
2025-07-04 09:10:51
Misaligned from Within: Large Language Models Reproduce Our Double-Loop Learning Blindness
Tim Rogers, Ben Teehankee
https://arxiv.org/abs/2507.02283 https…
Misaligned from Within: Large Language Models Reproduce Our Double-Loop Learning Blindness
Tim Rogers, Ben Teehankee
https://arxiv.org/abs/2507.02283 https…
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- MultiStream-LLM: Bridging Modalities for Robust Sign Language Translation
Marshall Thomas, Edward Fish, Richard Bowden
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/9]:
- DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse ...
Wang, Zhang, Fang, Tian, Yang, Ma, Pan, Song, Yu
Harnessing Patterns to Support the Development of Hybrid Quantum Applications
Daniel Vietz, Martin Beisel, Johanna Barzen, Frank Leymann, Lavinia Stiliadou, Benjamin Weder
https://arxiv.org/abs/2507.00696
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Perceiving Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models
Aarti Ghatkesar, Ganesh Venkatesh
OFCnetLLM: Large Language Model for Network Monitoring and Alertness
Hong-Jun Yoon, Mariam Kiran, Danial Ebling, Joe Breen
https://arxiv.org/abs/2507.22711 https://
Lost in Translation? Converting RegExes for Log Parsing into Dynatrace Pattern Language
Julian Fragner, Christian Macho, Bernhard Dieber, Martin Pinzger
https://arxiv.org/abs/2506.19539
Why Are Parsing Actions for Understanding Message Hierarchies Not Random?
Daichi Kato, Ryo Ueda, Yusuke Miyao
https://arxiv.org/abs/2506.22366 https://
One Weird Trick to Untie Landin's Knot
Paulette Koronkevich, William J. Bowman
https://arxiv.org/abs/2507.21317 https://arxiv.org/pdf/2507.21317…
Getting Started Live-Coding with 🌀 Strudel
With Strudel, you can expressively write dynamic music pieces. It is an official port of the Tidal Cycles pattern language to JavaScript. You don’t need to know JavaScript or Tidal Cycles to make music with Strudel.
#electronicmusic
https://strudel.cc/workshop/getting-started/
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Info...
Li, Shi, Gao, Liu, Wang, Chen, Liu, Zhao, Wang, Metaxas
STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning
Chenghao Wu, Ruiyang Ren, Junjie Zhang, Ruirui Wang, Zhongrui Ma, Qi Ye, Wayne Xin Zhao
https://arxiv.org/abs/2508.18812
Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning, and Generative AI
Mohammed Elmusrati
https://arxiv.org/abs/2508.15719 https://
The Phases of Chaos
Tarek Anous, Diego M. Hofman
https://arxiv.org/abs/2506.20542 https://arxiv.org/pdf/2506.20542
LLM-Based Social Simulations Require a Boundary
Zengqing Wu, Run Peng, Takayuki Ito, Chuan Xiao
https://arxiv.org/abs/2506.19806 https://
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/9]:
- PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Zeng, Ni, Wang, Rim, Chung, Yang, Hong, Wong
Large Language Models Show Signs of Alignment with Human Neurocognition During Abstract Reasoning
Christopher Pinier, Sonia Acu\~na Vargas, Mariia Steeghs-Turchina, Dora Matzke, Claire E. Stevenson, Michael D. Nunez
https://arxiv.org/abs/2508.10057
This is everything I could never get out of #sonicpi:
Multi-Language Detection of Design Pattern Instances
Hugo Andrade, Jo\~ao Bispo, Filipe F. Correia
https://arxiv.org/abs/2506.03903 https://
Reverse Physician-AI Relationship: Full-process Clinical Diagnosis Driven by a Large Language Model
Shicheng Xu, Xin Huang, Zihao Wei, Liang Pang, Huawei Shen, Xueqi Cheng
https://arxiv.org/abs/2508.10492
Bhatt Conjectures: On Necessary-But-Not-Sufficient Benchmark Tautology for Human Like Reasoning
Manish Bhatt
https://arxiv.org/abs/2506.11423 https://
Automatic Semantic Alignment of Flow Pattern Representations for Exploration with Large Language Models
Weihan Zhang, Jun Tao
https://arxiv.org/abs/2508.06300 https://
Diagnostic-Guided Dynamic Profile Optimization for LLM-based User Simulators in Sequential Recommendation
Hongyang Liu, Zhu Sun, Tianjun Wei, Yan Wang, Jiajie Zhu, Xinghua Qu
https://arxiv.org/abs/2508.12645
Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment through Latent Acoustic Pattern Triggers
Liang Lin, Miao Yu, Kaiwen Luo, Yibo Zhang, Lilan Peng, Dexian Wang, Xuehai Tang, Yuanhe Zhang, Xikang Yang, Zhenhong Zhou, Kun Wang, Yang Liu
https://arxiv.org/abs/2508.02175
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/9]:
- "Principal Components" Enable A New Language of Images
Xin Wen, Bingchen Zhao, Ismail Elezi, Jiankang Deng, Xiaojuan Qi
This https://arxiv.org/abs/2506.03903 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
Superstudent intelligence in thermodynamics
Rebecca Loubet, Pascal Zittlau, Marco Hoffmann, Luisa Vollmer, Sophie Fellenz, Heike Leitte, Fabian Jirasek, Johannes Lenhard, Hans Hasse
https://arxiv.org/abs/2506.09822
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning
Dongyang Guo, Yasmeen Abdrabou, Enkeleda Thaqi, Enkelejda Kasneci
https://arxiv.org/abs/2507.18252
Outsmarting Linear Neural Networks via an Incoherent Light-Driven Optical Extreme Learner with Data Reverberation
Bofeng Liu, Xu Mei, Sadman Shafi, Tunan Xia, Iam-Choon Khoo, Zhiwen Liu, Xingjie Ni
https://arxiv.org/abs/2508.08428
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
Zefeng Qian, Xincheng Yao, Yifei Huang, Chongyang Zhang, Jiangyong Ying, Hong Sun
Survey of LLM Agent Communication with MCP: A Software Design Pattern Centric Review
Anjana Sarkar, Soumyendu Sarkar
https://arxiv.org/abs/2506.05364 https…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- VRU-Accident: A Vision-Language Benchmark for Video Question Answering and Dense Captioning for A...
Younggun Kim, Ahmed S. Abdelrahman, Mohamed Abdel-Aty
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang
https://arxiv.org/abs/2508.03054
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment
Difei Gu, Yunhe Gao, Yang Zhou, Mu Zhou, Dimitris Metaxas
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/7]:
- Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distrib...
Behraj Khan, Tahir Qasim Syed, Nouman M. Durrani, Bilal Naseem, Shabir Ahmad, Rizwan Qureshi
…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/7]:
- Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Spee...
Jeong Hun Yeo, Minsu Kim, Chae Won Kim, Stavros Petridis, Yong Man Ro
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding
Chuwei Luo, Guozhi Tang, Qi Zheng, Cong Yao, Lianwen Jin, Chenliang Li, Yang Xue, Luo Si
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/7]:
- SLGaussian: Fast Language Gaussian Splatting in Sparse Views
Kangjie Chen, BingQuan Dai, Minghan Qin, Dongbin Zhang, Peihao Li, Yingshuang Zou, Haoqian Wang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model
Jiang, Gao, Wang, Sun, Wang, Heng, Sun, Tang, Zhu, Chai, Wang, Gu, Jiang, Sun
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial...
Ziyang Gong, Wenhao Li, Oliver Ma, Songyuan Li, Jiayi Ji, Xue Yang, Gen Luo, Junchi Yan, Rongrong Ji
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[10/10]:
- Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reas...
Laskar, Islam, Mahbub, Masry, Rahman, Bhuiyan, Nayeem, Joty, Hoque, Huang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/10]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu