2025-10-03 10:45:51
What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?
Jiwan Chung, Neel Joshi, Pratyusha Sharma, Youngjae Yu, Vibhav Vineet
https://arxiv.org/abs/2510.01719
What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?
Jiwan Chung, Neel Joshi, Pratyusha Sharma, Youngjae Yu, Vibhav Vineet
https://arxiv.org/abs/2510.01719
ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
Qiyuan Zeng, Chengmeng Li, Jude St. John, Zhongyi Zhou, Junjie Wen, Guorui Feng, Yichen Zhu, Yi Xu
https://arxiv.org/abs/2510.01607
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Did you see the sound? A Bayesian perspective on crossmodal perception in low vision https://www.biorxiv.org/content/10.64898/2025.12.24.696433v1 Temporal "audiovisual interactions are constrained by the presence of a usable visual signal"
"Did Joseph Carl Robnett Licklider (1915-1990) read “An Experiment in Time”? Could it be that he had a series of dreams between 1960 and 1968, and that he quickly wrote them down in his diary before breakfast? We can only speculate. But we do know for a fact that those dreams begat a nothing short of extraordinary sequence of writings."
TriAlignXA: An Explainable Trilemma Alignment Framework for Trustworthy Agri-product Grading
Jianfei Xie, Ziyang Li
https://arxiv.org/abs/2510.01990 https://
Sounds easy, looks nice: Crossmodal transfer of auditory processing fluency to visual object preference https://link.springer.com/article/10.3758/s13414-025-03177-5
Multimodal Feedback for Task Guidance in Augmented Reality
Hu Guo, Lily Patel, Rohan Gupt
https://arxiv.org/abs/2510.01690 https://arxiv.org/pdf/2510.01690…
Clink! Chop! Thud! -- Learning Object Sounds from Real-World Interactions
Mengyu Yang, Yiming Chen, Haozheng Pei, Siddhant Agarwal, Arun Balajee Vasudevan, James Hays
https://arxiv.org/abs/2510.02313
Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving
Haibo Hu, Lianming Huang, Xinyu Wang, Yufei Cui, Nan Guan, Chun Jason Xue
https://arxiv.org/abs/2510.01795
“Alliances are built on common values and a common threat perception,”
said Danish Defense Analyst Jacob Kaarsbo
“Trump shares neither of those with us
and I would argue he doesn’t share it with most Europeans.”
https://www.
MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Jiyao Liu, Jinjie Wei, Wanying Qu, Chenglong Ma, Junzhi Ning, Yunheng Li, Ying Chen, Xinzhe Luo, Pengcheng Chen, Xin Gao, Ming Hu, Huihui Xu, Xin Wang, Shujian Gao, Dingkang Yang, Zhongying Deng, Jin Ye, Lihao Liu, Junjun He, Ningsheng Xu
https://arxiv…
What Matters in RL-Based Methods for Object-Goal Navigation? An Empirical Study and A Unified Framework
Hongze Wang, Boyang Sun, Jiaxu Xing, Fan Yang, Marco Hutter, Dhruv Shah, Davide Scaramuzza, Marc Pollefeys
https://arxiv.org/abs/2510.01830
Perception of Mayfield shifting amid MVP-type run https://www.espn.com/nfl/story/_/id/46534098/perception-bucs-baker-mayfield-shifting-amid-mvp-type-run
📼 A unified model of memory and perception: How Hebbian learning explains our recall of past events
https://medicalxpress.com/news/2025-11-memory-perception-hebbian-recall-events.html
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Spectral peak picking improves tactile speech perception https://www.nature.com/articles/s41598-025-28930-6 "The algorithm is suitable for real-time use in wearable sensory substitution devices and could aid the development of effective haptic hearing aids."
LangGrasp: Leveraging Fine-Tuned LLMs for Language Interactive Robot Grasping with Ambiguous Instructions
Yunhan Lin, Wenqi Wu, Zhijie Zhang, Huasong Min
https://arxiv.org/abs/2510.02104
🇺🇦 #NowPlaying on #BBC6Music's #TheHueyShow
SHOLTO:
🎵 Tied To The Mast
#SHOLTO
#newRelease 🆕 single
https://sholto2.bandcamp.com/track/persephones-perception-2
https://open.spotify.com/track/3ZzN6oNgloLOJMNYgs5kJx
Circles or rectangles? What do you see? What does this tell you about your perception and consciousness?
Always fascinated by Anil Seth.
https://www.theguardian.com/commentisfree/2025/jul/05/optical-illusions-see-world-perception<…
Leider sehr nah an Erich Fromms Homophobie. Dennoch: Es geht um Rollen, die egal von welchem Geschlecht, eingenommen werden müssen, um psychisch gesunde Kinder zu erziehen und damit insgesamt zu einer friedlichen Gesellschaft zu kommen.
Beyond Perception: Wie entsteht Krieg? Die Folgen von Kindheitsprägung & Gefühlsstau | Dr. Hans-Joachim Maaz (#201)
Webseite der Episode:
Production Over Perception: Kenneth Murray Will Start in Dallas https://insidethestar.com/production-over-perception-kenneth-murray-will-start-in-dallas
An evening discussing falling enrollment in #STEM courses at universities across Europe, especially traditional studies like chemistry, geology and meteorology. I wonder if young people are unaware of just how interesting #STEM careers to be? Or do they have the perception it's "too hard" compared to other subjects where easier grades may be had? Or is it simply they think they can have "better"* jobs in other fields?
#AcademicChatter
*Where better might mean higher paid, more prestigious, more certain of employment, or less workload or some combination of all of these... ?
"When #billionaires own major media outlets, “news” becomes something closer to state messaging. Not because the government controls it, but because the financial class that funds political power also owns the platforms that shape public perception."
#kleptocracy
Legacy M…
Even #Nature now uses #AI to generate content: Sensory substitution devices and perception https://www.
I had thought we'd see Ekitike on the left, with Chiesa on the right. Slot, instead, puts Ekitike on the right and leaves Gakpo in.
Happy to see Jones make it into the midfield, as he's been good all year and a) Gravenberch needs to rest that ankle; and b) Mac Allister needs to rest period. Will be interested to see how far forward Szoboszlai plays.
Robbo and Frimpong make sense. I think Bradley has been better than the general perception, but Frimpong should get a run ou…
CHUCKLE -- When Humans Teach AI To Learn Emotions The Easy Way
Ankush Pratap Singh, Houwei Cao, Yong Liu
https://arxiv.org/abs/2510.09382 https://arxiv.org…
MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces
Reuben A. Luera, Ryan Rossi, Franck Dernoncourt, Samyadeep Basu, Sungchul Kim, Subhojyoti Mukherjee, Puneet Mathur, Ruiyi Zhang, Jihyung Kil, Nedim Lipka, Seunghyun Yoon, Jiuxiang Gu, Zichao Wang, Cindy Xiong Bearfield, Branislav Kveton
https://
Audio-Guided Visual Perception for Audio-Visual Navigation
Yi Wang, Yinfeng Yu, Fuchun Sun, Liejun Wang, Wendong Zheng
https://arxiv.org/abs/2510.11760 https://
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception
Ziyang Ma, Ruiyang Xu, Zhenghao Xing, Yunfei Chu, Yuxuan Wang, Jinzheng He, Jin Xu, Pheng-Ann Heng, Kai Yu, Junyang Lin, Eng Siong Chng, Xie Chen
https://arxiv.org/abs/2510.12720
The BBC publishes the results from its "Our Future, Our BBC" survey, completed by 872K viewers, showing only 43% say it is "effective" in being independent (Jake Kanter/Deadline)
https://deadline.com/2025/10/bbc-study-viewer…
The number of Computer Misuse Act 1990 prosecutions hasn't really risen in the last five years, despite the perception of a rise in "cyber crime". Maybe this indicates a ceiling on the resources allocated to investigate and prosecute, or maybe not.
https://questions-state…
Deceptive Planning Exploiting Inattention Blindness
Mustafa O. Karabag, Jesse Milzman, Ufuk Topcu
https://arxiv.org/abs/2510.02714 https://arxiv.org/pdf/25…
My lord, to be able to write science this well. This is like the opening of an econ or law paper.
From Donald Hoffman's "The Interface Theory of Perception",
https://sites.socsci.uci.edu/~ddhoff/interface.pdf
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
🇺🇦 #NowPlaying on BBCRadio3's #RoundMidnight
SHOLTO:
🎵 Persephone's Perception
#SHOLTO
https://open.spotify.com/track/0mT6XUgnPE9ZpX7sQY79ZG
Although I must admit, I much prefer crystal-clear problems (URL NO CONNECT) with clean fixes over the sort I spend most of my time on, where I can only see the problem through the lens of other people's perception of slow vs. fast across something like a 10k-mile connection. With all the concomitant obscurants, both technical and cultural.
#Sysadminnery
New study indicates language, but not music, plays a powerful role in tactile perception https://www.fu-berlin.de/en/presse/informationen/fup/2025/fup_25_166-brain-language-laboratory-miller/index.html "Neuroscientist…
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
Hanyang Chen, Mark Zhao, Rui Yang, Qinwei Ma, Ke Yang, Jiarui Yao, Kangrui Wang, Hao Bai, Zhenhailong Wang, Rui Pan, Mengchao Zhang, Jose Barreiros, Aykut Onol, ChengXiang Zhai, Heng Ji, Manling Li, Huan Zhang, Tong Zhang
https://arxiv…
Trump corrupts the minds of Americans
Normalizes bigotry, misogyny,
graft and greed
From: @apron.rupar@threads.net
https://www.threads.com/@aaron.rupar/post/DP1koWjiWp9
Spotlight on Token Perception for Multimodal Reinforcement Learning
Siyuan Huang, Xiaoye Qu, Yafu Li, Yun Luo, Zefeng He, Daizong Liu, Yu Cheng
https://arxiv.org/abs/2510.09285 …
The Third Visual Pathway for Social Perception
David Pitcher
https://arxiv.org/abs/2512.09351 https://arxiv.org/pdf/2512.09351 https://arxiv.org/html/2512.09351
arXiv:2512.09351v1 Announce Type: new
Abstract: Influential models of primate visual cortex describe two functionally distinct pathways: a ventral pathway for object recognition and the dorsal pathway for spatial and action processing. However, recent human and non-human primate research suggests the existence of a third visual pathway projecting from early visual cortex through the motion-selective area V5/MT into the superior temporal sulcus (STS). Here we integrate anatomical, neuroimaging, and neuropsychological evidence demonstrating that this pathway specializes in processing dynamic social cues such as facial expressions, eye gaze, and body movements. This third pathway supports social perception by computing the actions and intentions of other people. These findings enhance our understanding of visual cortical organization and highlight the STS's critical role in social cognition, suggesting that visual processing encompasses a dedicated neural circuit for interpreting socially relevant motion and behavior.
toXiv_bot_toot
The fact that so many on the Fediverse remain so uneducated isn’t surprising. It only reveals how deeply bourgeois ideology continues to shape public consciousness, molding perception to serve the interests of capital, especially US imperialism.
Just as it took decades for many to see the war on terror for what it truly was, and even longer to grasp Palestine’s struggle against imperialism and settler colonialism, the same pattern will repeat.
I wholeheartedly reject imperialis…
FRIDAY, October 31st
From 7:30 – 9:30 PM
@ Yale CCAM Sound Art Series
(149 York St, New Haven, CT)
"SIGNIFICANTLY LESS DECEPTIVE"
An audio visual live performance by #Negativland SUE-C
Free, open to the public, but limited capacity – first come first serve!
The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America
Shumin Li, Xiaoyun Lai
https://arxiv.org/abs/2510.11758 https://
Generative Latent Video Compression
Zongyu Guo, Zhaoyang Jia, Jiahao Li, Xiaoyi Zhang, Bin Li, Yan Lu
https://arxiv.org/abs/2510.09987 https://arxiv.org/pd…
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo, Utkarsh Tyagi, Advait Gosai, Paula Vergara, Ernesto Gabriel Hern\'andez Montoya, Chen Bo Calvin Zhang, Bin Hu, Yunzhong He, Bing Liu, Rakshith Sharma Srinivasa
https://arxiv.org/abs/2510.12712
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Two-stream network-driven vision-based tactile sensor for object feature extraction and fusion perception
Muxing Huang, Zibin Chen, Weiliang Xu, Zilan Li, Yuanzhi Zhou, Guoyuan Zhou, Wenjing Chen, Xinming Li
https://arxiv.org/abs/2510.12528
Role of thalamus in human conscious perception revealed by low-intensity focused ultrasound neuromodulation https://www.nature.com/articles/s41467-025-66832-3 these findings "underscore the modulatory potential of thalamocortical networks in shaping visual experience";
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo, Habtom Kahsay Gidey, Alexander Lenz, Alois Knoll
https://arxiv.org/abs/2510.09244 https://
Causal role of the individual alpha phase in #multisensory #perception https://www.biorxiv.org/content/10.1101/20
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
Nikos Theodoridis, Tim Brophy, Reenu Mohandas, Ganesh Sistu, Fiachra Collins, Anthony Scanlan, Ciaran Eising
https://arxiv.org/abs/2510.08352
Dart, Giants hope win over Eagles shifts narrative https://www.espn.com/nfl/story/_/id/46549170/jaxson-dart-giants-hope-win-eagles-shifts-narrative
Crosslisted article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[3/5]:
- Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technica...
Xiao, Zhang, Tang, Cheng, Xu, Ding, Zhou, Chen, Ye, Hao
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track
Erjia Xiao, Lingfeng Zhang, Yingbo Tang, Hao Cheng, Renjing Xu, Wenbo Ding, Lei Zhou, Long Chen, Hangjun Ye, Xiaoshuai Hao
https://arxiv.org/abs/2510.07871
Advances in artificial vision systems: a comprehensive review of technologies, applications, and future directions https://link.springer.com/article/10.1007/s13534-025-00513-4 "clinical value hinges on durability, effective resolution, surgical practicality, and user t…
Active Semantic Perception
Huayi Tang, Pratik Chaudhari
https://arxiv.org/abs/2510.05430 https://arxiv.org/pdf/2510.05430
Generative inference unifies feedback processing for learning and perception in natural and artificial vision (here not prosthetic vision) https://www.biorxiv.org/content/10.1101/2025.10.21.683535v2
BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
Junyan Ye, Dongzhi Jiang, Jun He, Baichuan Zhou, Zilong Huang, Zhiyuan Yan, Hongsheng Li, Conghui He, Weijia Li
https://arxiv.org/abs/2510.09361
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Causal role of the individual alpha phase in multisensory perception https://www.biorxiv.org/content/10.1101/2025.11.11.687884v2 more info in the X thread
Injecting Hallucinations in Autonomous Vehicles: A Component-Agnostic Safety Evaluation Framework
Alexandre Moreira Nascimento, Gabriel Kenji Godoy Shimanuki, L\'ucio Flavio Vismari, Jo\~ao Batista Camargo Jr, Jorge Rady de Almeida Jr, Paulo Sergio Cugnasca, Anna Carolina Muller Queiroz, Jeremy Noah Bailenson
https://arxiv.org/abs/2510…
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Enhanced pitch perception in early blind individuals and musicians is due to reduced internal noise https://www.biorxiv.org/content/10.1101/2025.11.25.690447v1
SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding
Zhiliu Yang, Jinyu Dai, Jianyuan Zhang, Zhu Yang
https://arxiv.org/abs/2510.12749
Long-term visual-to-tactile stimulation induces functional reorganization of thalamic pathways to achieve visual perception https://www.sciencedirect.com/science/article/pii/S105381192500655X using haptic sensory substitution;
MCOP: Multi-UAV Collaborative Occupancy Prediction
Zefu Lin, Wenbo Chen, Xiaojuan Jin, Yuran Yang, Lue Fan, Yixin Zhang, Yufeng Zhang, Zhaoxiang Zhang
https://arxiv.org/abs/2510.12679
Visual–tactile shape perception in Argus II participants: The impact of prolonged device use and blindness on performance https://jov.arvojournals.org/article.aspx?articleid=2810954 "data highlight individual differences in performance over prolonged device use and the …
Detect Anything via Next Point Prediction
Qing Jiang, Junan Huo, Xingyu Chen, Yuda Xiong, Zhaoyang Zeng, Yihao Chen, Tianhe Ren, Junzhi Yu, Lei Zhang
https://arxiv.org/abs/2510.12798
PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing
Bingquan Li, Ning Wang, Tianwei Zhang, Zhicheng He, Yucong Wu
https://arxiv.org/abs/2510.12346 http…
SilvaScenes: Tree Segmentation and Species Classification from Under-Canopy Images in Natural Forests
David-Alexandre Duclos, William Guimont-Martin, Gabriel Jeanson, Arthur Larochelle-Tremblay, Th\'eo Defosse, Fr\'ed\'eric Moore, Philippe Nolet, Fran\c{c}ois Pomerleau, Philippe Gigu\`ere
https://arxiv.org/abs/2510.09458…
Automated Behavior Planning for Fruit Tree Pruning via Redundant Robot Manipulators: Addressing the Behavior Planning Challenge
Gaoyuan Liu, Bas Boom, Naftali Slob, Yuri Durodi\'e, Ann Now\'e, Bram Vanderborght
https://arxiv.org/abs/2510.12509
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
Shaoqi Dong, Chaoyou Fu, Haihan Gao, Yi-Fan Zhang, Chi Yan, Chu Wu, Xiaoyu Liu, Yunhang Shen, Jing Huo, Deqiang Jiang, Haoyu Cao, Yang Gao, Xing Sun, Ran He, Caifeng Shan
https://arxiv.org/abs/2510.09607
Neural correlates of phosphene perception in blind individuals: A step toward a bidirectional cortical visual prosthesis https://www.science.org/doi/10.1126/sciadv.adv8846 "Cortical prostheses could one day restore functional vision in some blind subjects"
Robot Soccer Kit: Omniwheel Tracked Soccer Robots for Education
Gregoire Passault (UB, LaBRI), Clement Gaspard (UB, LaBRI), Olivier Ly (UB, LaBRI)
https://arxiv.org/abs/2510.11552
Wireless device uses light patterns to deliver information directly to the brain https://medicalxpress.com/news/2025-12-wireless-device-patterns-brain.html
Patterned wireless transcranial optogenetics generates artificial perception (in mice)
Have We Scene It All? Scene Graph-Aware Deep Point Cloud Compression
Nikolaos Stathoulopoulos, Christoforos Kanellakis, George Nikolakopoulos
https://arxiv.org/abs/2510.08512 ht…
Scalable Offline Metrics for Autonomous Driving
Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar
https://arxiv.org/abs/2510.08571 https://arxiv.org/pdf/2510.08…
Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis
David Nguyen, Zulfiqar Zaidi, Kevin Karol, Jessica Hodgins, Zhaoming Xie
https://arxiv.org/abs/2510.08754
Online Generic Event Boundary Detection
Hyungrok Jung, Daneul Kim, Seunggyun Lim, Jeany Son, Jonghyun Choi
https://arxiv.org/abs/2510.06855 https://arxiv.o…
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung, Jaehoon Choi, Yonghan Lee, Sungmin Eum, Heesung Kwon, Dinesh Manocha
https://arxiv.org/abs/2510.07119 …
FOGMACHINE -- Leveraging Discrete-Event Simulation and Scene Graphs for Modeling Hierarchical, Interconnected Environments under Partial Observations from Mobile Agents
Lars Ohnemus, Nils Hantke, Max Wei{\ss}er, Kai Furmans
https://arxiv.org/abs/2510.09483
Seeing like Meta: Smart glasses and the ethics of augmented reality #AR
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
Andong Deng, Taojiannan Yang, Shoubin Yu, Lincoln Spencer, Mohit Bansal, Chen Chen, Serena Yeung-Levy, Xiaohan Wang
https://arxiv.org/abs/2510.08559
IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Yandu Chen, Kefan Gu, Yuqing Wen, Yucheng Zhao, Tiancai Wang, Liqiang Nie
https://arxiv.org/abs/2510.07778
SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
Hongxing Li, Dingming Li, Zixuan Wang, Yuchen Yan, Hang Wu, Wenqi Zhang, Yongliang Shen, Weiming Lu, Jun Xiao, Yueting Zhuang
https://arxiv.org/abs/2510.08531
Electrotactile characteristics of rectified random noise stimulation (r-tRNS) on the forehead: a comparison with tDCS https://ieeexplore.ieee.org/document/11252854 "r-tRNS has the potential to be effectively utilized in applications requiring electrotactile perception, such as sensor…
NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
Haolin Yang, Yuxing Long, Zhuoyuan Yu, Zihan Yang, Minghan Wang, Jiapeng Xu, Yihan Wang, Ziyan Yu, Wenzhe Cai, Lei Kang, Hao Dong
https://arxiv.org/abs/2510.08173
Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion
Jie Luo, Yuxuan Jiang, Xin Jin, Mingyu Liu, Yihui Fan
https://arxiv.org/abs/2510.06687 https://
Active Next-Best-View Optimization for Risk-Averse Path Planning
Amirhossein Mollaei Khass, Guangyi Liu, Vivek Pandey, Wen Jiang, Boshu Lei, Kostas Daniilidis, Nader Motee
https://arxiv.org/abs/2510.06481
Visual Representations inside the Language Model
Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna
https://arxiv.org/abs/2510.04819 https://
Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
Chenfei Liao, Wensong Wang, Zichen Wen, Xu Zheng, Yiyu Wang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Xin Zou, Yuqian Fu, Bin Ren, Linfeng Zhang, Xuming Hu
https://arxiv.org/abs/2510.07143
DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining
Zhiliang Zhu, Tao Zeng, Tao Yang, Guoliang Luo, Jiyong Zeng
https://arxiv.org/abs/2510.06746
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Yuhe Nie, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu