2025-12-07 13:15:55
Production Over Perception: Kenneth Murray Will Start in Dallas https://insidethestar.com/production-over-perception-kenneth-murray-will-start-in-dallas
Production Over Perception: Kenneth Murray Will Start in Dallas https://insidethestar.com/production-over-perception-kenneth-murray-will-start-in-dallas
(paywalled) Appropriation of perceptual prostheses: an enactive approach to spatial perception https://link.springer.com/chapter/10.1007/978-3-031-91550-5_2 "analysis of the mechanisms of perception through perceptual prostheses", "A coupling device which is…
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
The development of spatial perception with and without visual experience #blindness
Visual Representations inside the Language Model
Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna
https://arxiv.org/abs/2510.04819 https://
CLEAR-IR: Clarity-Enhanced Active Reconstruction of Infrared Imagery
Nathan Shankar, Pawel Ladosz, Hujun Yin
https://arxiv.org/abs/2510.04883 https://arxiv…
Replaced article(s) found for cs.HC. https://arxiv.org/list/cs.HC/new
[1/2]:
- Understanding User Perception and Intention to Use Smart Homes for Energy Efficiency: A Survey
Alona Zharova, Hee-Eun Lee
A Study on the Data Distribution Gap in Music Emotion Recognition
Joann Ching, Gerhard Widmer
https://arxiv.org/abs/2510.04688 https://arxiv.org/pdf/2510.0…
The fact that so many on the Fediverse remain so uneducated isn’t surprising. It only reveals how deeply bourgeois ideology continues to shape public consciousness, molding perception to serve the interests of capital, especially US imperialism.
Just as it took decades for many to see the war on terror for what it truly was, and even longer to grasp Palestine’s struggle against imperialism and settler colonialism, the same pattern will repeat.
I wholeheartedly reject imperialis…
The Bayesian Origin of the Probability Weighting Function in Human Representation of Probabilities
Xin Tong, Thi Thu Uyen Hoang, Xue-Xin Wei, Michael Hahn
https://arxiv.org/abs/2510.04698
A4FN: an Agentic AI Architecture for Autonomous Flying Networks
Andr\'e Coelho, Pedro Ribeiro, Helder Fontes, Rui Campos
https://arxiv.org/abs/2510.03829 https://
Neural correlates of phosphene perception in blind individuals: A step toward a bidirectional cortical visual prosthesis https://www.science.org/doi/10.1126/sciadv.adv8846 "Cortical prostheses could one day restore functional vision in some blind subjects"
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Yuhe Nie, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu
Bio-Inspired Robotic Houbara: From Development to Field Deployment for Behavioral Studies
Lyes Saad Saoud, Irfan Hussain
https://arxiv.org/abs/2510.04692 https://
I have long hated Donald Trump with every fiber of my being, and despise his enablers no end. So this new hellscape doesn’t change my perception of him and his administration but it does make me actually nauseous.
Seeing the Bigger Picture: 3D Latent Mapping for Mobile Manipulation Policy Learning
Sunghwan Kim, Woojeh Chung, Zhirui Dai, Dwait Bhatt, Arth Shukla, Hao Su, Yulun Tian, Nikolay Atanasov
https://arxiv.org/abs/2510.03885
"Did Joseph Carl Robnett Licklider (1915-1990) read “An Experiment in Time”? Could it be that he had a series of dreams between 1960 and 1968, and that he quickly wrote them down in his diary before breakfast? We can only speculate. But we do know for a fact that those dreams begat a nothing short of extraordinary sequence of writings."
Perception of Mayfield shifting amid MVP-type run https://www.espn.com/nfl/story/_/id/46534098/perception-bucs-baker-mayfield-shifting-amid-mvp-type-run
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Embodied Rhythms. Interdisciplinary Takes on the Perception of Rhythm in Reality, Arts, Cinematic and Immersive Experience
https://ift.tt/of7ckDd
updated: Tuesday, November 4, 2025 - 9:50amfull name / name of organization: Comunicazioni Sociali:…
via Input 4 RELCFP
📼 A unified model of memory and perception: How Hebbian learning explains our recall of past events
https://medicalxpress.com/news/2025-11-memory-perception-hebbian-recall-events.html
“Alliances are built on common values and a common threat perception,”
said Danish Defense Analyst Jacob Kaarsbo
“Trump shares neither of those with us
and I would argue he doesn’t share it with most Europeans.”
https://www.
Leider sehr nah an Erich Fromms Homophobie. Dennoch: Es geht um Rollen, die egal von welchem Geschlecht, eingenommen werden müssen, um psychisch gesunde Kinder zu erziehen und damit insgesamt zu einer friedlichen Gesellschaft zu kommen.
Beyond Perception: Wie entsteht Krieg? Die Folgen von Kindheitsprägung & Gefühlsstau | Dr. Hans-Joachim Maaz (#201)
Webseite der Episode:
Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception
Ziyang Ma, Ruiyang Xu, Zhenghao Xing, Yunfei Chu, Yuxuan Wang, Jinzheng He, Jin Xu, Pheng-Ann Heng, Kai Yu, Junyang Lin, Eng Siong Chng, Xie Chen
https://arxiv.org/abs/2510.12720
An evening discussing falling enrollment in #STEM courses at universities across Europe, especially traditional studies like chemistry, geology and meteorology. I wonder if young people are unaware of just how interesting #STEM careers to be? Or do they have the perception it's "too hard" compared to other subjects where easier grades may be had? Or is it simply they think they can have "better"* jobs in other fields?
#AcademicChatter
*Where better might mean higher paid, more prestigious, more certain of employment, or less workload or some combination of all of these... ?
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Spectral peak picking improves tactile speech perception https://www.nature.com/articles/s41598-025-28930-6 "The algorithm is suitable for real-time use in wearable sensory substitution devices and could aid the development of effective haptic hearing aids."
"When #billionaires own major media outlets, “news” becomes something closer to state messaging. Not because the government controls it, but because the financial class that funds political power also owns the platforms that shape public perception."
#kleptocracy
Legacy M…
Electrotactile characteristics of rectified random noise stimulation (r-tRNS) on the forehead: a comparison with tDCS https://ieeexplore.ieee.org/document/11252854 "r-tRNS has the potential to be effectively utilized in applications requiring electrotactile perception, such as sensor…
CHUCKLE -- When Humans Teach AI To Learn Emotions The Easy Way
Ankush Pratap Singh, Houwei Cao, Yong Liu
https://arxiv.org/abs/2510.09382 https://arxiv.org…
I had thought we'd see Ekitike on the left, with Chiesa on the right. Slot, instead, puts Ekitike on the right and leaves Gakpo in.
Happy to see Jones make it into the midfield, as he's been good all year and a) Gravenberch needs to rest that ankle; and b) Mac Allister needs to rest period. Will be interested to see how far forward Szoboszlai plays.
Robbo and Frimpong make sense. I think Bradley has been better than the general perception, but Frimpong should get a run ou…
The BBC publishes the results from its "Our Future, Our BBC" survey, completed by 872K viewers, showing only 43% say it is "effective" in being independent (Jake Kanter/Deadline)
https://deadline.com/2025/10/bbc-study-viewer…
Even #Nature now uses #AI to generate content: Sensory substitution devices and perception https://www.
My lord, to be able to write science this well. This is like the opening of an econ or law paper.
From Donald Hoffman's "The Interface Theory of Perception",
https://sites.socsci.uci.edu/~ddhoff/interface.pdf
The number of Computer Misuse Act 1990 prosecutions hasn't really risen in the last five years, despite the perception of a rise in "cyber crime". Maybe this indicates a ceiling on the resources allocated to investigate and prosecute, or maybe not.
https://questions-state…
Spotlight on Token Perception for Multimodal Reinforcement Learning
Siyuan Huang, Xiaoye Qu, Yafu Li, Yun Luo, Zefeng He, Daizong Liu, Yu Cheng
https://arxiv.org/abs/2510.09285 …
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Although I must admit, I much prefer crystal-clear problems (URL NO CONNECT) with clean fixes over the sort I spend most of my time on, where I can only see the problem through the lens of other people's perception of slow vs. fast across something like a 10k-mile connection. With all the concomitant obscurants, both technical and cultural.
#Sysadminnery
🇺🇦 #NowPlaying on BBCRadio3's #RoundMidnight
SHOLTO:
🎵 Persephone's Perception
#SHOLTO
https://open.spotify.com/track/0mT6XUgnPE9ZpX7sQY79ZG
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
Hanyang Chen, Mark Zhao, Rui Yang, Qinwei Ma, Ke Yang, Jiarui Yao, Kangrui Wang, Hao Bai, Zhenhailong Wang, Rui Pan, Mengchao Zhang, Jose Barreiros, Aykut Onol, ChengXiang Zhai, Heng Ji, Manling Li, Huan Zhang, Tong Zhang
https://arxiv…
FRIDAY, October 31st
From 7:30 – 9:30 PM
@ Yale CCAM Sound Art Series
(149 York St, New Haven, CT)
"SIGNIFICANTLY LESS DECEPTIVE"
An audio visual live performance by #Negativland SUE-C
Free, open to the public, but limited capacity – first come first serve!
Trump corrupts the minds of Americans
Normalizes bigotry, misogyny,
graft and greed
From: @apron.rupar@threads.net
https://www.threads.com/@aaron.rupar/post/DP1koWjiWp9
Did you see the sound? A Bayesian perspective on crossmodal perception in low vision https://www.biorxiv.org/content/10.64898/2025.12.24.696433v1 Temporal "audiovisual interactions are constrained by the presence of a usable visual signal"
MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces
Reuben A. Luera, Ryan Rossi, Franck Dernoncourt, Samyadeep Basu, Sungchul Kim, Subhojyoti Mukherjee, Puneet Mathur, Ruiyi Zhang, Jihyung Kil, Nedim Lipka, Seunghyun Yoon, Jiuxiang Gu, Zichao Wang, Cindy Xiong Bearfield, Branislav Kveton
https://
The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America
Shumin Li, Xiaoyun Lai
https://arxiv.org/abs/2510.11758 https://
Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo, Utkarsh Tyagi, Advait Gosai, Paula Vergara, Ernesto Gabriel Hern\'andez Montoya, Chen Bo Calvin Zhang, Bin Hu, Yunzhong He, Bing Liu, Rakshith Sharma Srinivasa
https://arxiv.org/abs/2510.12712
Audio-Guided Visual Perception for Audio-Visual Navigation
Yi Wang, Yinfeng Yu, Fuchun Sun, Liejun Wang, Wendong Zheng
https://arxiv.org/abs/2510.11760 https://
🇺🇦 #NowPlaying on #BBC6Music's #TheHueyShow
SHOLTO:
🎵 Tied To The Mast
#SHOLTO
#newRelease 🆕 single
https://sholto2.bandcamp.com/track/persephones-perception-2
https://open.spotify.com/track/3ZzN6oNgloLOJMNYgs5kJx
Generative Latent Video Compression
Zongyu Guo, Zhaoyang Jia, Jiahao Li, Xiaoyi Zhang, Bin Li, Yan Lu
https://arxiv.org/abs/2510.09987 https://arxiv.org/pd…
Sounds easy, looks nice: Crossmodal transfer of auditory processing fluency to visual object preference https://link.springer.com/article/10.3758/s13414-025-03177-5
Two-stream network-driven vision-based tactile sensor for object feature extraction and fusion perception
Muxing Huang, Zibin Chen, Weiliang Xu, Zilan Li, Yuanzhi Zhou, Guoyuan Zhou, Wenjing Chen, Xinming Li
https://arxiv.org/abs/2510.12528
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
Nikos Theodoridis, Tim Brophy, Reenu Mohandas, Ganesh Sistu, Fiachra Collins, Anthony Scanlan, Ciaran Eising
https://arxiv.org/abs/2510.08352
New study indicates language, but not music, plays a powerful role in tactile perception https://www.fu-berlin.de/en/presse/informationen/fup/2025/fup_25_166-brain-language-laboratory-miller/index.html "Neuroscientist…
The Third Visual Pathway for Social Perception
David Pitcher
https://arxiv.org/abs/2512.09351 https://arxiv.org/pdf/2512.09351 https://arxiv.org/html/2512.09351
arXiv:2512.09351v1 Announce Type: new
Abstract: Influential models of primate visual cortex describe two functionally distinct pathways: a ventral pathway for object recognition and the dorsal pathway for spatial and action processing. However, recent human and non-human primate research suggests the existence of a third visual pathway projecting from early visual cortex through the motion-selective area V5/MT into the superior temporal sulcus (STS). Here we integrate anatomical, neuroimaging, and neuropsychological evidence demonstrating that this pathway specializes in processing dynamic social cues such as facial expressions, eye gaze, and body movements. This third pathway supports social perception by computing the actions and intentions of other people. These findings enhance our understanding of visual cortical organization and highlight the STS's critical role in social cognition, suggesting that visual processing encompasses a dedicated neural circuit for interpreting socially relevant motion and behavior.
toXiv_bot_toot
Active Semantic Perception
Huayi Tang, Pratik Chaudhari
https://arxiv.org/abs/2510.05430 https://arxiv.org/pdf/2510.05430
Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo, Habtom Kahsay Gidey, Alexander Lenz, Alois Knoll
https://arxiv.org/abs/2510.09244 https://
Dart, Giants hope win over Eagles shifts narrative https://www.espn.com/nfl/story/_/id/46549170/jaxson-dart-giants-hope-win-eagles-shifts-narrative
Crosslisted article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[3/5]:
- Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technica...
Xiao, Zhang, Tang, Cheng, Xu, Ding, Zhou, Chen, Ye, Hao
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track
Erjia Xiao, Lingfeng Zhang, Yingbo Tang, Hao Cheng, Renjing Xu, Wenbo Ding, Lei Zhou, Long Chen, Hangjun Ye, Xiaoshuai Hao
https://arxiv.org/abs/2510.07871
BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
Junyan Ye, Dongzhi Jiang, Jun He, Baichuan Zhou, Zilong Huang, Zhiyuan Yan, Hongsheng Li, Conghui He, Weijia Li
https://arxiv.org/abs/2510.09361
Role of thalamus in human conscious perception revealed by low-intensity focused ultrasound neuromodulation https://www.nature.com/articles/s41467-025-66832-3 these findings "underscore the modulatory potential of thalamocortical networks in shaping visual experience";
Causal role of the individual alpha phase in #multisensory #perception https://www.biorxiv.org/content/10.1101/20
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Injecting Hallucinations in Autonomous Vehicles: A Component-Agnostic Safety Evaluation Framework
Alexandre Moreira Nascimento, Gabriel Kenji Godoy Shimanuki, L\'ucio Flavio Vismari, Jo\~ao Batista Camargo Jr, Jorge Rady de Almeida Jr, Paulo Sergio Cugnasca, Anna Carolina Muller Queiroz, Jeremy Noah Bailenson
https://arxiv.org/abs/2510…
Generative inference unifies feedback processing for learning and perception in natural and artificial vision (here not prosthetic vision) https://www.biorxiv.org/content/10.1101/2025.10.21.683535v2
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Advances in artificial vision systems: a comprehensive review of technologies, applications, and future directions https://link.springer.com/article/10.1007/s13534-025-00513-4 "clinical value hinges on durability, effective resolution, surgical practicality, and user t…
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding
Zhiliu Yang, Jinyu Dai, Jianyuan Zhang, Zhu Yang
https://arxiv.org/abs/2510.12749
MCOP: Multi-UAV Collaborative Occupancy Prediction
Zefu Lin, Wenbo Chen, Xiaojuan Jin, Yuran Yang, Lue Fan, Yixin Zhang, Yufeng Zhang, Zhaoxiang Zhang
https://arxiv.org/abs/2510.12679
Causal role of the individual alpha phase in multisensory perception https://www.biorxiv.org/content/10.1101/2025.11.11.687884v2 more info in the X thread
SilvaScenes: Tree Segmentation and Species Classification from Under-Canopy Images in Natural Forests
David-Alexandre Duclos, William Guimont-Martin, Gabriel Jeanson, Arthur Larochelle-Tremblay, Th\'eo Defosse, Fr\'ed\'eric Moore, Philippe Nolet, Fran\c{c}ois Pomerleau, Philippe Gigu\`ere
https://arxiv.org/abs/2510.09458…
Enhanced pitch perception in early blind individuals and musicians is due to reduced internal noise https://www.biorxiv.org/content/10.1101/2025.11.25.690447v1
PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing
Bingquan Li, Ning Wang, Tianwei Zhang, Zhicheng He, Yucong Wu
https://arxiv.org/abs/2510.12346 http…
Long-term visual-to-tactile stimulation induces functional reorganization of thalamic pathways to achieve visual perception https://www.sciencedirect.com/science/article/pii/S105381192500655X using haptic sensory substitution;
Detect Anything via Next Point Prediction
Qing Jiang, Junan Huo, Xingyu Chen, Yuda Xiong, Zhaoyang Zeng, Yihao Chen, Tianhe Ren, Junzhi Yu, Lei Zhang
https://arxiv.org/abs/2510.12798
Automated Behavior Planning for Fruit Tree Pruning via Redundant Robot Manipulators: Addressing the Behavior Planning Challenge
Gaoyuan Liu, Bas Boom, Naftali Slob, Yuri Durodi\'e, Ann Now\'e, Bram Vanderborght
https://arxiv.org/abs/2510.12509
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
Shaoqi Dong, Chaoyou Fu, Haihan Gao, Yi-Fan Zhang, Chi Yan, Chu Wu, Xiaoyu Liu, Yunhang Shen, Jing Huo, Deqiang Jiang, Haoyu Cao, Yang Gao, Xing Sun, Ran He, Caifeng Shan
https://arxiv.org/abs/2510.09607
Visual–tactile shape perception in Argus II participants: The impact of prolonged device use and blindness on performance https://jov.arvojournals.org/article.aspx?articleid=2810954 "data highlight individual differences in performance over prolonged device use and the …
Scalable Offline Metrics for Autonomous Driving
Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar
https://arxiv.org/abs/2510.08571 https://arxiv.org/pdf/2510.08…
Have We Scene It All? Scene Graph-Aware Deep Point Cloud Compression
Nikolaos Stathoulopoulos, Christoforos Kanellakis, George Nikolakopoulos
https://arxiv.org/abs/2510.08512 ht…
Robot Soccer Kit: Omniwheel Tracked Soccer Robots for Education
Gregoire Passault (UB, LaBRI), Clement Gaspard (UB, LaBRI), Olivier Ly (UB, LaBRI)
https://arxiv.org/abs/2510.11552
Online Generic Event Boundary Detection
Hyungrok Jung, Daneul Kim, Seunggyun Lim, Jeany Son, Jonghyun Choi
https://arxiv.org/abs/2510.06855 https://arxiv.o…
Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis
David Nguyen, Zulfiqar Zaidi, Kevin Karol, Jessica Hodgins, Zhaoming Xie
https://arxiv.org/abs/2510.08754
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung, Jaehoon Choi, Yonghan Lee, Sungmin Eum, Heesung Kwon, Dinesh Manocha
https://arxiv.org/abs/2510.07119 …
FOGMACHINE -- Leveraging Discrete-Event Simulation and Scene Graphs for Modeling Hierarchical, Interconnected Environments under Partial Observations from Mobile Agents
Lars Ohnemus, Nils Hantke, Max Wei{\ss}er, Kai Furmans
https://arxiv.org/abs/2510.09483
Wireless device uses light patterns to deliver information directly to the brain https://medicalxpress.com/news/2025-12-wireless-device-patterns-brain.html
Patterned wireless transcranial optogenetics generates artificial perception (in mice)
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
Andong Deng, Taojiannan Yang, Shoubin Yu, Lincoln Spencer, Mohit Bansal, Chen Chen, Serena Yeung-Levy, Xiaohan Wang
https://arxiv.org/abs/2510.08559
IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Yandu Chen, Kefan Gu, Yuqing Wen, Yucheng Zhao, Tiancai Wang, Liqiang Nie
https://arxiv.org/abs/2510.07778
SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
Hongxing Li, Dingming Li, Zixuan Wang, Yuchen Yan, Hang Wu, Wenqi Zhang, Yongliang Shen, Weiming Lu, Jun Xiao, Yueting Zhuang
https://arxiv.org/abs/2510.08531
Active Next-Best-View Optimization for Risk-Averse Path Planning
Amirhossein Mollaei Khass, Guangyi Liu, Vivek Pandey, Wen Jiang, Boshu Lei, Kostas Daniilidis, Nader Motee
https://arxiv.org/abs/2510.06481
Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion
Jie Luo, Yuxuan Jiang, Xin Jin, Mingyu Liu, Yihui Fan
https://arxiv.org/abs/2510.06687 https://
NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
Haolin Yang, Yuxing Long, Zhuoyuan Yu, Zihan Yang, Minghan Wang, Jiapeng Xu, Yihan Wang, Ziyan Yu, Wenzhe Cai, Lei Kang, Hao Dong
https://arxiv.org/abs/2510.08173
Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
Chenfei Liao, Wensong Wang, Zichen Wen, Xu Zheng, Yiyu Wang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Xin Zou, Yuqian Fu, Bin Ren, Linfeng Zhang, Xuming Hu
https://arxiv.org/abs/2510.07143
DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining
Zhiliang Zhu, Tao Zeng, Tao Yang, Guoliang Luo, Jiyong Zeng
https://arxiv.org/abs/2510.06746