Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:45:51

What MLLMs Learn about When they Learn about Multimodal Reasoning: Perception, Reasoning, or their Integration?
Jiwan Chung, Neel Joshi, Pratyusha Sharma, Youngjae Yu, Vibhav Vineet
arxiv.org/abs/2510.01719

@arXiv_csRO_bot@mastoxiv.page
2025-10-03 09:36:21

ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
Qiyuan Zeng, Chengmeng Li, Jude St. John, Zhongyi Zhou, Junjie Wen, Guorui Feng, Yichen Zhu, Yi Xu
arxiv.org/abs/2510.01607

@netzschleuder@social.skewed.de
2025-11-01 18:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@seeingwithsound@mas.to
2026-01-02 15:36:43

Did you see the sound? A Bayesian perspective on crossmodal perception in low vision biorxiv.org/content/10.64898/2 Temporal "audiovisual interactions are constrained by the presence of a usable visual signal"

@deprogrammaticaipsum@mas.to
2025-11-02 18:11:52

"Did Joseph Carl Robnett Licklider (1915-1990) read “An Experiment in Time”? Could it be that he had a series of dreams between 1960 and 1968, and that he quickly wrote them down in his diary before breakfast? We can only speculate. But we do know for a fact that those dreams begat a nothing short of extraordinary sequence of writings."

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:22:21

TriAlignXA: An Explainable Trilemma Alignment Framework for Trustworthy Agri-product Grading
Jianfei Xie, Ziyang Li
arxiv.org/abs/2510.01990

@seeingwithsound@mas.to
2025-12-02 12:01:20

Sounds easy, looks nice: Crossmodal transfer of auditory processing fluency to visual object preference link.springer.com/article/10.3

@arXiv_csGR_bot@mastoxiv.page
2025-10-03 08:09:51

Multimodal Feedback for Task Guidance in Augmented Reality
Hu Guo, Lily Patel, Rohan Gupt
arxiv.org/abs/2510.01690 arxiv.org/pdf/2510.01690…

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:44:41

Clink! Chop! Thud! -- Learning Object Sounds from Real-World Interactions
Mengyu Yang, Yiming Chen, Haozheng Pei, Siddhant Agarwal, Arun Balajee Vasudevan, James Hays
arxiv.org/abs/2510.02313

@arXiv_csRO_bot@mastoxiv.page
2025-10-03 10:09:21

Nav-EE: Navigation-Guided Early Exiting for Efficient Vision-Language Models in Autonomous Driving
Haibo Hu, Lianming Huang, Xinyu Wang, Yufei Cui, Nan Guan, Chun Jason Xue
arxiv.org/abs/2510.01795

“Alliances are built on common values and a common threat perception,”
said Danish Defense Analyst Jacob Kaarsbo
“Trump shares neither of those with us
and I would argue he doesn’t share it with most Europeans.”

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:05:31

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
Jiyao Liu, Jinjie Wei, Wanying Qu, Chenglong Ma, Junzhi Ning, Yunheng Li, Ying Chen, Xinzhe Luo, Pengcheng Chen, Xin Gao, Ming Hu, Huihui Xu, Xin Wang, Shujian Gao, Dingkang Yang, Zhongying Deng, Jin Ye, Lihao Liu, Junjun He, Ningsheng Xu
arxiv…

@arXiv_csRO_bot@mastoxiv.page
2025-10-03 10:16:31

What Matters in RL-Based Methods for Object-Goal Navigation? An Empirical Study and A Unified Framework
Hongze Wang, Boyang Sun, Jiaxu Xing, Fan Yang, Marco Hutter, Dhruv Shah, Davide Scaramuzza, Marc Pollefeys
arxiv.org/abs/2510.01830

@NFL@darktundra.xyz
2025-10-08 20:09:49

Perception of Mayfield shifting amid MVP-type run espn.com/nfl/story/_/id/465340

@UP8@mastodon.social
2025-12-08 17:59:02

📼 A unified model of memory and perception: How Hebbian learning explains our recall of past events
medicalxpress.com/news/2025-11

@netzschleuder@social.skewed.de
2025-12-30 06:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@seeingwithsound@mas.to
2025-11-25 20:08:46

Spectral peak picking improves tactile speech perception nature.com/articles/s41598-025 "The algorithm is suitable for real-time use in wearable sensory substitution devices and could aid the development of effective haptic hearing aids."

@arXiv_csRO_bot@mastoxiv.page
2025-10-03 10:24:31

LangGrasp: Leveraging Fine-Tuned LLMs for Language Interactive Robot Grasping with Ambiguous Instructions
Yunhan Lin, Wenqi Wu, Zhijie Zhang, Huasong Min
arxiv.org/abs/2510.02104

@BBC6MusicBot@mastodonapp.uk
2025-11-01 12:10:13

🇺🇦 #NowPlaying on #BBC6Music's #TheHueyShow
SHOLTO:
🎵 Tied To The Mast
#SHOLTO
#newRelease 🆕 single
sholto2.bandcamp.com/track/per
open.spotify.com/track/3ZzN6oN

@prachisrivas@masto.ai
2025-10-04 14:50:17

Circles or rectangles? What do you see? What does this tell you about your perception and consciousness?
Always fascinated by Anil Seth.
theguardian.com/commentisfree/<…

@dichotomiker@dresden.network
2025-10-22 07:23:52

Leider sehr nah an Erich Fromms Homophobie. Dennoch: Es geht um Rollen, die egal von welchem Geschlecht, eingenommen werden müssen, um psychisch gesunde Kinder zu erziehen und damit insgesamt zu einer friedlichen Gesellschaft zu kommen.
Beyond Perception: Wie entsteht Krieg? Die Folgen von Kindheitsprägung & Gefühlsstau | Dr. Hans-Joachim Maaz (#201)
Webseite der Episode:

@cowboys@darktundra.xyz
2025-12-07 13:15:55

Production Over Perception: Kenneth Murray Will Start in Dallas insidethestar.com/production-o

@ruth_mottram@fediscience.org
2025-11-21 10:34:41

An evening discussing falling enrollment in #STEM courses at universities across Europe, especially traditional studies like chemistry, geology and meteorology. I wonder if young people are unaware of just how interesting #STEM careers to be? Or do they have the perception it's "too hard" compared to other subjects where easier grades may be had? Or is it simply they think they can have "better"* jobs in other fields?
#AcademicChatter
*Where better might mean higher paid, more prestigious, more certain of employment, or less workload or some combination of all of these... ?

@simon_brooke@mastodon.scot
2025-11-21 19:52:36

"When #billionaires own major media outlets, “news” becomes something closer to state messaging. Not because the government controls it, but because the financial class that funds political power also owns the platforms that shape public perception."
#kleptocracy
Legacy M…

@seeingwithsound@mas.to
2025-11-22 11:53:18

Even #Nature now uses #AI to generate content: Sensory substitution devices and perception

@mlawton@mstdn.social
2025-10-22 18:00:40

I had thought we'd see Ekitike on the left, with Chiesa on the right. Slot, instead, puts Ekitike on the right and leaves Gakpo in.
Happy to see Jones make it into the midfield, as he's been good all year and a) Gravenberch needs to rest that ankle; and b) Mac Allister needs to rest period. Will be interested to see how far forward Szoboszlai plays.
Robbo and Frimpong make sense. I think Bradley has been better than the general perception, but Frimpong should get a run ou…

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:42:40

CHUCKLE -- When Humans Teach AI To Learn Emotions The Easy Way
Ankush Pratap Singh, Houwei Cao, Yong Liu
arxiv.org/abs/2510.09382 arxiv.org…

@arXiv_csHC_bot@mastoxiv.page
2025-10-13 08:44:20

MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces
Reuben A. Luera, Ryan Rossi, Franck Dernoncourt, Samyadeep Basu, Sungchul Kim, Subhojyoti Mukherjee, Puneet Mathur, Ruiyi Zhang, Jihyung Kil, Nedim Lipka, Seunghyun Yoon, Jiuxiang Gu, Zichao Wang, Cindy Xiong Bearfield, Branislav Kveton

@arXiv_csSD_bot@mastoxiv.page
2025-10-15 08:31:42

Audio-Guided Visual Perception for Audio-Visual Navigation
Yi Wang, Yinfeng Yu, Fuchun Sun, Liejun Wang, Wendong Zheng
arxiv.org/abs/2510.11760

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:45:51

Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception
Ziyang Ma, Ruiyang Xu, Zhenghao Xing, Yunfei Chu, Yuxuan Wang, Jinzheng He, Jin Xu, Pheng-Ann Heng, Kai Yu, Junyang Lin, Eng Siong Chng, Xie Chen
arxiv.org/abs/2510.12720

@Mediagazer@mstdn.social
2025-10-16 09:55:58

The BBC publishes the results from its "Our Future, Our BBC" survey, completed by 872K viewers, showing only 43% say it is "effective" in being independent (Jake Kanter/Deadline)
deadline.com/2025/10/bbc-study

@gwire@mastodon.social
2025-10-21 12:05:50

The number of Computer Misuse Act 1990 prosecutions hasn't really risen in the last five years, despite the perception of a rise in "cyber crime". Maybe this indicates a ceiling on the resources allocated to investigate and prosecute, or maybe not.
questions-state…

@raiders@darktundra.xyz
2025-10-11 10:03:20

For Raiders QB Geno Smith, Perception is Reality si.com/nfl/raiders/las-vegas-g

@arXiv_csGT_bot@mastoxiv.page
2025-10-06 07:59:09

Deceptive Planning Exploiting Inattention Blindness
Mustafa O. Karabag, Jesse Milzman, Ufuk Topcu
arxiv.org/abs/2510.02714 arxiv.org/pdf/25…

@shriramk@mastodon.social
2025-11-09 00:07:27

My lord, to be able to write science this well. This is like the opening of an econ or law paper.
From Donald Hoffman's "The Interface Theory of Perception",
sites.socsci.uci.edu/~ddhoff/i

@netzschleuder@social.skewed.de
2025-10-23 19:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@BBC3MusicBot@mastodonapp.uk
2025-10-13 22:32:25

🇺🇦 #NowPlaying on BBCRadio3's #RoundMidnight
SHOLTO:
🎵 Persephone's Perception
#SHOLTO
open.spotify.com/track/0mT6XUg

@grumpybozo@toad.social
2025-10-12 01:44:03

Although I must admit, I much prefer crystal-clear problems (URL NO CONNECT) with clean fixes over the sort I spend most of my time on, where I can only see the problem through the lens of other people's perception of slow vs. fast across something like a 10k-mile connection. With all the concomitant obscurants, both technical and cultural.
#Sysadminnery

@seeingwithsound@mas.to
2025-10-24 13:06:25

New study indicates language, but not music, plays a powerful role in tactile perception fu-berlin.de/en/presse/informa "Neuroscientist…

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 10:22:21

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
Hanyang Chen, Mark Zhao, Rui Yang, Qinwei Ma, Ke Yang, Jiarui Yao, Kangrui Wang, Hao Bai, Zhenhailong Wang, Rui Pan, Mengchao Zhang, Jose Barreiros, Aykut Onol, ChengXiang Zhai, Heng Ji, Manling Li, Huan Zhang, Tong Zhang
arxiv…

Trump corrupts the minds of Americans
Normalizes bigotry, misogyny,
graft and greed
From: @apron.rupar@threads.net
threads.com/@aaron.rupar/post/

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:25:00

Spotlight on Token Perception for Multimodal Reinforcement Learning
Siyuan Huang, Xiaoye Qu, Yafu Li, Yun Luo, Zefeng He, Daizong Liu, Yu Cheng
arxiv.org/abs/2510.09285

@arXiv_qbioNC_bot@mastoxiv.page
2025-12-11 08:10:21

The Third Visual Pathway for Social Perception
David Pitcher
arxiv.org/abs/2512.09351 arxiv.org/pdf/2512.09351 arxiv.org/html/2512.09351
arXiv:2512.09351v1 Announce Type: new
Abstract: Influential models of primate visual cortex describe two functionally distinct pathways: a ventral pathway for object recognition and the dorsal pathway for spatial and action processing. However, recent human and non-human primate research suggests the existence of a third visual pathway projecting from early visual cortex through the motion-selective area V5/MT into the superior temporal sulcus (STS). Here we integrate anatomical, neuroimaging, and neuropsychological evidence demonstrating that this pathway specializes in processing dynamic social cues such as facial expressions, eye gaze, and body movements. This third pathway supports social perception by computing the actions and intentions of other people. These findings enhance our understanding of visual cortical organization and highlight the STS's critical role in social cognition, suggesting that visual processing encompasses a dedicated neural circuit for interpreting socially relevant motion and behavior.
toXiv_bot_toot

@midtsveen@social.linux.pizza
2025-12-05 01:07:16

The fact that so many on the Fediverse remain so uneducated isn’t surprising. It only reveals how deeply bourgeois ideology continues to shape public consciousness, molding perception to serve the interests of capital, especially US imperialism.
Just as it took decades for many to see the war on terror for what it truly was, and even longer to grasp Palestine’s struggle against imperialism and settler colonialism, the same pattern will repeat.
I wholeheartedly reject imperialis…

A confused little girl in a pink jacket, with raised hands, sits behind a table. Text above questions Russia's actions; below questions Nazi support.
@soundclamp@mastodon.xyz
2025-10-09 22:20:21

FRIDAY, October 31st
From 7:30 – 9:30 PM
@ Yale CCAM Sound Art Series
(149 York St, New Haven, CT)
"SIGNIFICANTLY LESS DECEPTIVE"
An audio visual live performance by #Negativland SUE-C
Free, open to the public, but limited capacity – first come first serve!

Black-and-white flyer for Negativland live with a stencil painted version of the band’s logo.
@arXiv_csCY_bot@mastoxiv.page
2025-10-15 07:58:21

The Adoption Paradox: A Comparative Analysis of Veterinary AI Adoption in China and the North America
Shumin Li, Xiaoyun Lai
arxiv.org/abs/2510.11758

@arXiv_eessIV_bot@mastoxiv.page
2025-10-14 08:28:48

Generative Latent Video Compression
Zongyu Guo, Zhaoyang Jia, Jiahao Li, Xiaoyi Zhang, Bin Li, Yan Lu
arxiv.org/abs/2510.09987 arxiv.org/pd…

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:49:21

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning
Xingang Guo, Utkarsh Tyagi, Advait Gosai, Paula Vergara, Ernesto Gabriel Hern\'andez Montoya, Chen Bo Calvin Zhang, Bin Hu, Yunzhong He, Bing Liu, Rakshith Sharma Srinivasa
arxiv.org/abs/2510.12712

@netzschleuder@social.skewed.de
2025-12-20 04:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@arXiv_csRO_bot@mastoxiv.page
2025-10-15 09:52:31

Two-stream network-driven vision-based tactile sensor for object feature extraction and fusion perception
Muxing Huang, Zibin Chen, Weiliang Xu, Zilan Li, Yuanzhi Zhou, Guoyuan Zhou, Wenjing Chen, Xinming Li
arxiv.org/abs/2510.12528

@seeingwithsound@mas.to
2025-12-08 14:55:21

Role of thalamus in human conscious perception revealed by low-intensity focused ultrasound neuromodulation nature.com/articles/s41467-025 these findings "underscore the modulatory potential of thalamocortical networks in shaping visual experience";

@arXiv_csAI_bot@mastoxiv.page
2025-10-13 10:01:00

Fundamentals of Building Autonomous LLM Agents
Victor de Lamo Castrillo, Habtom Kahsay Gidey, Alexander Lenz, Alois Knoll
arxiv.org/abs/2510.09244

@seeingwithsound@mas.to
2025-11-17 10:59:36

Causal role of the individual alpha phase in #multisensory #perception biorxiv.org/content/10.1101/20

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:01:49

Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
Nikos Theodoridis, Tim Brophy, Reenu Mohandas, Ganesh Sistu, Fiachra Collins, Anthony Scanlan, Ciaran Eising
arxiv.org/abs/2510.08352

@NFL@darktundra.xyz
2025-10-10 06:14:06

Dart, Giants hope win over Eagles shifts narrative espn.com/nfl/story/_/id/465491

@arXiv_csLG_bot@mastoxiv.page
2025-10-10 13:29:21

Crosslisted article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[3/5]:
- Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technica...
Xiao, Zhang, Tang, Cheng, Xu, Ding, Zhou, Chen, Ye, Hao

@netzschleuder@social.skewed.de
2025-12-14 16:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@arXiv_csRO_bot@mastoxiv.page
2025-10-10 09:49:59

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track
Erjia Xiao, Lingfeng Zhang, Yingbo Tang, Hao Cheng, Renjing Xu, Wenbo Ding, Lei Zhou, Long Chen, Hangjun Ye, Xiaoshuai Hao
arxiv.org/abs/2510.07871

@BBC6MusicBot@mastodonapp.uk
2025-12-13 02:47:42

🇺🇦 #NowPlaying on #BBC6Music's #FocusBeats
El Jazzy Chavo:
🎵 Perception
#ElJazzyChavo
villagelive.bandcamp.com/track
open.spotify.com/track/13LNkGn

@seeingwithsound@mas.to
2025-11-24 20:48:08

Advances in artificial vision systems: a comprehensive review of technologies, applications, and future directions link.springer.com/article/10.1 "clinical value hinges on durability, effective resolution, surgical practicality, and user t…

@arXiv_csRO_bot@mastoxiv.page
2025-10-08 09:47:09

Active Semantic Perception
Huayi Tang, Pratik Chaudhari
arxiv.org/abs/2510.05430 arxiv.org/pdf/2510.05430

@seeingwithsound@mas.to
2025-10-24 13:50:42

Generative inference unifies feedback processing for learning and perception in natural and artificial vision (here not prosthetic vision) biorxiv.org/content/10.1101/20

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:30:30

BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
Junyan Ye, Dongzhi Jiang, Jun He, Baichuan Zhou, Zilong Huang, Zhiyuan Yan, Hongsheng Li, Conghui He, Weijia Li
arxiv.org/abs/2510.09361

@netzschleuder@social.skewed.de
2025-11-09 09:00:05

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@BBC6MusicBot@mastodonapp.uk
2025-12-13 02:40:44

🇺🇦 #NowPlaying on #BBC6Music's #FocusBeats
El Jazzy Chavo:
🎵 Perception
#ElJazzyChavo
villagelive.bandcamp.com/track
open.spotify.com/track/13LNkGn

@netzschleuder@social.skewed.de
2025-10-08 20:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@seeingwithsound@mas.to
2025-11-18 17:14:35

Causal role of the individual alpha phase in multisensory perception biorxiv.org/content/10.1101/20 more info in the X thread

@arXiv_csRO_bot@mastoxiv.page
2025-10-10 09:33:29

Injecting Hallucinations in Autonomous Vehicles: A Component-Agnostic Safety Evaluation Framework
Alexandre Moreira Nascimento, Gabriel Kenji Godoy Shimanuki, L\'ucio Flavio Vismari, Jo\~ao Batista Camargo Jr, Jorge Rady de Almeida Jr, Paulo Sergio Cugnasca, Anna Carolina Muller Queiroz, Jeremy Noah Bailenson
arxiv.org/abs/2510…

@netzschleuder@social.skewed.de
2025-12-06 15:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@seeingwithsound@mas.to
2025-12-17 10:30:03

Enhanced pitch perception in early blind individuals and musicians is due to reduced internal noise biorxiv.org/content/10.1101/20

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:50:31

SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding
Zhiliu Yang, Jinyu Dai, Jianyuan Zhang, Zhu Yang
arxiv.org/abs/2510.12749

@seeingwithsound@mas.to
2025-12-18 07:37:24

Long-term visual-to-tactile stimulation induces functional reorganization of thalamic pathways to achieve visual perception sciencedirect.com/science/arti using haptic sensory substitution;

Thalamic connectivity in blind and its reorganization after SSD training. (SSD = sensory substitution device)
@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:48:21

MCOP: Multi-UAV Collaborative Occupancy Prediction
Zefu Lin, Wenbo Chen, Xiaojuan Jin, Yuran Yang, Lue Fan, Yixin Zhang, Yufeng Zhang, Zhaoxiang Zhang
arxiv.org/abs/2510.12679

@seeingwithsound@mas.to
2025-10-14 18:36:12

Visual–tactile shape perception in Argus II participants: The impact of prolonged device use and blindness on performance jov.arvojournals.org/article.a "data highlight individual differences in performance over prolonged device use and the …

@arXiv_csCV_bot@mastoxiv.page
2025-10-15 10:55:01

Detect Anything via Next Point Prediction
Qing Jiang, Junan Huo, Xingyu Chen, Yuda Xiong, Zhaoyang Zeng, Yihao Chen, Tianhe Ren, Junzhi Yu, Lei Zhang
arxiv.org/abs/2510.12798

@arXiv_csRO_bot@mastoxiv.page
2025-10-15 09:22:01

PolygMap: A Perceptive Locomotion Framework for Humanoid Robot Stair Climbing
Bingquan Li, Ning Wang, Tianwei Zhang, Zhicheng He, Yucong Wu
arxiv.org/abs/2510.12346

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:33:50

SilvaScenes: Tree Segmentation and Species Classification from Under-Canopy Images in Natural Forests
David-Alexandre Duclos, William Guimont-Martin, Gabriel Jeanson, Arthur Larochelle-Tremblay, Th\'eo Defosse, Fr\'ed\'eric Moore, Philippe Nolet, Fran\c{c}ois Pomerleau, Philippe Gigu\`ere
arxiv.org/abs/2510.09458

@arXiv_csRO_bot@mastoxiv.page
2025-10-15 09:50:51

Automated Behavior Planning for Fruit Tree Pruning via Redundant Robot Manipulators: Addressing the Behavior Planning Challenge
Gaoyuan Liu, Bas Boom, Naftali Slob, Yuri Durodi\'e, Ann Now\'e, Bram Vanderborght
arxiv.org/abs/2510.12509

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:41:30

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
Shaoqi Dong, Chaoyou Fu, Haihan Gao, Yi-Fan Zhang, Chi Yan, Chu Wu, Xiaoyu Liu, Yunhang Shen, Jing Huo, Deqiang Jiang, Haoyu Cao, Yang Gao, Xing Sun, Ran He, Caifeng Shan
arxiv.org/abs/2510.09607

@seeingwithsound@mas.to
2025-11-06 20:47:27

Neural correlates of phosphene perception in blind individuals: A step toward a bidirectional cortical visual prosthesis science.org/doi/10.1126/sciadv "Cortical prostheses could one day restore functional vision in some blind subjects"

Location of the UEA implantation site on the right occipital cortex of the two participants. Predicted retinotopic map organization overlaid on the 3D brain reconstruction.
@arXiv_csRO_bot@mastoxiv.page
2025-10-14 12:36:38

Robot Soccer Kit: Omniwheel Tracked Soccer Robots for Education
Gregoire Passault (UB, LaBRI), Clement Gaspard (UB, LaBRI), Olivier Ly (UB, LaBRI)
arxiv.org/abs/2510.11552

@seeingwithsound@mas.to
2025-12-11 15:07:03

Wireless device uses light patterns to deliver information directly to the brain medicalxpress.com/news/2025-12
Patterned wireless transcranial optogenetics generates artificial perception (in mice)

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:15:49

Have We Scene It All? Scene Graph-Aware Deep Point Cloud Compression
Nikolaos Stathoulopoulos, Christoforos Kanellakis, George Nikolakopoulos
arxiv.org/abs/2510.08512

@arXiv_csRO_bot@mastoxiv.page
2025-10-10 10:20:59

Scalable Offline Metrics for Autonomous Driving
Animikh Aich, Adwait Kulkarni, Eshed Ohn-Bar
arxiv.org/abs/2510.08571 arxiv.org/pdf/2510.08…

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 07:51:20

Whole Body Model Predictive Control for Spin-Aware Quadrupedal Table Tennis
David Nguyen, Zulfiqar Zaidi, Kevin Karol, Jessica Hodgins, Zhaoming Xie
arxiv.org/abs/2510.08754

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:30:31

Online Generic Event Boundary Detection
Hyungrok Jung, Daneul Kim, Seunggyun Lim, Jeany Son, Jonghyun Choi
arxiv.org/abs/2510.06855 arxiv.o…

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:40:51

MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung, Jaehoon Choi, Yonghan Lee, Sungmin Eum, Heesung Kwon, Dinesh Manocha
arxiv.org/abs/2510.07119

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 10:09:00

FOGMACHINE -- Leveraging Discrete-Event Simulation and Scene Graphs for Modeling Hierarchical, Interconnected Environments under Partial Observations from Mobile Agents
Lars Ohnemus, Nils Hantke, Max Wei{\ss}er, Kai Furmans
arxiv.org/abs/2510.09483

@seeingwithsound@mas.to
2025-10-03 15:58:20

Seeing like Meta: Smart glasses and the ethics of augmented reality #AR

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:20:19

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
Andong Deng, Taojiannan Yang, Shoubin Yu, Lincoln Spencer, Mohit Bansal, Chen Chen, Serena Yeung-Levy, Xiaohan Wang
arxiv.org/abs/2510.08559

@arXiv_csRO_bot@mastoxiv.page
2025-10-10 09:43:19

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction
Yandu Chen, Kefan Gu, Yuqing Wen, Yucheng Zhao, Tiancai Wang, Liqiang Nie
arxiv.org/abs/2510.07778

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:17:49

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
Hongxing Li, Dingming Li, Zixuan Wang, Yuchen Yan, Hang Wu, Wenqi Zhang, Yongliang Shen, Weiming Lu, Jun Xiao, Yueting Zhuang
arxiv.org/abs/2510.08531

@seeingwithsound@mas.to
2025-12-04 07:13:41

Electrotactile characteristics of rectified random noise stimulation (r-tRNS) on the forehead: a comparison with tDCS ieeexplore.ieee.org/document/1 "r-tRNS has the potential to be effectively utilized in applications requiring electrotactile perception, such as sensor…

@arXiv_csRO_bot@mastoxiv.page
2025-10-10 10:09:49

NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
Haolin Yang, Yuxing Long, Zhuoyuan Yu, Zihan Yang, Minghan Wang, Jiapeng Xu, Yihan Wang, Ziyan Yu, Wenzhe Cai, Lei Kang, Hao Dong
arxiv.org/abs/2510.08173

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:21:01

Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion
Jie Luo, Yuxuan Jiang, Xin Jin, Mingyu Liu, Yihui Fan
arxiv.org/abs/2510.06687

@arXiv_csRO_bot@mastoxiv.page
2025-10-09 08:35:51

Active Next-Best-View Optimization for Risk-Averse Path Planning
Amirhossein Mollaei Khass, Guangyi Liu, Vivek Pandey, Wen Jiang, Boshu Lei, Kostas Daniilidis, Nader Motee
arxiv.org/abs/2510.06481

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 12:38:32

Visual Representations inside the Language Model
Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna
arxiv.org/abs/2510.04819

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:43:01

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
Chenfei Liao, Wensong Wang, Zichen Wen, Xu Zheng, Yiyu Wang, Haocong He, Yuanhuiyi Lyu, Lutao Jiang, Xin Zou, Yuqian Fu, Bin Ren, Linfeng Zhang, Xuming Hu
arxiv.org/abs/2510.07143

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 10:23:11

DeRainMamba: A Frequency-Aware State Space Model with Detail Enhancement for Image Deraining
Zhiliang Zhu, Tao Zeng, Tao Yang, Guoliang Luo, Jiyong Zeng
arxiv.org/abs/2510.06746

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 12:45:32

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Yuhe Nie, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu