2025-07-25 09:47:12
MeloKids: Multisensory VR System to Enhance Speech and Motor Coordination in Children with Hearing Loss
Yichen Yu, Qiaoran Wang
https://arxiv.org/abs/2507.18619 https://
MeloKids: Multisensory VR System to Enhance Speech and Motor Coordination in Children with Hearing Loss
Yichen Yu, Qiaoran Wang
https://arxiv.org/abs/2507.18619 https://
Implementing Zero Trust Architecture to Enhance Security and Resilience in the Pharmaceutical Supply Chain
Saeid Ghasemshirazi, Ghazaleh Shirvani, Marziye Ranjbar Tavakoli, Bahar Ghaedi, Mohammad Amin Langarizadeh
https://arxiv.org/abs/2508.15776
A Modular Residual Learning Framework to Enhance Model-Based Approach for Robust Locomotion
Min-Gyu Kim, Dongyun Kang, Hajun Kim, Hae-Won Park
https://arxiv.org/abs/2507.18138 h…
Healthy European peatlands require specific temperature and water level parameters #Europe
Micro-variations in timing and loudness affect music-evoked mental imagery https://www.nature.com/articles/s41598-025-12604-4 "repetitive quasi-isochronous drumming enhanced mental imagery vividness, with a stronger effect observed when the drumming contained random micro-variat…
Synthesizing Artifact Dataset for Pixel-level Detection
Dennis Menn, Feng Liang, Diana Marculescu
https://arxiv.org/abs/2509.19589 https://arxiv.org/pdf/25…
SMARTAPS: Tool-augmented LLMs for Operations Management
Timothy Tin Long Yu, Mahdi Mostajabdaveh, Jabo Serge Byusa, Rindra Ramamonjison, Giuseppe Carenini, Kun Mao, Zirui Zhou, Yong Zhang
https://arxiv.org/abs/2507.17927
MuST2-Learn: Multi-view Spatial-Temporal-Type Learning for Heterogeneous Municipal Service Time Estimation
Nadia Asif, Zhiqing Hong, Shaogang Ren, Xiaonan Zhang, Xiaojun Shang, Yukun Yuan
https://arxiv.org/abs/2508.16503
Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass
Nicholas Popovi\v{c}, Michael F\"arber
https://arxiv.org/abs/2509.18901 https…
Assertion Messages with Large Language Models (LLMs) for Code
Ahmed Aljohani, Anamul Haque Mollah, Hyunsook Do
https://arxiv.org/abs/2509.19673 https://arx…
Above 99.9% Fidelity Single-Qubit Gates, Two-Qubit Gates, and Readout in a Single Superconducting Quantum Device
Fabian Marxer, Jakub Mro\.zek, Joona Andersson, Leonid Abdurakhimov, Janos Adam, Ville Bergholm, Rohit Beriwal, Chun Fai Chan, Saga Dahl, Soumya Ranjan Das, Frank Deppe, Olexiy Fedorets, Zheming Gao, Alejandro Gomez Frieiro, Daria Gusenkova, Andrew Guthrie, Tuukka Hiltunen, Hao Hsu, Eric Hyypp\"a, Joni Ikonen, Sinan Inel, Shan W. Jolin, Azad Karis, Seung-Goo Kim, Willia…
C-Koordinator: Interference-aware Management for Large-scale and Co-located Microservice Clusters
Shengye Song, Minxian Xu, Zuowei Zhang, Chengxi Gao, Fansong Zeng, Yu Ding, Kejiang Ye, Chengzhong Xu
https://arxiv.org/abs/2507.18005
Action-List Reinforcement Learning Syndrome Decoding for Binary Linear Block Codes
Milad Taghipour, Bane Vasic
https://arxiv.org/abs/2507.17893 https://arx…
U-Net Based Healthy 3D Brain Tissue Inpainting
Juexin Zhang, Ying Weng, Ke Chen
https://arxiv.org/abs/2507.18126 https://arxiv.org/pdf/2507.18126
Improving Disease Risk Estimation in Small Areas by Accounting for Spatiotemporal Local Discontinuities
Santaf\'e, G., Adin, A., Ugarte, M. L
https://arxiv.org/abs/2509.19889
A Secure Affine Frequency Division Multiplexing for Wireless Communication Systems
Ping Wang, Zulin Wang, Yuanfang Ma, Xiaosi Tian, Yuanhan Ni
https://arxiv.org/abs/2509.18555 h…
Sentiment-Aware Mean-Variance Portfolio Optimization for Cryptocurrencies
Qizhao Chen
https://arxiv.org/abs/2508.16378 https://arxiv.org/pdf/2508.16378
Safe Reinforcement Learning-based Automatic Generation Control
Amr S. Mohamed, Emily Nguyen, Deepa Kundur
https://arxiv.org/abs/2507.17868 https://arxiv.or…
The TEA-ASLP System for Multilingual Conversational Speech Recognition and Speech Diarization in MLC-SLM 2025 Challenge
Hongfei Xue, Kaixun Huang, Zhikai Zhou, Shen Huang, Shidong Shang
https://arxiv.org/abs/2507.18051
"Anyone can adapt this model to explore the potential impact of religion, spirituality and health, including devotional practices."
—Benjamin Doolittle '94 M.Div., '97 M.D., Director of the Yale Program for Medicine, Spirituality, and Religion, speaking about a new research model to enhance investigation conducted at the intersection of religion and medicine
Effects of galactic environment on accretion dynamics onto a rotating centrally located black hole and on emergent analogue gravity
Ripon Sk, Sangita Chatterjee, Sankhasubhra Nag
https://arxiv.org/abs/2509.18833
A Divergence-free Preserving Mixed Finite Element Method for Thermally Driven Active Fluid Model
Nan Zheng, Qingguang Guan, Wenlong Pei, Wenju Zhao
https://arxiv.org/abs/2509.19053
Phase Stability and Superconductivity in Hydrogenated and Lithiated Janus GaXS2 (X = Ga, In) Monolayers
Jakkapat Seeyangnok, Udomsilp Pinsook
https://arxiv.org/abs/2509.19922 ht…
Improving Outdoor Multi-cell Fingerprinting-based Positioning via Mobile Data Augmentation
Tony Chahoud, Lorenzo Mario Amorosa, Riccardo Marini, Luca De Nardis
https://arxiv.org/abs/2509.19405
Molecular gas in a system of two interacting galaxies overlapping on the line-of-sight
Ana\"elle Halle, Barbara Mazzilli Ciraulo, Daniel Maschmann, Anne-Laure Melchior, Fran\c{c}oise Combes
https://arxiv.org/abs/2507.18355
StrCGAN: A Generative Framework for Stellar Image Restoration
Shantanusinh Parmar
https://arxiv.org/abs/2509.19805 https://arxiv.org/pdf/2509.19805
Climate-Adaptive and Cascade-Constrained Machine Learning Prediction for Sea Surface Height under Greenhouse Warming
Tianmu Zheng, Ru Chen, Xin Su, Gang Huang, Bingzheng Yan
https://arxiv.org/abs/2509.18741
MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization
Jianxuan Yang, Xiaoran Yang, Lipan Zhang, Xinyue Guo, Zhao Wang, Gongping Huang
https://arxiv.org/abs/2509.19999
Hey #dotnet folks and #security wonks, join our #livestream today to learn about FAPI 2.0 and how to enhance security at your organization with the latest specification.
Also, drop in and say h…
Microsoft and the NFL extend their partnership to bring real-time game data and analysis to coaches and players using Microsoft Copilot and Azure AI (Ali McCadden/CNBC)
https://www.cnbc.com/2025/08/20/microsoft-nfl-ai-analysis.html
Retrieval Enhanced Feedback via In-context Neural Error-book
Jongyeop Hyun, Bumsoo Kim
https://arxiv.org/abs/2508.16313 https://arxiv.org/pdf/2508.16313
Orcust: Stepwise-Feedback Reinforcement Learning for GUI Agent
Junyu Lu, Songxin Zhang, Zejian Xie, Zhuoyang Song, Jiaxing Zhang
https://arxiv.org/abs/2509.17917 https://…
A Biomimetic Vertebraic Soft Robotic Tail for High-Speed, High-Force Dynamic Maneuvering
Sicong Liu, Jianhui Liu, Fang Chen, Wenjian Yang, Juan Yi, Yu Zheng, Zheng Wang, Wanchao Chi, Chaoyang Song
https://arxiv.org/abs/2509.20219
SIM-CoT: Supervised Implicit Chain-of-Thought
Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Yuhang Cao, Jiaqi Wang, Xipeng Qiu, Dahua Lin
https://arxiv.org/abs/2509.20317 ht…
Self-Disguise Attack: Induce the LLM to disguise itself for AIGT detection evasion
Yinghan Zhou, Juan Wen, Wanli Peng, Zhengxian Wu, Ziwei Zhang, Yiming Xue
https://arxiv.org/abs/2508.15848
Efficient and optimal quantum state discrimination via quantum belief propagation
Christophe Piveteau, Joseph M. Renes
https://arxiv.org/abs/2509.19441 https://
Factors Impacting Faculty Adoption of Project-Based Learning in Computing Education: a Survey
Ahmad D. Suleiman, Yiming Tang, Daqing Hou
https://arxiv.org/abs/2507.18039 https:/…
SCORE: Scaling audio generation using Standardized COmposite REwards
Jaemin Jung, Jaehun Kim, Inkyu Shin, Joon Son Chung
https://arxiv.org/abs/2509.19831 https://
Comparative Analysis of STEM and non-STEM Teachers' Needs for Integrating AI into Educational Environments
Bahare Riahi, Veronica Catete
https://arxiv.org/abs/2509.16276 htt…
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning
Dongyang Guo, Yasmeen Abdrabou, Enkeleda Thaqi, Enkelejda Kasneci
https://arxiv.org/abs/2507.18252
Optimizing Edge Gaming Slices through an Enhanced User Plane Function and Analytics in Beyond-5G Networks
Bruno Marques da Silva, Larissa Ferreira Rodrigues Moreira, Fl\'avio de Oliveira Silva, Rodrigo Moreira
https://arxiv.org/abs/2507.17843
Frequency-Aware Ensemble Learning for BraTS 2025 Pediatric Brain Tumor Segmentation
Yuxiao Yi, Qingyao Zhuang, Zhi-Qin John Xu
https://arxiv.org/abs/2509.19353 https://
Optimal Sizing of Community Photovoltaic and Battery Energy Storage Systems with Second-Life Batteries in Peer-to-Peer Energy Communities
J\'ulia Monar, Fernando Garc\'ia-Mu\~noz, Natalia Jorquera Bravo, Joaqu\'in Aballay Araya, Vicente Castro Burgos
https://arxiv.org/abs/2509.18082
Agentic AI for Low-Altitude Semantic Wireless Networks: An Energy Efficient Design
Zhouxiang Zhao, Ran Yi, Yihan Cang, Boyang Jin, Zhaohui Yang, Mingzhe Chen, Chongwen Huang, Zhaoyang Zhang
https://arxiv.org/abs/2509.19791
Regression approaches for modelling genotype-environment interaction and making predictions into unseen environments
Maksym Hrachov, Hans-Peter Piepho, Niaz Md. Farhat Rahman, Waqas Ahmed Malik
https://arxiv.org/abs/2507.18125
FROQ: Observing Face Recognition Models for Efficient Quality Assessment
\v{Z}iga Babnik, Deepak Kumar Jain, Peter Peer, Vitomir \v{S}truc
https://arxiv.org/abs/2509.17689 https…
Meson width predictions and symmetry emergence within the deep neural network
Xin Tong, Wei Feng, Weiwei Xu, Chao-Hsi Chang, Guo-Li Wang, Qiang Li
https://arxiv.org/abs/2509.17093
Compositional System Dynamics: The Higher Mathematics Underlying System Dynamics Diagrams & Practice
Xiaoyan Li, Evan Patterson, Patricia L. Mabry, Nathaniel D. Osgood
https://arxiv.org/abs/2509.18475
Crosslisted article(s) found for cs.CE. https://arxiv.org/list/cs.CE/new
[1/1]:
- Implementing Zero Trust Architecture to Enhance Security and Resilience in the Pharmaceutical Sup...
Ghasemshirazi, Shirvani, Tavakoli, Ghaedi, Langarizadeh
SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs
Jiazhen Hong, Geoff Mackellar, Soheila Ghane
https://arxiv.org/abs/2509.19401 https://
QvTAD: Differential Relative Attribute Learning for Voice Timbre Attribute Detection
Zhiyu Wu, Jingyi Fang, Yufei Tang, Yuanzhong Zheng, Yaoxuan Wang, Haojun Fei
https://arxiv.org/abs/2508.15931
A HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
Xin An, Ruijie Li, Qiao Ning, Hui Li, Qian Ma, Shikai Guo
https://arxiv.org/abs/2509.20240 https://
How European, national and local policies enhance each other’s effectiveness. And how cities make EU policy work for business and society.
https://www.linkedin.com/posts/jaapburger_how-european-national-and-local-policies-a…
Low-Resource English-Tigrinya MT: Leveraging Multilingual Models, Custom Tokenizers, and Clean Evaluation Benchmarks
Hailay Kidu Teklehaymanot, Gebrearegawi Gidey, Wolfgang Nejdl
https://arxiv.org/abs/2509.20209
Why we need all the organisms: an exploration of the Monarch knowledge graph to aid mechanism discovery
Katherina Cortes, Daniel Korn, Sarah Gehrke, Kevin Schaper, Corey Cox, Patrick Golden, Aaron Odell, Bryan Laraway, Madan Krishnamurthy, Justin Reese, Harry Caufield, Sierra Moxon, Ellen Elias, Christopher J Mungall, Melissa Haendel
https://
CyberSOCEval: Benchmarking LLMs Capabilities for Malware Analysis and Threat Intelligence Reasoning
Lauren Deason, Adam Bali, Ciprian Bejean, Diana Bolocan, James Crnkovich, Ioana Croitoru, Krishna Durai, Chase Midler, Calin Miron, David Molnar, Brad Moon, Bruno Ostarcevic, Alberto Peltea, Matt Rosenberg, Catalin Sandu, Arthur Saputkin, Sagar Shah, Daniel Stan, Ernest Szocs, Shengye Wan, Spencer Whitman, Sven Krasser, Joshua Saxe
Development of a Model Order Reduced Arbitrary Lagrangian Eulerian (MORALE) formulation for structures subjected to dynamic moving loads
Atul Anantheswar, Jannick Kehls, Ines Wollny, Tim Brepols, Stefanie Reese, Michael Kaliske
https://arxiv.org/abs/2509.20069
Tuning chiral anomaly signature in a Dirac semimetal via fast-ion implantation
Manasi Mandal, Eunbi Rha, Abhijatmedhi Chotrattanapituk, Denisse C\'ordova Carrizales, Alexander Lygo, Kevin B. Woller, Mouyang Cheng, Ryotaro Okabe, Guomin Zhu, Kiran Mak, Chu-Liang Fu, Chuhang Liu, Lijun Wu, Yimei Zhu, Susanne Stemmer, Mingda Li
https://ar…
A Deep Dive into Retrieval-Augmented Generation for Code Completion: Experience on WeChat
Zezhou Yang, Ting Peng, Cuiyun Gao, Chaozheng Wang, Hailiang Huang, Yuetang Deng
https://arxiv.org/abs/2507.18515
Deep chemical tagging - Identifying open clusters and moving groups in chemical space with graph attention networks
Lorenzo Spina, Milan Quandt Rodriguez, Laura Magrini, Leda Berni, Sara Lucatello, Marco Canducci
https://arxiv.org/abs/2509.18268
ExpFace: Exponential Angular Margin Loss for Deep Face Recognition
Jinhui Zheng, Xueyuan Gong
https://arxiv.org/abs/2509.19753 https://arxiv.org/pdf/2509.1…
BALANCE: Bitrate-Adaptive Limit-Aware Netcast Content Enhancement Utilizing QUBO and Quantum Annealing
Animesh Rajpurohit, Michael Kelley, Wei Wang, Krishna Murthy Kattiyan Ramamoorthy
https://arxiv.org/abs/2509.19616
Deep Learning-based Position-domain Channel Extrapolation for Cell-Free Massive MIMO
Jiajia Guo, Chao-Kai Wen, Xiao Li, Shi Jin
https://arxiv.org/abs/2507.17950 https://
Statistical Inference Leveraging Synthetic Data with Distribution-Free Guarantees
Meshi Bashari, Yonghoon Lee, Roy Maor Lotan, Edgar Dobriban, Yaniv Romano
https://arxiv.org/abs/2509.20345
Reciprocal Beyond-Diagonal Reconfigurable Intelligent Surface (BD-RIS): Scattering Matrix Design via Manifold Optimization
Marko Fidanovski, Iv\'an Alexander Morales Sandoval, Hyeon Seok Rou, Giuseppe Thadeu Freitas de Abreu, Emil Bj\"ornson
https://arxiv.org/abs/2509.20246
Group Relative Policy Optimization for Text-to-Speech with Large Language Models
Chang Liu, Ya-Jun Hu, Ying-Ying Gao, Shi-Lei Zhang, Zhen-Hua Ling
https://arxiv.org/abs/2509.18798
CoRaCMG: Contextual Retrieval-Augmented Framework for Commit Message Generation
Bo Xiong, Linghao Zhang, Chong Wang, Peng Liang
https://arxiv.org/abs/2509.18337 https://
Tenure Under Pressure: Simulating the Disruptive Effects of AI on Academic Publishing
Shan Jiang
https://arxiv.org/abs/2509.16925 https://arxiv.org/pdf/250…
PS3: A Multimodal Transformer Integrating Pathology Reports with Histology Images and Biological Pathways for Cancer Survival Prediction
Manahil Raza, Ayesha Azam, Talha Qaiser, Nasir Rajpoot
https://arxiv.org/abs/2509.20022
Does spatialized audio enhance the creation of mental representations? Spoiler: No (for their SnapStick-based setup) https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2025.1660373/full "seven blind individuals and se…
Table Detection with Active Learning
Somraj Gautam, Nachiketa Purohit, Gaurav Harit
https://arxiv.org/abs/2509.20003 https://arxiv.org/pdf/2509.20003
Extracting Higgs Self-Coupling Constraints through Triple Higgs Boson Production at Future Hadron Colliders
Benjamin Fuks, Andreas Papaefstathiou, Gilberto Tetlalmatzi-Xolocotzi
https://arxiv.org/abs/2509.16364
Enhancing CTAO Monitoring and Alarm Subsystems in Distributed Environments Using ServiMon
Kevin Munari, Alessandro Costa, Federico Incardona, Emilio Mastriani, Sebastiano Spinello, Stefano Germani, Pietro Bruno
https://arxiv.org/abs/2509.16366
Automated Labeling of Intracranial Arteries with Uncertainty Quantification Using Deep Learning
Javier Bisbal, Patrick Winter, Sebastian Jofre, Aaron Ponce, Sameer A. Ansari, Ramez Abdalla, Michael Markl, Oliver Welin Odeback, Sergio Uribe, Cristian Tejos, Julio Sotelo, Susanne Schnell, David Marlevi
https://arxiv.org/abs/2509.17726…
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
Shuocheng Yang, Zikun Xu, Jiahao Wang, Shahid Nawaz, Jianqiang Wang, Shaobing Xu
https://arxiv.org/abs/2509.16261
Improving Test-Time Performance of RVQ-based Neural Codecs
Hyeongju Kim, Junhyeok Lee, Jacob Morton, Juheon Lee, Jinhyeok Yang
https://arxiv.org/abs/2509.19186 https://
STAR: Speech-to-Audio Generation via Representation Learning
Zeyu Xie, Xuenan Xu, Yixuan Li, Mengyue Wu, Yuexian Zou
https://arxiv.org/abs/2509.17164 https://
Through the Looking Glass: A Dual Perspective on Weakly-Supervised Few-Shot Segmentation
Jiaqi Ma, Guo-Sen Xie, Fang Zhao, Zechao Li
https://arxiv.org/abs/2508.16159 https://
A LiDAR-Driven Fallback Longitudinal Controller for Safer Following in Sudden Braking Scenarios
Mohamed Sabry, Enrico Del Re, Walter Morales-Alvarez, Cristina Olaverri-Monreal
https://arxiv.org/abs/2509.16642
Triplet Loss Based Quantum Encoding for Class Separability
Marco Mordacci, Mahul Pandey, Paolo Santini, Michele Amoretti
https://arxiv.org/abs/2509.15705 https://
Enhancing Noise Robustness for Neural Speech Codecs through Resource-Efficient Progressive Quantization Perturbation Simulation
Rui-Chen Zheng, Yang Ai, Hui-Peng Du, Zhen-Hua Ling
https://arxiv.org/abs/2509.19025
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search
Zhiyu Mou, Yiqin Lv, Miao Xu, Cheems Wang, Yixiu Mao, Qichen Ye, Chao Li, Rongquan Bai, Chuan Yu, Jian Xu, Bo Zheng
https://arxiv.org/abs/2509.15927
Hybrid FIM and STAR-BD-RIS-Aided Wireless Communications with Short Packet Length: A Meta-TD3 Approach
Ayla Eftekhari, Maryam Cheraghy, Armin Farhadi, Mohammad Robat Mili, Qingqing Wu
https://arxiv.org/abs/2509.16417
DIO: Refining Mutual Information and Causal Chain to Enhance Machine Abstract Reasoning Ability
Ruizhuo Song, Beiming Yuan
https://arxiv.org/abs/2508.15387 https://
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
Zhenlan Ji, Daoyuan Wu, Wenxuan Wang, Pingchuan Ma, Shuai Wang, Lei Ma
https://arxiv.org/abs/2509.16268
EcomMMMU: Strategic Utilization of Visuals for Robust Multimodal E-Commerce Models
Xinyi Ling, Hanwen Du, Zhihui Zhu, Xia Ning
https://arxiv.org/abs/2508.15721 https://
On Fast Attitude Filtering Based on Matrix Fisher Distribution with Stability Guarantee
Shijie Wang, Haichao Gui, Rui Zhong
https://arxiv.org/abs/2509.17827 https://
Core-elements Subsampling for Alternating Least Squares
Dunyao Xue, Mengyu Li, Cheng Meng, Jingyi Zhang
https://arxiv.org/abs/2509.18024 https://arxiv.org/…
Survey of Vision-Language-Action Models for Embodied Manipulation
Haoran Li, Yuhui Chen, Wenbo Cui, Weiheng Liu, Kai Liu, Mingcai Zhou, Zhengtao Zhang, Dongbin Zhao
https://arxiv.org/abs/2508.15201
MaskVCT: Masked Voice Codec Transformer for Zero-Shot Voice Conversion With Increased Controllability via Multiple Guidances
Junhyeok Lee, Helin Wang, Yaohan Guan, Thomas Thebaud, Laureano Moro-Velazquez, Jes\'us Villalba, Najim Dehak
https://arxiv.org/abs/2509.17143
DecipherGuard: Understanding and Deciphering Jailbreak Prompts for a Safer Deployment of Intelligent Software Systems
Rui Yang, Michael Fu, Chakkrit Tantithamthavorn, Chetan Arora, Gunel Gulmammadova, Joey Chua
https://arxiv.org/abs/2509.16870
Tensorized Multi-Task Learning for Personalized Modeling of Heterogeneous Individuals with High-Dimensional Data
Elif Konyar, Mostafa Reisi Gahrooei, Kamran Paynabar
https://arxiv.org/abs/2508.15676
Fusing Spectral Correlation Density Imaging with Deep Learning for Intelligent Fault Diagnosis in Rotating Machinery
Dilshara Herath, Chinthaka Abeyrathne, Chamindu Adithya, Chathura Seneviratne
https://arxiv.org/abs/2509.16580
Prompt-with-Me: in-IDE Structured Prompt Management for LLM-Driven Software Engineering
Ziyou Li, Agnia Sergeyuk, Maliheh Izadi
https://arxiv.org/abs/2509.17096 https://
Fairness for the People, by the People: Minority Collective Action
Omri Ben-Dov, Samira Samadi, Amartya Sanyal, Alexandru \c{T}ifrea
https://arxiv.org/abs/2508.15374 https://
See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model
Pengteng Li, Pinhao Song, Wuyang Li, Weiyu Guo, Huizai Yao, Yijie Xu, Dugang Liu, Hui Xiong
https://arxiv.org/abs/2509.16087
LLM-empowered Dynamic Prompt Routing for Vision-Language Models Tuning under Long-Tailed Distributions
Yongju Jia, Jiarui Ma, Xiangxian Li, Baiqiao Zhang, Xianhui Cao, Juan Liu, Yulong Bian
https://arxiv.org/abs/2508.15688
Waver: Wave Your Way to Lifelike Video Generation
Yifu Zhang, Hao Yang, Yuqi Zhang, Yifei Hu, Fengda Zhu, Chuang Lin, Xiaofeng Mei, Yi Jiang, Zehuan Yuan, Bingyue Peng
https://arxiv.org/abs/2508.15761 …