
2025-08-13 14:04:15
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Digital and Robotic Twinning for Validation of Proximity Operations and Formation Flying
Golan, Zin, Ahmed, Bates, Bell, Huc, Low, Bosse, D'Amico
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Digital and Robotic Twinning for Validation of Proximity Operations and Formation Flying
Golan, Zin, Ahmed, Bates, Bell, Huc, Low, Bosse, D'Amico
It's ironic that so many people are blinded by their own pattern recognition abilities that were billions of years in the making, misleading them into thinking LLM pattern generators have human-like intelligence
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- Cut2Next: Generating Next Shot via In-Context Tuning
Jingwen He, Hongbo Liu, Jiajun Li, Ziqi Huang, Yu Qiao, Wanli Ouyang, Ziwei Liu
Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
Logan Cross, Erik Brockbank, Tobias Gerstenberg, Judith E. Fan, Daniel L. K. Yamins, Nick Haber
https://arxiv.org/abs/2508.06503
Federated Quantum Kernel-Based Long Short-term Memory for Human Activity Recognition
Yu-Chao Hsu, Jiun-Cheng Jiang, Chun-Hua Lin, Wei-Ting Chen, Kuo-Chung Peng, Prayag Tiwari, Samuel Yen-Chi Chen, En-Jui Kuo
https://arxiv.org/abs/2508.06078
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang, Bingke Zhu, Yingying Chen, Yiyuan Zhang, Ming Tang, Jinqiao Wang
Solar Flare Prediction Using LSTM and DLSTM with Sliding Window Pattern Recognition
Zeinab Hassani, Davud Mohammadpur, Hossein Safari
https://arxiv.org/abs/2507.05313
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Haoran Chen, Ping Wang, Zihan Zhou, Xu Zhang, Zuxuan Wu, Yu-Gang Jiang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Efficient Annotation of Medieval Charters
Anguelos Nicolaou, Daniel Luger, Franziska Decker, Nicolas Renet, Vincent Christlein, Georg Vogeler
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Emotion-Qwen: A Unified Framework for Emotion and Vision Understanding
Huang, Li, Yan, Cheng, Han, Huang, Li, Li, Wang, Lian, Cheng, Peng
Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
Sankalp Tattwadarshi Swain, Anshika Krishnatray, Dhruv Kumar, Jagat Sesh Challa
https://arxiv.org/abs/2509.07389
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention
Qi Xie, Yongjia Ma, Donglin Di, Xuehao Gao, Xun Yang
Single-Shot Multispectral Encoding: Advancing Optical Lithography for Encryption and Spectroscopy
Hyewon Shim, Geonwoong Park, Hyunsuk Yun, Sunmin Ryu, Yong-Young Noh, Cheol-Joo Kim
https://arxiv.org/abs/2508.05645
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial...
Ziyang Gong, Wenhao Li, Oliver Ma, Songyuan Li, Jiayi Ji, Xue Yang, Gen Luo, Junchi Yan, Rongrong Ji
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation
Felix Duelmer, Mohammad Farid Azampour, Magdalena Wysocki, Nassir Navab
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Prompt-aligned Gradient for Prompt Tuning
Beier Zhu, Yulei Niu, Yucheng Han, Yue Wu, Hanwang Zhang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- Objectomaly: Objectness-Aware Refinement for OoD Segmentation with Structural Consistency and Bou...
Jeonghoon Song, Sunghun Kim, Jaegyun Im, Byeongjoon Noh
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
Yuhui Wu, Liyi Chen, Ruibin Li, Shihao Wang, Chenxi Xie, Lei Zhang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving
Honghao Fu, Yongli Gu, Yidong Yan, Yilang Shen, Yiwen Wu, Libo Sun
[2025-08-14 Thu (UTC), 122 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models
Yang, Chen, Wong, Lei, Chen, Li, Zhou, Cheng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving
Chi Wan, Yixin Cui, Jiatong Du, Shuo Yang, Yulong Bai, Peng Yi, Nan Li, Yanjun Huang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- Total Disentanglement of Font Images into Style and Character Class Features
Daichi Haraguchi, Wataru Shimoda, Kota Yamaguchi, Seiichi Uchida
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/1]:
- DeepTV: A neural network approach for total variation minimization
Andreas Langer, Sara Behnamian
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[9/9]:
- SOPHY: Learning to Generate Simulation-Ready Objects with Physical Materials
Junyi Cao, Evangelos Kalogerakis
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/9]:
- FQGA-single: Towards Fewer Training Epochs and Fewer Model Parameters for Image-to-Image Translat...
Cho Yang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/9]:
- Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation
Hyung Kyu Kim, Hak Gu Kim
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/9]:
- UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
Yue Zhou, Yuan Bi, Wenjuan Tong, Wei Wang, Nassir Navab, Zhongliang Jiang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/9]:
- 3D Gaussian Splatting Data Compression with Mixture of Priors
Lei Liu, Zhenghao Chen, Dong Xu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/9]:
- ROODI: Reconstructing Occluded Objects with Denoising Inpainters
Yeonjin Chang, Erqun Dong, Seunghyeon Seo, Nojun Kwak, Kwang Moo Yi
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/9]:
- TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Zhang, Liu, Li, Zhang, Liu, Wang, Ouyang, Xiong, Gao, Hou, Cheng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/9]:
- Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
Yitong Yang, Yinglin Wang, Tian Zhang, Jing Wang, Shuting He
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/9]:
- On Representation Learning with Feedback
Hao Li
https://arxiv…
[2025-08-12 Tue (UTC), 235 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- A Challenging Benchmark of Anime Style Recognition
Haotang Li, Shengtao Guo, Kailin Lyu, Xiao Yang, Tianchen Chen, Jianqing Zhu, Huanqiang Zeng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- Nearest Neighbor Projection Removal Adversarial Training
Himanshu Singh, A. V. Subramanyam, Shivank Rajput, Mohan Kankanhalli
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification
Kaicong Huang, Talha Azfar, Jack Reilly, Ruimin Ke
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- Maximizing Information in Domain-Invariant Representation Improves Transfer Learning
Adrian Shuai Li, Elisa Bertino, Xuan-Hong Dang, Ankush Singla, Yuhai Tu, Mark N Wegman
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/1]:
- Revisiting Deepfake Detection: Chronological Continual Learning and the Limits of Generalization
Fontana, Diko, Lanzino, Marini, Kaddar, Foresti, Cinque
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- CARE: Enhancing Safety of Visual Navigation through Collision Avoidance via Repulsive Estimation
Joonkyung Kim, Joonyeol Sim, Woojun Kim, Katia Sycara, Changjoo Nam
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Can Large Pretrained Depth Estimation Models Help With Image Dehazing?
Hongfei Zhang, Kun Zhou, Ruizheng Wu, Jiangbo Lu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- M$^2$IV: Towards Efficient and Fine-grained Multimodal In-Context Learning via Representation Eng...
Yanshu Li, Yi Cao, Hongyang He, Qisen Cheng, Xiang Fu, Xi Xiao, Tianyang Wang, Ruixiang Tang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Improved DDIM Sampling with Moment Matching Gaussian Mixtures
Prasad Gabbur
https:/…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- Attention-Enhanced Deep Learning Ensemble for Breast Density Classification in Mammography
Peyman Sharifian, Xiaotong Hong, Alireza Karimian, Mehdi Amini, Hossein Arabi
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Adaptation of Multi-modal Representation Models for Multi-task Surgical Computer Vision
Soham Walimbe, Britty Baby, Vinkle Srivastav, Nicolas Padoy
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion
Haim Sawdayee, Chuan Guo, Guy Tevet, Bing Zhou, Jian Wang, Amit H. Bermano
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Boundary Learning by Using Weighted Propagation in Convolution Network
Wei Liu, Jiahao Chen, Chuni Liu, Xiaojuan Ban, Boyuan Ma, Hao Wang, Weihua Xue, Yu Guo
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Representation-Centric Survey of Skeletal Action Recognition and the ANUBIS Benchmark
Liu, Yang, Perera, Ji, Kim, Xu, Wang, Anwar, Gedeon, Wang, Qin
[2025-09-11 Thu (UTC), 59 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering
Wang, Pan, Wang, Liu, Mao, Liu, Zhu, Huang, Chen, Zhang, Chen
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
Yuming Li, Yikai Wang, Yuying Zhu, Zhongyu Zhao, Ming Lu, Qi She, Shanghang Zhang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- Conditional Video Generation for High-Efficiency Video Compression
Fangqiu Yi, Jingyu Xu, Jiawei Shao, Chi Zhang, Xuelong Li
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Large-scale Pre-training for Grounded Video Caption Generation
Evangelos Kazakos, Cordelia Schmid, Josef Sivic
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/1]:
- Benchmarking Vision Transformers and CNNs for Thermal Photovoltaic Fault Detection with Explainab...
Serra Aksoy
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation
Zhenghao Zhang, Junchao Liao, Xiangyu Meng, Long Qin, Weizhi Wang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Bai, Xia, Fu, Wang, Mu, Cao, Liu, Hu, Bai, Wan, Zhang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Hespi: A pipeline for automatically detecting information from hebarium specimen sheets
Robert Turnbull, Emily Fitzgerald, Karen Thompson, Joanne L. Birch
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Bayesian Multi-Scale Neural Network for Crowd Counting
Abhinav Sagar
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- Representation-Centric Survey of Skeletal Action Recognition and the ANUBIS Benchmark
Liu, Yang, Perera, Ji, Kim, Xu, Wang, Anwar, Gedeon, Wang, Qin
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- EgoPrompt: Prompt Learning for Egocentric Action Recognition
Huaihai Lyu, Chaofan Chen, Yuheng Ji, Changsheng Xu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Content Generation Models in Computational Pathology: A Comprehensive Survey on Methods, Applicat...
Yuan Zhang, Xinfeng Zhang, Xiaoming Qi, Xinyu Wu, Feng Chen, Guanyu Yang, Huazhu Fu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation
Weilong Yan, Xin Zhang, Robby T. Tan
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- CoreMark: Toward Robust and Universal Text Watermarking Technique
Jiale Meng, Yiming Li, Zheming Lu, Zewei He, Hao Luo, Tianwei Zhang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis
Bessie Dominguez-Dager, Felix Escalona, Francisco Gomez-Donoso, Miguel Cazorla
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Signal-Based Malware Classification Using 1D CNNs
Jack Wilkie, Hanan Hindy, Ivan Andonovic, Christos Tachtatzis, Robert Atkinson
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- LocoMamba: Vision-Driven Locomotion via End-to-End Deep Reinforcement Learning with Mamba
Yinuo Wang, Gavin Tao
[2025-07-10 Thu (UTC), 84 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[10/10]:
- Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reas...
Laskar, Islam, Mahbub, Masry, Rahman, Bhuiyan, Nayeem, Joty, Hoque, Huang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[9/10]:
- A Novel Automatic Real-time Motion Tracking Method in MRI-guided Radiotherapy Using Enhanced Trac...
Chen, Wang, Dai, Qin, Cao, Zhao, Chen, Wu, Tang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/10]:
- AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal La...
Zhou, Luo, Wu, Sun, Ji, Yan, Ding, Sun, Wu, Ji
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/10]:
- Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs
Shaojie Zhang, Jiahui Yang, Jianqin Yin, Zhenbo Luo, Jian Luan
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/10]:
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in V...
Niu, Cao, Zhan, Zhu, Ma, Zhao, Zeng, Zhong, Sun, Zheng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/10]:
- 4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges
Xiangyuan Peng, Miao Tang, Huawei Sun, Kay Bierzynski, Lorenzo Servadei, Robert Wille
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/10]:
- MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration
Mokshagna Sai Teja Karanam, Krithika Iyer, Sarang Joshi, Shireen Elhabian
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/10]:
- Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view RGB and Event Streams
Viktor Rudnev, Gereon Fox, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/10]:
- Deep Transformer Network for Monocular Pose Estimation of Shipborne Unmanned Aerial Vehicle
Maneesha Wickramasuriya, Taeyoung Lee, Murray Snyder
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/10]:
- AASeg: Attention Aware Network for Real Time Semantic Segmentation
Abhinav Sagar
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Multimodal Integration Challenges in Emotionally Expressive Child Avatars for Training Applications
Pegah Salehi, Sajad Amouei Sheshkal, Vajira Thambawita, Michael A. Riegler, P{\aa}l Halvorsen
…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- From Video to EEG: Adapting Joint Embedding Predictive Architecture to Uncover Visual Concepts in...
Amirabbas Hojjati, Lu Li, Ibrahim Hameed, Anis Yazidi, Pedro G. Lind, Rabindra Khadka
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- GC-GAT: Multimodal Vehicular Trajectory Prediction using Graph Goal Conditioning and Cross-contex...
Mahir Gulzar, Yar Muhammad, Naveed Muhammad
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Enhancing Visual Re-ranking through Denoising Nearest Neighbor Graph via Continuous CRF
Jaeyoon Kim, Yoonki Cho, Taeyoung Kim, Sung-Eui Yoon
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Zhenghao Zhang, Shengfan Zhang, Zhichao Wei, Zuozhuo Dai, Siyu Zhu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/6]:
- Live Demonstration: Neuromorphic Radar for Gesture Recognition
Satyapreet Singh Yadav, Akash K S, Chandra Sekhar Seelamantula, Chetan Singh Thakur
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anom...
Hao Ju, Hu Zhang, Zhedong Zheng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- Auto-Connect: Connectivity-Preserving RigFormer with Direct Preference Optimization
Guo, Liu, Chen, Mao, Hu, Jiang, Yu, Xu, Liu, Xu, Chen, Guo
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/1]:
- Sample-efficient Integration of New Modalities into Large Language Models
Osman Batur \.Ince, Andr\'e F. T. Martins, Oisin Mac Aodha, Edoardo M. Ponti
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- STF: Shallow-Level Temporal Feedback to Enhance Spiking Transformers
Zheng, Zhu, Yu, Huang, Lv, Tang, Yu, Jin
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- MetaOcc: Spatio-Temporal Fusion of Surround-View 4D Radar and Camera for 3D Occupancy Prediction ...
Yang, Zheng, Ai, Liu, Li, Lin, Yan, Bai, Ma, Huang, Zhu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- A Fast Text-Driven Approach for Generating Artistic Content
Marian Lupascu, Ryan Murdock, Ionut Mironica, Yijun Li
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/6]:
- IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
Xiaoya Lu, Zeren Chen, Xuhao Hu, Yijin Zhou, Weichen Zhang, Dongrui Liu, Lu Sheng, Jing Shao
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
Rang Meng, Yan Wang, Weipeng Wu, Ruobing Zheng, Yuming Li, Chenguang Ma
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/6]:
- Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Gu, Yang, Feng, Wang, Zhang, Long, Chen, Cai, Deng
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/6]:
- CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Hui Zhang, Dexiang Hong, Yitong Wang, Jie Shao, Xinglong Wu, Zuxuan Wu, Yu-Gang Jiang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/6]:
- Hulk: A Universal Knowledge Translator for Human-Centric Tasks
Wang, Wu, He, Guo, Zhu, Bai, Zhao, Wu, He, Ouyang, Tang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/9]:
- Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based ...
Jinyuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/6]:
- GRILL: Gradient Signal Restoration in Ill-Conditioned Layers to Enhance Adversarial Attacks on Au...
Chethan Krishnamurthy Ramanaik, Arjun Roy, Tobias Callies, Eirini Ntoutsi
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/6]:
- Multimodal Referring Segmentation: A Survey
Henghui Ding, Song Tang, Shuting He, Chang Liu, Zuxuan Wu, Yu-Gang Jiang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement
Tongshun Zhang, Pingping Liu, Mengen Cai, Zijian Zhang, Yubing Lu, Qiuzhan Zhou
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/6]:
- Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images
Jinghe Yang, Mingming Gong, Ye Pu