Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 14:04:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- Digital and Robotic Twinning for Validation of Proximity Operations and Formation Flying
Golan, Zin, Ahmed, Bates, Bell, Huc, Low, Bosse, D'Amico

@thomasfuchs@hachyderm.io
2025-07-10 16:05:47

It's ironic that so many people are blinded by their own pattern recognition abilities that were billions of years in the making, misleading them into thinking LLM pattern generators have human-like intelligence

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 14:04:03

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- Cut2Next: Generating Next Shot via In-Context Tuning
Jingwen He, Hongbo Liu, Jiajun Li, Ziqi Huang, Yu Qiao, Wanli Ouyang, Ziwei Liu

@arXiv_qbioNC_bot@mastoxiv.page
2025-08-12 08:34:43

Understanding Human Limits in Pattern Recognition: A Computational Model of Sequential Reasoning in Rock, Paper, Scissors
Logan Cross, Erik Brockbank, Tobias Gerstenberg, Judith E. Fan, Daniel L. K. Yamins, Nick Haber
arxiv.org/abs/2508.06503

@arXiv_quantph_bot@mastoxiv.page
2025-08-11 09:46:49

Federated Quantum Kernel-Based Long Short-term Memory for Human Activity Recognition
Yu-Chao Hsu, Jiun-Cheng Jiang, Chun-Hua Lin, Wei-Ting Chen, Kuo-Chung Peng, Prayag Tiwari, Samuel Yen-Chi Chen, En-Jui Kuo
arxiv.org/abs/2508.06078

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 14:03:51

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang, Bingke Zhu, Yingying Chen, Yiyuan Zhang, Ming Tang, Jinqiao Wang

@arXiv_astrophSR_bot@mastoxiv.page
2025-07-09 08:20:52

Solar Flare Prediction Using LSTM and DLSTM with Sliding Window Pattern Recognition
Zeinab Hassani, Davud Mohammadpur, Hossein Safari
arxiv.org/abs/2507.05313

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 14:03:40

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
Haoran Chen, Ping Wang, Zihan Zhou, Xu Zhang, Zuxuan Wu, Yu-Gang Jiang

@arXiv_csCV_bot@mastoxiv.page
2025-08-13 14:03:28

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- Efficient Annotation of Medieval Charters
Anguelos Nicolaou, Daniel Luger, Franziska Decker, Nicolas Renet, Vincent Christlein, Georg Vogeler

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:59

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- Emotion-Qwen: A Unified Framework for Emotion and Vision Understanding
Huang, Li, Yan, Cheng, Han, Huang, Li, Li, Wang, Lian, Cheng, Peng

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:01:01

Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
Sankalp Tattwadarshi Swain, Anshika Krishnatray, Dhruv Kumar, Jagat Sesh Challa
arxiv.org/abs/2509.07389

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:48

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- MoCA: Identity-Preserving Text-to-Video Generation via Mixture of Cross Attention
Qi Xie, Yongjia Ma, Donglin Di, Xuehao Gao, Xun Yang

@arXiv_physicsoptics_bot@mastoxiv.page
2025-08-11 08:18:10

Single-Shot Multispectral Encoding: Advancing Optical Lithography for Encryption and Spectroscopy
Hyewon Shim, Geonwoong Park, Hyunsuk Yun, Sunmin Ryu, Yong-Young Noh, Cheol-Joo Kim
arxiv.org/abs/2508.05645

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:38

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial...
Ziyang Gong, Wenhao Li, Oliver Ma, Songyuan Li, Jiayi Ji, Xue Yang, Gen Luo, Junchi Yan, Rongrong Ji

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:28

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- UltraRay: Introducing Full-Path Ray Tracing in Physics-Based Ultrasound Simulation
Felix Duelmer, Mohammad Farid Azampour, Magdalena Wysocki, Nassir Navab

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:17

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- Prompt-aligned Gradient for Prompt Tuning
Beier Zhu, Yulei Niu, Yucheng Han, Yue Wu, Hanwang Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-07-14 12:59:44

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/3]:
- Objectomaly: Objectness-Aware Refinement for OoD Segmentation with Structural Consistency and Bou...
Jeonghoon Song, Sunghun Kim, Jaegyun Im, Byeongjoon Noh

@arXiv_csCV_bot@mastoxiv.page
2025-07-14 12:59:32

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/3]:
- InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction
Yuhui Wu, Liyi Chen, Ruibin Li, Shihao Wang, Chenxi Xie, Lei Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-07-14 12:59:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/3]:
- SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving
Honghao Fu, Yongli Gu, Yidong Yan, Yilang Shen, Yiwen Wu, Libo Sun

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 07:31:52

[2025-08-14 Thu (UTC), 122 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 12:45:39

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/3]:
- MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models
Yang, Chen, Wong, Lei, Chen, Li, Zhou, Cheng

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 12:45:27

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/3]:
- GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving
Chi Wan, Yixin Cui, Jiatong Du, Shuo Yang, Yulong Bai, Peng Yi, Nan Li, Yanjun Huang

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 12:45:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/3]:
- Total Disentanglement of Font Images into Style and Character Class Features
Daichi Haraguchi, Wataru Shimoda, Kota Yamaguchi, Seiichi Uchida

@arXiv_csCV_bot@mastoxiv.page
2025-09-12 11:06:30

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/1]:
- DeepTV: A neural network approach for total variation minimization
Andreas Langer, Sara Behnamian

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:14:22

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[9/9]:
- SOPHY: Learning to Generate Simulation-Ready Objects with Physical Materials
Junyi Cao, Evangelos Kalogerakis

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:14:12

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[8/9]:
- FQGA-single: Towards Fewer Training Epochs and Fewer Model Parameters for Image-to-Image Translat...
Cho Yang

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:14:01

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[7/9]:
- Learning Phonetic Context-Dependent Viseme for Enhancing Speech-Driven 3D Facial Animation
Hyung Kyu Kim, Hak Gu Kim

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:51

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/9]:
- UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
Yue Zhou, Yuan Bi, Wenjuan Tong, Wei Wang, Nassir Navab, Zhongliang Jiang

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:41

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/9]:
- 3D Gaussian Splatting Data Compression with Mixture of Priors
Lei Liu, Zhenghao Chen, Dong Xu

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:30

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/9]:
- ROODI: Reconstructing Occluded Objects with Denoising Inpainters
Yeonjin Chang, Erqun Dong, Seunghyeon Seo, Nojun Kwak, Kwang Moo Yi

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/9]:
- TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Zhang, Liu, Li, Zhang, Liu, Wang, Ouyang, Xiong, Gao, Hou, Cheng

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:10

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/9]:
- Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
Yitong Yang, Yinglin Wang, Tian Zhang, Jing Wang, Shuting He

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 19:13:00

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/9]:
- On Representation Learning with Feedback
Hao Li
arxiv…

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 07:31:43

[2025-08-12 Tue (UTC), 235 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 13:56:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- A Challenging Benchmark of Anime Style Recognition
Haotang Li, Shengtao Guo, Kailin Lyu, Xiao Yang, Tianchen Chen, Jianqing Zhu, Huanqiang Zeng

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 12:48:22

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/3]:
- Nearest Neighbor Projection Removal Adversarial Training
Himanshu Singh, A. V. Subramanyam, Shivank Rajput, Mohan Kankanhalli

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 12:48:09

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/3]:
- TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification
Kaicong Huang, Talha Azfar, Jack Reilly, Ruimin Ke

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 12:47:57

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/3]:
- Maximizing Information in Domain-Invariant Representation Improves Transfer Learning
Adrian Shuai Li, Elisa Bertino, Xuan-Hong Dang, Ankush Singla, Yuhai Tu, Mark N Wegman

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 10:57:41

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/1]:
- Revisiting Deepfake Detection: Chronological Continual Learning and the Limits of Generalization
Fontana, Diko, Lanzino, Marini, Kaddar, Foresti, Cinque

@arXiv_csCV_bot@mastoxiv.page
2025-08-11 13:32:13

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- CARE: Enhancing Safety of Visual Navigation through Collision Avoidance via Repulsive Estimation
Joonkyung Kim, Joonyeol Sim, Woojun Kim, Katia Sycara, Changjoo Nam

@arXiv_csCV_bot@mastoxiv.page
2025-08-11 13:32:01

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/4]:
- Can Large Pretrained Depth Estimation Models Help With Image Dehazing?
Hongfei Zhang, Kun Zhou, Ruizheng Wu, Jiangbo Lu

@arXiv_csCV_bot@mastoxiv.page
2025-08-11 13:31:49

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/4]:
- M$^2$IV: Towards Efficient and Fine-grained Multimodal In-Context Learning via Representation Eng...
Yanshu Li, Yi Cao, Hongyang He, Qisen Cheng, Xiang Fu, Xi Xiao, Tianyang Wang, Ruixiang Tang

@arXiv_csCV_bot@mastoxiv.page
2025-08-11 13:31:37

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/4]:
- Improved DDIM Sampling with Moment Matching Gaussian Mixtures
Prasad Gabbur

@arXiv_csCV_bot@mastoxiv.page
2025-07-11 13:37:49

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- Attention-Enhanced Deep Learning Ensemble for Breast Density Classification in Mammography
Peyman Sharifian, Xiaotong Hong, Alireza Karimian, Mehdi Amini, Hossein Arabi

@arXiv_csCV_bot@mastoxiv.page
2025-07-11 13:37:37

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/4]:
- Adaptation of Multi-modal Representation Models for Multi-task Surgical Computer Vision
Soham Walimbe, Britty Baby, Vinkle Srivastav, Nicolas Padoy

@arXiv_csCV_bot@mastoxiv.page
2025-07-11 13:37:25

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/4]:
- Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion
Haim Sawdayee, Chuan Guo, Guy Tevet, Bing Zhou, Jian Wang, Amit H. Bermano

@arXiv_csCV_bot@mastoxiv.page
2025-07-11 13:37:13

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/4]:
- Boundary Learning by Using Weighted Propagation in Convolution Network
Wei Liu, Jiahao Chen, Chuni Liu, Xiaojuan Ban, Boyuan Ma, Hao Wang, Weihua Xue, Yu Guo

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 17:22:34

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- Representation-Centric Survey of Skeletal Action Recognition and the ANUBIS Benchmark
Liu, Yang, Perera, Ji, Kim, Xu, Wang, Anwar, Gedeon, Wang, Qin

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 07:32:23

[2025-09-11 Thu (UTC), 59 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 13:56:57

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering
Wang, Pan, Wang, Liu, Mao, Liu, Zhu, Huang, Chen, Zhang, Chen

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 13:56:47

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
Yuming Li, Yikai Wang, Yuying Zhu, Zhongyu Zhao, Ming Lu, Qi She, Shanghang Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 13:56:36

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- Conditional Video Generation for High-Efficiency Video Compression
Fangqiu Yi, Jingyu Xu, Jiawei Shao, Chi Zhang, Xuelong Li

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 13:56:26

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- Large-scale Pre-training for Grounded Video Caption Generation
Evangelos Kazakos, Cordelia Schmid, Josef Sivic

@arXiv_csCV_bot@mastoxiv.page
2025-09-10 11:29:07

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/1]:
- Benchmarking Vision Transformers and CNNs for Thermal Photovoltaic Fault Detection with Explainab...
Serra Aksoy

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 14:11:02

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation
Zhenghao Zhang, Junchao Liao, Xiangyu Meng, Long Qin, Weizhi Wang

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 14:10:52

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/4]:
- ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Bai, Xia, Fu, Wang, Mu, Cao, Liu, Hu, Bai, Wan, Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 14:10:42

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/4]:
- Hespi: A pipeline for automatically detecting information from hebarium specimen sheets
Robert Turnbull, Emily Fitzgerald, Karen Thompson, Joanne L. Birch

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 14:10:31

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/4]:
- Bayesian Multi-Scale Neural Network for Crowd Counting
Abhinav Sagar

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 12:39:00

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/3]:
- Representation-Centric Survey of Skeletal Action Recognition and the ANUBIS Benchmark
Liu, Yang, Perera, Ji, Kim, Xu, Wang, Anwar, Gedeon, Wang, Qin

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:39

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- EgoPrompt: Prompt Learning for Egocentric Action Recognition
Huaihai Lyu, Chaofan Chen, Yuheng Ji, Changsheng Xu

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 17:23:22

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- Content Generation Models in Computational Pathology: A Comprehensive Survey on Methods, Applicat...
Yuan Zhang, Xinfeng Zhang, Xiaoming Qi, Xinyu Wu, Feng Chen, Guanyu Yang, Huazhu Fu

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 17:23:10

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- ER-LoRA: Effective-Rank Guided Adaptation for Weather-Generalized Depth Estimation
Weilong Yan, Xin Zhang, Robby T. Tan

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 17:22:59

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- CoreMark: Toward Robust and Universal Text Watermarking Technique
Jiale Meng, Yiming Li, Zheming Lu, Zewei He, Hao Luo, Tianwei Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 17:22:46

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis
Bessie Dominguez-Dager, Felix Escalona, Francisco Gomez-Donoso, Miguel Cazorla

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 13:57:37

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/2]:
- Signal-Based Malware Classification Using 1D CNNs
Jack Wilkie, Hanan Hindy, Ivan Andonovic, Christos Tachtatzis, Robert Atkinson

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 13:57:21

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/2]:
- LocoMamba: Vision-Driven Locomotion via End-to-End Deep Reinforcement Learning with Mamba
Yinuo Wang, Gavin Tao

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 07:31:31

[2025-07-10 Thu (UTC), 84 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:22:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[10/10]:
- Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reas...
Laskar, Islam, Mahbub, Masry, Rahman, Bhuiyan, Nayeem, Joty, Hoque, Huang

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:22:09

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[9/10]:
- A Novel Automatic Real-time Motion Tracking Method in MRI-guided Radiotherapy Using Enhanced Trac...
Chen, Wang, Dai, Qin, Cao, Zhao, Chen, Wu, Tang

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:59

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[8/10]:
- AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal La...
Zhou, Luo, Wu, Sun, Ji, Yan, Ding, Sun, Wu, Ji

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:48

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[7/10]:
- Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs
Shaojie Zhang, Jiahui Yang, Jianqin Yin, Zhenbo Luo, Jian Luan

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:38

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/10]:
- AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in V...
Niu, Cao, Zhan, Zhu, Ma, Zhao, Zeng, Zhong, Sun, Zheng

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:28

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/10]:
- 4D mmWave Radar for Sensing Enhancement in Adverse Environments: Advances and Challenges
Xiangyuan Peng, Miao Tang, Huawei Sun, Kay Bierzynski, Lorenzo Servadei, Robert Wille

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:17

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/10]:
- MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration
Mokshagna Sai Teja Karanam, Krithika Iyer, Sarang Joshi, Shireen Elhabian

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:21:07

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/10]:
- Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view RGB and Event Streams
Viktor Rudnev, Gereon Fox, Mohamed Elgharib, Christian Theobalt, Vladislav Golyanik

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:20:57

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/10]:
- Deep Transformer Network for Monocular Pose Estimation of Shipborne Unmanned Aerial Vehicle
Maneesha Wickramasuriya, Taeyoung Lee, Murray Snyder

@arXiv_csCV_bot@mastoxiv.page
2025-07-08 23:20:46

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/10]:
- AASeg: Attention Aware Network for Real Time Semantic Segmentation
Abhinav Sagar

@arXiv_csCV_bot@mastoxiv.page
2025-07-09 14:35:25

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- Multimodal Integration Challenges in Emotionally Expressive Child Avatars for Training Applications
Pegah Salehi, Sajad Amouei Sheshkal, Vajira Thambawita, Michael A. Riegler, P{\aa}l Halvorsen

@arXiv_csCV_bot@mastoxiv.page
2025-07-09 14:35:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- From Video to EEG: Adapting Joint Embedding Predictive Architecture to Uncover Visual Concepts in...
Amirabbas Hojjati, Lu Li, Ibrahim Hameed, Anis Yazidi, Pedro G. Lind, Rabindra Khadka

@arXiv_csCV_bot@mastoxiv.page
2025-07-09 14:35:05

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- GC-GAT: Multimodal Vehicular Trajectory Prediction using Graph Goal Conditioning and Cross-contex...
Mahir Gulzar, Yar Muhammad, Naveed Muhammad

@arXiv_csCV_bot@mastoxiv.page
2025-07-09 14:34:55

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- Enhancing Visual Re-ranking through Denoising Nearest Neighbor Graph via Continuous CRF
Jaeyoon Kim, Yoonki Cho, Taeyoung Kim, Sung-Eui Yoon

@arXiv_csCV_bot@mastoxiv.page
2025-07-09 14:34:35

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model
Zhenghao Zhang, Shengfan Zhang, Zhichao Wei, Zuozhuo Dai, Siyu Zhu

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:40:51

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/6]:
- Live Demonstration: Neuromorphic Radar for Gesture Recognition
Satyapreet Singh Yadav, Akash K S, Chandra Sekhar Seelamantula, Chetan Singh Thakur

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 12:39:21

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/3]:
- AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anom...
Hao Ju, Hu Zhang, Zhedong Zheng

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 12:39:11

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/3]:
- Auto-Connect: Connectivity-Preserving RigFormer with Direct Preference Optimization
Guo, Liu, Chen, Mao, Hu, Jiang, Yu, Xu, Liu, Xu, Chen, Guo

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 10:57:34

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/1]:
- Sample-efficient Integration of New Modalities into Large Language Models
Osman Batur \.Ince, Andr\'e F. T. Martins, Oisin Mac Aodha, Edoardo M. Ponti

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:49

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- STF: Shallow-Level Temporal Feedback to Enhance Spiking Transformers
Zheng, Zhu, Yu, Huang, Lv, Tang, Yu, Jin

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:29

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:18

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- MetaOcc: Spatio-Temporal Fusion of Surround-View 4D Radar and Camera for 3D Occupancy Prediction ...
Yang, Zheng, Ai, Liu, Li, Lin, Yan, Bai, Ma, Huang, Zhu

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:08

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- A Fast Text-Driven Approach for Generating Artistic Content
Marian Lupascu, Ryan Murdock, Ionut Mironica, Yijun Li

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:41:02

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/6]:
- IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
Xiaoya Lu, Zeren Chen, Xuhao Hu, Yijin Zhou, Weichen Zhang, Dongrui Liu, Lu Sheng, Jing Shao

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:40:41

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/6]:
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
Rang Meng, Yan Wang, Weipeng Wu, Ruobing Zheng, Yuming Li, Chenguang Ma

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:40:31

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/6]:
- Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Gu, Yang, Feng, Wang, Zhang, Long, Chen, Cai, Deng

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:40:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/6]:
- CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Hui Zhang, Dexiang Hong, Yitong Wang, Jie Shao, Xinglong Wu, Zuxuan Wu, Yu-Gang Jiang

@arXiv_csCV_bot@mastoxiv.page
2025-08-07 14:40:10

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/6]:
- Hulk: A Universal Knowledge Translator for Human-Centric Tasks
Wang, Wu, He, Guo, Zhu, Bai, Zhao, Wu, He, Ouyang, Tang

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 22:49:18

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[8/9]:
- Advancing Grounded Multimodal Named Entity Recognition via LLM-Based Reformulation and Box-Based ...
Jinyuan Li, Ziyan Li, Han Li, Jianfei Yu, Rui Xia, Di Sun, Gang Pan

@arXiv_csCV_bot@mastoxiv.page
2025-08-06 14:57:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/6]:
- GRILL: Gradient Signal Restoration in Ill-Conditioned Layers to Enhance Adversarial Attacks on Au...
Chethan Krishnamurthy Ramanaik, Arjun Roy, Tobias Callies, Eirini Ntoutsi

@arXiv_csCV_bot@mastoxiv.page
2025-08-06 14:57:10

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/6]:
- Multimodal Referring Segmentation: A Survey
Henghui Ding, Song Tang, Shuting He, Chang Liu, Zuxuan Wu, Yu-Gang Jiang

@arXiv_csCV_bot@mastoxiv.page
2025-08-06 14:57:00

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/6]:
- BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement
Tongshun Zhang, Pingping Liu, Mengen Cai, Zijian Zhang, Yubing Lu, Qiuzhan Zhou

@arXiv_csCV_bot@mastoxiv.page
2025-08-06 14:56:49

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/6]:
- Knowledge Distillation for Underwater Feature Extraction and Matching via GAN-synthesized Images
Jinghe Yang, Mingming Gong, Ye Pu