Tootfinder

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 16:14:50

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- ArtPerception: ASCII Art-based Jailbreak on LLMs with Recognition Pre-test
Guan-Yan Yang, Tzu-Yu Cheng, Ya-Wen Teng, Farn Wanga, Kuo-Hui Yeh

@arXiv_csSE_bot@mastoxiv.page
2025-10-14 10:48:28

Software Defect Prediction using Autoencoder Transformer Model
Seshu Barma, Mohanakrishnan Hariharan, Satish Arvapalli
https://arxiv.org/abs/2510.10840 https://

Software Defect Prediction using Autoencoder Transformer Model
An AI-ML-powered quality engineering approach uses AI-ML to enhance software quality assessments by predicting defects. Existing ML models struggle with noisy data types, imbalances, pattern recognition, feature extraction, and generalization. To address these challenges, we develop a new model, Adaptive Differential Evolution (ADE) based Quantum Variational Autoencoder-Transformer (QVAET) Model (ADE-QVAET). ADE combines with QVAET to obtain high-dimensional latent features and maintain sequent…

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:53:19

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- Frequency-Aware Ensemble Learning for BraTS 2025 Pediatric Brain Tumor Segmentation
Yuxiao Yi, Qingyao Zhuang, Zhi-Qin John Xu, Xiaowen Wang, Yan Ren, Tianming Qiu

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:53:07

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- ALISE: Annotation-Free LiDAR Instance Segmentation for Autonomous Driving
Yongxuan Lyu, Guangfeng Jiang, Hongsi Liu, Jun Liu

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:52:55

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- MedCAL-Bench: A Comprehensive Benchmark on Cold-Start Active Learning with Foundation Models for ...
Ning Zhu, Xiaochuan Ma, Shaoting Zhang, Guotai Wang

@trochee@dair-community.social
2025-10-26 15:17:51

> Companies that can find the courage to trust their people, who bet on caring and curation and small models as differentiators instead of just rebadging big company products will have better choices and they’ll make better choices.
Among the bangers here from @… :
Pattern Recognition And Repetition | blarg

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:52:43

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Deep Learning for Sports Video Event Detection: Tasks, Datasets, Methods, and Challenges
Hao Xu, Arbind Agrahari Baniya, Sam Well, Mohamed Reda Bouadjenek, Richard Dazeley, Sunil Aryal

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 14:52:31

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Improving Image Captioning Descriptiveness by Ranking and LLM-based Fusion
Luigi Celona, Simone Bianco, Marco Donzella, Paolo Napoletano

@arXiv_csNE_bot@mastoxiv.page
2025-10-08 07:35:39

From Neural Activity to Computation: Biological Reservoirs for Pattern Recognition in Digit Classification
Ludovico Iannello, Luca Ciampi, Fabrizio Tonelli, Gabriele Lagani, Lucio Maria Calcagnile, Federico Cremisi, Angelo Di Garbo, Giuseppe Amato
https://arxiv.org/abs/2510.05637

From Neural Activity to Computation: Biological Reservoirs for Pattern Recognition in Digit Classification
In this paper, we present a biologically grounded approach to reservoir computing (RC), in which a network of cultured biological neurons serves as the reservoir substrate. This system, referred to as biological reservoir computing (BRC), replaces artificial recurrent units with the spontaneous and evoked activity of living neurons. A multi-electrode array (MEA) enables simultaneous stimulation and readout across multiple sites: inputs are delivered through a subset of electrodes, while the rem…

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 11:54:57

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation
Mert \.Inan, Anthony Sicilia, Alex Xie, Saujas Vaduguru, Daniel Fried, Malihe Alikhani

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 11:54:42

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- Look before Transcription: End-to-End SlideASR with Visually-Anchored Policy Optimization
Rui Hu, Delai Qiu, Yining Wang, Shengping Liu, Jitao Sang

@arXiv_csRO_bot@mastoxiv.page
2025-10-09 10:06:51

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Yi Han, Cheng Chi, Enshen Zhou, Shanyu Rong, Jingkun An, Pengwei Wang, Zhongyuan Wang, Lu Sheng, Shanghang Zhang
https://arxiv.org/abs/2510.07181

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Vision-Language Models (VLMs) have shown remarkable capabilities in spatial reasoning, yet they remain fundamentally limited to qualitative precision and lack the computational precision required for real-world robotics. Current approaches fail to leverage metric cues from depth sensors and camera calibration, instead reducing geometric problems to pattern recognition tasks that cannot deliver the centimeter-level accuracy essential for robotic manipulation. We present TIGeR (Tool-Integrated Ge…

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:20:59

RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases
Lang Qin, Zijian Gan, Xu Cao, Pengcheng Jiang, Yankai Jiang, Jiawei Han, Kaishun Wu, Jintai Chen
https://arxiv.org/abs/2510.05764

RareAgent: Self-Evolving Reasoning for Drug Repurposing in Rare Diseases
Computational drug repurposing for rare diseases is especially challenging when no prior associations exist between drugs and target diseases. Therefore, knowledge graph completion and message-passing GNNs have little reliable signal to learn and propagate, resulting in poor performance. We present RareAgent, a self-evolving multi-agent system that reframes this task from passive pattern recognition to active evidence-seeking reasoning. RareAgent organizes task-specific adversarial debates in w…

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:05:25

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/8]:
- TC-GS: A Faster Gaussian Splatting Module Utilizing Tensor Cores
Liao, Ding, Cui, Gong, Hu, Wang, Li, Zhang, Wang, Fu

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:05:05

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/8]:
- MultiCOIN: Multi-Modal COntrollable Video INbetweening
Tanveer, Zhou, Niklaus, Amiri, Zhang, Singh, Zhao

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:04:45

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/8]:
- GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning
Mustansar Fiaz, Hiyam Debary, Paolo Fraccaro, Danda Paudel, Luc Van Gool, Fahad Khan, Salman Khan

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:04:25

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/8]:
- Context Guided Transformer Entropy Modeling for Video Compression
Junlong Tong, Wei Zhang, Yaohui Jin, Xiaoyu Shen

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:04:05

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/8]:
- Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization
Yanting Gao, Yepeng Liu, Junming Liu, Qi Zhang, Hongyun Zhang, Duoqian Miao, Cairong Zhao

@arXiv_eessSP_bot@mastoxiv.page
2025-10-07 07:40:15

COMET: Co-Optimization of a CNN Model using Efficient-Hardware OBC Techniques
Boyang Chen, Mohd Tasleem Khan, George Goussetis, Mathini Sellathurai, Yuan Ding, Jo\~ao F. C. Mota
https://arxiv.org/abs/2510.03516

COMET: Co-Optimization of a CNN Model using Efficient-Hardware OBC Techniques
Convolutional Neural Networks (CNNs) are highly effective for computer vision and pattern recognition tasks; however, their computational intensity and reliance on hardware such as FPGAs pose challenges for deployment on low-power edge devices. In this work, we present COMET, a framework of CNN designs that employ efficient hardware offset-binary coding (OBC) techniques to enable co-optimization of performance and resource utilization. The approach formulates CNN inference with OBC representati…

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:03:45

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/8]:
- Learning to Instruct for Visual Instruction Tuning
Zhihan Zhou, Feng Hong, Jiaan Luo, Jiangchao Yao, Dongsheng Li, Bo Han, Ya Zhang, Yanfeng Wang

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:03:25

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/8]:
- Multimodal Alignment and Fusion: A Survey
Songtao Li, Hao Tang
https://

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 22:03:06

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/8]:
- Invariant Feature Learning for Generalized Long-Tailed Classification
Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 16:15:04

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- Adversarial Attacks Leverage Interference Between Features in Superposition
Edward Stevinson, Lucas Prieto, Melih Barsbey, Tolga Birdal

@arXiv_csCV_bot@mastoxiv.page
2025-10-14 16:14:34

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- Gradient-Sign Masking for Task Vector Transport Across Pre-Trained Models
Rinaldi, Panariello, Salici, Liu, Ciccone, Porrello, Calderara

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 07:30:44

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture
Yeonju Lee, Rui Qi Chen, Joseph Oboamah, Po Nien Su, Wei-zhen Liang, Yeyin Shi, Lu Gan, Yongsheng Chen, Xin Qiao, Jing Li
https://arxiv.org/abs/2509.18123

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture
Accurate interpretation of soil moisture patterns is critical for irrigation scheduling and crop management, yet existing approaches for soil moisture time-series analysis either rely on threshold-based rules or data-hungry machine learning or deep learning models that are limited in adaptability and interpretability. In this study, we introduce SPADE (Soil moisture Pattern and Anomaly DEtection), an integrated framework that leverages large language models (LLMs) to jointly detect irrigation p…

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 14:52:09

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- DiffMI: Breaking Face Recognition Privacy via Diffusion-Driven Training-Free Model Inversion
Hanrui Wang, Shuo Wang, Chun-Shien Lu, Isao Echizen

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:32:27

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Han, Chi, Zhou, Rong, An, Wang, Wang, Sheng, Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:32:12

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- HIVTP: A Training-Free Method to Improve VLMs Efficiency via Hierarchical Visual Token Pruning Us...
Jingqi Xu, Jingxi Lu, Chenghao Li, Sreetama Sarkar, Peter A. Beerel

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:31:59

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- VisionTS : Cross-Modal Time Series Foundation Model with Continual Pre-trained Vision Backbones
Lefei Shen, Mouxiang Chen, Xu Liu, Han Fu, Xiaoxue Ren, Jianling Sun, Zhuo Li, Chenghao Liu

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:31:45

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes
Bose, Dutta, Nag, Zhang, Li, Karydis, Chowdhury
…

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:31:30

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- PRVR: Partially Relevant Video Retrieval
Xianke Chen, Daizong Liu, Xun Yang, Xirong Li, Jianfeng Dong, Meng Wang, Xun Wang

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 13:11:17

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Splat the Net: Radiance Fields with Splattable Neural Primitives
Zhou, Nguyen, Magne, Golyanik, Leimk\"uhler, Theobalt

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 13:11:01

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- DUA-D2C: Dynamic Uncertainty Aware Method for Overfitting Remediation in Deep Learning
Md. Saiful Bari Siddiqui, Md Mohaiminul Islam, Md. Golam Rabiul Alam

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 14:51:57

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Unified Unsupervised Anomaly Detection via Matching Cost Filtering
Zhang, Cai, Wu, Zhang, Liu, Tao, Chai, Zhu

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 14:51:45

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- RGS-DR: Deferred Reflections and Residual Shading in 2D Gaussian Splatting
Georgios Kouros, Minye Wu, Tinne Tuytelaars

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 14:51:33

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Spatiotemporal Tile-based Attention-guided LSTMs for Traffic Video Prediction
Tu Nguyen

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 12:10:39

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Revisiting Mixout: An Overlooked Path to Robust Finetuning
Masih Aminbeidokhti, Heitor Rapela Medeiros, Eric Granger, Marco Pedersoli

@arXiv_csCV_bot@mastoxiv.page
2025-10-09 12:10:25

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- Stacked Regression using Off-the-shelf, Stimulus-tuned and Fine-tuned Neural Networks for Predict...
Robert Scholz, Kunal Bagga, Christine Ahrends, Carlo Alberto Barbano

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:41:13

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/7]:
- QGFace: Quality-Guided Joint Training For Mixed-Quality Face Recognition
Youzhe Song, Feng Wang

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:18:17

$\gamma$-Quant: Towards Learnable Quantization for Low-bit Pattern Recognition
Mishal Fatima, Shashank Agnihotri, Marius Bock, Kanchana Vaishnavi Gandikota, Kristof Van Laerhoven, Michael Moeller, Margret Keuper
https://arxiv.org/abs/2509.22448

$γ$-Quant: Towards Learnable Quantization for Low-bit Pattern Recognition
Most pattern recognition models are developed on pre-proce\-ssed data. In computer vision, for instance, RGB images processed through image signal processing (ISP) pipelines designed to cater to human perception are the most frequent input to image analysis networks. However, many modern vision tasks operate without a human in the loop, raising the question of whether such pre-processing is optimal for automated analysis. Similarly, human activity recognition (HAR) on body-worn sensor data comm…

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:36:51

Cross-Breed Pig Identification Using Auricular Vein Pattern Recognition: A Machine Learning Approach for Small-Scale Farming Applications
Emmanuel Nsengiyumvaa, Leonard Niyitegekaa, Eric Umuhoza
https://arxiv.org/abs/2510.02197

Cross-Breed Pig Identification Using Auricular Vein Pattern Recognition: A Machine Learning Approach for Small-Scale Farming Applications
Accurate livestock identification is a cornerstone of modern farming: it supports health monitoring, breeding programs, and productivity tracking. However, common pig identification methods, such as ear tags and microchips, are often unreliable, costly, target pure breeds, and thus impractical for small-scale farmers. To address this gap, we propose a noninvasive biometric identification approach that leverages uniqueness of the auricular vein patterns. To this end, we have collected 800 ear im…

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:42:40

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/7]:
- What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale
Xiaoyong Yuan, Xiaolong Ma, Linke Guo, Lan Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:42:29

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/7]:
- Latent Visual Reasoning
Li, Sun, Liu, Wang, Wu, Yu, Chen, Barsoum, Chen, Liu
https:…

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:42:16

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/7]:
- Deep Spectral Epipolar Representations for Dense Light Field Reconstruction
Noor Islam S. Mohammad

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:42:04

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/7]:
- Resolving Task Objective Conflicts in Unified Model via Task-Aware Mixture-of-Experts
Jiaxing Zhang, Hao Tang

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:41:53

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/7]:
- AutoDrive-QA: A Multiple-Choice Benchmark for Vision-Language Evaluation in Urban Autonomous Driving
Boshra Khalili, Andrew W. Smyth

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:41:33

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/7]:
- STIV: Scalable Text and Image Conditioned Video Generation
Lin, Liu, Chen, Lu, Hu, Fu, Allardice, Lai, Song, Zhang, Chen, Fei, Li, Sun, Chang, Yang

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 15:25:22

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents
Zeyi Zhang, Yanju Zhou, Heyuan Yao, Tenglong Ao, Xiaohang Zhan, Libin Liu

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 15:25:06

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- AI-Assisted Pleural Effusion Volume Estimation from Contrast-Enhanced CT Images
Sanhita Basu, Tomas Fr\"oding, Ali Teymur Kahraman, Dimitris Toumpanakis, Tobias Sj\"oblom

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 15:24:51

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- VIFO: Visual Feature Empowered Multivariate Time Series Forecasting with Cross-Modal Fusion
Wang, Yu, Xu, Ma, Zhang, Feng, Zhang, Huang, Sun, Zhang

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 11:07:35

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/1]:
- Secure and Robust Watermarking for AI-generated Images: A Comprehensive Survey
Jie Cao, Qi Li, Zelin Zhang, Jianbing Ni

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 13:17:39

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- VirDA: Reusing Backbone for Unsupervised Domain Adaptation with Visual Reprogramming
Duy Nguyen, Dat Nguyen

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 13:17:27

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segment...
Peng, Huang, Huang, Wen, Zheng, Chen, Yang, Wu, Hao, Stiefelhagen

@arXiv_csCV_bot@mastoxiv.page
2025-10-06 13:17:13

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/3]:
- Filter-Guided Diffusion for Controllable Image Generation
Zeqi Gu, Ethan Yang, Abe Davis

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 12:18:47

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- ROI-GS: Interest-based Local Quality 3D Gaussian Splatting
Quoc-Anh Bui, Gilles Rougeron, G\'eraldine Morin, Simone Gasparini

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 12:18:31

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- Development and Evaluation of an AI-Driven Telemedicine System for Prenatal Healthcare
Juan Barrientos, Michaelle P\'erez, Douglas Gonz\'alez, Favio Reyna, Julio Fajardo, Andrea Lara

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:05:08

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/5]:
- HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy
Myungkyu Koo, Daewon Choi, Taeyoung Kim, Kyungmin Lee, Changyeon Kim, Younggyo Seo, Jinwoo Shin

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:04:55

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- Equivariant Splitting: Self-supervised learning from incomplete data
Victor Sechaud, J\'er\'emy Scanvic, Quentin Barth\'elemy, Patrice Abry, Juli\'an Tachella

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:04:42

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/5]:
- RS-OOD: A Vision-Language Augmented Framework for Out-of-Distribution Detection in Remote Sensing
Chenhao Wang, Yingrui Ji, Yu Meng, Yunjian Zhang, Yao Zhu

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:04:30

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
- Oh-A-DINO: Understanding and Enhancing Attribute-Level Information in Self-Supervised Object-Cent...
Stefan Sylvius Wagner, Stefan Harmeling

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:04:17

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- LiDAR-HMR: 3D Human Mesh Recovery from LiDAR
Bohao Fan, Wenzhao Zheng, Jianjiang Feng, Jie Zhou

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 15:15:19

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP
Christoph Timmermann, Hyunse Lee, Woojin Lee

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 15:15:06

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- PAN: Pillars-Attention-Based Network for 3D Object Detection
Bispo, Mitrev, Mariotti, Botty, Humphrey, Scanlan, Eising

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 15:14:55

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Robustness and sex differences in skin cancer detection: logistic regression vs CNNs
Pedersen, Sydendal, Wulff, Raumanns, Petersen, Cheplygina

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 15:14:42

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- ZoDIAC: Zoneout Dropout Injection Attention Calculation
Zanyar Zohourianshahzadi, Terrance E. Boult, Jugal K. Kalita

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 12:20:29

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- A Fast and Precise Method for Searching Rectangular Tumor Regions in Brain MR Images
Hidenori Takeshima, Shuki Maruyama

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 12:20:12

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- EVO-LRP: Evolutionary Optimization of LRP for Interpretable Model Explanations
Emerald Zhang, Julian Weaver, Samantha R Santacruz, Edward Castillo

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 13:46:12

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Scaling Up Temporal Domain Generalization via Temporal Experts Averaging
Liu, Miller, Saligrama, Saenko, Gong, Lim, Plummer

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 13:45:57

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- Hyperbolic Optimization
Yanke Wang, Kyriakos Flouris
https://

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:51:31

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/7]:
- DiffTex: Differentiable Texturing for Architectural Proxy Models
Weidan Xiong, Yongli Wu, Bochuan Zeng, Jianwei Guo, Dani Lischinski, Daniel Cohen-Or, Hui Huang

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:51:20

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/7]:
- YOLO26: Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
Ranjan Sapkota, Rahul Harsha Cheppally, Ajay Sharda, Manoj Karkee

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:51:10

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/7]:
- CoFFT: Chain of Foresight-Focus Thought for Visual Language Models
Zhang, Dong, Zhang, Jia, Dang, Fernando, Liu, Shou

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:51:00

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/7]:
- FoundBioNet: A Foundation-Based Model for IDH Genotyping of Glioma from Multi-Parametric MRI
Somayeh Farahani, Marjaneh Hejazi, Antonio Di Ieva, Sidong Liu

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:50:49

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/7]:
- Photography Perspective Composition: Towards Aesthetic Perspective Recommendation
Lujian Yao, Siming Zheng, Xinbin Yuan, Zhuoxuan Cai, Pu Wu, Jinwei Chen, Bo Li, Peng-Tao Jiang

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:50:29

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/7]:
- LFTR: Learning-Free Token Reduction for Multimodal Large Language Models
Zihui Zhao, Yingxin Li, Yang Li

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:50:10

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/7]:
- M$^{2}$SNet: Multi-scale in Multi-scale Subtraction Network for Medical Image Segmentation
Xiaoqi Zhao, Hongpeng Jia, Youwei Pang, Long Lv, Feng Tian, Lihe Zhang, Weibing Sun, Huchuan Lu

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:24:03

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[11/11]:
- Vidar: Embodied Video Diffusion Model for Generalist Manipulation
Yao Feng, Hengkai Tan, Xinyi Mao, Chendong Xiang, Guodong Liu, Shuhe Huang, Hang Su, Jun Zhu

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:23:51

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[10/11]:
- AdaRank: Adaptive Rank Pruning for Enhanced Model Merging
Chanhyuk Lee, Jiho Choi, Chanryeol Lee, Donggyun Kim, Seunghoon Hong

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:23:39

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[9/11]:
- Do Sparse Subnetworks Exhibit Cognitively Aligned Attention? Effects of Pruning on Saliency Map F...
Sanish Suwal, Dipkamal Bhusal, Michael Clifford, Nidhi Rastogi

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:23:27

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/11]:
- DEPFusion: Dual-Domain Enhancement and Priority-Guided Mamba Fusion for UAV Multispectral Object ...
Shucong Li, Zhenyu Liu, Zijie Hong, Zhiheng Zhou, Xianghai Cao

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:23:14

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[7/11]:
- BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Generation
Youping Gu, Xiaolong Li, Yuhao Hu, Minqi Chen, Bohan Zhuang

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:23:02

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/11]:
- Controllable Reference Guided Diffusion with Local Global Fusion for Real World Remote Sensing Im...
Ce Wang, Wanjie Sun

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:22:50

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/11]:
- EarthMind: Leveraging Cross-Sensor Data for Advanced Earth Observation Interpretation with a Unif...
Shu, Ren, Xiong, Paudel, Van Gool, Demir, Sebe, Rota

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:22:38

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/11]:
- Advancing Marine Research: UWSAM Framework and UIIS10K Dataset for Precise Underwater Instance Se...
Hua Li, Shijie Lian, Zhiyuan Li, Runmin Cong, Chongyi Li, Laurence T. Yang, Weidong Zhang, Sam Kwong

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:22:25

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/11]:
- Beyond Synthetic Replays: Turning Diffusion Features into Few-Shot Class-Incremental Learning Kno...
Junsu Kim, Yunhoe Ku, Dongyoon Han, Seungryul Baek

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:22:13

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/11]:
- DeepFRC: An End-to-End Deep Learning Model for Functional Registration and Classification
Siyuan Jiang, Yihan Hu, Wenjie Li, Pengcheng Zeng

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 01:22:01

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/11]:
- Learning to Infer Unseen Single-/Multi-Attribute-Object Compositions with Graph Networks
Hui Chen, Jingjing Jiang, Nanning Zheng

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 17:50:17

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- TraitSpaces: Towards Interpretable Visual Creativity for Human-AI Co-Creation
Prerna Luthra

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 17:50:01

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- ReLumix: Extending Image Relighting to Video via Video Diffusion Models
Lezhong Wang, Shutong Jin, Ruiqi Cui, Anders Bjorholm Dahl, Jeppe Revall Frisvad, Siavash Bigdeli

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 17:49:45

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Robust Fine-Tuning from Non-Robust Pretrained Models: Mitigating Suboptimal Transfer With Adversa...
Ngnaw\'e, Heuillet, Sahoo, Pequignot, Ahmad, Durand, Precioso, Gagn\'e

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 17:49:27

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
Shubhashis Roy Dipta, Francis Ferraro

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:05:47

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[6/6]:
- Geometry aware inference of steady state PDEs using Equivariant Neural Fields representations
Giovanni Catalani, Michael Bauerheim, Fr\'ed\'eric Tost, Xavier Bertrand, Joseph Morlier

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:05:34

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/6]:
- Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging ...
Perez-Carrasco, Nasr, Roche, Miller, Zhang, Park, Walker, Garraffo, Finkbeiner, Gautam, Wofsy

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:05:22

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- $A^2R^2$: Advancing Img2LaTeX Conversion via Visual Reasoning with Attention-Guided Refinement
Zhecheng Li, Guoxian Song, Yiwei Wang, Zhen Xiong, Junsong Yuan, Yujun Cai

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:05:10

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/6]:
- Astraea: A Token-wise Acceleration Framework for Video Diffusion Transformers
Liu, Cheng, Miao, Liu, Chen, Lin, Yao, Chen, Leng, Feng, Guo

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:04:58

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/6]:
- SPEED: Scalable, Precise, and Efficient Concept Erasure for Diffusion Models
Ouxiang Li, Yuan Wang, Xinting Hu, Houcheng Jiang, Tao Liang, Yanbin Hao, Guojun Ma, Fuli Feng

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 17:04:45

Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/6]:
- Multi-View Hypercomplex Learning for Breast Cancer Screening
Eleonora Lopez, Eleonora Grassucci, Danilo Comminiello

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 13:06:38

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature P...
Gonz\'alez, Longuefosse, Benito, Mart\'in, Baldacci

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 13:06:24

Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- SGAligner : Cross-Modal Language-Aided 3D Scene Graph Alignment
Binod Singh, Sayan Deb Sarkar, Iro Armeni

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 07:43:26

[2025-10-01 Wed (UTC), 133 new articles found for cs.CV Computer Vision and Pattern Recognition]
toXiv_bot_toot

Tootfinder

Opt-in global Mastodon full text search. Join the index!