
2025-07-02 09:24:29
DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing
Yasunori Toshimitsu, Kento Kawaharazuka, Akihiro Miki, Kei Okada, Masayuki Inaba
https://arxiv.org/abs/2507.00446
DIJE: Dense Image Jacobian Estimation for Robust Robotic Self-Recognition and Visual Servoing
Yasunori Toshimitsu, Kento Kawaharazuka, Akihiro Miki, Kei Okada, Masayuki Inaba
https://arxiv.org/abs/2507.00446
The Demon is in Ambiguity: Revisiting Situation Recognition with Single Positive Multi-Label Learning
Yiming Lin, Yuchen Niu, Shang Wang, Kaizhu Huang, Qiufeng Wang, Xiao-Bo Jin
https://arxiv.org/abs/2508.21816
MetaLab: Few-Shot Game Changer for Image Recognition
Chaofei Qi, Zhitai Liu, Jianbin Qiu
https://arxiv.org/abs/2507.22057 https://arxiv.org/pdf/2507.22057
Replaced article(s) found for cs.NE. https://arxiv.org/list/cs.NE/new
[1/1]:
- Preprocessing Methods for Memristive Reservoir Computing for Image Recognition
Rishona Daniels, Duna Wattad, Ronny Ronen, David Saad, Shahar Kvatinsky
Replaced article(s) found for cs.ET. https://arxiv.org/list/cs.ET/new
[1/1]:
- Preprocessing Methods for Memristive Reservoir Computing for Image Recognition
Rishona Daniels, Duna Wattad, Ronny Ronen, David Saad, Shahar Kvatinsky
Explicit Residual-Based Scalable Image Coding for Humans and Machines
Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe
https://arxiv.org/abs/2506.19297 https://…
Enhanced Image Recognition Using Gaussian Boson Sampling
Si-Qiu Gong, Ming-Cheng Chen, Hua-Liang Liu, Hao Su, Yi-Chao Gu, Hao-Yang Tang, Meng-Hao Jia, Yu-Hao Deng, Qian Wei, Hui Wang, Han-Sen Zhong, Xiao Jiang, Li Li, Nai-Le Liu, Chao-Yang Lu, Jian-Wei Pan
https://arxiv.org/abs/2506.19707
Entropy-Based Non-Invasive Reliability Monitoring of Convolutional Neural Networks
Amirhossein Nazeri, Wael Hafez
https://arxiv.org/abs/2508.21715 https://…
Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors
Patrick Iff, Paul Bruegger, Marcin Chrapek, Maciej Besta, Torsten Hoefler
https://arxiv.org/abs/2507.21989
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation
Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- DSAGL: Dual-Stream Attention-Guided Learning for Weakly Supervised Whole Slide Image Classification
Cao, Cheng, Li, Zhou, Zhang, Li, Li, Gu, Zhang, Liu, Wu
Alibaba, Tencent, and other Chinese AI companies temporarily disabled chatbot functions like image recognition during China's annual college entrance exams (Luz Ding/Bloomberg)
https://www.bloomberg.com/news/articles/20
Normalized Radon Cumulative Distribution Transforms for Invariance and Robustness in Optimal Transport Based Image Classification
Matthias Beckmann, Robert Beinert, Jonas Bresch
https://arxiv.org/abs/2506.08761
Adjustable AprilTags For Identity Secured Tasks
Hao Li
https://arxiv.org/abs/2508.12304 https://arxiv.org/pdf/2508.12304
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Image Captioning via Compact Bidirectional Architecture
Zijie Song, Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Huixia Ben, Richang Hong, Meng Wang
Event-Enriched Image Analysis Grand Challenge at ACM Multimedia 2025
Thien-Phuc Tran, Minh-Quang Nguyen, Minh-Triet Tran, Tam V. Nguyen, Trong-Le Do, Duy-Nam Ly, Viet-Tham Huynh, Khanh-Duy Le, Mai-Khiem Tran, Trung-Nghia Le
https://arxiv.org/abs/2508.18904
Neural networks with image recognition by pairs
Polad Geidarov
https://arxiv.org/abs/2506.06322 https://arxiv.org/pdf/2506.06322
Diver-Robot Communication Dataset for Underwater Hand Gesture Recognition
Igor Kvasi\'c, Derek Orbaugh Antillon, {\DH}ula Na{\dj}, Christopher Walker, Iain Anderson, Nikola Mi\v{s}kovi\'c
https://arxiv.org/abs/2506.08974
MF-LPR$^2$: Multi-Frame License Plate Image Restoration and Recognition using Optical Flow
Kihyun Na, Junseok Oh, Youngkwan Cho, Bumjin Kim, Sungmin Cho, Jinyoung Choi, Injung Kim
https://arxiv.org/abs/2508.14797
Applying Vision Transformers on Spectral Analysis of Astronomical Objects
Luis Felipe Strano Moraes, Ignacio Becker, Pavlos Protopapas, Guillermo Cabrera-Vives
https://arxiv.org/abs/2506.00294
IDFace: Face Template Protection for Efficient and Secure Identification
Sunpill Kim, Seunghun Paik, Chanwoo Hwang, Dongsoo Kim, Junbum Shin, Jae Hong Seo
https://arxiv.org/abs/2507.12050
ViT-FIQA: Assessing Face Image Quality using Vision Transformers
Andrea Atzori, Fadi Boutros, Naser Damer
https://arxiv.org/abs/2508.13957 https://arxiv.or…
Identifying Prompted Artist Names from Generated Images
Grace Su, Sheng-Yu Wang, Aaron Hertzmann, Eli Shechtman, Jun-Yan Zhu, Richard Zhang
https://arxiv.org/abs/2507.18633 http…
Real-Time Foreign Object Recognition Based on Improved Wavelet Scattering Deep Network and Edge Computing
He Zhichao, Shen Xiangyu, Zhang Yong, Xie Nan
https://arxiv.org/abs/2507.11043
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- Efficient Image Generation with Variadic Attention Heads
Steven Walton, Ali Hassani, Xingqian Xu, Zhangyang Wang, Humphrey Shi
Preprocessing Methods for Memristive Reservoir Computing for Image Recognition
Rishona Daniels, Duna Wattad, Ronny Ronen, David Saad, Shahar Kvatinsky
https://arxiv.org/abs/2506.05588
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Chang, Fang, Xing, Wu, Cheng, Wang, Zeng, Yu, Chen
A Steel Surface Defect Detection Method Based on Lightweight Convolution Optimization
Cong Chen, Ming Chen, Hoileong Lee, Yan Li, Jiyang Yu
https://arxiv.org/abs/2507.15476
Recognition through Reasoning: Reinforcing Image Geo-localization with Large Vision-Language Models
Ling Li, Yao Zhou, Yuxuan Liang, Fugee Tsung, Jiaheng Wei
https://arxiv.org/abs/2506.14674
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Image Super-Resolution with Guarantees via Conformalized Generative Models
Eduardo Adame, Daniel Csillag, Guilherme Tegoni Goedert
SFATTI: Spiking FPGA Accelerator for Temporal Task-driven Inference -- A Case Study on MNIST
Alessio Caviglia, Filippo Marostica, Alessio Carpegna, Alessandro Savino, Stefano Di Carlo
https://arxiv.org/abs/2507.10561
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching
Yepeng Liu, Zhichao Sun, Baosheng Yu, Yitian Zhao, Bo Du, Yongchao Xu, Jun Cheng
GM-Skip: Metric-Guided Transformer Block Skipping for Efficient Vision-Language Models
Lianming Huang, Haibo Hu, Qiao Li, Xin He, Nan Guan, Chun Jason Xue
https://arxiv.org/abs/2508.18227
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/2]:
- Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution
Tainyi Zhang, Zheng-Peng Duan, Peng-Tao Jiang, Bo Li, Ming-Ming Cheng, Chun-Le Guo, Chongyi Li
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/6]:
- GenLit: Reformulating Single-Image Relighting as Video Generation
Shrisha Bharadwaj, Haiwen Feng, Giorgio Becherini, Victoria Fernandez Abrevaya, Michael J. Black
Handwritten Text Recognition of Historical Manuscripts Using Transformer-Based Models
Erez Meoded
https://arxiv.org/abs/2508.11499 https://arxiv.org/pdf/25…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/5]:
- ODES: Domain Adaptation with Expert Guidance for Online Medical Image Segmentation
Md Shazid Islam, Sayak Nag, Arindam Dutta, Miraj Ahmed, Fahim Faisal Niloy, Amit K. Roy-Chowdhury
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy
Carri\'on, Bai, Castro, Panaganti, Zenith, Trang, Zhang, Perona, Malik
MIND: A Noise-Adaptive Denoising Framework for Medical Images Integrating Multi-Scale Transformer
Tao Tang, Chengxu Yang
https://arxiv.org/abs/2508.07817 https://
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/6]:
- Enhancing Weakly Supervised 3D Medical Image Segmentation through Probabilistic-aware Learning
Runmin Jiang, Zhaoxin Fan, Junhao Wu, Lenghan Zhu, Xin Huang, Tianyang Wang, Heng Huang, Min Xu
This https://arxiv.org/abs/2506.05588 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csNE_…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/5]:
- PARTE: Part-Guided Texturing for 3D Human Reconstruction from a Single Image
Hyeongjin Nam, Donghwan Kim, Gyeongsik Moon, Kyoung Mu Lee
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Improving the Reasoning of Multi-Image Grounding in MLLMs via Reinforcement Learning
Bob Zhang, Haoran Li, Tao Zhang, Cilin Yan, Jiayin Cai, Yanbin Hao
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/4]:
- Context Diffusion: In-Context Aware Image Generation
Najdenkoska, Sinha, Dubey, Mahajan, Ramanathan, Radenovic
Attributes Shape the Embedding Space of Face Recognition Models
Pierrick Leroy, Antonio Mastropietro, Marco Nurisso, Francesco Vaccarino
https://arxiv.org/abs/2507.11372
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/7]:
- Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distrib...
Behraj Khan, Tahir Qasim Syed, Nouman M. Durrani, Bilal Naseem, Shabir Ahmad, Rizwan Qureshi
…
Integrating Complexity and Biological Realism: High-Performance Spiking Neural Networks for Breast Cancer Detection
Zofia Rudnicka, Januszcz Szczepanski, Agnieszka Pregowska
https://arxiv.org/abs/2506.06265
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
- Color Image Set Recognition Based on Quaternionic Grassmannians
Xiang Xiang Wang, Tin-Yau Tam
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
Yang, Duan, Zhu, Liu, Liu, Xu, Ma, Min, Zhai, Le Callet
A Lightweight Face Quality Assessment Framework to Improve Face Verification Performance in Real-Time Screening Applications
Ahmed Aman Ibrahim, Hamad Mansour Alawar, Abdulnasser Abbas Zehi, Ahmed Mohammad Alkendi, Bilal Shafi Ashfaq Ahmed Mirza, Shan Ullah, Ismail Lujain Jaleel, Hassan Ugail
https://arxiv.org/abs/2507.15961
Crosslisted article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/2]:
- From Image Captioning to Visual Storytelling
Admitos Passadakis, Yingjin Song, Albert Gatt
This https://arxiv.org/abs/2505.24380 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/7]:
- Hyperspectral Image Generation with Unmixing Guided Diffusion Model
Shiyu Shen, Bin Pan, Ziye Zhang, Zhenwei Shi
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[5/7]:
- ForensicsSAM: Toward Robust and Unified Image Forgery Detection and Localization Resisting to Adv...
Rongxuan Peng, Shunquan Tan, Chenqi Kong, Anwei Luo, Alex C. Kot, Jiwu Huang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[1/7]:
- Adaptively Clustering Neighbor Elements for Image-Text Generation
Zihua Wang, Xu Yang, Hanwang Zhang, Haiyang Xu, Ming Yan, Fei Huang, Yu Zhang
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
Haijing Liu, Tao Pu, Hefeng Wu, Keze Wang, Liang Lin
https://arxiv.org/abs/2508.05585 https:/…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/5]:
FrameBridge: Improving Image-to-Video Generation with Bridge Models
http…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/9]:
- FQGA-single: Towards Fewer Training Epochs and Fewer Model Parameters for Image-to-Image Translat...
Cho Yang
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/3]:
FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- RL-MoE: An Image-Based Privacy Preserving Approach In Intelligent Transportation System
Abdolazim Rezaei, Mehdi Sookhak, Mahboobeh Haghparast
Large Language Models Facilitate Vision Reflection in Image Classification
Guoyuan An, JaeYoon Kim, SungEui Yoon
https://arxiv.org/abs/2508.06525 https://a…
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing
Yusu Qian, Jiasen Lu, Tsu-Jui Fu, Xinze Wang, Chen Chen, Yinfei Yang, Wenze Hu, Zhe Gan
Self-Aware Adaptive Alignment: Enabling Accurate Perception for Intelligent Transportation Systems
Tong Xiang, Hongxia Zhao, Fenghua Zhu, Yuanyuan Chen, Yisheng Lv
https://arxiv.org/abs/2508.13823
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/9]:
- Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
Yitong Yang, Yinglin Wang, Tian Zhang, Jing Wang, Shuting He
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/4]:
- Can Large Pretrained Depth Estimation Models Help With Image Dehazing?
Hongfei Zhang, Kun Zhou, Ruizheng Wu, Jiangbo Lu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/10]:
- MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration
Mokshagna Sai Teja Karanam, Krithika Iyer, Sarang Joshi, Shireen Elhabian
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/10]:
- AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal La...
Zhou, Luo, Wu, Sun, Ji, Yan, Ding, Sun, Wu, Ji
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/6]:
- CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
Hui Zhang, Dexiang Hong, Yitong Wang, Jie Shao, Xinglong Wu, Zuxuan Wu, Yu-Gang Jiang
A Classification-Aware Super-Resolution Framework for Ship Targets in SAR Imagery
Ch Muhammad Awais, Marco Reggiannini, Davide Moroni, Oktay Karakus
https://arxiv.org/abs/2508.06407
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/10]:
- Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation
Nadav Z. Cohen, Oron Nir, Ariel Shamir
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/6]:
- BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancement
Tongshun Zhang, Pingping Liu, Mengen Cai, Zijian Zhang, Yubing Lu, Qiuzhan Zhou
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- Training-free Geometric Image Editing on Diffusion Models
Hanshen Zhu, Zhen Zhu, Kaile Zhang, Yiming Gong, Yuliang Liu, Xiang Bai
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[2/4]:
- FaceLift: Learning Generalizable Single Image 3D Face Reconstruction from Synthetic Heads
Weijie Lyu, Yi Zhou, Ming-Hsuan Yang, Zhixin Shu
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[4/4]:
- Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment
Pegios, Lin, Weng, Svendsen, Bashir, Bigdeli, Christensen, Tolsgaard, Feragen