2025-10-14 13:45:18
Benchmarking foundation models for hyperspectral image classification: Application to cereal crop type mapping
Walid Elbarz, Mohamed Bourriz, Hicham Hajji, Hamd Ait Abdelali, Fran\c{c}ois Bourzeix
https://arxiv.org/abs/2510.11576
Benchmarking foundation models for hyperspectral image classification: Application to cereal crop type mapping
Walid Elbarz, Mohamed Bourriz, Hicham Hajji, Hamd Ait Abdelali, Fran\c{c}ois Bourzeix
https://arxiv.org/abs/2510.11576
Hybrid Vision Transformer and Quantum Convolutional Neural Network for Image Classification
Mingzhu Wang, Yun Shang
https://arxiv.org/abs/2510.12291 https://
Prototypical Contrastive Learning For Improved Few-Shot Audio Classification
Christos Sgouropoulos, Christos Nikou, Stefanos Vlachos, Vasileios Theiou, Christos Foukanelis, Theodoros Giannakopoulos
https://arxiv.org/abs/2509.10074
Zero-shot image privacy classification with Vision-Language Models
Alina Elena Baia, Alessio Xompero, Andrea Cavallaro
https://arxiv.org/abs/2510.09253 https://
Measuring dissimilarity between convex cones by means of max-min angles
Welington de Oliveira, Valentina Sessa, David Sossa
https://arxiv.org/abs/2511.10483 https://arxiv.org/pdf/2511.10483 https://arxiv.org/html/2511.10483
arXiv:2511.10483v1 Announce Type: new
Abstract: This work introduces a novel dissimilarity measure between two convex cones, based on the max-min angle between them. We demonstrate that this measure is closely related to the Pompeiu-Hausdorff distance, a well-established metric for comparing compact sets. Furthermore, we examine cone configurations where the measure admits simplified or analytic forms. For the specific case of polyhedral cones, a nonconvex cutting-plane method is deployed to compute, at least approximately, the measure between them. Our approach builds on a tailored version of Kelley's cutting-plane algorithm, which involves solving a challenging master program per iteration. When this master program is solved locally, our method yields an angle that satisfies certain necessary optimality conditions of the underlying nonconvex optimization problem yielding the dissimilarity measure between the cones. As an application of the proposed mathematical and algorithmic framework, we address the image-set classification task under limited data conditions, a task that falls within the scope of the \emph{Few-Shot Learning} paradigm. In this context, image sets belonging to the same class are modeled as polyhedral cones, and our dissimilarity measure proves useful for understanding whether two image sets belong to the same class.
toXiv_bot_toot
Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
Zilin Kang, Chonghua Liao, Tingqiang Xu, Huazhe Xu
https://arxiv.org/abs/2510.08549
Crosslisted article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[2/8]:
- Label Semantics for Robust Hyperspectral Image Classification
Hassan, Roshni, Bari, Islam, Mohammed, Farazi, Rahman
Neuroplastic Modular Framework: Cross-Domain Image Classification of Garbage and Industrial Surfaces
Debojyoti Ghosh, Soumya K Ghosh, Adrijit Goswami
https://arxiv.org/abs/2510.05071
S$^3$F-Net: A Multi-Modal Approach to Medical Image Classification via Spatial-Spectral Summarizer Fusion Network
Md. Saiful Bari Siddiqui, Mohammed Imamul Hassan Bhuiyan
https://arxiv.org/abs/2509.23442
Accelerating Dynamic Image Graph Construction on FPGA for Vision GNNs
Anvitha Ramachandran, Dhruv Parikh, Viktor Prasanna
https://arxiv.org/abs/2509.25121 https://
Textual interpretation of transient image classifications from large language models
Fiorenzo Stoppa, Turan Bulmus, Steven Bloemen, Stephen J. Smartt, Paul J. Groot, Paul Vreeswijk, Ken W. Smith
https://arxiv.org/abs/2510.06931
NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization
Gaorui Zhang, Zhizhang Yuan, Jialan Yang, Junru Chen, Li Meng, Yang Yang
https://arxiv.org/abs/2512.09524 https://arxiv.org/pdf/2512.09524 https://arxiv.org/html/2512.09524
arXiv:2512.09524v1 Announce Type: new
Abstract: Neural decoding, a critical component of Brain-Computer Interface (BCI), has recently attracted increasing research interest. Previous research has focused on leveraging signal processing and deep learning methods to enhance neural decoding performance. However, the in-depth exploration of model architectures remains underexplored, despite its proven effectiveness in other tasks such as energy forecasting and image classification. In this study, we propose NeuroSketch, an effective framework for neural decoding via systematic architecture optimization. Starting with the basic architecture study, we find that CNN-2D outperforms other architectures in neural decoding tasks and explore its effectiveness from temporal and spatial perspectives. Building on this, we optimize the architecture from macro- to micro-level, achieving improvements in performance at each step. The exploration process and model validations take over 5,000 experiments spanning three distinct modalities (visual, auditory, and speech), three types of brain signals (EEG, SEEG, and ECoG), and eight diverse decoding tasks. Experimental results indicate that NeuroSketch achieves state-of-the-art (SOTA) performance across all evaluated datasets, positioning it as a powerful tool for neural decoding. Our code and scripts are available at https://github.com/Galaxy-Dawn/NeuroSketch.
toXiv_bot_toot
A Comparison of Selected Image Transformation Techniques for Malware Classification
Rishit Agrawal, Kunal Bhatnagar, Andrew Do, Ronnit Rana, Mark Stamp
https://arxiv.org/abs/2509.10838
A Semantics-Aware Hierarchical Self-Supervised Approach to Classification of Remote Sensing Images
Giulio Weikmann, Gianmarco Perantoni, Lorenzo Bruzzone
https://arxiv.org/abs/2510.04916
Automatic Classification of Magnetic Chirality of Solar Filaments from H-Alpha Observations
Alexis Chalmers, Azim Ahmadzadeh
https://arxiv.org/abs/2509.18214 https://
Paperless-NGX has become the true successor to #DEVONthink 3 for me, which now focuses on AI and changed its payment model. The automatic document classification is impressively accurate. Container hosting makes self-hosting straightforward. If you want modern, intelligent document management that works cross-platform with full control,
Quantum Probabilistic Label Refining: Enhancing Label Quality for Robust Image Classification
Fang Qi, Lu Peng, Zhengming Ding
https://arxiv.org/abs/2510.00528 https://
Bayesian Modelling of Multi-Year Crop Type Classification Using Deep Neural Networks and Hidden Markov Models
Gianmarco Perantoni, Giulio Weikmann, Lorenzo Bruzzone
https://arxiv.org/abs/2510.07008
Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography
Hanna Kreutzer, Anne-Sophie Caselitz, Thomas Dratsch, Daniel Pinto dos Santos, Christiane Kuhl, Daniel Truhn, Sven Nebelung
https://arxiv.org/abs/2510.05664 …
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
Sathira Silva, Eman Ali, Chetan Arora, Muhammad Haris Khan
https://arxiv.org/abs/2510.02270
Crosslisted article(s) found for q-bio.NC. https://arxiv.org/list/q-bio.NC/new
[1/1]:
- Error correction in multiclass image classification of facial emotion on unbalanced samples
Andrey A. Lebedev, Victor B. Kazantsev, Sergey V. Stasenko
FireGNN: Neuro-Symbolic Graph Neural Networks with Trainable Fuzzy Rules for Interpretable Medical Image Classification
Prajit Sengupta, Islem Rekik
https://arxiv.org/abs/2509.10510
A Collaborative Framework for Quantum Optimisation and Quantum Neural Networks: Credit Feature Selection and Image Classification
JiaNing Long, Xuechen Liang
https://arxiv.org/abs/2509.11110
UMind: A Unified Multitask Network for Zero-Shot M/EEG Visual Decoding
Chengjian Xu, Yonghao Song, Zelin Liao, Haochuan Zhang, Qiong Wang, Qingqing Zheng
https://arxiv.org/abs/2509.14772
Combining Audio and Non-Audio Inputs in Evolved Neural Networks for Ovenbird
Sergio Poo Hernandez, Vadim Bulitko, Erin Bayne
https://arxiv.org/abs/2509.10566 https://
Object-Centric Case-Based Reasoning via Argumentation
Gabriel de Olim Gaul, Adam Gould, Avinash Kori, Francesca Toni
https://arxiv.org/abs/2510.00185 https://
Sparse Representations Improve Adversarial Robustness of Neural Network Classifiers
Killian Steunou, Sigurd Saue, Th\'eo Druilhe
https://arxiv.org/abs/2509.21130 https://
Hy-Facial: Hybrid Feature Extraction by Dimensionality Reduction Methods for Enhanced Facial Expression Classification
Xinjin Li, Yu Ma, Kaisen Ye, Jinghan Cao, Minghao Zhou, Yeyang Zhou
https://arxiv.org/abs/2509.26614
Replaced article(s) found for eess.IV. https://arxiv.org/list/eess.IV/new
[1/1]:
- Brain Tumor Classification on MRI in Light of Molecular Markers
Liu, Yuan, Zeng, Tang, Zhang, Lin, Xu, Huang, Wang
Imaging Modalities-Based Classification for Lung Cancer Detection
Sajim Ahmed, Muhammad Zain Chaudhary, Muhammad Zohaib Chaudhary, Mahmoud Abbass, Ahmed Sherif, Mohammad Mahbubur Rahman Khan Mamun
https://arxiv.org/abs/2509.16254
WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification
Yiwen Jiang, Deval Mehta, Siyuan Yan, Yaling Shen, Zimu Wang, Zongyuan Ge
https://arxiv.org/abs/2509.17740
Intuitions of Machine Learning Researchers about Transfer Learning for Medical Image Classification
Yucheng Lu, Hubert Dariusz Zaj\k{a}c, Veronika Cheplygina, Amelia Jim\'enez-S\'anchez
https://arxiv.org/abs/2510.00902
Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Luc Boudier, Loris Manganelli, Eleftherios Tsonis, Nicolas Dufour, Vicky Kalogeiton
https://arxiv.org/abs/2509.22635
DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification
Fazle Rafsani, Jay Shah, Catherine D. Chong, Todd J. Schwedt, Teresa Wu
https://arxiv.org/abs/2509.12512
ERDE: Entropy-Regularized Distillation for Early-exit
Martial Guidez, Stefan Duffner, Yannick Alpou, Oscar R\"oth, Christophe Garcia
https://arxiv.org/abs/2510.04856 https:…
Achieving Fair Skin Lesion Detection through Skin Tone Normalization and Channel Pruning
Zihan Wei, Tapabrata Chakraborti
https://arxiv.org/abs/2509.22712 https://
Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability
Haifei Zhang, Patrick Barry, Eduardo Brandao
https://arxiv.org/abs/2510.00773 https://
Stratify or Die: Rethinking Data Splits in Image Segmentation
Naga Venkata Sai Jitin Jami, Thomas Altstidl, Jonas Mueller, Jindong Li, Dario Zanca, Bjoern Eskofier, Heike Leutheuser
https://arxiv.org/abs/2509.21056
Crosslisted article(s) found for eess.IV. https://arxiv.org/list/eess.IV/new
[1/1]:
- HQCNN: A Hybrid Quantum-Classical Neural Network for Medical Image Classification
, Fahim, Paul, Hossain, Ahmed, Chakraborty
ThinkFake: Reasoning in Multimodal Large Language Models for AI-Generated Image Detection
Tai-Ming Huang, Wei-Tung Lin, Kai-Lung Hua, Wen-Huang Cheng, Junichi Yamagishi, Jun-Cheng Chen
https://arxiv.org/abs/2509.19841
DF-LLaVA: Unlocking MLLM's potential for Synthetic Image Detection via Prompt-Guided Knowledge Injection
Zhuokang Shen, Kaisen Zhang, Bohan Jia, Yuan Fang, Zhou Yu, Shaohui Lin
https://arxiv.org/abs/2509.14957
Dual-View Alignment Learning with Hierarchical-Prompt for Class-Imbalance Multi-Label Classification
Sheng Huang, Jiexuan Yan, Beiyan Liu, Bo Liu, Richang Hong
https://arxiv.org/abs/2509.17747
ClustViT: Clustering-based Token Merging for Semantic Segmentation
Fabio Montello, Ronja G\"uldenring, Lazaros Nalpantidis
https://arxiv.org/abs/2510.01948 https://
Leveraging Geometric Visual Illusions as Perceptual Inductive Biases for Vision Models
Haobo Yang, Minghao Guo, Dequan Yang, Wenyu Wang
https://arxiv.org/abs/2509.15156 https://…
UltraUPConvNet: A UPerNet- and ConvNeXt-Based Multi-Task Network for Ultrasound Tissue Segmentation and Disease Prediction
Zhi Chen
https://arxiv.org/abs/2509.11108 https://
Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation
Hugo Carlesso, Josiane Mothe, Radu Tudor Ionescu
https://arxiv.org/abs/2509.13229