Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:46:49

Quantifying the Accuracy-Interpretability Trade-Off in Concept-Based Sidechannel Models
David Debot, Giuseppe Marra
arxiv.org/abs/2510.05670

@arXiv_csRO_bot@mastoxiv.page
2025-09-08 09:20:00

DeGuV: Depth-Guided Visual Reinforcement Learning for Generalization and Interpretability in Manipulation
Tien Pham, Xinyun Chi, Khang Nguyen, Manfred Huber, Angelo Cangelosi
arxiv.org/abs/2509.04970

@arXiv_statME_bot@mastoxiv.page
2025-10-08 09:08:59

Sparse-Group Factor Analysis for High-Dimensional Time Series
Xin Wang, Xialu Liu
arxiv.org/abs/2510.05370 arxiv.org/pdf/2510.05370

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 12:38:32

Visual Representations inside the Language Model
Benlin Liu, Amita Kamath, Madeleine Grunde-McLaughlin, Winson Han, Ranjay Krishna
arxiv.org/abs/2510.04819

@arXiv_csSE_bot@mastoxiv.page
2025-10-06 09:16:59

Mechanistic Interpretability of Code Correctness in LLMs via Sparse Autoencoders
Kriz Tahimic, Charibeth Cheng
arxiv.org/abs/2510.02917 arx…

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 14:27:53

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[4/6]:
- Cross-Document Cross-Lingual NLI via RST-Enhanced Graph Fusion and Interpretability Prediction
Mengying Yuan, Wenhao Wang, Zixuan Wang, Yujie Huang, Kangli Wei, Fei Li, Chong Teng, Donghong Ji

@arXiv_csHC_bot@mastoxiv.page
2025-08-08 09:28:52

CWEFS: Brain volume conduction effects inspired channel-wise EEG feature selection for multi-dimensional emotion recognition
Xueyuan Xu, Wenjia Dong, Fulin Wei, Li Zhuo
arxiv.org/abs/2508.05228

@arXiv_mathOC_bot@mastoxiv.page
2025-10-07 09:35:32

Optimal Regularization Under Uncertainty: Distributional Robustness and Convexity Constraints
Oscar Leong, Eliza O'Reilly, Yong Sheng Soh
arxiv.org/abs/2510.03464

@arXiv_csSD_bot@mastoxiv.page
2025-10-07 08:25:32

D\'esentrelacement Fr\'equentiel Doux pour les Codecs Audio Neuronaux
Beno\^it Gini\`es, Xiaoyu Bie, Olivier Fercoq, Ga\"el Richard
arxiv.org/abs/2510.03741

@arXiv_eessSP_bot@mastoxiv.page
2025-09-08 08:58:10

KGRAG-SC: Knowledge Graph RAG-Assisted Semantic Communication
Dayu Fan, Rui Meng, Song Gao, Xiaodong Xu
arxiv.org/abs/2509.04801 arxiv.org/…

@arXiv_astrophIM_bot@mastoxiv.page
2025-10-08 08:30:39

Interpreting anomaly detection of SDSS spectra
Edgar Ortiz Manrique, M\'ed\'eric Boquien
arxiv.org/abs/2510.05235 arxiv.org/pdf/251…

@arXiv_csIR_bot@mastoxiv.page
2025-09-04 09:26:31

Enhancing Interpretability and Effectiveness in Recommendation with Numerical Features via Learning to Contrast the Counterfactual samples
Xiaoxiao Xu, Hao Wu, Wenhui Yu, Lantao Hu, Peng Jiang, Kun Gai
arxiv.org/abs/2509.03187

@arXiv_qbioNC_bot@mastoxiv.page
2025-10-07 09:06:32

Atlas-free Brain Network Transformer
Shuai Huang, Xuan Kan, James J. Lah, Deqiang Qiu
arxiv.org/abs/2510.03306 arxiv.org/pdf/2510.03306

@arXiv_astrophEP_bot@mastoxiv.page
2025-09-08 08:49:40

Identifying Exoplanets with Deep Learning: A CNN and RNN Classifier for Kepler DR25 and Candidate Vetting
Bibin Thomas, Vittal Bhat M, Salman Arafath Mohammed, Abdul Wase Mohammed, Adis Abebaw Dessalegn, Mohit Mittal
arxiv.org/abs/2509.04793

@arXiv_csLG_bot@mastoxiv.page
2025-10-07 13:07:52

TopInG: Topologically Interpretable Graph Learning via Persistent Rationale Filtration
Cheng Xin, Fan Xu, Xin Ding, Jie Gao, Jiaxin Ding
arxiv.org/abs/2510.05102

@arXiv_physicsaoph_bot@mastoxiv.page
2025-09-08 08:14:50

High-Resolution Global Land Surface Temperature Retrieval via a Coupled Mechanism-Machine Learning Framework
Tian Xie, Huanfeng Shen, Menghui Jiang, Juan-Carlos Jim\'enez-Mu\~noz, Jos\'e A. Sobrino, Huifang Li, Chao Zeng
arxiv.org/abs/2509.04991

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:30:29

Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
Qingyu Yin, Chak Tou Leong, Linyi Yang, Wenxuan Huang, Wenjie Li, Xiting Wang, Jaehong Yoon, YunXing, XingYu, Jinjin Gu
arxiv.org/abs/2510.06036

@arXiv_quantph_bot@mastoxiv.page
2025-10-06 09:23:29

Amplitude-based Input Attribution in Quantum Learning via Integrated Gradients
Nicholas S. DiBrita, Jason Han, Younghyun Cho, Hengrui Luo, Tirthak Patel
arxiv.org/abs/2510.02497

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-09-05 07:45:50

Combining feature-based approaches with graph neural networks and symbolic regression for synergistic performance and interpretability
Rog\'erio Almeida Gouv\^ea, Pierre-Paul De Breuck, Tatiane Pretto, Gian-Marco Rignanese, Marcos Jos\'e Leite dos Santos
arxiv.org/abs/2509.03547

@arXiv_csRO_bot@mastoxiv.page
2025-10-08 10:20:49

EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
Zefu Lin, Rongxu Cui, Chen Hanning, Xiangyu Wang, Junjia Xu, Xiaojuan Jin, Chen Wenbo, Hui Zhou, Lue Fan, Wenling Li, Zhaoxiang Zhang
arxiv.org/abs/2510.06207

@arXiv_eessAS_bot@mastoxiv.page
2025-10-08 08:01:59

Teaching Machines to Speak Using Articulatory Control
Akshay Anand, Chenxu Guo, Cheol Jun Cho, Jiachen Lian, Gopala Anumanchipalli
arxiv.org/abs/2510.05619

@arXiv_mathOC_bot@mastoxiv.page
2025-08-08 09:27:42

Exact and Heuristic Algorithms for Constrained Biclustering
Antonio M. Sudoso
arxiv.org/abs/2508.05493 arxiv.org/pdf/2508.05493

@arXiv_csSD_bot@mastoxiv.page
2025-10-07 08:24:32

Soft Disentanglement in Frequency Bands for Neural Audio Codecs
Benoit Ginies, Xiaoyu Bie, Olivier Fercoq, Ga\"el Richard
arxiv.org/abs/2510.03735

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:57:39

Learning from Failures: Understanding LLM Alignment through Failure-Aware Inverse RL
Nyal Patel, Matthieu Bou, Arjun Jagota, Satyapriya Krishna, Sonali Parbhoo
arxiv.org/abs/2510.06092

@arXiv_statME_bot@mastoxiv.page
2025-10-07 10:16:12

Beyond Regularization: Inherently Sparse Principal Component Analysis
Jan O. Bauer
arxiv.org/abs/2510.03729 arxiv.org/pdf/2510.03729

@arXiv_csAI_bot@mastoxiv.page
2025-09-08 07:36:09

An Approach to Grounding AI Model Evaluations in Human-derived Criteria
Sasha Mitts
arxiv.org/abs/2509.04676 arxiv.org/pdf/2509.04676

@arXiv_csCL_bot@mastoxiv.page
2025-09-05 10:01:01

A Comprehensive Survey on Trustworthiness in Reasoning with Large Language Models
Yanbo Wang, Yongcan Yu, Jian Liang, Ran He
arxiv.org/abs/2509.03871

@arXiv_qbiobm_bot@mastoxiv.page
2025-10-06 08:31:39

SAE-RNA: A Sparse Autoencoder Model for Interpreting RNA Language Model Representations
Taehan Kim, Sangdae Nam
arxiv.org/abs/2510.02734 ar…

@arXiv_csLG_bot@mastoxiv.page
2025-09-08 10:08:50

Deep Reinforcement Learning for Ranking Utility Tuning in the Ad Recommender System at Pinterest
Xiao Yang, Mehdi Ben Ayed, Longyu Zhao, Fan Zhou, Yuchen Shen, Abe Engle, Jinfeng Zhuang, Ling Leng, Jiajing Xu, Charles Rosenberg, Prathibha Deshikachar
arxiv.org/abs/2509.05292

@arXiv_statML_bot@mastoxiv.page
2025-09-30 09:09:41

Sparse Deep Additive Model with Interactions: Enhancing Interpretability and Predictability
Yi-Ting Hung, Li-Hsiang Lin, Vince D. Calhoun
arxiv.org/abs/2509.23068

@arXiv_statAP_bot@mastoxiv.page
2025-10-07 08:54:52

Statistical Crime Linkage: Evaluating approaches within the Covenant for Using AI in Policing
Nathan A. Judd, Amy V. Tansell, Benjamin Costello, Liam Leonard, Jessica Woodhams, Rowland G. Seymour
arxiv.org/abs/2510.03730

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 10:29:29

QDeepGR4J: Quantile-based ensemble of deep learning and GR4J hybrid rainfall-runoff models for extreme flow prediction with uncertainty quantification
Arpit Kapoor, Rohitash Chandra
arxiv.org/abs/2510.05453

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:51:11

Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability
Haifei Zhang, Patrick Barry, Eduardo Brandao
arxiv.org/abs/2510.00773

@arXiv_csMM_bot@mastoxiv.page
2025-08-25 07:35:20

Beyond Interpretability: Exploring the Comprehensibility of Adaptive Video Streaming through Large Language Models
Lianchen Jia, Chaoyang Li, Ziqi Yuan, Jiahui Chen, Tianchi Huang, Jiangchuan Liu, Lifeng Sun
arxiv.org/abs/2508.16448

@arXiv_csSI_bot@mastoxiv.page
2025-09-04 07:46:40

On the Optimization of Methods for Establishing Well-Connected Communities
Mohammad Dindoost, Oliver Alvarado Rodriguez, Bartosz Bryg, Minhyuk Park, George Chacko, Tandy Warnow, David A. Bader
arxiv.org/abs/2509.02590

@arXiv_csCY_bot@mastoxiv.page
2025-10-03 07:39:30

An Analysis of the New EU AI Act and A Proposed Standardization Framework for Machine Learning Fairness
Mike Teodorescu, Yongxu Sun, Haren N. Bhatia, Christos Makridis
arxiv.org/abs/2510.01281

@arXiv_eessIV_bot@mastoxiv.page
2025-10-03 08:01:21

GFSR-Net: Guided Focus via Segment-Wise Relevance Network for Interpretable Deep Learning in Medical Imaging
Jhonatan Contreras, Thomas Bocklitz
arxiv.org/abs/2510.01919

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 10:19:47

Protocode: Prototype-Driven Interpretability for Code Generation in LLMs
Krishna Vamshi Bodla, Haizhao Yang
arxiv.org/abs/2509.25247 arxiv.…

@arXiv_mathST_bot@mastoxiv.page
2025-09-04 08:12:01

Reduce-Rank Matrix Integer-Valued Autoregressive Model
Kaiyan Cui, Tianyun Guo, Suping Wang
arxiv.org/abs/2509.03338 arxiv.org/pdf/2509.033…

@arXiv_eessSY_bot@mastoxiv.page
2025-10-03 08:34:11

Comparative Field Deployment of Reinforcement Learning and Model Predictive Control for Residential HVAC
Ozan Baris Mulayim, Elias N. Pergantis, Levi D. Reyes Premer, Bingqing Chen, Guannan Qu, Kevin J. Kircher, Mario Berg\'es
arxiv.org/abs/2510.01475

@arXiv_csNI_bot@mastoxiv.page
2025-09-03 09:09:33

SpliDT: Partitioned Decision Trees for Scalable Stateful Inference at Line Rate
Murayyiam Parvez, Annus Zulfiqar, Roman Beltiukov, Shir Landau Feibish, Walter Willinger, Arpit Gupta, Muhammad Shahbaz
arxiv.org/abs/2509.00397

@arXiv_statML_bot@mastoxiv.page
2025-09-29 10:04:17

Smoothing-Based Conformal Prediction for Balancing Efficiency and Interpretability
Mingyi Zheng, Hongyu Jiang, Yizhou Lu, Jiaye Teng
arxiv.org/abs/2509.22529

@arXiv_csCR_bot@mastoxiv.page
2025-09-29 09:53:48

Backdoor Attribution: Elucidating and Controlling Backdoor in Language Models
Miao Yu, Zhenhong Zhou, Moayad Aloqaily, Kun Wang, Biwei Huang, Stephen Wang, Yueming Jin, Qingsong Wen
arxiv.org/abs/2509.21761

@arXiv_qbioQM_bot@mastoxiv.page
2025-10-01 08:16:17

Commutative algebra neural network reveals genetic origins of diseases
JunJie Wee, Faisal Suwayyid, Mushal Zia, Hongsong Feng, Yuta Hozumi, Guo-Wei Wei
arxiv.org/abs/2509.26566

@arXiv_csDS_bot@mastoxiv.page
2025-09-30 10:30:01

Efficient Sketching and Nearest Neighbor Search Algorithms for Sparse Vector Sets
Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
arxiv.org/abs/2509.24815

@arXiv_csCL_bot@mastoxiv.page
2025-08-15 10:15:22

eDIF: A European Deep Inference Fabric for Remote Interpretability of LLM
Irma Heithoff. Marc Guggenberger, Sandra Kalogiannis, Susanne Mayer, Fabian Maag, Sigurd Schacht, Carsten Lanquillon
arxiv.org/abs/2508.10553

@arXiv_csLG_bot@mastoxiv.page
2025-10-01 11:56:57

The Loss Kernel: A Geometric Probe for Deep Learning Interpretability
Maxwell Adam, Zach Furman, Jesse Hoogland
arxiv.org/abs/2509.26537 ar…

@arXiv_csAI_bot@mastoxiv.page
2025-09-05 09:49:31

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning
Qika Lin, Yifan Zhu, Bin Pu, Ling Huang, Haoran Luo, Jingying Ma, Zhen Peng, Tianzhe Zhao, Fangzhi Xu, Jian Zhang, Kai He, Zhonghong Ou, Swapnil Mishra, Mengling Feng
arxiv.org/abs/2509.03906

@arXiv_eessIV_bot@mastoxiv.page
2025-09-03 09:40:23

DRetNet: A Novel Deep Learning Framework for Diabetic Retinopathy Diagnosis
Idowu Paul Okuwobi, Jingyuan Liu, Jifeng Wan, Jiaojiao Jiang
arxiv.org/abs/2509.01072

@arXiv_csSD_bot@mastoxiv.page
2025-08-25 07:43:30

Beyond Transcription: Mechanistic Interpretability in ASR
Neta Glazer, Yael Segal-Feldman, Hilit Segev, Aviv Shamsian, Asaf Buchnick, Gill Hetz, Ethan Fetaya, Joseph Keshet, Aviv Navon
arxiv.org/abs/2508.15882

@arXiv_statML_bot@mastoxiv.page
2025-09-04 09:10:31

Bayesian Additive Regression Trees for functional ANOVA model
Seokhun Park, Insung Kong, Yongdai Kim
arxiv.org/abs/2509.03317 arxiv.org/pdf…

@arXiv_csRO_bot@mastoxiv.page
2025-09-03 13:40:53

AutoDrive-R$^2$: Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving
Zhenlong Yuan, Jing Tang, Jinguo Luo, Rui Chen, Chengxuan Qian, Lei Sun, Xiangxiang Chu, Yujun Cai, Dapeng Zhang, Shuo Li
arxiv.org/abs/2509.01944

@arXiv_csCV_bot@mastoxiv.page
2025-08-26 12:30:37

Assessing the Noise Robustness of Class Activation Maps: A Framework for Reliable Model Interpretability
Syamantak Sarkar, Revoti P. Bora, Bhupender Kaushal, Sudhish N George, Kiran Raja
arxiv.org/abs/2508.18154

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:47:11

Interpreting Language Models Through Concept Descriptions: A Survey
Nils Feldhus, Laura Kopf
arxiv.org/abs/2510.01048 arxiv.org/pdf/2510.01…

@arXiv_csCY_bot@mastoxiv.page
2025-09-30 10:21:31

Open Opportunities in AI Safety, Alignment, and Ethics (AI SAE)
Dylan Waldner
arxiv.org/abs/2509.24065 arxiv.org/pdf/2509.24065

@arXiv_eessSP_bot@mastoxiv.page
2025-09-01 08:48:42

Machine Intelligence on the Edge: Interpretable Cardiac Pattern Localisation Using Reinforcement Learning
Haozhe Tian, Qiyu Rao, Nina Moutonnet, Pietro Ferraro, Danilo Mandic
arxiv.org/abs/2508.21652

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 10:05:31

Analyzing Latent Concepts in Code Language Models
Arushi Sharma, Vedant Pungliya, Christopher J. Quinn, Ali Jannesari
arxiv.org/abs/2510.00476

@arXiv_csLG_bot@mastoxiv.page
2025-09-05 10:29:01

Interpretable Clustering with Adaptive Heterogeneous Causal Structure Learning in Mixed Observational Data
Wenrui Li, Qinghao Zhang, Xiaowo Wang
arxiv.org/abs/2509.04415

@arXiv_astrophIM_bot@mastoxiv.page
2025-10-02 09:02:01

Architecturally Constrained Solutions to Ill-Conditioned Problems in QUBIC
Leonora Kardum
arxiv.org/abs/2510.00090 arxiv.org/pdf/2510.00090…

@arXiv_statME_bot@mastoxiv.page
2025-08-29 08:09:41

Interpretable Scalar-on-Image Linear Regression Models via the Generalized Dantzig Selector
Sijia Liao, Xiaoxiao Sun, Ning Hao, Hao Helen Zhang
arxiv.org/abs/2508.20278

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 10:28:51

A Neuro-Fuzzy System for Interpretable Long-Term Stock Market Forecasting
Miha O\v{z}bot, Igor \v{S}krjanc, Vitomir \v{S}truc
arxiv.org/abs/2510.00960

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:28:51

V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models
Qidong Wang, Junjie Hu, Ming Jiang
arxiv.org/abs/2509.14837

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 09:54:10

AIM: Amending Inherent Interpretability via Self-Supervised Masking
Eyad Alshami, Shashank Agnihotri, Bernt Schiele, Margret Keuper
arxiv.org/abs/2508.11502

@arXiv_eessIV_bot@mastoxiv.page
2025-10-02 08:40:00

DPsurv: Dual-Prototype Evidential Fusion for Uncertainty-Aware and Interpretable Whole-Slide Image Survival Prediction
Yucheng Xing, Ling Huang, Jingying Ma, Ruping Hong, Jiangdong Qiu, Pei Liu, Kai He, Huazhu Fu, Mengling Feng
arxiv.org/abs/2510.00053

@arXiv_statML_bot@mastoxiv.page
2025-10-02 08:49:01

Bayesian Neural Networks for Functional ANOVA model
Seokhun Park, Choeun Kim, Jihu Lee, Yunseop Shin, Insung Kong, Yongdai Kim
arxiv.org/abs/2510.00545

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 10:37:41

Typed Chain-of-Thought: A Curry-Howard Framework for Verifying LLM Reasoning
Elija Perrier
arxiv.org/abs/2510.01069 arxiv.org/pdf/2510.0106…

@arXiv_csLG_bot@mastoxiv.page
2025-09-04 10:30:11

EvolveSignal: A Large Language Model Powered Coding Agent for Discovering Traffic Signal Control Algorithms
Leizhen Wang, Peibo Duan, Hao Wang, Yue Wang, Jian Xu, Nan Zheng, Zhenliang Ma
arxiv.org/abs/2509.03335

@arXiv_csSE_bot@mastoxiv.page
2025-09-23 07:40:08

Constrained Co-evolutionary Metamorphic Differential Testing for Autonomous Systems with an Interpretability Approach
Hossein Yousefizadeh, Shenghui Gu, Lionel C. Briand, Ali Nasr
arxiv.org/abs/2509.16478

@arXiv_csRO_bot@mastoxiv.page
2025-09-01 09:32:42

Learning Agile Gate Traversal via Analytical Optimal Policy Gradient
Tianchen Sun, Bingheng Wang, Longbin Tang, Yichao Gao, Lin Zhao
arxiv.org/abs/2508.21592

@arXiv_csLG_bot@mastoxiv.page
2025-09-04 10:31:51

Initialization Schemes for Kolmogorov-Arnold Networks: An Empirical Study
Spyros Rigas, Dhruv Verma, Georgios Alexandridis, Yixuan Wang
arxiv.org/abs/2509.03417

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 09:51:22

Unfolding Framework with Complex-Valued Deformable Attention for High-Quality Computer-Generated Hologram Generation
Haomiao Zhang, Zhangyuan Li, Yanling Piao, Zhi Li, Xiaodong Wang, Miao Cao, Xiongfei Su, Qiang Song, Xin Yuan
arxiv.org/abs/2508.21657

@arXiv_csSD_bot@mastoxiv.page
2025-10-01 09:52:27

MUSE-Explainer: Counterfactual Explanations for Symbolic Music Graph Classification Models
Baptiste Hilaire, Emmanouil Karystinaios, Gerhard Widmer
arxiv.org/abs/2509.26521

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:18:47

Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in its Latent Thoughts
Hanwen Du, Yuxin Dong, Xia Ning
arxiv.org/abs/2509.26314

@arXiv_csAI_bot@mastoxiv.page
2025-10-02 10:37:21

A Neuro-Fuzzy System for Interpretable Long-Term Stock Market Forecasting
Miha O\v{z}bot, Igor \v{S}krjanc, Vitomir \v{S}truc
arxiv.org/abs/2510.00960

@arXiv_csLG_bot@mastoxiv.page
2025-09-04 10:32:01

LINKER: Learning Interactions Between Functional Groups and Residues With Chemical Knowledge-Enhanced Reasoning and Explainability
Phuc Pham, Viet Thanh Duy Nguyen, Truong-Son Hy
arxiv.org/abs/2509.03425

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:37:57

Interpret, prune and distill Donut : towards lightweight VLMs for VQA on document
Adnan Ben Mansour, Ayoub Karine, David Naccache
arxiv.org/abs/2509.26235

@arXiv_statME_bot@mastoxiv.page
2025-09-30 11:28:01

A more interpretable regression model for count data with excess of zeros
Gustavo H. A. Pereira, Jeremias Le\~ao, Manoel Santos-Neto, Jianwen Cai
arxiv.org/abs/2509.24916

@arXiv_csAI_bot@mastoxiv.page
2025-10-02 10:42:01

Typed Chain-of-Thought: A Curry-Howard Framework for Verifying LLM Reasoning
Elija Perrier
arxiv.org/abs/2510.01069 arxiv.org/pdf/2510.0106…

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:10:52

Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures
Marco Bronzini, Carlo Nicolini, Bruno Lepri, Jacopo Staiano, Andrea Passerini
arxiv.org/abs/2509.25045

@arXiv_csLG_bot@mastoxiv.page
2025-09-23 12:47:50

Medical priority fusion: achieving dual optimization of sensitivity and interpretability in nipt anomaly detection
Xiuqi Ge, Zhibo Yao, Yaosong Du
arxiv.org/abs/2509.17924

@arXiv_csAI_bot@mastoxiv.page
2025-09-23 12:06:20

Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates
Hy Dang, Tianyi Liu, Zhuofeng Wu, Jingfeng Yang, Haoming Jiang, Tao Yang, Pei Chen, Zhengyang Wang, Helen Wang, Huasheng Li, Bing Yin, Meng Jiang
arxiv.org/abs/2509.18076

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:25:17

UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
Hongyu Chen, Guangrun Wang
arxiv.org/abs/2509.22628

@arXiv_csSD_bot@mastoxiv.page
2025-09-11 08:56:43

Behind the Scenes: Mechanistic Interpretability of LoRA-adapted Whisper for Speech Emotion Recognition
Yujian Ma, Jinqiu Sang, Ruizhe Li
arxiv.org/abs/2509.08454

@arXiv_csCL_bot@mastoxiv.page
2025-08-18 09:40:30

Model Interpretability and Rationale Extraction by Input Mask Optimization
Marc Brinner, Sina Zarriess
arxiv.org/abs/2508.11388 arxiv.org/p…

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:17:07

Explaining multimodal LLMs via intra-modal token interactions
Jiawei Liang, Ruoyu Chen, Xianghao Jiao, Siyuan Liang, Shiming Liu, Qunli Zhang, Zheng Hu, Xiaochun Cao
arxiv.org/abs/2509.22415

@arXiv_csLG_bot@mastoxiv.page
2025-09-11 10:11:53

Interpretability as Alignment: Making Internal Understanding a Design Principle
Aadit Sengupta, Pratinav Seth, Vinay Kumar Sankarapu
arxiv.org/abs/2509.08592

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:09:21

Privacy Preserved Federated Learning with Attention-Based Aggregation for Biometric Recognition
Kassahun Azezew, Minyechil Alehegn, Tsega Asresa, Bitew Mekuria, Tizazu Bayh, Ayenew Kassie, Amsalu Tesema, Animut Embiyale
arxiv.org/abs/2510.01113

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:20:17

Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
Ruoyu Chen, Xiaoqing Guo, Kangwei Liu, Siyuan Liang, Shiming Liu, Qunli Zhang, Hua Zhang, Xiaochun Cao
arxiv.org/abs/2509.22496

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 10:38:27

REMA: A Unified Reasoning Manifold Framework for Interpreting Large Language Model
Bo Li, Guanzhi Deng, Ronghao Chen, Junrong Yue, Shuo Zhang, Qinghua Zhao, Linqi Song, Lijie Wen
arxiv.org/abs/2509.22518

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:33:01

Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
Bianca Raimondi, Daniela Dalbagno, Maurizio Gabbrielli
arxiv.org/abs/2510.12229

@arXiv_csCV_bot@mastoxiv.page
2025-09-16 12:39:17

CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
Debopom Sutradhar, Arefin Ittesafun Abian, Mohaimenul Azam Khan Raiaan, Reem E. Mohamed, Sheikh Izzal Azid, Sami Azam
arxiv.org/abs/2509.11952

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:18:33

Interpretable by AI Mother Tongue: Native Symbolic Reasoning in Neural Models
Hung Ming Liu
arxiv.org/abs/2508.18988 arxiv.org/pdf/2508.189…

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:37:21

Towards Understanding the Shape of Representations in Protein Language Models
Kosio Beshkov, Anders Malthe-S{\o}renssen
arxiv.org/abs/2509.24895

@arXiv_csAI_bot@mastoxiv.page
2025-08-28 09:20:01

Tracking World States with Language Models: State-Based Evaluation Using Chess
Romain Harang, Jason Naradowsky, Yaswitha Gujju, Yusuke Miyao
arxiv.org/abs/2508.19851

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:18:51

RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards
Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Ellie Evans, Daniel Egert, Hoo-Chang Shin, Felipe Soares, Yi Dong, Oleksii Kuchaiev
arxiv.org/abs/2509.21319

@arXiv_csLG_bot@mastoxiv.page
2025-09-29 11:32:27

(Sometimes) Less is More: Mitigating the Complexity of Rule-based Representation for Interpretable Classification
Luca Bergamin, Roberto Confalonieri, Fabio Aiolli
arxiv.org/abs/2509.22384

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 10:19:41

MOSS-ChatV: Reinforcement Learning with Process Reasoning Reward for Video Temporal Reasoning
Sicheng Tao, Jungang Li, Yibo Yan, Junyan Zhang, Yubo Gao, Hanqian Li, ShuHang Xun, Yuxuan Fan, Hong Chen, Jianxiang He, Xuming Hu
arxiv.org/abs/2509.21113

@arXiv_csLG_bot@mastoxiv.page
2025-09-29 11:32:17

Enhancing Credit Risk Prediction: A Meta-Learning Framework Integrating Baseline Models, LASSO, and ECOC for Superior Accuracy
Haibo Wang, Lutfu S. Sua, Jun Huang, Figen Balo, Burak Dolar
arxiv.org/abs/2509.22381

@arXiv_csCV_bot@mastoxiv.page
2025-08-27 10:24:33

Interpretable Decision-Making for End-to-End Autonomous Driving
Mona Mirzaie, Bodo Rosenhahn
arxiv.org/abs/2508.18898 arxiv.org/pdf/2508.18…

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 07:47:51

Think as a Doctor: An Interpretable AI Approach for ICU Mortality Prediction
Qingwen Li, Xiaohang Zhao, Xiao Han, Hailiang Huang, Lanjuan Liu
arxiv.org/abs/2510.11745