Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_physicsaoph_bot@mastoxiv.page
2025-09-05 08:18:41

Finetuning AI Foundation Models to Develop Subgrid-Scale Parameterizations: A Case Study on Atmospheric Gravity Waves
Aman Gupta, Aditi Sheshadri, Sujit Roy, Johannes Schmude, Vishal Gaur, Wei Ji Leong, Manil Maskey, Rahul Ramachandran
arxiv.org/abs/2509.03816

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 09:39:01

FideDiff: Efficient Diffusion Model for High-Fidelity Image Motion Deblurring
Xiaoyang Liu, Zhengyan Zhou, Zihang Xu, Jiezhang Cao, Zheng Chen, Yulun Zhang
arxiv.org/abs/2510.01641

@arXiv_csCL_bot@mastoxiv.page
2025-09-03 14:23:13

chDzDT: Word-level morphology-aware language model for Algerian social media text
Abdelkrime Aries
arxiv.org/abs/2509.01772 arxiv.org/pdf/2…

@arXiv_csDC_bot@mastoxiv.page
2025-09-03 08:53:33

LobRA: Multi-tenant Fine-tuning over Heterogeneous Data
Sheng Lin, Fangcheng Fu, Haoyang Li, Hao Ge, Xuanyu Wang, Jiawen Niu, Yaofeng Tu, Bin Cui
arxiv.org/abs/2509.01193

@arXiv_eessAS_bot@mastoxiv.page
2025-09-03 10:17:13

Speaker-Conditioned Phrase Break Prediction for Text-to-Speech with Phoneme-Level Pre-trained Language Model
Dong Yang, Yuki Saito, Takaaki Saeki, Tomoki Koriyama, Wataru Nakata, Detai Xin, Hiroshi Saruwatari
arxiv.org/abs/2509.00675

@arXiv_eessSP_bot@mastoxiv.page
2025-09-03 11:50:13

Fluid Antenna Port Prediction based on Large Language Models
Yali Zhang, Haifan Yin, Weidong Li, Emil Bjornson, Merouane Debbah
arxiv.org/abs/2509.01121

@arXiv_csCR_bot@mastoxiv.page
2025-09-30 11:22:01

StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
Yixu Wang, Yan Teng, Yingchun Wang, Xingjun Ma
arxiv.org/abs/2509.23594 ar…

@arXiv_csSE_bot@mastoxiv.page
2025-08-28 09:37:11

Smart Contract Intent Detection with Pre-trained Programming Language Model
Youwei Huang, Jianwen Li, Sen Fang, Yao Li, Peng Yang, Bin Hu, Tao Zhang
arxiv.org/abs/2508.20086

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:12:31

Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova, Babak Ehteshami Bejnordi, Gaurav Kumar, Hanxue Liang, Wanru Zhao, Paul Whatmough
arxiv.org/abs/2510.01185

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:14:51

Leveraging Prior Knowledge of Diffusion Model for Person Search
Giyeol Kim, Sooyoung Yang, Jihyong Oh, Myungjoo Kang, Chanho Eom
arxiv.org/abs/2510.01841

@arXiv_physicschemph_bot@mastoxiv.page
2025-09-03 10:11:33

Migration as a Probe: A Generalizable Benchmark Framework for Specialist vs. Generalist Machine-Learned Force Fields in Doped Materials
Yi Cao, Paulette Clancy
arxiv.org/abs/2509.00090

@arXiv_csRO_bot@mastoxiv.page
2025-08-19 10:59:00

Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search
Cyrus Neary, Omar G. Younis, Artur Kuramshin, Ozgur Aslan, Glen Berseth
arxiv.org/abs/2508.12211

@arXiv_csSD_bot@mastoxiv.page
2025-10-01 08:50:17

LTA-L2S: Lexical Tone-Aware Lip-to-Speech Synthesis for Mandarin with Cross-Lingual Transfer Learning
Kang Yang, Yifan Liang, Fangkun Liu, Zhenping Xie, Chengshi Zheng
arxiv.org/abs/2509.25670

@arXiv_csCV_bot@mastoxiv.page
2025-10-02 10:54:01

NSARM: Next-Scale Autoregressive Modeling for Robust Real-World Image Super-Resolution
Xiangtao Kong, Rongyuan Wu, Shuaizheng Liu, Lingchen Sun, Lei Zhang
arxiv.org/abs/2510.00820

@arXiv_eessAS_bot@mastoxiv.page
2025-09-03 10:53:13

MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model
Joonyong Park, Daisuke Saito, Nobuaki Minematsu
arxiv.org/abs/2509.01391

@arXiv_eessIV_bot@mastoxiv.page
2025-09-30 08:51:31

Achieving Fair Skin Lesion Detection through Skin Tone Normalization and Channel Pruning
Zihan Wei, Tapabrata Chakraborti
arxiv.org/abs/2509.22712

@arXiv_csIR_bot@mastoxiv.page
2025-09-30 10:21:51

Investigating Multi-layer Representations for Dense Passage Retrieval
Zhongbin Xie, Thomas Lukasiewicz
arxiv.org/abs/2509.23861 arxiv.org/p…

@arXiv_statML_bot@mastoxiv.page
2025-09-29 09:49:28

Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)
Nikita Kornilov, David Li, Tikhon Mavrin, Aleksei Leonov, Nikita Gushchin, Evgeny Burnaev, Iaroslav Koshelev, Alexander Korotin
arxiv.org/abs/2509.22459

@arXiv_csAI_bot@mastoxiv.page
2025-09-22 08:08:01

MicroRCA-Agent: Microservice Root Cause Analysis Method Based on Large Language Model Agents
Pan Tang, Shixiang Tang, Huanqi Pu, Zhiqing Miao, Zhixing Wang
arxiv.org/abs/2509.15635

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:45:31

TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
Sophia Tang, Yuchen Zhu, Molei Tao, Pranam Chatterjee
arxiv.org/abs/2509.25171

@arXiv_csHC_bot@mastoxiv.page
2025-08-15 07:47:02

Pre-trained Transformer-models using chronic invasive electrophysiology for symptom decoding without patient-individual training
Timon Merk, Saeed Salehi, Richard M. Koehler, Qiming Cui, Maria Olaru, Amelia Hahn, Nicole R. Provenza, Simon Little, Reza Abbasi-Asl, Phil A. Starr, Wolf-Julian Neumann
arxiv.org/abs/2508.10160

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:10:25

Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
Haoyang Zheng, Xinyang Liu, Cindy Xiangrui Kong, Nan Jiang, Zheyuan Hu, Weijian Luo, Wei Deng, Guang Lin
arxiv.org/abs/2509.25035

@arXiv_csIT_bot@mastoxiv.page
2025-09-26 07:36:11

A Deep Transfer Learning-Based Low-overhead Beam Prediction in Vehicle Communications
Zhiqiang Xiao, Yuwen Cao, Mondher Bouazizi, Tomoaki Ohtsuki, Shahid Mumtaz
arxiv.org/abs/2509.20659

@arXiv_econEM_bot@mastoxiv.page
2025-09-26 08:17:31

Recidivism and Peer Influence with LLM Text Embeddings in Low Security Correctional Facilities
Shanjukta Nath, Jiwon Hong, Jae Ho Chang, Keith Warren, Subhadeep Paul
arxiv.org/abs/2509.20634

@mia@hcommons.social
2025-10-09 08:17:27

Looking forward to reading this! “Making BERT Feel at Home. Modelling Domestic Space in 19th-Century British and Irish Fiction”, Journal of Computational Literary Studies4(1). doi: doi.org/10.48694/jcls.4164
By Guhr, S., Monaco, J., Sherman, A., Warner, M. & Algee-Hewitt, M

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 10:25:41

A Sentinel-3 foundation model for ocean colour
Geoffrey Dawson, Remy Vandaele, Andrew Taylor, David Moffat, Helen Tamura-Wicks, Sarah Jackson, Rosie Lickorish, Paolo Fraccaro, Hywel Williams, Chunbo Luo, Anne Jones
arxiv.org/abs/2509.21273

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:40:42

Efficient Code Embeddings from Code Generation Models
Daria Kryvosheieva, Saba Sturua, Michael G\"unther, Scott Martens, Han Xiao
arxiv.org/abs/2508.21290

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-08-13 08:00:22

DiffractGPT: Atomic Structure Determination from X-ray Diffraction Patterns using Generative Pre-trained Transformer
Kamal Choudhary
arxiv.org/abs/2508.08349

@arXiv_physicsgeoph_bot@mastoxiv.page
2025-09-30 09:17:21

U-SWIFT: A Unified Surface Wave Inversion Framework with Transformer via Normalization of Dispersion Curves
Tianjian Cheng, Hongrui Xu, Jiayu Feng, Xiongyu Hu, Chaofan Yao
arxiv.org/abs/2509.24872

@arXiv_csRO_bot@mastoxiv.page
2025-08-28 08:06:51

LaVA-Man: Learning Visual Action Representations for Robot Manipulation
Chaoran Zhu, Hengyi Wang, Yik Lung Pang, Changjae Oh
arxiv.org/abs/2508.19391

@arXiv_eessSY_bot@mastoxiv.page
2025-09-16 11:12:16

BERT4beam: Large AI Model Enabled Generalized Beamforming Optimization
Yuhang Li, Yang Lu, Wei Chen, Bo Ai, Zhiguo Ding, Dusit Niyato
arxiv.org/abs/2509.11056

@arXiv_csIR_bot@mastoxiv.page
2025-08-29 07:54:51

ELIXIR: Efficient and LIghtweight model for eXplaIning Recommendations
Ben Kabongo, Vincent Guigue, Pirmin Lemberger
arxiv.org/abs/2508.20312

@arXiv_csGR_bot@mastoxiv.page
2025-10-07 08:52:42

Paris: A Decentralized Trained Open-Weight Diffusion Model
Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy
arxiv.org/abs/2510.03434

@arXiv_csAR_bot@mastoxiv.page
2025-08-26 08:09:56

LLMulator: Generalizable Cost Modeling for Dataflow Accelerators with Input-Adaptive Control Flow
Kaiyan Chang, Wenlong Zhu, Shengwen Liang, Huawei Li, Ying Wang
arxiv.org/abs/2508.17826

@arXiv_astrophGA_bot@mastoxiv.page
2025-09-16 11:02:07

Radio Galaxy Zoo: Morphological classification by Fanaroff-Riley designation using self-supervised pre-training
Nutthawara Buatthaisong, Inigo Val Slijepcevic, Anna M. M. Scaife, Micah Bowles, Andrew Hopkins, Devina Mohan, Stanislav S Shabala, O. Ivy Wong
arxiv.org/abs/2509.11988

@arXiv_physicsplasmph_bot@mastoxiv.page
2025-09-17 09:02:10

FusionMAE: large-scale pretrained model to optimize and simplify diagnostic and control of fusion plasma
Zongyu Yang, Zhenghao Yang, Wenjing Tian, Jiyuan Li, Xiang Sun, Guohui Zheng, Songfen Liu, Niannian Wu, Rongpeng Li, Zhaohe Xu, Bo Li, Zhongbing Shi, Zhe Gao, Wei Chen, Xiaoquan Ji, Min Xu, Wulyu Zhong
arxiv.org/abs/2509.12945…

@arXiv_csAI_bot@mastoxiv.page
2025-09-22 07:33:01

Knowledge-Driven Hallucination in Large Language Models: An Empirical Study on Process Modeling
Humam Kourani, Anton Antonov, Alessandro Berti, Wil M. P. van der Aalst
arxiv.org/abs/2509.15336

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:42:07

FLOWER: A Flow-Matching Solver for Inverse Problems
Mehrsa Pourya, Bassam El Rawas, Michael Unser
arxiv.org/abs/2509.26287 arxiv.org/pdf/25…

@arXiv_csLG_bot@mastoxiv.page
2025-08-21 10:15:00

Cross-Modality Controlled Molecule Generation with Diffusion Language Model
Yunzhe Zhang, Yifei Wang, Khanh Vinh Nguyen, Pengyu Hong
arxiv.org/abs/2508.14748

@arXiv_csSD_bot@mastoxiv.page
2025-08-19 09:25:39

Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection
Bing Han, Anbai Jiang, Xinhu Zheng, Wei-Qiang Zhang, Jia Liu, Pingyi Fan, Yanmin Qian
arxiv.org/abs/2508.12230

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:00:40

Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
Yuxian Gu, Qinghao Hu, Shang Yang, Haocheng Xi, Junyu Chen, Song Han, Han Cai
arxiv.org/abs/2508.15884

@arXiv_csRO_bot@mastoxiv.page
2025-08-26 11:26:46

FlowVLA: Thinking in Motion with a Visual Chain of Thought
Zhide Zhong, Haodong Yan, Junfeng Li, Xiangchen Liu, Xin Gong, Wenxuan Song, Jiayi Chen, Haoang Li
arxiv.org/abs/2508.18269

@arXiv_quantph_bot@mastoxiv.page
2025-09-09 11:51:22

Classical Neural Networks on Quantum Devices via Tensor Network Disentanglers: A Case Study in Image Classification
Borja Aizpurua, Sukhbinder Singh, Rom\'an Or\'us
arxiv.org/abs/2509.06653

@arXiv_csSI_bot@mastoxiv.page
2025-08-12 08:42:03

Anatomy of a Machine Learning Ecosystem: 2 Million Models on Hugging Face
Benjamin Laufer, Hamidah Oderinwale, Jon Kleinberg
arxiv.org/abs/2508.06811

@arXiv_csCL_bot@mastoxiv.page
2025-08-29 10:18:21

Multi-Lingual Implicit Discourse Relation Recognition with Multi-Label Hierarchical Learning
Nelson Filipe Costa, Leila Kosseim
arxiv.org/abs/2508.20712

@arXiv_eessIV_bot@mastoxiv.page
2025-08-25 09:04:20

Time-Aware One Step Diffusion Network for Real-World Image Super-Resolution
Tainyi Zhang, Zheng-Peng Duan, Peng-Tao Jiang, Bo Li, Ming-Ming Cheng, Chun-Le Guo, Chongyi Li
arxiv.org/abs/2508.16557

@arXiv_csLG_bot@mastoxiv.page
2025-09-18 09:58:41

Towards a Physics Foundation Model
Florian Wiesner, Matthias Wessling, Stephen Baek
arxiv.org/abs/2509.13805 arxiv.org/pdf/2509.13805

@arXiv_csSE_bot@mastoxiv.page
2025-08-22 08:43:40

An Empirical Study of Knowledge Distillation for Code Understanding Tasks
Ruiqi Wang, Zezhou Yang, Cuiyun Gao, Xin Xia, Qing Liao
arxiv.org/abs/2508.15423

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 15:02:46

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
Junyu Chen, Wenkun He, Yuchao Gu, Yuyang Zhao, Jincheng Yu, Junsong Chen, Dongyun Zou, Yujun Lin, Zhekai Zhang, Muyang Li, Haocheng Xi, Ligeng Zhu, Enze Xie, Song Han, Han Cai
arxiv.org/abs/2509.25182

@arXiv_csSD_bot@mastoxiv.page
2025-09-22 09:53:31

SONAR: Self-Distilled Continual Pre-training for Domain Adaptive Audio Representation
Yizhou Zhang, Yuan Gao, Wangjin Zhou, Zicheng Yuan, Keisuke Imoto, Tatsuya Kawahara
arxiv.org/abs/2509.15703

@arXiv_csGR_bot@mastoxiv.page
2025-08-18 07:37:50

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation
Seungmi Lee, Kwan Yun, Junyong Noh
arxiv.org/abs/2508.11203

@arXiv_csAI_bot@mastoxiv.page
2025-10-07 12:14:52

Look-ahead Reasoning with a Learned Model in Imperfect Information Games
Ond\v{r}ej Kub\'i\v{c}ek, Viliam Lis\'y
arxiv.org/abs/2510.05048

@arXiv_statML_bot@mastoxiv.page
2025-08-13 09:32:02

In-Context Learning as Nonparametric Conditional Probability Estimation: Risk Bounds and Optimality
Chenrui Liu, Falong Tan, Chuanlong Xie, Yicheng Zeng, Lixing Zhu
arxiv.org/abs/2508.08673

@arXiv_csCL_bot@mastoxiv.page
2025-08-22 10:01:01

Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training
Woojin Chung, Jeonghoon Kim
arxiv.org/abs/2508.15390 arxiv.org/pdf…

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 15:02:56

PAD3R: Pose-Aware Dynamic 3D Reconstruction from Casual Videos
Ting-Hsuan Liao, Haowen Liu, Yiran Xu, Songwei Ge, Gengshan Yang, Jia-Bin Huang
arxiv.org/abs/2509.25183

@arXiv_physicsgeoph_bot@mastoxiv.page
2025-08-28 11:51:44

Replaced article(s) found for physics.geo-ph. arxiv.org/list/physics.geo-ph/
[1/1]:
- PRIME-DP: Pre-trained Integrated Model for Earthquake Data Processing
Ziye Yu, Yuqi Cai, Weitao Wang, Yanru An, Lu Li, Yueyang Xia, Yunpeng Zhang

@arXiv_csIR_bot@mastoxiv.page
2025-08-11 09:31:39

LMAR: Language Model Augmented Retriever for Domain-specific Knowledge Indexing
Yao Zhao, Yantian Ding, Zhiyue Zhang, Dapeng Yao, Yanxun Xu
arxiv.org/abs/2508.05672

@arXiv_csLG_bot@mastoxiv.page
2025-08-27 10:35:33

Composition and Alignment of Diffusion Models using Constrained Learning
Shervin Khalafi, Ignacio Hounie, Dongsheng Ding, Alejandro Ribeiro
arxiv.org/abs/2508.19104

@arXiv_csCR_bot@mastoxiv.page
2025-10-15 10:03:01

IP-Augmented Multi-Modal Malicious URL Detection Via Token-Contrastive Representation Enhancement and Multi-Granularity Fusion
Ye Tian, Yanqiu Yu, Liangliang Song, Zhiquan Liu, Yanbin Wang, Jianguo Sun
arxiv.org/abs/2510.12395

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:03:40

Seeing is Believing: Emotion-Aware Audio-Visual Language Modeling for Expressive Speech Generation
Weiting Tan, Jiachen Lian, Hirofumi Inaguma, Paden Tomasello, Philipp Koehn, Xutai Ma
arxiv.org/abs/2508.16188

@arXiv_csCV_bot@mastoxiv.page
2025-09-16 12:43:17

FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation
Bernardo Forni, Gabriele Lombardi, Federico Pozzi, Mirco Planamente
arxiv.org/abs/2509.12105

@arXiv_csSE_bot@mastoxiv.page
2025-08-20 11:48:46

Replaced article(s) found for cs.SE. arxiv.org/list/cs.SE/new
[1/1]:
- "I see models being a whole other thing": An Empirical Study of Pre-Trained Model Naming Conventi...
Wenxin Jiang, Mingyu Kim, Chingwo Cheung, Heesoo Kim, George K. Thiruvathukal, James C. Davis

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 08:38:20

FiLM-Nav: Efficient and Generalizable Navigation via VLM Fine-tuning
Naoki Yokoyama, Sehoon Ha
arxiv.org/abs/2509.16445 arxiv.org/pdf/2509.…

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:43:11

Enhancing Speech Emotion Recognition via Fine-Tuning Pre-Trained Models and Hyper-Parameter Optimisation
Aryan Golbaghi, Shuo Zhou
arxiv.org/abs/2510.07052

@arXiv_eessIV_bot@mastoxiv.page
2025-08-20 09:44:50

UNICON: UNIfied CONtinual Learning for Medical Foundational Models
Mohammad Areeb Qazi, Munachiso S Nwadike, Ibrahim Almakky, Mohammad Yaqub, Numan Saeed
arxiv.org/abs/2508.14024

@arXiv_csSD_bot@mastoxiv.page
2025-09-24 09:21:54

Scalable Evaluation for Audio Identification via Synthetic Latent Fingerprint Generation
Aditya Bhattacharjee, Marco Pasini, Emmanouil Benetos
arxiv.org/abs/2509.18620

@arXiv_csIR_bot@mastoxiv.page
2025-08-14 08:15:52

Personalized Product Search Ranking: A Multi-Task Learning Approach with Tabular and Non-Tabular Data
Lalitesh Morishetti, Abhay Kumar, Jonathan Scott, Kaushiki Nag, Gunjan Sharma, Shanu Vashishtha, Rahul Sridhar, Rohit Chatter, Kannan Achan
arxiv.org/abs/2508.09636

@arXiv_csLG_bot@mastoxiv.page
2025-08-22 10:16:41

Amortized In-Context Mixed Effect Transformer Models: A Zero-Shot Approach for Pharmacokinetics
C\'esar Ali Ojeda Marin, Wilhelm Huisinga, Purity Kavwele, Niklas Hartung
arxiv.org/abs/2508.15659

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 09:16:00

MAGIC-Enhanced Keyword Prompting for Zero-Shot Audio Captioning with CLIP Models
Vijay Govindarajan, Pratik Patel, Sahil Tripathi, Md Azizul Hoque, Gautam Siddharth Kashyap
arxiv.org/abs/2509.12591

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-09-11 09:55:43

Benchmarking CHGNet Universal Machine Learning Interatomic Potential Against DFT and EXAFS: Case of Layered WS2 and MoS2
Pjotrs \v{Z}guns, Inga Pudza, Alexei Kuzmin
arxiv.org/abs/2509.08498

@arXiv_csLG_bot@mastoxiv.page
2025-09-10 10:29:21

General Demographic Foundation Models for Enhancing Predictive Performance Across Diseases
Li-Chin Chen, Ji-Tian Sheu, Yuh-Jue Chuang
arxiv.org/abs/2509.07330

@arXiv_csCR_bot@mastoxiv.page
2025-10-08 09:39:59

Membership Inference Attacks on Tokenizers of Large Language Models
Meng Tong, Yuntao Du, Kejiang Chen, Weiming Zhang, Ninghui Li
arxiv.org/abs/2510.05699

@arXiv_csCV_bot@mastoxiv.page
2025-08-22 10:05:51

Transfer learning optimization based on evolutionary selective fine tuning
Jacinto Colan, Ana Davila, Yasuhisa Hasegawa
arxiv.org/abs/2508.15367

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:36:21

Can maiBERT Speak for Maithili?
Sumit Yadav, Raju Kumar Yadav, Utsav Maskey, Gautam Siddharth Kashyap Md Azizul Hoque, Ganesh Gautam
arxiv.org/abs/2509.15048

@arXiv_csSD_bot@mastoxiv.page
2025-08-21 09:12:59

ECHO: Frequency-aware Hierarchical Encoding for Variable-length Signal
Yucong Zhang, Juan Liu, Ming Li
arxiv.org/abs/2508.14689 arxiv.org/p…

@arXiv_csRO_bot@mastoxiv.page
2025-08-11 09:37:49

Bounding Distributional Shifts in World Modeling through Novelty Detection
Eric Jing, Abdeslam Boularias
arxiv.org/abs/2508.06096 arxiv.org…

@arXiv_csLG_bot@mastoxiv.page
2025-10-09 10:40:41

Sharpness-Aware Data Generation for Zero-shot Quantization
Dung Hoang-Anh, Cuong Pham Trung Le, Jianfei Cai, Thanh-Toan Do
arxiv.org/abs/2510.07018

@arXiv_csCV_bot@mastoxiv.page
2025-09-24 15:33:21

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/6]:
- SCoT: Straight Consistent Trajectory for Pre-Trained Diffusion Model Distillations
Zhangkai Wu, Xuhui Fan, Hongyu Wu, Longbing Cao

@arXiv_csSE_bot@mastoxiv.page
2025-10-08 09:37:39

Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding
Nikita Pavlichenko, Iurii Nazarov, Ivan Dolgov, Ekaterina Garanina, Dmitry Ustalov, Ivan Bondyrev, Kseniia Lysaniuk, Evgeniia Vu, Kirill Chekmenev, Joseph Shtok, Yaroslav Golubev, Anton Semenkin, Uladzislau Sazanovich
arxiv.org/abs/2510…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-19 09:46:01

Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance
Francisco Messina, Francesca Ronchini, Luca Comanducci, Paolo Bestagini, Fabio Antonacci
arxiv.org/abs/2509.14934

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 10:16:51

Canary-1B-v2 & Parakeet-TDT-0.6B-v3: Efficient and High-Performance Models for Multilingual ASR and AST
Monica Sekoyan, Nithin Rao Koluguri, Nune Tadevosyan, Piotr Zelasko, Travis Bartley, Nick Karpov, Jagadeesh Balam, Boris Ginsburg
arxiv.org/abs/2509.14128

@arXiv_csLG_bot@mastoxiv.page
2025-08-21 10:10:00

Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Zixi Chen, Yinyu Ye, Zijie Zhou
arxiv.org/abs/2508.14544 arxiv.or…

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:36:11

SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features
Jinyuan Qu, Hongyang Li, Xingyu Chen, Shilong Liu, Yukai Shi, Tianhe Ren, Ruitao Jing, Lei Zhang
arxiv.org/abs/2509.16098

@arXiv_csRO_bot@mastoxiv.page
2025-10-13 09:04:40

CDE: Concept-Driven Exploration for Reinforcement Learning
Le Mao, Andrew H. Liu, Renos Zabounidis, Zachary Kingston, Joseph Campbell
arxiv.org/abs/2510.08851

@arXiv_csSD_bot@mastoxiv.page
2025-09-17 09:14:10

More Similar than Dissimilar: Modeling Annotators for Cross-Corpus Speech Emotion Recognition
James Tavernor, Emily Mower Provost
arxiv.org/abs/2509.12295

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:18:51

HARNESS: Lightweight Distilled Arabic Speech Foundation Models
Vrunda N. sukhadia, Shammur Absar Chowdhury
arxiv.org/abs/2509.14689 arxiv.o…

@arXiv_csCV_bot@mastoxiv.page
2025-09-09 12:30:52

BIR-Adapter: A Low-Complexity Diffusion Model Adapter for Blind Image Restoration
Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard Steinbach
arxiv.org/abs/2509.06904

@arXiv_csLG_bot@mastoxiv.page
2025-08-20 10:05:00

In-Context Decision Making for Optimizing Complex AutoML Pipelines
Amir Rezaei Balef, Katharina Eggensperger
arxiv.org/abs/2508.13657 arxiv…

@arXiv_eessAS_bot@mastoxiv.page
2025-09-19 08:45:01

SpeechOp: Inference-Time Task Composition for Generative Speech Processing
Justin Lovelace, Rithesh Kumar, Jiaqi Su, Ke Chen, Kilian Q Weinberger, Zeyu Jin
arxiv.org/abs/2509.14298

@arXiv_csCV_bot@mastoxiv.page
2025-09-15 09:59:21

Grad-CL: Source Free Domain Adaptation with Gradient Guided Feature Disalignment
Rini Smita Thakur, Rajeev Ranjan Dwivedi, Vinod K Kurmi
arxiv.org/abs/2509.10134

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:41:38

Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls, Dylan J. Foster, Akshay Krishnamurthy, Jordan T. Ash
arxiv.org/abs/2510.11686

@arXiv_csCL_bot@mastoxiv.page
2025-09-18 08:49:31

Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
Zhuoxuan Zhang, Jinhao Duan, Edward Kim, Kaidi Xu
arxiv.org/abs/2509.13664

@arXiv_csLG_bot@mastoxiv.page
2025-08-12 12:07:33

Towards Unveiling Predictive Uncertainty Vulnerabilities in the Context of the Right to Be Forgotten
Wei Qian, Chenxu Zhao, Yangyi Li, Wenqian Ye, Mengdi Huai
arxiv.org/abs/2508.07458

@arXiv_eessAS_bot@mastoxiv.page
2025-10-07 10:06:42

Enhancing Speaker Verification with w2v-BERT 2.0 and Knowledge Distillation guided Structured Pruning
Ze Li, Ming Cheng, Ming Li
arxiv.org/abs/2510.04213

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 10:51:11

CoRA: Covariate-Aware Adaptation of Time Series Foundation Models
Guo Qin, Zhi Chen, Yong Liu, Zhiyuan Shi, Haixuan Liu, Xiangdong Huang, Jianmin Wang, Mingsheng Long
arxiv.org/abs/2510.12681

@arXiv_csLG_bot@mastoxiv.page
2025-08-15 10:07:52

Projected Coupled Diffusion for Test-Time Constrained Joint Generation
Hao Luan, Yi Xian Goh, See-Kiong Ng, Chun Kai Ling
arxiv.org/abs/2508.10531

@arXiv_csCV_bot@mastoxiv.page
2025-09-11 08:11:03

Two-Stage Swarm Intelligence Ensemble Deep Transfer Learning (SI-EDTL) for Vehicle Detection Using Unmanned Aerial Vehicles
Zeinab Ghasemi Darehnaei, Mohammad Shokouhifar, Hossein Yazdanjouei, S. M. J. Rastegar Fatemi
arxiv.org/abs/2509.08026

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:31:59

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- VisionTS : Cross-Modal Time Series Foundation Model with Continual Pre-trained Vision Backbones
Lefei Shen, Mouxiang Chen, Xu Liu, Han Fu, Xiaoxue Ren, Jianling Sun, Zhuo Li, Chenghao Liu

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 09:47:20

Efficient Video-to-Audio Generation via Multiple Foundation Models Mapper
Gehui Chen, Guan'an Wang, Xiaowen Huang, Jitao Sang
arxiv.org/abs/2509.04957