
2025-08-20 10:18:30
GDNSQ: Gradual Differentiable Noise Scale Quantization for Low-bit Neural Networks
Sergey Salishev, Ian Akhremchik
https://arxiv.org/abs/2508.14004 https://
GDNSQ: Gradual Differentiable Noise Scale Quantization for Low-bit Neural Networks
Sergey Salishev, Ian Akhremchik
https://arxiv.org/abs/2508.14004 https://
Fully Spiking Actor-Critic Neural Network for Robotic Manipulation
Liwen Zhang, Heng Deng, Guanghui Sun
https://arxiv.org/abs/2508.12038 https://arxiv.org/…
Synchronization and semantization in deep spiking networks
Jonas Oberste-Frielinghaus, Anno C. Kurth, Julian G\"oltz, Laura Kriener, Junji Ito, Mihai A. Petrovici, Sonja Gr\"un
https://arxiv.org/abs/2508.12975
Beyond Semantic Understanding: Preserving Collaborative Frequency Components in LLM-based Recommendation
Minhao Wang, Yunhang He, Cong Xu, Zhangchi Zhu, Wei Zhang
https://arxiv.org/abs/2508.10312
Input Conditioned Layer Dropping in Speech Foundation Models
Abdul Hannan, Daniele Falavigna, Alessio Brutti
https://arxiv.org/abs/2507.07954 https://
Optimizing Fluid Antenna Configurations for Constructive Interference Precoding
Wenxuan Sun, Mingjie Shao, Luteng Zhu, Yao Ge, Tong Zhang, Zhi Liu
https://arxiv.org/abs/2507.11093
Error Exponents for Quantum Packing Problems via An Operator Layer Cake Theorem
Hao-Chung Cheng, Po-Chieh Liu
https://arxiv.org/abs/2507.06232 https://
What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains
Chanakya Ekbote, Marco Bondaschi, Nived Rajaraman, Jason D. Lee, Michael Gastpar, Ashok Vardhan Makkuva, Paul Pu Liang
https://arxiv.org/abs/2508.07208
Fast Forward and Inverse Thermal Modeling for Parameter Estimation of Multi-Layer Composites -- Part II: Inverse Modeling and Applications
Gan Fu, Mitrofan Curti, Calina Ciuhu, Elena A. Lomonova
https://arxiv.org/abs/2507.06746
Ken Utilization Layer: Hebbian Replay Within a Student's Ken for Adaptive Knowledge Tracing
Grey Kuling, Marinka Zitnik
https://arxiv.org/abs/2507.00032
Implementing Neural Networks Over-the-Air via Reconfigurable Intelligent Surfaces
Meng Hua, Chenghong Bian, Haotian Wu, Deniz Gunduz
https://arxiv.org/abs/2508.01840 https://
AttZoom: Attention Zoom for Better Visual Features
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Ruben Tolosana
https://arxiv.org/abs/2508.03625 https://
This https://arxiv.org/abs/2506.05588 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csNE_…
Understanding and Controlling Repetition Neurons and Induction Heads in In-Context Learning
Nhi Hoai Doan, Tatsuya Hiraoka, Kentaro Inui
https://arxiv.org/abs/2507.07810
Generalization performance of narrow one-hidden layer networks in the teacher-student setting
Jean Barbier, Federica Gerace, Alessandro Ingrosso, Clarissa Lauditi, Enrico M. Malatesta, Gibbs Nwemadji, Rodrigo P\'erez Ortiz
https://arxiv.org/abs/2507.00629
PatchDSU: Uncertainty Modeling for Out of Distribution Generalization in Keyword Spotting
Bronya Roni Chernyak, Yael Segal, Yosi Shrem, Joseph Keshet
https://arxiv.org/abs/2508.03190
This https://arxiv.org/abs/2503.14372 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Trainable dynamical masking for readout-free optical computing
S. Bogdanov, E. Manuylovich, S. K. Turitsyn
https://arxiv.org/abs/2505.23464 https://…
Preprocessing Methods for Memristive Reservoir Computing for Image Recognition
Rishona Daniels, Duna Wattad, Ronny Ronen, David Saad, Shahar Kvatinsky
https://arxiv.org/abs/2506.05588
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations
A. Bochkov
https://arxiv.org/abs/2507.04886 https://…
Modeling the Path of Structural Strategic Deterrence: A Sand Table Simulation and Research Report on China's Military-Industrial Capability System against the United States Based on Rare Earth Supply Disconnection
Wei Meng
https://arxiv.org/abs/2505.21579
Liquid and solid layers in a thermal deep learning machine
Gang Huang, Lai Shun Chan, Hajime Yoshino, Ge Zhang, Yuliang Jin
https://arxiv.org/abs/2506.06789
ICWLM: A Multi-Task Wireless Large Model via In-Context Learning
Yuxuan Wen, Xiaoming Chen, Maojun Zhang, Zhaoyang Zhang
https://arxiv.org/abs/2507.18167 https://
Game-Theoretic Gradient Control for Robust Neural Network Training
Maria Zaitseva, Ivan Tomilov, Natalia Gusarova
https://arxiv.org/abs/2507.19143 https://…