
2025-06-06 09:56:30
This https://arxiv.org/abs/2212.02658 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2212.02658 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2505.18570 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
AMD says it has acquired the team behind AI inference chip developer Untether AI, a day after announcing it acquired AI software optimization startup Brium (Dylan Martin/CRN)
https://www.crn.com/news/components-periph
Membership Inference Attacks on Sequence Models
Lorenzo Rossi, Michael Aerni, Jie Zhang, Florian Tram\`er
https://arxiv.org/abs/2506.05126 https://
FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference
Xing Liu, Lizhuo Luo, Ming Tang, Chao Huang
https://arxiv.org/abs/2507.02620
Deep learning inference with the #EventHorizonTelescope I. Calibration improvements and a comprehensive synthetic data library / II. The ZINGULARITY framework for Bayesian artificial neural networks / III. ZINGULARITY results from the 2017 observations and predictions for future array expansions: https://www.aanda.org/articles/aa/full_html/2025/06/aa53784-25/aa53784-25.html / https://www.aanda.org/articles/aa/full_html/2025/06/aa53785-25/aa53785-25.html / https://www.aanda.org/articles/aa/full_html/2025/06/aa53786-25/aa53786-25.html -> Self-learning neural network cracks iconic black holes: https://www.astronomie.nl/nieuws/en/self-learning-neural-network-cracks-iconic-black-holes-4528
VeFIA: An Efficient Inference Auditing Framework for Vertical Federated Collaborative Software
Chung-ju Huang, Ziqi Zhang, Yinggui Wang, Binghui Wang, Tao Wei, Leye Wang
https://arxiv.org/abs/2507.02376
Lipschitz stability for Bayesian inference in porous medium tissue growth models
Tomasz D\k{e}biec, Piotr Gwiazda, B{\l}a\.zej Miasojedow, Katarzyna Ryszewska, Zuzanna Szyma\'nska, Aneta Wr\'oblewska-Kami\'nska
https://arxiv.org/abs/2506.04769
Classification of Extremal Dependence in Financial Markets via Bootstrap Inference
Qian Hui, Sidney I. Resnick, Tiandong Wang
https://arxiv.org/abs/2506.04656
Differentiable Fuzzy Cosmic-Web for Field Level Inference
P. Rossell\'o, F. -S. Kitaura, D. Forero-S\'anchez, F. Sinigaglia, G. Favole
https://arxiv.org/abs/2506.03969
Tokyo-based Sakana AI details a new Monte Carlo tree search-based technique that lets multiple LLMs cooperate on a single task, outperforming individual models (Ben Dickson/VentureBeat)
https://venturebeat.com/ai/sakana-ais-
Active inference as a unified model of collision avoidance behavior in human drivers
Julian F. Schumann, Johan Engstroem, Leif Johnson, Matthew O'Kelly, Joao Messias, Jens Kober, Arkady Zgonnikov
https://arxiv.org/abs/2506.02215
This https://arxiv.org/abs/2505.24502 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qu…
Amortized variational transdimensional inference
Laurence Davies, Dan Mackinlay, Rafael Oliveira, Scott A. Sisson
https://arxiv.org/abs/2506.04749 https://…
Reconstructing North Korea's Plutonium Production History with Bayesian Inference-Based Reprocessing Waste Analysis
Benjamin Jung, Johannes Bosse, Malte G\"ottsche
https://arxiv.org/abs/2506.03865
Inverse design for robust inference in integrated computational spectrometry
Wenchao Ma, Rapha\"el Pestourie, Zin Lin, Steven G. Johnson
https://arxiv.org/abs/2506.02194
This https://arxiv.org/abs/2506.02814 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
Bayesian Doubly Robust Causal Inference via Posterior Coupling
Shunichiro Orihara, Tomotaka Momozaki, Shonosuke Sugasawa
https://arxiv.org/abs/2506.04868 h…
Generator Based Inference (GBI)
Chi Lung Cheng, Ranit Das, Runze Li, Radha Mastandrea, Vinicius Mikuni, Benjamin Nachman, David Shih, Gup Singh
https://arxiv.org/abs/2506.00119
This https://arxiv.org/abs/2505.15380 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSD_…
This https://arxiv.org/abs/2504.01759 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2505.23655 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Exact operator inference with minimal data
Henrik Rosenberger, Benjamin Sanderse, Giovanni Stabile
https://arxiv.org/abs/2506.01244 https://
This https://arxiv.org/abs/2502.00805 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
This https://arxiv.org/abs/2503.18617 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
This https://arxiv.org/abs/2409.14202 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_eco…
This https://arxiv.org/abs/2504.10667 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2505.24293 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
This https://arxiv.org/abs/2505.19931 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Reappraising the Elatina series: Solar dynamo clocking and inference of orbital periods
F. Stefani, T. Weier, G. M. Horstmann, G. Mamatsashvili
https://arxiv.org/abs/2506.02628
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
Rui Xie, Asad Ul Haq, Yunhua Fang, Linsen Ma, Sanchari Sen, Swagath Venkataramani, Liu Liu, Tong Zhang
https://arxiv.org/abs/2507.02654
The Spurious Factor Dilemma: Robust Inference in Heavy-Tailed Elliptical Factor Models
Jiang Hu, Jiahui Xie, Yangchun Zhang, Wang Zhou
https://arxiv.org/abs/2506.05116
This https://arxiv.org/abs/2411.13280 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qbi…
Pyrefly vs. ty: Comparing Python’s Two New Rust-Based Type Checkers
#types
This https://arxiv.org/abs/2504.15268 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qfi…
EARN: Efficient Inference Acceleration for LLM-based Generative Recommendation by Register Tokens
Chaoqun Yang, Xinyu Lin, Wenjie Wang, Yongqi Li, Teng Sun, Xianjing Han, Tat-Seng Chua
https://arxiv.org/abs/2507.00715
Evaluation of "As-Intended" Vehicle Dynamics using the Active Inference Framework
Kazuharu Kidera, Takuma Miyaguchi, Hideyoshi Yanagisawa
https://arxiv.org/abs/2506.00035
Dissecting the Impact of Mobile DVFS Governors on LLM Inference Performance and Energy Efficiency
Zongpu Zhang, Pranab Dash, Y. Charlie Hu, Qiang Xu, Jian Li, Haibing Guan
https://arxiv.org/abs/2507.02135
QuickSilver -- Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization
Danush Khanna, Aditya Kumar Guru, Srivarshinee Sridhar, Zidan Ahmed, Rubhav Bahirwani, Meetu Malhotra, Vinija Jain, Aman Chadha, Amitava Das, Kripabandhu Ghosh
https://arxiv.org/abs/2…
This https://arxiv.org/abs/2502.03956 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLO_…
Adaptive Configuration Selection for Multi-Model Inference Pipelines in Edge Computing
Jinhao Sheng, Zhiqing Tang, Jianxiong Guo, Tian Wang
https://arxiv.org/abs/2506.02814
This https://arxiv.org/abs/2505.24765 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qu…
A Survey of LLM Inference Systems
James Pan, Guoliang Li
https://arxiv.org/abs/2506.21901 https://arxiv.org/pdf/2506.21901
This https://arxiv.org/abs/2405.00295 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csGT_…
Everybody complaining about getting hammered with #AI traffic seems to think that these are crawlers scraping for training data.
How likely is it that this is a complete misconception and this is all inference time?
Most public companies give their cralwers and RAG agents different user agent strings. But what about security services trawling through their data?
Privacy Leaks by Adversaries: Adversarial Iterations for Membership Inference Attack
Jing Xue, Zhishen Sun, Haishan Ye, Luo Luo, Xiangyu Chang, Ivor Tsang, Guang Dai
https://arxiv.org/abs/2506.02711
LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding
Yuchen Ma, Dennis Frauen, Jonas Schweisthal, Stefan Feuerriegel
https://arxiv.org/abs/2507.02843
Neural simulation-based inference of the Higgs trilinear self-coupling via off-shell Higgs production
Aishik Ghosh, Maximilian Griese, Ulrich Haisch, Tae Hyoun Park
https://arxiv.org/abs/2507.02032
Variational Inference for Latent Variable Models in High Dimensions
Chenyang Zhong, Sumit Mukherjee, Bodhisattva Sen
https://arxiv.org/abs/2506.01893 https…
Simulation-Efficient Cosmological Inference with Multi-Fidelity SBI
Leander Thiele, Adrian E. Bayer, Naoya Takeishi
https://arxiv.org/abs/2507.00514 https:…
Parallel CPU-GPU Execution for LLM Inference on Constrained GPUs
Jiakun Fan, Yanglin Zhang, Xiangchen Li, Dimitrios S. Nikolopoulos
https://arxiv.org/abs/2506.03296
Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning
Hien Ohnaka, Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto
https://arxiv.org/abs/2506.04527
This https://arxiv.org/abs/2505.14884 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Information-Optimal Sensing and Control in High-Intensity Laser Experiments
A. D\"opp, C. Eberle, J. Esslinger, S. Howard, F. Irshad, J. Schroeder, N. Weisse, S. Karsch
https://arxiv.org/abs/2506.04946
This https://arxiv.org/abs/2405.05969 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge
Chunlin Tian, Xinpeng Qin, Kahou Tam, Li Li, Zijian Wang, Yuanzhe Zhao, Minglei Zhang, Chengzhong Xu
https://arxiv.org/abs/2506.02847
Combining Type Inference and Automated Unit Test Generation for Python
Lukas Krodinger, Stephan Lukasczyk, Gordon Fraser
https://arxiv.org/abs/2507.01477 h…
This https://arxiv.org/abs/2506.01969 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
Randomization Inference with Sample Attrition
Xinran Li, Peizan Sheng, Zeyang Yu
https://arxiv.org/abs/2507.00795 https://arxiv.org/p…
Towards Better Attribute Inference Vulnerability Measures
Paul Francis, David Wagner
https://arxiv.org/abs/2507.01710 https://arxiv.o…
This https://arxiv.org/abs/2505.07802 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
Quantum Bayesian inference with Suport vector states for intrusion detection
Nayema Mridha, Garrv Sipani, Eva R Gaarder, Shah Haque, Radhika Kuttala, Binay P Akhouri, Mohamad M Al Zein, Eric Howard
https://arxiv.org/abs/2507.00403
This https://arxiv.org/abs/2406.10554 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
At the edge of Donsker's Theorem: Asymptotics of multiscale scan statistics
Johann K\"ohne, Fabian Mies
https://arxiv.org/abs/2506.05112 https://
Bayesian inference of the magnetic field and chemical potential on holographic jet quenching in heavy-ion collisions
Liqiang Zhu, Zhan Gao, Weiyao Ke, Hanzhong Zhang
https://arxiv.org/abs/2506.00340
FlashMLA-ETAP: Efficient Transpose Attention Pipeline for Accelerating MLA Inference on NVIDIA H20 GPUs
Pencuo Zeren, Qiuming Luo, Rui Mao, Chang Kong
https://arxiv.org/abs/2506.01969
This https://arxiv.org/abs/2501.16007 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
System-performance and cost modeling of Large Language Model training and inference
Wenzhe Guo, Joyjit Kundu, Uras Tos, Weijiang Kong, Giuliano Sisto, Timon Evenblij, Manu Perumkunnil
https://arxiv.org/abs/2507.02456
Simulation-Based Inference for Adaptive Experiments
Brian M Cho, Aur\'elien Bibaut, Nathan Kallus
https://arxiv.org/abs/2506.02881 https://
This https://arxiv.org/abs/2409.18858 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Causal Inference for Aggregated Treatment
Carolina Caetano, Gregorio Caetano, Brantly Callaway, Derek Dyal
https://arxiv.org/abs/2506.22885 https://…
This https://arxiv.org/abs/2501.08524 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
This https://arxiv.org/abs/2505.09999 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
This https://arxiv.org/abs/2406.04655 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
Two-Sample Covariance Inference in High-Dimensional Elliptical Models
Nina D\"ornemann
https://arxiv.org/abs/2507.02640 https://…
Memory Access Characterization of Large Language Models in CPU Environment and its Potential Impacts
Spencer Banasik
https://arxiv.org/abs/2506.01827 https…
A Frequentist Simulation-Based Inference Treatment of Sterile Neutrino Global Fits
Joshua Villarreal, Julia Woodward, John Hardin, Janet Conrad
https://arxiv.org/abs/2507.01153
Robust Estimation in Step-Stress Experiments under Exponential Lifetime Distributions
Mar\'ia Jaenada, Juan Manuel Mill\'an, Leandro Pardo
https://arxiv.org/abs/2506.04445
Deep Recommender Models Inference: Automatic Asymmetric Data Flow Optimization
Giuseppe Ruggeri, Renzo Andri, Daniele Jahier Pagliari, Lukas Cavigelli
https://arxiv.org/abs/2507.01676
This https://arxiv.org/abs/2505.23655 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Flexible Selective Inference with Flow-based Transport Maps
Sifan Liu, Snigdha Panigrahi
https://arxiv.org/abs/2506.01150 https://arx…
DistMLIP: A Distributed Inference Platform for Machine Learning Interatomic Potentials
Kevin Han, Bowen Deng, Amir Barati Farimani, Gerbrand Ceder
https://arxiv.org/abs/2506.02023
ProjMC$^2$: Scalable and Stable Posterior Inference for Bayesian Spatial Factor Models with Application to Spatial Transcriptomics
Lu Zhang
https://arxiv.org/abs/2506.01098
Keyed Chaotic Tensor Transformations for Secure And Attributable Neural Inference
Peter David Fagan
https://arxiv.org/abs/2505.23655 https://
QPART: Adaptive Model Quantization and Dynamic Workload Balancing for Accuracy-aware Edge Inference
Xiangchen Li, Saeid Ghafouri, Bo Ji, Hans Vandierendonck, Deepu John, Dimitrios S. Nikolopoulos
https://arxiv.org/abs/2506.23934
This https://arxiv.org/abs/2408.06211 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
Ghidorah: Fast LLM Inference on Edge with Speculative Decoding and Hetero-Core Parallelism
Jinhui Wei, Ye Huang, Yuhui Zhou, Jiazhi Jiang, Jiangsu Du
https://arxiv.org/abs/2505.23219
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning
Wenjin Mo, Zhiyuan Li, Minghong Fang, Mingwei Fang
https://arxiv.org/abs/2507.00423
Causal Inference in Panel Data with a Continuous Treatment
Zhiguo Xiao, Peikai Wu
https://arxiv.org/abs/2506.23226 https://arxiv.org/…
LLM-Mesh: Enabling Elastic Sharing for Serverless LLM Inference
Chuhao Xu, Zijun Li, Quan Chen, Han Zhao, Minyi Guo
https://arxiv.org/abs/2507.00507 https:…
Synopsis: Secure and private trend inference from encrypted semantic embeddings
Madelyne Xiao, Palak Jain, Micha Gorelick, Sarah Scheffler
https://arxiv.org/abs/2505.23880
SkyLB: A Locality-Aware Cross-Region Load Balancer for LLM Inference
Tian Xia, Ziming Mao, Jamison Kerney, Ethan J. Jackson, Zhifei Li, Jiarong Xing, Scott Shenker, Ion Stoica
https://arxiv.org/abs/2505.24095
This https://arxiv.org/abs/2304.12414 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference
Yongchao He, Bohan Zhao, Zheng Cao
https://arxiv.org/abs/2506.22033
Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression
Marc Ratkovic
https://arxiv.org/abs/2506.23862
Reluctant Interaction Inference after Additive Modeling
Yiling Huang, Snigdha Panigrahi, Guo Yu, Jacob Bien
https://arxiv.org/abs/2506.01219 https://
Cascadia: A Cascade Serving System for Large Language Models
Youhe Jiang, Fangcheng Fu, Wanru Zhao, Stephan Rabanser, Nicholas D. Lane, Binhang Yuan
https://arxiv.org/abs/2506.04203