Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCV_bot@mastoxiv.page
2025-06-25 10:32:00

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
Long Xing, Qidong Huang, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Jinsong Li, Shuangrui Ding, Weiming Zhang, Nenghai Yu, Jiaqi Wang, Feng Wu, Dahua Lin
arxiv.org/abs/2506.19848

@arXiv_csAI_bot@mastoxiv.page
2025-07-25 08:14:52

Agentic AI framework for End-to-End Medical Data Inference
Soorya Ram Shimgekar, Shayan Vassef, Abhay Goyal, Navin Kumar, Koustuv Saha
arxiv.org/abs/2507.18115

@arXiv_csCL_bot@mastoxiv.page
2025-06-26 09:40:50

When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs
Ammar Khairi, Daniel D'souza, Ye Shen, Julia Kreutzer, Sara Hooker
arxiv.org/abs/2506.20544

@arXiv_csCR_bot@mastoxiv.page
2025-08-25 09:03:50

Evaluating the Defense Potential of Machine Unlearning against Membership Inference Attacks
Aristeidis Sidiropoulos, Christos Chrysanthos Nikolaidis, Theodoros Tsiolakis, Nikolaos Pavlidis, Vasilis Perifanis, Pavlos S. Efraimidis
arxiv.org/abs/2508.16150

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 09:20:42

Cloud Native System for LLM Inference Serving
Minxian Xu, Junhan Liao, Jingfeng Wu, Yiyuan He, Kejiang Ye, Chengzhong Xu
arxiv.org/abs/2507.18007

@arXiv_mathST_bot@mastoxiv.page
2025-08-26 09:07:26

Quasi-likelihood inference for SDE with mixed-effects observed at high frequency
Maud Delattre, Hiroki Masuda
arxiv.org/abs/2508.17910 arxi…

@arXiv_astrophHE_bot@mastoxiv.page
2025-08-25 09:10:10

Simulation-Based Inference for Direction Reconstruction of Ultra-High-Energy Cosmic Rays with Radio Arrays
Oscar Macias, Zachary Mason, Matthew Ho, Ars\`ene Ferri\`ere, Aur\'elien Benoit-L\'evy, Mat\'ias Tueros
arxiv.org/abs/2508.15991

@arXiv_astrophIM_bot@mastoxiv.page
2025-06-25 08:31:30

Validating Sequential Monte Carlo for Gravitational-Wave Inference
Michael J. Williams, Minas Karamanis, Yilin Luo, Uro\v{s} Seljak
arxiv.org/abs/2506.18977

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 10:18:09

ViRN: Variational Inference and Distribution Trilateration for Long-Tailed Continual Representation Learning
Hao Dai, Chong Tang, Jagmohan Chauhan
arxiv.org/abs/2507.17368

@arXiv_csIT_bot@mastoxiv.page
2025-07-25 08:26:11

Minimax Data Sanitization with Distortion Constraint and Adversarial Inference
Amirarsalan Moatazedian, Yauhen Yakimenka, R\'emi A. Chou, J\"org Kliewer
arxiv.org/abs/2507.17942

@arXiv_statME_bot@mastoxiv.page
2025-08-25 09:22:10

Scalable Bayesian inference on high-dimensional multivariate linear regression
Xuan Cao, Kyoungjae Lee
arxiv.org/abs/2508.16446 arxiv.org/p…

@arXiv_csAR_bot@mastoxiv.page
2025-08-25 07:42:20

Bare-Metal RISC-V NVDLA SoC for Efficient Deep Learning Inference
Vineet Kumar (School of Electrical,Electronic Engineering, University College Dublin, Dublin, Ireland, Department of Electronic,Electrical Engineering, Trinity College Dublin, Dublin, Ireland), Ajay Kumar M (School of Electrical,Electronic Engineering, University College Dublin, Dublin, Ireland, Department of Electronic,Electrical Engineering, Trinity College Dublin, Dublin, Ireland), Yike Li (School of Electrical,Elec…

@fanf@mendeddrum.org
2025-08-21 11:42:03

from my link log —
Damas-Hindley-Milner inference two ways.
bernsteinbear.com/blog/type-in
saved 2024-10-17

@arXiv_astrophCO_bot@mastoxiv.page
2025-08-25 08:48:20

Simulation based inference of the ionization history from the 2D 21 cm power spectrum
Nadia Cooper, Carina Norregaard, Romain Meriot, Jonathan R. Pritchard
arxiv.org/abs/2508.16329

@arXiv_csCE_bot@mastoxiv.page
2025-06-24 09:02:50

Exact Conditional Score-Guided Generative Modeling for Amortized Inference in Uncertainty Quantification
Zezhong Zhang, Caroline Tatsuoka, Dongbin Xiu, Guannan Zhang
arxiv.org/abs/2506.18227

@arXiv_qbioNC_bot@mastoxiv.page
2025-08-26 08:57:06

A coalgebraic perspective on predictive processing
Manuel Baltieri, Filippo Torresan, Tomoya Nakai
arxiv.org/abs/2508.16877 arxiv.org/pdf/2…

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 07:51:41

Flexible Vector Integration in Embedded RISC-V SoCs for End to End CNN Inference Acceleration
Dmitri Lyalikov
arxiv.org/abs/2507.17771 arxi…

@arXiv_csPL_bot@mastoxiv.page
2025-08-22 09:04:01

Probabilistic Inference for Datalog with Correlated Inputs
Jingbo Wang, Shashin Halalingaiah, Weiyi Chen, Chao Wang, Isil Dillig
arxiv.org/abs/2508.15166

@arXiv_physicsoptics_bot@mastoxiv.page
2025-07-25 08:53:02

Temporal Broadening of Attosecond Pulse Trains Induced by Multi-Band inference in Solid-State High-Order Harmonic Generation
Qing-Guo Fan, Kang Lai, Wen-hao Liu, Zhi Wang, Lin-Wang Wang, Jun-Wei Luo
arxiv.org/abs/2507.18019

@arXiv_csNI_bot@mastoxiv.page
2025-06-25 07:49:49

WiLLM: An Open Wireless LLM Communication System
Boyi Liu, Yongguang Lu, Jianguo Zhao, Qiang Yang, Wen Wu, Lin Chen, Jagmohan Chauhan, Jun Zhang
arxiv.org/abs/2506.19030

@arXiv_qbioPE_bot@mastoxiv.page
2025-07-25 08:47:52

ARTreeFormer: A Faster Attention-based Autoregressive Model for Phylogenetic Inference
Tianyu Xie, Yicong Mao, Cheng Zhang
arxiv.org/abs/2507.18380

@arXiv_csCL_bot@mastoxiv.page
2025-06-26 07:33:20

Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs
Travis Thompson, Seung-Hwan Lim, Paul Liu, Ruoying He, Dongkuan Xu
arxiv.org/abs/2506.19967

@arXiv_physicsplasmph_bot@mastoxiv.page
2025-06-26 08:49:20

Tomography for Plasma Imaging: a Unifying Framework for Bayesian Inference
D. Hamm, C. Theiler, M. Simeoni, B. P. Duval, T. Debarre, L. Simons, J. R. Queralt
arxiv.org/abs/2506.20232

@arXiv_econEM_bot@mastoxiv.page
2025-07-25 07:46:22

Partitioned Wild Bootstrap for Panel Data Quantile Regression
Antonio F. Galvao, Carlos Lamarche, Thomas Parker
arxiv.org/abs/2507.18494 ar…

@arXiv_csCR_bot@mastoxiv.page
2025-07-25 09:19:22

LoRA-Leak: Membership Inference Attacks Against LoRA Fine-tuned Language Models
Delong Ran, Xinlei He, Tianshuo Cong, Anyu Wang, Qi Li, Xiaoyun Wang
arxiv.org/abs/2507.18302

@arXiv_csPF_bot@mastoxiv.page
2025-08-25 07:41:10

GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving
Qunyou Liu, Darong Huang, Marina Zapater, David Atienza
arxiv.org/abs/2508.16449

@arXiv_statML_bot@mastoxiv.page
2025-06-26 09:25:51

LARP: Learner-Agnostic Robust Data Prefiltering
Kristian Minchev, Dimitar Iliev Dimitrov, Nikola Konstantinov
arxiv.org/abs/2506.20573

@arXiv_quantph_bot@mastoxiv.page
2025-07-25 10:05:02

Hybrid quantum-classical algorithm for near-optimal planning in POMDPs
Gilberto Cunha, Alexandra Ram\^oa, Andr\'e Sequeira, Michael de Oliveira, Lu\'is Barbosa
arxiv.org/abs/2507.18606

@arXiv_csRO_bot@mastoxiv.page
2025-06-25 13:12:20

Replaced article(s) found for cs.RO. arxiv.org/list/cs.RO/new
[1/2]:
- Stochastic Motion Planning as Gaussian Variational Inference: Theory and Algorithms
Hongzhe Yu, Yongxin Chen

@arXiv_csSE_bot@mastoxiv.page
2025-08-25 09:26:00

SATORI: Static Test Oracle Generation for REST APIs
Juan C. Alonso, Alberto Martin-Lopez, Sergio Segura, Gabriele Bavota, Antonio Ruiz-Cort\'es
arxiv.org/abs/2508.16318

@arXiv_statME_bot@mastoxiv.page
2025-07-25 09:26:52

How weak are weak factors? Uniform inference for signal strength in signal plus noise models
Anna Bykhovskaya, Vadim Gorin, Sasha Sodin
arxiv.org/abs/2507.18554

@arXiv_hepph_bot@mastoxiv.page
2025-07-24 08:41:30

A linear PDF model for Bayesian inference
Mark N. Costantini, Luca Mantani, James M. Moore, Maria Ubiali
arxiv.org/abs/2507.16913

@arXiv_csAR_bot@mastoxiv.page
2025-06-25 07:31:09

MEDEA: A Design-Time Multi-Objective Manager for Energy-Efficient DNN Inference on Heterogeneous Ultra-Low Power Platforms
Hossein Taji, Jos\'e Miranda, Miguel Pe\'on-Quir\'os, David Atienza
arxiv.org/abs/2506.19067

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:06:00

A Probabilistic Inference Scaling Theory for LLM Self-Correction
Zhe Yang, Yichang Zhang, Yudong Wang, Ziyao Xu, Junyang Lin, Zhifang Sui
arxiv.org/abs/2508.16456

@cyrevolt@mastodon.social
2025-06-18 11:13:23

Right a few days before I'll be talking about patterns in DRAM init at #GPN23, #Binarly are posting on their type inference tooling:

@arXiv_csSD_bot@mastoxiv.page
2025-06-23 10:33:20

ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors
Junghyun Koo, Marco A. Martinez-Ramirez, Wei-Hsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji
arxiv.org/abs/2506.16889

@arXiv_astrophCO_bot@mastoxiv.page
2025-08-25 08:28:30

CIGaRS I: Combined simulation-based inference from SNae Ia and host photometry
Konstantin Karchev, Roberto Trotta, Raul Jimenez
arxiv.org/abs/2508.15899

@arXiv_eessIV_bot@mastoxiv.page
2025-07-25 09:45:22

UniSegDiff: Boosting Unified Lesion Segmentation via a Staged Diffusion Model
Yilong Hu, Shijie Chang, Lihe Zhang, Feng Tian, Weibing Sun, Huchuan Lu
arxiv.org/abs/2507.18362

@arXiv_grqc_bot@mastoxiv.page
2025-06-26 08:28:40

Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models
Sarp Ak\c{c}ay, Charlie Hoy, Jake Mac Uilliam
arxiv.org/abs/2506.19990

@arXiv_csOS_bot@mastoxiv.page
2025-06-26 08:44:30

MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection
Zhengxiang Huang, Chaoyue Niu, Zhaode Wang, Jiarui Xue, Hanming Zhang, Yugang Wang, Zewei Xin, Xiaotang Jiang, Chengfei Lv, Fan Wu, Guihai Chen
arxiv.org/abs/2506.19884

@arXiv_mathST_bot@mastoxiv.page
2025-06-25 08:59:50

Statistical Inference for Optimal Transport Maps: Recent Advances and Perspectives
Sivaraman Balakrishnan, Tudor Manole, Larry Wasserman
arxiv.org/abs/2506.19025

@arXiv_csAI_bot@mastoxiv.page
2025-06-24 11:09:40

Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game
Xiang Yuming, Li Sizhao, Li Rongpeng, Zhao Zhifeng, Zhang Honggang
arxiv.org/abs/2506.18126

@arXiv_csLG_bot@mastoxiv.page
2025-08-22 10:15:21

Inductive Domain Transfer In Misspecified Simulation-Based Inference
Ortal Senouf, Antoine Wehenkel, C\'edric Vincent-Cuaz, Emmanuel Abb\'e, Pascal Frossard
arxiv.org/abs/2508.15593

@arXiv_csCR_bot@mastoxiv.page
2025-06-26 09:21:00

Retrieval-Confused Generation is a Good Defender for Privacy Violation Attack of Large Language Models
Wanli Peng, Xin Chen, Hang Fu, XinYu He, Xue Yiming, Juan Wen
arxiv.org/abs/2506.19889

@arXiv_csCV_bot@mastoxiv.page
2025-08-22 10:20:41

Scaling Group Inference for Diverse and High-Quality Generation
Gaurav Parmar, Or Patashnik, Daniil Ostashev, Kuan-Chieh Wang, Kfir Aberman, Srinivasa Narasimhan, Jun-Yan Zhu
arxiv.org/abs/2508.15773

@arXiv_csDC_bot@mastoxiv.page
2025-06-26 09:01:30

SuperSONIC: Cloud-Native Infrastructure for ML Inferencing
Dmitry Kondratyev, Benedikt Riedel, Yuan-Tang Chou, Miles Cochran-Branson, Noah Paladino, David Schultz, Mia Liu, Javier Duarte, Philip Harris, Shih-Chieh Hsu
arxiv.org/abs/2506.20657

@arXiv_csAR_bot@mastoxiv.page
2025-08-26 07:31:26

TMA-Adaptive FP8 Grouped GEMM: Eliminating Padding Requirements in Low-Precision Training and Inference on Hopper
Zhongling Su, Rong Fu, Weihan Cao, Jianfei Gao, Minxi Jin, Zhilin Pei, Hui Wang
arxiv.org/abs/2508.16584

@arXiv_statME_bot@mastoxiv.page
2025-06-25 09:05:20

gcor: A Python Implementation of Categorical Gini Correlation and Its Inference
Sameera Hewage
arxiv.org/abs/2506.19230

@arXiv_statML_bot@mastoxiv.page
2025-07-25 08:57:32

A Two-armed Bandit Framework for A/B Testing
Jinjuan Wang, Qianglin Wen, Yu Zhang, Xiaodong Yan, Chengchun Shi
arxiv.org/abs/2507.18118 arx…

@arXiv_csRO_bot@mastoxiv.page
2025-08-25 08:33:20

GPL-SLAM: A Laser SLAM Framework with Gaussian Process Based Extended Landmarks
Ali Emre Balc{\i} (TU Delft), Erhan Ege Keyvan (Middle East Technical University), Emre \"Ozkan (Middle East Technical University)
arxiv.org/abs/2508.16459

@arXiv_csLG_bot@mastoxiv.page
2025-08-21 10:10:00

Adaptively Robust LLM Inference Optimization under Prediction Uncertainty
Zixi Chen, Yinyu Ye, Zijie Zhou
arxiv.org/abs/2508.14544 arxiv.or…

@arXiv_statME_bot@mastoxiv.page
2025-07-24 08:49:39

Efficient Bayesian Inference for Spatial Point Patterns Using the Palm Likelihood
Kevin M. Collins, Erin M. Schliep
arxiv.org/abs/2507.17065

@arXiv_hepph_bot@mastoxiv.page
2025-08-25 09:36:10

Transport Properties of QGP within a Bayesian Holographic QCD Model
Bing Chen, Liqiang Zhu, Xun Chen, Defu Hou, Xurong Chen
arxiv.org/abs/2508.16167

@arXiv_csIT_bot@mastoxiv.page
2025-08-25 11:29:00

Replaced article(s) found for cs.IT. arxiv.org/list/cs.IT/new
[1/1]:
- Computation and Communication Co-scheduling for Multi-Task Remote Inference
Md Kamran Chowdhury Shisher, Adam Piaseczny, Yin Sun, Christopher G. Brinton

@arXiv_csCR_bot@mastoxiv.page
2025-07-24 09:24:30

Tab-MIA: A Benchmark Dataset for Membership Inference Attacks on Tabular Data in LLMs
Eyal German, Sagiv Antebi, Daniel Samira, Asaf Shabtai, Yuval Elovici
arxiv.org/abs/2507.17259

@arXiv_csAR_bot@mastoxiv.page
2025-08-25 08:11:00

Hardwired-Neurons Language Processing Units as General-Purpose Cognitive Substrates
Yang Liu, Yi Chen, Yongwei Zhao, Yifan Hao, Zifu Zheng, Weihao Kong, Zhangmai Li, Dongchen Jiang, Ruiyang Xia, Zhihong Ma, Zisheng Liu, Zhaoyong Wan, Yunqi Lu, Ximing Liu, Hongrui Guo, Zhihao Yang, Zhe Wang, Tianrui Ma, Mo Zou, Rui Zhang, Ling Li, Xing Hu, Zidong Du, Zhiwei Xu, Qi Guo, Tianshi Chen, Yunji Chen

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:46:20

Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers
Shikang Zheng, Liang Feng, Xinyu Wang, Qinming Zhou, Peiliang Cai, Chang Zou, Jiacheng Liu, Yuqi Lin, Junjie Chen, Yue Ma, Linfeng Zhang
arxiv.org/abs/2508.16211

@arXiv_csAI_bot@mastoxiv.page
2025-08-25 08:26:50

Bridging the Gap in Ophthalmic AI: MM-Retinal-Reason Dataset and OphthaReason Model toward Dynamic Multimodal Reasoning
Ruiqi Wu, Yuang Yao, Tengfei Ma, Chenran Zhang, Na Su, Tao Zhou, Geng Chen, Wen Fan, Yi Zhou
arxiv.org/abs/2508.16129

@arXiv_csDC_bot@mastoxiv.page
2025-07-24 07:55:39

BrownoutServe: SLO-Aware Inference Serving under Bursty Workloads for MoE-based LLMs
Jianmin Hu, Minxian Xu, Kejiang Ye, Chengzhong Xu
arxiv.org/abs/2507.17133

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 09:03:49

SiLQ: Simple Large Language Model Quantization-Aware Training
Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha
arxiv.org/abs/2507.16933

@arXiv_csCV_bot@mastoxiv.page
2025-06-25 10:32:30

Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation
Xingyang Li, Muyang Li, Tianle Cai, Haocheng Xi, Shuo Yang, Yujun Lin, Lvmin Zhang, Songlin Yang, Jinbo Hu, Kelly Peng, Maneesh Agrawala, Ion Stoica, Kurt Keutzer, Song Han
arxiv.org/abs/2506.19852

@arXiv_csDC_bot@mastoxiv.page
2025-07-24 07:47:19

BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving
Wanyi Zheng, Minxian Xu, Shengye Song, Kejiang Ye
arxiv.org/abs/2507.17120

@arXiv_csAI_bot@mastoxiv.page
2025-08-26 09:12:46

Bridging the Gap in Ophthalmic AI: MM-Retinal-Reason Dataset and OphthaReason Model toward Dynamic Multimodal Reasoning
Ruiqi Wu, Yuang Yao, Tengfei Ma, Chenran Zhang, Na Su, Tao Zhou, Geng Chen, Wen Fan, Yi Zhou
arxiv.org/abs/2508.16129

@arXiv_csCL_bot@mastoxiv.page
2025-06-26 07:39:50

A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs
Kethmi Hirushini Hettige, Jiahao Ji, Cheng Long, Shili Xiang, Gao Cong, Jingyuan Wang
arxiv.org/abs/2506.20073

@arXiv_statME_bot@mastoxiv.page
2025-06-24 11:30:30

Leveraging specificity for causal inference in observational studies
Wang Miao
arxiv.org/abs/2506.18469 arxiv.org/pdf…

@arXiv_csLG_bot@mastoxiv.page
2025-06-25 07:37:19

From Tiny Machine Learning to Tiny Deep Learning: A Survey
Shriyank Somvanshi, Md Monzurul Islam, Gaurab Chhetri, Rohit Chakraborty, Mahmuda Sultana Mimi, Swagat Ahmed Shuvo, Kazi Sifatul Islam, Syed Aaqib Javed, Sharif Ahmed Rafat, Anandi Dutta, Subasish Das
arxiv.org/abs/2506.18927

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:56:50

A Lightweight Group Multiscale Bidirectional Interactive Network for Real-Time Steel Surface Defect Detection
Yong Zhang, Cunjian Chen, Qiang Gao, Yi Wang, Bin Fang
arxiv.org/abs/2508.16397

@arXiv_csCL_bot@mastoxiv.page
2025-06-26 09:38:30

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching
Guinan Su, Li Shen, Lu Yin, Shiwei Liu, Yanwu Yang, Jonas Geiping
arxiv.org/abs/2506.20480

@arXiv_csDC_bot@mastoxiv.page
2025-07-23 08:23:32

Collaborative Inference and Learning between Edge SLMs and Cloud LLMs: A Survey of Algorithms, Execution, and Open Challenges
Senyao Li, Haozhao Wang, Wenchao Xu, Rui Zhang, Song Guo, Jingling Yuan, Xian Zhong, Tianwei Zhang, Ruixuan Li
arxiv.org/abs/2507.16731

@arXiv_csCR_bot@mastoxiv.page
2025-06-25 09:24:20

PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty
Jinwen He, Yiyang Lu, Zijin Lin, Kai Chen, Yue Zhao
arxiv.org/abs/2506.19563

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:56:00

Exploiting Information Redundancy in Attention Maps for Extreme Quantization of Vision Transformers
Lucas Maisonnave, Karim Haroun, Tom Pegeot
arxiv.org/abs/2508.16311

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:50:20

On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo, Junxiao Wang, Fushuo Huo, Laizhong Cui, Song Guo, Jie Gui, Dacheng Tao
arxiv.org/abs/2508.16261

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:11:02

GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
Jiafeng Xiong, Yuting Zhao
arxiv.org/abs/2507.18562 arxiv.org/pdf/2…

@arXiv_csAR_bot@mastoxiv.page
2025-06-24 08:10:19

Embedded FPGA Acceleration of Brain-Like Neural Networks: Online Learning to Scalable Inference
Muhammad Ihsan Al Hafiz, Naresh Ravichandran, Anders Lansner, Pawel Herman, Artur Podobas
arxiv.org/abs/2506.18530

@arXiv_csCR_bot@mastoxiv.page
2025-06-24 11:41:50

HE-LRM: Encrypted Deep Learning Recommendation Models using Fully Homomorphic Encryption
Karthik Garimella, Austin Ebel, Gabrielle De Micheli, Brandon Reagen
arxiv.org/abs/2506.18150

@arXiv_statME_bot@mastoxiv.page
2025-07-24 09:33:19

Nonparametric inference for nonstationary spatial point processes
Izabel Nolau, Fl\'avio B. Gon\c{c}alves, Dani Gamerman
arxiv.org/abs/2507.17600

@arXiv_csCL_bot@mastoxiv.page
2025-06-26 09:45:00

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
Shansan Gong, Ruixiang Zhang, Huangjie Zheng, Jiatao Gu, Navdeep Jaitly, Lingpeng Kong, Yizhe Zhang
arxiv.org/abs/2506.20639

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 09:20:42

Cloud Native System for LLM Inference Serving
Minxian Xu, Junhan Liao, Jingfeng Wu, Yiyuan He, Kejiang Ye, Chengzhong Xu
arxiv.org/abs/2507.18007

@arXiv_csAR_bot@mastoxiv.page
2025-07-22 07:33:40

Efficient LLM Inference: Bandwidth, Compute, Synchronization, and Capacity are all you need
Michael Davies, Neal Crago, Karthikeyan Sankaralingam, Christos Kozyrakis
arxiv.org/abs/2507.14397

@arXiv_statME_bot@mastoxiv.page
2025-06-24 10:58:00

Bayesian Inference for Left-Truncated Log-Logistic Distributions for Time-to-event Data Analysis
Fahad Mostafa, Md Rejuan Haque, Md Mostafijur Rahman, Farzana Nasrin
arxiv.org/abs/2506.17852

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:13:12

System Report for CCL25-Eval Task 10: SRAG-MAV for Fine-Grained Chinese Hate Speech Recognition
Jiahao Wang, Ramen Liu, Longhui Zhang, Jing Li
arxiv.org/abs/2507.18580

@arXiv_csCV_bot@mastoxiv.page
2025-06-19 08:22:24

Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning
Ankan Deria, Adinath Madhavrao Dukre, Feilong Tang, Sara Atito, Sudipta Roy, Muhammad Awais, Muhammad Haris Khan, Imran Razzak
arxiv.org/abs/2506.15649

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 07:51:41

Flexible Vector Integration in Embedded RISC-V SoCs for End to End CNN Inference Acceleration
Dmitri Lyalikov
arxiv.org/abs/2507.17771 arxi…

@arXiv_statME_bot@mastoxiv.page
2025-07-24 09:30:09

Doubly robust outlier resistant inference on causal treatment effect
Joonsung Kang
arxiv.org/abs/2507.17439 arxiv.org/pdf/2507.17439

@arXiv_csAR_bot@mastoxiv.page
2025-07-25 08:06:42

Sandwich: Separating Prefill-Decode Compilation for Efficient CPU LLM Serving
Juntao Zhao, Jiuru Li, Chuan Wu
arxiv.org/abs/2507.18454 arxi…

@arXiv_csDC_bot@mastoxiv.page
2025-08-22 08:20:20

Efficient Mixed-Precision Large Language Model Inference with TurboMind
Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen
arxiv.org/abs/2508.15601

@arXiv_csCL_bot@mastoxiv.page
2025-08-21 09:22:00

Comparing energy consumption and accuracy in text classification inference
Johannes Zschache, Tilman Hartwig
arxiv.org/abs/2508.14170 arxiv…

@arXiv_statME_bot@mastoxiv.page
2025-07-23 08:56:22

Predictive inference for discrete-valued time series
Maxime Faymonville, Carsten Jentsch, Efstathios Paparoditis
arxiv.org/abs/2507.16035

@arXiv_csDC_bot@mastoxiv.page
2025-07-22 09:40:50

Efficient Routing of Inference Requests across LLM Instances in Cloud-Edge Computing
Shibo Yu, Mohammad Goudarzi, Adel Nadjaran Toosi
arxiv.org/abs/2507.15553

@arXiv_statME_bot@mastoxiv.page
2025-07-22 11:00:50

Inference on Nonlinear Counterfactual Functionals under a Multiplicative IV Model
Yonghoon Lee, Mengxin Yu, Jiewen Liu, Chan Park, Yunshu Zhang, James M. Robins, Eric J. Tchetgen Tchetgen
arxiv.org/abs/2507.15612

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 09:21:11

FCPO: Federated Continual Policy Optimization for Real-Time High-Throughput Edge Video Analytics
Lucas Liebe, Thanh-Tung Nguyen, Dongman Lee
arxiv.org/abs/2507.18047

@arXiv_statME_bot@mastoxiv.page
2025-08-25 09:11:10

Heterogeneous Quantile Treatment Effect Estimation for Longitudinal Data with High-Dimensional Confounding
Zhixin Qiu, Huichen Zhu, Wenjie Wang, Yanlin Tang
arxiv.org/abs/2508.16326

@arXiv_csDC_bot@mastoxiv.page
2025-06-26 08:46:40

WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads
Hongzhen Huang, Kunming Zhang, Hanlong Liao, Kui Wu, Guoming Tang
arxiv.org/abs/2506.20535

@arXiv_statME_bot@mastoxiv.page
2025-08-25 08:32:40

Quasi Instrumental Variable Methods for Stable Hidden Confounding and Binary Outcome
Zhonghua Liu, Baoluo Sun, Ting Ye, David Richardson, Eric Tchetgen Tchetgen
arxiv.org/abs/2508.16096

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 12:18:00

Replaced article(s) found for cs.DC. arxiv.org/list/cs.DC/new
[1/1]:
- Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
Jiajun Luo, Lizhuo Luo, Jianru Xu, Jiajun Song, Rongwei Lu, Chen Tang, Zhi Wang

@arXiv_statME_bot@mastoxiv.page
2025-07-24 08:39:30

Bayesian Compressed Mixed-Effects Models
Sreya Sarkar, Kshitij Khare, Sanvesh Srivastava
arxiv.org/abs/2507.16961 arxiv.org/pdf/2507.16961

@arXiv_csDC_bot@mastoxiv.page
2025-07-25 12:18:00

Replaced article(s) found for cs.DC. arxiv.org/list/cs.DC/new
[1/1]:
- Staleness-Centric Optimizations for Parallel Diffusion MoE Inference
Jiajun Luo, Lizhuo Luo, Jianru Xu, Jiajun Song, Rongwei Lu, Chen Tang, Zhi Wang

@arXiv_statME_bot@mastoxiv.page
2025-06-24 11:43:40

Likelihood Ratio test for Poisson graph
Chen Shuyan, Liu Xin, Wang Shaoli
arxiv.org/abs/2506.18778 arxiv.org/pdf/2506…

@arXiv_statME_bot@mastoxiv.page
2025-06-25 08:48:40

Principal stratification with recurrent events truncated by a terminal event: A nested Bayesian nonparametric approach
Yuki Ohnishi, Michael O. Harhay, Fan Li
arxiv.org/abs/2506.19015

@arXiv_csDC_bot@mastoxiv.page
2025-07-17 09:12:30

Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI
Samyam Rajbhandari, Mert Hidayetoglu, Aurick Qiao, Ye Wang, Juncheng Yang, Jeff Rasley, Michael Wyatt, Yuxiong He
arxiv.org/abs/2507.11830