
2025-06-25 09:45:00
Adaptive Request Scheduling for CodeLLM Serving with SLA Guarantees
Shi Chang, Boyuan Chen, Kishanthan Thangarajah, Hanan Lutfiyya, Ahmed E. Hassan
https://arxiv.org/abs/2506.19677
Adaptive Request Scheduling for CodeLLM Serving with SLA Guarantees
Shi Chang, Boyuan Chen, Kishanthan Thangarajah, Hanan Lutfiyya, Ahmed E. Hassan
https://arxiv.org/abs/2506.19677
On the Batch Size Selection in Stochastic Gradient Methods Using No-Replacement Sampling
Marco Boresta, Alberto De Santis, Stefano Lucidi
https://arxiv.org/abs/2506.08758
This https://arxiv.org/abs/2505.14884 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
ASMOP: Additional sampling stochastic trust region method for multi-objective problems
Nata\v{s}a Krklec Jerinki\'c, Luka Rute\v{s}i\'c
https://arxiv.org/abs/2506.10976
AI Accelerators for Large Language Model In-ference: Architecture Analysis and Scaling Strategies
Amit Sharma
https://arxiv.org/abs/2506.00008 https://
This https://arxiv.org/abs/2401.06738 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…