Kuo: OpenAI appears to be fast-tracking its AI agent phone with two NPUs and a custom MediaTek Dimensity 9600 SoC, targeting mass production as early as H1 2027 (@mingchikuo)
https://x.com/mingchikuo/status/2051523855286776034
Crosslisted article(s) found for cs.PF. https://arxiv.org/list/cs.PF/new
[1/1]:
- Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite
Zhiyuan Cheng, Longying Lai
https://arxiv.org/abs/2606.11257 https://mastoxiv.page/@arXiv_csCL_bot/116730488820684308
- TileFuse: A Fused Mixed-Precision Kernel Library for Efficient Quantized LLM Inference on AMD NPUs
Wesley Pang, Gregory Hyegang Jun, Feiyang Liu, Deming Chen
https://arxiv.org/abs/2606.11357 https://mastoxiv.page/@arXiv_csDC_bot/116730303808083247
- XPR: An Extensible Cross-Platform Point-Based Differentiable Renderer
Rhyner, Durvasula, Kovalev, Jia, Zhao, Mrutunjayya, Ahuja, Panneer, Giannoula, Vijaykumar
https://arxiv.org/abs/2606.11529 https://mastoxiv.page/@arXiv_csGR_bot/116730305576192171
- Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation
Chitral Patil
https://arxiv.org/abs/2606.11690 https://mastoxiv.page/@arXiv_csDC_bot/116730304985870885
- From Fork-Join to Asynchronous Tasks: Parallelizing Tiled Cholesky Decomposition with OpenMP and HPX
Alexander Strack, Alexander Van Craen, Dirk Pfl\"uger
https://arxiv.org/abs/2606.11937 https://mastoxiv.page/@arXiv_csDC_bot/116730405487451638
toXiv_bot_toot