
2025-09-15 09:36:11
Neural Scaling Laws for Deep Regression
Tilen Cadez, Kyoung-Min Kim
https://arxiv.org/abs/2509.10000 https://arxiv.org/pdf/2509.10000
Neural Scaling Laws for Deep Regression
Tilen Cadez, Kyoung-Min Kim
https://arxiv.org/abs/2509.10000 https://arxiv.org/pdf/2509.10000
OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique
Wasi Uddin Ahmad, Somshubra Majumdar, Aleksander Ficek, Sean Narenthiran, Mehrzad Samadi, Jocelyn Huang, Siddhartha Jain, Vahid Noroozi, Boris Ginsburg
https://arxiv.org/abs/2507.09075
Phenomenological Scaling Relations for SQM Stars with a Massive s-Quark in Gravitationally Strong Magnetic Fields under the Spherical Symmetry Approximation
{\L}ukasz Bratek, Joanna Ja{\l}ocha, Marek Kutschera
https://arxiv.org/abs/2507.10756
Scaling Up without Fading Out: Goal-Aware Sparse GNN for RL-based Generalized Planning
Sangwoo Jeon, Juchul Shin, Gyeong-Tae Kim, YeonJe Cho, Seongwoo Kim
https://arxiv.org/abs/2508.10747
Interestingn post and replies.
I think that Apple’s incentives are just not aligned with developers in niche segments - purely a matter of scaling and managing the teams.
In most cases open source alternatives are vastly better.
But the question remains, how can you finance the maintenance and evolution in the long term - volunteer work is not sustainable.
It is remarkable that after 25 years we still haven’t found a viable and scalable funding model for this.
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
Zihe Yan, Zhuosheng Zhang
https://arxiv.org/abs/2507.10610 https://
Scaling High-Performance Nanoribbon Transistors with Monolayer Transition Metal Dichalcogenides
Tara Pe\~na, Anton E. O. Persson, Andrey Krayev, \'Ashildur Fri{\dh}riksd\'ottir, Kathryn Neilson, Zhepeng Zhang, Anh Tuan Hoang, Jerry A. Yang, Lauren Hoang, Andrew J. Mannix, Paul C. McIntyre, Eric Pop
https://arxiv.org/abs/2509.09964
Despite hype and optimistic projections, the humanoid robot industry faces hurdles, from battery life and design to limited demand for large-scale deployments (Evan Ackerman/IEEE Spectrum)
https://spectrum.ieee.org/humanoid-robot-scaling
Scaling the memory wall using mixed-precision -- HPG-MxP on an exascale machine
Aditya Kashi, Nicholson Koukpaizan, Hao Lu, Michael Matheson, Sarp Oral, Feiyi Wang
https://arxiv.org/abs/2507.11512
"This convening is not an isolated event. It’s the first step in scaling a movement that reimagines how values-driven open science organisations can grow together, share resources, and strengthen the ecosystem we collectively serve. By working together, we can ensure that the future of open science is not only innovative, but also equitable, sustainable, and resilient."
https://www.carpentries.org/blog/2025/09/convening-to-reclaim-and-sustain-open-science-communities/
Great to see this forward looking collaborative work being done by @… @… @… @… and OLS
Scaling Learned Image Compression Models up to 1 Billion
Yuqi Li, Haotian Zhang, Li Li, Dong Liu, Feng Wu
https://arxiv.org/abs/2508.09075 https://arxiv.or…
Accurate Reduced Floating-Point Precision Implicit Monte Carlo
Simon Butson, Mathew Cleveland, Alex Long, Todd Palmer
https://arxiv.org/abs/2506.11962 http…
Universal Scaling Laws for Deep Indentation Beyond the Hertzian Regime
Tong Mu, Changhong Linghu, Yanju Liu, Jinsong Leng, Huajian Gao, K. Jimmy Hsia
https://arxiv.org/abs/2506.11461
Scaling Relations, Morphological Stability, and Asymptotic Freedom of Plasma-Surface Deposition Dynamics
Joel Saucedo, Uday Lamba, Hasitha Mahabaduge
https://arxiv.org/abs/2507.10645
Spin and thermal current scaling at a $Y$-junction of XX spin chains
Domenico Giuliano, Francesco Buccheri
https://arxiv.org/abs/2508.10267 https://arxiv.o…
Scaling Arabic Medical Chatbots Using Synthetic Data: Enhancing Generative AI with Synthetic Patient Records
Abdulrahman Allam, Seif Ahmed, Ali Hamdi, Khaled Shaban
https://arxiv.org/abs/2509.10108
🔧 #MatryoshkaRepresentationLearning technique allows scaling output dimensions from default 3072 💰 Priced at $0.15 per 1M input tokens with free tier available ⚡
Universal self-similarity of hierarchical communities formed through a general self-organizing principle
Shruti Tandon (equal), Nidhi Dilip Sonwane (equal), Tobias Braun, Norbert Marwan, Juergen Kurths, R. I. Sujith
https://arxiv.org/abs/2507.11159
Evidence of scaling advantage on an NP-Complete problem with enhanced quantum solvers
Quanfeng Lu, Shijie Wei, Keren Li, Pan Gao, Bao Yan, Muxi Zheng, Haoran Zhang, Jinfeng Zeng, Gui-Lu Long
https://arxiv.org/abs/2508.08869
US ethnic cleansing and what to do about it
Reposting link to source article instead of screenshot of tweet that had no alt text:
Data on arrests shows that ICE was heavily engaged in racial profiling in LA, because their arrest numbers fell by ~66% after that were ordered to stop making arrests based just in factors like skin color, with place, or language spoken.
#ICE #USPol
SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding
Ziyi Zhang, Ziheng Jiang, Chengquan Jiang, Menghan Yu, Size Zheng, Haibin Lin, Henry Hoffmann, Xin Liu
https://arxiv.org/abs/2506.11309
Comparison of Localization Algorithms between Reduced-Scale and Real-Sized Vehicles Using Visual and Inertial Sensors
Tobias Kern, Leon Tolksdorf, Christian Birkner
https://arxiv.org/abs/2507.11241
FuXi-\beta: Towards a Lightweight and Fast Large-Scale Generative Recommendation Model
Yufei Ye, Wei Guo, Hao Wang, Hong Zhu, Yuyang Ye, Yong Liu, Huifeng Guo, Ruiming Tang, Defu Lian, Enhong Chen
https://arxiv.org/abs/2508.10615
Grids Often Outperform Implicit Neural Representations
Namhoon Kim, Sara Fridovich-Keil
https://arxiv.org/abs/2506.11139 https://arxi…
Scaling behaviour of rotating convection in a spherical shell with different Prandtl numbers
Wei Fan, Qi Wang, Yufeng Lin
https://arxiv.org/abs/2508.09416 https://
FractalSync: Lightweight Scalable Global Synchronization of Massive Bulk Synchronous Parallel AI Accelerators
Victor Isachi, Alessandro Nadalini, Riccardo Fiorani Gallotta, Angelo Garofalo, Francesco Conti, Davide Rossi
https://arxiv.org/abs/2506.11668
A Prediction for Maximum Supercooling in SU(N) Confinement Transition
Prateek Agrawal, Gaurang Ramakant Kane, Vazha Loladze, John March-Russell
https://arxiv.org/abs/2508.10091 …
Critical Ising correlations on a torus
Baran Bayraktaroglu, Konstantin Izyurov
https://arxiv.org/abs/2506.11324 https://arxiv.org/pdf…
Invariant measures on moduli spaces of twisted holomorphic 1-forms and strata of dilation surfaces
Paul Apisa, Nick Salter
https://arxiv.org/abs/2507.10685
Universal Driven Critical Dynamics near the Boundary
Yu-Rong Shu, Shuai Yin
https://arxiv.org/abs/2509.10049 https://arxiv.org/pdf/2509.10049
Duty-Cycling is Not Enough in Constrained IoT Networking: Revealing the Energy Savings of Dynamic Clock Scaling
Michel Rottleuthner, Thomas C. Schmidt, Matthias W\"ahlisch
https://arxiv.org/abs/2508.09620
Profiling Multi-Level Operator Costs for Bottleneck Diagnosis in High-Speed Data Planes
Zhiyuan Ren, Yutao Liu, Wenchi Cheng, Kun Yang
https://arxiv.org/abs/2508.09574 https://
"À la question de savoir si l'intensification des approches actuelles de l'IA pourrait conduire Š l'intelligence artificielle générale (AGI), ou Š une IA Š usage général qui égalerait ou surpasserait la cognition humaine, 76 % des personnes interrogées ont répondu qu'il était "improbable" ou "très improbable" que cela réussisse."
#IA
Primordial Black Hole Formation and Spin in Matter Domination Revisited
Weitao Ye, Yungui Gong, Tomohiro Harada, Zhaofeng Kang, Kazunori Kohri, Daiki Saito, Chul-Moon Yoo
https://arxiv.org/abs/2508.10070
Mechanics-Informed Machine Learning for Geospatial Modeling of Soil Liquefaction: Global and National Surrogate Models for Simulation and Near-Real-Time Response
Morgan D. Sanger, Mertcan Geyin, Brett W. Maurer
https://arxiv.org/abs/2509.10962
Time Scaling Makes Accelerated Gradient Flow and Proximal Method Faster in Multiobjective Optimization
Yingdong Yin
https://arxiv.org/abs/2508.07254 https://
On surface energies in scaling laws for singular perturbation problems for martensitic phase transitions
Angkana R\"uland, Camillo Tissot, Antonio Tribuzio, Christian Zillinger
https://arxiv.org/abs/2507.06773 https://arxiv.org/pdf/2507.06773 https://arxiv.org/html/2507.06773
arXiv:2507.06773v1 Announce Type: new
Abstract: The objective of this article is to compare different surface energies for multi-well singular perturbation problems associated with martensitic phase transformations involving higher order laminates. We deduce scaling laws in the singular perturbation parameter which are robust in the choice of the surface energy (e.g., diffuse, sharp, an interpolation thereof or discrete). Furthermore, we show that these scaling laws do not require the presence of isotropic surface energies but that generically also highly anisotropic surface energies yield the same scaling results. More precisely, the presence of essentially generic partial directional derivatives in the regularization terms suffices to produce the same scaling behaviour as in the isotropic setting. The only sensitive directional dependences are directly linked to the lamination directions of the well structure -- and even for these only the ``inner-most'' lamination direction is of significance in determining the scaling law. In view of experimental applications, this shows that also for higher-order laminates, the precise structure of the surface energies -- which is often very difficult to determine experimentally -- does not have a crucial impact on the scaling behaviour of the investigated structures but only enters when considering finer properties.
toXiv_bot_toot
Primordial Black Hole Formation in a Scalar Field Dominated Universe: Investigation of the Critical nature of the Collapse
Luis E. Padilla, Ethan Milligan, David J. Mulryne, Juan Carlos Hidalgo
https://arxiv.org/abs/2509.10431
Replaced article(s) found for q-bio.NC. https://arxiv.org/list/q-bio.NC/new
[1/1]:
- Neuronal correlations shape the scaling behavior of memory capacity and nonlinear computational c...
Shotaro Takasu, Toshio Aoyagi
Numerical analysis of the large deviation regime of a kinetic equation with a nonlocal Hamilton-Jacobi limit
H\'el\`ene Hivert, Tino Laidin
https://arxiv.org/abs/2509.10323 …
Biological Processing Units: Leveraging an Insect Connectome to Pioneer Biofidelic Neural Architectures
Siyu Yu, Zihan Qin, Tingshan Liu, Beiya Xu, R. Jacob Vogelstein, Jason Brown, Joshua T. Vogelstein
https://arxiv.org/abs/2507.10951
Internal Slack message: OpenAI has hired four high-profile engineers from Tesla, xAI, and Meta, including David Lau, former VP of software engineering at Tesla (Wired)
https://www.wired.com/story/openai-new-hires-scaling/
Compartmentalised Agentic Reasoning for Clinical NLI
Ma\"el Jullien, Lei Xu, Marco Valentino, Andr\'e Freitas
https://arxiv.org/abs/2509.10222 https://
Insights for Early Massive Black Hole Growth from JWST Detection of the [Ne v] {\lambda}3427 Emission Line
Benny Trakhtenbrot, Claudio Ricci, Ezequiel Treister, Michael J. Koss, Richard Mushotzky, Kyuseok Oh, Alessandro Peca, Franz E. Bauer, Kriti Kamal Gupta, Tomer Reiss
https://arxiv.org/abs/2507.10681
Crosslisted article(s) found for q-bio.BM. https://arxiv.org/list/q-bio.BM/new
[1/1]:
- Physical Principles of Size and Frequency Scaling of Active Cytoskeletal Spirals
Aman Soni, Shivani A. Yadav, Chaitanya A. Athale
Natively Trainable Sparse Attention for Hierarchical Point Cloud Datasets
Nicolas Lapautre, Maria Marchenko, Carlos Miguel Pati\~no, Xin Zhou
https://arxiv.org/abs/2508.10758 ht…
The Darkfield Approach to Measuring Vacuum Birefringence and Light-by-Light Couplings -- A Proof-of-Principle Experiment
Michal Sm\'id, Pooyan Khademi, Carsten B\"ahtz, Erik Brambrink, Jindrich Chalupsky, Tom E. Cowan, Samuele Di Dio Cafiso, Sebastian G\"ode, J\"org Grenzer, Vera Hajkova, Peter Hilz, Willi Hippler, Hauke H\"opner, Alzbeta Horynova, Oliver Humphries, Simon Jelinek, Libor Juha, Felix Karbstein, Alejandro Laso-Garcia, Robert L\"otzsch, Aim\…
Crosslisted article(s) found for physics.bio-ph. https://arxiv.org/list/physics.bio-ph/new
[1/1]:
- Physical Principles of Size and Frequency Scaling of Active Cytoskeletal Spirals
Aman Soni, Shivani A. Yadav, Chaitanya A. Athale
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[3/3]:
- Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation
Li, Du, Yu, Li, Zhao, Liu, Jiang, Zhu, Huang
Crosslisted article(s) found for cond-mat.mes-hall. https://arxiv.org/list/cond-mat.mes-hall/new
[1/1]:
- Scaling High-Performance Nanoribbon Transistors with Monolayer Transition Metal Dichalcogenides
na, et al.
Dynamic scaling of growing interfaces
Pierre Le Doussal
https://arxiv.org/abs/2507.08341 https://arxiv.org/pdf/2507.08341
Exploring Quantum Annealing for Coarse-Grained Protein Folding
Timon Scheiber, Matthias Heller, Andreas Giebel
https://arxiv.org/abs/2508.10660 https://arx…
Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks
Sofia Ivolgina, P. Thomas Fletcher, Baba C. Vemuri
https://arxiv.org/abs/2507.08261
SWE-Mirror: Scaling Issue-Resolving Datasets by Mirroring Issues Across Repositories
Junhao Wang, Daoguang Zan, Shulin Xin, Siyao Liu, Yurong Wu, Kai Shen
https://arxiv.org/abs/2509.08724
Handows: A Palm-Based Interactive Multi-Window Management System in Virtual Reality
Jindu Wang, Ke Zhou, Haoyu Ren, Per Ola Kristensson, Xiang Li
https://arxiv.org/abs/2508.09469
InGaN Nanopixel Arrays on Single Crystal GaN Substrate
Nirmal Anand, Sadat Tahmeed Azad, Christy Giji Jenson, Dipon Kumar Ghosh, Md Zunaid Baten, Pei-Cheng Ku, Grzegorz Muziol, Sharif Sadaf
https://arxiv.org/abs/2506.11408
Counting sums of two powers
Anand Patel
https://arxiv.org/abs/2507.08337 https://arxiv.org/pdf/2507.08337
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, Kaiyan Zhang, Xuekai Zhu, Yuchen Zhang, Tianxing Chen, Ganqu Cui, Dehui Wang, Dingxiang Luo, Yuchen Fan, Youbang Sun, Jia Zeng, Jiangmiao Pang, Shanghang Zhang, Yu Wang, Yao Mu, Bowen Zhou, Ning Ding
https://arxiv.org/a…
The Knowledge-Reasoning Dissociation: Fundamental Limitations of LLMs in Clinical Natural Language Inference
Ma\"el Jullien, Marco Valentino, Andr\'e Freitas
https://arxiv.org/abs/2508.10777
HMD says it will "scale back" US operations, citing "a challenging geopolitical and economic environment", and appears to have stopped US sales of Nokia devices (Dominic Preston/The Verge)
https://www.theverge.com/news/705046/hmd-gl…
The Hidden Width of Deep ResNets: Tight Error Bounds and Phase Diagrams
L\'ena\"ic Chizat
https://arxiv.org/abs/2509.10167 https://arxiv.org/pdf/2…
SSRL: Self-Search Reinforcement Learning
Yuchen Fan, Kaiyan Zhang, Heng Zhou, Yuxin Zuo, Yanxu Chen, Yu Fu, Xinwei Long, Xuekai Zhu, Che Jiang, Yuchen Zhang, Li Kang, Gang Chen, Cheng Huang, Zhizhou He, Bingning Wang, Lei Bai, Ning Ding, Bowen Zhou
https://arxiv.org/abs/2508.10874
RewardDance: Reward Scaling in Visual Generation
Jie Wu, Yu Gao, Zilyu Ye, Ming Li, Liang Li, Hanzhong Guo, Jie Liu, Zeyue Xue, Xiaoxia Hou, Wei Liu, Yan Zeng, Weilin Huang
https://arxiv.org/abs/2509.08826
A scalable quantum-neural hybrid variational algorithm for ground state estimation
Minwoo Kim, Kyoung Keun Park, Uihwan Jeong, Sanghyeon Lee, Taehyun Kim
https://arxiv.org/abs/2507.11002
Casimir scaling in glueballs in SU($N$) and Sp($2N$) gauge theories: hints from constituent approaches
F. Buisseret, C. Chevalier, V. Mathieu, C. Semay
https://arxiv.org/abs/2509.09454
Characterizing the Efficiency of Distributed Training: A Power, Performance, and Thermal Perspective
Seokjin Go, Joongun Park, Spandan More, Hanjiang Wu, Irene Wang, Aaron Jezghani, Tushar Krishna, Divya Mahajan
https://arxiv.org/abs/2509.10371
From Kardar-Parisi-Zhang scaling to soliton proliferation in Josephson junction arrays
Mikheil Tsitsishvili, Reinhold Egger, Karsten Flensberg, Sebastian Diehl
https://arxiv.org/abs/2509.08479
Optimal trace norms for Helmholtz problems
Benedikt Gr\"a{\ss}le
https://arxiv.org/abs/2506.11944 https://arxiv.org/pdf/2506.119…
Semi-empirical constraints on the HI mass function of star-forming galaxies and $\Omega_{\rm HI}$ at $z\sim 0.37$ from interferometric surveys
Francesco Sinigaglia, Alessandro Bianchetti, Giulia Rodighiero, Lucio Mayer, Miroslava Dessauges-Zavadsky, Ed Elson, Mattia Vaccari, Matt J. Jarvis
https://arxiv.org/abs/2506.11280…
The growth of magnetic energy during the nonlinear phase of the subsonic and supersonic small-scale dynamo
Neco Kriel, James R. Beattie, Mark R. Krumholz, Jennifer Schober, Patrick J. Armstrong
https://arxiv.org/abs/2509.09949
Certifying and learning quantum Ising Hamiltonians
Andreas Bluhm, Matthias C. Caro, Francisco Escudero Guti\'errez, Aadil Oufkir, Cambyse Rouz\'e
https://arxiv.org/abs/2509.10239
Cyclic Data Streaming on GPUs for Short Range Stencils Applied to Molecular Dynamics
Martin Rose, Simon Homes, Lukas Ramsperger, Jose Gracia, Christoph Niethammer, Jadran Vrabec
https://arxiv.org/abs/2507.11289
Source: Nvidia is scaling back DGX Cloud to primarily internal R&D use; DGX Cloud was initially envisioned to compete with major cloud providers like AWS (Anissa Gardizy/The Information)
https://www.theinformation.com/articles/nvidia-steps-back-cloud-ef…
Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Hyungjin Chung, Hyelin Nam, Jiyeon Kim, Hyojun Go, Byeongjun Park, Junho Kim, Joonseok Lee, Seongsu Ha, Byung-Hoon Kim
https://arxiv.org/abs/2509.08016
Hilbert subspace imprint: a new mechanism for non-thermalization
Hui Yu, Jiangping Hu, Shi-Xin Zhang
https://arxiv.org/abs/2506.11922 https://
Electron Heating in Hypersonic Flows: A New Thermodynamically Consistent Model
Felipe Martin Rodriguez Fuentes, Bernard Parent
https://arxiv.org/abs/2506.11457
Ergodicity detection algorithms: Scaling of ergodicity in random symbolic dynamics
M. S\"uzen
https://arxiv.org/abs/2508.08319 https://arxiv.org/pdf/2…
Coordinated Reinforcement Learning Prefetching Architecture for Multicore Systems
Mohammed Humaid Siddiqui, Fernando Guzman, Yufei Wu, Ruishu Ann
https://arxiv.org/abs/2509.10719
Compass-v3: Scaling Domain-Specific LLMs for Multilingual E-Commerce in Southeast Asia
Sophia Maria
https://arxiv.org/abs/2509.09121 https://arxiv.org/pdf/…
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
Jie Chen, Jinhao Jiang, Yingqian Min, Zican Dong, Shijie Wang, Wayne Xin Zhao, Ji-Rong Wen
https://arxiv.org/abs/2509.05007
Pony AI says it rolled out 200 Gen-7 Robotaxis since mass production started two months ago, putting it on track for its 1,000-vehicle goal by the end of 2025 (Bloomberg)
https://www.bloomberg.com/news/articles/2025-08-13/po…
Microscopic calculation of two-particle-two-hole meson-exchange currents in $^{40}$Ar and asymmetric scaling properties for neutrino and electron scattering
V. L. Martinez-Consentino, J. Segovia, J. E. Amaro
https://arxiv.org/abs/2509.08916
Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search
Xin Lai, Junyi Li, Wei Li, Tao Liu, Tianjian Li, Hengshuang Zhao
https://arxiv.org/abs/2509.07969 …
Multi-agent Reinforcement Learning-based In-place Scaling Engine for Edge-cloud Systems
Jovan Prodanov, Bla\v{z} Bertalani\v{c}, Carolina Fortuna, Shih-Kai Chou, Matja\v{z} Branko Juri\v{c}, Ramon Sanchez-Iborra, Jernej Hribar
https://arxiv.org/abs/2507.07671
Multiparameter quantum metrology at Heisenberg scaling for an arbitrary two-channel linear interferometer with squeezed light
Atmadev Rai, Danilo Triggiani, Paolo Facchi, Vincenzo Tamma
https://arxiv.org/abs/2509.07574
Constraint correlation functions of the Ising model in the scaling limit
Ivan Balog, Adam Ran\c{c}on
https://arxiv.org/abs/2509.08557 https://arxiv.org/pdf…
Uncovering Scaling Laws for Large Language Models via Inverse Problems
Arun Verma, Zhaoxuan Wu, Zijian Zhou, Xiaoqiang Lin, Zhiliang Chen, Rachael Hwee Ling Sim, Rui Qiao, Jingtan Wang, Nhung Bui, Xinyuan Niu, Wenyang Hu, Gregory Kang Ruey Lau, Zi-Yu Khoo, Zitong Zhao, Xinyi Xu, Apivich Hemachandra, See-Kiong Ng, Bryan Kian Hsiang Low
https://
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Weigao Sun, Jiaxi Hu, Yucheng Zhou, Jusen Du, Disen Lan, Kexin Wang, Tong Zhu, Xiaoye Qu, Yu Zhang, Xiaoyu Mo, Daizong Liu, Yuxuan Liang, Wenliang Chen, Guoqi Li, Yu Cheng
https://arxiv.org/abs/2508.09834
Unreal is all you need: Multimodal ISAC Data Simulation with Only One Engine
Kongwu Huang, Shiyi Mu, Jun Jiang, Yuan Gao, Shugong Xu
https://arxiv.org/abs/2507.08716
Meek Models Shall Inherit the Earth
Hans Gundlach, Jayson Lynch, Neil Thompson
https://arxiv.org/abs/2507.07931 https://arxiv.org/pdf…
Google and IBM believe the first industrial-scale quantum computer is in sight, potentially by 2030, but challenges like scaling from ~200 to 1M qubits remain (Richard Waters/Financial Times)
https://www.ft.com/content/2fe4b1a3-b0d3-403a-bdd3-f033b3e5e56a
Crown, Frame, Reverse: Layer-Wise Scaling Variants for LLM Pre-Training
Andrei Baroian, Kasper Notebomer
https://arxiv.org/abs/2509.06518 https://arxiv.org…
On the commutator scaling in Hamiltonian simulation with multi-product formulas
Kaoru Mizuta
https://arxiv.org/abs/2507.06557 https://
Scaling RL to Long Videos
Yukang Chen, Wei Huang, Baifeng Shi, Qinghao Hu, Hanrong Ye, Ligeng Zhu, Zhijian Liu, Pavlo Molchanov, Jan Kautz, Xiaojuan Qi, Sifei Liu, Hongxu Yin, Yao Lu, Song Han
https://arxiv.org/abs/2507.07966
CTTS: Collective Test-Time Scaling
Zhende Song, Shengji Tang, Peng Ye, Jiayuan Fan, Tao Chen
https://arxiv.org/abs/2508.03333 https://arxiv.org/pdf/2508.03…
On the Impact of Classical and Quantum Communication Networks Upon Modular Quantum Computing Architecture System Performance
Pau Escofet, Abhijit Das, Sahar Ben Rached, Santiago Rodrigo, Jordi Domingo, Fabio Sebastiano, Masoud Babaie, Batuhan Keskin, Edoardo Charbon, Peter Haring Bol\'ivar, Maurizio Palesi, Elena Blokhina, Bogdan Staszewski, Avishek Nag, Artur Garcia-S\'aez, Sergi Abadal, Eduard Alarc\'on, Carmen G. Almud\'ever
GPT-5's release was underwhelming, offering incremental improvements and failing to meet expectations, showing that pure scaling simply isn't the path to AGI (Gary Marcus/Marcus on AI)
https://garymarcus.substack.com/p/gpt-5-overdue-overhyped-and-underwhel…
HierMoE: Accelerating MoE Training with Hierarchical Token Deduplication and Expert Swap
Wenxiang Lin, Xinglin Pan, Lin Zhang, Shaohuai Shi, Xuan Wang, Xiaowen Chu
https://arxiv.org/abs/2508.09591
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
Ziyang Wang, Jaehong Yoon, Shoubin Yu, Md Mohaiminul Islam, Gedas Bertasius, Mohit Bansal
https://arxiv.org/abs/2507.06485
Symmetry-protected many-body Ramsey spectroscopy: precision scaling and robustness
Sijie Chen, Jiahao Huang, Min Zhuang, Chaohong Lee
https://arxiv.org/abs/2509.08291 https://…