2025-12-30 22:21:07
A reflection on AI advances in the past decade and how scaling and time-horizon trends might point to far greater capabilities in the decade ahead (Zhengdong Wang)
https://zhengdongwang.com/2025/12/30/2025-letter.html
A reflection on AI advances in the past decade and how scaling and time-horizon trends might point to far greater capabilities in the decade ahead (Zhengdong Wang)
https://zhengdongwang.com/2025/12/30/2025-letter.html
What a great read and overview, recommended !
"The State Of LLMs 2025: Progress, Problems, and Predictions"
#AI
If AI chatbot companies truly had what they claim they have (arbitrary scaling human-level intelligence)—they would use it exclusively themselves, prompting it to come up with schemes to make money and execute them.
In reality these companies all lose money (in historically unprecedented amounts), to fuel a drug dealer-like approach by giving it away for free and hoping enough people get addicted to sycophantic chatbots; with the goal to charge exorbitant fees for it in the future.
When I was a kid, I had a couple well-loved issues of G.I. Joe Special Missions, and this art from the cover of one of them was my absolute favorite. And I ran across it the other day again, and it still strikes me as extremely badass, even now. Just something about Snake Eyes scaling a wall in moonlight.
#art #comics …
I'm running into a really annoying render bug with the latest Kubuntu, which now uses Wayland. Anyone else seen this issue, and know how to fix it?
https://askubuntu.com/questions/1559959/chromium-apps-scaling-wrong-in-wayland-kubuntu-25-10
I need to read it properly, but this looks 🔥 https://arxiv.org/abs/2511.16652
BoN Appetit Team at LeWiDi-2025: Best-of-N Test-time Scaling Can Not Stomach Annotation Disagreements (Yet)
Tomas Ruiz, Siyao Peng, Barbara Plank, Carsten Schwemmer
https://arxiv.org/abs/2510.12516
Right after graduating in 2015, I was offered a job at the #BBC, working on scaling iPlayer. Well, continuing to scale iPlayer. It's one of my great what ifs.
It's an organisation I hugely admire, for many of the reasons that it is being attacked so viciously right now. Truth matters. Finding common ground matters. Keep the faith.
Lithuania scaling down exiled Belarusian oppositon leader's protection, her office pauses work: https://benborges.xyz/2025/10/08/lithuania-scaling-down-exiled-belarusian.html
#freshRss release notes:
"Scaling of user statistics in Web UI and CLI, to help instances with 1k users"
I feel a bit small with my installation for two users
https://github.com/FreshRSS/FreshRSS/r
Either KDE fixes the problems with their Wayland session (faulty font rendering with fractional scaling, no ability to remap touchpad gestures that are literally baked in at compile time, and so on) or I'll be shopping for a new desktop environment once they drop X11 in 2027. I'll not be railroaded into accepting a dogshit user experience because somebody wants to dunk on the chuds. Plasma isn't that special!
ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models
Zhangyue Yin, Qiushi Sun, Zhiyuan Zeng, Zhiyuan Yu, Qipeng Guo, Xuanjing Huang, Xipeng Qiu
https://arxiv.org/abs/2510.06014
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Yingyan Li, Shuyao Shang, Weisong Liu, Bing Zhan, Haochen Wang, Yuqi Wang, Yuntao Chen, Xiaoman Wang, Yasong An, Chufeng Tang, Lu Hou, Lue Fan, Zhaoxiang Zhang
https://arxiv.org/abs/2510.12796
Q&A with Ilya Sutskever about model jaggedness, why we are moving beyond the "age of scaling", SSI's plan to straight-shot superintelligence, AGI, and more (Dwarkesh Patel/Dwarkesh Podcast)
https://www.dwarkesh.com/p/ilya-sutskever-2
Rob Gaskell of Sundial is presenting on a layer two protocol designed to enable bitcoin to generate yield.
Most bitcoin is still, in long term hodl. Not helping anyone.
Sure, you could lend your bitcoin for interest but that would count as a tax event and also involve losing custody.
What if a programmable sidechain to help with scaling, allow borrowing and lending and products retail and institutions like?
His solution is called Sundial and doesn't need new protocol changes or forks.
Hard to say what it actually does though? Presumably something like liquidity in sidechains? Didn't really seem to get what he actually is building. 🤷
#bitfest #bitcoin
On Uniformly Scaling Flows: A Density-Aligned Approach to Deep One-Class Classification
Faried Abu Zaid, Tim Katzke, Emmanuel M\"uller, Daniel Neider
https://arxiv.org/abs/2510.09452
Engineering atomic superradiance scaling in cavity QED system with collective and individual emission channels
Ruijin Sun, Xiang Guo, Andreas Ruschhaupt, Zhihai Wang
https://arxiv.org/abs/2510.12086
A new $1/(1-\rho)$-scaling bound for multiserver queues via a leave-one-out technique
Yige Hong
https://arxiv.org/abs/2510.11015 https://arxiv.org/pdf/2510…
Why is the UK scaling back jury trials, and why is it controversial? | Civil Rights News | Al Jazeera
https://www.aljazeera.com/news/2025/12/2/why-is-the-uk-scaling-back-jury-trials-and-why-is-it-controversial?traffic_source=rss
ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models
Gursimran Singh (Huawei Technologies Canada), Timothy Yu (Huawei Technologies Canada), Haley Li (Huawei Technologies Canada), Cheng Chen (Huawei Technologies Canada), Hanieh Sadri (Huawei Technologies Canada), Qintao Zhang (Huawei Technologies China), Yu Zhang (Huawei Technologies China), Ying Xiong (Huawei Technologies Canada), Yong Zhang (Huawei Technologies Canada), Zhenan Fan (Huawei Technologies Canada)
I missed this one:
#12886 [css-fonts-5] Text Fitting: Default scaling limit
https://github.com/w3c/csswg-drafts/issues/12886
Essentially a discussion how responsive text can satisfy 1.4.4, especially in this fit-to-container pitch from Google.
I added a comment …
Privacy Enhancement in Over-the-Air Federated Learning via Adaptive Receive Scaling
Faeze Moradi Kalarde, Ben Liang, Min Dong, Yahia A. Eldemerdash Ahmed, Ho Ting Cheng
https://arxiv.org/abs/2510.03860
Scaling Up: Lessons From The World's Best CEOs And Founders
Great Australian Pods Podcast Directory: https://www.greataustralianpods.com/scaling-up-lessons-from-the-worlds-best-ceos-and-founders/
David #Lammy,
“Trials are a fundamental part of our democratic settlement. Criminal trials without juries are a bad idea.”
And,
<<I have principles. If they are inconvenient, I have others,>>
Why is the UK scaling back jury trials, and why is it controversial? | Civil Rights News | Al Jazeera
Key Considerations for Auto-Scaling: Lessons from Benchmark Microservices
Majid Dashtbani, Ladan Tahvildari
https://arxiv.org/abs/2510.02585 https://arxiv.…
There's a type of guy whose only contribution at work is sheer volume of outputs, whether or not they serve any purpose. #AI tools ask us, "what if everyone could be that guy?"
It turns out that the result is bad for everyone. Systems lose their ability to evaluate whether outputs are fit for purpose. Shared intent disappears.
Scaling up trash only makes more trash. Work tha…
Allometric scaling of brain activity explained by avalanche criticality
Tiago S. A. N. Sim\~oes, Jos\'e S. Andrade Jr., Hans J. Herrmann, Stefano Zapperi, Lucilla de Arcangelis
https://arxiv.org/abs/2512.10834 https://arxiv.org/pdf/2512.10834 https://arxiv.org/html/2512.10834
arXiv:2512.10834v1 Announce Type: new
Abstract: Allometric scaling laws, such as Kleiber's law for metabolic rate, highlight how efficiency emerges with size across living systems. The brain, with its characteristic sublinear scaling of activity, has long posed a puzzle: why do larger brains operate with disproportionately lower firing rates? Here we show that this economy of scale is a universal outcome of avalanche dynamics. We derive analytical scaling laws directly from avalanche statistics, establishing that any system governed by critical avalanches must exhibit sublinear activity-size relations. This theoretical prediction is then verified in integrate-and-fire neuronal networks at criticality and in classical self-organized criticality models, demonstrating that the effect is not model-specific but generic. The predicted exponents align with experimental observations across mammal species, bridging dynamical criticality with the allometry of brain metabolism. Our results reveal avalanche criticality as a fundamental mechanism underlying Kleiber-like scaling in the brain.
toXiv_bot_toot
The evolution of influence operations
from crude Russian troll farms to sophisticated AI systems using large language models;
the discovery of GoLaxy documents revealing a "Smart Propaganda System" that collects millions of data points daily, builds psychological profiles, and generates resilient personas;
the fundamental challenges of measuring effectiveness;
GoLaxy's ties to Chinese intelligence agencies;
operations targeting Hong Kong's…
Unify Variables in Neural Scaling Laws for General Audio Representations via Embedding Effective Rank
Xuyao Deng, Yanjie Sun, Yong Dou, Kele Xu
https://arxiv.org/abs/2510.10948 …
Scaling Properties of Avalanche Activity in the Two-Dimensional Abelian Sandpile Model
Anubhav Ganguly
https://arxiv.org/abs/2510.09631 https://arxiv.org/p…
Neuralink scaling? At some point, with growing numbers of Neuralink brain implant recipients, Neuralink will be busier with trying to fix broken implants than with new implants for new patients. If not, where's support? https://chatgpt.com/share/690a6c48-8f98-8004-9d69-c2a05198d…
SITCOM: Scaling Inference-Time COMpute for VLAs
Ayudh Saxena, Harsh Shah, Sandeep Routray, Rishi Rajesh Shah, Esha Pahwa
https://arxiv.org/abs/2510.04041 https://
Scaling Law in LLM Simulated Personality: More Detailed and Realistic Persona Profile Is All You Need
Yuqi Bai, Tianyu Huang, Kun Sun, Yuting Chen
https://arxiv.org/abs/2510.11734
So, I guess no one has any idea how to fix Wayland vs. Chromium?
https://askubuntu.com/questions/1559959/chromium-apps-scaling-wrong-in-wayland-kubuntu-25-10
Towards Inference-time Scaling for Continuous Space Reasoning
Minghan Wang, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari
https://arxiv.org/abs/2510.12167 https://
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Changyao Tian, Hao Li, Gen Luo, Xizhou Zhu, Weijie Su, Hanming Deng, Jinguo Zhu, Jie Shao, Ziran Zhu, Yunpeng Liu, Lewei Lu, Wenhai Wang, Hongsheng Li, Jifeng Dai
https://arxiv.org/abs/2510.08565 …
Scaling crossover of the generalized Jeffreys-type law
Fugui Ma
https://arxiv.org/abs/2510.07930 https://arxiv.org/pdf/2510.07930
Scaling of Magnetic Domain Walls in Perpendicular Magnetic Anisotropy Systems
Guowen Gong, Changmin Xiong, Lijun Zhu
https://arxiv.org/abs/2510.10230 https://
Kibble-Zurek Scaling and Spatial Statistics in Quenched Binary Bose Superfluids
Subhadeep Patra, Arko Roy, Seong-Ho Shinn, Adolfo del Campo, Mithun Thudiyangal
https://arxiv.org/abs/2510.12770
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
Weihao Zeng, Keqing He, Chuqiao Kuang, Xiaoguang Li, Junxian He
https://arxiv.org/abs/2510.06135 htt…
CMOS 2.0 - Redefining the Future of Scaling
Moritz Brunion, Navaneeth Kunhi Purayil, Francesco Dell'Atti, Sebastian Lam, Refik Bilgic, Mehdi Tahoori, Luca Benini, Julien Ryckaert
https://arxiv.org/abs/2510.04535
Geometric Model Selection for Latent Space Network Models: Hypothesis Testing via Multidimensional Scaling and Resampling Techniques
Jieyun Wang, Anna L. Smith
https://arxiv.org/abs/2510.06136
"Scientists Develop Cigarette Butt Asphalt to Build Stronger Roads"
#Roads #Recycling
https://hap…
Well, that's two ways of putting it.
https://www.theverge.com/news/823750/european-union-ai-act-gdpr-changes
Monthly Rural-Urban Scaling of Road Accidents in England, Wales and Scotland (2019-2023)
Isabel Copsey, Quentin Hanley, Jack Sutton
https://arxiv.org/abs/2510.07351 https://
Superradiance and Superabsorption Engine of $N$ Two-Level Systems: $N^{2}$-Power Scaling at Near-Unity Efficiency
L. F. Alves da Silva, H. Sanchez, M. A. Ponte, M. H. Y. Moussa, Norton G. de Almeida
https://arxiv.org/abs/2510.12017
Zephyrus: Scaling Gateways Beyond the Petabit-Era with DPU-Augmented Hierarchical Co-Offloading
Yuemeng Xu, Haoran Chen, Jiarui Guo, Mingwei Cui, Qiuheng Yin, Cheng Dong, Daxiang Kang, Xian Wu, Chenmin Sun, Peng He, Yang Gao, Lirong Lai, Kai Wang, Hongyu Wu, Tong Yang, Xiyun Xu
https://arxiv.org/abs/2510.11043
Scaling Homomorphic Applications in Deployment
Ryan Marinelli, Angelica Chowdhury
https://arxiv.org/abs/2510.02376 https://arxiv.org/pdf/2510.02376
Universal scaling of shear thickening suspensions under acoustic perturbation
Anna R. Barth, Navneet Singh, Stephen J. Thornton, Pranav Kakhandiki, Edward Y. X. Ong, Meera Ramaswamy, Abhishek M. Shetty, Bulbul Chakraborty, James P. Sethna, Itai Cohen
https://arxiv.org/abs/2510.11820
xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity
Maximilian Beck, Kajetan Schweighofer, Sebastian B\"ock, Sebastian Lehner, Sepp Hochreiter
https://arxiv.org/abs/2510.02228 …
I gotta say, #cosmicdesktop gets many things right even in its alpha state. Using it right now. X11 apps and Electron garbage all work fine with fractional scaling. Can't say that about #gnome49. Oh, and quarter tiling by default! Optimistic for what the future brings for it.
Are neural scaling laws leading quantum chemistry astray?
Siwoo Lee, Adji Bousso Dieng
https://arxiv.org/abs/2509.26397 https://arxiv.org/pdf/2509.26397
Theory of Scaling Laws for In-Context Regression: Depth, Width, Context and Time
Blake Bordelon, Mary I. Letey, Cengiz Pehlevan
https://arxiv.org/abs/2510.01098 https://
Dual Data Scaling for Robust Two-Stage User-Defined Keyword Spotting
Zhiqi Ai, Han Cheng, Yuxin Wang, Shiyi Mu, Shugong Xu, Yongjin Zhou
https://arxiv.org/abs/2510.10740 https:/…
Lingxi: Repository-Level Issue Resolution Framework Enhanced by Procedural Knowledge Guided Scaling
Xu Yang, Jiayuan Zhou, Michael Pacheco, Wenhan Zhu, Pengfei He, Shaowei Wang, Kui Liu, Ruiqi Pan
https://arxiv.org/abs/2510.11838
Prompting Test-Time Scaling Is A Strong LLM Reasoning Data Augmentation
Sondos Mahmoud Bsharat, Zhiqiang Shen
https://arxiv.org/abs/2510.09599 https://arxi…
The EU unveils proposed updates to GDPR, including simplifying cookie permission pop-ups, and plans to water down the AI Act, after US and tech company pressure (The Verge)
https://www.theverge.com/news/823750/european-union-ai-act-gdpr-changes
On the Role of Temperature Sampling in Test-Time Scaling
Yuheng Wu, Azalia Mirhoseini, Thierry Tambe
https://arxiv.org/abs/2510.02611 https://arxiv.org/pdf…
Comparing Cross-Platform Performance via Node-to-Node Scaling Studies
Kenneth Weiss, Thomas M. Stitt, Daryl Hawkins, Olga Pearce, Stephanie Brink, Robert N. Rieben
https://arxiv.org/abs/2510.12166
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Jian Hu, Mingjie Liu, Ximing Lu, Fang Wu, Zaid Harchaoui, Shizhe Diao, Yejin Choi, Pavlo Molchanov, Jun Yang, Jan Kautz, Yi Dong
https://arxiv.org/abs/2510.01180
A Tauberian approach to metric scaling limits of random discrete structures, with an application to random planar maps
William Fleurat
https://arxiv.org/abs/2510.05078 https://
Scaling up AI requires staggering amounts of power and water
— especially when considering that many areas are already dealing with strained grids or drought conditions.
Even when optimized, a single hyperscale facility can draw as muchpower as a mid-sized city
and millions of gallons of water annually.
Professor Romany Webb, deputy director of Columbia University's Sabin Center for Climate Change Law, explained the challenge:
"Data centers are incred…
BLAZER: Bootstrapping LLM-based Manipulation Agents with Zero-Shot Data Generation
Rocktim Jyoti Das, Harsh Singh, Diana Turmakhan, Muhammad Abdullah Sohail, Mingfei Han, Preslav Nakov, Fabio Pizzati, Ivan Laptev
https://arxiv.org/abs/2510.08572
AutoDAN-Reasoning: Enhancing Strategies Exploration based Jailbreak Attacks with Test-Time Scaling
Xiaogeng Liu, Chaowei Xiao
https://arxiv.org/abs/2510.05379 https://
Extreme events scaling in self-organized critical models
Abdul Quadir, Haider Hasan Jafri
https://arxiv.org/abs/2510.08733 https://arxiv.org/pdf/2510.08733…
DeepPrune: Parallel Scaling without Inter-trace Redundancy
Shangqing Tu, Yaxuan Li, Yushi Bai, Lei Hou, Juanzi Li
https://arxiv.org/abs/2510.08483 https://…
Generalized Parallel Scaling with Interdependent Generations
Harry Dong, David Brandfonbrener, Eryk Helenowski, Yun He, Mrinal Kumar, Han Fang, Yuejie Chi, Karthik Abinav Sankararaman
https://arxiv.org/abs/2510.01143
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation
Harold Haodong Chen, Xianfeng Wu, Wen-Jie Shu, Rongjin Guo, Disen Lan, Harry Yang, Ying-Cong Chen
https://arxiv.org/abs/2509.26376
Anthropic commits to buy $30B in Azure capacity in a new deal with Microsoft and Nvidia, which commit to invest up to $5B and $10B, respectively, in Anthropic (Microsoft)
https://blogs.microsoft.com/blog/2025/11/18/microsoft-nvidia-…
Critical attention scaling in long-context transformers
Shi Chen, Zhengjiang Lin, Yury Polyanskiy, Philippe Rigollet
https://arxiv.org/abs/2510.05554 https://
Quenching, Fast and Slow: Breaking Kibble-Zurek Universal Scaling by Jumping along Geodesics
Thi Ha Kyaw, Guillermo Romero, Gaurav Saxena
https://arxiv.org/abs/2510.08528 https:…
CodeChemist: Functional Knowledge Transfer for Low-Resource Code Generation via Test-Time Scaling
Kaixin Wang, Tianlin Li, Xiaoyu Zhang, Aishan Liu, Xianglong Liu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, and Bin Shi
https://arxiv.org/abs/2510.00501
Generalized Parallel Scaling with Interdependent Generations
Harry Dong, David Brandfonbrener, Eryk Helenowski, Yun He, Mrinal Kumar, Han Fang, Yuejie Chi, Karthik Abinav Sankararaman
https://arxiv.org/abs/2510.01143
DiTSinger: Scaling Singing Voice Synthesis with Diffusion Transformer and Implicit Alignment
Zongcai Du, Guilin Deng, Xiaofeng Guo, Xin Gao, Linke Li, Kaichang Cheng, Fubo Han, Siyu Yang, Peng Liu, Pan Zhong, Qiang Fu
https://arxiv.org/abs/2510.09016
Scaling Language-Centric Omnimodal Representation Learning
Chenghao Xiao, Hou Pong Chan, Hao Zhang, Weiwen Xu, Mahani Aljunied, Yu Rong
https://arxiv.org/abs/2510.11693 https://…
Resolution scaling governs DINOv3 transfer performance in chest radiograph classification
Soroosh Tayebi Arasteh, Mina Shaigan, Christiane Kuhl, Jakob Nikolas Kather, Sven Nebelung, Daniel Truhn
https://arxiv.org/abs/2510.07191
Best-of-Majority: Minimax-Optimal Strategy for Pass@$k$ Inference Scaling
Qiwei Di, Kaixuan Ji, Xuheng Li, Heyang Zhao, Quanquan Gu
https://arxiv.org/abs/2510.03199 https://
Q&A with Rivian founder and CEO RJ Scaringe on founding Rivian in 2009, production challenges, the VW partnership, autonomy, AI, EVs, chips, CarPlay, and more (Ben Thompson/Stratechery)
https://stratechery.com/2025/an-interv
Verifier-free Test-Time Sampling for Vision Language Action Models
Suhyeok Jang, Dongyoung Kim, Changyeon Kim, Youngsuk Kim, Jinwoo Shin
https://arxiv.org/abs/2510.05681 https:/…
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Jiaru Zou, Soumya Roy, Vinay Kumar Verma, Ziyi Wang, David Wipf, Pan Lu, Sumit Negi, James Zou, Jingrui He
https://arxiv.org/abs/2510.06217
Nonlinear Heisenberg Limit via Uncertainty Principle in Quantum Metrology
Binke Xia, Jingzheng Huang, Yuxiang Yang, Guihua Zeng
https://arxiv.org/abs/2510.09216 https://
Multi-Dimensional Autoscaling of Stream Processing Services on Edge Devices
Boris Sedlak, Philipp Raith, Andrea Morichetta, V\'ictor Casamayor Pujol, Schahram Dustdar
https://arxiv.org/abs/2510.06882
Crypto investor Roger Ver, aka "Bitcoin Jesus", reaches a deferred prosecution agreement with the US DOJ and pays ~$50M to resolve a tax evasion indictment (Ben Weiss/Fortune)
https://fortune.com/crypto/2025/10/14/roger…
Auto-scaling Continuous Memory for GUI Agent
Wenyi Wu, Kun Zhou, Ruoxin Yuan, Vivian Yu, Stephen Wang, Zhiting Hu, Biwei Huang
https://arxiv.org/abs/2510.09038 https://
MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline
Rushi Qiang, Yuchen Zhuang, Anikait Singh, Percy Liang, Chao Zhang, Sherry Yang, Bo Dai
https://arxiv.org/abs/2510.07307
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
Miao Lu, Weiwei Sun, Weihua Du, Zhan Ling, Xuesong Yao, Kang Liu, Jiecao Chen
https://arxiv.org/abs/2510.06727
Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models
Siddarth Venkatraman, Vineet Jain, Sarthak Mittal, Vedant Shah, Johan Obando-Ceron, Yoshua Bengio, Brian R. Bartoldson, Bhavya Kailkhura, Guillaume Lajoie, Glen Berseth, Nikolay Malkin, Moksh Jain
https://arxiv.org/abs/2509.26626…
Shape Happens: Automatic Feature Manifold Discovery in LLMs via Supervised Multi-Dimensional Scaling
Federico Tiblias, Irina Bigoulaeva, Jingcheng Niu, Simone Balloccu, Iryna Gurevych
https://arxiv.org/abs/2510.01025
Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts
Jihoon Lee, Hoyeon Moon, Kevin Zhai, Arun Kumar Chithanar, Anit Kumar Sahu, Soummya Kar, Chul Lee, Souradip Chakraborty, Amrit Singh Bedi
https://arxiv.org/abs/2510.05040
Revisiting Direct Speech-to-Text Translation with Speech LLMs: Better Scaling than CoT Prompting?
Oriol Pareras, Gerard I. G\'allego, Federico Costa, Cristina Espa\~na-Bonet, Javier Hernando
https://arxiv.org/abs/2510.03093
Shape Happens: Automatic Feature Manifold Discovery in LLMs via Supervised Multi-Dimensional Scaling
Federico Tiblias, Irina Bigoulaeva, Jingcheng Niu, Simone Balloccu, Iryna Gurevych
https://arxiv.org/abs/2510.01025
An interview with Sam Altman and OpenAI President Greg Brockman on the tepid initial reception to GPT-5's launch, scaling, reinforcement learning, AGI, and more (Steven Levy/Wired)
https://www.wired.com/story/sam-altman-says-the-gpt-5-haters-got-it-all-wron…
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Suwhan Choi, Jaeyoon Jung, Haebin Seong, Minchan Kim, Minyeong Kim, Yongjun Cho, Yoonshik Kim, Yubeen Park, Youngjae Yu, Yunsung Lee
https://arxiv.org/abs/2510.05684
Parallel Scaling Law: Unveiling Reasoning Generalization through A Cross-Linguistic Perspective
Wen Yang, Junhong Wu, Chong Li, Chengqing Zong, Jiajun Zhang
https://arxiv.org/abs/2510.02272
Scaling Spoken Language Models with Syllabic Speech Tokenization
Nicholas Lee, Cheol Jun Cho, Alan W Black, Gopala K. Anumanchipalli
https://arxiv.org/abs/2509.26634 https://
Laminar: A Scalable Asynchronous RL Post-Training Framework
Guangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu
https://arxiv.org/abs/2510.12633