SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
Xiang Deng, Jeff Da, Edwin Pan, Yannis Yiming He, Charles Ide, Kanak Garg, Niklas Lauffer, Andrew Park, Nitin Pasari, Chetan Rane, Karmini Sampath, Maya Krishnan, Srivatsa Kundurthy, Sean Hendryx, Zifan Wang, Chen Bo Calvin Zhang, Noah Jacobson, Bing Liu, Brad Kenstler
https://
Sobre mi anterior Retoot. Os he contado que en el trabajo, por temas del cliente, tuve que pedir que me dieran un ordenador con Windows. Lo primero que hice fue instalar el WSL y aplicaciones comi Emacs, Age... Y usando aplicaciones que son S
Software Libre o casi, como Libreoffice, KeePassXC, Vivaldi...
Folks creating free and open software for the common good: here’s a checklist of everything else you should be doing so corporations can make the best use of your free labour.
I mean, sure, some good security tips here and worth reading anyway but do fuck off with holding free software developers to account for supply chain attacks. It’s your fucking supply chain, not ours, you fucking corporation.
https://libre.fm is freedom-respecting scrobbling software, the open, community-driven alternative to https://last.fm and
Please don't upload my code on GitHub
This is a call to free/libre and open source software developers to not upload the work of others to GitHub.
🚫 #NoGitHub
AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
Jiongchi Yu, Weipeng Jiang, Xiaoyu Zhang, Qiang Hu, Xiaofei Xie, Chao Shen
https://arxiv.org/abs/2510.04997
Euclid preparation. Cosmology Likelihood for Observables in Euclid (CLOE). 5. Extensions beyond the standard modelling of theoretical probes and systematic effects
Collaboration, Goh, Nouri-Zonoz, Pamuk, Ballardini, Bose, Ca\~nas-Herrera, Casas, Franco-Abell\'an, Ili\'c, Keil, Kunz, Le Brun, Lepori, Martinelli, Sakr, Sorrenti, Teixeira, Tutusaus, Blot, Bonici, Bonvin, Camera, Cardone, Carrilho, Di Domizio, Durrer, Farrens, Beauchamps, Joudaki, Moretti, Pezzotta, S\'anchez, …
BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software
Zehua Zhang, Ati Priya Bajaj, Divij Handa, Siyu Liu, Arvind S Raj, Hongkai Chen, Hulin Wang, Yibo Liu, Zion Leonahenahe Basque, Souradip Nath, Vishal Juneja, Nikhil Chapre, Yan Shoshitaishvili, Adam Doup\'e, Chitta Baral, Ruoyu Wang
https://arxiv.org/abs/2…
InsightQL: Advancing Human-Assisted Fuzzing with a Unified Code Database and Parameterized Query Interface
Wentao Gao, Renata Borovica-Gajic, Sang Kil Cha, Tian Qiu, Van-Thuan Pham
https://arxiv.org/abs/2510.04835