Have I missed something or is there no way with openssh to exempt certain netblocks from the MaxStartups setting?
There's PerSourcePenaltyExemptList but that's spoecifically for the "penalties" things which are separate from MaxStartups, right?
https://
Stress Testing Deliberative Alignment for Anti-Scheming Training
Bronson Schoen, Evgenia Nitishinskaya, Mikita Balesni, Axel H{\o}jmark, Felix Hofst\"atter, J\'er\'emy Scheurer, Alexander Meinke, Jason Wolfe, Teun van der Weij, Alex Lloyd, Nicholas Goldowsky-Dill, Angela Fan, Andrei Matveiakin, Rusheb Shah, Marcus Williams, Amelia Glaese, Boaz Barak, Wojciech Zaremba, Marius Hobbhahn
Functionize, which offers a cloud platform that uses AI to speed up software testing, raised a $41M Series B, bringing its total funding to $67M (Maria Deutscher/SiliconANGLE)
https://siliconangle.com/2025/08/19/functionize-nabs-41m-speed-software-testing…
Tuning Random Generators: Property-Based Testing as Probabilistic Programming
Ryan Tjoa, Poorva Garg, Harrison Goldstein, Todd Millstein, Benjamin Pierce, Guy Van den Broeck
https://arxiv.org/abs/2508.14394
Finally tried "testing/synctest" and I must say this is A PERFECT way of a writing tests for async and cancelable code. No more flaky checking of random time ranges. No more selecting timeouts small enough to not kill the testing at all, but big enough that they'll work on a busy CI machine.
I put sleeps 1s, 2s, 5s, 10s in a loop and the test itself finishes in 0s. All asserts are for the exact time too.
LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Ming Yin, Dinghan Shen, Silei Xu, Jianbing Han, Sixun Dong, Mian Zhang, Yebowen Hu, Shujian Liu, Simin Ma, Song Wang, Sathish Reddy Indurthi, Xun Wang, Yiran Chen, Kaiqiang Song
https://arxiv.org/abs/2508.15760
Hmmm, I wonder how long before it leaves beta? Almost the time when I nuke WIN10 and upgrade.
https://mxlinux.org/blog/mx-25-infinity-beta-1-isos-now-available-for-testing-purposes/
Robust Self-Testing of Multiqudit Supersinglet Slater States via Constant Number of Binary Measurements
Arturo Konderak, Wojciech Bruzda, Remigiusz Augusiak
https://arxiv.org/abs/2508.15546
A Simple Apparatus for Testing PMT Humidity Tolerance
A. Germer, K. Park, C. Skuse, C. Yang, D. S. Parno
https://arxiv.org/abs/2507.13545 https://
Methodological considerations for semialgebraic hypothesis testing with incomplete U-statistics
David Barnhill, Marina Garrote-L\'opez, Elizabeth Gross, Max Hill, Bryson Kagy, John A. Rhodes, Joy Z. Zhang
https://arxiv.org/abs/2507.13531
Wow, yet another Thunderbird update.
It seems that Thunderbird, which ought to be rather stable by now, is getting updated more often than Chrome.
Either somebody is writing crummy code, not testing, or has a low acceptance hurdle for proposed "enhancements."
Deep Learning Framework Testing via Heuristic Guidance Based on Multiple Model Measurements
Yinglong Zou, Juan Zhai, Chunrong Fang, Yanzhou Mu, Jiawei Liu, Zhenyu Chen
https://arxiv.org/abs/2507.15181
Development and testing of integrated readout electronics for next generation SiSeRO (Single electron Sensitive Read Out) devices
Tanmoy Chattopadhyay, Haley R. Stueber, Abigail Y. Pan, Sven Herrmann, Peter Orel, Kevan Donlon, Steven W. Allen, Marshall W. Bautz, Michael Cooper, Catherine E. Grant, Beverly LaMarr, Christopher Leitz, Andrew Malonis, Eric D. Miller, R. Glenn Morris, Gregory Prigozhin, Ilya Prigozhin, Artem Poliszczuk, Keith Warner, Daniel R. Wilkins
Of course, I don't know 'hardware', as you can tell from my technical description, but I have a sample from another tuning peg gear, and the peg and gear for testing, I get to Home Hardware and they have loose bolts of small dimension. I quickly learn that #6 is too large, #4 is too small and they have no #5's where the thread matches.
But you know what works? Do you remember those little chrome bolts with the hex-wrench heads that used to hold expansion cards in the ibm-pc? Perfect match, only 5mm too long, easily compensated by buying a matching nut and my 53-years owned pawnshop 5-string, my first banjo, is back in action!
🔥 Ukraine will become a testing ground for the weapons of the future: tests right on the front line: https://benborges.xyz/2025/07/18/ukraine-will-become-a-testing.html
Hypothesis testing for quantitative trait locus effects in both location and scale in genetic backcross studies
Guanfu Liu, Pengfei Li, Yukun Liu, Xiaolong Pu
https://arxiv.org/abs/2507.14253
DiCriTest: Testing Scenario Generation for Decision-Making Agents Considering Diversity and Criticality
Qitong Chu, Yufeng Yue, Danya Yao, Huaxin Pei
https://arxiv.org/abs/2508.11514
Google rolls out AI Mode to 180 countries and territories in English, after testing in the US, UK, and India, and plans to add more languages and regions "soon" (Abner Li/9to5Google)
https://9to5google.com/2025/08/21/google-ai-mode-countries-agentic/
AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework
Yu Yao, Salil Bhatnagar, Markus Mazzola, Vasileios Belagiannis, Igor Gilitschenski, Luigi Palmieri, Simon Razniewski, Marcel Hallgarten
https://arxiv.org/abs/2507.13729
I wish I had mämmi
#finland #testing
Testing the Generalized Second Law in $(2 1)$-Dimensional Cosmology: Holographic Entropy Bounds and Observational Constraints
Praveen Kumar Dhankar, Aritra Sanyal, Safiqul Islam, Farook Rahaman, Behnam Pourhassan
https://arxiv.org/abs/2508.13227
Leveraging the group structure of hypotheses for more powerful multiple testing with FDR control for the filtered rejection set
Marina Bogomolov, Shinjini Nandi
https://arxiv.org/abs/2509.15444

Leveraging the group structure of hypotheses for more powerful multiple testing with FDR control for the filtered rejection set
Modern biological studies often involve testing many hypotheses organized in a group or a hierarchical structure, such as a directed acyclic graph (DAG). In these studies, researchers often wish to control the false discovery rate (FDR) after filtering the discoveries to obtain interpretable results. For addressing this goal, Katsevich, Sabatti, and Bogomolov (2023, Journal of the American Statistical Association, 118(541), 165-176) developed a general method, Focused BH, that guarantees FDR co…
A Study of Anatomical Priors for Deep Learning-Based Segmentation of Pheochromocytoma in Abdominal CT
Tanjin Taher Toma, Tejas Sudharshan Mathai, Bikash Santra, Pritam Mukherjee, Jianfei Liu, Wesley Jong, Darwish Alabyad, Vivek Batheja, Abhishek Jha, Mayank Patel, Darko Pucar, Jayadira del Rivero, Karel Pacak, Ronald M. Summers
https://
Testing the cosmic distance duality relation with baryon acoustic oscillations and supernovae data
Tian-Nuo Li, Guo-Hong Du, Peng-Ju Wu, Jing-Zhao Qi, Jing-Fei Zhang, Xin Zhang
https://arxiv.org/abs/2507.13811
The International Institute for Strategic Studies
The Scale of Russian Sabotage Operations Against Europe’s Critical Infrastructure
19 August 2025
"IISS has created the most comprehensive open-source database of suspected and confirmed Russian sabotage operations targeting Europe."
site:
Source: Netflix is using Runway AI's video generation tools for production; Disney is testing out the tools and talked with Runway about possible uses for them (Rachel Metz/Bloomberg)
https://www.bloomberg.com/news/articles/20
You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation
Yutong Bian, Xianhao Lin, Yupeng Xie, Tianyang Liu, Mingchen Zhuge, Siyuan Lu, Haoming Tang, Jinlin Wang, Jiayi Zhang, Jiaqi Chen, Xiangru Tang, Yongxin Ni, Sirui Hong, Chenglin Wu
https://arxiv.org/abs/2508.14104
Estonia's Foreign Minister:
"Russia’s increasingly extensive testing of boundaries and growing aggressiveness must be met with a swift increase in political and economic pressure."
https://www.pravda.com.ua/eng/news/2025/09/19/7531620/
Unfolding the Atmospheric Muon Flux with IceCube: Investigating Stopping Muons and High-Energy Prompt Contributions
Pascal Gutjahr (for the IceCube Collaboration), Lucas Witthaus (for the IceCube Collaboration)
https://arxiv.org/abs/2507.14525
New drives have suitably pleased me after a bit of testing, time to start swapping them in... 12 hours per drive doesn't seem horrendous.
📊 Versatile use cases include summarizing articles explaining complex concepts testing knowledge modifying recipes comparing products and making informed decisions
✍️ Get key takeaways from articles pages or discussion threads without leaving your current browsing session maintaining focus and workflow efficiency
🔍 Ask questions about content you're reading and receive relevant answers and explanations using the current page's information for accurate context
Optimal Transport Based Testing in Factorial Design
Michel Groppe, Linus Niem\"oller, Shayan Hundrieser, David Ventzke, Anna Blob, Sarah K\"oster, Axel Munk
https://arxiv.org/abs/2509.13970
Distributed Shared Layered Storage Quantum Simulator: A novel quantum simulation system for efficient scaling and cost optimization
Mingyang Yu, Haorui Yang, Donglin Wang, Desheng Kong, Ji Du, Yulong Fu, Wei Wang, Jing Xu
https://arxiv.org/abs/2508.15542
NYC grants Waymo its first permit, which extends through late September, to test up to eight of its autonomous vehicles in Manhattan and Downtown Brooklyn (Samantha Subin/CNBC)
https://www.cnbc.com/2025/08/22/waymo-permit-new-york-city-nyc-rides.html
From Capabilities to Performance: Evaluating Key Functional Properties of LLM Architectures in Penetration Testing
Lanxiao Huang, Daksh Dave, Ming Jin, Tyler Cody, Peter Beling
https://arxiv.org/abs/2509.14289
Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference
Samir Abdaljalil, Erchin Serpedin, Khalid Qaraqe, Hasan Kurban
https://arxiv.org/abs/2508.14735
On the Testing of complete causal mediation and its applications
Yichin Tsai, Wan-Tzu Chang, Jia Jyun Sie, Cathy SJ Fann, Iebin Lian
https://arxiv.org/abs/2507.14246
RUM: Rule LLM-Based Comprehensive Assessment on Testing Skills
Yue Wang, Zhenyu Chen, Yuan Zhao, Chunrong Fang, Ziyuan Wang, Song Huang
https://arxiv.org/abs/2508.12922 https://…
NuSeC: A Dataset for Nuclei Segmentation in Breast Cancer Histopathology Images
Refik Samet, Nooshin Nemati, Emrah Hancer, Serpil Sak, Bilge Ayca Kirmizi
https://arxiv.org/abs/2507.14272
Microsoft begins testing a Windows 11 feature for sharing the entire desktop with Copilot Vision; it requires first entering a special mode in the Copilot app (Zac Bowden/Windows Central)
https://www.
EvolMathEval: Towards Evolvable Benchmarks for Mathematical Reasoning via Evolutionary Testing
Shengbo Wang, Mingwei Liu, Zike Li, Anji Li, Yanlin Wang, Xin Peng, Zibin Zheng
https://arxiv.org/abs/2508.13003
xOffense: An AI-driven autonomous penetration testing framework with offensive knowledge-enhanced LLMs and multi agent systems
Phung Duc Luong, Le Tran Gia Bao, Nguyen Vu Khai Tam, Dong Huu Nguyen Khoa, Nguyen Huu Quyen, Van-Hau Pham, Phan The Duy
https://arxiv.org/abs/2509.13021
Neutralization of Levitated Charged Nanodiamond: Towards matter-wave interferometry with massive objects
Sela Liran, Or Dobkowski, Rafael Benjaminov, Peter Skakunenko, Michael Averbukh, Yaniv Bar-Haim, David Groswasser, Joshua H. Baraban, Ron Folman
https://arxiv.org/abs/2508.15625
XAMT: Cross-Framework API Matching for Testing Deep Learning Libraries
Bin Duan, Ruican Dong, Naipeng Dong, Dan Dongseong Kim, Guowei Yang
https://arxiv.org/abs/2508.12546 https…
Google hires NBA star Stephen Curry as a "performance advisor" for its Health, Pixel, and Cloud products, including testing Fitbit's new personal health coach (Jess Weatherbed/The Verge)
https://www.theverge.com/news/762146/google-pixel-stephen-curry…
Strong Confinement of a Nanoparticle in a Needle Paul Trap: Towards Matter-Wave Interferometry with Nanodiamonds
Peter Skakunenko, Daniel Folman, Yaniv Bar-Haim, Ron Folman
https://arxiv.org/abs/2508.14272
ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal
Haonan Zhang, Dongxia Wang, Yi Liu, Kexin Chen, Jiashui Wang, Xinlei Ying, Long Liu, Wenhai Wang
https://arxiv.org/abs/2508.11222
Design of high-efficiency UHV loading of nanodiamonds into a Paul trap: Towards Matter-Wave Interferometry with Massive Objects
Rafael Benjaminov, Sela Liran, Or Dobkowski, Yaniv Bar-Haim, Michael Averbukh, Ron Folman
https://arxiv.org/abs/2508.14722
Inside Google's Reliability Labs, where it stress tests Pixel phones and watches; Google claims the Pixel 10 Pro Fold can withstand 10 years of folding (Julian Chokkattu/Wired)
https://www.wired.com/story/google-reliability-labs-exclusive-look/
Crosslisted article(s) found for cs.SE. https://arxiv.org/list/cs.SE/new
[1/1]:
- Tuning Random Generators: Property-Based Testing as Probabilistic Programming
Ryan Tjoa, Poorva Garg, Harrison Goldstein, Todd Millstein, Benjamin Pierce, Guy Van den Broeck
Quantum control of Nitrogen-Vacancy spin in Diamonds: Towards matter-wave interferometry with massive objects
N. Levi, O. Feldman, Y. Rosenzweig, D. Groswasser, A. Elgarat, M. Gal-Katizri, R. Folman
https://arxiv.org/abs/2508.15504
India-based ride-hailing app Rapido starts testing its food delivery service Ownly in Bengaluru, marking its first serious move to challenge Swiggy and Zomato (Jagmeet Singh/TechCrunch)
https://techcrunch.com/2025/08/13/indias-rapido-beg…
Trapping and cooling of nanodiamonds in a Paul trap under ultra-high vacuum: Towards matter-wave interferometry with massive objects
Omer Feldman, Ben Baruch Shultz, Maria Muretova, Or Dobkowski, Yonathan Japha, David Grosswasser, Ron Folman
https://arxiv.org/abs/2508.14687
A Novel Mutation Based Method for Detecting FPGA Logic Synthesis Tool Bugs
Yi Zhang, He Jiang, Xiaochen Li, Shikai Guo, Peiyu Zou, Zun Wang
https://arxiv.org/abs/2508.15536 http…
Israel-based Terra Security, which offers an AI-driven penetration testing platform, raised a $30M Series A led by Felicis, bringing its total funding to $38M (Meir Orbach/CTech)
https://www.calcalistech.com/ctechnews/article/awdq1yv5k
Bridging Control Variates and Regression Adjustment in A/B Testing: From Design-Based to Model-Based Frameworks
Yu Zhang, Bokui Wan, Yongli Qin
https://arxiv.org/abs/2509.13944 …
Extremal Testing for Network Software using LLMs
Rathin Singha, Harry Qian, Srinath Saikrishnan, Tracy Zhao, Ryan Beckett, Siva Kesava Reddy Kakarla, George Varghese
https://arxiv.org/abs/2507.11898
Modulator-free, self-testing quantum random number generator
Ana Bl\'azquez-Co\'ido, Fadri Gr\"unenfelder, Anthony Martin, Raphael Houlmann, Hugo Zbinden, Davide Rusca
https://arxiv.org/abs/2507.12346
A Regression Testing Framework with Automated Assertion Generation for Machine Learning Notebooks
Yingao Elaine Yao, Vedant Nimje, Varun Viswanath, Saikat Dutta
https://arxiv.org/abs/2509.13656
Google is testing a Windows desktop app that brings Mac's Spotlight-like search bar to PC users, allowing them to search local files, Google Drive, and the web (Emma Roth/The Verge)
https://www.theverge.com/news/778940/google-app-windows-launch
Wireless Communication Performance Testing: From Laboratory Environment to Research Vessel
Andrei-Raoul Morariu, Andreas Strandberg, Bogdan Iancu, Jerker Bjorkqvist
https://arxiv.org/abs/2509.14740
Evaluating the Effectiveness of Coverage-Guided Fuzzing for Testing Deep Learning Library APIs
Feiran Qin, M. M. Abid Naziri, Hengyu Ai, Saikat Dutta, Marcelo d'Amorim
https://arxiv.org/abs/2509.14626
Nvidia, Discord, and Epic Games are testing game demos on Discord servers, letting users try a game without downloading it or signing up, starting with Fortnite (Sean Hollister/The Verge)
https://www.theverge.com/news/760894/play-
An Online A/B Testing Decision Support System for Web Usability Assessment Based on a Linguistic Decision-making Methodology: Case of Study a Virtual Learning Environment
Noe Zerme\~no, Cristina Zuheros, Lucas Daniel Del Rosso Calache, Francisco Herrera, Rosana Montes
https://arxiv.org/abs/2507.12118