Tootfinder

Opt-in global Mastodon full text search. Join the index!

@frankel@mastodon.top
2025-05-24 08:01:05

Why #PropertyTesting Finds Bugs #UnitTesting Does Not
bu…

@arXiv_csSE_bot@mastoxiv.page
2025-06-23 10:57:00

Software Fairness Testing in Practice
Ronnie de Souza Santos, Matheus de Morais Leca, Reydne Santos, Cleyton Magalhaes
arxiv.org/abs/2506.17095

@frankel@mastodon.top
2025-08-23 16:14:05

A Better Vocabulary for #Testing
alperenkeles.com/posts/vocab-f

@grifferz@social.bitfolk.com
2025-07-22 21:12:30

Have I missed something or is there no way with openssh to exempt certain netblocks from the MaxStartups setting?
There's PerSourcePenaltyExemptList but that's spoecifically for the "penalties" things which are separate from MaxStartups, right?

@arXiv_statML_bot@mastoxiv.page
2025-06-23 09:19:50

Diffusion-Based Hypothesis Testing and Change-Point Detection
Sean Moushegian, Taposh Banerjee, Vahid Tarokh
arxiv.org/abs/2506.16089

@arXiv_mathAG_bot@mastoxiv.page
2025-07-23 09:27:32

Testing the variety hypothesis
A. Lerario, P. Roos Hoefgeest, M. Scolamiero, A. Tamai
arxiv.org/abs/2507.16705 arxiv.…

@Techmeme@techhub.social
2025-08-19 15:55:45

Functionize, which offers a cloud platform that uses AI to speed up software testing, raised a $41M Series B, bringing its total funding to $67M (Maria Deutscher/SiliconANGLE)
siliconangle.com/2025/08/19/fu

@benb@osintua.eu
2025-08-19 13:15:57

Joint training and NUCLEAR testing: what are Russia and Belarus preparing? #shorts: benborges.xyz/2025/08/19/joint

@arXiv_csAI_bot@mastoxiv.page
2025-07-23 10:14:52

ChatChecker: A Framework for Dialogue System Testing and Evaluation Through Non-cooperative User Simulation
Roman Mayr, Michel Schimpf, Thomas Bohn\'e
arxiv.org/abs/2507.16792

@arXiv_statME_bot@mastoxiv.page
2025-07-24 09:20:59

Testing Against Tree Ordered Alternatives in One-way ANOVA
Subha Halder, Anjana Mondal, Somesh Kumar
arxiv.org/abs/2507.17229 arxiv.org/pdf…

@arXiv_mathST_bot@mastoxiv.page
2025-06-23 08:34:40

The Optimality of a Nested Generalized Pairwise Group Testing Procedure
Yaakov Malinovsky, Viktor Skorniakov
arxiv.org/abs/2506.15797

@arXiv_quantph_bot@mastoxiv.page
2025-07-24 10:12:29

Development of a Standardized Testing Environment for QRNGs based on Semiconductor Laser Phase Noise
Matthias Ostner, Innocenzo De Marco, Christian Roubal
arxiv.org/abs/2507.17471

@arXiv_csSE_bot@mastoxiv.page
2025-06-23 08:28:40

Regression Testing Optimization for ROS-based Autonomous Systems: A Comprehensive Review of Techniques
Yupeng Jiang, Shuaiyi Sun, Xi Zheng
arxiv.org/abs/2506.16101

@arXiv_grqc_bot@mastoxiv.page
2025-06-23 10:52:00

Testing Quantum-Corrected Black Holes with QPOs Observations: A Study of Particle Dynamics and Accretion Flow
G. Mustafa, Sushant G. Ghosh, Orhan Donmez, S. K. Maurya, Shakhzod Orzuev, Farruh Atamurotov
arxiv.org/abs/2506.16405

@arXiv_astrophHE_bot@mastoxiv.page
2025-06-24 10:57:10

Testing the Lense-Thirring Precession Origin of the QPO in Swift J1727.8$-$1613
Ruican Ma, Chris Done, Aya Kubota
arxiv.org/abs/2506.18857

@datascience@genomic.social
2025-06-21 10:00:01

{testthat} is great for automatic testing. Here are some tricks for the heavy user: #rstats

@arXiv_astrophCO_bot@mastoxiv.page
2025-06-23 09:02:00

Testing Light Unaffiliated Mass Clumps in MACS 0416 on galaxy and galaxy cluster scales using JWST
Marceau Limousin, Derek Perera, Liliya L. R. Williams, Jori Liesenborgs, Gregor Rihtarsic
arxiv.org/abs/2506.16034

@karlauerbach@sfba.social
2025-07-23 00:54:15

Wow, yet another Thunderbird update.
It seems that Thunderbird, which ought to be rather stable by now, is getting updated more often than Chrome.
Either somebody is writing crummy code, not testing, or has a low acceptance hurdle for proposed "enhancements."

@ErikJonker@mastodon.social
2025-08-23 06:15:49

"The Scale of Russian Sabotage Operations Against Europe’s Critical Infrastructure" by IISS.
iiss.org/research-paper/2025/0

Map from IISS of attacks on critical infrastructure in Europe by Russia
@EarthOrgUK@mastodon.energy
2025-06-22 09:51:02

LED Lighting: Mini Reviews - Real-world testing! - earth.org.uk/LED-lighting.html

@teledyn@mstdn.ca
2025-07-22 19:29:50

Of course, I don't know 'hardware', as you can tell from my technical description, but I have a sample from another tuning peg gear, and the peg and gear for testing, I get to Home Hardware and they have loose bolts of small dimension. I quickly learn that #6 is too large, #4 is too small and they have no #5's where the thread matches.
But you know what works? Do you remember those little chrome bolts with the hex-wrench heads that used to hold expansion cards in the ibm-pc? Perfect match, only 5mm too long, easily compensated by buying a matching nut and my 53-years owned pawnshop 5-string, my first banjo, is back in action!

@Dragofix@veganism.social
2025-07-18 01:08:44

Brazil’s Chamber of Deputies Approves Bill Banning Cosmetic Testing on Live Vertebrates vegconomist.com/politics-law/b

@davidaugust@mastodon.online
2025-07-17 19:27:57

President Epstein List has an insufficiency? Sounds right.
#USpol

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 10:16:50

Deep Learning Framework Testing via Model Mutation: How Far Are We?
Yanzhou Mu, Rong Wang, Juan Zhai, Chunrong Fang, Xiang Chen, Zhiyuan Peng, Peiran Yang, Ruixiang Qian, Shaoyu Yang, Zhenyu Chen
arxiv.org/abs/2506.17638

@primonatura@mstdn.social
2025-06-18 18:00:36

"‘Shark Skin’ Coating for Airliners May Cut Fuel Use by 4% – Delta is Testing on its 767 Fleet"
#Aviation #Aeroplanes

@arXiv_csDS_bot@mastoxiv.page
2025-07-22 08:55:00

Characterizing and Testing Configuration Stability in Two-Dimensional Threshold Cellular Automata
Yonatan Nakar, Dana Ron
arxiv.org/abs/2507.14569

@arXiv_csCL_bot@mastoxiv.page
2025-08-22 10:16:01

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries
Ming Yin, Dinghan Shen, Silei Xu, Jianbing Han, Sixun Dong, Mian Zhang, Yebowen Hu, Shujian Liu, Simin Ma, Song Wang, Sathish Reddy Indurthi, Xun Wang, Yiran Chen, Kaiqiang Song
arxiv.org/abs/2508.15760

@fanf@mendeddrum.org
2025-08-17 11:42:03

from my link log —
Mix-testing: revealing a new class of compiler concurrency bugs.
johnwickerson.wordpress.com/20
saved 2024-06-29

@arXiv_mathST_bot@mastoxiv.page
2025-07-23 08:39:22

Gaussian Sequence Model: Sample Complexities of Testing, Estimation and LFHT
Zeyu Jia, Yury Polyanskiy
arxiv.org/abs/2507.16734

@arXiv_statME_bot@mastoxiv.page
2025-06-23 10:36:30

Leveraging Optimal Transport for Distributed Two-Sample Testing: An Integrated Transportation Distance-based Framework
Zhengqi Lin, Yan Chen
arxiv.org/abs/2506.16047

@Techmeme@techhub.social
2025-06-23 16:01:18

Microsoft says it is testing a new aggregated gaming library in the Xbox PC app for Windows 11 with Xbox Insiders, integrating "leading PC storefronts" (Jez Corden/Windows Central)
windowscentral.com/gaming/pc-g

@j_honegger@swiss.social
2025-07-23 12:02:32

From ⁨⁨⁨⁨⁨#AnnafromUkraine⁩⁩⁩⁩⁩ @AnnafromUkraine@youtube.com
PROTEST IN KYIV: ANTI-CORRUPTION LAW & MOSCOW NO FLY ZONE Vlog 1113: War in #Ukraine
Why #Moscow has become the new testing gro…

@arXiv_csPL_bot@mastoxiv.page
2025-08-21 07:38:39

Tuning Random Generators: Property-Based Testing as Probabilistic Programming
Ryan Tjoa, Poorva Garg, Harrison Goldstein, Todd Millstein, Benjamin Pierce, Guy Van den Broeck
arxiv.org/abs/2508.14394

@arXiv_quantph_bot@mastoxiv.page
2025-08-22 10:01:21

Robust Self-Testing of Multiqudit Supersinglet Slater States via Constant Number of Binary Measurements
Arturo Konderak, Wojciech Bruzda, Remigiusz Augusiak
arxiv.org/abs/2508.15546

@benb@osintua.eu
2025-07-18 22:43:01

🔥 Ukraine will become a testing ground for the weapons of the future: tests right on the front line: benborges.xyz/2025/07/18/ukrai

@arXiv_econEM_bot@mastoxiv.page
2025-07-22 09:09:10

Testing Clustered Equal Predictive Ability with Unknown Clusters
Oguzhan Akgun, Alain Pirotte, Giovanni Urga, Zhenlin Yang
arxiv.org/abs/2507.14621

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 09:59:50

Breaking Single-Tester Limits: Multi-Agent LLMs for Multi-User Feature Testing
Sidong Feng, Changhao Du, Huaxiao Liu, Qingnan Wang, Zhengwei Lv, Mengfei Wang, Chunyang Chen
arxiv.org/abs/2506.17539

@arXiv_csCR_bot@mastoxiv.page
2025-06-23 10:46:40

Towards Effective Complementary Security Analysis using Large Language Models
Jonas Wagner, Simon M\"uller, Christian N\"ather, Jan-Philipp Stegh\"ofer, Andreas Both
arxiv.org/abs/2506.16899

@Mediagazer@mstdn.social
2025-07-21 20:45:42

Source: Netflix is using Runway AI's video generation tools for production; Disney is testing out the tools and talked with Runway about possible uses for them (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@edintone@mastodon.green
2025-06-16 06:57:10

‘Shark Skin’ Coating for Airliners May Cut Fuel Use by 4% – Delta is Testing on its 767 Fleet goodnewsnetwork.org/shark-skin

@arXiv_astrophCO_bot@mastoxiv.page
2025-07-23 08:46:42

Testing gravitational physics by combining DESI DR1 and weak lensing datasets using the E_G estimator
S. J. Rauhut, C. Blake, U. Andrade, H. E. Noriega, J. Aguilar, S. Ahlen, S. BenZvi, D. Bianchi, D. Brooks, T. Claybaugh, A. Cuceu, A. de la Macorra, J. DeRose, P. Doel, N. Emas, S. Ferraro, J. E. Forero-Romero, C. Garcia-Quintero, E. Gazta\~naga, G. Gutierrez, S. Heydenreich, K. Honscheid, C. Howlett, D. Huterer, M. Ishak, S. Joudaki, R. Joyce, E. Jullo, R. Kehoe, D. Kirkby, A. Kremin,…

@arXiv_csLG_bot@mastoxiv.page
2025-08-18 09:45:30

DiCriTest: Testing Scenario Generation for Decision-Making Agents Considering Diversity and Criticality
Qitong Chu, Yufeng Yue, Danya Yao, Huaxin Pei
arxiv.org/abs/2508.11514

@arXiv_eessIV_bot@mastoxiv.page
2025-06-24 09:38:20

CT Radiomics-Based Explainable Machine Learning Model for Accurate Differentiation of Malignant and Benign Endometrial Tumors: A Two-Center Study
Tingrui Zhang, Honglin Wu, Zekun Jiang, Yingying Wang, Rui Ye, Huiming Ni, Chang Liu, Jin Cao, Xuan Sun, Rong Shao, Xiaorong Wei, Yingchun Sun
arxiv.org/abs/2506.18106

@arXiv_eessSY_bot@mastoxiv.page
2025-07-24 08:47:30

Multi-Angle Rotational Actuation in a 0.8-mm-Thick Preload-Free Piezoelectric Micromotor
Haijia Yu, Mingtong Chen, Zhengbao Yang
arxiv.org/abs/2507.17155

@Techmeme@techhub.social
2025-08-22 14:45:51

NYC grants Waymo its first permit, which extends through late September, to test up to eight of its autonomous vehicles in Manhattan and Downtown Brooklyn (Samantha Subin/CNBC)
cnbc.com/2025/08/22/waymo-perm

@arXiv_statCO_bot@mastoxiv.page
2025-07-23 08:50:02

inrep: A Comprehensive Framework for Adaptive Testing in R
Clievins Selva
arxiv.org/abs/2507.15893 arxiv.org/pdf/2507…

@malik@Mastodon.Social
2025-08-18 21:58:46

Pest und Cholera in einem schönen Ebenmaß?
mastodon.online/@9to5Mac/11505

@arXiv_csCG_bot@mastoxiv.page
2025-06-24 08:06:40

Optimal Parallel Algorithms for Convex Hulls in 2D and 3D under Noisy Primitive Operations
Michael T. Goodrich, Vinesh Sridhar
arxiv.org/abs/2506.17507

@arXiv_eessSP_bot@mastoxiv.page
2025-06-23 10:28:00

Refining Ray-Tracing Accuracy and Efficiency in the Context of FRMCS Urban Railway Channel Predictions
Romain Charbonnier, Thierry Tenoux, Yoann Corre
arxiv.org/abs/2506.16236

@selea@social.linux.pizza
2025-08-18 10:16:05

I wish I had mämmi
#finland #testing

@mrwedders@social.linux.pizza
2025-07-22 19:34:51

New drives have suitably pleased me after a bit of testing, time to start swapping them in... 12 hours per drive doesn't seem horrendous.

@arXiv_physicsinsdet_bot@mastoxiv.page
2025-07-21 08:24:00

A Simple Apparatus for Testing PMT Humidity Tolerance
A. Germer, K. Park, C. Skuse, C. Yang, D. S. Parno
arxiv.org/abs/2507.13545

@arXiv_csSE_bot@mastoxiv.page
2025-07-24 08:40:59

On the Feasibility of Quantum Unit Testing
Andriy Miranskyy, Jos\'e Campos, Anila Mjeda, Lei Zhang, Ignacio Garc\'ia Rodr\'iguez de Guzm\'an
arxiv.org/abs/2507.17235

@arXiv_hepph_bot@mastoxiv.page
2025-08-21 08:27:10

Testing the dark side of neutrino oscillations with the solar neutrino fog at Dark Matter experiments
Julia Gehrlein, Tanmay Kushwaha
arxiv.org/abs/2508.14166

@aardrian@toot.cafe
2025-06-04 18:14:03

Apropos of yet another conversation today, I’m a big fan of using automation in WCAG testing.
But I also know WCAG well enough to understand the limitations (and lies) the tools.
adrianroselli.com/2025/04/auto

@arXiv_csCY_bot@mastoxiv.page
2025-06-19 08:08:33

Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings
Harbin Hong, Sebastian Caldas, Liu Leqi
arxiv.org/abs/2506.14997

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 09:01:40

Challenges and Practices in Quantum Software Testing and Debugging: Insights from Practitioners
Jake Zappin, Trevor Stalnaker, Oscar Chaparro, Denys Poshyvanyk
arxiv.org/abs/2506.17306

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 11:17:40

Bayesian Optimization-based Search for Agent Control in Automated Game Testing
Carlos Celemin
arxiv.org/abs/2508.13121 arxiv.org/pdf/2508.1…

@arXiv_csCR_bot@mastoxiv.page
2025-07-23 07:37:02

BACFuzz: Exposing the Silence on Broken Access Control Vulnerabilities in Web Applications
I Putu Arya Dharmaadi, Mohannad Alhanahnah, Van-Thuan Pham, Fadi Mohsen, Fatih Turkmen
arxiv.org/abs/2507.15984

@arXiv_mathST_bot@mastoxiv.page
2025-06-24 09:17:10

Testing Separability of High-Dimensional Covariance Matrices
Bongjung Sung, Peter D. Hoff
arxiv.org/abs/2506.17463 ar…

@arXiv_statML_bot@mastoxiv.page
2025-06-23 10:04:20

On Continuous Monitoring of Risk Violations under Unknown Shift
Alexander Timans, Rajeev Verma, Eric Nalisnick, Christian A. Naesseth
arxiv.org/abs/2506.16416

@arXiv_csSE_bot@mastoxiv.page
2025-06-23 10:40:20

Revolutionizing Validation and Verification: Explainable Testing Methodologies for Intelligent Automotive Decision-Making Systems
Halit Eris, Stefan Wagner
arxiv.org/abs/2506.16876

@arXiv_statME_bot@mastoxiv.page
2025-07-22 11:11:30

Testing Homogeneity in a heteroscedastic contaminated normal mixture
Xiaoqing Niu, Pengfei Li, Yuejiao Fu
arxiv.org/abs/2507.15630

@Techmeme@techhub.social
2025-07-16 14:40:48

Microsoft begins testing a Windows 11 feature for sharing the entire desktop with Copilot Vision; it requires first entering a special mode in the Copilot app (Zac Bowden/Windows Central)

@arXiv_astrophHE_bot@mastoxiv.page
2025-06-23 11:12:50

Possibilities for SETI at High Energy
Brian C. Lacki, Stephen DiKerby
arxiv.org/abs/2506.16351 arxiv.org/pdf/2506.163…

@arXiv_csCG_bot@mastoxiv.page
2025-06-24 09:12:50

How Hard is it to be a Star? Convex Geometry and the Real Hierarchy
Marcus Schaefer, Daniel \v{S}tefankovi\v{c}
arxiv.org/abs/2506.18818

@arXiv_eessSY_bot@mastoxiv.page
2025-06-24 11:12:20

In silico evaluation of pramlintide dosing algorithms in artificial pancreas systems
Borja Pons Torres, Iv\'an Sala Mira, Clara Furi\'o-Novejarque, Ricardo Sanz, Pedro Garc\'ia, Jos\'e-Luis D\'iez, Jorge Bondia
arxiv.org/abs/2506.17790

@arXiv_statME_bot@mastoxiv.page
2025-07-22 08:20:00

Hypothesis testing for quantitative trait locus effects in both location and scale in genetic backcross studies
Guanfu Liu, Pengfei Li, Yukun Liu, Xiaolong Pu
arxiv.org/abs/2507.14253

@arXiv_csCR_bot@mastoxiv.page
2025-07-24 09:48:09

Enabling Cyber Security Education through Digital Twins and Generative AI
Vita Santa Barletta, Vito Bavaro, Miriana Calvano, Antonio Curci, Antonio Piccinno, Davide Pio Posa
arxiv.org/abs/2507.17518

@arXiv_csSE_bot@mastoxiv.page
2025-07-23 08:30:12

StaAgent: An Agentic Framework for Testing Static Analyzers
Elijah Nnorom, Md Basim Uddin Ahmed, Jiho Shin, Hung Viet Pham, Song Wang
arxiv.org/abs/2507.15892

@arXiv_csAI_bot@mastoxiv.page
2025-08-19 11:12:10

EvolMathEval: Towards Evolvable Benchmarks for Mathematical Reasoning via Evolutionary Testing
Shengbo Wang, Mingwei Liu, Zike Li, Anji Li, Yanlin Wang, Xin Peng, Zibin Zheng
arxiv.org/abs/2508.13003

@Techmeme@techhub.social
2025-08-21 12:40:49

Google rolls out AI Mode to 180 countries and territories in English, after testing in the US, UK, and India, and plans to add more languages and regions "soon" (Abner Li/9to5Google)
9to5google.com/2025/08/21/goog

@arXiv_astrophCO_bot@mastoxiv.page
2025-07-24 09:42:39

Quantifying the Impact of 2D and 3D BAO Measurements on the Cosmic Distance Duality Relation with HII Galaxy observation
Jie Zheng (HNAS), Da-Chun Qiang (HNAS), Zhi-Qiang You (HNAS), Darshan Kumar (HNAS)
arxiv.org/abs/2507.17113

@arXiv_csCR_bot@mastoxiv.page
2025-06-23 10:41:40

Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models
Lu Gao, Yongxin Liu, Hongyun Chen, Dahai Liu, Yunpeng Zhang, Jingran Sun
arxiv.org/abs/2506.16699

@arXiv_csSE_bot@mastoxiv.page
2025-06-23 10:52:10

Behavior Driven Development for 3D Games
Fernando Pastor Ric\'os, Beatriz Mar\'in, I. S. W. B. Prasetya, Tanja E. J. Vos, Joseph Davidson, Karel Hovorka
arxiv.org/abs/2506.17057

@arXiv_quantph_bot@mastoxiv.page
2025-07-17 10:15:10

Modulator-free, self-testing quantum random number generator
Ana Bl\'azquez-Co\'ido, Fadri Gr\"unenfelder, Anthony Martin, Raphael Houlmann, Hugo Zbinden, Davide Rusca
arxiv.org/abs/2507.12346

@Techmeme@techhub.social
2025-07-21 20:45:40

Source: Netflix is using Runway AI's video generation tools for production; Disney is testing out the tools and talked with Runway about possible uses for them (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_statME_bot@mastoxiv.page
2025-07-22 08:14:40

On the Testing of complete causal mediation and its applications
Yichin Tsai, Wan-Tzu Chang, Jia Jyun Sie, Cathy SJ Fann, Iebin Lian
arxiv.org/abs/2507.14246

@arXiv_csSE_bot@mastoxiv.page
2025-07-22 11:22:20

Deep Learning Framework Testing via Heuristic Guidance Based on Multiple Model Measurements
Yinglong Zou, Juan Zhai, Chunrong Fang, Yanzhou Mu, Jiawei Liu, Zhenyu Chen
arxiv.org/abs/2507.15181

@arXiv_statME_bot@mastoxiv.page
2025-07-22 10:59:20

Multiple Hypothesis Testing To Estimate The Number Of Communities in Stochastic Block Models
Chetkar Jha, Mingyao Li, Ian Barnett
arxiv.org/abs/2507.15471

@Techmeme@techhub.social
2025-08-14 01:40:59

India-based ride-hailing app Rapido starts testing its food delivery service Ownly in Bengaluru, marking its first serious move to challenge Swiggy and Zomato (Jagmeet Singh/TechCrunch)
techcrunch.com/2025/08/13/indi

@arXiv_statME_bot@mastoxiv.page
2025-08-22 09:25:21

An adaptive procedure for detecting replicated signals with $k$-family-wise error rate control
Ninh Tran
arxiv.org/abs/2508.15363 arxiv.org…

@Techmeme@techhub.social
2025-06-18 16:42:05

Waymo applied for a NYC permit to test its cars with safety drivers and plans to start collecting mapping data with manually driven cars in Manhattan in July (Andrew J. Hawkins/The Verge)
theverge.com/news/689093/waymo

@arXiv_csSE_bot@mastoxiv.page
2025-07-21 08:46:40

Testing Autonomous Driving Systems -- What Really Matters and What Doesn't
Changwen Li, Joseph Sifakis, Rongjie Yan, Jian Zhang
arxiv.org/abs/2507.13661

@arXiv_csSE_bot@mastoxiv.page
2025-08-19 10:11:40

RUM: Rule LLM-Based Comprehensive Assessment on Testing Skills
Yue Wang, Zhenyu Chen, Yuan Zhao, Chunrong Fang, Ziyuan Wang, Song Huang
arxiv.org/abs/2508.12922

@Techmeme@techhub.social
2025-08-20 14:30:57

Google hires NBA star Stephen Curry as a "performance advisor" for its Health, Pixel, and Cloud products, including testing Fitbit's new personal health coach (Jess Weatherbed/The Verge)
theverge.com/news/762146/googl

@arXiv_csSE_bot@mastoxiv.page
2025-06-18 09:19:09

Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature survey
Ina K. Schieferdecker
arxiv.org/abs/2506.14640

@Techmeme@techhub.social
2025-06-14 21:16:27

In an Oxford study, LLMs correctly identified medical conditions 94.9% of the time when given test scenarios directly, vs. 34.5% when prompted by human subjects (Nick Mokey/VentureBeat)
venturebeat.com/ai/just-add-hu

@arXiv_csSE_bot@mastoxiv.page
2025-06-19 08:37:03

Large Language Models for Unit Testing: A Systematic Literature Review
Quanjun Zhang, Chunrong Fang, Siqi Gu, Ye Shang, Zhenyu Chen, Liang Xiao
arxiv.org/abs/2506.15227

@arXiv_csSE_bot@mastoxiv.page
2025-07-24 09:43:09

CASCADE: LLM-Powered JavaScript Deobfuscator at Google
Shan Jiang, Pranoy Kovuri, David Tao, Zhixun Tan
arxiv.org/abs/2507.17691

@Techmeme@techhub.social
2025-08-20 22:05:59

Inside Google's Reliability Labs, where it stress tests Pixel phones and watches; Google claims the Pixel 10 Pro Fold can withstand 10 years of folding (Julian Chokkattu/Wired)
wired.com/story/google-reliabi

@arXiv_csSE_bot@mastoxiv.page
2025-07-23 08:03:52

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
Ori Press, Brandon Amos, Haoyu Zhao, Yikai Wu, Samuel K. Ainsworth, Dominik Krupke, Patrick Kidger, Touqir Sajed, Bartolomeo Stellato, Jisun Park, Nathanael Bosch, Eli Meril, Albert Steppi, Arman Zharmagambetov, Fangzhao Zhang, David Perez-Pineiro, Alberto Mercurio, Ni Zhan, Talor Abramovich, Kilian Lieret, Hanlin Zhang, Shirley Huang, Matthias Bethge, Ofir Press

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 11:38:00

Build It Clean: Large-Scale Detection of Code Smells in Build Scripts
Mahzabin Tamanna, Yash Chandrani, Matthew Burrows, Brandon Wroblewski, Laurie Williams, Dominik Wermke
arxiv.org/abs/2506.17948

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 10:22:40

May the Feedback Be with You! Unlocking the Power of Feedback-Driven Deep Learning Framework Fuzzing via LLMs
Shaoyu Yang, Chunrong Fang, Haifeng Lin, Xiang Chen, Zhenyu Chen
arxiv.org/abs/2506.17642

@Techmeme@techhub.social
2025-06-07 13:40:47

YouTube rolls out a tool to let some creators upload different thumbnails for each video dubbed into a different language, to help expand their global audience (Dan Whateley/Business Insider)
businessinsider.com/youtube-te

@arXiv_csSE_bot@mastoxiv.page
2025-06-23 10:36:00

Accountability of Robust and Reliable AI-Enabled Systems: A Preliminary Study and Roadmap
Filippo Scaramuzza, Damian A. Tamburri, Willem-Jan van den Heuvel
arxiv.org/abs/2506.16831

@arXiv_csSE_bot@mastoxiv.page
2025-08-21 07:57:10

You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation
Yutong Bian, Xianhao Lin, Yupeng Xie, Tianyang Liu, Mingchen Zhuge, Siyuan Lu, Haoming Tang, Jinlin Wang, Jiayi Zhang, Jiaqi Chen, Xiangru Tang, Yongxin Ni, Sirui Hong, Chenglin Wu
arxiv.org/abs/2508.14104

@arXiv_csSE_bot@mastoxiv.page
2025-08-18 08:38:50

ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal
Haonan Zhang, Dongxia Wang, Yi Liu, Kexin Chen, Jiashui Wang, Xinlei Ying, Long Liu, Wenhai Wang
arxiv.org/abs/2508.11222

@arXiv_csSE_bot@mastoxiv.page
2025-08-19 10:01:50

XAMT: Cross-Framework API Matching for Testing Deep Learning Libraries
Bin Duan, Ruican Dong, Naipeng Dong, Dan Dongseong Kim, Guowei Yang
arxiv.org/abs/2508.12546

@arXiv_csSE_bot@mastoxiv.page
2025-07-17 08:09:00

Extremal Testing for Network Software using LLMs
Rathin Singha, Harry Qian, Srinath Saikrishnan, Tracy Zhao, Ryan Beckett, Siva Kesava Reddy Kakarla, George Varghese
arxiv.org/abs/2507.11898