
2025-07-29 11:52:45
"#captcha art test" by wrong hands: https://wronghands1.com/2025/07/29/captcha-art-test/
"#captcha art test" by wrong hands: https://wronghands1.com/2025/07/29/captcha-art-test/
Automated Test Oracles for Flaky Cyber-Physical System Simulators: Approach and Evaluation
Baharin A. Jodat, Khouloud Gaaloul, Mehrdad Sabetzadeh, Shiva Nejati
https://arxiv.org/abs/2508.20902
Kiedy piszesz bibliotekę w #RustLang, bo chcesz bezpiecznego kodu, a zamiast tego dostajesz jakiś paskudny #heisenbug z przekłamaniem danych.
https://…
@… I am using `import.meta.resolve` in my import-module-string package (tested in Vitest) and needed to write a separate test suite using Node’s test runner to test the import.meta.resolve functionality.
h…
Evaluating Interactions between Automated Vehicles and Cyclists using a coupled In-the-Loop Test Environment
Michael Kaiser, Clemens Gro{\ss}, Lisa Marie Otto, Steffen M\"uller
https://arxiv.org/abs/2507.21859
Ex-Super Bowl champion doubts Travis Hunter's two-way NFL stardom, says Jaguars should 'test his armor'
https://www.cbssports.com/nfl/news…
China accepts first sanctioned Russian LNG cargo in test of US response, Bloomberg reports: https://benborges.xyz/2025/08/28/china-accepts-first-sanctioned-russian.html
Trump’s strategy of imposing sweeping tariffs on America’s main trading partners will face a major test in the US courts on Thursday,
four days after the president hailed the “powerful deal” reached with the EU -- and just hours before a new round of punishing import duties is set to come into effect.
Trump has underpinned his tariff policy with an emergency power that is now being challenged as unlawful in the federal courts.
On Thursday the US court of appeals for the fed…
New test unmasks illegal elephant ivory disguised as mammoth #AnimalRights
OpenAI and Anthropic publish findings from joint safety tests of each other's models, aimed at surfacing blind spots in their internal evaluations (Maxwell Zeff/TechCrunch)
https://techcrunch.com/2025/08/27/openai-co-founder-calls-for-ai…
A General Test for Independent and Identically Distributed Hypothesis
Tongyu Li, Jonas Mueller, Fang Yao
https://arxiv.org/abs/2506.22361 https://
Dann finden wir es doch heraus, landet ein Beitrag mit nur dem Hashtag:
#test
einfach auf
https://feddit.org/c/testing@kbin.earth?
Finally some good negative results #COVID19
Brazil offers America a lesson in democratic maturity. It is a test case for how countries recover from a populist fever.
https://economist.com/leaders/2025/08/28/brazil-offers-america-a-lesson-in-democratic-maturity
from The Ec…
After a discussion about why omitting coverage for tests is not a good idea, I added a tip to our test tutorial: https://python-basics-tutorial.readthedocs.io/en/latest/test/pytest/coverage.html#coverage-tip
Test-Time Consistency in Vision Language Models
Shih-Han Chou, Shivam Chandhok, James J. Little, Leonid Sigal
https://arxiv.org/abs/2506.22395 https://
A Graph-Based Test-Harness for LLM Evaluation
Jessica Lundin, Guillaume Chabot-Couture
https://arxiv.org/abs/2508.20810 https://arxiv.org/pdf/2508.20810
Intention-Driven Generation of Project-Specific Test Cases
Binhang Qi, Yun Lin, Xinyi Weng, Yuhuan Huang, Chenyan Liu, Hailong Sun, Jin Song Dong
https://arxiv.org/abs/2507.20619
So Starship exploded 4 times at last test launch? Like 1 time before the launch, and then 3 times during the test?
Is this fine
Researchers Quietly Planned a Test to Dim Sunlight Over 3,900 Square Miles - Slashdot
https://news.slashdot.org/story/25/07/27/2146205/researchers-quietly-planned-a-test-to-dim-sunlight-over-3900-square-miles
Resource-Efficient Hadamard Test Circuits for Nonlinear Dynamics on a Trapped-Ion Quantum Computer
Eleftherios Mastorakis, Muhammad Umer, Milena Guevara-Bertsch, Juris Ulmanis, Felix Rohde, Dimitris G. Angelakis
https://arxiv.org/abs/2507.19250
38% speedup on one of my test setups from a single-character code change.
This bug hiding in libscopehal for ages meant that if you tried to preallocate a buffer to avoid constant resizes, you'd end up reallocating even if the buffer was already the right size. If you're doing this every filter invocation, it adds up.
CR7 in Grödig? - Sinti und Roma müssen für Ronaldo-Test weichen #News #Nachrichten
#Wikipedia Editors Reject Founder's AI Review Proposal After ChatGPT Fails Basic Policy Test
https://slashdot.org/story/446032
Is the client vendor now responsible for you getting scammed when their agent gives away your financial information to scam sites?
https://guard.io/labs/scamlexity-we-put-agentic-ai-browsers-to-the-test-they-clicked-they-paid-they-fa…
Det burde man nok have forudset. Myndighederne anbefaler at danskerne gŸr noget. Kyniske virksomheder udnytter situationen og prakker danskerne alt muligt skrammel på. #betalingsmur #beredskab
In preparation for Python 3.14 and official free-threading / nogil support, I present argon2-cffi-bindings 25.1.0!
It comes with 3.14t wheels (and new ARM64 wheels for Windows for good measure). Currently, this is mostly to unblock everyone who wants to test things since it depends on a beta version of CFFI for 3.14 .
There’s no need for changes to argon2-cffi proper.
Bag of Coins: A Statistical Probe into Neural Confidence Structures
Agnideep Aich, Ashit Baran Aich, Md Monzur Murshed, Sameera Hewage, Bruce Wade
https://arxiv.org/abs/2507.19774
Logical Reasoning with Outcome Reward Models for Test-Time Scaling
Ramya Keerthy Thatikonda, Wray Buntine, Ehsan Shareghi
https://arxiv.org/abs/2508.19903 https://
A Reckless Judicial Nomination Puts the Senate to the Test (David French/New York Times)
https://www.nytimes.com/2025/06/29/opinion/bove-trump-judges-confirmation.html
http://www.memeorandum.com/250629/p92#a250629p92
Cash and Cognition: The Impact of Transfer Timing on Standardized Test Performance and Human Capital
Axel Eizmendi Larrinaga, Germ\'an Reyes
https://arxiv.org/abs/2507.21393
WhichYear 6/30/25
3566 pts
9.0 avg. years off
3️⃣ ⚪ ⚪ ⚪ 1️⃣
https://whichyr.com
Detection of a dark matter sub-halo near the Sun from pulsar timing: #DarkMatter been found in our galactic neighborhood? https://www.science.org/content/article/has-huge-blob-dark-matter-been-found-our-galactic-neighborhood - if confirmed, vast cloud could test predictions about the Milky Way’s hidden architecture.
UChicago researchers have developed a new liquid biopsy test that uses RNA modifications to detect early-stage colorectal cancer with 95% accuracy with a simple blood draw
https://biologicalsciences.uchicago.edu/news/liquid-biopsy-rna-cancer
Exploring AI-Enabled Test Practice, Affect, and Test Outcomes in Language Assessment
Jill Burstein, Ramsey Cardwell, Ping-Ling Chuang, Allison Michalowski, Steven Nydick
https://arxiv.org/abs/2508.17108
Okay, the magnets won't be in until Wednesday but got the 3D printed D-pad installed. Tracked down my old XBox Pro controller, fit the metal D-pad (D-dish?) onto it, and gave it a short test running around the map in Sea of Stars...
If you have a stock D-pad on the Legion Go, this is definitely a must have. The diagonals now exist and the D-pad is nice to use now. Is it the greatest? No, but it's about a 1000x better than it was.
The 3D print was found here:
Das passiert, wenn nazis an die Macht kommen.
>>Rassistischer Hintergrund? <<
>>Dänemark nimmt grönländischer Mutter Neugeborenes nach umstrittenem Test weg<<
Obwohl der Test für Grönländerinnen gesetzlich verboten war, interessien Nazis solche Verbote nicht. Mit Leuten, die mich auffordern, die SPD-geführte Regierung in Dänemark nicht als Nazis zu bezeichnen, diskutiere ich nicht.
So I first now removed the 3rd keyoxide test...removed it both from the server and the device now...
In this same post I will also put details on the removal of the final test, keeping only the main keyoxide...and it has now been removed.
Now my profile looks sane again.
#Keyoxide
Generating Highly Structured Test Inputs Leveraging Constraint-Guided Graph Refinement
Zhaorui Yang, Yuxin Qiu, Haichao Zhu, Qian Zhang
https://arxiv.org/abs/2507.21271 https://…
Watching 36 cars with active Advanced Driver Assistant Systems (#ADAS) being tested in real live situations is ... revealing. Huge differences in outcomes (crashes etc.) and paying more money does not guarantee a better outcome. Watch the video with English subtitles switched on, it's hilarious.
heise | Kochautomat: Bosch Cookit im Test
Der Cookit war schon immer der schärfste Konkurrent des Thermomix TM6. Aber kann er mit Vorwerks neuestem Modell TM7 mithalten?
https://www.heise.de/r…
Gravitational Microlensing of the Galactic Centre $\gamma$-Ray Excess: A New Test for Point-Like or Extended Emission?
Nada Salama, Florian List, Geraint Lewis
https://arxiv.org/abs/2508.19577
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[2/4]:
- MetaSel: A Test Selection Approach for Fine-tuned DNN Models
Amin Abbasishahkoo, Mahboubeh Dadkhah, Lionel Briand, Dayi Lin
Wow! The latest go blog has an excellent writeup explaining a test/synctest feature.
Testing Time (and other asyncronicities) - The Go Programming Language
#go
Anthropic tests AI running a real business with bizarre results https://www.artificialintelligence-news.com/news/anthropic-tests-ai-running-a-real-business-with-bizarre-results/
No Boats, Just Lies...
#Sims4 #TheSims4 #Comicstrip
A test of seven AI chatbots' abilities to identify news photos' location, date, and photographer showed all failed to consistently identify photos' provenance (Columbia Journalism Review)
https://www.cjr.org/tow_center/why-ai-models-are-bad-at-verifyi…
A clusterability test for directed graphs
Mario R. Guarracino, Pierre Miasnikof, Alexander Y. Shestopaloff, Houyem Demni, Cristi\'an Bravo, Yuri Lawryshyn
https://arxiv.org/abs/2506.20111
Who Wore It Best? The Greatest NFL Players by Jersey Number, 0-24 https://www.foxsports.com/stories/nfl/greatest-nfl-players-wear-every-jersey-number-0-24
X is testing using Community Notes to highlight posts that are liked by users with different perspectives (Sarah Perez/TechCrunch)
https://techcrunch.com/2025/07/24/x-to-test-using-community-notes-to-find-the-posts-everyone-likes/
Empathy in Explanation
Katherine M. Collins, Kartik Chandra, Adrian Weller, Jonathan Ragan-Kelley, Joshua B. Tenenbaum
https://arxiv.org/abs/2507.21081 https://
Agentic Program Repair from Test Failures at Scale: A Neuro-symbolic approach with static analysis and test execution feedback
Chandra Maddila, Adam Tait, Claire Chang, Daniel Cheng, Nauman Ahmad, Vijayaraghavan Murali, Marshall Roch, Arnaud Avondet, Aaron Meltzer, Victor Montalvao, Michael Hopko, Chris Waterson, Parth Thakkar, Renuka Fernandez, Kristian Kristensen, Sivan Barzily, Sherry Chen, Rui Abreu, Nachiappan Nagappan, Payam Shodjai, Killian Murphy, James Everingham, Aparna Raman…
When Is Causal Inference Possible? A Statistical Test for Unmeasured Confounding
Muye Liu, Jun Xie
https://arxiv.org/abs/2508.20366 https://arxiv.org/pdf/2…
Non-Hormonal Male Birth Control Pill Passes Key Test
https://gizmodo.com/non-hormonal-male-birth-control-pill-passes-key-test-2000633856
heise | Plastisch pinseln: iPad-App Feather im Test
Sie malen gerne, wollen es aber dreidimensional? Mit der iPad-App Feather zeichnet man begehbare 3D-Welten.
https://www.heise.de/tests/Plastisch…
MIRAGE: Scaling Test-Time Inference with Parallel Graph-Retrieval-Augmented Reasoning Chains
Kaiwen Wei, Rui Shan, Dongsheng Zou, Jianzhong Yang, Bi Zhao, Junnan Zhu, Jiang Zhong
https://arxiv.org/abs/2508.18260
The Trump administration is planning to change the visa system for skilled foreign workers,
a program at the center of a dispute between immigration hard-liners and tech industry leaders,
said the new director of U.S. Citizenship and Immigration Services.
Joseph Edlow, the director of U.S.C.I.S., said the test to become a U.S. citizen was "too easy" and should change.
“The test as it’s laid out right now, it’s not very difficult,”
Mr. Edlow said on Thur…
Russia test-fires Kalibr and Uran missiles during naval exercises in the Sea of Japan: https://benborges.xyz/2025/08/25/russia-testfires-kalibr-and-uran.html
The much anticipated tenth test flight of the #Starship - an assessment in https://milesobrien.substack.com/p/starship-at-the-crossroads-can-test - could come at 23:50 UTC tonight: SpaceX's webcast will be at https://x.com/i/broadcasts/1yoKMPRjeYYxQ while others are already running at https://www.youtube.com/watch?v=1qmknQKRsIM and https://www.youtube.com/live/C7WmlTp7ue0 and https://www.youtube.com/live/QFFysWdL4wI
Independence Testing for Mixed Data
Dana Bucalo Jeli\'c, Marija Cupari\'c, Bojana Milo\v{s}evi\'c
https://arxiv.org/abs/2507.20609 https://arxi…
A test of seven AI chatbots' abilities to identify news photos' location, date, and photographer showed all failed to consistently identify photos' provenance (Columbia Journalism Review)
https://www.cjr.org/tow_center/why-ai-models-are-bad-at-verifyi…
Heute vor 67 Jahren: Am 29.06.1958 kam es bei Enewetak zum Atomtest Operation Hardtack I, "Hickory". Dieser Test war Teil einer Serie von 35 #Atomtests, die die USA im Sommer 1958 auf den #Marshallinseln im Pazifik durchführten.
Is Lindblad for me?
Martino Stefanini, Aleksandra A. Ziolkowska, Dmitry Budker, Ulrich Poschinger, Ferdinand Schmidt-Kaler, Antoine Browaeys, Atac Imamoglu, Darrick Chang, Jamir Marino
https://arxiv.org/abs/2506.22436
Interactive Adversarial Testing of Autonomous Vehicles with Adjustable Confrontation Intensity
Yicheng Guo, Chengkai Xu, Jiaqi Liu, Hao Zhang, Peng Hang, Jian Sun
https://arxiv.org/abs/2507.21814
A Slice-Based Change Impact Analysis for Regression Test Case Prioritization of Object-Oriented Programs
S. Panda, D. Munjal, D. P. Mohapatra
https://arxiv.org/abs/2508.19056 ht…
AI-Driven Media & Synthetic Knowledge: Rethinking Society in Generative Futures
Katalin Feher
https://arxiv.org/abs/2507.19877 https://arxiv.org/pdf/25…
The next question after Turing's question: Introducing the Grow-AI test
Alexandru Tugui
https://arxiv.org/abs/2508.16277 https://arxiv.org/pdf/2508.162…
LLM-based Property-based Test Generation for Guardrailing Cyber-Physical Systems
Khashayar Etemadi, Marjan Sirjani, Mahshid Helali Moghadam, Per Strandberg, Paul Pettersson
https://arxiv.org/abs/2505.23549
Anthropic tested Claude's ability to manage a physical "storefront" to mixed results, as the AI struggled with pricing strategy and inventory management (Ryan Daws/AI News)
https://www.artificialintelligence-news.co
Overview of ADoBo at IberLEF 2025: Automatic Detection of Anglicisms in Spanish
Elena Alvarez-Mellado, Jordi Porta-Zamorano, Constantine Lignos, Julio Gonzalo
https://arxiv.org/abs/2507.21813
KI-Update kompakt: Gemini Live, AI Mode, GPT-5-Test, KI-Psychosen
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.…
Heute vor 67 Jahren: Am 30.05.1958 kam es bei Enewetak zum Atomtest Operation Hardtack I, "Tobacco". Dieser Test war Teil einer Serie von 35 #Atomtests, die die USA im Sommer 1958 auf den #Marshallinseln im Pazifik durchführten.
50 years ago #OTD at 16:09 UTC #Soyuz 19 and the final #Apollo spacecraft docked in orbit in the Apollo–Soyuz Test Project or Экспериментальный полёт «Союз»–«Аполлон» (the Americans and Soviets never agreed on which s/c comes first): https://www.nasa.gov/apollo-soyuz-test-project/ = various NASA materials, https://historycollection.jsc.nasa.gov/JSCHistoryPortal/history/astp.htm = many links, https://www.nasa.gov/wp-content/uploads/2023/04/sp-4209.pdf = a biiiig book, https://www.facebook.com/AstroDrewMorgan/posts/pfbid0wcN12LcAfRYYTmiWS71GQ47P42jukXVhd2QWA7eBzgg1wbdZAKq8LXVGTC9o7tKtl and https://www.facebook.com/ralf.heckel1/posts/pfbid02jcFp85PRy2UBUs4JVhPQcBnmddC8sQbPL3Vx26XoFMvbbsToNJmpLMDrRfce98HPl and https://www.facebook.com/groups/176051159106442/posts/24331420949809459 and https://www.facebook.com/groups/48800622850/posts/10162318303047851 = more visuals, https://www.youtube.com/live/Mu7iEyaOmDM = an online event, https://arstechnica.com/space/2025/07/not-that-into-peace-doves-the-apollo-soyuz-patch-nasa-rejected/ and https://www.spacerockethistory.com/2025/02/space-rocket-history-457-apollo-soyuz-test-project-soviet-concerns-with-apollo/ and http://www.collectspace.com/news/news-071425a-apollo-soyuz-test-project-astp-official-flight-kit-ofk-apk.html and https://www.nytimes.com/2025/07/14/science/apollo-soyuz-test-project-1975-anniversary.html?smid=bs-share = articles and https://scicomm.xyz/@Nick_Stevens_graphics@mastodon.art/114858897568849751 = a contemporary cartoon.
heise | Beziehungshelfer: Networking-App Dextr im Test
Das Adressbuch Dextr verknüpft Kontakte und visualisiert sie als Netzwerk. Wir haben die iOS- und iPadOS-App getestet
https://www.heise.de…
Model-Free Hovering and Source Seeking via Extremum Seeking Control: Experimental Demonstration
Ahmed A. Elgohary, Rohan Palanikumar, Sameh A. Eisa
https://arxiv.org/abs/2508.20836
Sequential Diagnosis with Language Models
Harsha Nori, Mayank Daswani, Christopher Kelly, Scott Lundberg, Marco Tulio Ribeiro, Marc Wilson, Xiaoxuan Liu, Viknesh Sounderajah, Jonathan Carlson, Matthew P Lungren, Bay Gross, Peter Hames, Mustafa Suleyman, Dominic King, Eric Horvitz
https://arxiv.org/abs/2506.22405
Noch einige der zuletzt hier besonders häufig geteilten #News:
CachyOS im Test: Wie schnell kann ein Linux sein?
https://www.
SATORI: Static Test Oracle Generation for REST APIs
Juan C. Alonso, Alberto Martin-Lopez, Sergio Segura, Gabriele Bavota, Antonio Ruiz-Cort\'es
https://arxiv.org/abs/2508.16318
RoD-TAL: A Benchmark for Answering Questions in Romanian Driving License Exams
Andrei Vlad Man, R\u{a}zvan-Alexandru Sm\u{a}du, Cristian-George Craciun, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel
https://arxiv.org/abs/2507.19666
heise | PV für unterwegs: Solar-Campingtisch TX-252 mit Doppelfunktion im Test
Am Solar-Campingtisch TX-252 kann man sowohl essen als auch Geräte wie Smartphones aufladen.
https://www.
Noch einige der zuletzt hier besonders häufig geteilten #News:
CachyOS im Test: Wie schnell kann ein Linux sein?
https://www.
YATE: The Role of Test Repair in LLM-Based Unit Test Generation
Michael Konstantinou, Renzo Degiovanni, Jie M. Zhang, Mark Harman, Mike Papadakis
https://arxiv.org/abs/2507.18316
LLM4VV: Evaluating Cutting-Edge LLMs for Generation and Evaluation of Directive-Based Parallel Programming Model Compiler Tests
Zachariah Sollenberger, Rahul Patel, Saieda Ali Zada, Sunita Chandrasekaran
https://arxiv.org/abs/2507.21447
Quantum-Based Software Engineering
Jianjun Zhao
https://arxiv.org/abs/2505.23674 https://arxiv.org/pdf/2505.23674
heise | Austauschen statt wegwerfen: Powerbanks mit Wechselakkus im Test
Streikt die Powerbank, landet sie meist im Schrott. Nachhaltiger gehts mit einer leicht zu öffnenden Powerbank für 18650-Akkus.
http…
Replaced article(s) found for cs.SE. https://arxiv.org/list/cs.SE/new
[1/1]:
- CoCoEvo: Co-Evolution of Programs and Test Cases to Enhance Code Generation
Kefan Li, Yuan Yuan, Hongyue Yu, Tingyu Guo, Shijie Cao
Resolving Build Conflicts via Example-Based and Rule-Based Program Transformations
Sheikh Shadab Towqir, Fei He, Todd Mytkowicz, Na Meng
https://arxiv.org/abs/2507.19432 https:/…
Boosting Skeleton-Driven SMT Solver Fuzzing by Leveraging LLM to Produce Formula Generators
Maolin Sun, Yibiao Yang, Yuming Zhou
https://arxiv.org/abs/2508.20340 https://…
Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization
Mingzhe Du, Luu Tuan Tuan, Yue Liu, Yuhao Qing, Dong Huang, Xinyi He, Qian Liu, Zejun Ma, See-kiong Ng
https://arxiv.org/abs/2505.23387
Search-Based Fuzzing For RESTful APIs That Use MongoDB
Hernan Ghianni, Man Zhang, Juan P. Galeotti, Andrea Arcuri
https://arxiv.org/abs/2507.20848 https://…
Black-Box Bug-Amplification for Multithreaded Software
Yeshayahu Weiss, Gal Amram, Achiya Elyasaf, Eitan Farchi, Oded Margalit, Gera Weiss
https://arxiv.org/abs/2507.21318 https…
Testing Is Not Boring: Characterizing Challenge in Software Testing Tasks
Davi Gama Hardman, Cesar Fran\c{c}a, Brody Stuart-Verner, Ronnie de Souza Santos
https://arxiv.org/abs/2507.20407