Tootfinder

Opt-in global Mastodon full text search. Join the index!

@thomasfuchs@hachyderm.io
2025-11-02 15:08:33

Computer scientists: The Turing test is a test of a machine's ability to exhibit intelligent behavior equivalent to that of a human.
Reality: The Turing test is a test of a human's ability not to be fooled by themselves that a machine is mimicking a human.
(Because the test uses a human to decide if something else is a human, it essentially explores the gullibility limit of an individual human—and says nothing about machine intelligence.)

@macandi@social.heise.de
2025-09-02 06:03:00

heise | 12 zu 1: Thunderbolt-5-Dock von Ugreen im Test
Das Revodok Max U715 macht aus einem Thunderbolt-5-Anschluss zwölf unterschiedliche Ports.
heise.de/tests/12-zu-1-Thunder

@cyrevolt@mastodon.social
2025-10-02 16:42:30

Getting there with SpacemiT K1x image building in @… :
running 2 tests
test starfive::visionfive2_hdr::test_hdr ... ok
test spacemit::k1x_hdr::sign ... ok
test result: ok. 2 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out; finished in 0.00s
Alright, time for some fresh air!

@arXiv_csSE_bot@mastoxiv.page
2025-09-03 08:47:33

JS-TOD: Detecting Order-Dependent Flaky Tests in Jest
Negar Hashemi, Amjed Tahir, Shawn Rasheed, August Shi, Rachel Blagojevic
arxiv.org/abs/2509.00466

At 1:09 a.m. EDT on November 2,
SpaceX launched a Falcon 9 rocket from Cape Canaveral, carrying Haven Demo and 17 other satellites,
including ones that will be operated by South Korea's Agency for Defense Development (ADD),
the Berlin-based company Exolaunch,
Turkey's Fergani Space,
and U.S. weather-forecasting outfit Tomorrow Companies,
and Starcloud, which carried an NVIDIA H100 AI chip to see if it’s possible to build an artificial intelligence-…

@waidler@bayerwald.social
2025-11-02 12:41:02

RE: bayerwald.social/@system/11548
Test Test Test Test ..... Zitierfunktion.

@filmfacts@social.tchncs.de
2025-09-30 11:16:59

Test
test
andreas-edler.de/blog/2025/09/

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 14:55:53

ADVMEM: Adversarial Memory Initialization for Realistic Test-Time Adaptation via Tracklet-Based Benchmarking
Shyma Alhuwaider, Motasem Alfarra, Juan C. Perez, Merey Ramazanova, Bernard Ghanem
arxiv.org/abs/2509.02182

@hikingdude@mastodon.social
2025-11-01 19:43:56

Omg, there is a new Simon the Sourcerer game?
stadt-bremerhaven.de/simon-the

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:44:32

Normality and the Turing Test
Alexandre Kabbach
arxiv.org/abs/2508.21382 arxiv.org/pdf/2508.21382

@UP8@mastodon.social
2025-10-03 14:04:28

🚀 Powering a path to Mars with reactor test bed
#space

@philip@mastodon.mallegolhansen.com
2025-11-01 21:49:47

@… Tangential but you may find it interesting: In Kent Beck’s book on Test Driven Development one of the examples is to build your own testing framework in Python, using that very same framework to test the code for the framework you are writing.
It was so much fun to do. I could imagine a parallel where you build a CI service, using the same to deploy changes to yo…

@arXiv_qbioQM_bot@mastoxiv.page
2025-10-03 08:33:21

To Remember, To Adapt, To Preempt: A Stable Continual Test-Time Adaptation Framework for Remote Physiological Measurement in Dynamic Domain Shifts
Shuyang Chu, Jingang Shi, Xu Cheng, Haoyu Chen, Xin Liu, Jian Xu, Guoying Zhao
arxiv.org/abs/2510.01282

@arXiv_csLG_bot@mastoxiv.page
2025-10-03 11:03:31

Test-Time Anchoring for Discrete Diffusion Posterior Sampling
Litu Rout, Andreas Lugmayr, Yasamin Jafarian, Srivatsan Varadharajan, Constantine Caramanis, Sanjay Shakkottai, Ira Kemelmacher-Shlizerman
arxiv.org/abs/2510.02291

@heiseonline@social.heise.de
2025-10-30 14:52:00

heise | Reale Ort in VR-Umgebungen verwandeln: "Meta Hyperscape" für Quest 3 im Test
Mit "Meta Hyperscape" lassen sich aus echten Räumen fotorealistische VR-Umgebungen erschaffen. Wir haben das in Innenräumen und im Freien ausprobiert.

@arXiv_csAI_bot@mastoxiv.page
2025-10-02 10:36:41

Test-Time Search in Neural Graph Coarsening Procedures for the Capacitated Vehicle Routing Problem
Yoonju Sim, Hyeonah Kim, Changhyun Kwon
arxiv.org/abs/2510.00958

@gedankenstuecke@scholar.social
2025-11-01 01:21:05

«if you have to choose a “pedia” to trust, you might choose the one assembled by a bunch of pedantic nerds saying “well, ACTUALLY” to each other until the heat death of the universe, over the one assembled by an LLM controlled by an insecure Nazi salute-throwing billionaire»
A Review of Grokipedia, Using Myself as Test Subject | Whatever
whatever.scalzi.com/2025/10/30

@raiders@darktundra.xyz
2025-09-01 20:01:29

Pete Carroll vs. Mike Vrabel: Raiders Face Brutal Week 1 Test In Foxborough raiderramble.com/2025/09/01/pe

@cowboys@darktundra.xyz
2025-09-03 04:04:51

Test your NFL knowledge with our Connections: Sports Edition team-themed games nytimes.com/athletic/6592714/2

@kcase@mastodon.social
2025-09-01 04:34:32

Just posted a public test build of OmniDiskSweeper 1.16:
omnistaging.omnigroup.com/omni
• Accessible Sizes — Each file now provides an accessibility title (e.g. “Desktop"), an accessibility value describing its size (e.g. “need permission to fully…

@Techmeme@techhub.social
2025-09-03 20:07:58

Sources: Apple plans an AI search tool, World Knowledge Answers, for spring 2026 as part of a Siri revamp; Apple and Google plan to test a Google model for Siri (Mark Gurman/Bloomberg)
bloomberg.com/news/articles/20

@thijs_lucas@norden.social
2025-09-29 15:59:09

Licht und Dashcam am Rad
spiegel.de/tests/fahrrad-zubeh

@davidaugust@mastodon.online
2025-08-03 06:48:11

My non-exhaustive test of @… triggered through @ap.brid.gy bridge between Bluesky and Fediverse has these results:
Bluesky reply to Fediverse post = no joy

@arXiv_statML_bot@mastoxiv.page
2025-10-01 09:45:47

Test time training enhances in-context learning of nonlinear functions
Kento Kuwataka, Taiji Suzuki
arxiv.org/abs/2509.25741 arxiv.org/pdf/…

@arXiv_qbioNC_bot@mastoxiv.page
2025-09-03 10:04:03

Improving Electroencephalogram-Based Deception Detection in Concealed Information Test under Low Stimulus Heterogeneity
Suhye Kim, Jaehoon Cheon, Taehee Kim, Seok Chan Kim, Chang-Hwan Im
arxiv.org/abs/2509.02234

@arXiv_csRO_bot@mastoxiv.page
2025-10-02 10:29:11

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition
Jiahang Cao, Yize Huang, Hanzhong Guo, Rui Zhang, Mu Nan, Weijian Mai, Jiaxu Wang, Hao Cheng, Jingkai Sun, Gang Han, Wen Zhao, Qiang Zhang, Yijie Guo, Qihao Zheng, Chunfeng Song, Xiao Li, Ping Luo, Andrew F. Luo

@arXiv_statME_bot@mastoxiv.page
2025-09-03 13:09:23

Bayesian Estimation and Regularization Techniques in Categorical Data Analysis
Jan Kalina
arxiv.org/abs/2509.02222 arxiv.org/pdf/2509.02222…

@arXiv_csSE_bot@mastoxiv.page
2025-10-03 09:00:21

Clarifying Semantics of In-Context Examples for Unit Test Generation
Chen Yang, Lin Yang, Ziqi Wang, Dong Wang, Jianyi Zhou, Junjie Chen
arxiv.org/abs/2510.01994

@azonenberg@ioc.exchange
2025-09-02 08:15:24

Progress! Got the STM32H750 test board to the point of being able to light up some LEDs on GPIOs before jumping into an infinite loop.
Next step will be all of the RCC and PWR config to get it operational at the desired VOS and clock configuration, at which point I can start thinking about bringing up a UART.
But that will probably have to wait till tomorrow given that it's 1 in the morning and I have work tomorrow.

@fanf@mendeddrum.org
2025-09-02 20:42:03

from my link log —
R0ML’s ratio bozo test: Is your volume discount a good deal? Who nose!
blog.glyph.im/2025/08/r0mls-ra
saved 2025-08-10

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 14:57:13

SynthGenNet: a self-supervised approach for test-time generalization using synthetic multi-source domain mixing of street view images
Pushpendra Dhakara, Prachi Chachodhia, Vaibhav Kumar
arxiv.org/abs/2509.02287

@arXiv_physicsinsdet_bot@mastoxiv.page
2025-09-03 08:54:52

Beam test results of the Intermediate Silicon Tracker for sPHENIX
C. W. Shih, G. Nukazuka, Y. Sugiyama, Y. Akiba, H. En'yo, T. Hachiya, S. Hasegawa, M. Hata, H. Imai, C. M. Kuo, M. Morita, I. Nakagawa, Y. Nakamura, G. Nakano, Y. Namimoto, R. Nouicer, M. Shibata, M. Shimomura, R. Takahama, K. Toho, M. Tsuruta, M. Watanabe
arxiv.…

@publicvoit@graz.social
2025-08-03 14:13:22

"In Deutschland stehen alle wild lebenden #Tiere unter dem allgemeinen Schutz des Bundesnaturschutzgesetzes. Das Töten ohne vernünftigen Grund ist verboten. Bei Lästlingen wie #Stubenfliegen, #Stechmücken

35-year-old former U.S. Army sergeant, Bajun “Baji” Mavalwalla II,
faces up to six years in prison
for protesting against ICE deportations
in what legal experts are calling a test case for the
Trump administration’s attempts to criminalize and punish dissent.
Mavalwalla was arrested and charged with “conspiracy to impede or injure officers”
after he was identified in a video taken at the protest and shared on Instagram.
Mavalwalla, who survived a ro…

@karlauerbach@sfba.social
2025-10-02 18:15:05

I was amused that on a recent episode of Gen V that a lab-test animal (that gets turned into a blob of blood and gore) is named "Elon Musk" in order to reduce sympathy for its fate.

@arXiv_csSD_bot@mastoxiv.page
2025-10-01 08:39:17

EMO-TTA: Improving Test-Time Adaptation of Audio-Language Models for Speech Emotion Recognition
Jiacheng Shi, Hongfei Du, Y. Alicia Hong, Ye Gao
arxiv.org/abs/2509.25495

@nemorosa@mastodon.nu
2025-10-02 08:14:50

Igår körde jag ett 5km-test. Uppvärmning 5km nerjogg. Alltihop tog 46 minuter, men halvmilen gick på facila 35:51. Pulsen höll sig stadig och låg.
Utan klockan hade jag inte kunnat hålla ordning på hur många varv jag sprungit, jag blir tankspridd när jag springer.
Det var ett väldigt trevligt besked, att jag kan tuffa runt på en idrottsplats i 7:10/km i snitt och bli tankspridd. Då går jag inte på max.
Trevligt att veta, Jönköping 10km går av stapeln den 18 oktober. Då …

Selfie med en idrottsplatsen som bakgrund. Solen i ansiktet och mörka moln bakom.
@grahamperrin@bsd.cafe
2025-10-03 02:03:22

alpha test period extended
👍
#FreeBSD

@deprogrammaticaipsum@mas.to
2025-10-02 18:31:05

"Let us return to 2000: a few days before Joel Spolsky published his Test, Mark Lucovsky gave a talk titled “Windows: A Software-Engineering Odyssey” at the 4th USENIX Windows System Symposium in Seattle, Washington. He was a member of the original Windows NT team from 1988 to the mid-2000s. The PowerPoint slides of the talk are still available online, and I seriously recommend you take a look at them.
Because part of the “odyssey” was, you guessed it, source control."

@hex@kolektiva.social
2025-10-02 21:37:36

Just to hammer this home a bit more, I knew a guy who joined the army. I met him after he went AWOL. When he went into the recruiter's office, they asked him if he had any open warrants out for his arrest because they couldn't recruit anyone with a criminal record. He said he did, then they said, "oh, actually, we can help you with all that, don't get caught before we ship you out."
He was just trying to keep himself out of jail. That's not supposed to happen, but it does. IIRC it was on a drug charge, which, also, they're not supposed to take anyone who tests positive for weed... but they also just tell you how to prepare for a drug test.
Another friend joined because she wanted to be part Army Corps of Engineers. A good chunk of the folks I went to school with joined the military after graduation because the other choices were working at the saw mill or working at the canary. If you join the military, you get to go to college. (Or you get to stay out of jail... as long as you don't go AWOL.)

@A_Katie_Mix@hessen.social
2025-11-02 19:38:10

🤾⛹️‍♂️🏃‍♂️ #DHB verlor das zweite Testspiel gegen Island vor der EM knapp und zeigte noch Schwächen – im Angriff oft zu unpräzise, in der Defensive anfällig. Das war vor zwei Tagen besser. Zehn Wochen vor dem Turnierstart in Dänemark, Schweden und Norwegen gilt: Grundordnung stärken Stabilität finden.
#Handball

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 10:28:11

Test-Time Search in Neural Graph Coarsening Procedures for the Capacitated Vehicle Routing Problem
Yoonju Sim, Hyeonah Kim, Changhyun Kwon
arxiv.org/abs/2510.00958

@marcus@hachyderm.io
2025-08-03 15:57:57

nix-unit looks like a nice way to test your #nix libraries clan.lol/blog/nix-unit/

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 10:06:21

CodeChemist: Functional Knowledge Transfer for Low-Resource Code Generation via Test-Time Scaling
Kaixin Wang, Tianlin Li, Xiaoyu Zhang, Aishan Liu, Xianglong Liu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, and Bin Shi
arxiv.org/abs/2510.00501

@arXiv_astrophSR_bot@mastoxiv.page
2025-10-01 08:45:18

Test particle sampling and particle acceleration in a 2D coronal plasmoid-mediated reconnecting current sheet
Eilif S. {\O}yre, Boris V. Gudiksen, Lyndsay Fletcher
arxiv.org/abs/2509.25447

@heiseonline@social.heise.de
2025-10-01 08:31:00

Güterverkehr: Autonomer Elektro-Truck überschreitet selbstständig Grenze
Einride will den grenzüberschreitenden autonomen Güterverkehr voranbringen. In einem ersten Test zwischen Schweden und Norwegen hat das geklappt.

@NFL@darktundra.xyz
2025-10-02 11:02:29

This Week in Sports Trivia: October 2, 2025 nytimes.com/athletic/6680446/2

@misterbrisby@social.tchncs.de
2025-10-31 11:08:48

Temu und Shein - Gefähr­liche Schnäpp­chen:
Stiftung Warentest: "Unsicheres Spielzeug, giftige Schwer­metalle in Schmuck, zu heiße Ladegeräte: Wir haben 162 Produkte von Temu und Shein getestet. 110 erfüllten nicht die EU-Stan­dards."
test.de/Temu-und-Shein-Gefaehr

@lilmikesf@c.im
2025-10-02 04:22:58

In 1934, Dorothy Thompson, wife of Sinclair Lewis, became first journalist expelled from #Hitler's Germany, after interviewing him and finding him "the very prototype of the little man "
By 1941, she'd written an influential WWII era essay that has stood the test of time, on who amongst us makes a likely #Nazi

It is an interesting and somewhat macabre parlor game to play at a large gathering of one's acquaintances: to speculate who in a showdown would go Nazi. By now, I think I know. I have gone through the experience many times—in Germany, in Austria, and in France. I have come to know the types: the born Nazis, the Nazis whom democracy itself has created, the certain-to-be fellow-travelers. And I also know those who never, under any conceivable circumstances, would become Nazis.
@memeorandum@universeodon.com
2025-08-30 10:01:42

With newly approved maps in Texas, GOP puts its gains with Latinos to the test (Claudia Grisales/NPR)
npr.org/2025/08/29/nx-s1-55126
memeorandum.com/250830/p4#a250

@gwire@mastodon.social
2025-09-03 11:58:34

Astounded to discover someone unaware that there's a UK national alert test scheduled for Sunday. Which indicates what kind of weird bubble I live in.

@Techmeme@techhub.social
2025-09-03 11:05:56

SK Hynix and ASML say they assembled the industry's first Twinscan NXE:5200B High-NA EUV lithography system at SK's M16 fab, initially to test next-gen tech (Anton Shilov/Tom's Hardware)
tomshard…

@davidaugust@mastodon.online
2025-08-03 06:11:14

This fediverse post is designed to test a bot running on Bluesky that can add alt text to images.
This post will have 2 replies, one from the fediverse and one from Bluesky, trying to trigger the bot.
This image in this post does not have alt text included with it by design.
#testing

@Dragofix@veganism.social
2025-08-31 20:43:09

Panic, Terror, and Near Drowning: Ban the Forced Swim Test #AnimalRights

@macandi@social.heise.de
2025-10-30 07:03:00

heise | Bildaufhübscher: Elgato Facecam 4K im Test
Mit Wechselfiltern und hoher Bildqualität will Elgatos Facecam 4K Streamer überzeugen. Wie schlägt sie sich gegenüber der Konkurrenz?

@azonenberg@ioc.exchange
2025-09-28 05:00:19

8 coax lanes, DC - 70 GHz, solderless compression fit... *drool*
hubersuhner.com/en/newsroom/bl

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:38:17

Searching for Difficult-to-Translate Test Examples at Scale
Wenda Xu, Vil\'em Zouhar, Parker Riley, Mara Finkelstein, Markus Freitag, Daniel Deutsch
arxiv.org/abs/2509.26619

@fanf@mendeddrum.org
2025-09-03 20:42:03

from my link log —
Golang cryptographic assembly mutation testing.
words.filippo.io/assembly-muta
saved 2025-07-31

@arXiv_csCV_bot@mastoxiv.page
2025-09-01 10:00:22

DriveQA: Passing the Driving Knowledge Test
Maolin Wei, Wanzhou Liu, Eshed Ohn-Bar
arxiv.org/abs/2508.21824 arxiv.org/pdf/2508.21824

@arXiv_csSE_bot@mastoxiv.page
2025-09-03 12:00:13

Methodology for Test Case Allocation based on a Formalized ODD
Martin Skoglund, Fredrik Warg, Anders Thoren, Sasikumar Punnekkat, Hans Hansson
arxiv.org/abs/2509.02311

@heiseonline@social.heise.de
2025-10-29 14:37:00

heise | Heizlüfter mit Bitcoin-Miner: Ofen 2 im Test
Bitcoins schürfen und damit kostengünstig die Wohnung heizen: Das verspricht der Bitcoin-Heizlüfter Ofen 2 von 21energy. Wir testen, ob er sich rechnet.

@arXiv_statME_bot@mastoxiv.page
2025-10-02 08:59:41

Remote Auditing: Design-based Tests of Randomization, Selection, and Missingness with Broadly Accessible Satellite Imagery
Connor T. Jerzak, Adel Daoud
arxiv.org/abs/2510.00128

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 10:35:11

GenIA-E2ETest: A Generative AI-Based Approach for End-to-End Test Automation
Elvis J\'unior, Alan Valejo, Jorge Valverde-Rebaza, V\^ania de Oliveira Neves
arxiv.org/abs/2510.01024

@arXiv_csLG_bot@mastoxiv.page
2025-10-01 11:58:27

Recursive Self-Aggregation Unlocks Deep Thinking in Large Language Models
Siddarth Venkatraman, Vineet Jain, Sarthak Mittal, Vedant Shah, Johan Obando-Ceron, Yoshua Bengio, Brian R. Bartoldson, Bhavya Kailkhura, Guillaume Lajoie, Glen Berseth, Nikolay Malkin, Moksh Jain
arxiv.org/abs/2509.26626

Vladimir Putin boasted this week that Russia has tested a
nuclear-powered super torpedo,
the Poseidon, that was unstoppable and more powerful than a nuclear missile,
the second announcement in a week of Russian trials involving
nuclear-powered weapons systems.

Days earlier, Russia announced the test of a new nuclear-powered cruise missile,
the Burevestnik or Skyfall,
that seemed in particular to irritate Donald Trump.
In the wake of the tests,…

@waidler@bayerwald.social
2025-11-02 12:27:41

RE: mastodon.social/@martinmoorlan
Toll! Und gleichzeitig soll dies für mich ein Test sein für die neue Zitierfunktion von Mastodon, die ich heute über das RC2 Softwarerelease eingebaut habe. Mal schauen, wie das Ganze na…

@NFL@darktundra.xyz
2025-11-02 17:01:51

Mahomes-Allen Part X will expose fatal flaws for both contenders ahead of NFL's trade deadline

cbssports.com/nfl/news/mahomes

@arXiv_statML_bot@mastoxiv.page
2025-10-01 10:08:47

Pretrain-Test Task Alignment Governs Generalization in In-Context Learning
Mary I. Letey, Jacob A. Zavatone-Veth, Yue M. Lu, Cengiz Pehlevan
arxiv.org/abs/2509.26551

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:24:47

Text-Based Approaches to Item Alignment to Content Standards in Large-Scale Reading & Writing Tests
Yanbin Fu, Hong Jiao, Tianyi Zhou, Robert W. Lissitz, Nan Zhang, Ming Li, Qingshu Xu, Sydney Peters
arxiv.org/abs/2509.26431

@macandi@social.heise.de
2025-10-29 07:03:00

heise | Gezielt trainieren: Suunto Race 2 im Test
Suunto ergänzt bei seiner Sportuhr Race 2 ein Sprachfeedback und Trainingspläne. Wie gut ist die Fitness-Smartwatch?
heise.de/tests/Gezi…

@philip@mastodon.mallegolhansen.com
2025-09-29 20:55:58

My latest haul, for anyone interested, contains the following volumes:
Test Driven Development by Example: powells.com/book/test-driven-d
Building Serverless Applications on Knative:

@UP8@mastodon.social
2025-09-26 14:32:57

🚁 DoorDash plans to test drone deliveries in San Francisco warehouse
latimes.com/business/story/202
🆓

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 10:01:31

Beyond Pass/Fail: The Story of Learning-Based Testing
Sheikh Md. Mushfiqur Rahman, Nasir Eisty
arxiv.org/abs/2510.00450 arxiv.org/pdf/2510.…

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:53:37

TTT3R: 3D Reconstruction as Test-Time Training
Xingyu Chen, Yue Chen, Yuliang Xiu, Andreas Geiger, Anpei Chen
arxiv.org/abs/2509.26645 arxi…

@heiseonline@social.heise.de
2025-09-26 14:28:00

heise | Toniebox 2 im Test: Audioplayer mit Spielfunktion fürs Kinderzimmer
Das beliebte Hörspielsystem Toniebox ist in der zweiten Version erschienen und unterstützt erstmals auch Spiele. Dazu kommen viele Detailänderungen.

@arXiv_csAI_bot@mastoxiv.page
2025-10-02 10:30:01

Logical Consistency Between Disagreeing Experts and Its Role in AI Safety
Andr\'es Corrada-Emmanuel
arxiv.org/abs/2510.00821 arxiv.org/…

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:00:11

LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space
Guibin Zhang, Fanci Meng, Guancheng Wan, Zherui Li, Kun Wang, Zhenfei Yin, Lei Bai, Shuicheng Yan
arxiv.org/abs/2509.24771

@arXiv_csSE_bot@mastoxiv.page
2025-09-03 11:39:03

ProbTest: Unit Testing for Probabilistic Programs (Extended Version)
Katrine Christensen, Mahsa Varshosaz, Ra\'ul Pardo
arxiv.org/abs/2509.02012

Trump directs Pentagon to test nuclear weapons, just before meeting China’s Xi
Donald Trump on Thursday morning said he directed the Pentagon to begin testing nuclear weapons
“on an equal basis” with Russia and China,
-- abruptly inserting nuclear issues into the discussion just before meeting his Chinese counterpart, Xi Jinping, for a high-stakes trade summit here.

The announcement signaled a reversal of decades of United States nuclear policy that could have far-re…

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 12:43:02

Crosslisted article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[5/5]:
- Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distr...
Cao, Huang, Guo, Zhang, Nan, Mai, Wang, Cheng, Sun, Han, Zhao, Zhang, Guo, Zheng, Song, Li, Luo, Luo

@arXiv_csSE_bot@mastoxiv.page
2025-09-01 09:00:12

Reusable Test Suites for Reinforcement Learning
J{\o}rn Eirik Betten, Quentin Mazouni, Dennis Gross, Pedro Lind, Helge Spieker
arxiv.org/abs/2508.21553

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 10:21:01

Logical Consistency Between Disagreeing Experts and Its Role in AI Safety
Andr\'es Corrada-Emmanuel
arxiv.org/abs/2510.00821 arxiv.org/…

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:07:21

The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification
Tavor Z. Baharav, Spyros Dragazis, Aldo Pacchiano
arxiv.org/abs/2510.01020

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 10:58:17

EQ-Robin: Generating Multiple Minimal Unique-Cause MC/DC Test Suites
Robin Lee, Youngho Nam
arxiv.org/abs/2509.26458 arxiv.org/pdf/2509.264…

@arXiv_csAI_bot@mastoxiv.page
2025-09-03 07:39:40

Entropy-Guided Loop: Achieving Reasoning through Uncertainty-Aware Generation
Andrew G. A. Correa, Ana C. H de Matos
arxiv.org/abs/2509.00079

@heiseonline@social.heise.de
2025-10-24 14:45:00

heise | Endlich präziser spielen: PSVR2-Controller im Test mit der Apple Vision Pro
Endlich lässt sich die Apple Vision Pro auch mit VR-Controllern bedienen. Wir haben die Unterstützung der PSVR2-Controller in Spielen und Menüs getestet.

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 14:58:23

Hues and Cues: Human vs. CLIP
Nuria Alabau-Bosque, Jorge Vila-Tom\'as, Paula Daud\'en-Oliver, Pablo Hern\'andez-C\'amara, Jose Manuel Ja\'en-Lorites, Valero Laparra, Jes\'us Malo
arxiv.org/abs/2509.02305

@arXiv_csLG_bot@mastoxiv.page
2025-10-03 11:01:01

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Yanxu Chen, Zijun Yao, Yantao Liu, Jin Ye, Jianing Yu, Lei Hou, Juanzi Li
arxiv.org/abs/2510.02209

@arXiv_csSE_bot@mastoxiv.page
2025-10-02 10:32:31

Enhancing Software Testing Education: Understanding Where Students Struggle
Shiza Andleeb, Teo Mendoza, Lucas Cordova, Gursimran Walia, Jeffrey C. Carver
arxiv.org/abs/2510.00957

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 14:58:03

Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Generation
Sapir Esther Yiflach, Yuval Atzmon, Gal Chechik
arxiv.org/abs/2509.02295

@arXiv_csAI_bot@mastoxiv.page
2025-10-01 11:46:17

Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark
Minhui Zhu, Minyang Tian, Xiaocheng Yang, Tianci Zhou, Penghao Zhu, Eli Chertkov, Shengyan Liu, Yufeng Du, Lifan Yuan, Ziming Ji, Indranil Das, Junyi Cao, Yufeng Du, Jinchen He, Yifan Su, Jiabin Yu, Yikun Jiang, Yujie Zhang, Chang Liu, Ze-Min Huang, Weizhen Jia, Xinan Chen, Peixue Wu, Yunkai Wang, Juntai Zhou, Yong Zhao, Farshid Jafarpour, Jessie Shelton, Aaron Young, John Bartolotta, Wenchao Xu,…

@heiseonline@social.heise.de
2025-08-29 12:17:00

heise | heise Update vom 29. August 2025: Lesetipps fürs Wochenende
Der wöchentliche Newsletter von heise mit Pixel 10 im Test, KI findet Schwachstellen, Quälgeistern im Internet, Antriebswende und mobilen Solarpaneelen.

@arXiv_csSE_bot@mastoxiv.page
2025-09-03 10:26:13

A Privacy-Preserving Recommender for Filling Web Forms Using a Local Large Language Model
Amirreza Nayyeri, Abbas Rasoolzadegan
arxiv.org/abs/2509.01527

@arXiv_csSE_bot@mastoxiv.page
2025-09-01 08:21:03

Learning to Generate Unit Test via Adversarial Reinforcement Learning
Dongjun Lee, Changho Hwang, Kimin Lee
arxiv.org/abs/2508.21107 arxiv.…

@arXiv_csSE_bot@mastoxiv.page
2025-08-29 09:07:21

Automated Test Oracles for Flaky Cyber-Physical System Simulators: Approach and Evaluation
Baharin A. Jodat, Khouloud Gaaloul, Mehrdad Sabetzadeh, Shiva Nejati
arxiv.org/abs/2508.20902

@arXiv_csSE_bot@mastoxiv.page
2025-10-01 10:26:07

Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development
Yuxuan Wan, Tingshuo Liang, Jiakai Xu, Jingyu Xiao, Yintong Huo, Michael R. Lyu
arxiv.org/abs/2509.25297

@arXiv_csSE_bot@mastoxiv.page
2025-09-30 11:47:41

DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern
Lekang Yang, Yuetong Liu, Yitong Zhang, Jia Li
arxiv.org/abs/2509.24975

@arXiv_csSE_bot@mastoxiv.page
2025-09-30 11:14:41

Unit Test Update through LLM-Driven Context Collection and Error-Type-Aware Refinement
Yuanhe Zhang, Zhiquan Yang, Shengyi Pan, Zhongxin Liu
arxiv.org/abs/2509.24419

@arXiv_csSE_bot@mastoxiv.page
2025-09-30 09:53:31

Navigating the Labyrinth: Path-Sensitive Unit Test Generation with Large Language Models
Dianshu Liao, Xin Yin, Shidong Pan, Chao Ni, Zhenchang Xing, Xiaoyu Sun
arxiv.org/abs/2509.23812