2024-03-07 06:48:05
Security Testing of RESTful APIs With Test Case Mutation
Sebastien Salva, Jarod Sue
https://arxiv.org/abs/2403.03701 https://arxiv.or…
Security Testing of RESTful APIs With Test Case Mutation
Sebastien Salva, Jarod Sue
https://arxiv.org/abs/2403.03701 https://arxiv.or…
Automated Control Logic Test Case Generation using Large Language Models
Heiko Koziolek, Virendra Ashiwal, Soumyadip Bandyopadhyay, Chandrika K R
https://arxiv.org/abs/2405.01874 …
“Violates.” The word you’re looking for is “violates.”
https://thehill.com/regulation/court-battles/4574362-trump-tests-gag-order-by-sharing-articles-on-judges-family/
Cowboys owner Jerry Jones ordered to take paternity test in ongoing 2022 case https://cowboyswire.usatoday.com/2024/02/29/jerry-jones-paternity-test/
A Dual Geometric Test for Forward-Flatness
Bernd Kolar, Johannes Schrotshamer, Markus Sch\"oberl
https://arxiv.org/abs/2404.02816 https://
This https://arxiv.org/abs/2112.13190 has been replaced.
link: https://scholar.google.com/scholar?q=a
mask effectiveness stats, Canada
"... the irregular introduction of community mask mandates by 34 regional health authorities created a quasi-experiment useful for evaluating mask effects ... While mask mandate effects were variably significant with unadjusted case counts, they were large & robust when test-adjusted counts were used. We estimate that community mask mandates saved 1000s of lives, & 100s of millions of dollars in care costs, in Ontario in 2020."
#masks
Judge rules Jerry Jones must submit to paternity test amid lawsuit https://www.yardbarker.com/nfl/articles/judge_rules_jerry_jones_must_submit_to_paternity_test_amid_lawsuit/s1_17236_40035440
Automated User Story Generation with Test Case Specification Using Large Language Model
Tajmilur Rahman, Yuecai Zhu
https://arxiv.org/abs/2404.01558 https:…
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Melissa Ailem, Katerina Marazopoulou, Charlotte Siska, James Bono
https://arxiv.org/abs/2404.16966
This https://arxiv.org/abs/2401.11632 has been replaced.
link: https://scholar.google.com/scholar?q=a
Understanding random forests and overfitting: a visualization and simulation study
Lasai Barre\~nada, Paula Dhiman, Dirk Timmerman, Anne-Laure Boulesteix, Ben Van Calster
https://arxiv.org/abs/2402.18612
I made a little 3d-printable case for my “qtpy_synth” test board to help me play with CircuitPython synthio (and Arduino Mozzi). It works pretty well, I like it better than the standoffs I originally started with. And now can have some cool matching 3d-printed knobs! https://www.
Thanking David for the quick fix of a html-validate issue with acceptable values for the textarea tag’s autocomplete attribute (https://gitlab.com/html-validate/html-validate/-/issues/249).
If you’re not using html-validate, you should. Kitten* has is integr…
This https://arxiv.org/abs/2403.18442 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
I keep meeting students who feel they *must* test their variables for normality before analysis.
I tell them there's no need, & if the test tells them it's normal it's only because N is too small.
I decided to run some simulations to check, though, whether e.g., t-tests degraded more for non-normal data & small Ns.
I was a bit surprised by the result: CI coverage degrades for normal vars just as badly as other symmetric dists, but skewed distributions…
Test function approach to fully nonlinear equations in thin domains
Isabeau Birindelli, Ariela Briani, Hitoshi Ishii
https://arxiv.org/abs/2404.19577 https://arxiv.org/pdf/2404.19577
arXiv:2404.19577v1 Announce Type: new
Abstract: In this note we extend to fully nonlinear operators the well known result on thin domains of Hale and Raugel. The result is more general even in the case of the Laplacian.
SCOTUS sets "authority" test for judging public officials' actions on social media, by @… https://www.
Congressional Age Limit Proposed in #NorthDakota in Potential Test Case for Nation
https://www.usnews.co…
Winds of change: the nuclear and galaxy-scale outflows and the X-ray variability of 2MASS 0918 2117
P. Baldini, G. Lanzuisi, M. Brusa, A. Merloni, K. Gkimisi, M. Perna, I. E. Lopez, E. Bertola, Z. Igo, S. Waddell, B. Musiimenta, C. Aydar, R. Arcodia, G. A. Matzeu, A. Luminari, J. Buchner, C. Vignali, M. Dadina, A. Comastri, G. Cresci, S. Marchesi, R. Gilli, F. Tombesi, R. Serafinelli
NakedCapitalism.com is always worth a look, and it looks like particularly now: https://mastodon.social/@EverMama8_/112145647138090653
Large Language Models as Test Case Generators: Performance Evaluation and Enhancement
Kefan Li, Yuan Yuan
https://arxiv.org/abs/2404.13340 https://<…
Gravitational Repulsion in an Expanding Ball of Dust
Diogo P. L. Bragan\c{c}a
https://arxiv.org/abs/2402.18022 https://arxiv.org/pdf/…
I made a little 3d-printable case for my “qtpy_synth” test board to help me play with CircuitPython synthio (and Arduino Mozzi). It works pretty well, I like it better than the standoffs I originally started with. And now can have some cool matching 3d-printed knobs! https://www.
Symfony: {{ dump() }} creates an iframe - why is that important? In my case, because I test for cy.iframe('iframe:nth-child(1)') in my cypress test, because the damn iframe changes names all the time and it's not in my domain to change.
https://winkelwagen.de/2024/03/14/symf…
there is a reason, in the age of climate crisis and massive inequality, that the go-to test case of an AI system is 'Planning a Holiday to a Foreign Country' - FFS
Testing for Asymmetric Information in Insurance with Deep Learning
Serguei Maliar, Bernard Salanie
https://arxiv.org/abs/2404.18207 https://
QuickerCheck: Implementing and Evaluating a Parallel Run-Time for QuickCheck
Robert Krook, Nicholas Smallbone, Bo Joel Svensson, Koen Claessen
https://arxiv.org/abs/2404.16062
Optical ray tracing of echelle spectrographs applied to the wavelength solution for precise radial velocities
Marcelo Tala Pinto, Adrian Kaminski, Andreas Quirrenbach, Mathias Zechmeister
https://arxiv.org/abs/2404.19691
Thanking David for the quick fix of a html-validate issue with acceptable values for the textarea tag’s autocomplete attribute (https://gitlab.com/html-validate/html-validate/-/issues/249).
If you’re not using html-validate, you should. Kitten* has is integr…
My #Fairphone3 arrives on Wednesday. Looking forward to running #UbuntuTouch on it as a test case for viability of ethical phone for kids that helps dodge the worst of mobile apps, behaviours, tracking etc
This https://arxiv.org/abs/2311.06036 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Another evening of layout on the OSHPark KU test board comes to a close.
Only 272 nets left to route and some of those are pretty close to hooked up (e.g. status/control GPIOs on the QSFP ports).
Still nothing on the main MCU, I'm saving that for last.
There's four more big 1210 caps I need to shove onto the VCCINT rail somehow, which might get tricky.
Worst case I can omit some of them and just accept that this board won't be able to max out the entire …
This https://arxiv.org/abs/2310.13485 has been replaced.
initial toot: https://mastoxiv.page/@ar…
This https://arxiv.org/abs/2403.03537 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIT_…
Congressional Age Limit Proposed in #NorthDakota in Potential Test Case for Nation
https://www.usnews.co…
ugghh trying to troubleshoot glitchy css animation on firefox. having trouble narrowing it down to a simple reproducible test case
Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom
Shisen Yue, Siyuan Song, Xinyuan Cheng, Hai Hu
https://arxiv.org/abs/2404.19509 https://arxiv.org/pdf/2404.19509
arXiv:2404.19509v1 Announce Type: new
Abstract: Understanding the non-literal meaning of an utterance is critical for large language models (LLMs) to become human-like social communicators. In this work, we introduce SwordsmanImp, the first Chinese multi-turn-dialogue-based dataset aimed at conversational implicature, sourced from dialogues in the Chinese sitcom $\textit{My Own Swordsman}$. It includes 200 carefully handcrafted questions, all annotated on which Gricean maxims have been violated. We test eight close-source and open-source LLMs under two tasks: a multiple-choice question task and an implicature explanation task. Our results show that GPT-4 attains human-level accuracy (94%) on multiple-choice questions. CausalLM demonstrates a 78.5% accuracy following GPT-4. Other models, including GPT-3.5 and several open-source models, demonstrate a lower accuracy ranging from 20% to 60% on multiple-choice questions. Human raters were asked to rate the explanation of the implicatures generated by LLMs on their reasonability, logic and fluency. While all models generate largely fluent and self-consistent text, their explanations score low on reasonability except for GPT-4, suggesting that most LLMs cannot produce satisfactory explanations of the implicatures in the conversation. Moreover, we find LLMs' performance does not vary significantly by Gricean maxims, suggesting that LLMs do not seem to process implicatures derived from different maxims differently. Our data and code are available at https://github.com/sjtu-compling/llm-pragmatics.
Uncertainty quantification in the Henry problem using the multilevel Monte Carlo method
Dmitry Logashenko, Alexander Litvinenko, Raul Tempone, Ekaterina Vasilyeva, Gabriel Wittum
https://arxiv.org/abs/2403.17018
Latest NFL news puts substantially more money into Cowboys CB Daron Bland's bank account https://www.yardbarker.com/nfl/articles/latest_nfl_news_puts_substantially_more_money_into_cowbo…
Gravitational Repulsion in an Expanding Ball of Dust
Diogo P. L. Bragan\c{c}a
https://arxiv.org/abs/2402.18022 https://arxiv.org/pdf/…
I made a little 3d-printable case for my “qtpy_synth” test board to help me play with CircuitPython synthio (and Arduino Mozzi). It works pretty well, I like it better than the standoffs I originally started with. And now can have some cool matching 3d-printed knobs! https://www.
in case you were wondering, it's faster to schedule and get the results of a routine TB test from Quest Diagnostics than it is to get a DIGITAL street parking permit from #sfmta.
how much faster? no idea, still waiting on the confirmation it's in the system after having paid for it a week ago.
Did Sam Hartman's extra college year help his NFL future? Hartman thinks so https://www.espn.com/nfl/story/_/id/39913886/quarterback-sam-hartman-nfl-draft-notre-dame-wake-forest-nil-transfer-portal<…
This https://arxiv.org/abs/2306.01104 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_hept…
Today's project: getting the previous PC build I was using before the upgrade to current gen parts (other than the GPU which is last gen) encased in that Zalman T7. Gonna go in the "bedroom" so I can move this PC to the "living room". Will be a test bench of sorts, wanna explore some self hosting, media server etc options on this thing just to get myself familiar with the tools etc. Eventually thinking I wanna build a real NAS. Prob start with Deb Server, for simplici…
Assessing Delayed Treatment Benefits of Immunotherapy Using Long-Term Average Hazard: A Novel Test/Estimation Approach
Miki Horiguchi, Lu Tian, Kenneth L. Kehl, Hajime Uno
https://arxiv.org/abs/2403.10742
This https://arxiv.org/abs/2309.16120 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
This https://arxiv.org/abs/2212.13146 has been replaced.
link: https://scholar.google.com/scholar?q=a
Cygnus OB2 as a test case for particle acceleration in young massive star clusters
Stefano Menchiari, Giovanni Morlino, Elena Amato, Niccol\`o Bucciantini, Maria Teresa Beltr\'an
https://arxiv.org/abs/2402.07784
Near-Universally-Optimal Differentially Private Minimum Spanning Trees
Richard Hlad\'ik, Jakub T\v{e}tek
https://arxiv.org/abs/2404.15035 https://
On the Second-Order Asymptotics of the Hoeffding Test and Other Divergence Tests
K. V. Harsha, Jithin Ravi, Tobias Koch
https://arxiv.org/abs/2403.03537 ht…
Confirmation of the centrality of the Huanan market among early COVID-19 cases
Florence D\'ebarre, Michael Worobey
https://arxiv.org/abs/2403.05859 htt…
Passive detection of a random signal common to multi-sensor reference and surveillance arrays
David Ram\'irez, Ignacio Santamaria, Louis L. Scharf
https://arxiv.org/abs/2402.07583
Today's project: getting the previous PC build I was using before the upgrade to current gen parts (other than the GPU which is last gen) encased in that Zalman T7. Gonna go in the "bedroom" so I can move this PC to the "living room". Will be a test bench of sorts, wanna explore some self hosting, media server etc options on this thing just to get myself familiar with the tools etc. Eventually thinking I wanna build a real NAS. Prob start with Deb Server, for simplici…
A two-step approach for analyzing time to event data under non-proportional hazards
Jonas Brugger, Tim Friede, Florian Klinglm\"uller, Martin Posch, Robin Ristl, Franz K\"onig
https://arxiv.org/abs/2402.08336
Virtual Cylindrical PET for Efficient DOI Image Reconstruction with Sub-millimetre Resolution
Francisco E Enr\'iquez-Mier-y-Ter\'an, Andre Z Kyme, Georgios Angelis, Steven R Meikle
https://arxiv.org/abs/2403.16465
Mining Transactional Data To Produce Extended Association Rules Using Collaborative Apriori, Fsa-Red And M5p Predictive Algorithm As A Basis Of Business Actions
Feri Sulianta, Laksana Eka Angga, Thee Houw Liong
https://arxiv.org/abs/2403.04179
Enhancing Large Language Models for Text-to-Testcase Generation
Saranya Alagarsamy, Chakkrit Tantithamthavorn, Chetan Arora, Aldeida Aleti
https://arxiv.org/abs/2402.11910
This https://arxiv.org/abs/2103.01412 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2404.08354 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists
Joachim Baumann, Celestine Mendler-D\"unner
https://arxiv.org/abs/2404.04269
This https://arxiv.org/abs/2309.13211 has been replaced.
initial toot: https://mastoxiv.page/@ar…
MMT: Mutation Testing of Java Bytecode with Model Transformation -- An Illustrative Demonstration
Christoph Bockisch, Gabriele Taentzer, Daniel Neufeld
https://arxiv.org/abs/2404.14097
A New Statistic for Testing Covariance Equality in High-Dimensional Gaussian Low-Rank Models
R\'emi Beisson, Pascal Vallet, Audrey Giremus, Guillaume Ginolhac
https://arxiv.org/abs/2404.07100
To impute or not to? Testing multivariate normality on incomplete dataset: Revisiting the BHEP test
Danijel Aleksi\'c, Bojana Milo\v{s}evi\'c
https://arxiv.org/abs/2404.07136
This https://arxiv.org/abs/2309.01338 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_grqc_…
This https://arxiv.org/abs/2305.07388 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Mechanisms of Elevated Temperature Galling in Hardfacings
Samuel R. Rogers, David Stewart, Paul Taplin, David Dye
https://arxiv.org/abs/2403.07730 https://…
This https://arxiv.org/abs/2305.17496 has been replaced.
link: https://scholar.google.com/scholar?q=a
Humboldt Highway II -- computer cluster on renewable energies
Danyer Perez Adan, Luis Ignacio Estevez Banos, Tony Cass, Bjoern Felkers, Fernando Guzman, Thomas Hartmann, Beate Heinemann, Hannes Jung, Yves Kemp, Frank Lehner, J\"urgen Nicklaus, David Gutierrez Menendez, Sandra Consuegra Rodriguez, Cesar Garcia Trapaga, Lidice Vaillant, Rodney Walker
This https://arxiv.org/abs/2309.02595 has been replaced.
initial toot: https://mastoxiv.page/@…
Tests for categorical data beyond Pearson: A distance covariance and energy distance approach
Fernando Castro-Prado, Wenceslao Gonz\'alez-Manteiga, Javier Costas, Fernando Facal, Dominic Edelmann
https://arxiv.org/abs/2403.12711
This https://arxiv.org/abs/2305.17496 has been replaced.
link: https://scholar.google.com/scholar?q=a
LLM-Powered Test Case Generation for Detecting Tricky Bugs
Kaibo Liu, Yiyang Liu, Zhenpeng Chen, Jie M. Zhang, Yudong Han, Yun Ma, Ge Li, Gang Huang
https://arxiv.org/abs/2404.10304
Mechanisms of Elevated Temperature Galling in Hardfacings
Samuel R. Rogers, David Stewart, Paul Taplin, David Dye
https://arxiv.org/abs/2403.07730 https://…
LLM-Powered Test Case Generation for Detecting Tricky Bugs
Kaibo Liu, Yiyang Liu, Zhenpeng Chen, Jie M. Zhang, Yudong Han, Yun Ma, Ge Li, Gang Huang
https://arxiv.org/abs/2404.10304
On the Integration of Spectrum-Based Fault Localization Tools into IDEs
Attila Szatm\'ari, Qusay Idrees Sarhan, Gerg\H{o} Balogh, P\'eter Attila Soha, \'Arp\'ad Besz\'edes
https://arxiv.org/abs/2403.11538
This https://arxiv.org/abs/2211.13525 has been replaced.
link: https://scholar.google.com/scholar?q=a
Mercury: An Efficiency Benchmark for LLM Code Synthesis
Mingzhe Du, Anh Tuan Luu, Bin Ji, See-Kiong Ng
https://arxiv.org/abs/2402.07844 https://
This https://arxiv.org/abs/2211.13525 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2403.01971 has been replaced.
link: https://scholar.google.com/scholar?q=a