Tootfinder

Opt-in global Mastodon full text search. Join the index!

@samvarma@fosstodon.org
2025-06-04 15:32:47

This author is invaluable to me because they always have a fresh take that I haven't seen anywhere else. Was a fave follow on the bad place.
In this case, re #LLMs
#AI #LLM

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:04:31

This arxiv.org/abs/2505.07453 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:52:11

MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
Purbesh Mitra, Sennur Ulukus
arxiv.org/abs/2507.02851 a…

@groupnebula563@mastodon.social
2025-07-05 01:32:59

#AI #honeypots huh

@GroupNebula563@mastodon.social
2025-07-05 01:32:59

#AI #honeypots huh

@arXiv_csCR_bot@mastoxiv.page
2025-07-04 09:57:01

Early Signs of Steganographic Capabilities in Frontier LLMs
Artur Zolkowski, Kei Nishimura-Gasparian, Robert McCarthy, Roland S. Zimmermann, David Lindner
arxiv.org/abs/2507.02737

@arXiv_csIR_bot@mastoxiv.page
2025-06-05 09:41:33

This arxiv.org/abs/2505.20730 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:24:15

CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking
Neeva Oza, Ishaan Govil, Parul Gupta, Dinesh Khandelwal, Dinesh Garg, Parag Singla
arxiv.org/abs/2506.04019

@macandi@social.heise.de
2025-06-04 09:29:00

Sky: Mac-Desktop steuern mit KI von den Shortcuts-Machern
Apple Intelligence bietet bislang keine Funktionen zur Kontrolle des Mac. Hier setzen die Entwickler von Sky an: Ihre Anwendung nutzt LLMs zur Automatisierung.

@lysander07@sigmoid.social
2025-06-03 12:35:05

LLMs are starving for knowledge graphs. Raphael Troncy was pointing out that many LLM company crawlers are constantly visiting their KGs. Some crawlers even perform explicit SPARQL queries on the KGs.
#knowledgegraphs #eswc2025

The image shows a presentation slide titled "LLMs are starving for KGs" (Large Language Models are starving for Knowledge Graphs). The slide is projected onto a screen and features a list of crawlers visiting various Knowledge Graphs (KGs), including OpenAI, ByteDance, Apple, Meta AI, Anthropic, Microsoft, DuckDuckGo, CommonCrawl, Amazon, and Perplexity. Each crawler is associated with a specific KG, and the number of requests made to each KG is listed. For example, OpenAI has made 3,430,585 re…
@arXiv_csLG_bot@mastoxiv.page
2025-06-05 11:01:27

This arxiv.org/abs/2506.02965 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csAR_bot@mastoxiv.page
2025-06-04 07:17:33

CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge
Chunlin Tian, Xinpeng Qin, Kahou Tam, Li Li, Zijian Wang, Yuanzhe Zhao, Minglei Zhang, Chengzhong Xu
arxiv.org/abs/2506.02847

@arXiv_csDB_bot@mastoxiv.page
2025-06-04 13:32:44

This arxiv.org/abs/2501.04901 has been replaced.
initial toot: mastoxiv.page/@arXiv_csDB_…

@arXiv_csHC_bot@mastoxiv.page
2025-06-03 16:34:01

This arxiv.org/abs/2503.16456 has been replaced.
initial toot: mastoxiv.page/@arXiv_csHC_…

@arXiv_csDC_bot@mastoxiv.page
2025-06-04 07:27:34

NestedFP: High-Performance, Memory-Efficient Dual-Precision Floating Point Support for LLMs
Haeun Lee, Omin Kwon, Yeonhong Park, Jae W. Lee
arxiv.org/abs/2506.02024

@JGraber@mastodon.social
2025-05-02 17:54:44

#Python Friday #277: Access Local #LLMs Through LM Studio
pythonfriday.dev/2025/05/277-a

@gla@mastodon.social
2025-07-03 04:45:08

I’ve written about automating away some boring part of parenthood with LLMs and AppleScript
#apple

@ubuntourist@mastodon.social
2025-05-31 23:52:16

'Failure Imminent': When LLMs In a Long-Running Vending Business Simulation Went Berserk
slashdot.org/story/25/05/31/21

@tante@tldr.nettime.org
2025-06-03 14:38:54

This is such a perfect analogy.
My goto is "asbestos". Super useful invention which bit us in the ass afterwards.
xoxo.zone/@annika/114614639082

@castarco@hachyderm.io
2025-05-04 22:28:42

Anyone has the impression that virtually all LLMs use a sort of "hyper-allistic" language?
As if we had a spectrum for allism disorder and LLMs were an extreme case of it.

@arXiv_csCV_bot@mastoxiv.page
2025-07-04 10:24:31

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
Jiaer Xia, Bingkui Tong, Yuhang Zang, Rui Shao, Kaiyang Zhou
arxiv.org/abs/2507.02859

@compfu@mograph.social
2025-07-04 19:43:51

Am I the only one who foresees the future #AI business model as enshittified ad-infused LLMs? Once LLMs are ingrained in every class and board room, you‘ll suddenly have to pay big bucks while the free plans will be riddled with ads. It‘ll be like that #BlackMirror episode where the teacher spews comme…

@arXiv_csIT_bot@mastoxiv.page
2025-07-04 08:24:21

On the Convergence of Large Language Model Optimizer for Black-Box Network Management
Hoon Lee, Wentao Zhou, Merouane Debbah, Inkyu Lee
arxiv.org/abs/2507.02689

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:36:41

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs
Ken Tsui
arxiv.org/abs/2507.02778

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 17:48:46

This arxiv.org/abs/2501.07071 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@poppastring@dotnet.social
2025-05-31 19:35:26

A post from the archive 📫:
If LLMs Can Code, Why Are We Building More IDEs?
poppastring.com/blog/if-llms-c

@arXiv_statME_bot@mastoxiv.page
2025-06-04 14:00:58

This arxiv.org/abs/2505.19145 has been replaced.
initial toot: mastoxiv.page/@arXiv_sta…

@Techmeme@techhub.social
2025-06-05 06:01:39

A former employee says fewer than 10,000 people use Ola Krutrim's LLM chatbot, which supports 10 Indian languages, and that over 60% of them are random testers (Swathi Moorthy/The Economic Times)

@arXiv_csPL_bot@mastoxiv.page
2025-06-05 09:41:06

This arxiv.org/abs/2405.08965 has been replaced.
initial toot: mastoxiv.page/@arXiv_csPL_…

@arXiv_physicssocph_bot@mastoxiv.page
2025-06-03 16:49:23

This arxiv.org/abs/2407.04503 has been replaced.
initial toot: mastoxiv.page/@arX…

@arXiv_csRO_bot@mastoxiv.page
2025-06-05 10:00:55

This arxiv.org/abs/2505.20573 has been replaced.
initial toot: mastoxiv.page/@arXiv_csRO_…

@samir@functional.computer
2025-06-03 20:48:17

If LLMs were so good at writing code, they wouldn’t need a new thought leader yelling about them every day.
They might be. At this point, I do not care. Lots of people (including, most recently, Ptacek, Yegge, etc.) are trying to sell me something and I have no interest in listening.
If your thing is good, show, don’t tell.
But it’s not, is it?
These articles… you’re not trying to convince me, you’re trying to convince yourselves.
So please: keep them to yoursel…

@arXiv_csCY_bot@mastoxiv.page
2025-06-04 13:34:40

This arxiv.org/abs/2506.00095 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCY_…

@arXiv_csIR_bot@mastoxiv.page
2025-07-04 08:10:01

When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search
William A. Ingram, Bipasha Banerjee, Edward A. Fox
arxiv.org/abs/2507.02139

@arXiv_csLG_bot@mastoxiv.page
2025-06-05 10:59:18

This arxiv.org/abs/2505.24298 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCR_bot@mastoxiv.page
2025-07-04 09:12:41

PII Jailbreaking in LLMs via Activation Steering Reveals Personal Information Leakage
Krishna Kanth Nakka, Xue Jiang, Xuebing Zhou
arxiv.org/abs/2507.02332

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:23:54

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation
Yuansheng Ni, Ping Nie, Kai Zou, Xiang Yue, Wenhu Chen
arxiv.org/abs/2506.03930

@tante@tldr.nettime.org
2025-07-03 09:48:09

"LLMs are okay at coding, but at scale they build jumbled messes. I’ve scaled back my use of AI when coding and gone back to using my brain and pen and paper."
albertofortin.com/writing/codi

@arXiv_csAI_bot@mastoxiv.page
2025-07-04 07:31:41

Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs
Mohammad Ali Alomrani, Yingxue Zhang, Derek Li, Qianyi Sun, Soumyasundar Pal, Zhanguang Zhang, Yaochen Hu, Rohan Deepak Ajwani, Antonios Valkanas, Raika Karimi, Peng Cheng, Yunzhou Wang, Pengyi Liao, Hanrui Huang, Bin Wang, Jianye Hao, Mark Coates

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:31:41

Can LLMs Identify Critical Limitations within Scientific Research? A Systematic Evaluation on AI Research Papers
Zhijian Xu, Yilun Zhao, Manasi Patwardhan, Lovekesh Vig, Arman Cohan
arxiv.org/abs/2507.02694

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:23:42

Boosting Open-Source LLMs for Program Repair via Reasoning Transfer and LLM-Guided Reinforcement Learning
Xunzhu Tang, Jacques Klein, Tegawend\'e F. Bissyand\'e
arxiv.org/abs/2506.03921

@arXiv_csCR_bot@mastoxiv.page
2025-07-04 07:43:21

MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation
Lu Yan, Zhuo Zhang, Xiangzhe Xu, Shengwei An, Guangyu Shen, Zhou Xuan, Xuan Chen, Xiangyu Zhang
arxiv.org/abs/2507.02057

@arXiv_csAR_bot@mastoxiv.page
2025-06-03 07:17:05

ReTern: Exploiting Natural Redundancy and Sign Transformations for Enhanced Fault Tolerance in Compute-in-Memory based Ternary LLMs
Akul Malhotra, Sumeet Kumar Gupta
arxiv.org/abs/2506.01140

@arXiv_csHC_bot@mastoxiv.page
2025-06-05 07:18:43

Sampling Preferences Yields Simple Trustworthiness Scores
Sean Steinle
arxiv.org/abs/2506.03399 arxiv.org/pdf/2506.03…

@arXiv_csRO_bot@mastoxiv.page
2025-06-04 14:08:57

This arxiv.org/abs/2506.01538 has been replaced.
initial toot: mastoxiv.page/@arXiv_csRO_…

@arXiv_csCY_bot@mastoxiv.page
2025-06-05 09:38:07

This arxiv.org/abs/2506.00095 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCY_…

@arXiv_csAI_bot@mastoxiv.page
2025-07-04 09:11:41

Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Berkan Dokmeci, Qingyang Wu, Ben Athiwaratkun, Ce Zhang, Shuaiwen Leon Song, James Zou
arxiv.org/abs/2507.02173

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:51:54

This arxiv.org/abs/2505.19433 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-05 07:19:10

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems
Tiehua Mei, Hengrui Chen, Peng Yu, Jiaqing Liang, Deqing Yang
arxiv.org/abs/2506.04015

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:21:33

Fault Localisation and Repair for DL Systems: An Empirical Study with LLMs
Jinhan Kim, Nargiz Humbatova, Gunel Jahangirova, Shin Yoo, Paolo Tonella
arxiv.org/abs/2506.03396

@arXiv_csCL_bot@mastoxiv.page
2025-07-03 10:17:10

The Thin Line Between Comprehension and Persuasion in LLMs
Adrian de Wynter, Tangming Yuan
arxiv.org/abs/2507.01936 a…

@arXiv_csCR_bot@mastoxiv.page
2025-06-04 07:26:46

BitBypass: A New Direction in Jailbreaking Aligned Large Language Models with Bitstream Camouflage
Kalyan Nakka, Nitesh Saxena
arxiv.org/abs/2506.02479

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 16:10:09

This arxiv.org/abs/2406.13945 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csHC_bot@mastoxiv.page
2025-06-03 16:54:22

This arxiv.org/abs/2503.18792 has been replaced.
initial toot: mastoxiv.page/@arXiv_csHC_…

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 07:26:13

The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features
Omid Reza Abbasi, Franz Welscher, Georg Weinberger, Johannes Scholz
arxiv.org/abs/2506.00203

@arXiv_csLG_bot@mastoxiv.page
2025-06-05 11:00:19

This arxiv.org/abs/2506.00486 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-05 07:18:47

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking
Xianming Li, Aamir Shakir, Rui Huang, Julius Lipp, Jing Li
arxiv.org/abs/2506.03487

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 16:16:02

This arxiv.org/abs/2401.16310 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 09:44:15

This arxiv.org/abs/2506.02658 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 07:20:02

Comparative analysis of privacy-preserving open-source LLMs regarding extraction of diagnostic information from clinical CMR imaging reports
Sina Amirrajab, Volker Vehof, Michael Bietenbeck, Ali Yilmaz
arxiv.org/abs/2506.00060

@arXiv_csHC_bot@mastoxiv.page
2025-07-04 09:10:51

Misaligned from Within: Large Language Models Reproduce Our Double-Loop Learning Blindness
Tim Rogers, Ben Teehankee
arxiv.org/abs/2507.02283

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:37:23

This arxiv.org/abs/2505.03793 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCL_bot@mastoxiv.page
2025-06-03 08:19:57

Benford's Curse: Tracing Digit Bias to Numerical Hallucination in LLMs
Jiandong Shao, Yao Lu, Jianfei Yang
arxiv.org/abs/2506.01734

@arXiv_csCR_bot@mastoxiv.page
2025-06-04 13:33:00

This arxiv.org/abs/2404.16873 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 17:41:46

This arxiv.org/abs/2412.13147 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 07:31:51

Computational Thinking Reasoning in Large Language Models
Kechi Zhang, Ge Li, Jia Li, Huangzhao Zhang, Jingjing Xu, Hao Zhu, Lecheng Wang, Jia Li, Yihong Dong, Jing Mai, Bin Gu, Zhi Jin
arxiv.org/abs/2506.02658

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 07:20:41

Evaluating Prompt Engineering Techniques for Accuracy and Confidence Elicitation in Medical LLMs
Nariman Naderi, Zahra Atf, Peter R Lewis, Aref Mahjoub far, Seyed Amir Ahmad Safavi-Naini, Ali Soroush
arxiv.org/abs/2506.00072

@arXiv_csCL_bot@mastoxiv.page
2025-07-04 09:42:51

Multimodal Mathematical Reasoning with Diverse Solving Perspective
Wenhao Shi, Zhiqiang Hu, Yi Bin, Yang Yang, See-Kiong Ng, Heng Tao Shen
arxiv.org/abs/2507.02804

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 17:38:17

This arxiv.org/abs/2412.11934 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 17:30:50

This arxiv.org/abs/2501.18626 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:18:25

Evaluation of LLMs for mathematical problem solving
Ruonan Wang, Runxi Wang, Yunwen Shen, Chengfeng Wu, Qinglin Zhou, Rohitash Chandra
arxiv.org/abs/2506.00309

@arXiv_csCR_bot@mastoxiv.page
2025-07-03 09:06:10

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism
Beitao Chen, Xinyu Lyu, Lianli Gao, Jingkuan Song, Heng Tao Shen
arxiv.org/abs/2507.01513

@arXiv_csSE_bot@mastoxiv.page
2025-07-04 09:30:01

LLMREI: Automating Requirements Elicitation Interviews with LLMs
Alexander Korn, Samuel Gorsch, Andreas Vogelsang
arxiv.org/abs/2507.02564

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:27:17

Jailbreak-R1: Exploring the Jailbreak Capabilities of LLMs via Reinforcement Learning
Weiyang Guo, Zesheng Shi, Zhuo Li, Yequan Wang, Xuebo Liu, Wenya Wang, Fangming Liu, Min Zhang, Jing Li
arxiv.org/abs/2506.00782

@arXiv_csCL_bot@mastoxiv.page
2025-07-03 10:02:40

LLMs for Legal Subsumption in German Employment Contracts
Oliver Wardas, Florian Matthes
arxiv.org/abs/2507.01734 arx…

@arXiv_csCR_bot@mastoxiv.page
2025-07-04 09:54:51

Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents
Jiangrong Wu, Yuhong Nan, Jianliang Wu, Zitong Yao, Zibin Zheng
arxiv.org/abs/2507.02699

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:23:19

DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
Yongkang Xiao, Sinian Zhang, Yi Dai, Huixue Zhou, Jue Hou, Jie Ding, Rui Zhang
arxiv.org/abs/2506.00708

@arXiv_csCR_bot@mastoxiv.page
2025-06-04 13:34:26

This arxiv.org/abs/2412.15289 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 13:38:28

This arxiv.org/abs/2501.07849 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:11:17

This arxiv.org/abs/2505.19165 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 17:52:02

This arxiv.org/abs/2505.18889 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 07:36:27

Reuse or Generate? Accelerating Code Editing via Edit-Oriented Speculative Decoding
Peiding Wang, Li Zhang, Fang Liu, Yinghao Zhu, Wang Xu, Lin Shi, Xiaoli Lian, Minxiao Li, Bo Shen, An Fu
arxiv.org/abs/2506.02780

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 16:19:26

This arxiv.org/abs/2406.13948 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 17:36:14

This arxiv.org/abs/2502.11191 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 17:24:06

This arxiv.org/abs/2504.11711 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 07:26:17

CoP: Agentic Red-teaming for Large Language Models using Composition of Principles
Chen Xiong, Pin-Yu Chen, Tsung-Yi Ho
arxiv.org/abs/2506.00781

@arXiv_csCR_bot@mastoxiv.page
2025-06-04 07:33:29

ATAG: AI-Agent Application Threat Assessment with Attack Graphs
Parth Atulbhai Gandhi, Akansha Shukla, David Tayouri, Beni Ifland, Yuval Elovici, Rami Puzis, Asaf Shabtai
arxiv.org/abs/2506.02859

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 17:33:48

This arxiv.org/abs/2505.23387 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-05 09:45:09

This arxiv.org/abs/2506.02139 has been replaced.
link: scholar.google.com/scholar?q=a

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:05:22

This arxiv.org/abs/2505.08459 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 13:37:05

This arxiv.org/abs/2409.14644 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csCR_bot@mastoxiv.page
2025-07-04 09:24:21

Evaluating Language Models For Threat Detection in IoT Security Logs
Jorge J. Tejero-Fern\'andez, Alfonso S\'anchez-Maci\'an
arxiv.org/abs/2507.02390

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 09:43:03

This arxiv.org/abs/2503.20197 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:09:04

This arxiv.org/abs/2505.16978 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-03 16:55:16

This arxiv.org/abs/2408.16028 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csAI_bot@mastoxiv.page
2025-07-04 07:43:41

Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab
Haonan Duan, Stephen Zhewen Lu, Caitlin Fiona Harrigan, Nishkrit Desai, Jiarui Lu, Micha{\l} Koziarski, Leonardo Cotta, Chris J. Maddison
arxiv.org/abs/2507.02083

@arXiv_csSE_bot@mastoxiv.page
2025-06-04 07:26:19

Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability
Mengliang He, Jiayi Zeng, Yankai Jiang, Wei Zhang, Zeming Liu, Xiaoming Shi, Aimin Zhou
arxiv.org/abs/2506.02073

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:21:30

Empirical Evaluation of Generalizable Automated Program Repair with Large Language Models
Viola Campos, Ridwan Shariffdeen, Adrian Ulges, Yannic Noller
arxiv.org/abs/2506.03283

@arXiv_csSE_bot@mastoxiv.page
2025-07-04 09:28:21

Meta-Fair: AI-Assisted Fairness Testing of Large Language Models
Miguel Romero-Arjona, Jos\'e A. Parejo, Juan C. Alonso, Ana B. S\'anchez, Aitor Arrieta, Sergio Segura
arxiv.org/abs/2507.02533

@arXiv_csSE_bot@mastoxiv.page
2025-06-05 07:22:50

From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation
Peter Pfeiffer, Alexander Rombach, Maxim Majlatow, Nijat Mehdiyev
arxiv.org/abs/2506.03801