Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCV_bot@mastoxiv.page
2025-08-15 10:25:22

Performance of GPT-5 in Brain Tumor MRI Reasoning
Mojtaba Safari, Shansong Wang, Mingzhe Hu, Zach Eidex, Qiang Li, Xiaofeng Yang
arxiv.org/abs/2508.10865

@arXiv_csSE_bot@mastoxiv.page
2025-09-15 08:53:41

WALL: A Web Application for Automated Quality Assurance using Large Language Models
Seyed Moein Abtahi, Akramul Azim
arxiv.org/abs/2509.09918

@arXiv_csHC_bot@mastoxiv.page
2025-10-14 08:48:58

ROBOPSY PL[AI]: Using Role-Play to Investigate how LLMs Present Collective Memory
Margarete Jahrmann, Thomas Brandstetter, Stefan Glasauer
arxiv.org/abs/2510.09874

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 09:23:29

Automated Classification of Tutors' Dialogue Acts Using Generative AI: A Case Study Using the CIMA Corpus
Liqun He, Jiaqi Xu
arxiv.org/abs/2509.09125

@Techmeme@techhub.social
2025-08-01 18:25:51

Source: GPT-5 improvements won't be comparable to the leaps in performance of earlier models, such as between GPT-3 in 2020 and GPT-4 in 2023 (The Information)
theinformation.com/articles/in

@arXiv_csDC_bot@mastoxiv.page
2025-09-11 08:56:33

Design and Implementation of Code Completion System Based on LLM and CodeBERT Hybrid Subsystem
Bingbing Zhang, Ziyu Lin, Yingxin Su
arxiv.org/abs/2509.08215

@ErikJonker@mastodon.social
2025-08-09 18:02:14

GPT-5 may be slightly disappointing, Genie 3 demo blew me away... Watch it.
#ai

@jdrm@social.linux.pizza
2025-08-06 09:04:05

Nos reíamos de que Reagan preguntara a una vidente decisiones de política durante su presidencia. Pues en Suecia estšn con la versión 3.0 de consultar a un oršculo: theguardian.com/technology/202

@arXiv_physicsedph_bot@mastoxiv.page
2025-08-13 08:59:32

The Boiling-Frog Problem of Physics Education
Gerd Kortemeyer
arxiv.org/abs/2508.08842 arxiv.org/pdf/2508.08842

@arXiv_csAI_bot@mastoxiv.page
2025-08-11 09:30:00

Retrieval Augmented Large Language Model System for Comprehensive Drug Contraindications
Byeonghun Bang, Jongsuk Yoon, Dong-Jin Chang, Seho Park, Yong Oh Lee
arxiv.org/abs/2508.06145

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:18:02

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning
Imran Mansha
arxiv.org/abs/2510.05003 arxiv.org/pdf/2…

@arXiv_csPL_bot@mastoxiv.page
2025-08-07 12:56:16

Replaced article(s) found for cs.PL. arxiv.org/list/cs.PL/new
[1/1]:
- RTLCoder: Outperforming GPT-3.5 in Design RTL Generation with Our Open-Source Dataset and Lightwe...
Shang Liu, Wenji Fang, Yao Lu, Qijun Zhang, Hongce Zhang, Zhiyao Xie

@arXiv_csAI_bot@mastoxiv.page
2025-08-06 09:49:50

Can Large Language Models Bridge the Gap in Environmental Knowledge?
Linda Smail (College of Interdisciplinary Studies, Zayed University, UAE), David Santandreu Calonge (Department of Academic Development, Mohamed bin Zayed University of Artificial Intelligence, UAE), Firuz Kamalov (School of Engineering, Applied Science,Technology, Canadian University Dubai, UAE), Nur H. Orak (Department of Environmental Engineering, Marmara University, T\"urkiye)

@arXiv_csCY_bot@mastoxiv.page
2025-08-07 08:33:34

Prompt Injection Vulnerability of Consensus Generating Applications in Digital Democracy
Jairo Gudi\~no-Rosero, Cl\'ement Contet, Umberto Grandi, C\'esar A. Hidalgo
arxiv.org/abs/2508.04281

@arXiv_csAR_bot@mastoxiv.page
2025-08-26 07:31:46

GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI's Open-Weight Mixture of Experts Model
Deepak Kumar, Divakar Yadav, Yash Patel
arxiv.org/abs/2508.16700

@UP8@mastodon.social
2025-09-29 15:25:58

🧾 Multi-Modal Vision vs. Text-Based Parsing: Benchmarking LLM Strategies for Invoice Processing
#software

@arXiv_csCR_bot@mastoxiv.page
2025-09-19 07:38:11

Early Approaches to Adversarial Fine-Tuning for Prompt Injection Defense: A 2022 Study of GPT-3 and Contemporary Models
Gustavo Sandoval, Denys Fenchenko, Junyao Chen
arxiv.org/abs/2509.14271

@arXiv_csAI_bot@mastoxiv.page
2025-10-08 10:03:59

Large Language Model-Based Uncertainty-Adjusted Label Extraction for Artificial Intelligence Model Development in Upper Extremity Radiography
Hanna Kreutzer, Anne-Sophie Caselitz, Thomas Dratsch, Daniel Pinto dos Santos, Christiane Kuhl, Daniel Truhn, Sven Nebelung
arxiv.org/abs/2510.05664

@Techmeme@techhub.social
2025-09-29 19:26:02

Anthropic prices Claude Sonnet 4.5 at $3/1M input and $15/1M output tokens, same as Sonnet 4, cheaper than Opus at $15/$75 but higher than GPT-5 at $1.25/$10 (Simon Willison/Simon Willison's Weblog)
simonwillison.net/2025/Sep/29/

@arXiv_physicsedph_bot@mastoxiv.page
2025-09-11 07:56:02

Feedback That Clicks: Introductory Physics Students' Valued Features in AI Feedback Generated From Self-Crafted and Engineered Prompts
Amogh Sirnoorkar, N. Sanjay Rebello
arxiv.org/abs/2509.08516

@arXiv_csHC_bot@mastoxiv.page
2025-08-08 08:43:02

Charts-of-Thought: Enhancing LLM Visualization Literacy Through Structured Data Extraction
Amit Kumar Das, Mohammad Tarun, Klaus Mueller
arxiv.org/abs/2508.04842

@arXiv_csIR_bot@mastoxiv.page
2025-09-16 10:02:17

Do Large Language Models Favor Recent Content? A Study on Recency Bias in LLM-Based Reranking
Hanpei Fang, Sijie Tao, Nuo Chen, Kai-Xin Chang, Tetsuya Sakai
arxiv.org/abs/2509.11353

@arXiv_csSD_bot@mastoxiv.page
2025-09-30 20:39:23

Replaced article(s) found for cs.SD. arxiv.org/list/cs.SD/new
[1/1]:
- M6(GPT)3: Generating Multitrack Modifiable Multi-Minute MIDI Music from Text using Genetic algori...
Jakub Po\'cwiardowski, Mateusz Modrzejewski, Marek S. Tatara

@arXiv_csCY_bot@mastoxiv.page
2025-08-28 08:13:51

Should LLMs be WEIRD? Exploring WEIRDness and Human Rights in Large Language Models
Ke Zhou, Marios Constantinides, Daniele Quercia
arxiv.org/abs/2508.19269

@arXiv_csHC_bot@mastoxiv.page
2025-08-01 09:22:31

Exploring LLM-generated Culture-specific Affective Human-Robot Tactile Interaction
Qiaoqiao Ren, Tony Belpaeme
arxiv.org/abs/2507.22905 arx…

@arXiv_eessSY_bot@mastoxiv.page
2025-09-23 09:00:00

Synergies between Federated Foundation Models and Smart Power Grids
Seyyedali Hosseinalipour, Shimiao Li, Adedoyin Inaolaji, Filippo Malandra, Luis Herrera, Nicholas Mastronarde
arxiv.org/abs/2509.16496

@arXiv_csSE_bot@mastoxiv.page
2025-08-21 09:32:00

Assessing the Quality and Security of AI-Generated Code: A Quantitative Analysis
Abbas Sabra, Olivier Schmitt, Joseph Tyler
arxiv.org/abs/2508.14727

@arXiv_csCL_bot@mastoxiv.page
2025-09-03 14:37:23

An Ensemble Classification Approach in A Multi-Layered Large Language Model Framework for Disease Prediction
Ali Hamdi, Malak Mohamed, Rokaia Emad, Khaled Shaban
arxiv.org/abs/2509.02446

@arXiv_csCR_bot@mastoxiv.page
2025-07-22 07:53:50

Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
Richard M. Charles, James H. Curry, Richard B. Charles
arxiv.org/abs/2507.14207

@arXiv_csCY_bot@mastoxiv.page
2025-07-29 10:11:51

The Carbon Cost of Conversation, Sustainability in the Age of Language Models
Sayed Mahbub Hasan Amiri, Prasun Goswami, Md. Mainul Islam, Mohammad Shakhawat Hossen, Sayed Majhab Hasan Amiri, Naznin Akter
arxiv.org/abs/2507.20018

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:48:02

Personality Matters: User Traits Predict LLM Preferences in Multi-Turn Collaborative Tasks
Sarfaroz Yunusov, Kaige Chen, Kazi Nishat Anwar, Ali Emami
arxiv.org/abs/2508.21628

@arXiv_physicsmedph_bot@mastoxiv.page
2025-08-26 08:37:56

Root Cause Analysis of Radiation Oncology Incidents Using Large Language Models
Yuntao Wang, Mariluz De Ornelas, Matthew T. Studenski, Elizabeth Bossart, Siamak P. Najad-Davarani, Yunze Yang
arxiv.org/abs/2508.17201

@arXiv_csCL_bot@mastoxiv.page
2025-09-17 10:37:50

The Few-shot Dilemma: Over-prompting Large Language Models
Yongjian Tang, Doruk Tuncel, Christian Koerner, Thomas Runkler
arxiv.org/abs/2509.13196

@arXiv_csAI_bot@mastoxiv.page
2025-07-25 07:52:32

Does visualization help AI understand data?
Victoria R. Li, Johnathan Sun, Martin Wattenberg
arxiv.org/abs/2507.18022 arxiv.org/pdf/2507.18…

@arXiv_csCL_bot@mastoxiv.page
2025-08-22 12:38:52

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[3/3]:
- CRISPR-GPT for Agentic Automation of Gene-editing Experiments
Qu, Huang, Yin, Zhan, Liu, Yin, Cousins, Johnson, Wang, Shah, Altman, Zhou, Wang, Cong

@arXiv_csCY_bot@mastoxiv.page
2025-07-16 07:41:31

Can Large Language Models Understand As Well As Apply Patent Regulations to Pass a Hands-On Patent Attorney Test?
Bhakti Khera, Rezvan Alamian, Pascal A. Scherz, Stephan M. Goetz
arxiv.org/abs/2507.10576

@arXiv_csCL_bot@mastoxiv.page
2025-07-28 13:02:38

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/3]:
- Comparison of pipeline, sequence-to-sequence, and GPT models for end-to-end relation extraction: ...
Shashank Gupta, Xuguang Ai, Ramakanth Kavuluru

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:03:06

A Retail-Corpus for Aspect-Based Sentiment Analysis with Large Language Models
Oleg Silcenco, Marcos R. Machad, Wallace C. Ugulino, Daniel Braun
arxiv.org/abs/2508.17994

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:33:11

A Comparative Evaluation of Large Language Models for Persian Sentiment Analysis and Emotion Detection in Social Media Texts
Kian Tohidi, Kia Dashtipour, Simone Rebora, Sevda Pourfaramarz
arxiv.org/abs/2509.14922

@arXiv_csCL_bot@mastoxiv.page
2025-08-21 08:31:50

Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models
Badrinath Ramakrishnan, Akshaya Balaji
arxiv.org/abs/2508.14062