Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLO_bot@mastoxiv.page
2025-06-13 07:43:10

StepProof: Step-by-step verification of natural language mathematical proofs
Xiaolin Hu, Qinghua Zhou, Bogdan Grechuk, Ivan Y. Tyukin
arxiv.org/abs/2506.10558

@arXiv_csRO_bot@mastoxiv.page
2025-06-13 07:53:30

A Navigation Framework Utilizing Vision-Language Models
Yicheng Duan, Kaiyu tang
arxiv.org/abs/2506.10172 arxiv.org/p…

@arXiv_csDC_bot@mastoxiv.page
2025-06-12 07:25:31

EdgeProfiler: A Fast Profiling Framework for Lightweight LLMs on Edge Using Analytical Model
Alyssa Pinnock, Shakya Jayakody, Kawsher A Roxy, Md Rubel Ahmed
arxiv.org/abs/2506.09061

@arXiv_csCV_bot@mastoxiv.page
2025-08-12 12:48:13

ReferSplat: Referring Segmentation in 3D Gaussian Splatting
Shuting He, Guangquan Jie, Changshuo Wang, Yun Zhou, Shuming Hu, Guanbin Li, Henghui Ding
arxiv.org/abs/2508.08252

@arXiv_csIR_bot@mastoxiv.page
2025-08-11 09:24:30

ITDR: An Instruction Tuning Dataset for Enhancing Large Language Models in Recommendations
Zekun Liu, Xiaowen Huang, Jitao Sang
arxiv.org/abs/2508.05667

@arXiv_csRO_bot@mastoxiv.page
2025-07-11 08:57:41

LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation
Sonia Raychaudhuri, Enrico Cancelli, Tommaso Campari, Lamberto Ballan, Manolis Savva, Angel X. Chang
arxiv.org/abs/2507.07299

@arXiv_csRO_bot@mastoxiv.page
2025-06-13 08:10:10

Using Language and Road Manuals to Inform Map Reconstruction for Autonomous Driving
Akshar Tumu, Henrik I. Christensen, Marcell Vazquez-Chanlatte, Chikao Tsuchiya, Dhaval Bhanderi
arxiv.org/abs/2506.10317

@arXiv_csCE_bot@mastoxiv.page
2025-07-08 07:34:59

ElliottAgents: A Natural Language-Driven Multi-Agent System for Stock Market Analysis and Prediction
Jaros{\l}aw A. Chudziak, Micha{\l} Wawer
arxiv.org/abs/2507.03435

@arXiv_csHC_bot@mastoxiv.page
2025-08-08 09:27:22

AI Conversational Tutors in Foreign Language Learning: A Mixed-Methods Evaluation Study
Nikolaos Avouris
arxiv.org/abs/2508.05156 arxiv.org…

@arXiv_csSE_bot@mastoxiv.page
2025-08-11 09:17:00

Position: Intelligent Coding Systems Should Write Programs with Justifications
Xiangzhe Xu, Shiwei Feng, Zian Su, Chengpeng Wang, Xiangyu Zhang
arxiv.org/abs/2508.06017

@arXiv_csCV_bot@mastoxiv.page
2025-07-10 09:14:01

Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation
Kazi Mahathir Rahman, Naveed Imtiaz Nafis, Md. Farhan Sadik, Mohammad Al Rafi, Mehedi Hasan Shahed
arxiv.org/abs/2507.06530

@arXiv_qbioQM_bot@mastoxiv.page
2025-08-08 07:59:42

Understanding protein function with a multimodal retrieval-augmented foundation model
Timothy Fei Truong Jr, Tristan Bepler
arxiv.org/abs/2508.04724

@arXiv_csCR_bot@mastoxiv.page
2025-07-08 07:48:00

Unveiling Privacy Policy Complexity: An Exploratory Study Using Graph Mining, Machine Learning, and Natural Language Processing
Vijayalakshmi Ramasamy, Seth Barrett, Gokila Dorai, Jessica Zumbach
arxiv.org/abs/2507.02968

@arXiv_csCY_bot@mastoxiv.page
2025-08-07 07:32:33

Health Insurance Coverage Rule Interpretation Corpus: Law, Policy, and Medical Guidance for Health Insurance Coverage Understanding
Mike Gartner
arxiv.org/abs/2508.03718

@arXiv_csLG_bot@mastoxiv.page
2025-06-03 21:51:54

This arxiv.org/abs/2505.19433 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCL_bot@mastoxiv.page
2025-08-08 10:04:22

How Do LLMs Persuade? Linear Probes Can Uncover Persuasion Dynamics in Multi-Turn Conversations
Brandon Jaipersaud, David Krueger, Ekdeep Singh Lubana
arxiv.org/abs/2508.05625

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:04:31

This arxiv.org/abs/2505.07453 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csCR_bot@mastoxiv.page
2025-07-10 08:25:21

False Alarms, Real Damage: Adversarial Attacks Using LLM-based Models on Text-based Cyber Threat Intelligence Systems
Samaneh Shafee, Alysson Bessani, Pedro M. Ferreira
arxiv.org/abs/2507.06252

@arXiv_csDB_bot@mastoxiv.page
2025-05-29 07:17:06

StreamLink: Large-Language-Model Driven Distributed Data Engineering System
Dawei Feng, Di Mei, Huiri Tan, Lei Ren, Xianying Lou, Zhangxi Tan
arxiv.org/abs/2505.21575

@arXiv_astrophIM_bot@mastoxiv.page
2025-07-03 08:32:00

SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars
Xiaosheng Zhao, Yang Huang, Guirong Xue, Xiao Kong, Jifeng Liu, Xiaoyu Tang, Timothy C. Beers, Yuan-Sen Ting, A-Li Luo
arxiv.org/abs/2507.01939

@arXiv_csHC_bot@mastoxiv.page
2025-08-05 11:27:10

Understanding User Preferences for Interaction Styles in Conversational Recommender Systems: The Predictive Role of System Qualities, User Experience, and Traits
Raj Mahmud, Shlomo Berkovsky, Mukesh Prasad, A. Baki Kocaballi
arxiv.org/abs/2508.02328

@arXiv_csMM_bot@mastoxiv.page
2025-07-04 08:44:01

VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning
Siran Chen, Boyu Chen, Chenyun Yu, Yuxiao Luo, Ouyang Yi, Lei Cheng, Chengxiang Zhuo, Zang Li, Yali Wang
arxiv.org/abs/2507.02626

@arXiv_csCL_bot@mastoxiv.page
2025-06-27 09:59:39

skLEP: A Slovak General Language Understanding Benchmark
Marek \v{S}uppa, Andrej Ridzik, Daniel Hl\'adek, Tom\'a\v{s} Jav\r{u}rek, Vikt\'oria Ondrejov\'a, Krist\'ina S\'asikov\'a, Martin Tamajka, Mari\'an \v{S}imko
arxiv.org/abs/2506.21508

@arXiv_qbioNC_bot@mastoxiv.page
2025-06-24 09:19:19

Challenges in Grounding Language in the Real World
Peter Lindes, Kaoutar Skiker
arxiv.org/abs/2506.17375 arxiv.org/pd…

@arXiv_csMA_bot@mastoxiv.page
2025-07-02 08:34:30

State and Memory is All You Need for Robust and Reliable AI Agents
Matthew Muhoberac, Atharva Parikh, Nirvi Vakharia, Saniya Virani, Aco Radujevic, Savannah Wood, Meghav Verma, Dimitri Metaxotos, Jeyaraman Soundararajan, Thierry Masquelin, Alexander G. Godfrey, Sean Gardner, Dobrila Rudnicki, Sam Michael, Gaurav Chopra
a…

@arXiv_csSE_bot@mastoxiv.page
2025-06-03 17:04:18

This arxiv.org/abs/2411.03079 has been replaced.
initial toot: mastoxiv.page/@arXiv_csSE_…

@arXiv_eessIV_bot@mastoxiv.page
2025-07-30 08:17:01

Querying GI Endoscopy Images: A VQA Approach
Gaurav Parajuli
arxiv.org/abs/2507.21165 arxiv.org/pdf/2507.21165

@arXiv_csCV_bot@mastoxiv.page
2025-07-04 10:24:31

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation
Jiaer Xia, Bingkui Tong, Yuhang Zang, Rui Shao, Kaiyang Zhou
arxiv.org/abs/2507.02859

@arXiv_econGN_bot@mastoxiv.page
2025-06-03 16:34:48

This arxiv.org/abs/2504.15448 has been replaced.
initial toot: mastoxiv.page/@arXiv_eco…

@lysander07@sigmoid.social
2025-05-13 16:25:32

Last week, our students learned how to conduct a proper evaluation for an NLP experiment. To this end, we introduced a small textcorpus with sentences about Joseph Fourier, who counts as one of the discoverers of the greenhouse effect, responsible for global warming.

Slide of the Information Service ENgineering lecture 03, Natural Language Processing 02, section 2.6: Evaluation, Precision, and Recall
Headline: Experiment
Let's consider the following text corpus (FOURIERCORPUS):
 1
In 1807, Fourier's work on heat transfer laid the foundation for understanding the greenhouse effect.
2
Joseph Fourier's energy balance analysis showed atmosphere's heat-trapping role.
3
Fourrier's calculations, though rudimentary, suggested that the atmosphere acts as an insulato…
@arXiv_csAR_bot@mastoxiv.page
2025-06-23 08:05:39

DeepRTL2: A Versatile Model for RTL-Related Tasks
Yi Liu, Hongji Zhang, Yunhao Zhou, Zhengyuan Shi, Changran Xu, Qiang Xu
arxiv.org/abs/2506.15697

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:06:32

AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data
Rana Alshaikh, Israa Alghanmi, Shelan Jeawak
arxiv.org/abs/2507.18442

@arXiv_csDB_bot@mastoxiv.page
2025-06-17 09:28:39

Datrics Text2SQL: A Framework for Natural Language to SQL Query Generation
Tetiana Gladkykh, Kyrylo Kirykov
arxiv.org/abs/2506.12234

@arXiv_csCR_bot@mastoxiv.page
2025-06-26 09:45:50

SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models
Dipayan Saha, Shams Tarek, Hasan Al Shaikh, Khan Thamid Hasan, Pavan Sai Nalluri, Md. Ajoad Hasan, Nashmin Alam, Jingbo Zhou, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi
arxiv.org/abs/2506.20415

@arXiv_csSE_bot@mastoxiv.page
2025-06-24 12:06:40

Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories
Islem Bouzenia, Michael Pradel
arxiv.org/abs/2506.18824

@arXiv_csHC_bot@mastoxiv.page
2025-06-23 11:40:20

Capturing Visualization Design Rationale
Maeve Hutchinson, Radu Jianu, Aidan Slingsby, Jo Wood, Pranava Madhyastha
arxiv.org/abs/2506.16571

@arXiv_csIR_bot@mastoxiv.page
2025-06-23 09:44:00

eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing
Isaac Shi, Zeyuan Li, Wenli Wang, Lewei He, Yang Yang, Tianyu Shi
arxiv.org/abs/2506.16768

@arXiv_csSE_bot@mastoxiv.page
2025-08-04 09:32:01

Can User Feedback Help Issue Detection? An Empirical Study on a One-billion-user Online Service System
Shuyao Jiang, Jiazhen Gu, Wujie Zheng, Yangfan Zhou, Michael R. Lyu
arxiv.org/abs/2508.00593

@arXiv_physicsgeoph_bot@mastoxiv.page
2025-05-28 07:34:40

SeisCoDE: 3D Seismic Interpretation Foundation Model with Contrastive Self-Distillation Learning
Goodluck Archibong, Ardiansyah Koeshidayatullah, Umair Waheed, Weichang Li, Dicky Harishidayat, Motaz Alfarraj
arxiv.org/abs/2505.20518

@arXiv_csMM_bot@mastoxiv.page
2025-07-15 09:12:51

LayLens: Improving Deepfake Understanding through Simplified Explanations
Abhijeet Narang, Parul Gupta, Liuyijia Su, Abhinav Dhall
arxiv.org/abs/2507.10066

@arXiv_csSE_bot@mastoxiv.page
2025-07-24 08:30:20

Evaluating Uncertainty and Quality of Visual Language Action-enabled Robots
Pablo Valle, Chengjie Lu, Shaukat Ali, Aitor Arrieta
arxiv.org/abs/2507.17049

@arXiv_csRO_bot@mastoxiv.page
2025-07-16 07:46:41

Vision Language Action Models in Robotic Manipulation: A Systematic Review
Muhayy Ud Din, Waseem Akram, Lyes Saad Saoud, Jan Rosell, Irfan Hussain
arxiv.org/abs/2507.10672

@arXiv_econGN_bot@mastoxiv.page
2025-06-19 08:39:22

Identifying economic narratives in large text corpora -- An integrated approach using Large Language Models
Tobias Schmidt, Kai-Robin Lange, Matthias Reccius, Henrik M\"uller, Michael Roos, Carsten Jentsch
arxiv.org/abs/2506.15041

@arXiv_csRO_bot@mastoxiv.page
2025-06-23 11:50:30

CodeDiffuser: Attention-Enhanced Diffusion Policy via VLM-Generated Code for Instruction Ambiguity
Guang Yin, Yitong Li, Yixuan Wang, Dale McConachie, Paarth Shah, Kunimatsu Hashimoto, Huan Zhang, Katherine Liu, Yunzhu Li
arxiv.org/abs/2506.16652

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 10:10:50

Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate
Ana Davila, Jacinto Colan, Yasuhisa Hasegawa
arxiv.org/abs/2507.12370

@arXiv_csCL_bot@mastoxiv.page
2025-06-23 08:16:40

Rethinking LLM Training through Information Geometry and Quantum Metrics
Riccardo Di Sipio
arxiv.org/abs/2506.15830 a…

@arXiv_csCL_bot@mastoxiv.page
2025-07-17 08:34:00

ExpliCIT-QA: Explainable Code-Based Image Table Question Answering
Maximiliano Hormaz\'abal Lagos, \'Alvaro Bueno S\'aez, Pedro Alonso Doval, Jorge Alcalde Vesteiro, H\'ector Cerezo-Costas
arxiv.org/abs/2507.11694

@arXiv_csSE_bot@mastoxiv.page
2025-07-14 09:10:22

NL in the Middle: Code Translation with LLMs and Intermediate Representations
Chi-en Amy Tai, Pengyu Nie, Lukasz Golab, Alexander Wong
arxiv.org/abs/2507.08627

@arXiv_csSE_bot@mastoxiv.page
2025-07-17 09:25:30

MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks
Artem Chervyakov, Alexander Kharitonov, Pavel Zadorozhny, Adamenko Pavel, Rodion Levichev, Dmitrii Vorobev, Dmitrii Salikhov, Aidar Valeev, Alena Pestova, Maria Dziuba, Ilseyar Alimova, Artem Zavgorodnev, Aleksandr Medvedev, Stanislav Moiseev, Elena Bruches, Daniil Grebenkin, Roman Derunets, Vikulov Vladimir, Anton Emelyanov, Dmitrii Babaev, Vladimir V. Ivanov, Valentin Malykh, Alena Fenogenova