
2025-08-12 09:10:23
Towards General-Purpose Data Discovery: A Programming Languages Approach
Andrew Kang, Yashnil Saha, Sainyam Galhotra
https://arxiv.org/abs/2508.08074 https://
Towards General-Purpose Data Discovery: A Programming Languages Approach
Andrew Kang, Yashnil Saha, Sainyam Galhotra
https://arxiv.org/abs/2508.08074 https://
Maximizing GPU Efficiency via Optimal Adapter Caching: An Analytical Approach for Multi-Tenant LLM Serving
Ferran Agullo, Joan Oliveras, Chen Wang, Alberto Gutierrez-Torre, Olivier Tardieu, Alaa Youssef, Jordi Torres, Josep Ll. Berral
https://arxiv.org/abs/2508.08343
PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C
Pedro Orvalho, Marta Kwiatkowska
https://arxiv.org/abs/2508.08171 https://
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning
Sheng Chen, Peiyu He, Jiaxin Hu, Ziyang Liu, Yansheng Wang, Tao Xu, Chi Zhang, Chongchong Zhang, Chao An, Shiyu Cai, Duo Cao, Kangping Chen, Shuai Chu, Tianwei Chu, Mingdi Dan, Min Du, Weiwei Fang, Pengyou Fu, Junkai Hu, Xiaowei Jiang, Zhaodi Jiang, Fuxuan Li, Jun Li, Minghui Li, Mingyao Li, Yanchang Li, Zhibin Li, Guangming Liu, Kairui Liu, Lihao Liu, Weizhi Liu, Xiaoshun Liu, Yufei Liu, Yunfei Liu, Qiang…
I'm trying to write a general purpose Inspector UI object for #Clojure
so you have a function
`(inspect o)`
which, when evaluated, throws up a window showing in a sensible form the value of `o`.
Obviously, though, if `o` is lazy, you don't want the inspector to explore it all.
LazySeq implements an interface IPending, which isn't documented. Are all lazy …
This https://arxiv.org/abs/2506.06266 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models
Anthony M. Barrett, Jessica Newman, Brandie Nonnecke, Nada Madkour, Dan Hendrycks, Evan R. Murphy, Krystal Jackson, Deepika Raman
https://arxiv.org/abs/2506.23949
ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols
Arnav Sheth, Ivaxi Sheth, Mario Fritz
https://arxiv.org/abs/2506.07945 h…
Teaching Astronomy with Large Language Models
Yuan-Sen Ting, Teaghan O'Briain
https://arxiv.org/abs/2506.06921 https://arxiv.org/…
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, Chao-Han Huck Yang, Sung-Feng Huang, Chih-Kai Yang, Chee-En Yu, Chun-Wei Chen, Wei-Chih Chen, Chien-yu Huang, Yi-Cheng Lin, Yu-Xiang Lin, Chi-An Fu, Chun-Yi Kuan, Wenze Ren, Xuanjun Chen, Wei-Ping Huang, En-Pei Hu, Tzu-Quan Lin, Yuan-Kuei Wu, Kuan-Po Huang, Hsiao-Ying Huang, Huang-Cheng Chou, Kai-Wei Chang, Cheng-Han Chiang, Boris Ginsburg, Yu…
SafeCOMM: What about Safety Alignment in Fine-Tuned Telecom Large Language Models?
Aladin Djuhera, Swanand Ravindra Kadhe, Farhan Ahmed, Syed Zawad, Holger Boche, Walid Saad
https://arxiv.org/abs/2506.00062
Kwai Keye-VL Technical Report
Kwai Keye Team, Biao Yang, Bin Wen, Changyi Liu, Chenglong Chu, Chengru Song, Chongling Rao, Chuan Yi, Da Li, Dunju Zang, Fan Yang, Guorui Zhou, Hao Peng, Haojie Ding, Jiaming Huang, Jiangxia Cao, Jiankang Chen, Jingyun Hua, Jin Ouyang, Kaibing Chen, Kaiyu Jiang, Kaiyu Tang, Kun Gai, Shengnan Zhang, Siyang Mao, Sui Huang, Tianke Zhang, Tingting Gao, Wei Chen, Wei Yuan, Xiangyu Wu, Xiao Hu, Xingyu Lu, Yang Zhou, Yi-Fan Zhang, Yiping Yang, Yulong Chen, Zhenh…
Reconstructing Biological Pathways by Applying Selective Incremental Learning to (Very) Small Language Models
Pranta Saha, Joyce Reimer, Brook Byrns, Connor Burbridge, Neeraj Dhar, Jeffrey Chen, Steven Rayan, Gordon Broderick
https://arxiv.org/abs/2507.04432
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang, Xin Zhang, Xinping Zhao, Shouzheng Huang, Baotian Hu, Min Zhang
https://arxiv.org/abs/2507.20783
This https://arxiv.org/abs/2505.21652 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
SAEL: Leveraging Large Language Models with Adaptive Mixture-of-Experts for Smart Contract Vulnerability Detection
Lei Yu, Shiqi Cheng, Zhirong Huang, Jingyuan Zhang, Chenjie Shen, Junyi Lu, Li Yang, Fengjun Zhang, Jiajia Ma
https://arxiv.org/abs/2507.22371
This https://arxiv.org/abs/2410.21801 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
An instance of FreeCHR with refined operational semantics
Sascha Rechenberger, Thom Fr\"uhwirth
https://arxiv.org/abs/2505.22155 https://
A Large Language Model for Chemistry and Retrosynthesis Predictions
Yueqing Zhang, Wentao Liu, Yan Zhang, Danyang Xiong, Jihang Zhai, Hao Hao, YuCheng Gu, HaiBo Yang, Shuanhu Gao, Lianrui Hu, Aimin Zhou, Xiao He
https://arxiv.org/abs/2507.01444
aLLoyM: A large language model for alloy phase diagram prediction
Yuna Oikawa, Guillaume Deffrennes, Taichi Abe, Ryo Tamura, Koji Tsuda
https://arxiv.org/abs/2507.22558 https://…
Concept-Level AI for Telecom: Moving Beyond Large Language Models
Viswanath Kumarskandpriya, Abdulhalim Dandoush, Abbas Bradai, Ali Belgacem
https://arxiv.org/abs/2506.22359
For the first time AI systems crossed the gold-medal scoring threshold at the International Mathematical Olympiad for high-school students.
Both Google and OpenAI's models solved five out of six problems,
-- achieving the result using general-purpose “reasoning” models that processed mathematical concepts using natural language, in contrast to the previous approaches used by AI firms.
OpenAI’s breakthrough was achieved with a new experimental model centered on massively …
RooseBERT: A New Deal For Political Language Modelling
Deborah Dore, Elena Cabrio, Serena Villata
https://arxiv.org/abs/2508.03250 https://arxiv.org/pdf/25…
Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review
Haoneng Lin, Cheng Xu, Jing Qin
https://arxiv.org/abs/2506.18378 https://
OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Shikhar Bharadwaj, Samuele Cornell, Kwanghee Choi, Satoru Fukayama, Hye-jin Shim, Soham Deshmukh, Shinji Watanabe
https://arxiv.org/abs/2507.14129
This just occured to me (too much sun and gin lemonade could be a factor): English is a funny language and when they say Artificial they mean Automated, and when they say Intelligence they don't mean smarts, they mean covertly gathering intel from prospective enemies!
Hence #ArtificialIntelligence, often promoted to General.
The purpose of any system is what it does, not what it consistently fails to do.
This https://arxiv.org/abs/2505.07453 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
The Case for Instance-Optimized LLMs in OLAP Databases
Bardia Mohammadi, Laurent Bindschaedler
https://arxiv.org/abs/2507.04967 https://
A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation
TRI LBM Team, Jose Barreiros, Andrew Beaulieu, Aditya Bhat, Rick Cory, Eric Cousineau, Hongkai Dai, Ching-Hsin Fang, Kunimatsu Hashimoto, Muhammad Zubair Irshad, Masha Itkina, Naveen Kuppuswamy, Kuan-Hui Lee, Katherine Liu, Dale McConachie, Ian McMahon, Haruki Nishimura, Calder Phillips-Grafflin, Charles Richter, Paarth Shah, Krishnan Srinivasan, Blake Wulfe, Chen Xu, Mengchao Zhang, Alex Alspach, Maya …
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
Ori Press, Brandon Amos, Haoyu Zhao, Yikai Wu, Samuel K. Ainsworth, Dominik Krupke, Patrick Kidger, Touqir Sajed, Bartolomeo Stellato, Jisun Park, Nathanael Bosch, Eli Meril, Albert Steppi, Arman Zharmagambetov, Fangzhao Zhang, David Perez-Pineiro, Alberto Mercurio, Ni Zhan, Talor Abramovich, Kilian Lieret, Hanlin Zhang, Shirley Huang, Matthias Bethge, Ofir Press
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis
Anushka Yadav, Isha Nalawade, Srujana Pillarichety, Yashwanth Babu, Reshmi Ghosh, Samyadeep Basu, Wenlong Zhao, Ali Nasaeh, Sriram Balasubramanian, Soundararajan Srinivasan
https://arxiv.org/abs/2508.04699
Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs
Mohammad Ali Alomrani, Yingxue Zhang, Derek Li, Qianyi Sun, Soumyasundar Pal, Zhanguang Zhang, Yaochen Hu, Rohan Deepak Ajwani, Antonios Valkanas, Raika Karimi, Peng Cheng, Yunzhou Wang, Pengyi Liao, Hanrui Huang, Bin Wang, Jianye Hao, Mark Coates
https://
Open Scene Graphs for Open-World Object-Goal Navigation
Joel Loo, Zhanxin Wu, David Hsu
https://arxiv.org/abs/2508.04678 https://arxiv.org/pdf/2508.04678…
Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health
Jabari Kwesi, Jiaxun Cao, Riya Manchanda, Pardis Emami-Naeini
https://arxiv.org/abs/2507.10695
Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study
Qiquan Zhang, Moran Chen, Zeyang Song, Hexin Liu, Xiangyu Zhang, Haizhou Li
https://arxiv.org/abs/2507.04368
Thread and Memory-Safe Programming with CLASS
Lu\'is Caires (Instituto Superior T\'ecnico)
https://arxiv.org/abs/2505.20848 https://
Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models
Sajratul Y. Rubaiat, Hasan M. Jamil
https://arxiv.org/abs/2506.17580
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
Bernard Lange, Anil Yildiz, Mansur Arief, Shehryar Khattak, Mykel Kochenderfer, Georgios Georgakis
https://arxiv.org/abs/2506.17462
Shrinking the Generation-Verification Gap with Weak Verifiers
Jon Saad-Falcon, E. Kelly Buchanan, Mayee F. Chen, Tzu-Heng Huang, Brendan McLaughlin, Tanvir Bhathal, Shang Zhu, Ben Athiwaratkun, Frederic Sala, Scott Linderman, Azalia Mirhoseini, Christopher R\'e
https://arxiv.org/abs/2506.18203
Can Common VLMs Rival Medical VLMs? Evaluation and Strategic Insights
Yuan Zhong, Ruinan Jin, Xiaoxiao Li, Qi Dou
https://arxiv.org/abs/2506.17337 https://…
Can LLMs Write CI? A Study on Automatic Generation of GitHub Actions Configurations
Taher A. Ghaleb, Dulina Rathnayake
https://arxiv.org/abs/2507.17165 htt…
Optimized Execution of FreeCHR
Sascha Rechenberger, Thom Fr\"uhwirth
https://arxiv.org/abs/2506.14485 https://arxiv.org/pdf/2506…
Text-to-SQL Task-oriented Dialogue Ontology Construction
Renato Vukovic, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Hsien-Chin Lin, Shutong Feng, Nurul Lubis, Milica Gasic
https://arxiv.org/abs/2507.23358
Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training
Jonathan Cook, Silvia Sapora, Arash Ahmadian, Akbir Khan, Tim Rocktaschel, Jakob Foerster, Laura Ruis
https://arxiv.org/abs/2506.18777
MUST-RAG: MUSical Text Question Answering with Retrieval Augmented Generation
Daeyong Kwon, SeungHeon Doh, Juhan Nam
https://arxiv.org/abs/2507.23334 https://
Investigating the Role of LLMs Hyperparameter Tuning and Prompt Engineering to Support Domain Modeling
Vladyslav Bulhakov, Giordano d'Aloisio, Claudio Di Sipio, Antinisca Di Marco, Davide Di Ruscio
https://arxiv.org/abs/2507.14735
I2I-STRADA -- Information to Insights via Structured Reasoning Agent for Data Analysis
SaiBarath Sundar, Pranav Satheesan, Udayaadithya Avadhanam
https://arxiv.org/abs/2507.17874
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues
https://arxiv.org/abs/2507.19634
Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
Kathleen C. Fraser, Hillary Dawkins, Isar Nejadgholi, Svetlana Kiritchenko
https://arxiv.org/abs/2506.17209
TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards
Andreea Nica, Ivan Zakazov, Nicolas Mario Baldwin, Saibo Geng, Robert West
https://arxiv.org/abs/2507.18618
Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection
Huiyi Wang, Fahim Shahriar, Alireza Azimi, Gautham Vasan, Rupam Mahmood, Colin Bellinger
https://arxiv.org/abs/2507.10814
Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain
Rishemjit Kaur, Arshdeep Singh Bhankhar, Surangika Ranathunga, Jashanpreet Singh Salh, Sudhir Rajput, Vidhi, Kashish Mahendra, Bhavika Berwal, Ritesh Kumar
https://arxiv.org/abs/2507.16974…