
2025-06-24 09:26:50
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
Bernard Lange, Anil Yildiz, Mansur Arief, Shehryar Khattak, Mykel Kochenderfer, Georgios Georgakis
https://arxiv.org/abs/2506.17462
General-Purpose Robotic Navigation via LVLM-Orchestrated Perception, Reasoning, and Acting
Bernard Lange, Anil Yildiz, Mansur Arief, Shehryar Khattak, Mykel Kochenderfer, Georgios Georgakis
https://arxiv.org/abs/2506.17462
Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review
Haoneng Lin, Cheng Xu, Jing Qin
https://arxiv.org/abs/2506.18378 https://
This just occured to me (too much sun and gin lemonade could be a factor): English is a funny language and when they say Artificial they mean Automated, and when they say Intelligence they don't mean smarts, they mean covertly gathering intel from prospective enemies!
Hence #ArtificialIntelligence, often promoted to General.
The purpose of any system is what it does, not what it consistently fails to do.
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
Ori Press, Brandon Amos, Haoyu Zhao, Yikai Wu, Samuel K. Ainsworth, Dominik Krupke, Patrick Kidger, Touqir Sajed, Bartolomeo Stellato, Jisun Park, Nathanael Bosch, Eli Meril, Albert Steppi, Arman Zharmagambetov, Fangzhao Zhang, David Perez-Pineiro, Alberto Mercurio, Ni Zhan, Talor Abramovich, Kilian Lieret, Hanlin Zhang, Shirley Huang, Matthias Bethge, Ofir Press
Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training
Jonathan Cook, Silvia Sapora, Arash Ahmadian, Akbir Khan, Tim Rocktaschel, Jakob Foerster, Laura Ruis
https://arxiv.org/abs/2506.18777
Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models
Sajratul Y. Rubaiat, Hasan M. Jamil
https://arxiv.org/abs/2506.17580
TRPrompt: Bootstrapping Query-Aware Prompt Optimization from Textual Rewards
Andreea Nica, Ivan Zakazov, Nicolas Mario Baldwin, Saibo Geng, Robert West
https://arxiv.org/abs/2507.18618
Shrinking the Generation-Verification Gap with Weak Verifiers
Jon Saad-Falcon, E. Kelly Buchanan, Mayee F. Chen, Tzu-Heng Huang, Brendan McLaughlin, Tanvir Bhathal, Shang Zhu, Ben Athiwaratkun, Frederic Sala, Scott Linderman, Azalia Mirhoseini, Christopher R\'e
https://arxiv.org/abs/2506.18203
Can LLMs Write CI? A Study on Automatic Generation of GitHub Actions Configurations
Taher A. Ghaleb, Dulina Rathnayake
https://arxiv.org/abs/2507.17165 htt…
For the first time AI systems crossed the gold-medal scoring threshold at the International Mathematical Olympiad for high-school students.
Both Google and OpenAI's models solved five out of six problems,
-- achieving the result using general-purpose “reasoning” models that processed mathematical concepts using natural language, in contrast to the previous approaches used by AI firms.
OpenAI’s breakthrough was achieved with a new experimental model centered on massively …
Alibaba-backed Z.ai, formerly Zhipu, launches a general-purpose AI agent app, which lets users use natural language to book hotels, order takeaway, and more (Danielle Popov/South China Morning Post)
https://www.scmp.com/tech/tech-trends/arti…
I2I-STRADA -- Information to Insights via Structured Reasoning Agent for Data Analysis
SaiBarath Sundar, Pranav Satheesan, Udayaadithya Avadhanam
https://arxiv.org/abs/2507.17874
Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain
Rishemjit Kaur, Arshdeep Singh Bhankhar, Surangika Ranathunga, Jashanpreet Singh Salh, Sudhir Rajput, Vidhi, Kashish Mahendra, Bhavika Berwal, Ritesh Kumar
https://arxiv.org/abs/2507.16974…
Can Common VLMs Rival Medical VLMs? Evaluation and Strategic Insights
Yuan Zhong, Ruinan Jin, Xiaoxiao Li, Qi Dou
https://arxiv.org/abs/2506.17337 https://…
OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Shikhar Bharadwaj, Samuele Cornell, Kwanghee Choi, Satoru Fukayama, Hye-jin Shim, Soham Deshmukh, Shinji Watanabe
https://arxiv.org/abs/2507.14129
Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health
Jabari Kwesi, Jiaxun Cao, Riya Manchanda, Pardis Emami-Naeini
https://arxiv.org/abs/2507.10695
Optimized Execution of FreeCHR
Sascha Rechenberger, Thom Fr\"uhwirth
https://arxiv.org/abs/2506.14485 https://arxiv.org/pdf/2506…
Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
Kathleen C. Fraser, Hillary Dawkins, Isar Nejadgholi, Svetlana Kiritchenko
https://arxiv.org/abs/2506.17209
Towards General-Purpose Data Discovery: A Programming Languages Approach
Andrew Kang, Yashnil Saha, Sainyam Galhotra
https://arxiv.org/abs/2508.08074 https://
US ethnic cleansing and what to do about it
Reposting link to source article instead of screenshot of tweet that had no alt text:
Data on arrests shows that ICE was heavily engaged in racial profiling in LA, because their arrest numbers fell by ~66% after that were ordered to stop making arrests based just in factors like skin color, with place, or language spoken.
#ICE #USPol
AudioSet-R: A Refined AudioSet with Multi-Stage LLM Label Reannotation
Yulin Sun, Qisheng Xu, Yi Su, Qian Zhu, Yong Dou, Xinwang Liu, Kele Xu
https://arxiv.org/abs/2508.15429 ht…
I'm trying to write a general purpose Inspector UI object for #Clojure
so you have a function
`(inspect o)`
which, when evaluated, throws up a window showing in a sensible form the value of `o`.
Obviously, though, if `o` is lazy, you don't want the inspector to explore it all.
LazySeq implements an interface IPending, which isn't documented. Are all lazy …
Investigating the Role of LLMs Hyperparameter Tuning and Prompt Engineering to Support Domain Modeling
Vladyslav Bulhakov, Giordano d'Aloisio, Claudio Di Sipio, Antinisca Di Marco, Davide Di Ruscio
https://arxiv.org/abs/2507.14735
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, Chao-Han Huck Yang, Sung-Feng Huang, Chih-Kai Yang, Chee-En Yu, Chun-Wei Chen, Wei-Chih Chen, Chien-yu Huang, Yi-Cheng Lin, Yu-Xiang Lin, Chi-An Fu, Chun-Yi Kuan, Wenze Ren, Xuanjun Chen, Wei-Ping Huang, En-Pei Hu, Tzu-Quan Lin, Yuan-Kuei Wu, Kuan-Po Huang, Hsiao-Ying Huang, Huang-Cheng Chou, Kai-Wei Chang, Cheng-Han Chiang, Boris Ginsburg, Yu…
MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework
Ailing Yu, Lan Yao, Jingnan Liu, Zhe Chen, Jiajun Yin, Yuan Wang, Xinhao Liao, Zhiling Ye, Ji Li, Yun Yue, Hansong Xiao, Hualei Zhou, Chunxiao Guo, Peng Wei, Jinjie Gu
https://arxiv.org/abs/2508.14880…
Kwai Keye-VL Technical Report
Kwai Keye Team, Biao Yang, Bin Wen, Changyi Liu, Chenglong Chu, Chengru Song, Chongling Rao, Chuan Yi, Da Li, Dunju Zang, Fan Yang, Guorui Zhou, Hao Peng, Haojie Ding, Jiaming Huang, Jiangxia Cao, Jiankang Chen, Jingyun Hua, Jin Ouyang, Kaibing Chen, Kaiyu Jiang, Kaiyu Tang, Kun Gai, Shengnan Zhang, Siyang Mao, Sui Huang, Tianke Zhang, Tingting Gao, Wei Chen, Wei Yuan, Xiangyu Wu, Xiao Hu, Xingyu Lu, Yang Zhou, Yi-Fan Zhang, Yiping Yang, Yulong Chen, Zhenh…
Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning
Sheng Chen, Peiyu He, Jiaxin Hu, Ziyang Liu, Yansheng Wang, Tao Xu, Chi Zhang, Chongchong Zhang, Chao An, Shiyu Cai, Duo Cao, Kangping Chen, Shuai Chu, Tianwei Chu, Mingdi Dan, Min Du, Weiwei Fang, Pengyou Fu, Junkai Hu, Xiaowei Jiang, Zhaodi Jiang, Fuxuan Li, Jun Li, Minghui Li, Mingyao Li, Yanchang Li, Zhibin Li, Guangming Liu, Kairui Liu, Lihao Liu, Weizhi Liu, Xiaoshun Liu, Yufei Liu, Yunfei Liu, Qiang…
ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols
Arnav Sheth, Ivaxi Sheth, Mario Fritz
https://arxiv.org/abs/2506.07945 h…
aLLoyM: A large language model for alloy phase diagram prediction
Yuna Oikawa, Guillaume Deffrennes, Taichi Abe, Ryo Tamura, Koji Tsuda
https://arxiv.org/abs/2507.22558 https://…
Teaching Astronomy with Large Language Models
Yuan-Sen Ting, Teaghan O'Briain
https://arxiv.org/abs/2506.06921 https://arxiv.org/…
Maximizing GPU Efficiency via Optimal Adapter Caching: An Analytical Approach for Multi-Tenant LLM Serving
Ferran Agullo, Joan Oliveras, Chen Wang, Alberto Gutierrez-Torre, Olivier Tardieu, Alaa Youssef, Jordi Torres, Josep Ll. Berral
https://arxiv.org/abs/2508.08343
Reconstructing Biological Pathways by Applying Selective Incremental Learning to (Very) Small Language Models
Pranta Saha, Joyce Reimer, Brook Byrns, Connor Burbridge, Neeraj Dhar, Jeffrey Chen, Steven Rayan, Gordon Broderick
https://arxiv.org/abs/2507.04432
Concept-Level AI for Telecom: Moving Beyond Large Language Models
Viswanath Kumarskandpriya, Abdulhalim Dandoush, Abbas Bradai, Ali Belgacem
https://arxiv.org/abs/2506.22359
AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models
Anthony M. Barrett, Jessica Newman, Brandie Nonnecke, Nada Madkour, Dan Hendrycks, Evan R. Murphy, Krystal Jackson, Deepika Raman
https://arxiv.org/abs/2506.23949
A Large Language Model for Chemistry and Retrosynthesis Predictions
Yueqing Zhang, Wentao Liu, Yan Zhang, Danyang Xiong, Jihang Zhai, Hao Hao, YuCheng Gu, HaiBo Yang, Shuanhu Gao, Lianrui Hu, Aimin Zhou, Xiao He
https://arxiv.org/abs/2507.01444
A Comparative Study of Decoding Strategies in Medical Text Generation
Oriana Presacan, Alireza Nik, Vajira Thambawita, Bogdan Ionescu, Michael Riegler
https://arxiv.org/abs/2508.13580
Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection
Huiyi Wang, Fahim Shahriar, Alireza Azimi, Gautham Vasan, Rupam Mahmood, Colin Bellinger
https://arxiv.org/abs/2507.10814
SafeCOMM: What about Safety Alignment in Fine-Tuned Telecom Large Language Models?
Aladin Djuhera, Swanand Ravindra Kadhe, Farhan Ahmed, Syed Zawad, Holger Boche, Walid Saad
https://arxiv.org/abs/2506.00062
SAEL: Leveraging Large Language Models with Adaptive Mixture-of-Experts for Smart Contract Vulnerability Detection
Lei Yu, Shiqi Cheng, Zhirong Huang, Jingyuan Zhang, Chenjie Shen, Junyi Lu, Li Yang, Fengjun Zhang, Jiajia Ma
https://arxiv.org/abs/2507.22371
An instance of FreeCHR with refined operational semantics
Sascha Rechenberger, Thom Fr\"uhwirth
https://arxiv.org/abs/2505.22155 https://
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang, Xin Zhang, Xinping Zhao, Shouzheng Huang, Baotian Hu, Min Zhang
https://arxiv.org/abs/2507.20783
This https://arxiv.org/abs/2410.21801 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
The Case for Instance-Optimized LLMs in OLAP Databases
Bardia Mohammadi, Laurent Bindschaedler
https://arxiv.org/abs/2507.04967 https://
PyVeritas: On Verifying Python via LLM-Based Transpilation and Bounded Model Checking for C
Pedro Orvalho, Marta Kwiatkowska
https://arxiv.org/abs/2508.08171 https://
This https://arxiv.org/abs/2505.07453 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
This https://arxiv.org/abs/2505.21652 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
RooseBERT: A New Deal For Political Language Modelling
Deborah Dore, Elena Cabrio, Serena Villata
https://arxiv.org/abs/2508.03250 https://arxiv.org/pdf/25…
Thread and Memory-Safe Programming with CLASS
Lu\'is Caires (Instituto Superior T\'ecnico)
https://arxiv.org/abs/2505.20848 https://
Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study
Qiquan Zhang, Moran Chen, Zeyang Song, Hexin Liu, Xiangyu Zhang, Haizhou Li
https://arxiv.org/abs/2507.04368
A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation
TRI LBM Team, Jose Barreiros, Andrew Beaulieu, Aditya Bhat, Rick Cory, Eric Cousineau, Hongkai Dai, Ching-Hsin Fang, Kunimatsu Hashimoto, Muhammad Zubair Irshad, Masha Itkina, Naveen Kuppuswamy, Kuan-Hui Lee, Katherine Liu, Dale McConachie, Ian McMahon, Haruki Nishimura, Calder Phillips-Grafflin, Charles Richter, Paarth Shah, Krishnan Srinivasan, Blake Wulfe, Chen Xu, Mengchao Zhang, Alex Alspach, Maya …
Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs
Mohammad Ali Alomrani, Yingxue Zhang, Derek Li, Qianyi Sun, Soumyasundar Pal, Zhanguang Zhang, Yaochen Hu, Rohan Deepak Ajwani, Antonios Valkanas, Raika Karimi, Peng Cheng, Yunzhou Wang, Pengyi Liao, Hanrui Huang, Bin Wang, Jianye Hao, Mark Coates
https://
This https://arxiv.org/abs/2506.06266 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
Open Scene Graphs for Open-World Object-Goal Navigation
Joel Loo, Zhanxin Wu, David Hsu
https://arxiv.org/abs/2508.04678 https://arxiv.org/pdf/2508.04678…
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis
Anushka Yadav, Isha Nalawade, Srujana Pillarichety, Yashwanth Babu, Reshmi Ghosh, Samyadeep Basu, Wenlong Zhao, Ali Nasaeh, Sriram Balasubramanian, Soundararajan Srinivasan
https://arxiv.org/abs/2508.04699
Text-to-SQL Task-oriented Dialogue Ontology Construction
Renato Vukovic, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Hsien-Chin Lin, Shutong Feng, Nurul Lubis, Milica Gasic
https://arxiv.org/abs/2507.23358
MUST-RAG: MUSical Text Question Answering with Retrieval Augmented Generation
Daeyong Kwon, SeungHeon Doh, Juhan Nam
https://arxiv.org/abs/2507.23334 https://
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues
https://arxiv.org/abs/2507.19634