Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index
Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi
https://arxiv.org/abs/2506.12229
Acting and Planning with Hierarchical Operational Models on a Mobile Robot: A Study with RAE UPOM
Oscar Lima, Marc Vinci, Sunandita Patra, Sebastian Stock, Joachim Hertzberg, Martin Atzmueller, Malik Ghallab, Dana Nau, Paolo Traverso
https://arxiv.org/abs/2507.11345
Despite what many purchasing departments think, software developers are not fungible assets.
https://www.linkedin.com/posts/alsutton_im
Unreal is all you need: Multimodal ISAC Data Simulation with Only One Engine
Kongwu Huang, Shiyi Mu, Jun Jiang, Yuan Gao, Shugong Xu
https://arxiv.org/abs/2507.08716
So to summarize this whole adventure:
1. A good 45 minutes was spent to get an answer that we probably could have gotten in 5 minutes in the 2010's, or in maybe 1-2 hours in the 1990's.
2. The time investment wasn't a total waste as we learned a lot along the way that we wouldn't have in the 2010's. Most relevant is the wide range of variation (e.g. a 2x factor depending on fiber intake!).
3. Most of the search engine results were confidently wrong answers that had no relation to reality. We were lucky to get one that had real citations we could start from (but that same article included the bogus 4.91 kcal/gram number). Next time I want to know a random factoid I might just start on Google scholar.
4. At least one page we chased citations through had a note at the top about being frozen due to NIH funding issues. The digital commons is under attack on multiple fronts.
All of this is yet another reason not to support the big LLM companies.
#AI
Simulation of surface x-ray emission from the ASTERICS ECR ion source
Thomas Thuillier, Andrea Cernuschi, Benjamin Cheymol
https://arxiv.org/abs/2507.06074
The Variable Radio Emission of V830 Tau and Its Putative Planet
Rachel A. Osten, Scott J. Wolk
https://arxiv.org/abs/2509.05082 https://arxiv.org/pdf/2509.…
AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning
Lang Mei, Zhihan Yang, Chong Chen
https://arxiv.org/abs/2508.20368 https://
SERP Interference Network and Its Applications in Search Advertising
Purak Jain, Sandeep Appala
https://arxiv.org/abs/2506.21598 https://
TURA: Tool-Augmented Unified Retrieval Agent for AI Search
Zhejun Zhao, Yuehu Dong, Alley Liu, Lixue Zheng, Pingsheng Liu, Dongdong Shen, Long Xia, Jiashu Zhao, Dawei Yin
https://arxiv.org/abs/2508.04604