
2025-05-31 02:00:46
Great stuff! I can't recommend enough!
OK, I'll try, but it's like I'm trying to slam dunk a basketball, when I can't jump 6" off the ground...
You'll love his books. They're great. Life-lessons. Tragedy. Comedy. And they're a good deal. Did I mention cheap?
#ShortFiction #ProsePoetry
Predicting Microbial Ontology and Pathogen Risk from Environmental Metadata with Large Language Models
Hyunwoo Yoo, Gail L. Rosen
https://arxiv.org/abs/2507.21980 https://
Verisimilitude as Boon and Bane: How People Initiate Opportunistic Interactions at Professional Events in Social VR
Victoria Chang, Caro Williams-Pierce, Huaishu Peng, Ge Gao
https://arxiv.org/abs/2507.22241
from my link log —
Firefox's optimized zip format: reading zip files really quickly.
https://taras.glek.net/posts/optimized-zip-format/
saved 2025-07-03
@…
Overheard: "Pidgey is beak-deep in Mark Doyon's tragicomic new novel, 'Deep Fried.' Be like Pidgey."
Available now!
https://www.DeepFried.store/
SciMantify -- A Hybrid Approach for the Evolving Semantification of Scientific Knowledge
Lena John, Kheir Eddine Farfar, S\"oren Auer, Oliver Karras
https://arxiv.org/abs/2506.21819
“FOR REVIEW: Address formats around the world”
https://www.w3.org/blog/International/2025/08/06/for-review-address-formats-around-the-world/
The doc to review:
Enriching Object-Centric Event Data with Process Scopes: A Framework for Aggregation and Analysis
Shahrzad Khayatbashi, Majid Rafiei, Jiayuan Chen, Timotheus Kampik, Gregor Berg, Amin Jalali
https://arxiv.org/abs/2508.18830
CNN is expanding its podcast business by rebranding CNN Audio to CNN Podcasts and plans to introduce new talent and formats, including more video podcasts (Alex Weprin/The Hollywood Reporter)
https://www.hollywoodreporter.com/business
@… @… @… the library already checks for expected fail on those formats 😅
"New File Format Research and Documentation on the Sustainability of Digital Formats" | The Signal https://blogs.loc.gov/thesignal/2025/06/new-file-format-research/
A Technical Review on Comparison and Estimation of Steganographic Tools
Ms. Preeti P. Bhatt, Rakesh R. Savant
https://arxiv.org/abs/2508.19323 https://arxi…
Binsparse: A Specification for Cross-Platform Storage of Sparse Matrices and Tensors
Benjamin Brock, Willow Ahrens, Hameer Abbasi, Timothy A. Davis, Juni Kim, James Kitchen, Spencer Patty, Isaac Virshup, Erik Welch
https://arxiv.org/abs/2506.19175
Three small announcements:
1. RFC 9839, a guide to which Unicode characters you should never use: https://www.rfc-editor.org/rfc/rfc9839.html
2. Blog piece with background and context, “RFC 9839 and Bad Unicode”:
Excellent article by Ibrahim Diallo that serves as a quick overview of image formats for the web! ^^
What Learning React Won't Teach You: Image Formats
https://idiallo.com/blog/react-and-image-format?utm_source=tldrwebdev
Lyft ramps up its ad business with three new ad formats; CEO David Risher said that Lyft is on track to reach a $100M annualized ad revenue run rate by 2025 end (Kerry Flynn/Axios)
https://www.axios.com/2025/06/12/lyft-ads-sponsored-rides
Most of the time I like widescreen and ultra-widescreen, but when I'm just puttering around on a computer, I find 5:4/4:3 oddly cozy. Incidentally, it works really well for browsing the modern web, because images fit pretty well in both square and vertical formats while wider formats still look ok.
Yes, I'm coming at you live in 1280x1024.
Recommendations to overcome language barriers in the Vera C. Rubin Observatory Research Ecosystem
Jos\'e Antonio Alonso Pav\'on, Andr\'es Alejandro Plazas Malag\'on
https://arxiv.org/abs/2507.18682
Efficient Mixed-Precision Large Language Model Inference with TurboMind
Li Zhang, Youhe Jiang, Guoliang He, Xin Chen, Han Lv, Qian Yao, Fangcheng Fu, Kai Chen
https://arxiv.org/abs/2508.15601
Smoothness Meets Autobidding: Tight Price of Anarchy Bounds for Simultaneous First-Price Auctions
Riccardo Colini-Baldeschi, Sophie Klumper, Twan Kroll, Stefano Leonardi, Guido Sch\"afer, Artem Tsikiridis
https://arxiv.org/abs/2506.20908
Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Mohammed Rakibul Hasan, Rafi Majid, Ahanaf Tahmid
https://arxiv.org/abs/2508.19887
🏗️ Supports distributable workers, multiple output formats & pluggable architecture for maximum flexibility
🔒 Execution without root privileges using #runc or #crun backends with #containerd
TreeReader: A Hierarchical Academic Paper Reader Powered by Language Models
Zijian Zhang, Pan Chen, Fangshi Du, Runlong Ye, Oliver Huang, Michael Liut, Al\'an Aspuru-Guzik
https://arxiv.org/abs/2507.18945
"Der öffentlich-rechtliche Sender ist in jüngster Vergangenheit immer wieder durch einseitige Berichte über die Landwirtschaft aufgefallen – und zwar zugunsten des Bauernverbandes und nach rechts tendierender Landwirte. Vorläufiger Höhepunkt ist die zweite Folge des gemeinsam mit dem BR produzierten Formats „Klar“. Sie kann als Werbung für die AfD interpretiert werden." #ÖRR
WGRAMMAR: Leverage Prior Knowledge to Accelerate Structured Decoding
Ran Wang, Xiaoxuan Liu, Hao Ren, Gang Chen, Fanchao Qi, Maosong Sun
https://arxiv.org/abs/2507.16768
A Unified Transformer Architecture for Low-Latency and Scalable Wireless Signal Processing
Yuto Kawai, Rajeev Koodli
https://arxiv.org/abs/2508.17960 https://
Vor genau 30 Jahren wurde ein kleines Stück Technikgeschichte geschrieben, das unseren Musikkonsum für immer verändern sollte! Am 14. Juli 1995 legten die Forscher am Fraunhofer-Institut die Dateiendung ".mp3" fest. 🎧
Zum Artikel: https://heise.de/-1…
Comparison of FTN-NOFDM and PCS-OFDM for Long-Haul Coherent Optical Communications
Haide Wang, Ji Zhou, Yongcheng Li, Weiping Liu, Changyuan Yu, Xiangjun Xin, Liangchuan Li
https://arxiv.org/abs/2508.17350
Generating Actionable Robot Knowledge Bases by Combining 3D Scene Graphs with Robot Ontologies
Giang Nguyen, Mihai Pomarlan, Sascha Jongebloed, Nils Leusmann, Minh Nhat Vu, Michael Beetz
https://arxiv.org/abs/2507.11770
A User Manual for cuHALLaR: A GPU Accelerated Low-Rank Semidefinite Programming Solver
Jacob Aguirre, Diego Cifuentes, Vincent Guigues, Renato D. C. Monteiro, Victor Hugo Nascimento, Arnesh Sujanani
https://arxiv.org/abs/2508.15951
High-Frequency First: A Two-Stage Approach for Improving Image INR
Sumit Kumar Dam, Mrityunjoy Gain, Eui-Nam Huh, Choong Seon Hong
https://arxiv.org/abs/2508.15582 https://
Evaluating recognition and recall formats of social network surveys in physics education research
Meagan Sundstrom, Justin Gambrell, Adrienne L. Traxler, Eric Brewe
https://arxiv.org/abs/2508.08417
@… that one looks decent! It’s about 1MB on disk but looks like it doesn’t support week date or day of year formats either
VulGuard: An Unified Tool for Evaluating Just-In-Time Vulnerability Prediction Models
Duong Nguyen, Manh Tran-Duc, Thanh Le-Cong, Triet Huynh Minh Le, M. Ali Babar, Quyet-Thang Huynh
https://arxiv.org/abs/2507.16685
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi, Yi Ding
https://arxiv.org/abs/2508.16712 https://
LettinGo: Explore User Profile Generation for Recommendation System
Lu Wang, Di Zhang, Fangkai Yang, Pu Zhao, Jianfeng Liu, Yuefeng Zhan, Hao Sun, Qingwei Lin, Weiwei Deng, Dongmei Zhang, Feng Sun, Qi Zhang
https://arxiv.org/abs/2506.18309
Jack Unit: An Area- and Energy-Efficient Multiply-Accumulate (MAC) Unit Supporting Diverse Data Formats
Seock-Hwan Noh, Sungju Kim, Seohyun Kim, Daehoon Kim, Jaeha Kung, Yeseong Kim
https://arxiv.org/abs/2507.04772
Esto estš interesante. Dream Theater publicarš una novela gršfica inspirada en su mšs reciente disco «Parasomnia» que incluye 8 historias, una por cada canción.
https://www.loudersound.com/bands-artists/dream-theater…
It boggles my mind how it is just impossible to pay for a digitally delivered version of AmigaOS 3.2.X so I can use it in an emulator or on my PiStorm. The sold formats are physical roms, cd and compact flash (with weird restrictions on how to buy the latter). All massive detours for my purposes. I guess it’s 3.1 or 3.9. #amiga
Lyft ramps up its ad business with three new ad formats; CEO David Risher said that Lyft is on track to reach a $100M annualized ad revenue run rate by 2025 end (Kerry Flynn/Axios)
https://www.axios.com/2025/06/12/lyft-ads-sponsored-rides
Just used this to shoot a patch scene that will go into an episode with no other added foley/ADR
iPhone 16: Edit Spatial Audio in Videos With Audio Mix - MacRumors https://www.macrumors.com/how-to/iphone-16-edit-spatial-audio-in-video-audio-mix/…
HybHuff: Lossless Compression for Hypergraphs via Entropy-Guided Huffman-Bitwise Coordination
Tianyu Zhao, Dongfang Zhao, Luanzheng Guo, Nathan Tallent
https://arxiv.org/abs/2506.15844
Collaborative Texture Filtering
Tomas Akenine-M\"oller, Pontus Ebelin, Matt Pharr, Bartlomiej Wronski
https://arxiv.org/abs/2506.17770 https://…
🔄 Key features include automatic garbage collection, extendable frontend formats & concurrent dependency resolution
⚡ Efficient instruction caching with build cache import/export capabilities & nested build job invocations
PickleBall: Secure Deserialization of Pickle-based Machine Learning Models
Andreas D. Kellas, Neophytos Christou, Wenxin Jiang, Penghui Li, Laurent Simon, Yaniv David, Vasileios P. Kemerlis, James C. Davis, Junfeng Yang
https://arxiv.org/abs/2508.15987
Music and Artificial Intelligence: Artistic Trends
Jordi Pons, Zack Zukowski, Julian D. Parker, CJ Carr, Josiah Taylor, Zach Evans
https://arxiv.org/abs/2508.11694 https://
👀 #QGIS vs #Arcgis from an industry perspective https://www.wigeogis.com/en/arcgis_vs_qgis
A workflow for generating synthetic LiDAR datasets in simulation environments
Abhishek Phadke, Shakib Mahmud Dipto, Pratip Rana
https://arxiv.org/abs/2506.17378
AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data
Rana Alshaikh, Israa Alghanmi, Shelan Jeawak
https://arxiv.org/abs/2507.18442 https://…
The {esquisse} package makes it easy to plot your data in different ways with a drag and drop interface: #rstats
I wouldn't have thought that quantitative analyses of retrospective national bibliographies would be that painful: data access via SRU, OAI, and REST API; another resource has a JSON dump, another one again consists of various ttl for which you have to set up your own sparql endpoint. And I've not even arrived at formats, metadata standards and cataloguing peculiarities 🤯 so everything's #FAIR
LLM-based Agents for Automated Confounder Discovery and Subgroup Analysis in Causal Inference
Po-Han Lee, Yu-Cheng Lin, Chan-Tung Ku, Chan Hsu, Pei-Cing Huang, Ping-Hsun Wu, Yihuang Kang
https://arxiv.org/abs/2508.07221
Stereo Sound Event Localization and Detection with Onscreen/offscreen Classification
Kazuki Shimada, Archontis Politis, Iran R. Roman, Parthasaarathy Sudarsanam, David Diaz-Guerra, Ruchi Pandey, Kengo Uchida, Yuichiro Koyama, Naoya Takahashi, Takashi Shibuya, Shusuke Takahashi, Tuomas Virtanen, Yuki Mitsufuji
https://arxiv.org/a…
STPFormer: A State-of-the-Art Pattern-Aware Spatio-Temporal Transformer for Traffic Forecasting
Jiayu Fang, Zhiqi Shao, S T Boris Choy, Junbin Gao
https://arxiv.org/abs/2508.13433
from my link log —
Can You Block It ? a simple ad block tester.
https://canyoublockit.com/
saved 2025-08-05 https://dotat.at/:/Q4GFT.ht…
The Making of a Community Dark Matter Dataset with the National Science Data Fabric
Amy Roberts, Jack Marquez, Kin Hong NG, Kitty Mickelson, Aashish Panta, Giorgio Scorzelli, Amy Gooch, Prisca Cushman, Matthew Fritts, Himangshu Neog, Valerio Pascucci, Michela Taufer
https://arxiv.org/abs/2507.13297…
OceanVive: An Immersive Visualization System for Communicating Complex Oceanic Phenomena
Yang Ouyang, Yuchen Wu, Xiyuan Wang, Laixin Xie, Weicong Cheng, Jianping Gan, Quan Li, Xiaojuan Ma
https://arxiv.org/abs/2507.17218
sommerleseempfehlungen aus dem newsletter vom @…:
Summer 2025 Picks for Adults der New York Public Library:
https://www.nypl.org/books-more/recommend…
Generating Inputs for Grammar Mining using Dynamic Symbolic Execution
Andreas Pointner (University of Applied Sciences Upper Austria, Austria), Josef Pichler (University of Applied Sciences Upper Austria, Austria), Herbert Pr\"ahofer (Johannes Kepler University Linz, Austria)
https://arxiv.org/abs/2508.03832
A Comparative Study of Delta Parquet, Iceberg, and Hudi for Automotive Data Engineering Use Cases
Dinesh Eswararaj, Ajay Babu Nellipudi, Vandana Kollati
https://arxiv.org/abs/2508.13396
Leveraging Hardware-Aware Computation in Mixed-Precision Matrix Multiply: A Tile-Centric Approach
Qiao Zhang, Rabab Alomairy, Dali Wang, Zhuowei Gu, Qinglei Cao
https://arxiv.org/abs/2508.14848
Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation
Genki Kusano, Kosuke Akimoto, Kunihiro Takeoka
https://arxiv.org/abs/2507.13525
POLARON: Precision-aware On-device Learning and Adaptive Runtime-cONfigurable AI acceleration
Mukul Lokhande, Santosh Kumar Vishvakarma
https://arxiv.org/abs/2506.08785
Survey of Surveys. II. Stellar parameters for 23 millions of stars
A. Turchi, E. Pancino, A. Avdeeva, F. Rossi, M. Tsantaki, P. M. Marrese, S. Marinoni, N. Sanna, G. Fanari, D. Alvarez Garay, M. Echeveste, S. Nedhath, S. Rani, E. Reggiani, S. Saracino, L. Steinbauer, G. Thomas, F. Gran, G. Guiglion
https://arxiv.org/abs/2507.059…
MRpro - open PyTorch-based MR reconstruction and processing package
Felix Frederik Zimmermann, Patrick Schuenke, Christoph S. Aigner, Bill A. Bernhardt, Mara Guastini, Johannes Hammacher, Noah Jaitner, Andreas Kofler, Leonid Lunin, Stefan Martin, Catarina Redshaw Kranich, Jakob Schattenfroh, David Schote, Yanglei Wu, Christoph Kolbitsch
https://
CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following
Yinghao Ma, Siyou Li, Juntao Yu, Emmanouil Benetos, Akira Maezawa
https://arxiv.org/abs/2506.12285 …
FrameShift: Learning to Resize Fuzzer Inputs Without Breaking Them
Harrison Green, Claire Le Goues, Fraser Brown
https://arxiv.org/abs/2507.05421 https://
Jelly: a fast and convenient RDF serialization format
Piotr Sowinski, Karolina Bogacka, Anastasiya Danilenka, Nikita Kozlov
https://arxiv.org/abs/2506.11298
Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices
Parshva Dhilankumar Patel
https://arxiv.org/abs/2507.07029 https://…
UK satellite TV channel Islam Channel acquires politics magazine Tribune, plans to increase its print frequency, and launch new formats like podcasts and video (Rob Waugh/Press Gazette)
https://pressgazette.co.uk/publishers/magazines/islam-channel-tribune/…
ARPaCCino: An Agentic-RAG for Policy as Code Compliance
Francesco Romeo, Luigi Arena, Francesco Blefari, Francesco Aurelio Pironti, Matteo Lupinacci, Angelo Furfaro
https://arxiv.org/abs/2507.10584
NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
Running Zhao, Zhihan Jiang, Xinchen Zhang, Chirui Chang, Handi Chen, Weipeng Deng, Luyao Jin, Xiaojuan Qi, Xun Qian, Edith C. H. Ngai
https://arxiv.org/abs/2508.14395
XR-NPE: High-Throughput Mixed-precision SIMD Neural Processing Engine for Extended Reality Perception Workloads
Tejas Chaudhari, Akarsh J., Tanushree Dewangan, Mukul Lokhande, Santosh Kumar Vishvakarma
https://arxiv.org/abs/2508.13049
Toward Efficient SpMV in Sparse LLMs via Block Extraction and Compressed Storage
Junqing Lin, Jingwei Sun, Mingge Lu, Guangzhong Sun
https://arxiv.org/abs/2507.12205
scDataset: Scalable Data Loading for Deep Learning on Large-Scale Single-Cell Omics
Davide D'Ascenzo, Sebastiano Cultrera di Montesano
https://arxiv.org/abs/2506.01883
From Explainable to Explanatory Artificial Intelligence: Toward a New Paradigm for Human-Centered Explanations through Generative AI
Christian Meske, Justin Brenne, Erdi Uenal, Sabahat Oelcer, Ayseguel Doganguen
https://arxiv.org/abs/2508.06352
Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data
Chandana Cheerla
https://arxiv.org/abs/2507.12425 https://
Everything You Need to Know About CS Education: Open Results from a Survey of More Than 18,000 Participants
Katsiaryna Dzialets, Aleksandra Makeeva, Ilya Vlasov, Anna Potriasaeva, Aleksei Rostovskii, Yaroslav Golubev, Anastasiia Birillo
https://arxiv.org/abs/2508.05286
A factorisation-based regularised interior point method using the augmented system
Filippo Zanetti, Jacek Gondzio
https://arxiv.org/abs/2508.04370 https://…
LLaVA-RE: Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model
Tao Sun, Oliver Liu, JinJin Li, Lan Ma
https://arxiv.org/abs/2508.05602 https://
LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation
Fei Teng, Haoyang Li, Lei Chen
https://arxiv.org/abs/2508.09594 https://arxi…
Scaling the memory wall using mixed-precision -- HPG-MxP on an exascale machine
Aditya Kashi, Nicholson Koukpaizan, Hao Lu, Michael Matheson, Sarp Oral, Feiyi Wang
https://arxiv.org/abs/2507.11512
Structured Semantics from Unstructured Notes: Language Model Approaches to EHR-Based Decision Support
Wu Hao Ran, Xi Xi, Furong Li, Jingyi Lu, Jian Jiang, Hui Huang, Yuzhuan Zhang, Shi Li
https://arxiv.org/abs/2506.06340
MExplore: an entity-based visual analytics approach for medical expertise acquisition
Xiao Pang, Yan Huang, Chang Liu, JiYuan Liu, MingYou Liu
https://arxiv.org/abs/2507.12337
The Architecture of Trust: A Framework for AI-Augmented Real Estate Valuation in the Era of Structured Data
Petteri Teikari, Mike Jarrell, Maryam Azh, Harri Pesola
https://arxiv.org/abs/2508.02765
Error Detection and Correction for Interpretable Mathematics in Large Language Models
Yijin Yang, Cristina Cornelio, Mario Leiva, Paulo Shakarian
https://arxiv.org/abs/2508.03500
Echoes of Automation: The Increasing Use of LLMs in Newsmaking
Abolfazl Ansari, Delvin Ce Zhang, Nafis Irtiza Tripto, Dongwon Lee
https://arxiv.org/abs/2508.06445 https://
PRvL: Quantifying the Capabilities and Risks of Large Language Models for PII Redaction
Leon Garza, Anantaa Kotal, Aritran Piplai, Lavanya Elluri, Prajit Das, Aman Chadha
https://arxiv.org/abs/2508.05545
From Legacy to Standard: LLM-Assisted Transformation of Cybersecurity Playbooks into CACAO Format
Mehdi Akbari Gurabi, Lasse Nitz, Radu-Mihai Castravet, Roman Matzutt, Avikarsha Mandal, Stefan Decker
https://arxiv.org/abs/2508.03342
In-person, Online and Back Again -- A Tale of Three Hybrid Hackathons
Abasi-amefon Obot Affia-Jomants, Alexander Serebrenik, James D. Herbsleb, Alexander Nolte
https://arxiv.org/abs/2508.07301
Template-Based Schema Matching of Multi-Layout Tenancy Schedules:A Comparative Study of a Template-Based Hybrid Matcher and the ALITE Full Disjunction Model
Tim Uilkema, Yao Ma, Seyed Sahand Mohammadi Ziabari, Joep van Vliet
https://arxiv.org/abs/2507.02020
This https://arxiv.org/abs/2505.20368 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
Marito: Structuring and Building Open Multilingual Terminologies for South African NLP
Vukosi Marivate, Isheanesu Dzingirai, Fiskani Banda, Richard Lastrucci, Thapelo Sindane, Keabetswe Madumo, Kayode Olaleye, Abiodun Modupe, Unarine Netshifhefhe, Herkulaas Combrink, Mohlatlego Nakeng, Matome Ledwaba
https://arxiv.org/abs/2508.03529…
CF-RAG: A Dataset and Method for Carbon Footprint QA Using Retrieval-Augmented Generation
Kaiwen Zhao, Bharathan Balaji, Stephen Lee
https://arxiv.org/abs/2508.03489 https://
Investigating Gender Bias in LLM-Generated Stories via Psychological Stereotypes
Shahed Masoudian, Gustavo Escobedo, Hannah Strauss, Markus Schedl
https://arxiv.org/abs/2508.03292
Capturing and Sharing Know-How through Visual Process Representations: A Human-Centred Approach to Teacher Workflows
Gloria Fern\'andez-Nieto, Vanessa Echeverria, Yuheng Li, Yi-Shan Tsai, Lele Sha, Guanliang Chen, Dragan Gasevic, Zachari Swiecki
https://arxiv.org/abs/2508.04357
Evaluating Structured Output Robustness of Small Language Models for Open Attribute-Value Extraction from Clinical Notes
Nikita Neveditsin, Pawan Lingras, Vijay Mago
https://arxiv.org/abs/2507.01810
MetaExplainer: A Framework to Generate Multi-Type User-Centered Explanations for AI Systems
Shruthi Chari, Oshani Seneviratne, Prithwish Chakraborty, Pablo Meyer, Deborah L. McGuinness
https://arxiv.org/abs/2508.00300