
2025-07-02 11:36:18
Looks like EU funding of #FLOSS will continue: "Open digital infrastructure, from search and identity to cloud and software governance, is now a strategic European priority."
Looks like EU funding of #FLOSS will continue: "Open digital infrastructure, from search and identity to cloud and software governance, is now a strategic European priority."
Series D, Episode 01 - Rescue
DORIAN: Exactly.
AVON: You really are insane, aren't you?
DORIAN: By now I probably would be.
AVON: If it wasn't for this mysterious room.
DORIAN: And what it contains.
SOOLIN: [Enters] And what might that be, Dorian?
https://blake.torpidity.net/m/401/467…
This https://arxiv.org/abs/2505.23576 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csRO_…
"Comparative review on Primo Research Assistant, Scopus AI, Web of Science Research Assistant and a explainer for AI search for librarians" by Aaron Tay: https://musingsaboutlibrarianship.blogspot.com/2025/05/comparative-review-…
When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search
William A. Ingram, Bipasha Banerjee, Edward A. Fox
https://arxiv.org/abs/2507.02139 …
Check out my new “Media Literacy Roundup: 2 August 2025” newsletter on SubStack!
https://open.substack.com/pub/wfryer/p/media-literacy-roundup-2-august-2025?r=d4phj&utm_campaign=post&u…
Digital Collections Explorer: An Open-Source, Multimodal Viewer for Searching Digital Collections
Ying-Hsiang Huang, Benjamin Charles Germain Lee
https://arxiv.org/abs/2507.00961 …
Baidu plans to open-source its Ernie LLM on June 30; some say this could cement China's AI leadership, while others doubt it will be a "DeepSeek moment" (Kevin Williams/CNBC)
https://www.cnbc.com/2025/06/29/china-bigg
> As the Court weighs how to restore competition in the search market, Mozilla is asking it to seriously consider the unintended consequences of some of the proposed remedies, which, if adopted, could harm browser competition, weaken user choice and undermine the open web.
This should be called a Mozilla syndrome!
Modern search demands scalable personalisation. Join Piotr Kobziakowski
at this year's Berlin Buzzwords to discover how Vespa's multi-stage ranking and tensor framework can be used for hybrid queries, multimodal retrieval, and real-time machine learning. Learn how to deploy low-latency, high-relevance search systems at petabyte scale.
Learn more:
Can a Dark Inferno Melt Earth's Core?
Christopher Cappiello, Tansu Daylan
https://arxiv.org/abs/2505.24070 https://arxiv.org/pdf/…
🇺🇦 #NowPlaying on KEXP's #MiddayShow
Iggy and The Stooges:
🎵 Search and Destroy
#IggyandTheStooges
https://bastardon.bandcamp.com/track/search-and-destroy-iggy-and-the-stooges
https://open.spotify.com/track/00sydAz6PeOxYzwG1dRIPi
We're pleased to share that Search Guard is a Silver Partner and the sponsor of our annual Get-Together!
Learn more about Search Guard: https://search-guard.com/
Join us on Monday 16 June for an evening of food, drinks and networking, generously sponsored by Search Guard. It's the perf…
This https://arxiv.org/abs/2505.19253 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
Mohammad Aflah Khan, Ameya Godbole, Johnny Tian-Zheng Wei, Ryan Wang, James Flemings, Krishna Gummadi, Willie Neiswanger, Robin Jia
https://arxiv.org/abs/2507.19419
Als Nicht-Entwickler kann man auch zu Open-Source Projekten beitragen.
Ich bin happy, daß ich für #n8n, einem no-code Prozess-Automatisierungstool mein zweites Template beitragen konnte:
The UK CMA says it has provisionally found Google meets the legal tests to designate it with "strategic market status" in general search and search advertising (Financial Times)
https://www.ft.com/content/26aa105f-fabb-4061-bd6d-fd13ba94f691
Searching for radio pulsars in old open clusters from the Parkes archive
S. B. Zhang, J. J. Wei, X. Yang, S. Dai, J. S. Wang, L. Toomey, S. Q. Wang, G. Hobbs, X. F. Wu, L. Staveley-Smith
https://arxiv.org/abs/2506.19236
I’ve been using Kagi for search for a while and using the Orion browser ever since Arc was shutdown. Its got some flakiness and not all the features of Arc (mainly missing the ability to sync my pinned pages and what I have open) https://mastodon.social/@kagihq/113074391235397771
First use of large area SiPM matrices coupled with NaI(Tl) scintillating crystal for low energy dark matter search
Edoardo Martinenghi, Valerio Toso, Fabrizio Bruno Armani, Andrea Castoldi, Giuseppe di Carlo, Luca Frontini, Niccol\`o Gallice, Chiara Guazzoni, Valentino Liberali, Alberto Stabile, Valeria Trabattoni, Andrea Zani, Davide D'Angelo
https://
Series B, Episode 01 - Redemption
AVON: Don't worry. At the right time, I will remind you of it.
[Flight deck]
VILA: [Accepts cup from Jenna and swallows pill] Thanks.
JENNA: [To Blake who has just entered flight deck with Avon] You all right?
https://blake.torpidity.net/m/201/242
Mixture of Encoders is a vector-native alternative that models both structured and unstructured data in a unified embedding space. Join Filip Makraduli as he introduces the method, demonstrates how it powers natural language search and real-time recommendations, and shares open-source tools and benchmarks for replacing complex hybrid stacks.
Learn more:
🇺🇦 #NowPlaying on #BBC6Music
Peaches:
🎵 Search And Destroy
#Peaches
https://open.spotify.com/track/0tHUdwmRvao8Hd0EV9WFSW
Time for another "review". This one's hard. While the book was quite interesting, it required me to be quite open-minded. Still, I think it's worth mentioning:
Robert Wright — Nonzero: The Logic of Human Destiny
The book basically focused on a thesis that both biological evolution and cultural evolution are a thing, they are directional and this directionality can be explained together using game theory — as eventually leading to more non-zero sum games.
It consists of three chapters. The first one is is focused on the history of civilization. It features many examples from different parts of the world, which makes it quite interesting. The author argues that the culture inevitably is evolving as information processing techniques improve — from writing to the Internet.
The second chapter is focused on biological evolution. Now, the argument is that it's not quite random, but actually directed towards greater complexity — eventually leading to the development of highly intelligent species, and a civilization.
The third chapter is quite speculative and metaphysical, and I'm just going to skip it.
The book is full of optimism. Capitalism creates freedom — because people are more productive when they're working for their own gain, so the free market eliminates slavery. Globalisation creates networks of interdependence that make wars uneconomic. Increased contacts between different cultures makes people more tolerant. And eventually, the humanity may be able to unite facing a common "external" enemy — the climate change.
What can I say? The examples are quite interesting, the whole theory seems self-consistent. Still, I repeatedly looked at the publication date (it's 1999), and wondered if author would write the same thing today (yes, I know I can search for his current opinions).
#books #bookstodon @…
Indie open gaming marketplace Itch.io abruptly deindexes NSFW content from its browse and search pages after payment processors raised concerns, following Steam (Jess Weatherbed/The Verge)
https://www.theverge.com/news/712890/itch-removes-adult-nsfw…
Q: how do you search a Matrix room?
A: you open my IRC logs
Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints
Zhenyun Yin, Shujie Wang, Xuhong Wang, Xingjun Ma, Yinchun Wang
https://arxiv.org/abs/2507.16727
Series B, Episode 06 - Trial
ZIL: To be alone must not be feared. The Host is slow to recognize one who is alone. Though there are many, all stay alone. [Clears off a patch of ground] Do you hunger? [Tears the ground open, scoops up some of the lining of the opening and eats it.] Do you hunger?
BLAKE: What is it?
https://
Heh, got pitched based on my Quamina open-source GitHub repo on behalf of “Fabinvest” which turns out to be a PE fund owned by Qatari royal Jassim Al-Thani. A polite and restrained pitch, not pushy or scammy.
There’s too much money in the world and its owners are increasingly desperate in their search for a place to invest it. I suppose prospecting popular GitHub repos is less crazy than other things that I see going on.
SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models
Giordano d'Aloisio, Tosin Fadahunsi, Jay Choy, Rebecca Moussa, Federica Sarro
https://arxiv.org/abs/2507.15663
Monocular Vision-Based Swarm Robot Localization Using Equilateral Triangular Formations
Taewon Kang, Ji-Wook Kwon, Il Bae, Jin Hyo Kim
https://arxiv.org/abs/2507.19100 https://
Series B, Episode 06 - Trial
ZIL: To be alone must not be feared. The Host is slow to recognize one who is alone. Though there are many, all stay alone. [Clears off a patch of ground] Do you hunger? [Tears the ground open, scoops up some of the lining of the opening and eats it.] Do you hunger?
BLAKE: What is it?
https://
This https://arxiv.org/abs/2403.18213 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
🇺🇦 #NowPlaying on #BBC6Music
Peaches:
🎵 Search And Destroy
#Peaches
https://open.spotify.com/track/0tHUdwmRvao8Hd0EV9WFSW
Context-Aware Scientific Knowledge Extraction on Linked Open Data using Large Language Models
Sajratul Y. Rubaiat, Hasan M. Jamil
https://arxiv.org/abs/2506.17580
Announcing sff: A fast, on-the-fly SemanticFileFinder written in Rust! 🦀
It scans a directory (like your notes or a repo), finds the most semantically relevant text chunks for your query, and lets you open the file in a text editor of your choice.
No vector DBs, no GPU needed. Indexes ~2500 files with 10k chunks in 250ms on a CPU.
Perfect for searching Obsidian vaults, codebases, and more.
𝚌𝚊𝚛𝚐𝚘 𝚒𝚗𝚜𝚝𝚊𝚕𝚕 𝚜𝚏𝚏
𝚜𝚏𝚏 "𝚠𝚘𝚛𝚔𝚒𝚗𝚐 𝚠𝚒𝚝𝚑 𝚐𝚒𝚝"
Series B, Episode 04 - Horizon
SELMA: Thank you.
RO: You saved my life. While I live, you'll be welcome on Horizon.
BLAKE: Still Horizon?
RO: You can't return to the past.
BLAKE: [Into bracelet] Liberator, teleport now. [To Ro and Selma] Good-bye.
https://blake.torpidity.net/m/204/633
The Pandora's Box Problem with Sequential Inspections
Ali Aouad, Jingwei Ji, Yaron Shaposhnik
https://arxiv.org/abs/2507.07508 https://
Series B, Episode 04 - Horizon
SELMA: Thank you.
RO: You saved my life. While I live, you'll be welcome on Horizon.
BLAKE: Still Horizon?
RO: You can't return to the past.
BLAKE: [Into bracelet] Liberator, teleport now. [To Ro and Selma] Good-bye.
https://blake.torpidity.net/m/204/633
Tree-Structured Parzen Estimator Can Solve Black-Box Combinatorial Optimization More Efficiently
Kenshin Abe, Yunzhuo Wang, Shuhei Watanabe
https://arxiv.org/abs/2507.08053 https://arxiv.org/pdf/2507.08053 https://arxiv.org/html/2507.08053
arXiv:2507.08053v1 Announce Type: new
Abstract: Tree-structured Parzen estimator (TPE) is a versatile hyperparameter optimization (HPO) method supported by popular HPO tools. Since these HPO tools have been developed in line with the trend of deep learning (DL), the problem setups often used in the DL domain have been discussed for TPE such as multi-objective optimization and multi-fidelity optimization. However, the practical applications of HPO are not limited to DL, and black-box combinatorial optimization is actively utilized in some domains, e.g., chemistry and biology. As combinatorial optimization has been an untouched, yet very important, topic in TPE, we propose an efficient combinatorial optimization algorithm for TPE. In this paper, we first generalize the categorical kernel with the numerical kernel in TPE, enabling us to introduce a distance structure to the categorical kernel. Then we discuss modifications for the newly developed kernel to handle a large combinatorial search space. These modifications reduce the time complexity of the kernel calculation with respect to the size of a combinatorial search space. In the experiments using synthetic problems, we verified that our proposed method identifies better solutions with fewer evaluations than the original TPE. Our algorithm is available in Optuna, an open-source framework for HPO.
toXiv_bot_toot
We're thrilled to announce that @… has rejoined Berlin Buzzwords as a Platinum Partner!
Learn more about OpenSearch: https://opensearch.org/
Beyond the Buzz: A Pragmatic Take on Inference Disaggregation
Tiyasa Mitra, Ritika Borkar, Nidhi Bhatia, Ramon Matas, Shivam Raj, Dheevatsa Mudigere, Ritchie Zhao, Maximilian Golub, Arpan Dutta, Sailaja Madduri, Dharmesh Jani, Brian Pharris, Bita Darvish Rouhani
https://arxiv.org/abs/2506.05508
This https://arxiv.org/abs/2211.15412 has been replaced.
link: https://scholar.google.com/scholar?q=a
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf, Ipek Baris Schlicht, Tilman Mirass, Julia Bayer, Tilman Wagner, Ruben Bouwmeester
https://arxiv.org/abs/2506.13188
Deep Learning and Model Independence
Martin King
https://arxiv.org/abs/2507.03438 https://arxiv.org/pdf/2507.03438
Implementation of full and simplified likelihoods in CheckMATE
I\~naki Lara, Krzysztof Rolbiecki
https://arxiv.org/abs/2507.08565 https://
A huge thank you to Qdrant for sponsoring the coffee breaks at Berlin Buzzwords! We're thrilled to have their support in keeping everyone energised and connected throughout the conference.
Learn more about Qdrant here: https://qdrant.tech/
#Blakes7 Series B, Episode 10 - Voice from the Past
AVON: [Restrains Blake] Blake.
BLAKE: Renounce!
AVON: Easy, easy!
BLAKE: Renounce!
AVON: [To Cally] Tranquilizer pack!
BLAKE: Renounce! [Cally administers the tranquilizer pack. Blake subsides]
To conclude the first evening of Berlin Buzzwords, Gregor Bransky invites you to join a tour of c-base. Afterwards, you can unwind at one of its recreational areas, enjoying a refreshing beverage by the waterside of the Spree.
A travel group will form during the Get-Together.
📅 When: Today – 7 pm
📍 Where: c-base, Rungestraße 20 | 10179 Berlin
Learn more:
🇺🇦 #NowPlaying on #BBC6Music's #CraigCharles
Jon Lucien:
🎵 Search For The Inner Self
#JonLucien
https://open.spotify.com/track/33nn9tbsuT7tggeViAPeDH
To conclude the first evening of Berlin Buzzwords, Gregor Bransky invites you to join a tour of c-base. Afterwards, you can unwind at one of its recreational areas, enjoying a refreshing beverage by the waterside of the Spree.
A travel group will form during the Get-Together.
📅 When: June 17, 2025 – 7 pm
📍 Where: c-base, Rungestraße 20 | 10179 Berlin
Learn more:
Series B, Episode 09 - Countdown
AVON: Approximately six million. It was colonized in the last century of the Old Calendar. At first they resisted political affiliation, but then they joined the Federation, and they have remained unswervingly loyal.
VILA: Then they're not likely to welcome us with open arms.
https://
Series D, Episode 01 - Rescue
TARRANT: He will if Orac's working. Now come on. We're wasting time. [starts to climb]
[Dayna does not follow, but continues to explore the small room. A hatch slides open in the floor.]
DAYNA: I knew it. Tarrant.
https://blake.torpidity.net/m/401/422
Apache Solr 9.8 introduces the LLM module, opening the doors to end-to-end natural language query support through vector-backed semantic search (K Nearest Neighbors). At Berlin Buzzwords 2025, Alessandro Benedetti discussed the open-source contributions from both an indexing and query perspective, as well as what's next for Solr in terms of Large Language Model integration.
Watch the full session here:
Want to make the most of your time around Berlin Buzzwords?
Check out our Satellite Events page to find out about other interesting meet-ups and conferences happening around #bbuzz: https://2025.berlinbuzzwords.de/sate…