2025-09-11 09:45:53
LLM Ensemble for RAG: Role of Context Length in Zero-Shot Question Answering for BioASQ Challenge
Dima Galat, Diego Molla-Aliod
https://arxiv.org/abs/2509.08596 https://
LLM Ensemble for RAG: Role of Context Length in Zero-Shot Question Answering for BioASQ Challenge
Dima Galat, Diego Molla-Aliod
https://arxiv.org/abs/2509.08596 https://
@… Haha, I appreciate that.
It’s a good question. My mind immediately *wants* to answer all the ones you didn’t ask (Taking guns away from everyone, getting to pick and choose who gets them, etc.)
But the question as (purposefully I’m sure) posed, is a tough one.
Cowboys starter gets angry at reporters over Micah Parson question https://www.sportingnews.com/us/ncaa-football/penn-state/news/cowboys-starter-angry-reporters-over-micah-parson-question/80f3074307cb…
Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models
Sharut Gupta, Shobhita Sundaram, Chenyu Wang, Stefanie Jegelka, Phillip Isola
https://arxiv.org/abs/2510.08492
Interviews with security researchers about AI's potential for large-scale destruction, as experts remain divided and global regulatory frameworks lag (Stephen Witt/New York Times)
https://www.
Sincere question for the HCI community (and all other international research communities who disseminate research primarily at conferences):
1. When any conference is in the US, will international folks risk coming? I already know prominent folks who say they won't.
2. When any conference is outside the US, will any international students within the states risk going? My PhD student has been advised not to, so I'm giving his talk overseas for him.
How will we all …
High-dimensional Analysis of Synthetic Data Selection
Parham Rezaei, Filip Kovacevic, Francesco Locatello, Marco Mondelli
https://arxiv.org/abs/2510.08123 https://
StaR-KVQA: Structured Reasoning Traces for Implicit-Knowledge Visual Question Answering
Zhihao Wen, Wenkang Wei, Yuan Fang, Xingtong Yu, Hui Zhang, Weicheng Zhu, Xin Zhang
https://arxiv.org/abs/2510.06638
What a cool trivia question from the Lord of the Rings...
▶️ There was Another Way to Destroy the Ring and Defeat Sauron
https://youtube.com/watch?v=Qhnc8TbUrKM&si=gqLtw6pBQcg4byg7
Unix question: is there a version of seq(1) for letters instead of numbers?
Avoiding Knowledge Edit Skipping in Multi-hop Question Answering with Guided Decomposition
Yi Liu, Xiangrong Zhu, Xiangyu Liu, Wei Wei, Wei Hu
https://arxiv.org/abs/2509.07555 h…
Generative AI as a Safety Net for Survey Question Refinement
Erica Ann Metheney, Lauren Yehle
https://arxiv.org/abs/2509.08702 https://arxiv.org/pdf/2509.0…
#AdviceRequested!
We want to buy an electric car! It's exciting but also daunting to make car buying decisions, and harder to evaluate with electric than it was for gas.
Safety and reliability are the highest priorities — which was easier to evaluate with models like the Honda Civic that's been around for decades
Lucid looks really nice, but I question the relia…
Dear author,
Thank you for submitting your article manuscript. While we trust that many in the research community would welcome a study of vitamin, drug, and disease interaction as a timely intervention, we question its suitability for publication in a history journal.
The editors
Here's your regular reminder:
There is no debate over if cars will or will not be part of the future. They will not. They are a luxury we can no longer afford. The question is only if we will choose to rid our future of cars, or allow cars to rid us of our future.
#FuckCars
Monorepo vs Multi-repo vs #Git submodule vs Git Subtree: A Complete Guide for Developers
https://levelup.gitcon…
‘It’s a question of humanity’: how a small Spanish town made headlines over its immigration stance | Spain | The Guardian
https://www.theguardian.com/world/2025/oct/11/small-spanish-town-headlines-immigration-villamalea
> …
"Chatbots are turning on the flattery, patience, and support. Microsoft AI CEO Mustafa Suleyman said the “cool thing” about the company’s AI personal assistant is that it doesn’t “judge you for asking a stupid question.” It exhibits “kindness and empathy.” Here’s the rub: We need people to judge us. We need people to call us out for making stupid statements. Friction and conflict are key to developing resilience and learning how to function in society."
Asking For It: Question-Answering for Predicting Rule Infractions in Online Content Moderation
Mattia Samory, Diana Pamfile, Andrew To, Shruti Phadke
https://arxiv.org/abs/2510.06350
One other thing, while we don't claim that our mixed-effects logit model is the perfect way to account for non-independence between languages, we don't think it's correct, as Xia & Lindell assert, to just claim that our results are "counterintuitive", the fix-eff estimates are "unreliable" and that the high model fits are "unrealistic." Whether a mix model better captures the data-generat. process is ultimately an empirical question, not one to be decided by assertion. Take, for instance, our finding that once random effects for either subregion or language family are included, the estimated effect of L1_population reverses direction—from the negative value reported by Xia & Lindell et al. to a positive one.
NFL Week 6 injury report: Bengals' Ja'Marr Chase questionable vs. Packers due to illness
https://www.cbssports.com/nfl/news/nfl-week-6-injury-report-updates-tracker/
🔊 #NowPlaying on #BBCRadio3:
#TheEssay
- The Meaning and Magic of Music
How does music convey meaning to the listener? Catherine Coldstream examines this question in the context of classical music and religious faith.
Relisten now 👇
https://www.bbc.co.uk/programmes/m0029pvm
Stability with respect to periodic switching laws does not imply global stability under arbitrary switching
Ian D. Morris
https://arxiv.org/abs/2510.08074 https://
🇺🇦 #NowPlaying on KEXP's #Expansions
A.B.O.:
🎵 This Question
#ABO
https://deepclicks.bandcamp.com/track/this-question-original-mix
It's a question that comes up for a lot of us, so in the latest episode I talk about potential ways to cope with it.
Find the podcast by searching for That Hoarder: Overcome Compulsive Hoarding podcast in your podcast player.
#hoarding #hoardingdisorder
The Biggest Question Facing Each NFL Team in the Second Half of 2025 https://www.foxsports.com/stories/nfl/biggest-question-facing-each-nfl-team-second-half-2025
The Role of Exploration Modules in Small Language Models for Knowledge Graph Question Answering
Yi-Jie Cheng, Oscar Chew, Yun-Nung Chen
https://arxiv.org/abs/2509.07399 https://…
An interesting piece about the death and life of Edgar Allan Poe. #taphephobia
https://lithub.com/to-haunt-and-be-haunted-on-the-exhumation-of-edgar-allen-po…
VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Dhruv Jain, Harshit Shukla, Gautam Rajeev, Ashish Kulkarni, Chandra Khatri, Shubham Agarwal
https://arxiv.org/abs/2510.07978
"In the end, we are defined not just by our actions, but by the actions we tolerate."
Mike Monteiro with another banger
(Original title: How to eat with others)
https://buttondown.com/monteiro/archive/how-to-eat-with-others/
Revisiting the Question of Information Content of EXAFS Spectra through a Bayesian Approach
Lucy Haddad, Diego Gianolio, Andrei Sapelkin
https://arxiv.org/abs/2509.07950 https:/…
The question Mamdani won't answer (Politico)
https://www.politico.com/newsletters/playbook/2025/11/06/the-question-mamdani-wont-answer-00639459
http://www.memeorandum.com/251106/p8#a251106p8
Temporal Counterfactual Explanations of Behaviour Tree Decisions
Tamlin Love, Antonio Andriella, Guillem Aleny\`a
https://arxiv.org/abs/2509.07674 https://…
Number of integers represented by families of binary forms III: fewnomials
Etienne Fouvry, Michel Waldschmidt
https://arxiv.org/abs/2509.08335 https://arxi…
All you need is controlled-V: universality of a standard two-qubit gate by catalytic embedding
Robin Kaarsgaard
https://arxiv.org/abs/2509.07578 https://ar…
On the diagonal of quartic hypersurfaces and $(2,3)$-complete intersection $n$-folds
Elia Fiammengo, Morten L\"uders
https://arxiv.org/abs/2510.07111 https://
Now you know they're lying: different responses to the same question, multiple answers, indicate lying about the real reason
Tariffs aren't meant for revenue and will shrink over time, Bessent says
https://www.axios.com/2025/11/09/trump-tariffs-bessent-tax-revenue
I also have this idea that somewhere there's a a transbian policule coven who take money for hexes and curses against fascists and use it to fund the revolution.
Unlreated question... if a coven is a legally registered church, shouldn't paying for a hex be a legally tax deductable charitable donation? Asking for a friend.
Interesting question on road.cc: is there a bit of road that you just hate #cycling on?
For me, a short stretch of southbound Mearns Rd at Mearns Kirk. Slight rise, poor surface, from Eaglesham Rd to the roundabout:
https://www.…
Multi-Hop Question Answering: When Can Humans Help, and Where do They Struggle?
Jinyan Su, Claire Cardie, Jennifer Healey
https://arxiv.org/abs/2510.04493 https://
Love it when someone finally responds to me after literally months of silence, I ask a question and they respond correcting me and saying they've taken action now anyway "to avoid further iteration and delay".
SN 2022xlp: The second-known well-observed, intermediate-luminosity Iax supernova
D. B\'anhidi, B. Barna, T. Szalai, J. Vink\'o, I. B. B\'ir\'o, K. A. Bostroem, I. Cs\'anyi, K. W. Davis, R. J. Foley, L. Galbany, S. W. Jha, D. A. Howell, L. A. Kwok, A. P\'al, C. Pellegrino, C. Rojas-Bravo, P. Sz\'ekely, K. Taggart, G. Terreran, S. Tinyanont
From Keywords to Clusters: AI-Driven Analysis of YouTube Comments to Reveal Election Issue Salience in 2024
Raisa M. Simoes, Timoteo Kelly, Eduardo J. Simoes, Praveen Rao
https://arxiv.org/abs/2510.07821
The $L^p$-diameter of the space of contractible loops
Michael Brandenbursky, Egor Shelukhin
https://arxiv.org/abs/2509.07270 https://arxiv.org/pdf/2509.072…
“The real question, then, is not ‘what can we do?’, but ‘what are we afraid to do?’ Whose comfort are we protecting when we ask safe questions? Whose illusions do we preserve through politeness? Solidarity is not an optic; it is a disruption. It is noisy, uncomfortable, often isolating. It pulls reputation apart rather than polishing it.
…
We are too fluent in the language of outrage, too comfortable in the posture of virtue. History will not absolve spectatorship, even when specta…
Dynamic Connectivity with Expected Polylogarithmic Worst-Case Update Time
Simon Meierhans, Maximilian Probst Gutenberg
https://arxiv.org/abs/2510.08297 https://
Probing the Origin of Water in Planets within Habitable Zones by HWO
Yasuhiro Hasegawa, Courtney Dressing, Ludmila Carone
https://arxiv.org/abs/2510.07349 https://
Martial Arts have always been a part of #StarTrek. But we have come quite a bit since the famous hand chop, as we can see with the subject of today's #TrekTriviaTuesday question.
As always no googling and no spoiling the answer for others. Please boost after voting! :BoostOK:
V…
Finally, what Xia & Lindell call a "separation problem" is, in our view, a feature of our approach and not a bug.
If, e.g., all languages in a family are polysynthetic (or none are), that’s not a statistical artefact – it’s the signal. The outcome is well associated with genealogy, showing that family membership captures someth genuinely informative about the process. When the model finds that family explains a large share of the variance, that's not a failure–it's evidence that phylogenetic structure dominates the pattern.
So while Xia & Lindell insist that "autocorrelation due to relationships and distance cannot be captured in family or regional-level analyses", we see that as an empirical question – and we treated it as one.
The real test is whether a mixed model that explicitly represents phylogeny and geography performs worse than their alternative, where the entire shared history of languages and environments is effectively collapsed into a single dimension (an eigenvector).
In other words: we model relationships – Xia & Lindell summarise them into one number per language.
The Random Walk Pinning Model II: Upper bounds on the free energy and disorder relevance
Quentin Berger, Hubert Lacoin
https://arxiv.org/abs/2509.08769 https://
TEGRA: A Flexible & Scalable NextGen Mobile Core
Bilal Saleem, Omar Basit, Jiayi Meng, Iftekhar Alam, Ajay Thakur, Christian Maciocco, Muhammad Shahbaz, Y. Charlie Hu, Larry Peterson
https://arxiv.org/abs/2509.07410
Counterfactual Identifiability via Dynamic Optimal Transport
Fabio De Sousa Ribeiro, Ainkaran Santhirasekaram, Ben Glocker
https://arxiv.org/abs/2510.08294 https://
I have a question about the following:
Space-X satellites are relatively low orbit - they go around the world, their radio/signal footprint on the ground goes around the world with them.
So how does a single country, the US, issue "license" for radio spectrum that apply outside of the US geographic borders?
"SpaceX buys wireless spectrum from EchoStar in $17 billion deal"
Ordered a refurb Samsung S20 FE from Newegg, arrived today. I question the "Grade A - Excellent" rating that Reebeio gave it. Has mars all around the edge of the case and the back. Also has what looks to be a pressure crack in the back panel next to the cameras, like someone sat on it while in some kind of protective case and just put enough weight on it to crack the phone case a tiny bit.
However, the screen looks perfect and it powers on and boots to initialization mode no …
D-LEAF: Localizing and Correcting Hallucinations in Multimodal LLMs via Layer-to-head Attention Diagnostics
Tiancheng Yang, Lin Zhang, Jiaye Lin, Guimin Hu, Di Wang, Lijie Hu
https://arxiv.org/abs/2509.07864
Ramp says it has hit $1B in annualized revenue, after saying it had hit $700M in March; it was valued at $22.5B in July (Julie Bort/TechCrunch)
https://techcrunch.com/2025/09/09/ramp-says-it-has-hit-1b-in-annualized-revenue/
Memorization in Large Language Models in Medicine: Prevalence, Characteristics, and Implications
Anran Li, Lingfei Qian, Mengmeng Du, Yu Yin, Yan Hu, Zihao Sun, Yihang Fu, Erica Stutz, Xuguang Ai, Qianqian Xie, Rui Zhu, Jimin Huang, Yifan Yang, Siru Liu, Yih-Chung Tham, Lucila Ohno-Machado, Hyunghoon Cho, Zhiyong Lu, Hua Xu, Qingyu Chen
https://
Cowboys Defense Faces HUGE Question vs Giants! https://www.youtube.com/watch?v=Yy1kKR4n28A
Aligning LLMs for the Classroom with Knowledge-Based Retrieval -- A Comparative RAG Study
Amay Jain, Liu Cui, Si Chen
https://arxiv.org/abs/2509.07846 https://
Hakeem Jeffries dodges question on whether Mamdani is future of Democratic Party (Fox News)
https://www.foxnews.com/politics/hakeem-jeffries-dodges-question-whether-mamdani-future-democratic-party
http://www.memeorandum.com/251105/p164#a251105p164
PAC Learnability in the Presence of Performativity
Ivan Kirev, Lyuben Baltadzhiev, Nikola Konstantinov
https://arxiv.org/abs/2510.08335 https://arxiv.org/p…
Could early bye weeks be a good thing? Why there's an advantage and how six teams are approaching them https://www.espn.com/nfl/story/_/id/46509162/nfl-bye-weeks-2025-advantage-question-steelers-packers-falcons-be…
The nonexistence of sections of Stiefel varieties and stably free modules
Sebastian Gant
https://arxiv.org/abs/2509.07263 https://arxiv.org/pdf/2509.07263
Exploring the Viability of the Updated World3 Model for Examining the Impact of Computing on Planetary Boundaries
Nara Guliyeva, Eshta Bhardwaj, Christoph Becker
https://arxiv.org/abs/2510.07634
Culturally transmitted color categories in LLMs reflect a learning bias toward efficient compression
Nathaniel Imel, Noga Zaslavsky
https://arxiv.org/abs/2509.08093 https://
Patriots Rookie’s Status in Question vs. Raiders https://www.si.com/nfl/patriots/news/new-england-patriots-will-campbell-status-question-raiders
Opponent Shaping in LLM Agents
Marta Emili Garcia Segura, Stephen Hailes, Mirco Musolesi
https://arxiv.org/abs/2510.08255 https://arxiv.org/pdf/2510.08255
On roundness of rotation sets
Boris Perrot, Jan Boro\'nski, Alex Clark
https://arxiv.org/abs/2510.08235 https://arxiv.org/pdf/2510.08235
BcQLM: Efficient Vision-Language Understanding with Distilled Q-Gated Cross-Modal Fusion
Sike Xiang, Shuang Chen, Amir Atapour-Abarghouei
https://arxiv.org/abs/2509.08715 https:…
Firefighters question leaders’ role in Washington immigration raid
https://www.dailykos.com/stories/2025/9/4/2341497/-Firefighters-question-leaders-role-in-Washington-immigration-raid
Mailbag: Pick your poison stopping run, pass? https://www.dallascowboys.com/news/mailbag-pick-your-poison-stopping-run-pass
NFL Week 6 injury report: Jalen Carter's status in question for Eagles; Giants shorthanded at WR
https://www.cbssports.com/nfl/news/nfl-week-6-injury-report-updates-tracker/
Mailbag: Pick your poison stopping run, pass? https://www.dallascowboys.com/news/mailbag-pick-your-poison-stopping-run-pass
KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering
Yushi Sun, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen
https://arxiv.org/abs/2509.04716
Bill Cassidy Traps RFK Jr. With Nobel Prize Question
https://www.mediaite.com/media/news/senate-republican-bill-cassidy-masterfully-traps-rfk-jr-with-nobel-prize-question/
Cowboys will get a chance to answer the question of whether a big trade would help https://www.foxsports.com/articles/nfl/cowboys-will-get-a-chance-to-answer-the-question-of-whether-a-big-trade-would-help
Mean dimension and rate-distortion function revisited
Rui Yang
https://arxiv.org/abs/2510.08051 https://arxiv.org/pdf/2510.08051
A look at India's rationale for banning online real-money games, with IT minister Ashwini Vaishnaw citing 450M people losing a combined ~$2.3B to them (Vivek Kaul/Newslaundry)
https://www.newslaundry.com/2025/08/27/the-rs-444-question-why-in…
FocusMed: A Large Language Model-based Framework for Enhancing Medical Question Summarization with Focus Identification
Chao Liu, Ling Luo, Tengxiao Lv, Huan Zhuang, Lejing Yu, Jian Wang, Hongfei Lin
https://arxiv.org/abs/2510.04671
Lamar Jackson contract: Ravens QB sidesteps question about extension, says he's 'not worried about that'
https://www.cbssports.com/nfl/news/lamar-j…
MoLoRAG: Bootstrapping Document Understanding via Multi-modal Logic-aware Retrieval
Xixi Wu, Yanchao Tan, Nan Hou, Ruiyang Zhang, Hong Cheng
https://arxiv.org/abs/2509.07666 htt…
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/7]:
- Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge
Lukas Haas, Gal Yona, Giovanni D'Antonio, Sasha Goldshtein, Dipanjan Das
https://arxiv.org/abs/2509.07968
Are Humans as Brittle as Large Language Models?
Jiahui Li, Sean Papay, Roman Klinger
https://arxiv.org/abs/2509.07869 https://arxiv.org/pdf/2509.07869
Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window
Qiaoyu Tang, Hao Xiang, Le Yu, Bowen Yu, Yaojie Lu, Xianpei Han, Le Sun, WenJuan Zhang, Pengbo Wang, Shixuan Liu, Zhenru Zhang, Jianhong Tu, Hongyu Lin, Junyang Lin
https://arxiv.org/abs/2510.08276
AI Knowledge Assist: An Automated Approach for the Creation of Knowledge Bases for Conversational AI Agents
Md Tahmid Rahman Laskar, Julien Bouvier Tremblay, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN
https://arxiv.org/abs/2510.08149
StepChain GraphRAG: Reasoning Over Knowledge Graphs for Multi-Hop Question Answering
Tengjun Ni, Xin Yuan, Shenghong Li, Kai Wu, Ren Ping Liu, Wei Ni, Wenjie Zhang
https://arxiv.org/abs/2510.02827
Research on Multi-hop Inference Optimization of LLM Based on MQUAKE Framework
Zucheng Liang, Wenxin Wei, Kaijie Zhang, Hongyi Chen
https://arxiv.org/abs/2509.04770 https://
AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering
Ziqing Wang, Chengsheng Mao, Xiaole Wen, Yuan Luo, Kaize Ding
https://arxiv.org/abs/2510.02328
Uncertainty as Feature Gaps: Epistemic Uncertainty Quantification of LLMs in Contextual Question-Answering
Yavuz Bakman, Sungmin Kang, Zhiqi Huang, Duygu Nur Yaldiz, Catarina G. Bel\'em, Chenyang Zhu, Anoop Kumar, Alfy Samuel, Salman Avestimehr, Daben Liu, Sai Praneeth Karimireddy
https://arxiv.org/abs/2510.02671