2025-10-02 09:20:40
Theory of Scaling Laws for In-Context Regression: Depth, Width, Context and Time
Blake Bordelon, Mary I. Letey, Cengiz Pehlevan
https://arxiv.org/abs/2510.01098 https://
Theory of Scaling Laws for In-Context Regression: Depth, Width, Context and Time
Blake Bordelon, Mary I. Letey, Cengiz Pehlevan
https://arxiv.org/abs/2510.01098 https://
Anthropic adds context editing and a memory tool to the Claude API, allowing AI agents to handle long-running tasks without frequently hitting context limits (Anthropic)
https://www.anthropic.com/news/context-management
Salesforce öffnet Slack für externe KI
Salesforce öffnet Slack für KI: Über API und Model Context Protocol erhalten Entwickler Zugriff auf Chatdaten, um kontextsensitive Agenten zu erstellen.
https://www.
the context, the context, the context
Strategic Fusion of Vision Language Models: Shapley-Credited Context-Aware Dawid-Skene for Multi-Label Tasks in Autonomous Driving
Yuxiang Feng, Keyang Zhang, Hassane Ouchouid, Ashwil Kaniamparambil, Ioannis Souflas, Panagiotis Angeloudis
https://arxiv.org/abs/2510.01126
How Does the Pretraining Distribution Shape In-Context Learning? Task Selection, Generalization, and Robustness
Wa\"iss Azizian, Ali Hasan
https://arxiv.org/abs/2510.01163 …
Variable Rate Image Compression via N-Gram Context based Swin-transformer
Priyanka Mudgal, Feng Liu
https://arxiv.org/abs/2510.00058 https://arxiv.org/pdf/…
Black-box Context-free Grammar Inference for Readable & Natural Grammars
Mohammad Rifat Arefin, Shanto Rahman, Christoph Csallner
https://arxiv.org/abs/2509.26616 https://…
A Measurement Study of Model Context Protocol
Hechuan Guo, Yongle Hao, Yue Zhang, Minghui Xu, Peizhuo Lyu, Jiezhi Chen, Xiuzhen Cheng
https://arxiv.org/abs/2509.25292 https://…
Memory-Augmented Log Analysis with Phi-4-mini: Enhancing Threat Detection in Structured Security Logs
Anbi Guo, Mahfuza Farooque
https://arxiv.org/abs/2510.00529 https://…
Leveraging Scene Context with Dual Networks for Sequential User Behavior Modeling
Xu Chen, Yunmeng Shu, Yuangang Pan, Jinsong Lan, Xiaoyong Zhu, Shuai Xiao, Haojin Zhu, Ivor W. Tsang, Bo Zheng
https://arxiv.org/abs/2509.26172
SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long-Context LLM Serving
Qihui Zhou, Peiqi Yin, Pengfei Zuo, James Cheng
https://arxiv.org/abs/2509.24626 http…
#FotoVorschlag 'Fortbewegungsmittel' 'Means of transportation'
1/.
I do not have that many photos of my beloved means of transportation.
The photo itself has also some context which I've written in my blog:
Does anybody know of aggregations of dcat data (or any published dcat on data collections/services) in the context of cultural heritage institutions?
#openglam #dcat #culturalheritage
If you think there might be an AI bubble, but don't worry, even if it pops, how bad can it be?
Here's one number: seven tech companies (NVIDIA, Microsoft, Google, Apple, Meta, Tesla, Amazon) are worth ~20 trillion, ⇓ of total US stock market
For context: total subprime mortgages in 2008 were 1.3 trillion, total US mortgage debt 10 trillion
You have a 401k or index fonds? Your money is likely tied with the AI market. If you want that, that's fine. If you're …
Pretrain-Test Task Alignment Governs Generalization in In-Context Learning
Mary I. Letey, Jacob A. Zavatone-Veth, Yue M. Lu, Cengiz Pehlevan
https://arxiv.org/abs/2509.26551 htt…
My sabbatical officially starts today!
https://mastodon.social/@nocontexttrek/115073662238539617
Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs
Jun Bai, Minghao Tong, Yang Liu, Zixia Jia, Zilong Zheng
https://arxiv.org/abs/2508.19594
Also a big thanks to the unknown-to-me person who already made the first PRs to add more configs over the weekend: https://codeberg.org/gedankenstuecke/freshrss-fulltext-settings
I've also expanded the README a bit to hopefully provide more context 🙂
The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management
Tobias Lindenbauer, Igor Slinko, Ludwig Felder, Egor Bogomolov, Yaroslav Zharov
https://arxiv.org/abs/2508.21433
Given the remarkable #GreensSurge in members and polling, and the ongoing chaotic infighting at 'Your Party', it's maybe worth reposting my questioning of both leaderships on the eco-planetary crisis and overshoot.
As UK politics turns both right and left, how do we get degrowth onto the agenda? – degrowthUK
Trying to figure out why Gemini AI in Gmail is so very bad. It seems to not have access to all of my email and whatever RAG-like thing they are doing is very stupid.
Here's Gemini saying my oldest email matching some keywords in 2021 is from March 2021. If I explicitly tell it to look in February 2021, it finds it. There's only 11 emails that match in all of 2021, it's not like I'm blowing up the context window.
A post from the archive 📫:
Building debugging context for Copilot Chat
https://www.poppastring.com/blog/building-debugging-context-for-copilot-chat
Debating degrowth: A response to Jason Hickel.
https://www.resilience.org/stories/2025-10-02/debating-degrowth-a-response-to-jason-hickel/
We (coordinators of the DegrowthUK website) will possibly write something about this. Broadly…
Why bother? He'll just want to rip them up so that he can add fake gold in their place.
https://www.theguardian.com/commentisfree/2025/sep/01/soldiers-landscaping-washington-dc-crime
Permutation closure for multiple context-free languages
Andrew Duncan, Murray Elder, Lisa Frenkel, Mengfan Lyu
https://arxiv.org/abs/2509.22239 https://arx…
Honestly, #emoji and icons in #Unicode are a true horror.
Yeah, sure. It's great that you don't have to use <img/> anymore and you can just paste a random Unicode character. You can get graphics into fields where only text was originally intended (like bug summaries). Even better, you can now easily get cool colorful icons on terminal with almost no effort.
However, it is an #accessibility nightmare. People are now encoding *information* in random graphical symbols. Symbols that require huge fonts to render, or huge character tables to describe.
Yeah, a bare <img/> carrying information sucks. However, you can add a *meaningful* alt-text to the image, and accessibility tools can use that text to provide meaningful context. Like "bug fix".
However, emojis and icons are symbolic. The best you can get is some description like "hammer and wrench", so people can kinda figure out that it's probably a "bug fix". Or maybe it was a "maintenance task"? Or you'll get a "unknown character 0x1F6E0". And I'm sure people will surely enjoy cross-referencing a "legend" of such "unknown characters".
Addiction (Speculatve)
Kind of a fucked-up metaphor, but I was thinking yesterday that parenting is a lot like addiction. If you separate me from my child, I'll take completely irrational and desperate actions to get them back, driven by a deep instinct that goes well beyond "love." I'll also make self-disadvantageous long-term decisions like forgoing sleep, working an extra job, or quitting a job to do some combination of providing for and/or being present with my child.
Even in parenting situations where love is absent, and beyond, I think, the possessiveness that sometimes festers in those situations, there's often (although not always) a craving for simple presence of the child.
In a healthy relationship, there's a whole lot more than this, but it's interesting to me that the same obsessive craving and absolute priority that we think of as diseased and/or monstrous in someone addicted to a hard drug can be healthy in the right context (that is, when it doesn't contribute to abusive or twisted parental relationships but instead exists alongside a healthy amount of love and respect).
Makes me wonder if there are ways to have a truly healthy drug addiction, although I recognize the answer might well be "no" and that even if it's "technically/theoretically yes" it might still be harmful to hype up or even merely discuss that possibility since it might help addicted people in harmful addictions more easily justify inaction. At minimum I think any "yes" answer here involves assuming utopian-level differences from our current society.
#Parenting #Addiction
📊 Full observability: Log every state change, measure latency, generate audit trails
🎨 #PHP8 features: String-backed enums for states/events, readonly value objects for context data
🔄 Domain events pattern: Emit events to outbox, handle side effects idempotently with retries
📋 Easy testing: Unit test state transitions without hitting database or network
📈 Auto-documentation: …
SlimPack: Fine-Grained Asymmetric Packing for Balanced and Efficient Variable-Length LLM Training
Yuliang Liu, Guohao Wu, Shenglong Zhang, Wei Zhang, Qianchao Zhu, Zhouyang Li, Chenyu Wang
https://arxiv.org/abs/2509.26246
DynaMIC: Dynamic Multimodal In-Context Learning Enabled Embodied Robot Counterfactual Resistance Ability
Tianqiang Yan, Ziqiao Lin, Sicheng Wang, Tianwei Zhang, Zhenglong Sun
https://arxiv.org/abs/2509.24413
POVQA: Preference-Optimized Video Question Answering with Rationales for Data Efficiency
Ashim Dahal, Ankit Ghimire, Saydul Akbar Murad, Nick Rahimi
https://arxiv.org/abs/2510.01009
Rethinking Thinking Tokens: LLMs as Improvement Operators
Lovish Madaan, Aniket Didolkar, Suchin Gururangan, John Quan, Ruan Silva, Ruslan Salakhutdinov, Manzil Zaheer, Sanjeev Arora, Anirudh Goyal
https://arxiv.org/abs/2510.01123
Dynamics of Majorana zero modes across hybrid Kitaev chain
Rajiv Kumar, Rohit Kumar Shukla, Levan Chotorlishvili, Sunil Kumar Mishra
https://arxiv.org/abs/2509.26134 https://
Test time training enhances in-context learning of nonlinear functions
Kento Kuwataka, Taiji Suzuki
https://arxiv.org/abs/2509.25741 https://arxiv.org/pdf/…
Analysis of Semantic Communication for Logic-based Hypothesis Deduction
Ahmet Faruk Saz, Siheng Xiong, Faramarz Fekri
https://arxiv.org/abs/2508.21755 https://
Designing Wine Tasting Experiences for All: The role of Human Diversity and Personal food memory
Xinyang Shan, Yuanyuan Xu, Yuqing Wang, Tian Xia, Yinshan Lin
https://arxiv.org/abs/2510.00607
Throttling for metric dimension and its variants
Boris Brimkov, Peter Diao, Jesse Geneson, Carolyn Reinhart, Shen-Fu Tsai, William Wang, Kyle Worley
https://arxiv.org/abs/2510.00530
from my link log —
Context parameters and API design in Kotlin.
https://serranofp.com/blog/context-params.html
saved 2025-10-20 https://
Test particle sampling and particle acceleration in a 2D coronal plasmoid-mediated reconnecting current sheet
Eilif S. {\O}yre, Boris V. Gudiksen, Lyndsay Fletcher
https://arxiv.org/abs/2509.25447
Does anybody know of aggregations of dcat data (or any published dcat on data collections/services) in the context of cultural heritage institutions?
#openglam #dcat #culturalheritage
Data Quality Taxonomy for Data Monetization
Eduardo Vyhmeister, Bastien Pietropoli, Andrea Visentin
https://arxiv.org/abs/2510.00089 https://arxiv.org/pdf/…
Asynchronous Nonlinear Sheaf Diffusion for Multi-Agent Coordination
Yichen Zhao, Tyler Hanks, Hans Riess, Samuel Cohen, Matthew Hale, James Fairbanks
https://arxiv.org/abs/2510.00270
Whenever I see a clip of someone throwing around the Video Game Line after a tragedy I gotta check the upload date to see if it was just an out of context clip from 2004 or something new.
Energy conditions and gravitational baryogenesis in $f(R, {\cal R})$ gravity
K. Atazadeh, S. Golsanamlou
https://arxiv.org/abs/2510.01148 https://arxiv.org…
In Apple Music context does adding a song to a playlist imply adding the album to your library?
As an aside, as a subscribed Apple Music customer is there some support that I can trach out to with similar questions and issues?
Enhancing Connectivity for Emergency Vehicles Through UAV Trajectory and Resource Allocation Optimization
S. Fatemeh Bozorgi, S. Mohammad Razavizadeh, Mohsen Rezaee
https://arxiv.org/abs/2509.26067
Context-Driven Performance Modeling for Causal Inference Operators on Neural Processing Units
Neelesh Gupta, Rakshith Jayanth, Dhruv Parikh, Viktor Prasanna
https://arxiv.org/abs/2509.25155
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
Esakkivel Esakkiraja, Denis Akhiyarov, Aditya Shanmugham, Chitra Ganapathy
https://arxiv.org/abs/2509.25716
In-Context Learning can Perform Continual Learning Like Humans
Liuwang Kang, Fan Wang, Shaoshan Liu, Hung-Chyun Chou, Chuan Lin, Ning Ding
https://arxiv.org/abs/2509.22764 https…
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs
Yehonatan Pesiakhovsky, Zorik Gekhman, Yosi Mass, Liat Ein-Dor, Roi Reichart
https://arxiv.org/abs/2509.22582
Personalized Vision via Visual In-Context Learning
Yuxin Jiang, Yuchao Gu, Yiren Song, Ivor Tsang, Mike Zheng Shou
https://arxiv.org/abs/2509.25172 https://
Crosslisted article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[5/10]:
- In-Context Curiosity: Distilling Exploration for Decision-Pretrained Transformers on Bandit Tasks
Huitao Yang, Guanting Chen
Learnable Conformal Prediction with Context-Aware Nonconformity Functions for Robotic Planning and Perception
Divake Kumar, Sina Tayebati, Francesco Migliarba, Ranganath Krishnan, Amit Ranjan Trivedi
https://arxiv.org/abs/2509.21955
Computable measures of non-Markovianity for Gaussian free fermion systems
Giuliano Chiriac\`o
https://arxiv.org/abs/2509.25953 https://arxiv.org/pdf/2509.2…
Explainable and Resilient ML-Based Physical-Layer Attack Detectors
Aleksandra Knapi\'nska, Marija Furdek
https://arxiv.org/abs/2509.26530 https://arxiv…
Next Point-of-interest (POI) Recommendation Model Based on Multi-modal Spatio-temporal Context Feature Embedding
Lingyu Zhang, Guobin Wu, Yan Wang, Pengfei Xu, Jian Liang, Xuan Song, Yunhai Wang
https://arxiv.org/abs/2509.22661
Carbon and nitrogen as indicators of stellar evolution and age. A homogeneous sample of 44 open clusters from the Gaia-ESO Survey
G. Tautvai\v{s}ien\.e, A. Drazdauskas, \v{S}. Mikolaitis, R. Minkevi\v{c}i\=ut\.e, E. Stonkut\.e, S. Randich, A. Bragaglia, L. Magrini, R. Smiljanic, M. Ambrosch, V. Bagdonas, G. Casali, Y. Chorniy, C. Viscasillas V\'azquez
https…
U-DFA: A Unified DINOv2-Unet with Dual Fusion Attention for Multi-Dataset Medical Segmentation
Zulkaif Sajjad, Furqan Shaukat, Junaid Mir
https://arxiv.org/abs/2510.00585 https:…
Orchid: Orchestrating Context Across Creative Workflows with Generative AI
Srishti Palani, Gonzalo Ramos
https://arxiv.org/abs/2508.19517 https://arxiv.org…
Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting
Jan Fillies, Michael Peter Hoffmann, Rebecca Reichel, Roman Salzwedel, Sven Bodemer, Adrian Paschke
https://arxiv.org/abs/2508.21084
Novel very-high-frequency quasi-periodic oscillations of compact, non-singular objects
Jens Boos, Felix Wunsch
https://arxiv.org/abs/2510.00986 https://arx…
from my link log —
libfringe: an old Rust library for stackful coroutines.
https://github.com/edef1c/libfringe
saved 2025-10-28 https://dotat.at/:/S…
'Too much alignment; not enough culture': Re-balancing cultural alignment practices in LLMs
Eric J. W. Orlowski, Hakim Norhashim, Tristan Koh Ly Wey
https://arxiv.org/abs/2509.26167
Context-Specific Instruction: A Longitudinal Study on Debugging Skill Acquisition and Retention for Novice Programmers
Ziyi Zhang, Devjeet Roy, Venera Arnaoudova
https://arxiv.org/abs/2509.22420
TASP: Topology-aware Sequence Parallelism
Yida Wang (Capital Normal University, Infinigence-AI), Ke Hong (Tsinghua University, Infinigence-AI), Xiuhong Li (Infinigence-AI), Yuanchao Xu (Capital Normal University), Wenxun Wang (Tsinghua University), Guohao Dai (Infinigence-AI, Shanghai Jiao Tong University), Yu Wang (Tsinghua University)
https://
Strata: Hierarchical Context Caching for Long Context Language Model Serving
Zhiqiang Xie, Ziyi Xu, Mark Zhao, Yuwei An, Vikram Sharma Mailthody, Scott Mahlke, Michael Garland, Christos Kozyrakis
https://arxiv.org/abs/2508.18572
A Contextual Seven-Valued Logic (\emph{Saptabhang\=inaya}) for Quantum Systems
Partha Ghose
https://arxiv.org/abs/2510.01120 https://arxiv.org/pdf/2510.011…
Uncertainty-Aware Concept Bottleneck Models with Enhanced Interpretability
Haifei Zhang, Patrick Barry, Eduardo Brandao
https://arxiv.org/abs/2510.00773 https://
Finding Phones Fast: Low-Latency and Scalable Monitoring of Cellular Communications in Sensitive Areas
Martin Kotuliak, Simon Erni, Jakub Pol\'ak, Marc Roeschlin, Richard Baker, Ivan Martinovic, Srdjan \v{C}apkun
https://arxiv.org/abs/2509.25430
MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation
Jinlan Fu, Shenzhen Huangfu, Hao Fei, Yichong Huang, Xiaoyu Shen, Xipeng Qiu, See-Kiong Ng
https://arxiv.org/abs/2510.00647
Counterfactual Scenarios for Automated Planning
Nicola Gigante, Francesco Leofante, Andrea Micheli
https://arxiv.org/abs/2508.21521 https://arxiv.org/pdf/2…
Anatomy-DT: A Cross-Diffusion Digital Twin for Anatomical Evolution
Moinak Bhattacharya, Gagandeep Singh, Prateek Prasanna
https://arxiv.org/abs/2509.25280 https://
Asymptotic Schwarzschild solutions in $f(R)$ gravity and their observable effects on the photon sphere of black holes
Miguel Aparicio Resco
https://arxiv.org/abs/2510.00702 http…
The Demon is in Ambiguity: Revisiting Situation Recognition with Single Positive Multi-Label Learning
Yiming Lin, Yuchen Niu, Shang Wang, Kaizhu Huang, Qiufeng Wang, Xiao-Bo Jin
https://arxiv.org/abs/2508.21816
COM-BOM: Bayesian Exemplar Search for Efficiently Exploring the Accuracy-Calibration Pareto Frontier
Gaoxiang Luo, Aryan Deshwal
https://arxiv.org/abs/2510.01178 https://…
Interpreting Language Models Through Concept Descriptions: A Survey
Nils Feldhus, Laura Kopf
https://arxiv.org/abs/2510.01048 https://arxiv.org/pdf/2510.01…
Unit Test Update through LLM-Driven Context Collection and Error-Type-Aware Refinement
Yuanhe Zhang, Zhiquan Yang, Shengyi Pan, Zhongxin Liu
https://arxiv.org/abs/2509.24419 htt…
MC-GNNAS-Dock: Multi-criteria GNN-based Algorithm Selection for Molecular Docking
Siyuan Cao, Hongxuan Wu, Jiabao Brad Wang, Yiliang Yuan, Mustafa Misir
https://arxiv.org/abs/2509.26377
ProfVLM: A Lightweight Video-Language Model for Multi-View Proficiency Estimation
Edoardo Bianchi, Jacopo Staiano, Antonio Liotta
https://arxiv.org/abs/2509.26278 https://
Data-Centric Elastic Pipeline Parallelism for Efficient Long-Context LLM Training
Shiju Wang, Yujie Wang, Ao Sun, Fangcheng Fu, Zijian Zhu, Bin Cui, Xu Han, Kaisheng Ma
https://arxiv.org/abs/2509.21275
Crosslisted article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[7/7]:
- Pretrain-Test Task Alignment Governs Generalization in In-Context Learning
Mary I. Letey, Jacob A. Zavatone-Veth, Yue M. Lu, Cengiz Pehlevan
Hybrid Dialogue State Tracking for Persian Chatbots: A Language Model-Based Approach
Samin Mahdipour Aghabagher, Saeedeh Momtazi
https://arxiv.org/abs/2510.01052 https://…
AI Compute Architecture and Evolution Trends
Bor-Sung Liang
https://arxiv.org/abs/2508.21394 https://arxiv.org/pdf/2508.21394
Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Shashank Vempati, Nishit Anand, Gaurav Talebailkar, Arpan Garai, Chetan Arora
https://arxiv.org/abs/2508.21693
ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems
Junsong Pu, Yichen Li, Zhuangbin Chen, Jinyang Liu, Zhihan Jiang, Jianjun Chen, Rui Shi, Zibin Zheng, Tieying Zhang
https://arxiv.org/abs/2509.26463
GRAD: Generative Retrieval-Aligned Demonstration Sampler for Efficient Few-Shot Reasoning
Oussama Gabouj, Kamel Charaf, Ivan Zakazov, Nicolas Baldwin, Robert West
https://arxiv.org/abs/2510.01165
MultiFluxAI Enhancing Platform Engineering with Advanced Agent-Orchestrated Retrieval Systems
Sri Ram Macharla, Sridhar Murthy J, Anjaneyulu Pasala
https://arxiv.org/abs/2508.21307
TTT3R: 3D Reconstruction as Test-Time Training
Xingyu Chen, Yue Chen, Yuliang Xiu, Andreas Geiger, Anpei Chen
https://arxiv.org/abs/2509.26645 https://arxi…
Bridging Developer Instructions and Code Completion Through Instruction-Aware Fill-in-the-Middle Paradigm
Zhensu Sun, Chengran Yang, Chao Peng, Pengfei Gao, Xiaoning Du, Li Li, David Lo
https://arxiv.org/abs/2509.24637
CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs
Li Li, Ziyi Wang, Yongliang Wu, Jianfei Cai, Xu Yang
https://arxiv.org/abs/2510.00579 https://
Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
Zhongpan Tang
https://arxiv.org/abs/2508.20407 https://
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
Hyun-kyu Ko, Youbin Kim, Jihyeon Park, Dongheok Park, Gyeongjin Kang, Wonjun Cho, Hyung Yi, Eunbyung Park
https://arxiv.org/abs/2510.00862
Model Context Protocols in Adaptive Transport Systems: A Survey
Gaurab Chhetri, Shriyank Somvanshi, Md Monzurul Islam, Shamyo Brotee, Mahmuda Sultana Mimi, Dipti Koirala, Biplov Pandey, Subasish Das
https://arxiv.org/abs/2508.19239
Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models
Meidan Ding, Jipeng Zhang, Wenxuan Wang, Cheng-Yi Li, Wei-Chieh Fang, Hsin-Yu Wu, Haiqin Zhong, Wenting Chen, Linlin Shen
https://arxiv.org/abs/2508.21430
KeySG: Hierarchical Keyframe-Based 3D Scene Graphs
Abdelrhman Werby, Dennis Rotondi, Fabio Scaparro, Kai O. Arras
https://arxiv.org/abs/2510.01049 https://…
Granite Embedding R2 Models
Parul Awasthy, Aashka Trivedi, Yulong Li, Meet Doshi, Riyaz Bhat, Vignesh P, Vishwajeet Kumar, Yushu Yang, Bhavani Iyer, Abraham Daniels, Rudra Murthy, Ken Barker, Martin Franz, Madison Lee, Todd Ward, Salim Roukos, David Cox, Luis Lastras, Jaydeep Sen, Radu Florian
https://arxiv.org/abs/2508.21085
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang, Xin Gao, Qizhi Pei, Zhuoshi Pan, Mengzhang Cai, Jiang Wu, Conghui He, Lijun Wu
https://arxiv.org/abs/2508.21589
ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models
Manlai Liang, Mandi Liu, Jiangzhou Ji, Huaijun Li, Haobo Yang, Yaohan He, Jinlong Li
https://arxiv.org/abs/2508.17892