Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csDC_bot@mastoxiv.page
2025-09-03 08:24:53

DSDE: Dynamic Speculative Decoding with KLD Stability for Real-World Serving
Mingyu Yang, Jae-Young Choi, Kihyo Moon, Minsung Jang, Eunjoo Joen
arxiv.org/abs/2509.01083

@arXiv_csOS_bot@mastoxiv.page
2025-09-03 08:00:13

AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving
Shaoting Feng, Hanchen Li, Kuntai Du, Zhuohan Gu, Yuhan Liu, Jiayi Yao, Siddhant Ray, Samuel Shen, Yihua Cheng, Ganesh Ananthanarayanan, Junchen Jiang
arxiv.org/abs/2509.00105

@Mediagazer@mstdn.social
2025-08-03 15:45:32

The Senate confirms former Fox News host and prosecutor Jeanine Pirro as US attorney for DC; she had been serving in the role on an interim basis since May (Nnamdi Egwuonwu/NBC News)
nbcnews.com/politics/congress/

35-year-old former U.S. Army sergeant, Bajun “Baji” Mavalwalla II,
faces up to six years in prison
for protesting against ICE deportations
in what legal experts are calling a test case for the
Trump administration’s attempts to criminalize and punish dissent.
Mavalwalla was arrested and charged with “conspiracy to impede or injure officers”
after he was identified in a video taken at the protest and shared on Instagram.
Mavalwalla, who survived a ro…

@hex@kolektiva.social
2025-10-02 12:28:47

Two things are worth noticing right now:
1. The military brass *did not* respond well to Trump and Hegseth.
2. The deployment to #Portland keeps getting delayed.
The military will never say "no" to the president (unless he's literally ordering them to open fire on unarmed civilians or something equally obviously illegal). But there are ways to not comply that don't necessarily involve refusal. Brass showing that they aren't aligned with Trump may weaken his billionaire backers, who might be realizing now that weak dictators who can't lead their militaries tend to get toppled... and their oligarch-backers tend to end up against walls.
If folks being ordered to send troops to #PDX don't want to comply, delaying until the there's an initial response from the lawsuit would be basically impossible to detect. The deployment to LA went far too fast, running into logistical challenges like troops sleeping on the floor. The delays we've already seen could indicate either a more careful approach or quiet resistance.
Trump will continue to escalate at every chance he gets. I would be surprised if PDX didn't give him a fight. I doubt the troops will become more interested in serving a guy who's stabbed them in the back and wasted their time at every opportunity.
It is still possible troops just won't deploy. Trump will make something up about how just the threat of an intervention was enough to make things safe or something like that. If we see that, it's 100% the military telling him to kick rocks because he's not competent enough to know when to back down.
Honestly, I think Trump wants revenge for the resistance PDX put up at the end of his last term. Any backing down from that is absolutely a big loss for him.

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 15:02:33

GenCompositor: Generative Video Compositing with Diffusion Transformer
Shuzhou Yang, Xiaoyu Li, Xiaodong Cun, Guangzhi Wang, Lingen Li, Ying Shan, Jian Zhang
arxiv.org/abs/2509.02460

@memeorandum@universeodon.com
2025-10-30 15:30:58

These Republicans oppose DEI, but also cuts hitting Hispanic-serving colleges (Rachel Hatzipanagos/Washington Post)
washingtonpost.com/nation/2025
memeorandum.com/251030/p70#a25

@curiouscat@fosstodon.org
2025-09-03 16:06:27

Due to the devastating harm being done to the USA by the current administration, and the administration of Texas I have donated again to the Texas Civil Rights Project txcivilrights.org (doubling my giving for 2025).
I will be doing the same for several other

@arXiv_csDC_bot@mastoxiv.page
2025-09-03 09:05:33

LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving
Huanqi Hu, Bowen Xiao, Shixuan Sun, Jianian Yin, Zhexi Zhang, Xiang Luo, Chengquan Jiang, Weiqi Xu, Xiaoying Jia, Xin Liu, Minyi Guo
arxiv.org/abs/2509.01229

@detondev@social.linux.pizza
2025-08-31 23:28:56

this is an underrated dees, its composition evokes dali and carrington landscapes in a way none of his other stuff does

apple Frankenstein kid and potato baby girl and apple pie serving cyborg parent and DNA ladder and chemtrails
@arXiv_csDB_bot@mastoxiv.page
2025-09-03 10:35:03

Batch Query Processing and Optimization for Agentic Workflows
Junyi Shen, Noppanat Wadlom, Yao Lu
arxiv.org/abs/2509.02121 arxiv.org/pdf/25…

@arXiv_grqc_bot@mastoxiv.page
2025-09-03 13:41:43

Loop Quantum Vector-Tensor Gravity and Its Spherically Symmetric Model
Shengzhi Li, Yongge Ma
arxiv.org/abs/2509.02056 arxiv.org/pdf/2509.0…

@inthehands@hachyderm.io
2025-08-31 15:23:25

The headline here is that
(1) the increasingly authoritarian US government demanded that Google suppress videos serving the public interest because those videos are embarrassing to government officials, and
(2) Google readily over-complied with that request, deleting not just the videos but the entire channel.
theguardian.com/us-news/2025/a

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-10-03 08:41:31

exaPD: A highly parallelizable workflow for multi-element phase diagram (PD) construction
Feng Zhang, Zhuo Ye, Maxim Moraru, Ying Wai Li, Weiyi Xia, Yongxin Yao, Ryan Richard, Cai-Zhuang Wang
arxiv.org/abs/2510.01400

@arXiv_astrophGA_bot@mastoxiv.page
2025-09-03 13:25:23

Signal Drop in Magnification Profiles: Combining Lensing Simulations and Observations
David Crespo, Joaqu\'in Gonz\'alez-Nuevo, Laura Bonavera, Marcos M. Cueli, Hu Zou, Rebeca Fern\'andez-Fern\'andez, Jose M. Casas
arxiv.org/abs/2509.02213

@arXiv_csNE_bot@mastoxiv.page
2025-10-03 08:21:51

VarCoNet: A variability-aware self-supervised framework for functional connectome extraction from resting-state fMRI
Charalampos Lamprou, Aamna Alshehhi, Leontios J. Hadjileontiadis, Mohamed L. Seghier
arxiv.org/abs/2510.02120

@arXiv_astrophIM_bot@mastoxiv.page
2025-09-03 11:51:13

FPGA-Based RoCEv2-RDMA Readout Electronics for the CTAO-LST Advanced Camera
F. Marini, M. Bellato, A. Bergnoli, D. Corti, A. Griggio, R. Isocrate, L. Modenese, M. Toffano, C. Arcaro, F. Di Pierro, M. Mariotti, M. Mi, P. Wang
arxiv.org/abs/2509.02285

@arXiv_csLG_bot@mastoxiv.page
2025-10-01 14:18:14

Crosslisted article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[1/7]:
- AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving
Feng, Li, Du, Gu, Liu, Yao, Ray, Shen, Cheng, Ananthanarayanan, Jiang

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:13:17

Comparative Analysis of Ant Colony Optimization and Google OR-Tools for Solving the Open Capacitated Vehicle Routing Problem in Logistics
Assem Omar, Youssef Omar, Marwa Solayman, Hesham Mansour
arxiv.org/abs/2509.26216

@samir@functional.computer
2025-08-31 20:45:40

@… Yeah, I am not opposed to state varying across that level. Across applications, maintained by separate teams or even departments, of course! You don’t want to couple yourself to people who might have drastically different definitions of “success” to you.
But within a single application, serving a single purpose? Only if there’s no better way.…

@benb@osintua.eu
2025-09-17 13:22:13

Ukraine captures Kenyan serving in Russian army, who claims he was tricked into joining: benborges.xyz/2025/09/17/ukrai

@newsie@darktundra.xyz
2025-09-25 13:56:34

How Surveillance Firms Use ‘Democracy’ As a Cover for Serving ICE and Trump 404media.co/how-surveillance-f

@memeorandum@universeodon.com
2025-10-29 02:15:52

Another Trump-Appointed U.S. Attorney Found to be Serving Unlawfully, Federal Judge Rules (Yunior Rivas/Democracy Docket)
democracydocket.com/news-alert
memeorandum.com/251028/p156#a2

@NFL@darktundra.xyz
2025-09-22 21:46:29

Virginia Tech coaching search: Bruce Arians, Super Bowl winner and former Hokies QB, serving as consultant

cbssports.com/college-foo…

@davej@dice.camp
2025-08-31 15:42:36

I got Epsom Salt. That’s just uncanny. beige.party/@RickiTarr/1151240

Your salt identity is:
Epsom salt
"say less"

Flavour profile:
• identifies all exits within 7 seconds of entering
• adapts to chaos faster than others can comprehend it
• assembles IKEA furniture without looking at instructions
• low-key
• knows exactly who to befriend for maximum advantage
• pragmatic and resourceful
• resting bitch face

Best matches:
• Fleur de sel
• Kala namak

Arch enemies:
• Flaky salt
• Kosher salt

Serving suggestion:
2 cups combined with your favourite essential oil f…
@midtsveen@social.linux.pizza
2025-10-30 19:36:03

New #Pixelfed Post:
#Fediverse

@cosmos4u@scicomm.xyz
2025-10-30 01:44:22

The most up-to-date light curve of interstellar comet #ATLAS - x.com/AsteroidEnergy/status/19 - with sparse data points all the way to perihelion, bsky.app/profile/kwalsh4a.bsky from PUNCH serving the latest. So as seen from Earth the brightness is about 9.5 mag. - and should still be 10.something when the comet can be observed from the ground again, from about 10 November onwards.

@spamless@mastodon.social
2025-10-29 23:52:35

So, we had we had the Chez Dallman Pecan Chicken Bok Choy again. I wanted to take my wife's picture, but she protested and said she never looks good under the glaring kitchen light. I pushed the settings button in the camera app and said wryly, "Ah, there it is: turning on 'Make My Wife Beautiful' mode!" And I pushed at the screen.
My wife cracked up. (I did too.) It worked. She likes the photo, or at least she doesn't hate it. I can't post it here, though…

Me tonight at our kitchen table eating my wife's serving of my Pecan Chicken Bok Choy recipe.
@arXiv_csCR_bot@mastoxiv.page
2025-09-30 12:08:51

VeriLLM: A Lightweight Framework for Publicly Verifiable Decentralized Inference
Ke Wang, Felix Qu, Libin Xia, Zishuo Zhao, Chris Tong, Lynn Ai, Eric Yang
arxiv.org/abs/2509.24257

@arXiv_csSE_bot@mastoxiv.page
2025-09-30 11:43:11

Evaluating SAP Joule for Code Generation
Joshua Heisler, Johannes Reisinger, Andreas Fischer
arxiv.org/abs/2509.24828 arxiv.org/pdf/2509.24…

@arXiv_csDC_bot@mastoxiv.page
2025-09-30 10:07:51

A Predictive and Synergistic Two-Layer Scheduling Framework for LLM Serving
Yue Zhang, Yuansheng Chen, Xuan Mo, Alex Xi, Jialun Li, WeiGang Wu
arxiv.org/abs/2509.23384

@Dragofix@veganism.social
2025-10-29 19:15:15

Report finds dangerous mercury levels, highlights mislabeling in shark meat sold in EU news.mongabay.com/2025/10/repo

@metacurity@infosec.exchange
2025-10-27 11:05:26

Chatbots Are Pushing Sanctioned Russian Propaganda
wired.com/story/chatbots-are-p

@scott@carfree.city
2025-10-27 03:57:17

That's sad: what was far and away my favorite restaurant during last year's visit to Portland (Ore.) closed. Brunch-serving vegan diners are a particular type of place I love that seems to be an endangered species these days, cf Beach'N in SF, also closed this year.

@arXiv_quantph_bot@mastoxiv.page
2025-09-29 10:35:37

Resource-efficient universal photonic processor based on time-multiplexed hybrid architectures
Jonas Lammers, Laura Ares, Federico Pegoraro, Philip Held, Benjamin Brecht, Jan Sperling, Christine Silberhorn
arxiv.org/abs/2509.22521

@arXiv_csAR_bot@mastoxiv.page
2025-09-30 09:27:01

Fault Injection in On-Chip Interconnects: A Comparative Study of Wishbone, AXI-Lite, and AXI
Hongwei Zhao, Vianney Lapotre, Guy Gogniat
arxiv.org/abs/2509.24929

After two decades in Congress,
Darrell Issa’s career is all about serving the ultra-wealthy like himself
– and serving Donald Trump. It’s time to send him packing and take our country back. 
On the City Council,
Marni von Wilpert
flipped San Diego’s reddest seat blue.
Now, she’s ready to do it again – in Congress.

@arXiv_csDB_bot@mastoxiv.page
2025-08-27 08:43:52

Rethinking Caching for LLM Serving Systems: Beyond Traditional Heuristics
Jungwoo Kim, Minsang Kim, Jaeheon Lee, Chanwoo Moon, Heejin Kim, Taeho Hwang, Woosuk Chung, Yeseong Kim, Sungjin Lee
arxiv.org/abs/2508.18736

@arXiv_csDC_bot@mastoxiv.page
2025-09-30 10:44:41

SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long-Context LLM Serving
Qihui Zhou, Peiqi Yin, Pengfei Zuo, James Cheng
arxiv.org/abs/2509.24626

@jerome@jasette.facil.services
2025-10-22 12:39:16

Drop Site uncovered new information about individuals, donor networks, and businesses helping Canary Mission, a pro-Israel organization serving the U.S.'s deportation and repression efforts.
dropsitenews.com/p/canary-miss

@davidaugust@mastodon.online
2025-10-25 04:16:33

"For me, our job as artists is to serve the story, serve the director, and serve the fellow actors. And if you do that, by osmosis you’re serving yourself because you’ll get the best out of yourself."
—David Oyelowo
#acting #coaching

@sauer_lauwarm@mastodon.social
2025-08-26 02:15:25

The application is currently not serving requests at this endpoint. It may not have been started or is still starting.

@arXiv_mathPR_bot@mastoxiv.page
2025-09-30 10:16:51

Zero-Waiting Load Balancing with Heterogeneous Servers in Heavy Traffic
Xin Liu, Lei Ying
arxiv.org/abs/2509.23918 arxiv.org/pdf/2509.23918…

@servelan@newsie.social
2025-09-09 15:40:23

As Trump Defunds Infrastructure, Water Systems Serving Millions Face Flood Risk - WhoWhatWhy
whowhatwhy.org/science/environ

@arXiv_csET_bot@mastoxiv.page
2025-09-30 08:54:41

Information Transmission in Quorum Sensing for Gut Microbiome
O. Tansel Baydas, Efe Yatgin, Ozgur B. Akan
arxiv.org/abs/2509.25057 arxiv.or…

@cyrevolt@mastodon.social
2025-09-25 03:05:17

happens to the best of us
blog.cloudflare.com/deep-dive-

@brian_gettler@mas.to
2025-10-25 01:26:56

I've been thinking a bit about historians' use of theory. I've decided it's often similar to a comment a cafeteria worker made to me decades ago. They were serving chili dogs and I asked if I could just have a bowl of chili instead. "Oh no, dear," she said, "this is chili dog chili, it's not for eating." I've long suspected that a good chunk of my field thinks of theory along these lines. We put it on the menu and let our students consume it, even …

@arXiv_csAI_bot@mastoxiv.page
2025-08-22 10:07:51

Measuring the environmental impact of delivering AI at Google Scale
Cooper Elsworth, Keguo Huang, David Patterson, Ian Schneider, Robert Sedivy, Savannah Goodman, Ben Townsend, Parthasarathy Ranganathan, Jeff Dean, Amin Vahdat, Ben Gomes, James Manyika
arxiv.org/abs/2508.15734

@Techmeme@techhub.social
2025-09-21 01:05:59

A profile of Noah Urban, who was a key member of the Scattered Spider group because of his social engineering skills and is serving a 10-year prison sentence (Margi Murphy/Bloomberg)

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:36:01

DRIFT-Net: A Spectral--Coupled Neural Operator for PDEs Learning
Jiayi Li, Flora D. Salim
arxiv.org/abs/2509.24868 arxiv.org/pdf/2509.24868…

@arXiv_csCV_bot@mastoxiv.page
2025-09-30 15:01:26

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts
Fan Yuan, Yuchen Yan, Yifan Jiang, Haoran Zhao, Tao Feng, Jinyan Chen, Yanwei Lou, Wenqi Zhang, Yongliang Shen, Weiming Lu, Jun Xiao, Yueting Zhuang
arxiv.org/abs/2509.25160

@arXiv_astrophSR_bot@mastoxiv.page
2025-09-29 10:02:28

Particle Acceleration and Transport in the Large-scale Current Sheet under an Erupting Magnetic Flux Rope
Hao Wu, Yang Guo, Rony Keppens, Chun Xia, Yang Su, Xiangliang Kong, Mingde Ding
arxiv.org/abs/2509.22265

@kurtsh@mastodon.social
2025-08-25 07:59:11

I appreciate the premise of reducing red meat in one's diet... but if you expect me to drop fish or seafood, you can go to hell.
✅ The Self-Importance of Luxury Dining: Eleven Madison Park is serving meat again—a sign of American tastes, and of fine-dining hubris - The Atlantic
archive.ph/BxO7Z

@arXiv_csOS_bot@mastoxiv.page
2025-09-26 08:55:01

Nova: Real-Time Agentic Vision-Language Model Serving with Adaptive Cross-Stage Parallelization
Yuhang Xu, Shengzhong Liu, Dong Zhang, Bingheng Yan, Fan Wu, Guihai Chen
arxiv.org/abs/2509.21301

@arXiv_mathOC_bot@mastoxiv.page
2025-08-11 09:14:39

LLM Serving Optimization with Variable Prefill and Decode Lengths
Meixuan Wang, Yinyu Ye, Zijie Zhou
arxiv.org/abs/2508.06133 arxiv.org/pdf…

@me@mastodon.peterjanes.ca
2025-09-27 15:06:46

From bsky.app/profile/did:plc:cysfy
> Prof. Lee says that the problem for Canada Post is the smartphone because people send texts, not letters. Fine, but don't gut it. I say, make Canada Post a pub…

A federal judge ruled Thursday that Trump’s former lawyer,
Alina Habba, has been unlawfully serving as the the top federal prosecutor in New Jersey since last month.
U.S. District Judge Matthew Brann held that Habba’s term as the interim U.S. attorney ended in July,
and the Trump administration’s “novel series of legal and personnel moves” to keep her in the role
-- without getting confirmation from the U.S. Senate
-- didn’t follow procedures required by federal l…

@arXiv_csIR_bot@mastoxiv.page
2025-08-29 08:45:51

MPFormer: Adaptive Framework for Industrial Multi-Task Personalized Sequential Retriever
Yijia Sun, Shanshan Huang, Linxiao Che, Haitao Lu, Qiang Luo, Kun Gai, Guorui Zhou
arxiv.org/abs/2508.20400

@arXiv_astrophHE_bot@mastoxiv.page
2025-08-29 09:53:21

Very high-energy gamma-ray and neutrino emission from hadronic interaction in compact binary millisecond pulsars
Vittoria Vecchiotti, Manuel Linares
arxiv.org/abs/2508.20952

@arXiv_csDC_bot@mastoxiv.page
2025-09-30 07:41:11

FLAME: A Serving System Optimized for Large-Scale Generative Recommendation with Efficiency
Xianwen Guo, Bin Huang, Xiaomeng Wu, Guanlin Wu, Fangjian Li, Shijia Wang, Qiang Xiao, Chuanjiang Luo, Yong Li
arxiv.org/abs/2509.22681

@arXiv_csPF_bot@mastoxiv.page
2025-08-25 07:41:10

GreenLLM: SLO-Aware Dynamic Frequency Scaling for Energy-Efficient LLM Serving
Qunyou Liu, Darong Huang, Marina Zapater, David Atienza
arxiv.org/abs/2508.16449

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:38:21

Intra-request branch orchestration for efficient LLM reasoning
Weifan Jiang, Rana Shahout, Yilun Du, Michael Mitzenmacher, Minlan Yu
arxiv.org/abs/2509.24957

@arXiv_csDC_bot@mastoxiv.page
2025-09-30 10:43:21

RServe: Overlapping Encoding and Prefill for Efficient LMM Inference
Tianyu Guo, Tianming Xu, Xianjie Chen, Junru Chen, Nong Xiao, Xianwei Zhang
arxiv.org/abs/2509.24381

@servelan@newsie.social
2025-09-23 22:13:25

"In 2024, the group focused on recruiting and retention, women serving on submarines and mothers reintegrating to military life after pregnancy"
Pentagon shutters women’s advisory group
taskandpurpose.com/news/defens

@Mediagazer@mstdn.social
2025-10-24 11:21:05

National Federation of Community Broadcasters gets a $1.25M MacArthur grant; NFCB represents ~200 stations mostly serving rural and underrepresented communities (Austin Fuller/Current)
current.org/2025/10/national-f

@arXiv_csNE_bot@mastoxiv.page
2025-09-29 07:36:15

Cycle is All You Need: More Is Different
Xin Li
arxiv.org/abs/2509.21340 arxiv.org/pdf/2509.21340

@arXiv_csAI_bot@mastoxiv.page
2025-09-26 09:37:11

Embodied AI: From LLMs to World Models
Tongtong Feng, Xin Wang, Yu-Gang Jiang, Wenwu Zhu
arxiv.org/abs/2509.20021 arxiv.org/pdf/2509.20021

@memeorandum@universeodon.com
2025-08-27 17:40:55

DHS moves to bar aid groups from serving undocumented immigrants (Brianna Sacks/Washington Post)
washingtonpost.com/weather/202
memeorandum.com/250827/p96#a25

@Techmeme@techhub.social
2025-10-21 15:25:51

Dario Amodei addresses "inaccurate claims" about Anthropic's policy stances after David Sacks said the "real issue" is "Anthropic's agenda to backdoor Woke AI" (Ashley Capoot/CNBC)
cnbc.com/2025/10/21/anthropic-

@arXiv_csSE_bot@mastoxiv.page
2025-10-14 10:12:48

Grounded AI for Code Review: Resource-Efficient Large-Model Serving in Enterprise Pipelines
Sayan Mandal, Hua Jiang
arxiv.org/abs/2510.10290

@metacurity@infosec.exchange
2025-10-20 10:46:14

The official Xubuntu website was compromised over the weekend (18/19 October 2025) briefly serving up Windows malware to users trying to download the distro.
omgubuntu.co.uk/2025/10/xubunt

@arXiv_quantph_bot@mastoxiv.page
2025-08-27 10:18:03

Optimal quantum simulation of linear non-unitary dynamics
Guang Hao Low, Rolando D. Somma
arxiv.org/abs/2508.19238 arxiv.org/pdf/2508.19238…

@arXiv_csDC_bot@mastoxiv.page
2025-08-26 09:56:36

ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale
Ge Shi, Hanieh Sadri, Qian Wang, Yu Zhang, Ying Xiong, Yong Zhang, Zhenan Fan
arxiv.org/abs/2508.17624

@arXiv_csPF_bot@mastoxiv.page
2025-08-26 08:15:26

Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi, Yi Ding
arxiv.org/abs/2508.16712

@arXiv_csLG_bot@mastoxiv.page
2025-08-29 10:12:31

Structure-aware Hypergraph Transformer for Diagnosis Prediction in Electronic Health Records
Haiyan Wang, Ye Yuan
arxiv.org/abs/2508.20500

@arXiv_csDC_bot@mastoxiv.page
2025-10-01 08:44:47

Parallax: Efficient LLM Inference Service over Decentralized Environment
Chris Tong, Youhe Jiang, Gufeng Chen, Tianyi Zhao, Sibian Lu, Wenjie Qu, Eric Yang, Lynn Ai, Binhang Yuan
arxiv.org/abs/2509.26182

@Techmeme@techhub.social
2025-10-20 02:35:34

Alibaba Cloud details a GPU pooling system that it claims reduced the number of Nvidia H20 required by 82% when serving dozens of LLMs of up to 72B parameters (Vincent Chow/South China Morning Post)
scmp.com/business/article/3329

@servelan@newsie.social
2025-08-23 17:24:56

Justice Department says U.S. won't defend grants for Hispanic-serving colleges, calling them unconstitutional - CBS News
cbsnews.com/news/justice-depar

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 08:26:42

Synthesizing Artifact Dataset for Pixel-level Detection
Dennis Menn, Feng Liang, Diana Marculescu
arxiv.org/abs/2509.19589 arxiv.org/pdf/25…

@arXiv_csAI_bot@mastoxiv.page
2025-09-25 09:08:42

Embodied AI: From LLMs to World Models
Tongtong Feng, Xin Wang, Yu-Gang Jiang, Wenwu Zhu
arxiv.org/abs/2509.20021 arxiv.org/pdf/2509.20021

@arXiv_csDC_bot@mastoxiv.page
2025-08-25 07:36:30

HyperFlexis: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling
Zahra Yousefijamarani, Xinglu Wang, Qian Wang, Morgan Lindsay Heisler, Taha Shabani, Niloofar Gholipour, Parham Yassini, Hong Chang, Kan Chen, Qiantao Zhang, Xiaolong Bai, Jiannan Wang, Ying Xiong, Yong Zhang, Zhenan Fan
arxiv.org/abs/2508.15…

@Mediagazer@mstdn.social
2025-08-19 08:26:02

City Matters, a free monthly newspaper serving the City of London since 2016, enters voluntary liquidation, citing rising print costs and declining ad revenue (Alice Brooker/Press Gazette)
pressgazette.co.uk/publishers/

A senior CIA officer who oversaw Russia analysis
has been stripped of her security clearance,
part of a sweeping removal of
37 serving and former officials accused of "betray[ing] their oath to the Constitution," the Economist reported on Aug. 21.
The officer, who served as the CIA’s top Russia and Eurasia analyst during the 2016 election
and helped produce the report detailing Moscow’s interference on behalf of Donald Trump,
was among the most s…

@arXiv_csDC_bot@mastoxiv.page
2025-09-23 09:52:40

Expert-as-a-Service: Towards Efficient, Scalable, and Robust Large-scale MoE Serving
Ziming Liu, Boyu Tian, Guoteng Wang, Zhen Jiang, Peng Sun, Zhenhua Han, Tian Tang, Xiaohe Hu, Yanmin Jia, Yan Zhang, He Liu, Mingjun Zhang, Yiqi Zhang, Qiaoling Chen, Shenggan Cheng, Mingyu Gao, Yang You, Siyuan Feng
arxiv.org/abs/2509.17863

@memeorandum@universeodon.com
2025-08-22 21:46:02

Justice Dept. declines to defend grants for Hispanic-serving colleges, calling them unconstitutional (Associated Press)
apnews.com/article/hispanic-co
memeorandum.com/250822/p117#a2

@arXiv_csDC_bot@mastoxiv.page
2025-08-26 07:58:46

Equinox: Holistic Fair Scheduling in Serving Large Language Models
Zhixiang Wei, James Yen, Jingyi Chen, Ziyang Zhang, Zhibai Huang, Chen Chen, Xingzi Yu, Yicheng Gu, Chenggang Wu, Yun Wang, Mingyuan Xia, Jie Wu, Hao Wang, Zhengwei Qi
arxiv.org/abs/2508.16646

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 10:01:10

TinyML Towards Industry 4.0: Resource-Efficient Process Monitoring of a Milling Machine
Tim Langer, Matthias Widra, Volkhard Beyer
arxiv.org/abs/2508.16553

@arXiv_csDC_bot@mastoxiv.page
2025-08-27 08:25:23

Strata: Hierarchical Context Caching for Long Context Language Model Serving
Zhiqiang Xie, Ziyi Xu, Mark Zhao, Yuwei An, Vikram Sharma Mailthody, Scott Mahlke, Michael Garland, Christos Kozyrakis
arxiv.org/abs/2508.18572

@arXiv_csDC_bot@mastoxiv.page
2025-08-26 08:49:06

TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM Serving
Bingyang Wu, Zili Zhang, Yinmin Zhong, Guanzhe Huang, Yibo Zhu, Xuanzhe Liu, Xin Jin
arxiv.org/abs/2508.17219

@arXiv_csDC_bot@mastoxiv.page
2025-08-29 08:06:41

Predictable LLM Serving on GPU Clusters
Erfan Darzi, Shreeanant Bharadwaj, Sree Bhargavi Balija
arxiv.org/abs/2508.20274 arxiv.org/pdf/2508…

@arXiv_csDC_bot@mastoxiv.page
2025-09-23 09:49:10

Disaggregated Prefill and Decoding Inference System for Large Language Model Serving on Multi-Vendor GPUs
Xing Chen, Rong Shi, Lu Zhao, Lingbin Wang, Xiao Jin, Yueqiang Chen, Hongfeng Sun
arxiv.org/abs/2509.17542

@arXiv_csDC_bot@mastoxiv.page
2025-09-23 07:58:50

ShadowServe: Interference-Free KV Cache Fetching for Distributed Prefix Caching
Xingyu Xiang, Raj Joshi, Yuhan Liu, Jiayi Yao, Chenxingyu Zhao, Junchen Jiang, Yang Zhou, Eddie Kohler, Minlan Yu
arxiv.org/abs/2509.16857

@arXiv_csDC_bot@mastoxiv.page
2025-08-28 08:04:11

Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference
Rongzhi Li, Ruogu Du, Zefang Chu, Sida Zhao, Chunlei Han, Zuocheng Shi, Yiwen Shao, Huanle Han, Long Huang, Zherui Liu, Shufan Liu
arxiv.org/abs/2508.19559

@arXiv_csDC_bot@mastoxiv.page
2025-10-15 07:37:21

FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters
Yanying Lin, Shijie Peng, Chengzhi Lu, Chengzhong Xu, Kejiang Ye
arxiv.org/abs/2510.11938

@arXiv_csDC_bot@mastoxiv.page
2025-09-08 07:38:19

VoltanaLLM: Feedback-Driven Frequency Control and State-Space Routing for Energy-Efficient LLM Serving
Jiahuan Yu (University of Illinois Urbana-Champaign), Aryan Taneja (University of Illinois Urbana-Champaign), Junfeng Lin (Tsinghua University), Minjia Zhang (University of Illinois Urbana-Champaign)
arxiv.org/abs/2509.04827

@arXiv_csDC_bot@mastoxiv.page
2025-09-11 09:11:33

Hetis: Serving LLMs in Heterogeneous GPU Clusters with Fine-grained and Dynamic Parallelism
Zizhao Mo, Jianxiong Liao, Huanle Xu, Zhi Zhou, Chengzhong Xu
arxiv.org/abs/2509.08309

@arXiv_csDC_bot@mastoxiv.page
2025-08-06 09:00:20

Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling
Wei Da, Evangelia Kalyvianaki
arxiv.org/abs/2508.03611

@arXiv_csDC_bot@mastoxiv.page
2025-08-12 09:21:43

Kairos: Low-latency Multi-Agent Serving with Shared LLMs and Excessive Loads in the Public Cloud
Jinyuan Chen, Jiuchen Shi, Quan Chen, Minyi Guo
arxiv.org/abs/2508.06948