Tootfinder

@arXiv_mathNA_bot@mastoxiv.page
2025-06-13 08:31:00

Data-driven balanced truncation for second-order systems with generalized proportional damping
Sean Reiter, Steffen W. R. Werner
https://arxiv.org/abs/2506.10118

Data-driven balanced truncation for second-order systems with generalized proportional damping
Structured reduced-order modeling is a central component in the computer-aided design of control systems in which cheap-to-evaluate low-dimensional models with physically meaningful internal structures are computed. In this work, we develop a new approach for the structured data-driven surrogate modeling of linear dynamical systems described by second-order time derivatives via balanced truncation model-order reduction. The proposed method is a data-driven reformulation of position-velocity bal…

@arXiv_csDB_bot@mastoxiv.page
2025-06-12 07:25:51

Terabyte-Scale Analytics in the Blink of an Eye
Bowen Wu, Wei Cui, Carlo Curino, Matteo Interlandi, Rathijit Sen
https://arxiv.org/abs/2506.09226 https://

Terabyte-Scale Analytics in the Blink of an Eye
For the past two decades, the DB community has devoted substantial research to take advantage of cheap clusters of machines for distributed data analytics -- we believe that we are at the beginning of a paradigm shift. The scaling laws and popularity of AI models lead to the deployment of incredibly powerful GPU clusters in commercial data centers. Compared to CPU-only solutions, these clusters deliver impressive improvements in per-node compute, memory bandwidth, and inter-node interconnect pe…

@aral@mastodon.ar.al
2025-07-16 06:21:54

“Move fast and break things.”
The things:
- Human rights
- Our habitat
- Democracy
#BigTech #SiliconValley #ventureCapital

Arthur Charpentier ⏚ 🇨🇦 (@freakonometrics@mastodon.social)
"Because electricity is more costly for data centers than water, companies often prioritize building their facilities in places with cheap power, even if the area is drought stricken. That has exacerbated water shortages across the world, Ms. Ajami said. “Water is an afterthought” for tech companies, she said. “The thinking is, ‘Someone will figure that out later.’”" https://www.nytimes.com/2025/07/14/technology/meta-data-center-water.html

@arXiv_csCL_bot@mastoxiv.page
2025-07-29 17:43:34

Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/5]:
- Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Castro-Gonzalez, Chung, Kirk, Francis, Williams, Johansson, Bright

@draxil@social.linux.pizza
2025-07-17 16:13:42

Wow "individualised pricing", such fun. I guess this boils down to: what's the most we can charge you, personally, based on what dirt we have on you:
https://

Will AI end cheap flights? Critics attack Delta’s “predatory” AI pricing.
Your data could soon be used to spike flight costs, as Delta expands AI pilot.

@publicvoit@graz.social
2025-07-21 15:54:31

#Microsoft outsourced administration of classified #DoD data to cheap workers in #China. 🇨🇳 🕵️
My latest update on

Read That Before You Trust Anything by Microsoft Once Again
Read That Before You Trust Anything by Microsoft Once Again

@arXiv_eessIV_bot@mastoxiv.page
2025-06-30 09:24:10

Dehazing Light Microscopy Images with Guided Conditional Flow Matching: finding a sweet spot between fidelity and realism
Anirban Ray, Ashesh, Florian Jug
https://arxiv.org/abs/2506.22397

Dehazing Light Microscopy Images with Guided Conditional Flow Matching: finding a sweet spot between fidelity and realism
Fluorescence microscopy is a major driver of scientific progress in the life sciences. Although high-end confocal microscopes are capable of filtering out-of-focus light, cheaper and more accessible microscopy modalities, such as widefield microscopy, can not, which consequently leads to hazy image data. Computational dehazing is trying to combine the best of both worlds, leading to cheap microscopy but crisp-looking images. The perception-distortion trade-off tells us that we can optimize eith…

@tiotasram@kolektiva.social
2025-07-17 13:31:49

To add a single example here (feel free to chime in with your own):
Problem: editing code is sometimes tedious because external APIs require boilerplate.
Solutions:
- Use LLM-generated code. Downsides: energy use, code theft, potential for legal liability, makes mistakes, etc. Upsides: popular among some peers, seems easy to use.
- Pick a better library (not always possible).
- Build internal functions to centralize boilerplate code, then use those (benefits: you get a better understanding of the external API, and a more-unit-testable internal code surface; probably less amortized effort).
- Develop a non-LLM system that actually reasons about code at something like the formal semantics level and suggests boilerplate fill-ins based on rules, while foregrounding which rules it's applying so you can see the logic behind the suggestions (needs research).
Obviously LLM use in coding goes beyond this single issue, but there are similar analyses for each potential use of LLMs in coding. I'm all cases there are:
1. Existing practical solutions that require more effort (or in many cases just seem to but are less-effort when amortized).
2. Near-term researchable solutions that directly address the problem and which would be much more desirable in the long term.
Thus in addition to disastrous LLM effects on the climate, on data laborers, and on the digital commons, they tend to suck us into cheap-seeming but ultimately costly design practices while also crowding out better long-term solutions. Next time someone suggests how useful LLMs are for some task, try asking yourself (or them) what an ideal solution for that task would look like, and whether LLM use moves us closer to or father from a world in which that solution exists.

Tootfinder

Opt-in global Mastodon full text search. Join the index!