Tootfinder

Opt-in global Mastodon full text search. Join the index!

@benb@osintua.eu
2026-01-25 19:16:13

Putin secretly sends 'special tasks' general to Abu Dhabi talks with Ukraine, signaling a Kremlin strategy shift: benborges.xyz/2026/01/25/putin

@Techmeme@techhub.social
2026-01-27 10:40:47

Moonshot says Kimi K2.5 builds on K2 with "pretraining over ~15T mixed visual and text tokens" and "can self-direct an agent swarm with up to 100 sub-agents" (Kimi)
kimi.com/blog/kimi-k2-5.html

@cyrevolt@mastodon.social
2025-11-27 10:45:14

Today I'm heading to #Berlin again with tiny sh0rky 🦈🤏 and some cleanup tasks left for the ride.
The intel_fw library turns out to be very rich and powerful already.
I will start my new job on Monday, and I'll be using #Arch btw.

@heiseonline@social.heise.de
2026-01-13 15:49:01

heise | To-Do-Apps im Vergleich: Google Tasks vs. Zenkit, Todoist und Tasks.org
Endlich nichts mehr vergessen: Google Tasks erinnert pünktlich an Aufgaben, Pflichten und Geburtstage. Wir zeigen, welche Apps mehr Komfort und Features bieten.

@mxp@mastodon.acm.org‬
2025-12-27 16:53:41

Some people now argue that LLMs are useless. I disagree; they can be very useful if you take them as what they are: models of language that generate text on the basis of some given text. As such, they can be useful for a wide range of text-related tasks, including assisting with writing. And the more formulaic the genre, the better they work obviously. This is part of the reason why they are so popular with students, and in academia more generally.

‪@mxp@mastodon.acm.org‬
2025-12-27 16:53:41

Some people now argue that LLMs are useless. I disagree; they can be very useful if you take them as what they are: models of language that generate text on the basis of some given text. As such, they can be useful for a wide range of text-related tasks, including assisting with writing. And the more formulaic the genre, the better they work obviously. This is part of the reason why they are so popular with students, and in academia more generally.

@kurtsh@mastodon.social
2025-11-27 20:28:12

Welcome to the world of the field, engineering.
For a long time, we've hired very few into sales, mktg, support or consulting that don't already gobs of experience elsewhere.
✅ How #Microsoft’s developers are using #AI - The Verge

@pre@boing.world
2025-12-26 23:25:43

Like all the rest of the nerds, I did a bit of tech support on family computers.
They're all popping up windows from scam virus scanners lying that subscriptions need to be renewed or machines are unprotected. People don't know how to remove these things. Luckily they also don't really know how to pay the subscription.
Their phones are updating on them. Changing where buttons used to be. Removing options. Forcing people to register to use they things they have been doing for years.
They don't know how to register.
Things pop up asking for passwords and they have no idea who is asking or which password to use.
I tell them that I don't really understand why they keep using Windows now it is so shitty and awful. They say they don't know how to use anything else. The fact they don't really know how to use windows either doesn't seem to register.
The tech corporations have given up completely on being user friendly. They are all deliberately user hostile and exploitative now.
Corporate tech is terrible. The industry is failing it's users, abusing them. People don't even know there is any other way. They are just giving up on achieving their tasks until someone can fix the pop-ups and subscription boxes and passwords and 2fa for them.
Tech sucks now. Sucks hard.
#tech #christmasTechSupport

@jswright61@ruby.social
2025-11-26 20:42:30

Todoist FTW.
Several years ago we had smoky pies and rolls for Thanksgiving because we put off cleaning the ovens until it was too late.
I created a recurring Todoist task to clean the ovens on the 3rd Sunday of November. Ever since then we’ve had pristine ovens and smoke free cooking every Thanksgiving.
Collette appreciates that a holiday for which most of the tasks are hers, she doesn’t have to worry and the ovens are ready to go.

@balaji@social.linux.pizza
2025-12-25 13:05:55

Been using a number of AI models over the past week or so as work has slowed down, giving me time to explore things more deeply.
Been using Claude Code with musistudio/claude-code-router which is great as I can switch between different models on similar tasks.
Experience so far has been that Gemini 3 Flash is very good for thinking and coding tasks but the code does tend to be fragile so rewrites are needed. For tough problems where the errors are not straightforward it falls d…

@Archivist@social.linux.pizza
2026-01-25 07:04:55

I yearn for C 26 and not having to do repetitive tasks ever again to have a modicum of reflection. I want my static reflections yesterday, and my template for one day before that.

@alejandrobdn@social.linux.pizza
2026-01-25 12:30:48

Oh, this is cool: A mind mapper for the terminal #cli

 tmmpr - terminal mind mapper demo
@thomasfuchs@hachyderm.io
2025-11-24 01:11:25

RE: hachyderm.io/@thomasfuchs/1156
hear me out, how about NOT CRAMMING EVERYTHING INTO ONE DEVICE that just works mid for everything, but instead, you know, do some actual innovation here and there
for example, make devices specifically tailored for certain tasks
like if you're Apple why in the fuck don't you make devices with e-paper screens for people who don't want to be terminally online

@smashtie@mas.to
2025-12-25 13:32:49

This is the level of prep my wife brings to Christmas lunch. Can you tell she's a scientist? As you can see, it's going well. I've been assigned a few tasks, but she mostly wants to do this herself, it seems.
#Christmas

A list of activities towards the cooking of a Christmas dinner, jotted on paper, with times for each one. At the end is "2:00: eat!"
@eyebee@mstdn.social
2025-11-25 10:44:16

Tuesday: eyebeemania.net/2025/11/25/tue

@seeingwithsound@mas.to
2025-12-24 09:15:33

Dynamic reversal of IT-PFC information flow orchestrates visual categorization under perceptual uncertainty biorxiv.org/content/10.64898/2 Quite a mouthful to say that "the brain actually reverses its information flow when things get blurr…

Stimuli, tasks and population-level representational similarity analysis (RSA).
@raiders@darktundra.xyz
2026-01-25 03:02:13

Raiders’ Ashton Jeanty Gets Bold Christian McCaffrey Message heavy.com/sports/nfl/las-vegas

@dichotomiker@dresden.network
2025-11-24 09:49:40

#TIL about CYBATHLON #cybathlon

@Techmeme@techhub.social
2025-12-24 16:21:11

Beijing-based DP Technology, which develops AI tools used by researchers for tasks like computer-aided drug design and battery design, raised a ~$114M Series C (Eunice Xu/South China Morning Post)
scmp.com/business/companies/ar

@eichkat3r@hessen.social
2026-01-22 09:38:44

mich nervt es wenn programme eine plugin schnittstelle haben aber keine möglichkeit damit alle funktionalitäten des programms zu erweitern
zb kann man wohl in gimp keine zusätzlichen tools zur toolbox hinzufügen sodass man sich immer durch menüs klicken muss
es lohnt also nicht wirklich für wiederkehrende tasks ein plugin zu entwickeln
<…

@v_i_o_l_a@openbiblio.social
2025-12-22 20:56:28

"Deep Research, Shallow Agency: What Academic Deep Research Can and Can't Do"
aarontay.substack.com/p/how-ag

@daniel@social.telemetrydeck.com
2025-11-22 13:08:14

My brain seems to have two modes:
1. time-unaware, that slightly chaotic relaxed flow state where I just do the next thing that feels right
2. time-aware, which is always a bit tense and stressful because I need to force myself to keep to a calendar, and execute tasks in specific times.

@arXiv_csLG_bot@mastoxiv.page
2025-12-22 10:33:40

Easy Adaptation: An Efficient Task-Specific Knowledge Injection Method for Large Models in Resource-Constrained Environments
Dong Chen, Zhengqing Hu, Shixing Zhao, Yibo Guo
arxiv.org/abs/2512.17771 arxiv.org/pdf/2512.17771 arxiv.org/html/2512.17771
arXiv:2512.17771v1 Announce Type: new
Abstract: While the enormous parameter scale endows Large Models (LMs) with unparalleled performance, it also limits their adaptability across specific tasks. Parameter-Efficient Fine-Tuning (PEFT) has emerged as a critical approach for effectively adapting LMs to a diverse range of downstream tasks. However, existing PEFT methods face two primary challenges: (1) High resource cost. Although PEFT methods significantly reduce resource demands compared to full fine-tuning, it still requires substantial time and memory, making it impractical in resource-constrained environments. (2) Parameter dependency. PEFT methods heavily rely on updating a subset of parameters associated with LMs to incorporate task-specific knowledge. Yet, due to increasing competition in the LMs landscape, many companies have adopted closed-source policies for their leading models, offering access only via Application Programming Interface (APIs). Whereas, the expense is often cost-prohibitive and difficult to sustain, as the fine-tuning process of LMs is extremely slow. Even if small models perform far worse than LMs in general, they can achieve superior results on particular distributions while requiring only minimal resources. Motivated by this insight, we propose Easy Adaptation (EA), which designs Specific Small Models (SSMs) to complement the underfitted data distribution for LMs. Extensive experiments show that EA matches the performance of PEFT on diverse tasks without accessing LM parameters, and requires only minimal resources.
toXiv_bot_toot

@metacurity@infosec.exchange
2026-01-21 12:25:50

AI Agents ‘Perilous’ for Secure Apps Such as Signal, Whittaker Says
bloomberg.com/news/articles/20

@netzschleuder@social.skewed.de
2025-12-20 04:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@arXiv_physicsoptics_bot@mastoxiv.page
2025-11-25 11:06:23

Experimental insights into data augmentation techniques for deep learning-based multimode fiber imaging: limitations and success
Jawaria Maqbool, M. Imran Cheema
arxiv.org/abs/2511.19072 arxiv.org/pdf/2511.19072 arxiv.org/html/2511.19072
arXiv:2511.19072v1 Announce Type: new
Abstract: Multimode fiber~(MMF) imaging using deep learning has high potential to produce compact, minimally invasive endoscopic systems. Nevertheless, it relies on large, diverse real-world medical data, whose availability is limited by privacy concerns and practical challenges. Although data augmentation has been extensively studied in various other deep learning tasks, it has not been systematically explored for MMF imaging. This work provides the first in-depth experimental and computational study on the efficacy and limitations of augmentation techniques in this field. We demonstrate that standard image transformations and conditional generative adversarial-based synthetic speckle generation fail to improve, or even deteriorate, reconstruction quality, as they neglect the complex modal interference and dispersion that results in speckle formation. To address this, we introduce a physical data augmentation method in which only organ images are digitally transformed, while their corresponding speckles are experimentally acquired via fiber. This approach preserves the physics of light-fiber interaction and enhances the reconstruction structural similarity index measure~(SSIM) by up to 17\%, forming a viable system for reliable MMF imaging under limited data conditions.
toXiv_bot_toot

@fanf@mendeddrum.org
2025-11-21 18:42:03

from my link log —
Exploring the fragmentation of Wayland: an xdotool adventure.
semicomplete.com/blog/xdotool-
saved 2025-11-21

Google’s vibe-coding tool, Opal,
is making its way to Gemini.
The company on Wednesday said it is integrating the tool,
which lets you build AI-powered mini apps,
inside the Gemini web app,
allowing users to create their own custom apps,
which Google calls Gems.
Introduced in 2024,
Gems are customized versions of Gemini designed for specific tasks or scenarios.
For instance, some of Google’s pre-made Gems include
a learning coach,…

@Techmeme@techhub.social
2025-11-20 14:50:52

Sunday Robotics unveils Memo, a fully autonomous home robot capable of tasks like making espresso and loading dishwashers, set to launch in beta in 2026 (Will Knight/Wired)
wired.com/story/memo-sunday-ro

@rperezrosario@mastodon.social
2025-11-19 03:15:56

Senior Microsoft Product Manager Wendy Breiding discusses in this recent post how you can now customize your IDE to include agentic AI to your project that is focused on tasks related to a specific language or UI stack, in this case: C# and WinForms. The results have been positive when comparing these agents to previous more general approaches.
"Introducing Custom Agents for .NET Developers: C# Expert & WinForms Expert"

A black and white line art drawing illustrating the theme of this post. It incorporates a male and female figures as purported C# programming language and Windows Forms experts. Sitting in the bottom of the composition is a laptop with the text ".NET" on the screen. The image was generated using ChatGPT 4o.
@UP8@mastodon.social
2025-11-06 23:51:59

🎲 TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks
#llm

Figure 1: Comparison of cumulative regret trends for four LLMs: 
(a) Llama-3.1-8B regret trends: Exhibits high cumulative regret, suggesting poor adaptation to feedback over time. (b) Phi-2 regret trends: Maintains consistently high regret levels, indicating limited learning from outcomes (c) Qwen3-4B regret trends: Displays rapid reduction in regret, reflecting strong and consistent decision making (d) Qwen3-8B regret trends : Consistently high regret across prompts, indicating overthinking an…
@shriramk@mastodon.social
2025-12-17 17:38:51

Do you use LLMs to generate regular expressions? We do, too! Do you *review* your regexes? Is that frustrating? How can we put humans in the loop better, doing relatively few, meaningful tasks? Please try out our new tool PICK:regex, available for VSCode!
blog.brownplt.org/2025/12/11/p

@seeingwithsound@mas.to
2026-01-23 12:42:39

To flexibly organize thought, the brain makes use of space news.mit.edu/2026/to-flexibly-

@rasterweb@mastodon.social
2025-11-15 15:02:03

I sat down at my design to design and print something then I started doing sysadmin tasks and now it's an hour later... dammit!

@tezoatlipoca@mas.to
2025-12-18 22:13:21

Good website #uber guys...
I needed to update my password. Then I couldn't go back (yes yes, outside of my browser's back button)

The "Manage your profile" page of the Uber service. There is no link to return to the main page of one's uber account (showing rides taken etc.), and since the account management page opens in the same tab the main service page was in, you must use your browser's back button. 

The very helpful AI assistant had asked "Ask me anything about your Uber account", to which I asked "how do I get back to the main uber page"

The AI's response was
I'm here to help with account management tasks lik…
@trochee@dair-community.social
2025-11-16 21:30:48

Just so you can guess where the "AI" thing is headed, look at this listing from a local community college
"AI made simple for everyday life"
... And look at what comes next:
"Flaggers certification"
The exciting thing (for bosses) is the idea that knowledge workers will be as interchangeable (and precarious) as DOT flaggers
#YouDeserveAUnion

>

PROFESSIONAL DEVELOPMENT

///

Al Made Simple for Everyday Life

Think Al is just for techies? Think again! This fun and accessible course breaks down the basics of ChatGPT and shows how anyone can use it to simplify tasks, get inspired, or learn new things. No jargon, no pressure-just practical, creative ways to bring Al into your world. This online (self-paced) class is on our Canvas learning platform and it includes a 1-hour, one on one tutoring session to address your specific questions/…
@Techmeme@techhub.social
2025-11-23 06:05:50

Arbiter, which is using AI to automate healthcare administrative tasks, emerges from stealth with a $52M seed from multiple family offices at a $400M valuation (Rebecca Torrence/Business Insider)
businessinsider.com/health-sta

@frankel@mastodon.top
2025-11-01 08:22:03

#Anthropic Introduces #Skills for Custom Claude Tasks
infoq.com/news/2025/10/anthrop

@theodric@social.linux.pizza
2026-01-19 22:35:33

In the old days you could solicit for remote sysadmin jobs by including the pitch in your distro's README

  3.  Seeking commercial Linux support?

  I offer to install bigger Linux server machines and then keep an eye
  on them and upgrade them via ISDN. Ask me, if you have special
  questions about Linux or would like to have an adapted version of my
  Linux distribution for you. I'd also start interesting programming
  tasks.

  You are not allowed to send those things via email to me, as my
  account is from my university. Please use snail-mail instead. (Yes, I
  also have telephone, but I won't…
@Techmeme@techhub.social
2025-12-23 12:15:45

China's MiniMax releases M2.1, an upgrade to its open-source M2 model that it says has "significantly enhanced" coding capabilities in Rust, Java, and others (MiniMax)
minimax.io/news/minimax-m21

@jamesthebard@social.linux.pizza
2026-01-14 04:54:50

Started the official rewrite of the Sisyphus client in #golang, working on getting the Ffmpeg command-line tasks parsed and validated against the schema. This should make things easier to distribute with respect to the client as I can just distribute static binaries.
#programming

A screenshot of the Ffmpeg structures in Golang that will store job information and be used to construct command-line arguments.
@arXiv_csLG_bot@mastoxiv.page
2025-12-22 10:33:20

Can You Hear Me Now? A Benchmark for Long-Range Graph Propagation
Luca Miglior, Matteo Tolloso, Alessio Gravina, Davide Bacciu
arxiv.org/abs/2512.17762 arxiv.org/pdf/2512.17762 arxiv.org/html/2512.17762
arXiv:2512.17762v1 Announce Type: new
Abstract: Effectively capturing long-range interactions remains a fundamental yet unresolved challenge in graph neural network (GNN) research, critical for applications across diverse fields of science. To systematically address this, we introduce ECHO (Evaluating Communication over long HOps), a novel benchmark specifically designed to rigorously assess the capabilities of GNNs in handling very long-range graph propagation. ECHO includes three synthetic graph tasks, namely single-source shortest paths, node eccentricity, and graph diameter, each constructed over diverse and structurally challenging topologies intentionally designed to introduce significant information bottlenecks. ECHO also includes two real-world datasets, ECHO-Charge and ECHO-Energy, which define chemically grounded benchmarks for predicting atomic partial charges and molecular total energies, respectively, with reference computations obtained at the density functional theory (DFT) level. Both tasks inherently depend on capturing complex long-range molecular interactions. Our extensive benchmarking of popular GNN architectures reveals clear performance gaps, emphasizing the difficulty of true long-range propagation and highlighting design choices capable of overcoming inherent limitations. ECHO thereby sets a new standard for evaluating long-range information propagation, also providing a compelling example for its need in AI for science.
toXiv_bot_toot

@arXiv_qbioNC_bot@mastoxiv.page
2025-12-11 08:29:01

NeuroSketch: An Effective Framework for Neural Decoding via Systematic Architectural Optimization
Gaorui Zhang, Zhizhang Yuan, Jialan Yang, Junru Chen, Li Meng, Yang Yang
arxiv.org/abs/2512.09524 arxiv.org/pdf/2512.09524 arxiv.org/html/2512.09524
arXiv:2512.09524v1 Announce Type: new
Abstract: Neural decoding, a critical component of Brain-Computer Interface (BCI), has recently attracted increasing research interest. Previous research has focused on leveraging signal processing and deep learning methods to enhance neural decoding performance. However, the in-depth exploration of model architectures remains underexplored, despite its proven effectiveness in other tasks such as energy forecasting and image classification. In this study, we propose NeuroSketch, an effective framework for neural decoding via systematic architecture optimization. Starting with the basic architecture study, we find that CNN-2D outperforms other architectures in neural decoding tasks and explore its effectiveness from temporal and spatial perspectives. Building on this, we optimize the architecture from macro- to micro-level, achieving improvements in performance at each step. The exploration process and model validations take over 5,000 experiments spanning three distinct modalities (visual, auditory, and speech), three types of brain signals (EEG, SEEG, and ECoG), and eight diverse decoding tasks. Experimental results indicate that NeuroSketch achieves state-of-the-art (SOTA) performance across all evaluated datasets, positioning it as a powerful tool for neural decoding. Our code and scripts are available at github.com/Galaxy-Dawn/NeuroSk.
toXiv_bot_toot

@knurd42@social.linux.pizza
2025-11-12 15:14:21

#RedHat Enterprise #Linux 10.1 is out. It among others brings:
* Soft-reboots. This new systemd capability cuts downtime by letting administrators alter system state without fully rebooting.
* Reproducible builds for container tools in image mode.
* Cloud-crossing consistency w…

@mro@digitalcourage.social
2025-12-12 19:48:58

⭐ The Simple Habit That Saves My Evenings | alikhil | software engineering, kubernetes & self-hosting
alikhil.dev/posts/the-simple-h
Spoiler alert:
Here are the two main ideas of it:
* Don’t overwork
*…

@aral@mastodon.ar.al
2025-12-30 12:01:53

Caught a bug over the holidays so I’m mostly resting, feeling sorry for myself, and taking the time to at least carry out some mindless housekeeping tasks (updating dependencies, etc.) on some of my Node modules.
Released updates to the following packages yesterday:
Tape-based Node.js testing:
• Tap monkey (

@netzschleuder@social.skewed.de
2026-01-15 03:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-12-23 02:55:58

The Pentagon partners with xAI to embed the company's frontier AI systems, based on the Grok family of models, directly into GenAI.mil as soon as early 2026 (Bonny Chu/Fox News)
foxnews.com/politics/pentagon-

@Stomata@social.linux.pizza
2025-12-13 18:17:57

Eww..🤮
Hopefully I left Brave
brave.com/blog/ai-browsing/

@ruth_mottram@fediscience.org
2025-11-29 08:13:26

The LLMs are useful for some tasks, I'm currently tidying up, proof reading and editing a document written by many international co-authors.
The LLM I'm using can very quickly correct grammar and spelling mistakes and produces much easier to understand text from occasionally tortured paragraphs. It still needs an experts eye (mine!) to check no mistakes have been introduced or complexities over-simplified.
This is actually the first time I've used an LLM for this task. It's making it much faster and less painful.
It also explains a lot about academic publishing lately..

@v_i_o_l_a@openbiblio.social
2026-01-16 21:31:13

"Deep Research, Shallow Agency: What Academic Deep Research Can and Can't Do"
aarontay.substack.com/p/how-ag

@mapto@qoto.org
2025-12-11 07:47:09

Today at #CHR2025, I will be presenting our work on the evaluation of the historical adequacy of masked language models (MLMs) for #Latin. There are several models like this, and they represent the current state of the art for a number of downstream tasks, like semantic change and text reuse detection. However, a h…

A poster for the paper that could be found at https://doi.org/10.63744/sLAHYnQdA8fu
@billbert@mastodon.social
2025-11-12 23:25:18

TIL the tiny organisms on Mystery Science Theater 3000 #MST3K that live all throughout the Satellite of Love - the Nanites - have amongst their tasks and activities … running … a …
MICRO-brewery #DadJoke

A pair of Nanites from MST3K
@jonippolito@digipres.club
2025-12-09 14:11:46

We've updated the What Uses More app to reflect last week's finding by Luccioni and Gamazaychikov that "reasoning" mode increases energy and water usage by 30x. The study casts doubt on the improved efficiency AI companies are claiming for newer models

A screenshot from the What Uses More app, showing a chart with 30x more energy usage for reasoning models.
@publicvoit@graz.social
2025-11-28 22:19:13

"I'm an average user, so I don't need all the options and apps the programme has to offer. But, to be honest, #Microsoft is making it increasingly attractive to switch. Now that the company is putting #AI in everything, everything is becoming more annoying to use."
Can Dutch

@Techmeme@techhub.social
2026-01-12 20:01:28

Anthropic launches Cowork for Claude, built on Claude Code to automate complex tasks with minimal prompting, as a research preview for Claude Max subscribers (Webb Wright/ZDNET)
zdnet.com/article/anthropic-co

@kurtsh@mastodon.social
2026-01-14 03:34:41

Someone built an indexed & vectorized index & a conversational AI for the Epstein Files.
Never mind Qs like "How many times was Trump mentioned?" Try asking it complex tasks & questions that require intelligent reasoning like, "Is there any evidence..."
epstein.trynia.ai/

@thomasfuchs@hachyderm.io
2025-12-08 14:01:23

It's funny how "AI" tools are simulteanously marketed as "agents" that can run fully in the background and do stuff but whenever they do something bad it's the user at fault for not supervising the software that doesn't work.
Even when it’s directly used and the user has the chance to review everything—it’s extremely dangerous, especially at tasks it is doing fine like 95% of the time and/or when the bad things are only subtly wrong.
Imagine other tools being like this, like a steering wheel that turns the car 95 out of a 100 times. 2% of the time it steers into the other direction. 3% of the time it steers 5x as much as normally.

@netzschleuder@social.skewed.de
2025-12-14 16:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-11-21 22:40:59

Anthropic finds that LLMs trained to "reward hack" by cheating on coding tasks show even more misaligned behavior, including sabotaging AI-safety research (Anthropic)
anthropic.com/research/emergen

Financial well-being has an outsize imprint on older Americans’ quality of life
-- affecting their physical health, social life and even cognitive skills.

Low-income seniors are more likely to experience mental confusion,
spend less time pursuing hobbies,
and face difficulties with everyday tasks such as climbing stairs and grocery shopping,
compared with their more affluent counterparts

@Techmeme@techhub.social
2025-12-21 05:05:57

METR: Claude Opus 4.5 has a 50% task completion time horizon of about 4 hours and 49 minutes, more than double that of Claude Opus 4 released earlier this year (@metr_evals)
x.com/metr_evals/status/200220

@rasterweb@mastodon.social
2025-12-03 15:32:32

I'm trying out kan.bn since my Focalboard install took a shit. kan.bn was fairly easy to get running via Docker on my OpenMediaVault NAS.
I'm not a huge fan of the kanban board style but I'll give it a try. At least until I can find something better to manage my tasks.
kan.bn/

@arXiv_csLG_bot@mastoxiv.page
2025-12-22 10:34:40

Weighted Stochastic Differential Equation to Implement Wasserstein-Fisher-Rao Gradient Flow
Herlock Rahimi
arxiv.org/abs/2512.17878 arxiv.org/pdf/2512.17878 arxiv.org/html/2512.17878
arXiv:2512.17878v1 Announce Type: new
Abstract: Score-based diffusion models currently constitute the state of the art in continuous generative modeling. These methods are typically formulated via overdamped or underdamped Ornstein--Uhlenbeck-type stochastic differential equations, in which sampling is driven by a combination of deterministic drift and Brownian diffusion, resulting in continuous particle trajectories in the ambient space. While such dynamics enjoy exponential convergence guarantees for strongly log-concave target distributions, it is well known that their mixing rates deteriorate exponentially in the presence of nonconvex or multimodal landscapes, such as double-well potentials. Since many practical generative modeling tasks involve highly non-log-concave target distributions, considerable recent effort has been devoted to developing sampling schemes that improve exploration beyond classical diffusion dynamics.
A promising line of work leverages tools from information geometry to augment diffusion-based samplers with controlled mass reweighting mechanisms. This perspective leads naturally to Wasserstein--Fisher--Rao (WFR) geometries, which couple transport in the sample space with vertical (reaction) dynamics on the space of probability measures. In this work, we formulate such reweighting mechanisms through the introduction of explicit correction terms and show how they can be implemented via weighted stochastic differential equations using the Feynman--Kac representation. Our study provides a preliminary but rigorous investigation of WFR-based sampling dynamics, and aims to clarify their geometric and operator-theoretic structure as a foundation for future theoretical and algorithmic developments.
toXiv_bot_toot

@Techmeme@techhub.social
2026-01-20 13:30:58

Legal AI startup Ivo, which aims to reduce hallucinations by breaking legal reviews into 400 tasks, raised a $55M Series B, a source says at a $355M valuation (Aditya Soni/Reuters)
reuters.com/technology/legal-a

@netzschleuder@social.skewed.de
2026-01-09 11:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@arXiv_csLG_bot@mastoxiv.page
2025-12-22 10:32:30

You Only Train Once: Differentiable Subset Selection for Omics Data
Daphn\'e Chopard, Jorge da Silva Gon\c{c}alves, Irene Cannistraci, Thomas M. Sutter, Julia E. Vogt
arxiv.org/abs/2512.17678 arxiv.org/pdf/2512.17678 arxiv.org/html/2512.17678
arXiv:2512.17678v1 Announce Type: new
Abstract: Selecting compact and informative gene subsets from single-cell transcriptomic data is essential for biomarker discovery, improving interpretability, and cost-effective profiling. However, most existing feature selection approaches either operate as multi-stage pipelines or rely on post hoc feature attribution, making selection and prediction weakly coupled. In this work, we present YOTO (you only train once), an end-to-end framework that jointly identifies discrete gene subsets and performs prediction within a single differentiable architecture. In our model, the prediction task directly guides which genes are selected, while the learned subsets, in turn, shape the predictive representation. This closed feedback loop enables the model to iteratively refine both what it selects and how it predicts during training. Unlike existing approaches, YOTO enforces sparsity so that only the selected genes contribute to inference, eliminating the need to train additional downstream classifiers. Through a multi-task learning design, the model learns shared representations across related objectives, allowing partially labeled datasets to inform one another, and discovering gene subsets that generalize across tasks without additional training steps. We evaluate YOTO on two representative single-cell RNA-seq datasets, showing that it consistently outperforms state-of-the-art baselines. These results demonstrate that sparse, end-to-end, multi-task gene subset selection improves predictive performance and yields compact and meaningful gene subsets, advancing biomarker discovery and single-cell analysis.
toXiv_bot_toot

@Techmeme@techhub.social
2025-11-20 15:55:47

Stuut, which connects to CRM and other systems to automate management of accounts receivable, raised a $29.5M Series A led by a16z (Charlie Fink/Forbes)
forbes.com/sites/charliefink/2

@kurtsh@mastodon.social
2025-11-07 02:47:14

Dude. What the...? 😜
#Azure
From: @…
infosec.exchange/@alevsk/11549

@Techmeme@techhub.social
2025-12-19 21:06:41

AI robotics startup Physical Intelligence claims vision-language-action models learn to align human videos and robot data as pre-training is scaled up (Physical Intelligence)
physicalintelligence.company/r

@netzschleuder@social.skewed.de
2025-11-09 09:00:05

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-11-18 19:40:55

Maxima, whose AI platform automates accounting tasks like reconciliation and journal entry, raised $41M in seed and Series A rounds at a $143M valuation (Aditya Soni/Reuters)
reuters.com/business/ai-accoun

@arXiv_csLG_bot@mastoxiv.page
2025-12-22 11:50:31

Crosslisted article(s) found for cs.LG. arxiv.org/list/cs.LG/new
[2/3]:
- Sharp Structure-Agnostic Lower Bounds for General Functional Estimation
Jikai Jin, Vasilis Syrgkanis
arxiv.org/abs/2512.17341 mastoxiv.page/@arXiv_statML_bo
- Timely Information Updating for Mobile Devices Without and With ML Advice
Yu-Pin Hsu, Yi-Hsuan Tseng
arxiv.org/abs/2512.17381 mastoxiv.page/@arXiv_csNI_bot/
- SWE-Bench : A Framework for the Scalable Generation of Software Engineering Benchmarks from Open...
Wang, Ramalho, Celestino, Pham, Liu, Sinha, Portillo, Osunwa, Maduekwe
arxiv.org/abs/2512.17419 mastoxiv.page/@arXiv_csSE_bot/
- Perfect reconstruction of sparse signals using nonconvexity control and one-step RSB message passing
Xiaosi Gu, Ayaka Sakata, Tomoyuki Obuchi
arxiv.org/abs/2512.17426 mastoxiv.page/@arXiv_statML_bo
- MULTIAQUA: A multimodal maritime dataset and robust training strategies for multimodal semantic s...
Jon Muhovi\v{c}, Janez Per\v{s}
arxiv.org/abs/2512.17450 mastoxiv.page/@arXiv_csCV_bot/
- When Data Quality Issues Collide: A Large-Scale Empirical Study of Co-Occurring Data Quality Issu...
Emmanuel Charleson Dapaah, Jens Grabowski
arxiv.org/abs/2512.17460 mastoxiv.page/@arXiv_csSE_bot/
- Behavioural Effects of Agentic Messaging: A Case Study on a Financial Service Application
Olivier Jeunen, Schaun Wheeler
arxiv.org/abs/2512.17462 mastoxiv.page/@arXiv_csIR_bot/
- Linear Attention for Joint Power Optimization and User-Centric Clustering in Cell-Free Networks
Irched Chafaa, Giacomo Bacci, Luca Sanguinetti
arxiv.org/abs/2512.17466 mastoxiv.page/@arXiv_eessSY_bo
- Translating the Rashomon Effect to Sequential Decision-Making Tasks
Dennis Gross, J{\o}rn Eirik Betten, Helge Spieker
arxiv.org/abs/2512.17470 mastoxiv.page/@arXiv_csAI_bot/
- Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions
Atharva Awari, Nicolas Gillis, Arnaud Vandaele
arxiv.org/abs/2512.17473 mastoxiv.page/@arXiv_eessSP_bo
- TwinSegNet: A Digital Twin-Enabled Federated Learning Framework for Brain Tumor Analysis
Almustapha A. Wakili, Adamu Hussaini, Abubakar A. Musa, Woosub Jung, Wei Yu
arxiv.org/abs/2512.17488 mastoxiv.page/@arXiv_csCV_bot/
- Resource-efficient medical image classification for edge devices
Mahsa Lavaei, Zahra Abadi, Salar Beigzad, Alireza Maleki
arxiv.org/abs/2512.17515 mastoxiv.page/@arXiv_eessIV_bo
- PathBench-MIL: A Comprehensive AutoML and Benchmarking Framework for Multiple Instance Learning i...
Brussee, Valkema, Weijer, Doeleman, Schrader, Kers
arxiv.org/abs/2512.17517 mastoxiv.page/@arXiv_csCV_bot/
- HydroGym: A Reinforcement Learning Platform for Fluid Dynamics
Christian Lagemann, et al.
arxiv.org/abs/2512.17534 mastoxiv.page/@arXiv_physicsfl
- When De-noising Hurts: A Systematic Study of Speech Enhancement Effects on Modern Medical ASR Sys...
Chondhekar, Murukuri, Vasani, Goyal, Badami, Rana, SN, Pandia, Katiyar, Jagadeesh, Gulati
arxiv.org/abs/2512.17562 mastoxiv.page/@arXiv_csSD_bot/
- Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing
Lingxiao Zhao, Haoran Zhou, Yuezhi Che, Dazhao Cheng
arxiv.org/abs/2512.17574 mastoxiv.page/@arXiv_csDC_bot/
- SkinGenBench: Generative Model and Preprocessing Effects for Synthetic Dermoscopic Augmentation i...
N. A. Adarsh Pritam, Jeba Shiney O, Sanyam Jain
arxiv.org/abs/2512.17585 mastoxiv.page/@arXiv_eessIV_bo
- MAD-OOD: A Deep Learning Cluster-Driven Framework for an Out-of-Distribution Malware Detection an...
Tosin Ige, Christopher Kiekintveld, Aritran Piplai, Asif Rahman, Olukunle Kolade, Sasidhar Kunapuli
arxiv.org/abs/2512.17594 mastoxiv.page/@arXiv_csCR_bot/
- Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion De...
Menna Elgabry, Ali Hamdi
arxiv.org/abs/2512.17630 mastoxiv.page/@arXiv_csCL_bot/
- Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Effic...
Madhav R. Muthyala, Farshud Sorourifar, Tianhong Tan, You Peng, Joel A. Paulson
arxiv.org/abs/2512.17659 mastoxiv.page/@arXiv_statML_bo
toXiv_bot_toot

@Techmeme@techhub.social
2025-12-18 17:25:49

Anthropic launches Agent Skills, which let AI assistants perform specialized tasks using modular instructions, and says Microsoft, Cursor, and others use them (Michael Nuñez/VentureBeat)
venturebeat.com/ai/anthropic-l

@netzschleuder@social.skewed.de
2026-01-08 07:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-12-18 12:05:57

UK AI Security Institute report: AI models are rapidly improving at potentially dangerous biological and chemical tasks, and show fast jumps in self-replication (Shakeel Hashim/Transformer)
transformernews.ai/p/aisi-ai-s

@Techmeme@techhub.social
2025-12-18 06:20:41

Manus says it crossed $100M ARR eight months after launch and is growing at 20% MoM since Manus 1.5's release; its total revenue run rate is now over $125M (Jake Rudnitsky/Bloomberg)
bloomberg.com/news/articles/20

@netzschleuder@social.skewed.de
2025-12-06 15:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-11-05 11:45:36

Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a "messiness tax" for real-world tasks (Boaz Barak/Windows On Theory)
windowsontheory.org/2025/11/04

@Techmeme@techhub.social
2025-11-05 07:15:57

A look at data labeling startups like Objectways, whose workers record and annotate repetitive tasks like folding towels to train AI robots for physical tasks (Nilesh Christopher/Los Angeles Times)
latimes.com/business/story/202

@netzschleuder@social.skewed.de
2026-01-03 15:00:03

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@netzschleuder@social.skewed.de
2025-11-01 18:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-11-30 06:40:47

Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)
the-decoder.com/qwen3-vl-can-s

@netzschleuder@social.skewed.de
2025-12-30 06:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2025-12-11 18:45:58

OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost (OpenAI)
openai.com/index/introducing-g

@Techmeme@techhub.social
2025-12-09 15:36:07

Pebble unveils the Pebble Index 01, a $99 smart ring with an on-device LLM for processing voice notes, shipping in March 2026, initially for $75 (Julian Chokkattu/Wired)
wired.com/story/pebble-index-r

@Techmeme@techhub.social
2025-12-09 14:45:50

Source: Anthropic, OpenAI, Google, Microsoft, and more are set to unveil the Agentic AI Foundation to build open-source AI agent standards as soon as this week (Aaron Holmes/The Information)
theinformation.com/articles/op

@Techmeme@techhub.social
2026-01-08 07:20:49

Local governments across China are funding dozens of "robot training centers", where human trainers mimic movements like folding clothes to teach the robots (Rest of World)
restofworld.org/2026/china-rob

@Techmeme@techhub.social
2025-12-08 12:30:38

An OpenAI survey of 9,000 workers at 100 companies: it saves workers ~40 to 60 minutes per day on average for professional tasks; OpenAI has 1M business clients (Shirin Ghaffary/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-01-08 20:45:46

OpenAI is rolling out a HIPAA-compliant version of ChatGPT for clinicians to assist with medical reasoning and administrative tasks, at Cedars-Sinai and others (Shirin Ghaffary/Bloomberg)
bloomberg.com/news/newsletters

@Techmeme@techhub.social
2025-12-06 02:20:57

An analysis of 100T tokens from the past year shows reasoning models now represent over half of all usage, open-weight model use has grown steadily, and more (OpenRouter)
openrouter.ai/state-of-ai

@Techmeme@techhub.social
2025-12-05 02:50:54

Pine, which offers an AI agent to automate digital chores, like making calls, handling emails, and operating software to complete tasks, raised a $25M Series A (FinSMEs)
finsmes.com/2025/12/pine-raise

@Techmeme@techhub.social
2026-01-06 04:01:14

AMD unveils Ryzen AI 400 Series AI PC chips with 12 CPU cores, claiming 1.3x faster multitasking and 1.7x faster content creation than rivals (Rebecca Szkutak/TechCrunch)
techcrunch.com/2026/01/05/amd-

@Techmeme@techhub.social
2025-11-04 18:36:04

Switzerland-based Mimic Robotics, which is building AI models to enable human-like robotic hands to adapt to complex, high-precision tasks, raised a $16M seed (Kyt Dotson/SiliconANGLE)
siliconangle.com/2025/11/04/mi

@Techmeme@techhub.social
2025-11-05 13:45:37

Giga, which develops voice-based AI agents for customer support, raised a $61M Series A led by Redpoint with participation from Y Combinator and Nexus (Beatrice Nolan/Fortune)
fortune.com/2025/11/05/voice-a

@Techmeme@techhub.social
2025-12-03 09:35:47

A look at startups like AGI and Plato, which build replicas of websites to let AI agents learn to navigate and complete specific tasks, like booking flights (Cade Metz/New York Times)
nytimes.com/2025/12/02…

@Techmeme@techhub.social
2025-12-02 16:30:48

Sources: OpenAI is developing a new LLM codenamed Garlic, which performs well when compared to Gemini 3 and Opus 4.5 in coding and reasoning tasks (Stephanie Palazzolo/The Information)
theinformation.com/articles/op

@Techmeme@techhub.social
2025-12-03 14:21:23

Sources: multiple Microsoft divisions lowered how much salespeople are supposed to grow sales of certain AI products after missing growth targets, a rare move (Aaron Holmes/The Information)
theinformation.com/articles/mi

@Techmeme@techhub.social
2025-10-29 12:21:02

OpenAI releases gpt-oss-safeguard, its open-weight reasoning models for safety classification tasks, available in 120B and 20B parameters, under Apache 2.0 (OpenAI)
openai.com/index/introducing-g