Tootfinder

Opt-in global Mastodon full text search. Join the index!

@Techmeme@techhub.social
2026-05-07 13:11:49

Khosla-backed robotics startup Genesis AI unveils GENE-26.5, its first model, which can control robotic hands that it designed in-house to do tasks like cooking (Anna Heim/TechCrunch)
techcrunch.com/2026/05/06/khos

@gadgetboy@gadgetboy.social
2026-04-08 10:33:19

I found a solid iOS client for using my LM Studio-hosted models.
The Web Agent is interesting - launches Google in an in-app browser, parses the SERP, and delivers the results of your query back in the chat window - all while you watch what it's doing.
Qwen 3.5 35b performs well with these tasks, even if it's a little slow for interactive tasks on my hardware.
Find the app here:

@frankel@mastodon.top
2026-03-08 09:06:24

#ClaudeCode Performance: Unlock Deep #Thinking for Better Results
claudefa.st/blog/guide/perform

@simon_jf@mastodon.scot
2026-05-07 10:57:38

I know I'm in a ridiculously privileged situation to be able to moan about this, but work has really felt like **work** recently. Endless marking, paper revisions, reviewing, and admin. It seems that all of the crap tasks have basically bunched together in the last couple of weeks.

@netzschleuder@social.skewed.de
2026-04-08 15:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Kingu@sakurajima.moe
2026-04-06 13:45:28

A agentic OS is n OS where every tasks is being done by an entertainment-purpose only agent.

@ErikJonker@mastodon.social
2026-03-30 13:40:13

Fun playing ARC-AGI-3 , puzzles that the most advanced AI-models can only solve for 1% 😀
Illustrates how AI models look extremely smart but are at the same time quite dumb.
#AI

@jdrm@social.linux.pizza
2026-03-06 07:03:49

No se si visteis esto. Lo que estš haciendo gente para poder programar y que en la empresa piensen que estš usando un agente de loroestocšstico danq.me/2026/03/03/ai-agent-lo

@Mediagazer@mstdn.social
2026-04-05 08:01:07

How Hollywood support staff are integrating AI into workflows, from mundane tasks to creative development, amid cost-cutting and workload demands (Mia Galuppo/The Hollywood Reporter)
hollywoodreporter.com/movies/m

@marjolica@social.linux.pizza
2026-05-06 09:42:32

For some reason Hannan Fry is optimistic AI will get better at this (but then maybe her job depends on it?). Personally I think mathematicians have better things to do.
'Brit mathematician lets AI agent loose with credit card – cue password leaks, CAPTCHA chaos and more
British mathematician Professor Hannah Fry has shared a cautionary experiment involving an AI agent, a set of tasks, and a bank card number Fry's team gave it "to show us what it could do."'

@Techmeme@techhub.social
2026-05-06 04:55:59

Google has quietly shut down Project Mariner, its Chrome-browsing AI agent for completing tasks on users' behalf, after highlighting it onstage at I/O 2025 (Max Zeff/@zeffmax)
x.com/zeffmax/status/205182449

@fell@ma.fellr.net
2026-05-05 10:27:21

It's so sad. With 📆 CalDAV we have a really nice open protocol for syncing events, todos and notes. The protocol, which is technically more of a file format (iCalendar) even supports quite complex reccurence rules and even things like recurring tasks.
Unfortunately, client (and server) applications usually only implement a subset of what's possible.
Know some good ones? Let me know!
#CalDAV

@neverpanic@chaos.social
2026-03-31 18:31:27

I feel seen.
open.substack.com/pub/workchro

A 6-panel comic from Work Chronicles (workchronicles.com).

Panel 1: How to be more productive

Panel 2: Step 1: Write down your to-do list. We see a comic figure writing a to-do list.

Panel 3: Step 2: Identify high impact tasks. Close-up of the list with Tasks 1-6, tasks 3 and 4 are highlighted.

Panel 4: Step 3: Take a break and browse social media. A comic character is sitting on a bean-bag chair and is doom scrolling on a smartphone.

Panel 5: Step 4: It's night already. The same…
@DamonHD@mastodon.social
2026-05-05 12:33:27

#today I have been at our TTK Energy Group meeting where we all seemed to have major announcements! Now I am home and doing some #QA work on a course for Surrey uni, and I have a HUGE pile of other work chores/tasks to try to work though somehow...

@nebucatnetzer@social.linux.pizza
2026-04-05 12:29:18

It is really fun to have all my data directly accessible on my phone without having to rely on an active internet connection. Not that I don't have one but I just don't need one for most of the tasks.
This is mainly done through #Syncthing and an SD card in my #fairphone

@askesis@qoto.org
2026-04-30 22:58:11

Technology and Responsibility: Reflections on the New Tasks of Ethics
Hans Jonas (1973)
#etica

@Techmeme@techhub.social
2026-02-26 02:56:04

Anthropic unveils scheduled tasks in Cowork, enabling Claude to complete recurring tasks at specific times automatically (Claude/@claudeai)
x.com/claudeai/status/20267208

One useful way to think about today's chatbots is that they function more like secretaries than physicians.
They are remarkably effective at organising information, summarising text, and structuring complex documents.
These are the kinds of tasks where language models are already proving useful within healthcare systems,
for example in drafting clinical notes, summarising patient records, or generating referral letters
The promise of AI in medicine remains real, …

@pre@boing.world
2026-05-08 09:29:51

A man called Confidence thinks, presumably confidently, that the best interface for a ai agent is not a chat window on a website but... Email!
Chat is synchronous, app specific, dies with a tab close.
But agents tasks can be asynchronous.
Email has identity, a wake event when messages arrive, has a way to reply asynchronously, can attach files, and agents can email each other.
Email is already everywhere.
He has a js library to deal with email to agent messages.
He does a demo including buying a domain and setting up his lib to handle email from it. Interestingly, there's still a web chat to it. Ha.
All of which makes sense but. Email? Really? Surely something more secure and encrypted is better? Something with sender signing so random hackers cant email it? I thought this was surely going to be satire.
#devWorld

@scottmiller42@mstdn.social
2026-05-03 01:55:24

A shoutout to the systems engineers that made Microsoft Windows so fragile, that a single file browser (explorer.exe) freezing causes my Edge downloads to pause, and closing the frozen file browser closes all my file browsers.
My ADHD brain needed those open file browsers so I could keep track of all my in-progress tasks. How do I resume now? Yeah, I suppose I could come up with a better system than that.

@Techmeme@techhub.social
2026-04-05 09:01:45

How Hollywood support staff are integrating AI into workflows, from mundane tasks to creative development, amid cost-cutting and workload demands (Mia Galuppo/The Hollywood Reporter)
hollywoodreporter.com/movies/m

@inthehands@hachyderm.io
2026-03-18 17:03:46

The other one I truly love is GitUp (gitup.co). Its visualization handles certain specific tasks better than anything else — tasks where I’m more concerned about the shape of the commit graph than the contents of individual commits.
Because of the way it does live updates of repo state and offers a whole-commit-graph-level undo, I’ll sometimes keep it open in the background while doing some fiddly thing in another tool (Fork, CLI, whatever) just so I can see what the ^*@# is happening.
Alas, its lack of support for commit signing means I use it less and less.

@mia@hcommons.social
2026-02-14 12:18:03

'Automate tasks, not jobs' - a great headline from a report on 'the AI opportunity for Scotland’s public services' stormid.com/research/

@arXiv_econTH_bot@mastoxiv.page
2026-04-03 07:55:41

Bridging Distant Ideas: the Impact of AI on R&D and Recombinant Innovation
Emanuele Bazzichi, Massimo Riccaboni, Fulvio Castellacci
arxiv.org/abs/2604.02189 arxiv.org/pdf/2604.02189 arxiv.org/html/2604.02189
arXiv:2604.02189v1 Announce Type: new
Abstract: We study how artificial intelligence (AI) affects firms' incentives to pursue incremental versus radical knowledge recombinations. We develop a model of recombinant innovation embedded in a Schumpeterian quality-ladder framework, in which innovation arises from recombining ideas across varying distances in a knowledge space. R&D consists of multiple tasks, a fraction of which can be performed by AI. AI facilitates access to distant knowledge domains, but at the same time it also increases the aggregate rate of creative destruction, shortening the monopoly duration that rewards radical innovations. Moreover, excessive reliance on AI may reduce the originality of research and lead to duplication of research efforts. We obtain three main results. First, higher AI productivity encourages more distant recombinations, if the direct facilitation effect is stronger than the indirect effect due to intensified competition from rivals. Second, the effect of increasing the share of AI-automated R&D tasks is non-monotonic: firms initially target more radical innovations, but beyond a threshold of human-AI complementarity, they shift the focus toward incremental innovations. Third, in the limiting case of full automation, the model predicts that optimal recombination distance collapses to zero, suggesting that fully AI-driven research would undermine the very knowledge creation that it seeks to accelerate.
toXiv_bot_toot

@aral@mastodon.ar.al
2026-02-23 17:41:10

🥳 New Kitten¹ release
• Added `initialise()` hook to `kitten.Component` instances.
This gets called at the end of the constructor and is handy if you don’t want to override the constructor and have to handle the `data` parameter and remember to call `super(data)`. You can still access passed data from `this.data`.

Note that the component is not part of the view hierarchy on the client at this point. If you have tasks you need to perform only once per page – for example, ins…

@netzschleuder@social.skewed.de
2026-03-02 08:00:05

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@trochee@dair-community.social
2026-03-28 21:33:37

To me, Banks is the first prophet of Judith and Octavia's Jihad.
> One of the most important tasks in setting up and running a stable and internally content civilisation is finding an acceptable balance between the desire for freedom of choice in one's actions (and the freedom from mortal fear in one's life) and the need to feel that even in a society so self-correctingly Utopian one is still contributing something.
A Few Notes on the Culture, by Iain M Banks

@ruth_mottram@fediscience.org
2026-04-23 20:29:25

It's amazing how bad I am at estimating how long simple tasks* will take.
I had a nearly finished manuscript. I just needed to tidy a few things in it and then upload for review
Two hours tops I told my family.
5.5 hours later...
#AcademicChatter

@Techmeme@techhub.social
2026-05-05 17:49:43

Sources: Apple plans to let users choose from multiple third-party AI models to perform tasks like generating and editing text and images in iOS 27 (Mark Gurman/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:38:31

From Isolation to Integration: Building an Adaptive Expert Forest for Pre-Trained Model-based Class-Incremental Learning
Ruiqi Liu, Boyu Diao, Hangda Liu, Zhulin An, Fei Wang, Yongjun Xu
arxiv.org/abs/2602.20911 arxiv.org/pdf/2602.20911 arxiv.org/html/2602.20911
arXiv:2602.20911v1 Announce Type: new
Abstract: Class-Incremental Learning (CIL) requires models to learn new classes without forgetting old ones. A common method is to freeze a pre-trained model and train a new, lightweight adapter for each task. While this prevents forgetting, it treats the learned knowledge as a simple, unstructured collection and fails to use the relationships between tasks. To this end, we propose the Semantic-guided Adaptive Expert Forest (SAEF), a new method that organizes adapters into a structured hierarchy for better knowledge sharing. SAEF first groups tasks into conceptual clusters based on their semantic relationships. Then, within each cluster, it builds a balanced expert tree by creating new adapters from merging the adapters of similar tasks. At inference time, SAEF finds and activates a set of relevant experts from the forest for any given input. The final prediction is made by combining the outputs of these activated experts, weighted by how confident each expert is. Experiments on several benchmark datasets show that SAEF achieves SOTA performance.
toXiv_bot_toot

@wyri@toot-toot.wyrihaxim.us
2026-02-26 17:56:16

YOLO Wasn't expecting to warming up to using an agent for the shitty tasks this easily
#YOLO #Junie #LLM

@seeingwithsound@mas.to
2026-03-29 21:14:11

MIRAGE: The illusion of visual understanding (by AI models) #LLM #AI

@frankel@mastodon.top
2026-02-18 09:00:44

SkillsBench: Benchmarking How Well Agent #Skills Work Across Diverse Tasks
#LLM

@azonenberg@ioc.exchange
2026-02-10 21:35:45

Anyone know of an Android to-do list application that is
* Completely device local, no network connectivity required or used
* No ads or spyware
* Doesn't time-out tasks even if they sit around for a year uncompleted (looking at you, google calendar)
* Supports recurring maintenance tasks for weekly, monthly, etc. cleaning or something
Open source preferred, but willing to pay a reasonable price if it's out there as a commercial tool

@fanf@mendeddrum.org
2026-02-23 12:42:03

from my link log —
Using nsnotifyd with a PowerDNS secondary.
blog.feld.me/posts/2026/02/nsn
saved 2026-02-23

@deepthoughts10@infosec.exchange
2026-02-25 13:55:32

Geoshitties for the win! If you use @… ‘s blocklists you’d have already blocked *.vercel.app which is a key link in the kill chain for this attack described by Microsoft. My advice: block Vercel for everyone in your org except for those that have a business need. #cybersecurity

@Techmeme@techhub.social
2026-05-05 15:15:59

Anthropic unveils 10 new AI agents for the financial sector, including for drafting pitch decks, reviewing financial statements, and escalating compliance cases (Shirin Ghaffary/Bloomberg)
bloomberg.com/news/articles/20

@cketti@social.int21.dev
2026-03-25 17:02:23

@… @… Not really. The only .java files I could find are here:

@cketti@int21.dev
2026-03-25 17:02:23

@… @… Not really. The only .java files I could find are here:

@chris@mstdn.chrisalemany.ca
2026-05-06 16:41:06

Finally! Some not-conflicted adults looking at the privacy concerns of LLM bots just slurping up your data without regulation or permission.
“OpenAI did not respect Canadian privacy laws when it trained its immensely popular ChatGPT tool, resulting in the collection and use of sensitive personal information, according to a joint investigation.
The federal privacy commissioner and his counterparts in Quebec, British Columbia and Alberta outlined their findings Wednesday morning into ChatGPT— a chatbot that generates conversational, human-like responses when users type in questions or tasks.
The privacy watchdogs' launched their probe in 2023 following a complaint that the company unlawfully collected, used and disclosed personal information without consent. "
#OpenAI #ChatGPT #LLM #Canada #CanPoli #CdnPoli #Privacy

@zachleat@zachleat.com
2026-04-17 14:32:31

@… do you think folks are using it in a separate profile? The marketed tasks all mention access to automate private info

@Mediagazer@mstdn.social
2026-04-29 08:15:47

A case study of using AI to aid election coverage at Bay City News, starting in 2024 and refining later, aiming to reduce journalists' manual, repetitive tasks (Ciara Zavala/Local News Matters)
localnewsmatters.org/2026/04/2

What will people do when AI can handle most current white-collar tasks?
I don't know.
And that's the whole point.
Nobody knew what displaced agricultural workers would do, either,
-- until they did it.
The absence of a visible next chapter isn't evidence that there won't be one.
It's evidence that we're bad at predicting what humans will invent when constraints shift.

@Techmeme@techhub.social
2026-05-04 12:35:37

Enzo Health, whose AI tools help home health and hospice agencies automate tasks like patient intake and documentation review, raised a $20M Series A led by N47 (Brock E.W. Turner/Axios)
axios.com/pro/health-tech-deal

@netzschleuder@social.skewed.de
2026-02-27 12:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2026-04-04 05:15:54

Generalist, which raised $140M at a $440M valuation in 2025, releases GEN-1, an AI model to help robots handle high-dexterity tasks typically done by humans (Anna Tong/Forbes)
forbes.com/sites/annatong/2026

@ruth_mottram@fediscience.org
2026-04-22 21:53:35

It's taken me most of the day, but I am finally down to fewer than 250 emails in my inbox. Loads of different tasks dealt with too - I guess I should go to bed now, but I also really need to finish this paper now...
#AcademicChatter

@simon_jf@mastodon.scot
2026-02-26 15:36:43

Possibly one of my least favourite tasks as an academic: getting a paper back to the page limit after adding the ACM guff and removing the space hacks 😐

@inthehands@hachyderm.io
2026-02-17 17:04:54

It’s important to distinguish two different hypothetical ways in which gen AI can constitute a massive wealth transfer:
Scenario 1, “LLMs are the new petrochemicals:” Gen AI is actually effective for all sorts of tasks as advertised. It becomes a necessity for economic participation / useful work / whatever, and ownership of the data model and/or data centers thus means control of high-value resources.
2/

@Techmeme@techhub.social
2026-04-15 17:31:00

Adobe unveils Firefly AI Assistant, which can orchestrate and execute multistep tasks across Creative Cloud apps, available in public beta in the coming weeks (Ivan Mehta/TechCrunch)
techcrunch.com/2026/04/15/adob

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:42:41

Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads
Huy Trinh, Rebecca Ma, Zeqi Yu, Tahsin Reza
arxiv.org/abs/2602.21081 arxiv.org/pdf/2602.21081 arxiv.org/html/2602.21081
arXiv:2602.21081v1 Announce Type: new
Abstract: Vision Transformers (ViTs) have demonstrated remarkable potential in image processing tasks by utilizing self-attention mechanisms to capture global relationships within data. However, their scalability is hindered by significant computational and memory demands, especially for large-scale models with many parameters. This study aims to leverage DeepSpeed, a highly efficient distributed training framework that is commonly used for language models, to enhance the scalability and performance of ViTs. We evaluate intra- and inter-node training efficiency across multiple GPU configurations on various datasets like CIFAR-10 and CIFAR-100, exploring the impact of distributed data parallelism on training speed, communication overhead, and overall scalability (strong and weak scaling). By systematically varying software parameters, such as batch size and gradient accumulation, we identify key factors influencing performance of distributed training. The experiments in this study provide a foundational basis for applying DeepSpeed to image-related tasks. Future work will extend these investigations to deepen our understanding of DeepSpeed's limitations and explore strategies for optimizing distributed training pipelines for Vision Transformers.
toXiv_bot_toot

@netzschleuder@social.skewed.de
2026-02-24 19:00:04

windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked​ to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted

windsurfers: Windsurfers network (1986). 43 nodes, 336 edges. https://networks.skewed.de/net/windsurfers
@Techmeme@techhub.social
2026-04-17 07:25:51

Physical Intelligence says its new model, π0.7, can direct robots on tasks they weren't trained on, an "early sign" of generalization, surprising researchers (Connie Loizos/TechCrunch)
techcrunch.com/202…

@Mediagazer@mstdn.social
2026-03-24 14:40:39

Beehiiv now lets creators manage their accounts through AI platforms; the first iteration of Beehiiv MCP supports subscriber analysis and SEO optimization (Sara Fischer/Axios)
axios.com/2026/03/24/beehiiv-c

@Techmeme@techhub.social
2026-04-27 22:16:02

Xiaomi open sources MiMo-V2.5 and MiMo-V2.5-Pro under the MIT License, saying both models are among the most efficient available for agentic "claw" tasks (Carl Franzen/VentureBeat)
venturebeat.com/ai/open-source

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:44:51

Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training
Anas Barakat, Souradip Chakraborty, Khushbu Pahwa, Amrit Singh Bedi
arxiv.org/abs/2602.21189 arxiv.org/pdf/2602.21189 arxiv.org/html/2602.21189
arXiv:2602.21189v1 Announce Type: new
Abstract: Pass@k is a widely used performance metric for verifiable large language model tasks, including mathematical reasoning, code generation, and short-answer reasoning. It defines success if any of $k$ independently sampled solutions passes a verifier. This multi-sample inference metric has motivated inference-aware fine-tuning methods that directly optimize pass@$k$. However, prior work reports a recurring trade-off: pass@k improves while pass@1 degrades under such methods. This trade-off is practically important because pass@1 often remains a hard operational constraint due to latency and cost budgets, imperfect verifier coverage, and the need for a reliable single-shot fallback. We study the origin of this trade-off and provide a theoretical characterization of when pass@k policy optimization can reduce pass@1 through gradient conflict induced by prompt interference. We show that pass@$k$ policy gradients can conflict with pass@1 gradients because pass@$k$ optimization implicitly reweights prompts toward low-success prompts; when these prompts are what we term negatively interfering, their upweighting can rotate the pass@k update direction away from the pass@1 direction. We illustrate our theoretical findings with large language model experiments on verifiable mathematical reasoning tasks.
toXiv_bot_toot

@frankel@mastodon.top
2026-02-20 17:11:49

Evaluating #AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
arxiv.org/abs/2602.11988

@Techmeme@techhub.social
2026-04-03 00:25:43

Source: Anthropic has acquired Coefficient Bio, which was developing a platform that enables AI to run biotech tasks such as planning drug research, for ~$400M (The Information)
theinformation.com/articles/an

@Techmeme@techhub.social
2026-04-02 11:06:28

How AI helped Medvi, a telehealth provider of GLP-1 weight-loss drugs with just two full-time employees, hit $401M in 2025 sales, as it tracks for $1.8B in 2026 (Erin Griffith/New York Times)
nytimes.com/2026/04/02/technol

@Techmeme@techhub.social
2026-03-02 04:40:41

Early data show wages are rising for AI-exposed jobs that place a high value on a "worker's tacit knowledge and experience", as textbook knowledge loses value (J. Scott Davis/Federal Reserve Bank of Dallas)
dallasfed.org/research/economi

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:34:51

UrbanFM: Scaling Urban Spatio-Temporal Foundation Models
Wei Chen, Yuqian Wu, Junle Chen, Xiaofang Zhou, Yuxuan Liang
arxiv.org/abs/2602.20677 arxiv.org/pdf/2602.20677 arxiv.org/html/2602.20677
arXiv:2602.20677v1 Announce Type: new
Abstract: Urban systems, as dynamic complex systems, continuously generate spatio-temporal data streams that encode the fundamental laws of human mobility and city evolution. While AI for Science has witnessed the transformative power of foundation models in disciplines like genomics and meteorology, urban computing remains fragmented due to "scenario-specific" models, which are overfitted to specific regions or tasks, hindering their generalizability. To bridge this gap and advance spatio-temporal foundation models for urban systems, we adopt scaling as the central perspective and systematically investigate two key questions: what to scale and how to scale. Grounded in first-principles analysis, we identify three critical dimensions: heterogeneity, correlation, and dynamics, aligning these principles with the fundamental scientific properties of urban spatio-temporal data. Specifically, to address heterogeneity through data scaling, we construct WorldST. This billion-scale corpus standardizes diverse physical signals, such as traffic flow and speed, from over 100 global cities into a unified data format. To enable computation scaling for modeling correlations, we introduce the MiniST unit, a novel split mechanism that discretizes continuous spatio-temporal fields into learnable computational units to unify representations of grid-based and sensor-based observations. Finally, addressing dynamics via architecture scaling, we propose UrbanFM, a minimalist self-attention architecture designed with limited inductive biases to autonomously learn dynamic spatio-temporal dependencies from massive data. Furthermore, we establish EvalST, the largest-scale urban spatio-temporal benchmark to date. Extensive experiments demonstrate that UrbanFM achieves remarkable zero-shot generalization across unseen cities and tasks, marking a pivotal first step toward large-scale urban spatio-temporal foundation models.
toXiv_bot_toot

@Techmeme@techhub.social
2026-04-02 10:51:04

OpenClaw launches an official China mirror, with ByteDance providing the servers to host the Chinese-language service, as OpenClaw explodes in the country (Juro Osawa/The Information)
theinformation.com/briefings/b

@Techmeme@techhub.social
2026-03-19 15:11:07

DoorDash launches Tasks, a new app that pays delivery couriers in some markets to submit video clips and complete other tasks for training AI models (Natalie Lung/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-01 15:05:43

A look at Hyundai's Atlas humanoid robot, slated for assembly tasks in 2028; Hyundai has invested billions in robotics since acquiring Boston Dynamics in 2021 (Hyonhee Shin/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:38:51

Hierarchic-EEG2Text: Assessing EEG-To-Text Decoding across Hierarchical Abstraction Levels
Anupam Sharma, Harish Katti, Prajwal Singh, Shanmuganathan Raman, Krishna Miyapuram
arxiv.org/abs/2602.20932 arxiv.org/pdf/2602.20932 arxiv.org/html/2602.20932
arXiv:2602.20932v1 Announce Type: new
Abstract: An electroencephalogram (EEG) records the spatially averaged electrical activity of neurons in the brain, measured from the human scalp. Prior studies have explored EEG-based classification of objects or concepts, often for passive viewing of briefly presented image or video stimuli, with limited classes. Because EEG exhibits a low signal-to-noise ratio, recognizing fine-grained representations across a large number of classes remains challenging; however, abstract-level object representations may exist. In this work, we investigate whether EEG captures object representations across multiple hierarchical levels, and propose episodic analysis, in which a Machine Learning (ML) model is evaluated across various, yet related, classification tasks (episodes). Unlike prior episodic EEG studies that rely on fixed or randomly sampled classes of equal cardinality, we adopt hierarchy-aware episode sampling using WordNet to generate episodes with variable classes of diverse hierarchy. We also present the largest episodic framework in the EEG domain for detecting observed text from EEG signals in the PEERS dataset, comprising $931538$ EEG samples under $1610$ object labels, acquired from $264$ human participants (subjects) performing controlled cognitive tasks, enabling the study of neural dynamics underlying perception, decision-making, and performance monitoring.
We examine how the semantic abstraction level affects classification performance across multiple learning techniques and architectures, providing a comprehensive analysis. The models tend to improve performance when the classification categories are drawn from higher levels of the hierarchy, suggesting sensitivity to abstraction. Our work highlights abstraction depth as an underexplored dimension of EEG decoding and motivates future research in this direction.
toXiv_bot_toot

@Techmeme@techhub.social
2026-03-01 06:21:03

Multiple AWS developers say they are asked to take on new roles with AI tools' assistance, and engineers are now required to complete technical writing tasks (Financial Times)
ft.com/content/433f41f2-bf6d-4

@Techmeme@techhub.social
2026-03-31 23:40:58

Salesforce announces over 30 new features for Slack, including a meeting transcription feature and an operator mode to complete multi-step tasks on the desktop (Sabrina Ortiz/The Deep View)
thedeepview.com/articles/slack

@Techmeme@techhub.social
2026-03-23 22:15:44

Anthropic rolls out a computer use feature for Claude Cowork and the Claude Code desktop app, in research preview on macOS for Pro and Max subscribers (Blake Stimac/CNET)
cnet.com/tech/services-and-sof

@Techmeme@techhub.social
2026-03-30 02:36:14

Inside the rise and fall of Sora, whose team worked separately from OpenAI's core research team, as OpenAI shuts down Sora and redirects compute to other tasks (Wall Street Journal)
wsj.com/tech/ai/the-sudden-fal

@Techmeme@techhub.social
2026-02-09 20:55:44

An eight-month study at a US tech company finds AI tools didn't reduce work but intensified it, as employees worked faster and took on a broader range of tasks (Harvard Business Review)
hbr.org/2026/02/ai-doesnt-redu

@Techmeme@techhub.social
2026-02-26 16:40:46

Encord, whose software helps companies developing AI models manage training data for robots and other uses, raised $60M at a $500M pre-money valuation (Rocket Drew/The Information)
theinformation.com/articles/ro

@Techmeme@techhub.social
2026-02-24 13:01:39

Basis, which builds AI agents to help accounting firms with tasks like tax returns, raised $100M led by Accel at a $1.15B valuation, for $138M in total funding (Rebecca Torrence/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-27 02:15:57

NY-based Blossom Health, which makes an "AI copilot" to augment psychiatrists' clinical decisions and automate office tasks, raised $20M in seed and Series A (Lily Mae Lazarus/Fortune)
fortune.com/2026/03/26/exclusi

@Techmeme@techhub.social
2026-03-26 16:25:57

Cohere launches Transcribe, its first voice model; the 2B-parameter, open-source speech recognition model handles tasks like notetaking and speech analysis (Ivan Mehta/TechCrunch)
techcrunch.com/2026/03/26/cohe

@Techmeme@techhub.social
2026-04-23 03:16:04

OpenAI releases ChatGPT for Clinicians, a tool for medical tasks like documentation and research, free for verified physicians, pharmacists, and more in the US (OpenAI)
openai.com/index/making-chatgp

@Techmeme@techhub.social
2026-03-24 14:40:50

Beehiiv now lets creators manage their accounts through AI platforms; the first iteration of Beehiiv MCP supports subscriber analysis and SEO optimization (Sara Fischer/Axios)
axios.com/2026/03/24/beehiiv-c

@Techmeme@techhub.social
2026-04-23 18:20:49

GPT-5.5 is rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex, and GPT-5.5 Pro to Pro, Business, and Enterprise users in ChatGPT (The Verge)
theverge.com/ai-artificial-int

@Techmeme@techhub.social
2026-04-22 18:06:08

OpenAI announces workspace agents in ChatGPT, letting teams create Codex-powered shared agents for complex tasks, and says they are "an evolution of GPTs" (OpenAI)
openai.com/index/introducing-w

@Techmeme@techhub.social
2026-04-23 18:08:54

OpenAI unveils GPT 5.5, intended to be better at completing work without much direction, saying the model "kind of figures it out, deals with ambiguity" (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-22 03:55:55

Hands-on with Gemini task automation on mobile: it's super impressive despite being very slow and failing at some tasks; it can order food, book Ubers, and more (Allison Johnson/The Verge)
theverge.com/tech/898282/gemin

@Techmeme@techhub.social
2026-04-20 16:15:40

Moonshot introduces Kimi K2.6, an open-weight model that it says shows strong improvements in long-horizon coding tasks, available under a modified MIT License (Kimi AI)
kimi.com/blog/kimi-k2-6

@Techmeme@techhub.social
2026-02-19 16:22:04

Google rolls out Gemini 3.1 Pro, which it says is "a step forward in core reasoning", for AI Pro and Ultra subscribers; the .1 increment is a first for Google (Abner Li/9to5Google)
9to5google.com/2026/02/19/goog

@Techmeme@techhub.social
2026-02-16 11:35:41

Alibaba debuts Qwen 3.5, adding "visual agentic capabilities" to independently execute tasks, and says it is 60% cheaper to use and 8x better at large workloads (Eduardo Baptista/Reuters)
reuters.com/world/china/alibab

@Techmeme@techhub.social
2026-03-19 18:35:48

Google moved some staffers working on Project Mariner, its AI agent that can navigate Chrome and complete tasks on a user's behalf, to higher-priority projects (Maxwell Zeff/Wired)
wired.com/story/google-shakes-

@Techmeme@techhub.social
2026-03-19 15:55:49

Cursor launches Composer 2, an AI agent trained solely on coding-related data to perform autonomous, lengthy coding tasks, to compete with Anthropic and OpenAI (Rachel Metz/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-19 16:20:54

Meta plans to reduce its reliance on third-party vendors for content moderation, in favor of AI tools that it says are better at spotting scams and other tasks (Kurt Wagner/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-17 04:40:46

Alibaba launches Wukong, an enterprise AI platform that coordinates multiple AI agents to handle complex tasks like document editing, currently in beta (Reuters)
reuters.com/world/asia-pacific

@Techmeme@techhub.social
2026-04-16 03:21:21

OpenAI updates Agents SDK with native sandboxing and an in-distribution harness for deploying and testing agents on long-horizon tasks (Lucas Ropek/TechCrunch)
techcrunch.com/2026/04/15/open

@Techmeme@techhub.social
2026-04-17 00:25:49

Alibaba unveils Qwen3.6-35B-A3B, an open-weight MoE model with 35B total and 3B active parameters, saying it rivals larger dense models in agentic coding tasks (Qwen)
qwen.ai/blog?id=qwen3.6-35b-a3b

@Techmeme@techhub.social
2026-03-16 20:10:46

Z.ai launches GLM-5-Turbo, a closed-source, faster, and cheaper variant of GLM-5 optimized for agent-driven workflows and OpenClaw-style tasks (Carl Franzen/VentureBeat)
venturebeat.com/technology/z-a

@Techmeme@techhub.social
2026-04-14 19:41:10

Anthropic redesigns Claude Code on desktop, adding a sidebar for managing multiple sessions, a drag-and-drop layout, an integrated terminal, and a file editor (Claude)
claude.com/blog/claude-code-de

@Techmeme@techhub.social
2026-02-12 06:26:00

Hong Kong-listed Zhipu AI surged 30% after releasing its GLM-5, an open-source LLM with enhanced coding capabilities and long-running agent tasks (CNBC)
cnbc.com/2026/02/12/chinese-ai

@Techmeme@techhub.social
2026-04-14 17:50:52

Anthropic launches a repeatable routines feature for Claude Code as a research preview, allowing developers to schedule and automate software development tasks (Zac Hall/9to5Mac)
9to5mac.com/2026/04/14/anthrop

@Techmeme@techhub.social
2026-02-13 11:31:09

Baidu plans to let users access OpenClaw via its search app and integrate OpenClaw's capabilities into its e-commerce business and other services (Evelyn Cheng/CNBC)
cnbc.com/2026/02/13/baidu-open

@Techmeme@techhub.social
2026-02-11 17:21:01

Z.ai launches GLM-5, its flagship open-weight model, saying it has best-in-class performance among open-source models in reasoning, coding, and agentic tasks (Z.ai)
z.ai/blog/glm-5

@Techmeme@techhub.social
2026-02-10 10:55:55

Alibaba's DAMO Academy releases RynnBrain, an open-source foundation model to help robots perform real-world tasks like navigating rooms, trained on Qwen3-VL (Saritha Rai/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-02-10 14:20:58

Cloud computing provider Nebius agrees to buy Tavily, which helps AI agents search for up-to-date information for tasks like coding, a source says for $275M (Dina Bass/Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2026-03-13 11:02:52

STMicro plans to retrain workers and deploy humanoid robots in its older chip plants for repetitive and physically demanding tasks, aiming to avoid closures (Nathan Vifflin/Reuters)
reuters.com/business/stmicroel