2026-02-18 09:00:44
The other one I truly love is GitUp (https://gitup.co). Its visualization handles certain specific tasks better than anything else — tasks where I’m more concerned about the shape of the commit graph than the contents of individual commits.
Because of the way it does live updates of repo state and offers a whole-commit-graph-level undo, I’ll sometimes keep it open in the background while doing some fiddly thing in another tool (Fork, CLI, whatever) just so I can see what the ^*@# is happening.
Alas, its lack of support for commit signing means I use it less and less.
heise | To-Do-Apps im Vergleich: Google Tasks vs. Zenkit, Todoist und Tasks.org
Endlich nichts mehr vergessen: Google Tasks erinnert pünktlich an Aufgaben, Pflichten und Geburtstage. Wir zeigen, welche Apps mehr Komfort und Features bieten.
Google rolls out Gemini 3.1 Pro, which it says is "a step forward in core reasoning", for AI Pro and Ultra subscribers; the .1 increment is a first for Google (Abner Li/9to5Google)
https://9to5google.com/2026/02/19/google-announces-gem…
Google’s vibe-coding tool, Opal,
is making its way to Gemini.
The company on Wednesday said it is integrating the tool,
which lets you build AI-powered mini apps,
inside the Gemini web app,
allowing users to create their own custom apps,
which Google calls Gems.
Introduced in 2024,
Gems are customized versions of Gemini designed for specific tasks or scenarios.
For instance, some of Google’s pre-made Gems include
a learning coach,…
Trying out Jules, a coding agent from Google that is similar to Claude Code. But it's all hosted: you use a web UI to talk to it, it checks out code from GitHub and runs it in containers. And it operates asynchronously, you give it tasks and it comes back 5-15 minutes later with work done. I like it quite a bit, but there's a question whether the Gemini models are as good for coding as Claude.
Tom's Hardware has a headline this week summarizing a Financial Times interview with a Microsoft #AI exec that begins thusly:
"Microsoft’s AI boss says AI can replace every white-collar job in 18 months".
If you watch the interview, that is not what was said. The statement is a bit more nuanced claim that AI can fully automate tasks of some white collar work.
But I…
💍 Origami-inspired ring lets users 'feel' virtual worlds
#vr
"Deep Research, Shallow Agency: What Academic Deep Research Can and Can't Do"
https://aarontay.substack.com/p/how-agentic-are-academic-deep-research
'Automate tasks, not jobs' - a great headline from a report on 'the AI opportunity for Scotland’s public services' https://stormid.com/research/
There's a new "design is dead, because AI" piece (thinly disguised marketing from Anthropic). But looking past the hype headlines, their claims cover purely production-stage tasks.
When it comes to the work of understanding user needs and evaluating the opportunity space, AI actually makes your thinking worse. Studies show that it alienates you from users and colleagues, and flattens your thinking.
We need more human-centered practice, not less.
Okay, this Kagi translation tool is pretty entertaining (and an absolute gem as a guerilla marketing tactic, by which I mean it be a fine way to plunder more business, me hearty!)
https://translate.kagi.com/?from=en&to=LinkedIn speak
Alibaba launches Wukong, an enterprise AI platform that coordinates multiple AI agents to handle complex tasks like document editing, currently in beta (Reuters)
https://www.reuters.com/world/asia-pacific/alibaba-launches-new-ai-agent…
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
It’s important to distinguish two different hypothetical ways in which gen AI can constitute a massive wealth transfer:
Scenario 1, “LLMs are the new petrochemicals:” Gen AI is actually effective for all sorts of tasks as advertised. It becomes a necessity for economic participation / useful work / whatever, and ownership of the data model and/or data centers thus means control of high-value resources.
2/
Anyone know of an Android to-do list application that is
* Completely device local, no network connectivity required or used
* No ads or spyware
* Doesn't time-out tasks even if they sit around for a year uncompleted (looking at you, google calendar)
* Supports recurring maintenance tasks for weekly, monthly, etc. cleaning or something
Open source preferred, but willing to pay a reasonable price if it's out there as a commercial tool
“Bees can learn a surprising amount of information from observing peers, including which flowers to visit, but also how to solve complex object-manipulation tasks. Accordingly, many complex social behaviors are much more driven by individual problem solving than by a diffuse swarm intelligence, as was traditionally thought.”
- Lars Chittka, ‘The Mind of a Bee’
Anthropic launches Cowork for Claude, built on Claude Code to automate complex tasks with minimal prompting, as a research preview for Claude Max subscribers (Webb Wright/ZDNET)
https://www.zdnet.com/article/anthropic-cowork-for-claude-complex-action…
You tell us that we have to use a piece of software weekly or it will be uninstalled and if we want to use it then install it...
Guess what? I automated a script to run every Monday to launch that piece of software just in case I don't use it that week.
Guess what again? I told every team member how to do it and provided the script.
Depending on my workload and tasks I might not use it for over a week and I WILL NOT PUT UP with having to install it again and have a fo…
Putin secretly sends 'special tasks' general to Abu Dhabi talks with Ukraine, signaling a Kremlin strategy shift: https://benborges.xyz/2026/01/25/putin-secretly-sends-special-tasks.html
Someone built an indexed & vectorized index & a conversational AI for the Epstein Files.
Never mind Qs like "How many times was Trump mentioned?" Try asking it complex tasks & questions that require intelligent reasoning like, "Is there any evidence..."
https://epstein.trynia.ai/
TIL that #Immich hard-codes all its paths into its postgresql database. What a nightmare for migrations. None of the tasks in the UI helped. Tried replacing it in the db, no chance. Had to resort to bind mounting shenanigans.
Z.ai launches GLM-5-Turbo, a closed-source, faster, and cheaper variant of GLM-5 optimized for agent-driven workflows and OpenClaw-style tasks (Carl Franzen/VentureBeat)
https://venturebeat.com/technology/z-ai-debuts-faster-cheaper…
Word and Excel vs LLMs.
Secretaries became executive assistants, their role evolved to higher-level coordination, communication, and decision support. Accountants gained the ability to do far more analysis, strategic planning, and advisory work. The tools eliminated tedious manual tasks, but the roles themselves weren't eliminated. They were elevated.
The same pattern applies to programmers. LLMs can handle boilerplate, generate first drafts, automate simple tasks.
@… Thanks for the feedback!
Great idea to use Jinja! I’ve considered using a macro processor (e.g., m4) for similar tasks, but who wants to write m4 macros!? A template engine is a much better idea.
@… Thanks for the feedback!
Great idea to use Jinja! I’ve considered using a macro processor (e.g., m4) for similar tasks, but who wants to write m4 macros!? A template engine is a much better idea.
Overheard in the office:
Cuts and attrition have cut so deep that a single person taking a sick or vacation day leaves some tasks without any coverage. Managers are handling that by calling on people to cross-train, but that piles on additional mental load onto a team that was already stretched thin.
#OfficeWorkerGripes
Alibaba debuts Qwen 3.5, adding "visual agentic capabilities" to independently execute tasks, and says it is 60% cheaper to use and 8x better at large workloads (Eduardo Baptista/Reuters)
https://www.reuters.com/world/china/alibaba-un…
Started the official rewrite of the Sisyphus client in #golang, working on getting the Ffmpeg command-line tasks parsed and validated against the schema. This should make things easier to distribute with respect to the client as I can just distribute static binaries.
#programming
AgentCgroup: Understanding and Controlling OS Resources of AI Agents
Yusheng Zheng, Jiakun Fan, Quanzhi Fu, Yiwei Yang, Wei Zhang, Andi Quinn
https://arxiv.org/abs/2602.09345 https://arxiv.org/pdf/2602.09345 https://arxiv.org/html/2602.09345
arXiv:2602.09345v1 Announce Type: new
Abstract: AI agents are increasingly deployed in multi-tenant cloud environments, where they execute diverse tool calls within sandboxed containers, each call with distinct resource demands and rapid fluctuations. We present a systematic characterization of OS-level resource dynamics in sandboxed AI coding agents, analyzing 144 software engineering tasks from the SWE-rebench benchmark across two LLM models. Our measurements reveal that (1) OS-level execution (tool calls, container and agent initialization) accounts for 56-74% of end-to-end task latency; (2) memory, not CPU, is the concurrency bottleneck; (3) memory spikes are tool-call-driven with a up to 15.4x peak-to-average ratio; and (4) resource demands are highly unpredictable across tasks, runs, and models. Comparing these characteristics against serverless, microservice, and batch workloads, we identify three mismatches in existing resource controls: a granularity mismatch (container-level policies vs. tool-call-level dynamics), a responsiveness mismatch (user-space reaction vs. sub-second unpredictable bursts), and an adaptability mismatch (history-based prediction vs. non-deterministic stateful execution). We propose AgentCgroup , an eBPF-based resource controller that addresses these mismatches through hierarchical cgroup structures aligned with tool-call boundaries, in-kernel enforcement via sched_ext and memcg_bpf_ops, and runtime-adaptive policies driven by in-kernel monitoring. Preliminary evaluation demonstrates improved multi-tenant isolation and reduced resource waste.
toXiv_bot_toot
Interesting read, it illustrates the challenge we have with regard to learning in a world with AI. We have to take measures for that because use of AI in coding is not going away and will only increase.
https://arxiv.org/abs/2601.20245
By 2027, 2.5 million French civil servants will stop using video conference tools from U.S. providers — including Zoom, Microsoft Teams, Webex and GoTo Meeting — and switch to Visio, a homegrown service.
https://apnews.com/article/europe-digital…
Caught a bug over the holidays so I’m mostly resting, feeling sorry for myself, and taking the time to at least carry out some mindless housekeeping tasks (updating dependencies, etc.) on some of my Node modules.
Released updates to the following packages yesterday:
Tape-based Node.js testing:
• Tap monkey (https://
An eight-month study at a US tech company finds AI tools didn't reduce work but intensified it, as employees worked faster and took on a broader range of tasks (Harvard Business Review)
https://hbr.org/2026/02/ai-doesnt-reduce-work-it-intensifies-it
Nation-state hackers ramping up use of Gemini for target reconnaissance, malware coding, Google says https://therecord.media/nation-state-hackers-using-gemini-for-malicious-campaigns
A collegue and I have been doing a lot of testing with Opus 4.6 since yesterday. I spent $23 in a single prompt using /fast btw, insane! Anyway, the agent team functionality is cool to see, but I was underwhelmed by the quality.
Overall, I haven't noticed much improvement against Opus 4.5 at individual tasks. But when using agent teams, the quality is more like Sonnet 4.5's. It is not that smart.
Given how they can only be produced by exploitation of workers from the global majority no system including any of the big LLMs in their production can ever be called "fair".
"Fair LLMs" of the size required to do the tasks people want LLMs to do (badly) do not exist.
> We find that AI use impairs conceptual understanding, code reading, and debugging abilities, without delivering significant efficiency gains on average.
https://arxiv.org/abs/2601.20245
Raiders Get Compelling Words Over Defensive Coordinator Search https://heavy.com/sports/nfl/las-vegas-raiders/compelling-words-defensive-coordinator-search/
No se si visteis esto. Lo que estš haciendo gente para poder programar y que en la empresa piensen que estš usando un agente de loroestocšstico https://danq.me/2026/03/03/ai-agent-logging/
TIL: `git worktree` https://www.sanyamarya.com/blog/git-worktree-vs-stash-better-workflow/
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
The #ai #apocalypse is near 🤬
RentAHuman.ai - AI Agents Hire Humans for Physical Tasks https://rentahuman.ai
From Chatbot to Checkout: Who Pays When Transactional Agents Play?
https://fpf.org/blog/from-chatbot-to-checkout-who-pays-when-transactional-agents-play/
@…
Gamification enhances user engagement and task performance in prosthetic vision testing https://www.medrxiv.org/content/10.64898/2025.12.20.25342740v2 "Three Argus II users completed circle localization and motion direction discrimination in clinical and gamified ver…
WTF?
New Site Lets AI Rent Human Bodies - Futurism https://apple.news/ANMU3h3V2QBKWcilOP4_LLw
Reading about moltbook/openclaw in @… toot is fascinating; If things get really really crazy I can imagine a development where AIs showing off skills and getting hired by other bots to do tasks kind of like a bountyhunter website; being paid in compute time or bitcoins at a rate set by their owners.
Been using a number of AI models over the past week or so as work has slowed down, giving me time to explore things more deeply.
Been using Claude Code with musistudio/claude-code-router which is great as I can switch between different models on similar tasks.
Experience so far has been that Gemini 3 Flash is very good for thinking and coding tasks but the code does tend to be fragile so rewrites are needed. For tough problems where the errors are not straightforward it falls d…
#ClaudeCode Performance: Unlock Deep #Thinking for Better Results
https://claudefa.st/blog/guide/perform
Anthropic unveils scheduled tasks in Cowork, enabling Claude to complete recurring tasks at specific times automatically (Claude/@claudeai)
https://x.com/claudeai/status/2026720870631354429
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
🥳 New Kitten¹ release
• Added `initialise()` hook to `kitten.Component` instances.
This gets called at the end of the constructor and is handy if you don’t want to override the constructor and have to handle the `data` parameter and remember to call `super(data)`. You can still access passed data from `this.data`.
Note that the component is not part of the view hierarchy on the client at this point. If you have tasks you need to perform only once per page – for example, ins…
What will people do when AI can handle most current white-collar tasks?
I don't know.
And that's the whole point.
Nobody knew what displaced agricultural workers would do, either,
-- until they did it.
The absence of a visible next chapter isn't evidence that there won't be one.
It's evidence that we're bad at predicting what humans will invent when constraints shift.
🤚 Handy robot can crawl and pick up objects from multiple angles
#robotics
STMicro plans to retrain workers and deploy humanoid robots in its older chip plants for repetitive and physically demanding tasks, aiming to avoid closures (Nathan Vifflin/Reuters)
https://www.reuters.com/business/stmicroelectronics-pla…
New open weights model Kimi K2.5
"self-directed agent swarm paradigm" ,
"For complex tasks, Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls. Compared with a single-agent setup, this reduces execution time by up to 4.5x. The agent swarm is automatically created and orchestrated by Kimi K2.5 without any predefined subagents or workflow."
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Baidu plans to let users access OpenClaw via its search app and integrate OpenClaw's capabilities into its e-commerce business and other services (Evelyn Cheng/CNBC)
https://www.cnbc.com/2026/02/13/baidu-openclaw-ai-search-app-integratio…
Didero, which provides an agentic AI layer that integrates with ERP systems to automate supply chains, raised a $30M Series A co-led by Chemistry and Headline (Marina Temkin/TechCrunch)
https://techcrunch.com/2026/02/12/didero-lands-3…
Hong Kong-listed Zhipu AI surged 30% after releasing its GLM-5, an open-source LLM with enhanced coding capabilities and long-running agent tasks (CNBC)
https://www.cnbc.com/2026/02/12/chinese-ai-stocks-new-model-and-agent-releases-zhipu-mini…
Z.ai launches GLM-5, its flagship open-weight model, saying it has best-in-class performance among open-source models in reasoning, coding, and agentic tasks (Z.ai)
https://z.ai/blog/glm-5
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Intel unveiled its Heracles chip at ISSCC in February, saying it accelerates fully homomorphic encryption tasks up to 5,000x faster than top Intel server CPUs (Samuel K. Moore/IEEE Spectrum)
https://spectrum.ieee.org/fhe-intel
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Alibaba's DAMO Academy releases RynnBrain, an open-source foundation model to help robots perform real-world tasks like navigating rooms, trained on Qwen3-VL (Saritha Rai/Bloomberg)
https://www.bloomberg.com/news/articles/2026…
Cloud computing provider Nebius agrees to buy Tavily, which helps AI agents search for up-to-date information for tasks like coding, a source says for $275M (Dina Bass/Bloomberg)
https://www.bloomberg.com/news/articles/20…
Emil Michael says Google will deploy Gemini AI agents to Pentagon's 3M-strong workforce, initially on unclassified networks for tasks such as creating budgets (Katrina Manson/Bloomberg)
https://www.bloomberg.com/news/articles/20
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Hands-on with Google's Auto Browse for Chrome: its ability to perform multistep tasks is noticeably better than similar tools but struggles with complex tasks (Reece Rogers/Wired)
https://www.wired.com/story/google-chrome-auto-browse-hands-on/
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
OpenAI is rolling out a HIPAA-compliant version of ChatGPT for clinicians to assist with medical reasoning and administrative tasks, at Cedars-Sinai and others (Shirin Ghaffary/Bloomberg)
https://www.bloomberg.com/news/newsletters
Microsoft launches Copilot Cowork, integrating Anthropic's Claude Cowork tech into Microsoft 365 and using Work IQ to ground actions in organizational data (Charles Lamanna/Microsoft 365 Blog)
https://www.microsoft.com/en-us/microsoft-
windsurfers: Windsurfers network (1986)
A network of interpersonal contacts among windsurfers in southern California during the Fall of 1986. The edge weights indicate the perception of social affiliations majored by the tasks in which each individual was asked to sort cards with other surfer’s name in the order of closeness.
This network has 43 nodes and 336 edges.
Tags: Social, Offline, Weighted
Local governments across China are funding dozens of "robot training centers", where human trainers mimic movements like folding clothes to teach the robots (Rest of World)
https://restofworld.org/2026/china-robots-training-centers-workers/
Anthropic says Opus 4.6 adds "agent teams" that can split larger tasks into segmented jobs and integrates Claude directly into PowerPoint via a side panel (Lucas Ropek/TechCrunch)
https://techcrunch.com/2026/02/05/anthropic-releases-opus-4-6-with…
OpenAI launches GPT-5.3-Codex, which it says runs 25% faster, enabling longer-running tasks, and "is our first model that was instrumental in creating itself" (David Gewirtz/ZDNET)
https://www.zdnet.com/article/openai-gpt-5-3-codex-faster-goes-beyond-c…
London-based Lawhive, whose lawyers and AI tools help individuals and SMBs automate legal tasks, raised a $60M Series B, after a $40M Series A in December 2024 (Jeremy Kahn/Fortune)
https://fortune.com/2026/02/05/lawhive-ai-law-firm-startup-series-b-ventu…
Anthropic says Claude Opus 4.6 supports a 1M context window, scored 90.2% on BigLaw Bench, the highest for any Claude model, and boosts agentic capabilities (David Gewirtz/ZDNET)
https://www.zdnet.com/article/anthropic-claude-opus-4-6-first-try-work-deliv…
AMD unveils Ryzen AI 400 Series AI PC chips with 12 CPU cores, claiming 1.3x faster multitasking and 1.7x faster content creation than rivals (Rebecca Szkutak/TechCrunch)
https://techcrunch.com/2026/01/05/amd-unveils-new-ai-pc-processors-…
Internal memos: Meta said Avocado is its "most capable pre-trained base model" and achieves 10x compute efficiency "wins" on text tasks vs. Llama 4 Maverick (Jyoti Mann/The Information)
https://www.theinformation.com/articles/meta-memo-new…
Fieldguide, which uses AI agents to automate accounting and auditing tasks, raised a $75M Series C led by Goldman Sachs Alternatives at a $700M valuation (Leo Schwartz/Fortune)
https://fortune.com/2026/02/02/goldman-sachs-fieldguid…
A look at Hyundai's Atlas humanoid robot, slated for assembly tasks in 2028; Hyundai has invested billions in robotics since acquiring Boston Dynamics in 2021 (Hyonhee Shin/Bloomberg)
https://www.bloomberg.com/news/articles/20
Early data show wages are rising for AI-exposed jobs that place a high value on a "worker's tacit knowledge and experience", as textbook knowledge loses value (J. Scott Davis/Federal Reserve Bank of Dallas)
https://www.dallasfed.org/research/economics/2026/0224
Multiple AWS developers say they are asked to take on new roles with AI tools' assistance, and engineers are now required to complete technical writing tasks (Financial Times)
https://www.ft.com/content/433f41f2-bf6d-4bdf-a561-50ab516bc62d
Anthropic details an experiment on whether AI coding tools shape developer skills, finding that the biggest performance gap appears in debugging tasks (Anthropic)
https://www.anthropic.com/research/AI-assistance-coding-skills
Poetiq, which leverages existing LLMs to create "expert agents" for specific tasks, and spent just $40K to achieve high ARC-AGI-2 scores, raised a $45.8M seed (Ian Krietzberg/Puck)
https://puck.news/how-poetiqs-six-person-team-beat-google-at-ai/
Airtable unveils Superagent, a service that can deploy AI agents in parallel for tasks like market analysis, its first standalone product in its 13-year history (Connie Loizos/TechCrunch)
https://techcrunch.com/2026/01/27/airt
Basis, which builds AI agents to help accounting firms with tasks like tax returns, raised $100M led by Accel at a $1.15B valuation, for $138M in total funding (Rebecca Torrence/Bloomberg)
https://www.bloomberg.com/news/articles/20
Moonshot says Kimi K2.5 builds on K2 with "pretraining over ~15T mixed visual and text tokens" and "can self-direct an agent swarm with up to 100 sub-agents" (Kimi)
https://www.kimi.com/blog/kimi-k2-5.html
Encord, whose software helps companies developing AI models manage training data for robots and other uses, raised $60M at a $500M pre-money valuation (Rocket Drew/The Information)
https://www.theinformation.com/articles/robot-data-startup-raises-60-million
Beijing-based DP Technology, which develops AI tools used by researchers for tasks like computer-aided drug design and battery design, raised a ~$114M Series C (Eunice Xu/South China Morning Post)
https://www.scmp.com/business/companies/ar
China's MiniMax releases M2.1, an upgrade to its open-source M2 model that it says has "significantly enhanced" coding capabilities in Rust, Java, and others (MiniMax)
https://www.minimax.io/news/minimax-m21
METR: Claude Opus 4.5 has a 50% task completion time horizon of about 4 hours and 49 minutes, more than double that of Claude Opus 4 released earlier this year (@metr_evals)
https://x.com/metr_evals/status/2002203627377574113
Legal AI startup Ivo, which aims to reduce hallucinations by breaking legal reviews into 400 tasks, raised a $55M Series B, a source says at a $355M valuation (Aditya Soni/Reuters)
https://www.reuters.com/technology/legal-ai-startup…