2026-04-30 19:45:49
North Korea-linked hackers stole ~$577M across the Drift Protocol and KelpDAO hacks in April, accounting for 76% of total crypto hack losses so far in 2026 (TRM Insights)
https://www.trmlabs.com/resources/blog/north-korea-sto…
North Korea-linked hackers stole ~$577M across the Drift Protocol and KelpDAO hacks in April, accounting for 76% of total crypto hack losses so far in 2026 (TRM Insights)
https://www.trmlabs.com/resources/blog/north-korea-sto…
"Europe could source half its critical materials from waste by 2050, study finds"
#Europe #Minerals #Resources
Space Force guardians provided critical support during high-profile U.S. military operations in Iran and Venezuela
— experience that underscores the need for additional resources to prepare the service for future conflicts,
a senior official told DefenseScoop in an exclusive interview Friday.
Operation Midnight Hammer and Operation Absolute Resolve were carried out by the joint force in June 2025 and January 2026, respectively.
Pentagon leadership have touted the rai…
3am on a public holiday is a good time for posting bug reproducers...
https://github.com/kube-rs/envtest/issues/58
Closed-Loop Integrated Sensing, Communication, and Control for Efficient Drone Flight
Jingli Li, Yiyan Ma, Bo Ai, Wei Chen, Weijie Yuan, Qingqing Cheng, Tongyang Xu, Guoyu Ma, Mi Yang, Yunlong Lu, Wenwei Yue, Christos Masouros, Zhangdui Zhong
https://arxiv.org/abs/2603.29220 https://arxiv.org/pdf/2603.29220 https://arxiv.org/html/2603.29220
arXiv:2603.29220v1 Announce Type: new
Abstract: Low-altitude wireless networks (LAWN) require drones to follow specific trajectories controlled by ground base stations (GBSs). However, given complex low-altitude channel conditions and limited spectrum and power resources, sensing errors and wireless link unreliability cannot be ignored, leading to trajectory deviations that threaten flight safety. To address this issue, this paper proposes an integrated sensing-communication-control (ISCC) closed-loop trajectory tracking approach, aiming to reveal the coupling mechanisms among communication, sensing, and control during drone flight. In detail, we incorporate sensing errors in trajectory state estimation, packet losses in control command transmission, and finite blocklength transmission effects into the closed-loop dynamics. First, through theoretical analysis, we identify the dominant role of the time-frequency resources allocated to control in ensuring system stability and derive a lower bound on the resources required to guarantee stable operation. Second, to minimize tracking error, we formulate a time-frequency resource allocation optimization problem for the sensing, communication, and control components, subject to constraints on communication rate and closed-loop stability. Accordingly, a solution algorithm based on successive convex approximation is proposed. Third, simulation results indicate that once stability is ensured, system performance is primarily determined by sensing accuracy, with the trajectory tracking error exhibiting an approximately linear dependence on the position error bound. Finally, it is shown that the proposed ISCC scheme avoids trajectory divergence under FBL transmission compared with ISCC designs ignoring control packet loss, and could achieve decimeter-level average tracking accuracy, reducing the error to only 17.37% of that observed in the baseline global navigation satellite system scheme.
toXiv_bot_toot
The war in Iran is threatening the global supply of helium, an essential component in cooling chip-making tools.
https://www.computing.co.uk/news/2026/chips-components/iran-war-chokes…
Why not eliminate single-representative districts and switch to some form of proportional representation? (Or just go with nationwide independent redistricting commissions.) Too much time and resources are spent on gerrymandering and redistricting legal warfare.
But of course, those that benefit under the current system (*cough* GOP *cough*) won’t change it.
#uspol
Gee, I wonder who we can blame for this?
The US auto industry is capable of building decent EVs. We have plenty of examples, some of which have been withdrawn from the market.
Most of us here in the US who drive EV's haven't been affected much (directly) by the Iran war increase in gasoline prices. But everyone with a gas/diesel motor is affected. And it appears that the US Congress and the trumpies are celebrating our dependence, our wasted resources, and our inability…
The US OMB recently published a new logging reference architecture M-26-14 replacing 2021's M-21-31.
I'm not familiar enough with CISA and the bureaucracy to confidently provide a useful analysis, but the two different memorandums are noticeably different that I'm sure someone with more familiarity could say something interesting about the potential consequences, positive or negative.
LinkedIn Is Scanning Your Browser Extensions. This Is How They Use the Data.
https://404privacy.com/blog/linkedin-is-scanning-your-browser-extensions-this-is-how-they-use-the-data/
I actually think that governments should be acting to allow copper lines to be replaced, but just the language choice of "divert precious resources to the maintenance of deteriorating legacy networks" screams regulatory capture.
https://www.theregister.com/2026/03/30/fcc
Google researchers warn that quantum computers may crack elliptic-curve cryptography, which helps secure crypto wallets, with 20x fewer resources than expected (Bloomberg)
https://www.bloomberg.com/news/articles/2026-03-31/g…
EarlySciRev: A Dataset of Early-Stage Scientific Revisions Extracted from LaTeX Writing Traces
L\'eane Jourdan, Julien Aubert-B\'educhaud, Yannis Chupin, Marah Baccari, Florian Boudin
https://arxiv.org/abs/2603.28515 https://arxiv.org/pdf/2603.28515 https://arxiv.org/html/2603.28515
arXiv:2603.28515v1 Announce Type: new
Abstract: Scientific writing is an iterative process that generates rich revision traces, yet publicly available resources typically expose only final or near-final versions of papers. This limits empirical study of revision behaviour and evaluation of large language models (LLMs) for scientific writing. We introduce EarlySciRev, a dataset of early-stage scientific text revisions automatically extracted from arXiv LaTeX source files. Our key observation is that commented-out text in LaTeX often preserves discarded or alternative formulations written by the authors themselves. By aligning commented segments with nearby final text, we extract paragraph-level candidate revision pairs and apply LLM-based filtering to retain genuine revisions. Starting from 1.28M candidate pairs, our pipeline yields 578k validated revision pairs, grounded in authentic early drafting traces. We additionally provide a human-annotated benchmark for revision detection. EarlySciRev complements existing resources focused on late-stage revisions or synthetic rewrites and supports research on scientific writing dynamics, revision modelling, and LLM-assisted editing.
toXiv_bot_toot
november17: November17 members (2009)
A network representing connections among members of the November 17 (N17) Greek terrorist group. Nodes are members, and an edge exists if two members have some connection in the past. Metadata include role, function, resources. Some attributes are missing.
This network has 22 nodes and 66 edges.
Tags: Social, Offline, Unweighted, Metadata
I’m glad the Wisconsin Bike Fed is getting in front of the ebike issue.
#bikeTooter
Statement in Response to DOJ Demand to Drop Ballroom Lawsuit (Carol Quillen/@SavingPlaces)
https://savingplaces.org/press-center/media-resources/statement-from-nationaltrust-4-27-2026
http://www.memeorandum.com/260427/p92#a260427p92
from my link log —
GPU architecture resources.
https://interplayoflight.wordpress.com/2020/05/09/gpu-architecture-resources/
saved 2020-05-15
In just 14 years, the Swiss glaciers lost almost 25% of their 2010 ice volume, which corresponds to a wastage of around 15 km3 of ice. New paper:
https://tc.copernicus.org/articles/20/3111/2026/
"Marking Gender: A Critical Analysis of Gender Representation in Library of Congress Subject Headings"
https://doi.org/10.5860/lrts.70n2.8680
"Although the problem of bias, prejudice, and marginalization has long been a subject of critical reflection and inquiry in library knowl…
FPF Releases Practitioner Guides on Privacy Enhancing Technologies for Education Stakeholders
https://fpf.org/blog/fpf-releases-practitioner-guides-on-privacy-enhancing-technologies-for-education-stakeholders/
Field staff at the federal agency that enforces civil rights laws in the workplace
say they are under intense pressure from leadership to bring in cases that fit the Trump administration’s priorities,
i.e. charges of discrimination against white men and charges of antisemitism on college campuses.
That pressure has led investigators and lawyers at the agency,
the Equal Employment Opportunity Commission,
to focus its thin resources on pursuing and fast-tracking c…
"Rainforests pushed to breaking point by new demands for resources, report says"
https://www.theguardian.com/environment/2026/may/20/rainforests-pushed-to-breaking-point-by-new-demands-for-resources-re…
"Rainforests pushed to breaking point by new demands for resources, report says"
https://www.theguardian.com/environment/2026/may/20/rainforests-pushed-to-breaking-point-by-new-demands-for-resources-re…
There are international and national issues to follow, but there are always local ones too. Here in MN, the legislative fight over mining in the Boundary Waters is a big deal. It feels like salt in the wound to threaten the crown jewel of MN’s outdoor resources while also messing everything else up. #nokings
After 17 years of many manuscripts cleaned submitted to #arxiv, I finally automated and wrote down the process, see: https://github.com/NERDSITU/research-resources/blob/main/a…
New research reveals repeated flooding is altering Florida freshwater resources #florida
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/5]:
- Can Small Language Models Handle Context-Summarized Multi-Turn Customer-Service QA? A Synthetic D...
Lakshan Cooray, Deshan Sumanathilaka, Pattigadapa Venkatesh Raju
https://arxiv.org/abs/2602.00665 https://mastoxiv.page/@arXiv_csCL_bot/116006686092324902
- SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue
Dai, Gao, Zhang, Wang, Luo, Wang, Wang, Wu, Wang
https://arxiv.org/abs/2602.03548
- OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering
Yifan Zhu, Xinyu Mu, Tao Feng, Zhonghong Ou, Yuning Gong, Haoran Luo
https://arxiv.org/abs/2602.03707
- GreekMMLU: A Native-Sourced Multitask Benchmark for Evaluating Language Models in Greek
Zhang, Konomi, Xypolopoulos, Divriotis, Skianis, Nikolentzos, Stamou, Shang, Vazirgiannis
https://arxiv.org/abs/2602.05150
- Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems
Zhangqi Duan, Arnav Kankaria, Dhruv Kartik, Andrew Lan
https://arxiv.org/abs/2602.17542 https://mastoxiv.page/@arXiv_csCL_bot/116102514058414603
- MetaState: Persistent Working Memory Enhances Reasoning in Discrete Diffusion Language Models
Kejing Xia, Mingzhe Li, Lixuan Wei, Zhenbang Du, Xiangchi Yuan, Dachuan Shi, Qirui Jin, Wenke Lee
https://arxiv.org/abs/2603.01331 https://mastoxiv.page/@arXiv_csCL_bot/116165314672421581
- A Browser-based Open Source Assistant for Multimodal Content Verification
Milner, Foster, Karmakharm, Razuvayevskaya, Roberts, Porcellini, Teyssou, Bontcheva
https://arxiv.org/abs/2603.02842 https://mastoxiv.page/@arXiv_csCL_bot/116170368271004704
- Nw\=ach\=a Mun\=a: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR
Sharma, Shrestha, Poudel, Tiwari, Shrestha, Ghimire, Bal
https://arxiv.org/abs/2603.07554 https://mastoxiv.page/@arXiv_csCL_bot/116204797995674104
- Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions
Mingyang Song, Mao Zheng
https://arxiv.org/abs/2603.09938 https://mastoxiv.page/@arXiv_csCL_bot/116210189810004206
- AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Ag...
Zekun Wu, Adriano Koshiyama, Sahan Bulathwela, Maria Perez-Ortiz
https://arxiv.org/abs/2603.12564 https://mastoxiv.page/@arXiv_csCL_bot/116237800898328349
- GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages
Gyamfi, Azunre, Moore, Budu, Asare, Owusu, Asiamah
https://arxiv.org/abs/2603.13793 https://mastoxiv.page/@arXiv_csCL_bot/116243544688031749
- sebis at ArchEHR-QA 2026: How Much Can You Do Locally? Evaluating Grounded EHR QA on a Single Not...
Ibrahim Ebrar Yurt, Fabian Karl, Tejaswi Choppa, Florian Matthes
https://arxiv.org/abs/2603.13962 https://mastoxiv.page/@arXiv_csCL_bot/116243646346563497
- ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation
Yuzhe Shang, Pengzhi Gao, Yazheng Yang, Jiayao Ma, Wei Liu, Jian Luan, Jinsong Su
https://arxiv.org/abs/2603.14903 https://mastoxiv.page/@arXiv_csCL_bot/116243711232778054
- BanglaSocialBench: A Benchmark for Evaluating Sociopragmatic and Cultural Alignment of LLMs in Ba...
Tanvir Ahmed Sijan, S. M Golam Rifat, Pankaj Chowdhury Partha, Md. Tanjeed Islam, Md. Musfique Anwar
https://arxiv.org/abs/2603.15949 https://mastoxiv.page/@arXiv_csCL_bot/116249122231759766
- EngGPT2: Sovereign, Efficient and Open Intelligence
G. Ciarfaglia, et al.
https://arxiv.org/abs/2603.16430 https://mastoxiv.page/@arXiv_csCL_bot/116249228411487178
- HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning
Bartosz Trojan, Filip G\k{e}bala
https://arxiv.org/abs/2603.19278 https://mastoxiv.page/@arXiv_csCL_bot/116277612915482857
- Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review
Yi Yu, Maria Boritchev, Chlo\'e Clavel
https://arxiv.org/abs/2603.19292 https://mastoxiv.page/@arXiv_csCL_bot/116277620779254916
- Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Langu...
Xinyue Liu, Niloofar Mireshghallah, Jane C. Ginsburg, Tuhin Chakrabarty
https://arxiv.org/abs/2603.20957 https://mastoxiv.page/@arXiv_csCL_bot/116283538317671552
- KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
Shuai Wang, Yinan Yu
https://arxiv.org/abs/2603.21440 https://mastoxiv.page/@arXiv_csCL_bot/116283595007808076
toXiv_bot_toot
TIL¹: witr² – Why is this running?
An interactive TUI that shows all the details about processes, their resources, environment, parents and children. Like ps, htop, lsof, ss etc. combine for a process-centered view.
I installed the forky Debian package³ on Debian trixie and it just works.
I played around with it and immediately liked id.
__
¹ok, technically it was yesterday. Thanks to @…
Working on extending features for Warlock, and this round of features requires a backend web service running in a centralized, controlled environment due to the requirement of privileged access to partner network resources, (aka, they require an API key and prior authorization to access certain data, thus cannot be distributed in an open source project).
SO, since this is a traditional web service, I opted to use the traditional technologies to power it, but wanted to try out Symfony s…
A hierarchy of spatial predictions across human visual cortex during natural vision #neuroscience
#Python is just doing great. We're not having impossible constraints, as some projects need old #setuptools for pkg_resources, and other projects are starting to require newer setuptools for some fancy new features. And ofc after promising to release pkg_resources standalone over a month ago, setuptools upstream didn't deliver.
#Gentoo
ScaleOps, which makes automated cloud spend tools, raised a $130M Series C led by Insight Partners at an $800M valuation, bringing its total funding to $210M (Meir Orbach/CTech)
https://www.calcalistech.com/ctechnews/article/skywz1dowe
The #Medieval Genealogy is one of the most useful portals to Medieval historical resources. Now with its 114th update. https://www.medievalgenealogy.org.uk/updates/update.shtml…
How some independent tech reporters are using AI, which they say allows them to do more reporting and recreate newsroom resources like editors and fact-checkers (Maxwell Zeff/Wired)
https://www.wired.com/story/tech-reporters-using-ai-write-edit-stories/
the "Deal man" from the Dover Museum sure looks familiar #stonks
https://www.dovermuseum.co.uk/Information-Resources/The-Collection/The-Deal-Man.as…
"Citation Analysis for Acquisitions Reveals Better Grades for Students Using Library Resources"
https://pal-ojs-tamu.tdl.org/pal/article/view/7214
"This study investigates the use of library resources by first-year students (English 101/102) at the University…
"Portugal has just used up its natural resources for 2026. Is the rest of Europe doing any better?"
#Portugal #Resources
In the next 2 weeks we will be ready with our anti surveillance campaign that will educate people that they are cameras everywhere and we need to notice them, map them or do something even more creative with them.
We will use some #openstreetMap resources for that and work with a few artists to create a vision for "something is looking at me, and I had not noticed".
We ne…
Mick Shots: How ’bout something good for today https://www.dallascowboys.com/news/mick-shots-how-bout-something-good-for-today
The fact the United States is currently creating facts claiming that Cuba isn't being starved by them, but that the Cuban government is actually at fault and using all the resources which would otherwise help the population (somehow), really makes me think about all the other times I've heard this rhetoric in history.
Working on extending features for Warlock, and this round of features requires a backend web service running in a centralized, controlled environment due to the requirement of privileged access to partner network resources, (aka, they require an API key and prior authorization to access certain data, thus cannot be distributed in an open source project).
SO, since this is a traditional web service, I opted to use the traditional technologies to power it, but wanted to try out Symfony s…
AFT boss Randi Weingarten tapped union resources worth over $1.4M to write 'manifesto' book (Carl Campanile/New York Post)
https://nypost.com/2026/05/19/us-news/aft-boss-randi-weingarten-tapped-union-resources-worth-over-1-4m-to-write-manifesto-book/
http://www.memeorandum.com/260519/p88#a260519p88
If you don’t have the resources to write and understand the code yourself, you don’t have the resources to maintain it either.
Any monkey with a keyboard can write code. Writing code has never been hard. People were churning out crappy code en masse way before generative AI and LLMs. I know because I’ve seen it, I’ve had to work with it, and I no doubt wrote (and continue to write) my share of it.
What’s never been easy, and what remains difficult, is figuring out the right probl…
Why do we need services like the Museum Data Service?
'Scientists Keep Finding Major Discoveries Lurking in Museum Backrooms' https://www.sciencealert.com/scientists-keep-finding-major-discoveries-lurking-in-museum-backroo…
Tools of Repression - PHR #immigration #protests
“When you have excellent resources, be it physical, medical, psychological, social, educational, that does increase player readiness.”
https://www.theguardian.com/sport/2026/mar/20/teen-sensatio…
Explore NutritionFacts.org Resources #nutrition
So one of the authors is Nicholas Carlini, who works for Anthropic. This is basically an ad for the three letter agencies to use Claude. It massively over-promises compared to what the actual paper says.
But, it is important. First, this is really about silencing people. The threat of identification is designed to make people afraid to talk online. There's a massive asymmetry between the fascists and the people. The fascists are weird racists and pedophiles who are obsessed with control. No one likes them. No one likes their ideas, because their ideas are creepy and bad.
When they talk about their ideas, that people should be murdered or kidnaped based on their skin color, that there should be a national dress code, that people's sex lives should be monitored, that children should be treated like objects that are owned by the parent (specifically, one parent), that people with different skin color or uteri should be considered as livestock, people fucking hate it because it's awful. When we talk about our ideas, that everyone should be able to eat and take care of themselves, that people who can't take care of themselves should be taken care of, that we should live in a society that values life, that we should live in harmony with nature, people like those ideas. When fascists out us for talking about those ideas, people support us. When we out people who are working as fascist goons those people have to face social consequences.
Everyone hates these people. The US government is currently less popular than it has ever been. The only way they can keep power is by making everyone think that they aren't extraordinarily unpopular. The only way to do that, the way authoritarian have always done it, is to make everyone afraid to talk.
But, yes, what this paper is saying is actually kind of bad. It looks like people who don't take any precautions at all in separating identities can be identified about 30% of the time (based on the results). It's unclear how this will actually work in the real world. Larger corpses will probably have more data, making connecting things easier.
This isn't as good as a human trying to dox someone. It's not going to work as well. It may only work in a small number of cases. There will be false positives (just like there are with people doing the work). It's probably not cheaper than hiring people. But it does mean that you can just dump money into a machine that has no ethical framework and get data out. That's the point. It's hard to find humans who will do evil shit like help dictatorships target human rights activists, but if a machine can do it for twice the price then it's a better deal for the dictatorship.
For most people, you just shouldn't care. This isn't for you. As long as you keep doing what you're doing, and you can keep everyone else doing what they're doing, then there aren't enough resources to actually target you. Even if they know who you are, there are just too many people who hate them and too few goons.
For people who might actually be targeted, there are a lot of things. First, keep in mind what you're putting into anonymous accounts. Any feature that's connected to your real life is a feature that can be extracted to identify you. This has always been true, it just may be easier to find now. Your identities should be totally siloed. It's also harder to identify you if you're writing anonymously as a collective. Collectives are better anyway because they can help check your thinking. When you write as a collective, you can help clean up each other's personal details and language. A collective develops its own voice, which is distinct from individual contributors. If you do this, and you also present your work as being from one "person," then it becomes even harder for anyone (systems or individuals) to really figure it out.
I'm not going to do a full deep dive on this because I just don't have time, but your existing threat model should *already cover these threats* if you need to make sure your writing remains anonymous.
This paper doesn't present any novel methodologies. It just extracts a bunch of features, which a human would extract as notes, and tries to correlate those between identities, which is how human researchers work. Linguistic forensics were mentioned (not by name) in the paper, but the actual methodology doesn't actually seem to use them.
So a thing with less ethics can do a worse job for more money (when adjusted for the real, not investor deflated, price of tokens). It's worth knowing. It's not the end of the world, but it is a good reminder to check your threat model and make sure it's up to date.
Okta reports Q1 revenue up 11% YoY to $765M, vs. $752M est., says the agentic AI build-out is spiking demand for its identity tools; OKTA jumps 7% after hours (Samantha Subin/CNBC)
https://www.cnbc.com/2026/05/28/okta-okta-earnings-q1-2027.html
Pattern Formation in a Spatial Public Goods Dilemma due to Diffusive or Directed Motion
Yuxuan Zhao, Kaisheng Zhu, Yefei Zhang, Daniel B. Cooney
https://arxiv.org/abs/2603.21025 https://arxiv.org/pdf/2603.21025 https://arxiv.org/html/2603.21025
arXiv:2603.21025v1 Announce Type: new
Abstract: The costly provision of public goods serves as a model problem for the evolution of cooperative behavior, presenting a social dilemma between the collective benefits of shared resources and the individual incentive to free-ride in resource production. The spatial structure of populations can also impact cooperation over public goods, as diffusion of public goods and intentional motion of individuals towards regions with greater resources can interact with population and public goods dynamics to produce heterogeneous patterns in the spatial distribution of strategies and resources. In this paper, we build off a model introduced by Young and Belmonte for the reaction dynamics of interacting individuals and explicit public good, deriving a system of PDEs that describes the spatial profiles of strategies and the public good in the presence of both diffusive motion of individuals and resources and chemotaxis-like directed motion of individuals in response to gradients in the concentration of public goods. Through linear stability analysis, we show that spatial patterns in strategic and public goods profiles can emerge due to either Turing instability with high defector diffusivity or a directed-motion instability through strong sensitivity of cooperators towards increasing resource concentration. We further explore the emergent spatial patterns with a mix of weakly nonlinear stability analysis and numerical simulation, showing that diffusion-driven instability appears to increase cooperation and public goods across the spatial domain, while directed motion of cooperators towards regions with great public goods provision tends to decrease cooperation and environmental quality across the environment.
toXiv_bot_toot
LombardoGraphia: Automatic Classification of Lombard Orthography Variants
Edoardo Signoroni, Pavel Rychl\'y
https://arxiv.org/abs/2603.28418 https://arxiv.org/pdf/2603.28418 https://arxiv.org/html/2603.28418
arXiv:2603.28418v1 Announce Type: new
Abstract: Lombard, an underresourced language variety spoken by approximately 3.8 million people in Northern Italy and Southern Switzerland, lacks a unified orthographic standard. Multiple orthographic systems exist, creating challenges for NLP resource development and model training. This paper presents the first study of automatic Lombard orthography classification and LombardoGraphia, a curated corpus of 11,186 Lombard Wikipedia samples tagged across 9 orthographic variants, and models for automatic orthography classification. We curate the dataset, processing and filtering raw Wikipedia content to ensure text suitable for orthographic analysis. We train 24 traditional and neural classification models with various features and encoding levels. Our best models achieve 96.06% and 85.78% overall and average class accuracy, though performance on minority classes remains challenging due to data imbalance. Our work provides crucial infrastructure for building variety-aware NLP resources for Lombard.
toXiv_bot_toot
The other day, my friend shared this video on consulting collectives, which is when "a small group of independent consultants or freelancers share clients, resources, and revenue on specific projects while maintaining their independence on others" :
https://www.instagram.com/p/DUDtOqGkTmY/…
@… Hello, Evan.
We're considering ways to prevent the URL of an ActivityPub Note object from being a URL of a completely unrelated third party, as this can be undesirable.
Do you know of any resources that describe the specifications …
Today is UN World Day for Glaciers, so I'm re-posting some relevant messages & personal experiences with disappearing glaciers in the Alps:
Interactive visualization of changes in glacier length
https://mastodon.thi.ng/@toxi/113191569344705725
One of the birth …
There are some changes for the 2026 tax filing season that people who are 65 years of age and older should be aware of.
The most recent being the enhanced deduction for seniors
https://www.irs.gov/newsroom/2026-filing-season-updates-and-resources-for-…
Do you have experience developing Open Educational Resources (OER) or developing educational software? The Journal of Open Source Education (JOSE) is looking for reviewers to help with our check-list driven peer review process. Happy to answer questions about reviewing for JOSE if you're interested.
https://forms.gle/Rizd3TcHnQKhrbYY7
Sony Pictures plans to close VFX and virtual production company Pixomondo and integrate some of its resources into other areas; Sony purchased Pixomondo in 2022 (Jon Creamer/Televisual)
https://www.televisual.com/news/sony-pictures-to-wind-down-pixomondo/
Short-term survival of #tardigrades (Ramazzottius cf. varieornatus and Hypsibius exemplaris) in #martian #regolith simulants (MGS-1 and OUCM-1): https://www.cambridge.org/core/journals/international-journal-of-astrobiology/article/shortterm-survival-of-tardigrades-ramazzottius-cf-varieornatus-and-hypsibius-exemplaris-in-martian-regolith-simulants-mgs1-and-oucm1/8A91986096FB533FB264DD056F549DF2 -> ‘Water bears’ reveal potential for adapting, protecting Martian resources: https://www.psu.edu/news/research/story/water-bears-reveal-potential-adapting-protecting-martian-resources - microscopic tardigrades help inform how simulated Martian soil might support plant life and mitigate contaminants shedding from human explorers, researchers report -> Scientists Finally Found Something Tardigrades Can’t Survive: https://gizmodo.com/scientists-finally-found-something-tardigrades-cant-survive-2000728358 - tardigrades are practically invincible on Earth, so scientists looked to outer space in search of their kryptonite.
The Accessibility Conformance Testing (ACT) Task Force has published its rules, letting you filter automated rules:
https://www.w3.org/WAI/standards-guidelines/act/rules/?requirements=a,aa&status=approved&imp…
I guess my work laptop will need to have a catastrophic failure before it will be replaced. Request for replacement was denied. It was supposed to be a new machine, when I arrived two years ago it was 'sorry, your laptop was assigned to someone else. Here is a refurbished model almost at end of life'.
Multiple black screens. Cannot connect two monitors, keyboard, mouse and headset without it complaining about resources in the USB-C hub. Monitors won't show full resolutio…
Now, more than ever, it's essential to practice good mental hygiene, work on emotional regulation, and guard against social media psyops. You never know when you'll need those inner resources.
Crosslisted article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/2]:
- Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Fi...
Li, Liu, Zong, Tao, Dai, Ren, Liu, Jiang, Yang
https://arxiv.org/abs/2603.26668 https://mastoxiv.page/@arXiv_csIR_bot/116322781593134028
- SRAG: RAG with Structured Data Improves Vector Retrieval
Shalin Shah, Srikanth Ryali, Ramasubbu Venkatesh
https://arxiv.org/abs/2603.26670 https://mastoxiv.page/@arXiv_csIR_bot/116322784870180864
- LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval
Seonok Kim
https://arxiv.org/abs/2603.26683 https://mastoxiv.page/@arXiv_csIR_bot/116322841916406330
- Agentic AI for Human Resources: LLM-Driven Candidate Assessment
Yuksel, Anees, Elneima, Hewavitharana, Al-Badrashiny, Sawaf
https://arxiv.org/abs/2603.26710 https://mastoxiv.page/@arXiv_csIR_bot/116322937601675587
- SEAR: Schema-Based Evaluation and Routing for LLM Gateways
Zecheng Zhang, Han Zheng, Yue Xu
https://arxiv.org/abs/2603.26728 https://mastoxiv.page/@arXiv_csDB_bot/116322627580095245
- SleepVLM: Explainable and Rule-Grounded Sleep Staging via a Vision-Language Model
Guifeng Deng, Pan Wang, Jiquan Wang, Shuying Rao, Junyi Xie, Wanjun Guo, Tao Li, Haiteng Jiang
https://arxiv.org/abs/2603.26738 https://mastoxiv.page/@arXiv_csCV_bot/116322739676378309
- Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models
Chen Zheng, Yuxuan Lai, Haoyang Lu, Wentao Ma, Jitao Yang, Jian Wang
https://arxiv.org/abs/2603.26768 https://mastoxiv.page/@arXiv_csCV_bot/116323078149576728
- Learning to Select Visual In-Context Demonstrations
Eugene Lee, Yu-Chi Lin, Jiajie Diao
https://arxiv.org/abs/2603.26775 https://mastoxiv.page/@arXiv_csLG_bot/116322648878995047
- CRISP: Characterizing Relative Impact of Scholarly Publications
Hannah Collison, Benjamin Van Durme, Daniel Khashabi
https://arxiv.org/abs/2603.26791 https://mastoxiv.page/@arXiv_csDL_bot/116322621679820997
- GroupRAG: Cognitively Inspired Group-Aware Retrieval and Reasoning via Knowledge-Driven Problem S...
Xinyi Duan, Yuanrong Tang, Jiangtao Gong
https://arxiv.org/abs/2603.26807 https://mastoxiv.page/@arXiv_csIR_bot/116322959557860848
- In your own words: computationally identifying interpretable themes in free-text survey data
Jenny S Wang, Aliya Saperstein, Emma Pierson
https://arxiv.org/abs/2603.26930 https://mastoxiv.page/@arXiv_csCY_bot/116322780637316287
- Multilingual Stutter Event Detection for English, German, and Mandarin Speech
Felix Haas, Sebastian P. Bayerl
https://arxiv.org/abs/2603.26939 https://mastoxiv.page/@arXiv_csSD_bot/116322704289189130
- FormalProofBench: Can Models Write Graduate Level Math Proofs That Are Formally Verified?
Ravi, Ying, Nesterov, Krishnan, Uskuplu, Xia, Aswedige, Nashold
https://arxiv.org/abs/2603.26996 https://mastoxiv.page/@arXiv_csAI_bot/116322625941412681
- PHONOS: PHOnetic Neutralization for Online Streaming Applications
Waris Quamer, Mu-Ruei Tseng, Ghady Nasrallah, Ricardo Gutierrez-Osuna
https://arxiv.org/abs/2603.27001 https://mastoxiv.page/@arXiv_eessAS_bot/116322763598554193
- ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
Jovana Kondic, et al.
https://arxiv.org/abs/2603.27064 https://mastoxiv.page/@arXiv_csCV_bot/116323214468792735
- daVinci-LLM:Towards the Science of Pretraining
Qin, Liu, Mi, Xie, Huang, Si, Lu, Feng, Wu, Liu, Luo, Hou, Guo, Qiao, Liu
https://arxiv.org/abs/2603.27164 https://mastoxiv.page/@arXiv_csAI_bot/116322653467105951
- LightMover: Generative Light Movement with Color and Intensity Controls
Zhou, Wang, Kim, Shu, Yu, Hold-Geoffroy, Chaturvedi, Wu, Lin, Cohen
https://arxiv.org/abs/2603.27209 https://mastoxiv.page/@arXiv_csCV_bot/116323263295656104
- Self-evolving AI agents for protein discovery and directed evolution
Tan, Zhang, Li, Yu, Zhong, Zhou, Dong, Hong
https://arxiv.org/abs/2603.27303 https://mastoxiv.page/@arXiv_csAI_bot/116322838641595927
- Inference-Time Structural Reasoning for Compositional Vision-Language Understanding
Amartya Bhattacharya
https://arxiv.org/abs/2603.27349 https://mastoxiv.page/@arXiv_csCV_bot/116323280006044500
- LLM Readiness Harness: Evaluation, Observability, and CI Gates for LLM/RAG Applications
Alexandre Cristov\~ao Maiorano
https://arxiv.org/abs/2603.27355 https://mastoxiv.page/@arXiv_csAI_bot/116322987708962414
- Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Eth...
Jakub Mas{\l}owski, Jaros{\l}aw A. Chudziak
https://arxiv.org/abs/2603.27404 https://mastoxiv.page/@arXiv_csAI_bot/116322999177460352
toXiv_bot_toot
Mick Shots: How ’bout something good for today https://www.dallascowboys.com/news/mick-shots-how-bout-something-good-for-today
Such a long article that can be summarised in "Venezuela is a US's banana republic and there won't be elections any time soon because Trump & friends only care about its natural resources".
Honestly, the article is a shame.
https://…
"For decades we've lived on abundant energy and resources that appeared to be infinite. But that foundation no longer exists."
A tour of every Spanish region with a stellar line-up of speakers.
It's the mind of thing needed in the UK, in addition to the Emergency Briefing film that's being rolled out.
NO ES NORMAL, no fue y no sera
http…
Okay, I'll give you that: LLMs aren't the root of the problem.
Capitalism is. The idea of infinite growth. The idea that people can't just live, they must with 40 hours a week to justify their existence, and they must be purchasing something all the time. Companies must keep selling new stuff. All the resources must be tapped into and exploited.
And companies are making software. They must keep selling new features and pointless complete redesigns nobody wanted. The code must keep being churned over and over again. Programmers must justify their existence by churning out absurd amounts of meaningless code. The companies must exploit them.
Then, companies are entering the #OpenSource "market". They are acquiring and enshittifying. They are hiring and exploiting. And then so many volunteers just jump on the bandwagon and keep cosplaying them. And they too churn out useless code, "sell" pointless complete makeovers, "profit" off their users (even if they actually aren't making any real profit).
And then come LLMs, perfect tools for the job. Perfect tools for exploitation, for churning out useless code, for creating addiction, and for turning everyone into mindless corpospeak bullshit machines.
#AI #LLM #NoAI #NoLLM #AntiCapitalism
"Market effects of open educational resources on U.S. textbook pricing (2003–2024)"
https://doi.org/10.1016/j.acalib.2026.103271
"[...] This work looks at whether the increase in the development and adoption of Open Educational Material between 2003 and 2024 corres…
After Cleveland.com faced backlash for posting AI-generated videos to promote its podcasts, an editor said "we don't have resources to do it any other way" (Sean Keeley/Awful Announcing)
https://awfulannouncing.com/newspapers/clevela…
Beneath Arctic ice, a vast fossil fuel footprint is colliding with Indigenous lands and wildlife #Arctic
"On Library Closing Music: Could It Be a Way to Promote Library Music Resources and Liberal Arts Education?"
https://muse.jhu.edu/article/990720
[closed access 🙄]
november17: November17 members (2009)
A network representing connections among members of the November 17 (N17) Greek terrorist group. Nodes are members, and an edge exists if two members have some connection in the past. Metadata include role, function, resources. Some attributes are missing.
This network has 22 nodes and 66 edges.
Tags: Social, Offline, Unweighted, Metadata
A surge in AI-generated "pro se" cases, or lawsuits filed by self-represented litigants, is democratizing the legal system but consuming more court resources (New York Times)
https://www.nytimes.com/2…
Explosive Misinformation: A Guide to Mushroom Clouds, ‘Sonic Weapons’ and Disintegration - bellingcat https://www.bellingcat.com/resources/2026/03/30/explosive-misinformation-a-guide-to-mushroom-clouds-sonic-weap…
Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computational Processing
Lisan Al Amin, Md Ismail Hossain, Rupak Kumar Das, Mahbubul Islam, Saddam Mukta, Abdulaziz Tabbakh
https://arxiv.org/abs/2603.20920 https://arxiv.org/pdf/2603.20920 https://arxiv.org/html/2603.20920
arXiv:2603.20920v1 Announce Type: new
Abstract: The exponential growth in data has intensified the demand for computational power to train large-scale deep learning models. However, the rapid growth in model size and complexity raises concerns about equal and fair access to computational resources, particularly under increasing energy and infrastructure constraints. GPUs have emerged as essential for accelerating such workloads. This study benchmarks four deep learning models (Conv6, VGG16, ResNet18, CycleGAN) using TensorFlow and PyTorch on Intel Xeon CPUs and NVIDIA Tesla T4 GPUs. Our experiments demonstrate that, on average, GPU training achieves speedups ranging from 11x to 246x depending on model complexity, with lightweight models (Conv6) showing the highest acceleration (246x), mid-sized models (VGG16, ResNet18) achieving 51-116x speedups, and complex generative models (CycleGAN) reaching 11x improvements compared to CPU training. Additionally, in our PyTorch vs. TensorFlow comparison, we observed that TensorFlow's kernel-fusion optimizations reduce inference latency by approximately 15%. We also analyze GPU memory usage trends and projecting requirements through 2025 using polynomial regression. Our findings highlight that while GPUs are essential for sustaining AI's growth, democratized and shared access to GPU resources is critical for enabling research innovation across institutions with limited computational budgets.
toXiv_bot_toot
@13a@mastodon.social
And my question is - given the limited resources (people, materials, equipment) is MEER the best use of those resources, or do we end up with a "sunk cost" related problem :"well we've invested a lot of money here, and temps are down so we don't need to do anything else."
(I'm a chemist, worked in industry & the latter is a known problem; since retiring I have been engaged in advocacy and have seen how politicians respon.…
@13a@mastodon.social
And my question is - given the limited resources (people, materials, equipment) is MEER the best use of those resources, or do we end up with a "sunk cost" related problem :"well we've invested a lot of money here, and temps are down so we don't need to do anything else."
(I'm a chemist, worked in industry & the latter is a known problem; since retiring I have been engaged in advocacy and have seen how politicians respon.…
"Brazil's reserves run on too little funding, with Amazon getting just 20% needed"
#Brazil #Amazon #Environment
"Creating a writing and dissemination toolkit for faculty scholarly writing and publishing"
https://doi.org/10.3897/ese.2026.e183055
Amid a contentious feud with Pope Leo XIV regarding U.S. military interventions over the past several months,
including the war in Iran,
the Trump administration has
⚠️ended an $11 million contract with the Catholic Charities of the Archdiocese of Miami.
The contract through the Office of Refugee Resettlement (ORR) gave funds to the organization to provide housing and other resources for migrant childrenwho entered the country without parents or adult family members.
RE: https://mastodon.social/@hughsie/116272060914902883
It's important to stress what it means to run infrastructure.
In so many cases is it the part that takes most of the resources, far more than the actual development, especially when demand in…
California Committee Approves Bill to Stop Taxpayer Bailout of Oil Companies https://biologicaldiversity.org/w/news/press-releases/california-committee-approves-bill-to-stop-taxpayer-bailout-of-oil-compa…
The California Values Act (SB 54) ensures that no state and local resources are used to assist federal immigration enforcement and that our schools, our hospitals, and our courthouses are safe spaces for everyone in our community.
SB 54 was signed into law on October 5, 2017 and went into effect January 1, 2018.
POLICE AND SHERIFFS:
Cannot ask about your immigration status.
Cannot arrest you only for having a deportation order or for most other immigration violations.
If a convoy came under attack from Iranian missiles or drones, the escorting warship would have only seconds to respond.
Similar escort and air defence efforts have already been seen in the Red Sea against Houthi attacks, so there is a working model.
The problem is that such operations consume major resources and are extremely costly if they are to be sustained for every transit.
The danger would not come only from the air or the shore.
Iran could also rely on swarms…
The UK unveils Sovereign AI, a £500M fund to invest in domestic AI startups, starting with Callosum, which builds software to help different chips work together (Joel Khalili/Wired)
https://www.wired.com/story/the-uk-launches-its-dollar675-million-soverei…
"OER as Dynamic Digital Commons: Toward Maintenance and Governance"
https://doi.org/10.31274/jlsc.20076
"Academic libraries have been instrumental in supporting the creation and adoption of open educational resources (
Thousands of non-Mexican migrants deported from the US
are being left in southern Mexican cities far from the border,
often without legal status or resources.
Many are elderly, ill, or have lived most of their lives in the US,
and now face unsafe conditions and limited access to aid.
Human rights advocates warn the policy violates international protections and leaves deportees in a 'quasi-stateless limbo.'
Cases include a trans woman deported to …
Amazon begins three-hour deliveries in ~2K US cities and towns and one-hour deliveries in hundreds of those areas, after 2025 pilots; 90K products are eligible (Annie Palmer/CNBC)
https://www.cnbc.com/2026/03/17/amazon-rolls-out-1-ho…
Why Hasn’t Trump Mentioned Iran’s Oil?
Usually he encourages the seizure of natural resources as repayment for war
https://www.theatlantic.com/national-security/2026/03/trump-oil-iran-venezuela/686271/
Sources: Intel has begun testing production of "low-end/legacy iPhone, iPad, and Mac processors"; Apple thinks TSMC's resources will continue tilting toward AI (@mingchikuo)
https://x.com/mingchikuo/status/2054987772289810884
Since Trump’s decision to snatch Maduro in January and reboot relations with his successors,
the five-star hotel has become the nerve centre of Washington’s efforts to steer a country some now call a US protectorate
– and which Trump has even said he hopes to turn into the 51st state.
“It’s [effectively] the US embassy.
I don’t think anybody’s going to work at the actual embassy,”
said Phil Gunson, a Caracas-based political analyst for Crisis Group.
The condition of the economy of the Russian Federation is now more complicated than in recent years
Due to a strong ruble, high interest rates, shortage of labor resources and budget restrictions,
Reserves in the economy are largely exhausted,
said the Minister of Economic Development of the Russian Federation Maxim Reshetnikov,
speaking on Friday at the All-Russian Forum of Entrepreneurship Support Infrastructure "My Business" in Vsevolozhsk.