Conduent Data Breach Notification Letters Sent to Millions
as Ransomware Group Claims
⚡️ 8 Terabytes Stolen in One of the Largest U.S. Incidents.
Letters began reaching affected individuals this month detailing a major data breach at #Conduent Business Services, LLC,
a government technology contractor that processes payments, healthcare claims, and back-office services for clients n…
The United States has instructed its diplomats to lobby against foreign governments seeking to tighten control over how American technology companies handle citizens' data, according to an internal diplomatic cable seen by Reuters.
The built-in database backup and restore feature in Kitten¹ (that actually works and is in the Kitten Settings section of every Kitten app) just saved my ass (again) :)
Thank you, past me ;)
¹ https://kitten.small-web.org
arxiv_citation: arXiv citation networks (1993-2003)
Citations among papers posted on arxiv.org under the hep-ph and hep-th categories, between 1993 and 2003. This time begins a few months after axiv was launched. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) These data were originally released as part of the 2003 KDD Cup.
This network has 27770 nodes and 352807 edges.
Tags: Informational,…
It is clear Trump now has another platform, after X , to control people in the US and feed them with disinformation. They supplement each other nicely with regard to age groups etc. If FB, WhatsApp, LinkedIn also tighten their "moderation" with regard to anti-trump sentiments, an Orwellian picture emerges.
https://www.
TikTok US says it is working to restore services after a power outage at a data center and any algorithm changes users noticed were likely due to the outage (Dan Whateley/Business Insider)
https://www.businessinsider.com/tiktok-outage-data-cent…
“Big Tech’s AI hype is distracting users from the rapid and dangerous expansion of giant, energy and water-intensive data centres […].
There is simply no evidence that AI will help the climate more than it will harm it.
Rather than relying on credible and substantiated data, Big Tech companies are writing themselves a blank cheque to pollute on the empty promise of future salvation. We cannot bet the climate on these baseless claims.”
"Is academic research becoming too competitive?
Nature examines the data Applications for European research grants increased in 2025. Scientists say they’re feeling the competition."
Sobering numbers from funding institutions in Nature
https://www.nature.com/articles/d41586-025
On Website Technicals (2026-01) - Tech updates: mail chatter, new energy series, SSES peak tweak, Zenodo, data correction, anti-herding, parallelisation, order, coverage policy... - https://m.earth.org.uk/note-on-site-technicals-104.html
ProxyFL: A Proxy-Guided Framework for Federated Semi-Supervised Learning
Duowen Chen, Yan Wang
https://arxiv.org/abs/2602.21078 https://arxiv.org/pdf/2602.21078 https://arxiv.org/html/2602.21078
arXiv:2602.21078v1 Announce Type: new
Abstract: Federated Semi-Supervised Learning (FSSL) aims to collaboratively train a global model across clients by leveraging partially-annotated local data in a privacy-preserving manner. In FSSL, data heterogeneity is a challenging issue, which exists both across clients and within clients. External heterogeneity refers to the data distribution discrepancy across different clients, while internal heterogeneity represents the mismatch between labeled and unlabeled data within clients. Most FSSL methods typically design fixed or dynamic parameter aggregation strategies to collect client knowledge on the server (external) and / or filter out low-confidence unlabeled samples to reduce mistakes in local client (internal). But, the former is hard to precisely fit the ideal global distribution via direct weights, and the latter results in fewer data participation into FL training. To this end, we propose a proxy-guided framework called ProxyFL that focuses on simultaneously mitigating external and internal heterogeneity via a unified proxy. I.e., we consider the learnable weights of classifier as proxy to simulate the category distribution both locally and globally. For external, we explicitly optimize global proxy against outliers instead of direct weights; for internal, we re-include the discarded samples into training by a positive-negative proxy pool to mitigate the impact of potentially-incorrect pseudo-labels. Insight experiments & theoretical analysis show our significant performance and convergence in FSSL.
toXiv_bot_toot
My (Austrian) bank's online interface is rubbish (unless you download them every 3 months, you lose history data), but they do have other features I don't want to miss (online debit card use with TAN generator rather than app).
Given that undesired parties (such as Klarna) apparently can use access to bank accounts, are there actually useful fintechs that, for example, just provide sensible data exports on the user's behalf?
🏗️ Problem 4: Architecture decay. Classes become dumping grounds for random data: $user->cached_data, $user->temp_flag, $user->random_stuff. Code becomes unmaintainable.
✅ Modern PHP solution: Declare all properties explicitly with types. Use public string $name; public int $age; public bool $isActive; in your class definitions.
Authoritarian governments don't build surveillance systems from scratch—they weaponize existing infrastructure.
The US has a history of illegal surveillance: COINTELPRO, Carnivore, PRISM, warrantless wiretapping. Tech companies too willingly comply with government data requests.
Once your data leaves your control, it can be accessed by any administration with enough legal pressure—or disregard for it.
Now is a good time to rethink smart home devices and reduce your digi…
Für den #Windpark Sundern#Allendorf wurden alle 15 #Rotorblätter nach rund vier Monaten Transportdauer geliefert.
Die nächtlichen Schwerlastfahrten auf der 19,5 Kilometer langen Strecke stießen auf großes öffentliches Interesse. Wetterbedingte Verzögerungen machten flexible Planung…
sp_baboons: Baboons' interactions (2020)
Network of interactions between a group of 20 Guinea baboons living in an enclosure of a Primate Center in France, between June 13th 2019 and July 10th 2019. The data set contains observational and wearable sensors data.
This network has 23 nodes and 3197 edges.
Tags: Social, Animal, Offline, Unweighted, Weighted, Temporal, Metadata
I'm sure algorithms designed by people with biases and an agenda, based on data from people with biases and an agenda, marketed by people with biases and an agenda and used by people with biases and an agenda will act perfectly neutrally!
Anyway here's a good read on epistemic vigilance in [commercial] LLMs (tl;dr it's not possible) https://bsky.app/profile/mjcrockett.bsky.social/post/3mfrbukoy5c2s
For more than a decade, Russia’s so-called #probiv market
– a term derived from the verb
“to pierce” or
“to punch into a search bar”
– has operated as a parallel information economy built on a network of corrupt officials,
traffic police,
bank employees and l
ow-level security staff willing to sell access to restricted government or corporate databases.
While l…
Power prices surge in Virginia, home to the world's largest data center hub; record demand is expected during the winter storm, partly due to data center needs (Tim McLaughlin/Reuters)
https://www.reuters.com/business/energy/po
🥳 New Kitten¹ release
• Added `initialise()` hook to `kitten.Component` instances.
This gets called at the end of the constructor and is handy if you don’t want to override the constructor and have to handle the `data` parameter and remember to call `super(data)`. You can still access passed data from `this.data`.
Note that the component is not part of the view hierarchy on the client at this point. If you have tasks you need to perform only once per page – for example, ins…
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 256491 nodes and 328132 edges.
Tags: Social, Online, Weighted, Multilayer
Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning
Zhangjie Xia, Yu Yang, Pan Xu
https://arxiv.org/abs/2602.21072 https://arxiv.org/pdf/2602.21072 https://arxiv.org/html/2602.21072
arXiv:2602.21072v1 Announce Type: new
Abstract: Off-dynamics offline reinforcement learning (RL) aims to learn a policy for a target domain using limited target data and abundant source data collected under different transition dynamics. Existing methods typically address dynamics mismatch either globally over the state space or via pointwise data filtering; these approaches can miss localized cross-domain similarities or incur high computational cost. We propose Localized Dynamics-Aware Domain Adaptation (LoDADA), which exploits localized dynamics mismatch to better reuse source data. LoDADA clusters transitions from source and target datasets and estimates cluster-level dynamics discrepancy via domain discrimination. Source transitions from clusters with small discrepancy are retained, while those from clusters with large discrepancy are filtered out. This yields a fine-grained and scalable data selection strategy that avoids overly coarse global assumptions and expensive per-sample filtering. We provide theoretical insights and extensive experiments across environments with diverse global and local dynamics shifts. Results show that LoDADA consistently outperforms state-of-the-art off-dynamics offline RL methods by better leveraging localized distribution mismatch.
toXiv_bot_toot
South Carolina doesn’t require hospitals to report when they admit patients with measles-related illnesses.
Available data shows that only 2% of the state’s measles cases have resulted in hospitalizations. Some infectious disease experts fear significant underreporting.
Some doctors say they lack information about the severity of measles complications as it spreads around them.
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 116408 nodes and 150818 edges.
Tags: Social, Online, Weighted, Multilayer
Digital Realty, QTS, and NTT Data warn the data center industry is doing a poor job of combating local opposition; 24 US projects were blocked in January alone (Rafe Rosner-Uddin/Financial Times)
https://www.ft.com/content/f45d45fc-c0ea-463c-8edf-38fb99ed5c05
UrbanFM: Scaling Urban Spatio-Temporal Foundation Models
Wei Chen, Yuqian Wu, Junle Chen, Xiaofang Zhou, Yuxuan Liang
https://arxiv.org/abs/2602.20677 https://arxiv.org/pdf/2602.20677 https://arxiv.org/html/2602.20677
arXiv:2602.20677v1 Announce Type: new
Abstract: Urban systems, as dynamic complex systems, continuously generate spatio-temporal data streams that encode the fundamental laws of human mobility and city evolution. While AI for Science has witnessed the transformative power of foundation models in disciplines like genomics and meteorology, urban computing remains fragmented due to "scenario-specific" models, which are overfitted to specific regions or tasks, hindering their generalizability. To bridge this gap and advance spatio-temporal foundation models for urban systems, we adopt scaling as the central perspective and systematically investigate two key questions: what to scale and how to scale. Grounded in first-principles analysis, we identify three critical dimensions: heterogeneity, correlation, and dynamics, aligning these principles with the fundamental scientific properties of urban spatio-temporal data. Specifically, to address heterogeneity through data scaling, we construct WorldST. This billion-scale corpus standardizes diverse physical signals, such as traffic flow and speed, from over 100 global cities into a unified data format. To enable computation scaling for modeling correlations, we introduce the MiniST unit, a novel split mechanism that discretizes continuous spatio-temporal fields into learnable computational units to unify representations of grid-based and sensor-based observations. Finally, addressing dynamics via architecture scaling, we propose UrbanFM, a minimalist self-attention architecture designed with limited inductive biases to autonomously learn dynamic spatio-temporal dependencies from massive data. Furthermore, we establish EvalST, the largest-scale urban spatio-temporal benchmark to date. Extensive experiments demonstrate that UrbanFM achieves remarkable zero-shot generalization across unseen cities and tasks, marking a pivotal first step toward large-scale urban spatio-temporal foundation models.
toXiv_bot_toot
citeseer: CiteSeer citations (2014)
Citations among papers indexed by the CiteSeer digital library. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present.
This network has 384413 nodes and 1751463 edges.
Tags: Informational, Citation, Unweighted
“So you have TikTok just smashing our young people’s brains all day long with video of carnage in Gaza … And this is why so many of us can’t have a sane conversation with younger Jews because anything that we try to say to them, they are hearing it through this wall of carnage. So I want to give data and information and facts and arguments and they are just seeing in their minds carnage and I sound obscene.”
– Former Obama speechwriter Sarah Hurwitz
“They trust their own eyes and…
Chinese government data: shipments of foreign-branded mobile phones, primarily iPhones, rose 128.4% YoY in November 2025 to 6.93M units (Reuters)
https://www.reuters.com/business/media-telecom/foreign-bran…
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
Apple says iPhone and iPad on iOS 26 and iPadOS 26 have become the first consumer devices NATO approved for use up to the "restricted" level of classified data (Elyse Betters Picaro/ZDNET)
https://www.zdnet.com/article/apple-iphone-ipad-nato-classified-sec…
product_space: Atlas of Economic Complexity export network
Two networks of economic products, where a pair of products are connected if they are exported at similar rates by the same countries. The data are a projection from a bipartite network of nations and the products they export. Edges weights represent a similarity score (called "proximity"). Data based on UN Comtrade worldwide trade patterns. SITC network based on the Standard International Trade Classification and HS …
Gambit Security: an unknown hacker used Claude to steal 150GB of Mexican government data, including 195M taxpayer records, in December 2025 and January 2026 (Bloomberg)
Diplomatic cable: US officials order diplomats to lobby against attempts to regulate how US tech companies handle foreigners' data, citing risks to AI services (Reuters)
https://www.reuters.com/sustainability/boa
A test of ChatGPT Health and Claude for Healthcare with data from Apple Health finds the chatbots provided questionable and inconsistent responses (Geoffrey A. Fowler/Washington Post)
Chainalysis and TRM Labs estimate that $2.7B was stolen in crypto in 2025 in total, up from $2.2B in 2024; the biggest hack was the $1.4B breach at Bybit (Lorenzo Franceschi-Bicchierai/TechCrunch)
https://techcrunch.com/2025/12/23/hackers-stole-…
Sam Altman says currently "the idea of putting data centers in space is ridiculous" and that it is "not something that's going to matter at scale this decade" (Lauren Edmonds/Business Insider)
https://www.businessinsider.com/sam-altman
Wikimedia data puts iOS 26 adoption at ~50% in January vs. iOS 18's 72% in 2025 as Apple slows auto-updates; Statcounter showed 15% after missing Safari changes (John Gruber/Daring Fireball)
https://daringfireball.net/2026/01/ios_26_adoption_rate_is_not_bizar…
Sources: SoftBank halts talks about a ~$50B acquisition of US data center operator Switch, a setback to Masayoshi Son's goal to roll out Stargate infrastructure (Bloomberg)
Nvidia reports Q4 revenue up 73% YoY to $68.13B, vs. $66.21B est., Data Center revenue up 75% to $62.3B, and forecasts Q1 revenue above estimates (Ian King/Bloomberg)
https://www.bloomberg.com/news/articles/2026-02-25/nvidia-…
The UK ICO fines Reddit £14.47M for unlawfully using children's personal information; Reddit began verifying user ages in July 2025 to comply with the UK OSA (Tom Singleton/BBC)
https://www.bbc.com/news/articles/cwyx0xggepjo
SOTU: Trump says he told tech companies they must build their own power plants for their data centers; sources: the WH expects to formalize the effort in March (Richard Valdmanis/Reuters)
https://www.reuters.com/business/energy/tr
Indian IT services provider Coforge agrees to acquire Encora, which offers AI tools for product, cloud, and data engineering, at an enterprise value of $2.35B (Reuters)
https://www.reuters.com/world/india/indias-coforge-acquire-us-based…