2026-01-24 00:46:54
FPF Releases an Updated Issue Brief on Vietnam’s Law on Protection of Personal Data and the Law on Data
https://fpf.org/blog/fpf-releases-updated-issue-brief-on-vietnams-law-on-protection-of-personal-data-and-the-law-on…
FPF Releases an Updated Issue Brief on Vietnam’s Law on Protection of Personal Data and the Law on Data
https://fpf.org/blog/fpf-releases-updated-issue-brief-on-vietnams-law-on-protection-of-personal-data-and-the-law-on…
Getting serious about data breaches in Kazakhstan
January 23: Kazakhstan Moves to Criminalize Mass Data Breaches
https://meyka.com/blog/january-23-kazakhstan-moves-to-criminalize-mass-data-breaches-2301/
Amazon plans to invest $12B in new data centers in Louisiana and says it worked with the local utility "to ensure we pay 100% of the costs" tied to the campus (Annie Palmer/CNBC)
https://www.cnbc.com/2026/02/23/amazon-louisiana-ai-data-centers.html
“Disease combinations are often unique – anonymization of health data is therefore particularly complex” An interview on reconstruction risks by @…: https://www.
Weekend Reads
* IRR data quality
https://labs.ripe.net/author/tobias-striffler/the-irr-landscape-data-quality-the-good-the-bad-and-the-outdated/
* Roy Arends on DNSSEC
AI company which trains its models on stolen data accuses other AI companies of stealing data 🤣
Anthropic Accuses Chinese Companies of Siphoning Data From Claude - Slashdot https://slashdot.org/story/26/02/23/1810225/anthropic-acc…
🥳 New Kitten¹ release
• Added `initialise()` hook to `kitten.Component` instances.
This gets called at the end of the constructor and is handy if you don’t want to override the constructor and have to handle the `data` parameter and remember to call `super(data)`. You can still access passed data from `this.data`.
Note that the component is not part of the view hierarchy on the client at this point. If you have tasks you need to perform only once per page – for example, ins…
For those who missed how the Coupang data breach in Korea became a flat-out crisis, the Korea Economic Institute of America has produced this nifty timeline.
https://keia.org/the-peninsula/the-coupang-data-breach-a-timeline/
A look at the top data journalism projects of 2025, including stories on conflict, climate, AI, and social media and disinformation (Hanna Duggal/Global Investigative Journalism Network)
https://gijn.org/stories/2025-editors-picks-data-journalism/
Reddit has been fined £14.47m by the UK's data watchdog for unlawfully using children's personal information.
#reddit
Dems eyeing 2028 are retreating from AI data centers (Axios)
https://www.axios.com/2026/02/22/democrats-2028-retreat-ai-data-centers
http://www.memeorandum.com/260222/p57#a260222p57
On 16WW Mains Inlet Water Temperature - Domestic mains water temperature data for 16WW on tap; seasonal min/max about 10C/20C in winter/summer. #dataset #water #temperature -
Nimble, whose AI agents structure real-time web data into tables that can be queried like a database, raised a $47M Series B, bringing its total funding to $75M (Ram Iyer/TechCrunch)
https://techcrunch.com/2026/02/24/nimble-way-raises-47m-to-giv…
Global data protection authorities warn generative AI companies against replicating real people https://therecord.media/data-protection-authorities-warn-ai-companies-of-sharing-images
I don’t think people understand how bad this is going to get.
https://www.tomshardware.com/pc-components/ram/data-centers-will-consume-70-percent-of-memory-chips-made-in-2026-supply-shortfall-will-cause-the-chip-shortage-to-spread-to-other-segments
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 38918 nodes and 32523 edges.
Tags: Social, Online, Weighted, Multilayer
マインスイーパー面白いよねえ。
ソリティアもよくやってた。
最近はナンプレ(数独)が好き。
これとか、中級ぐらいの解きやすい良問で楽しかったから、みんなやってみ。
https://misskey.flowers/notes/ahvaij022u6h010n
“The core idea is that your conversations with an AI assistant should be as private as your conversations with a person. Not because you’re doing something wrong, but because privacy is what lets you think freely.”
Moxie Marlinspike, Confessions to a data lake https://confer.to/blog/2025/12/confess
Innovation and Data Privacy Are Not Natural Enemies: Insights from Korea’s Experience
https://fpf.org/blog/innovation-and-data-privacy-are-not-natural-enemies-insights-from-koreas-experience/
@…
Under the Trump administration, immigration and border agencies have received an unprecedented windfall of nearly
❌ $170 billion in new funding,
but their operations have also come at a major cost to the taxpayer, according to newly released data.
The cost of a single enforced deportation is 💥$18,245,
the Department of Homeland Security announced on Wednesday. (Last year, the figure was just over $17,009).
The agency announced the statistics while touting its
A pictures says more than 1000 words. How much more can an audio representation of your data tell you? #rstats
DATA: Spojení s menšími SPD pomohlo, účel splnilo i Spolu | @… - spolehlivé zpršvy
https://www.irozh…
Complaint by Coupang's U.S. investors may turn data leak probe into trade flash point
https://koreajoongangdaily.joins.com/news/2026-01-23/business/tech/Compl…
Microsoft confirms it does provide BitLocker recovery keys for encrypted data if it receives a valid legal order and the user has stored the keys on its servers (Thomas Brewster/Forbes)
http://www.forbes.com/sites/thomasbrewste
National survey finds microplastic pollution around Britain's coastline could be double than previously recorded https://phys.org/news/2026-02-national-survey-microplastic-pollution-britain.html
#Klutshnik v0.4.1 is out.
Klutshnik is a Key Mgmnt Service 4 data-at-rest. Keys r stored in a threshold setup& r never reconstructed only used in operations that hide their values. These keys r cheaply&securely updatable without reencrypting the encrypted data, providing forward-secrecy&post-compromise security. Klutshnik servers can use TLS, USB or BLE.
"Colombia poised for another drop in deforestation in 2025, data show"
#Colombia #Deforestation #Environment
Google Is Spending Over $4 Billion on a Data Center Company
https://gizmodo.com/google-is-spending-over-4-billion-on-a-data-center-company-2000702751
How often do NFL teams get coaching hires right? What 23 years of data tells us https://www.nytimes.com/athletic/7056940/2026/02/24/nfl-head-coach-hire-success-rate/
Turns out that Microsoft's BitLocker security for the data stored on your hard drive is just a placebo.
Might as well give your password to everyone:
https://techcrunch.com/2026/01/23/microsoft-gave-fbi-a-s…
The Trump administration admits even more ways DOGE accessed sensitive personal data (NPR)
https://www.npr.org/2026/01/23/nx-s1-5684185/doge-data-social-security-privacy
http://www.memeorandum.com/260123/p30#a260123p30
Ca ii K Polar Network Index of the Sun - A Proxy for Historical Polar Magnetic Field: #SolarCycle activity: https://www.swri.org/newsroom/press-releases/using-100-year-old-data-help-predict-future-solar-cycle-activity
Auch wir vom Team Troet.Cafe sprechen unsere ungebrochene Solidarität mit @… aus, dessen Leitung momentan von der U.S. Regierung mit Sanktionen angegriffen wird.
Der Präsident des Internationalen Strafgerichtshof erhielt auch eine Sanktion dieser Art, für ihn hieß das: Einreiseverbot, Schließung seiner Bankkonten, Schließu…
High-dimensional data has this property: it is extremely unlikely that there will be a data point situated at the exact center.
It’s the high dimensionality that’s important here. One person might be at some sort of average on •one• dimension, but for them to be at the average on •all• dimensions grows exponentially less likely as the number of dimensions increases.
It’s like trying to roll all threes with a set of dice. Odds of that with one die? 1 in 6. Odds with two dice? 1 in 36. Odds with 10 dice? 1 in ~60 million.
4/
RE: https://mastodon.social/@tomw/116125237989075874
Very important point!
Even if you keep those AI agents out of your data you are making everyone else's digital life and experience worse.
Microsoft provided the FBI with the recovery keys to unlock encrypted data on the hard drives of three laptops as part of a federal investigation, Forbes reported on Friday.
Many modern Windows computers rely on full-disk encryption, called #BitLocker, which is enabled by default.
This type of technology should prevent anyone except the device owner from accessing the data if the computer is …
Chainalysis and TRM Labs estimate that $2.7B was stolen in crypto in 2025 in total, up from $2.2B in 2024; the biggest hack was the $1.4B breach at Bybit (Lorenzo Franceschi-Bicchierai/TechCrunch)
https://techcrunch.com/2025/12/23/hackers-stole-…
Microsoft Gave FBI Keys To Unlock Encrypted Data, Exposing Major Privacy Flaw (Thomas Brewster/Forbes)
https://www.forbes.com/sites/thomasbrewster/2026/01/22/microsoft-gave-fbi-keys-to-unlock-bitlocker-encrypted-data/
http://www.memeorandum.com/260123/p25#a260123p25
FPF releases issue brief on Vietnam’s Law on Protection of Personal Data and the Law on Data
https://fpf.org/blog/fpf-releases-issue-brief-on-vietnams-law-on-protection-of-personal-data-and-the-law-on-data/
The Shinhan Card data breach has exposed the personal information of approximately 192,000 card merchants, the South Korea–based financial services company confirmed
https://thecyberexpress.com/shinhan-card-data-breach/amp/
Do you need inspiration how to present a dataset in a clear figure and what package to use? Check out #rstats
Turkey launches a review of how social media platforms handle children's data as it prepares new rules that include identity verification and age restrictions (Turkish Minute)
https://www.turkishminute.com/2026/02/21/turkey…
WAN-IFRA: news publications globally had an estimated combined revenue of $125.7B in 2025, down 0.01% YoY; print still accounted for 65% of total revenue (Charlotte Tobitt/Press Gazette)
https://pressgazette.co…
On 16WW Data Collections and Graphs - Open for research home #dataset - https://m.earth.org.uk/note-on-data.html
A recent study conducted by Crosstown LA, examining vehicular collisions in Los Angeles intersections,
found that some are far more perilous than others.
According to the numbers, the intersection at Figueroa and Slauson is the most dangerous in the city.
The study utilized data collected by the Los Angeles Police Department
Per the data, between 2021 and 2025 alone, that intersection was subject to some 66 traffic accidents.
The intersection of Sepulveda and R…
My sister used to teach courses for the University of Phoenix. Ugh.
University of Phoenix data breach impacts nearly 3.5 million individuals
https://www.bleepingcomputer.com/news/security/university-of-phoenix-data-…
PayPal Data Breach Exposes SSNs and Business PII of Customers for Over Six Months
https://cybersecuritynews.com/paypal-data-breach-expose-customer-data/
sp_high_school: High school temporal contacts (2013)
These data sets correspond to the contacts and friendship relations between students in a high school in Marseilles, France, in December 2013, as measured through several techniques.
This network has 329 nodes and 1437 edges.
Tags: Social, Offline, Unweighted, Weighted, Temporal, Metadata
Using the Browser’s <canvas> for Data Compression
When building static websites and Single-Page Applications (SPAs), we sometimes need functionality in JavaScript front ends—such as compression—that is usually handled on the back end instead. […]
🔄 https://jstrieb.github.io/posts/canvas
Gift article
Anthropic Accuses Chinese Companies of Siphoning Data From Claude
https://www.wsj.com/tech/ai/anthropic-accuses-chinese-companies-of-siphoning-data-from-claude-63a13afc?s…
Sam Altman says currently "the idea of putting data centers in space is ridiculous" and that it is "not something that's going to matter at scale this decade" (Lauren Edmonds/Business Insider)
https://www.businessinsider.com/sam-altman
If you just need a pretty figure from a dataset and not the full power of R, have a look at #gui
Climate scientists and meteorologists
are sounding the alarm
after White House budget director Russell Vought announced
the Trump administration will
❌break up the National Center for Atmospheric Research in Boulder, Colorado, known as NCAR.
“He is executing the playbook of Project 2025,” says Michael Mann,
scientist and co-author of
"Science Under Siege".
Without NCAR, “we will not have the sorts of observational data and climate m…
On 16WW Data Collections and Graphs - Open for research #dataset - https://m.earth.org.uk/note-on-data.html
Alphabet agrees to acquire clean energy developer Intersect for $4.75B in cash, and existing debt, as part of its push to dramatically expand AI data centers (Bloomberg)
https://www.bloomberg.com/news/articles/2025-12-22/alpha…
U.S. murders on pace for largest one-year drop on record (Julianna Bragg/Axios)
https://www.axios.com/2025/12/24/us-trump-murder-data-killing-crime-national-guard
http://www.memeorandum.com/251224/p9#a251224p9
Before you head out for the weekend, don't miss today's Metacurity for the crucial cybersecurity developments you should know, including
--A database with 149 million usernames and passwords was exposed on the internet,
--Venezuelan nationals who stole cash from ATMs using malware will be deported from US,
--FBI asked Microsoft to unlock encrypted laptops,
--Under Armour is investigating massive data breach,
--Tech investors want the US government to prob…
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 256491 nodes and 328132 edges.
Tags: Social, Online, Weighted, Multilayer
The Age-Verification Trap — Verifying user’s ages undermines everyone’s data protection
Social media is going the way of alcohol, gambling, and other social sins: Societies are deciding it’s no longer kid stuff. Lawmakers point to compulsive use, exposure to harmful content, and mounting concerns about adolescent mental health. So, many propose to set a minimum age, usually 13 or 16.
🕵️
The UK ICO fines Reddit £14.47M for unlawfully using children's personal information; Reddit began verifying user ages in July 2025 to comply with the UK OSA (Tom Singleton/BBC)
https://www.bbc.com/news/articles/cwyx0xggepjo
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 304691 nodes and 563069 edges.
Tags: Social, Online, Weighted, Multilayer
TikTok users in the US were presented with a new privacy policy; the changes were part of the app's ownership transition and now allow precise location tracking (Reece Rogers/Wired)
https://www.wired.com/story/tiktok-new-privacy-policy/
Korean police arrested two high school students for the 2024 hack of Seoul’s public bicycle service “Ttareungyi”
https://www.chosun.com/english/national-en/2026/02/23/X4AOVZXXORHNLI5P5BOHMRZQRY/
Advancing American Freedom Hires Former Heritage Foundation Legal, Economic and Data Directors (Julia Jensen/Advancing American Freedom)
https://advancingamericanfreedom.com/advancing-american-freedom-hires-former-heritage-foundation-legal-economic-and-data-directors/
http://www.memeorandum.com/251222/p93#a251222p93
Bitcoin miners retooling data centers for AI have boosted their stocks; the CoinShares Bitcoin Mining ETF is up about 90% YTD even as bitcoin has slumped (Vicky Ge Huang/Wall Street Journal)
https://www.wsj…
dnc: DNC emails (2016)
A network representing the exchange of emails among members of the Democratic National Committee, in the email data leak released by WikiLeaks in 2016.
This network has 2029 nodes and 12085 edges.
Tags: Social, Communication, Unweighted, Multigraph
https://networks.skewed.de/net/dnc
OpenAI scrambled for compute power after the Stargate project stalled; sources: OpenAI still plans to build its own data centers, but not in the near future (Anissa Gardizy/The Information)
https://www.theinformation.com/articles/inside-op…
The Canada Pension Plan Board and Australia's Goodman agree to invest $2.6B in data centers in Frankfurt, Amsterdam, and Paris, starting to build in June 2026 (Angus Whitley/Bloomberg)
https://www.bloomberg.com/news/articles/20
faa_routes: FAA Preferred Routes (2010)
A network of air traffic routes, from the FAA (Federal Aviation Administration) National Flight Data Center (NFDC) preferred routes database (www.fly.faa.gov). Date of extraction is prior to 2010. Nodes represent airports or service centers, and a directed edge is the preferred route between airport i and airport j.
This network has 1226 nodes and 2615 edges.
Tags: Transportation, Airport, Unweighted
So many publications are reporting RansomHouse's claim that it had hacked a key Apple contractor, Luxshare. This morning I saw at least twelve publications reporting, so I posted about it too.
But as HackRead notes, it's just a claim by a criminal group with no confirmation. "Nevertheless, until Luxshare confirms an incident or the attackers release verifiable data, the claim remains just that."
I really wish cyber and tech publications would stop doing this --…
crime: Rosenfeld crime network (1991)
A network of associations among suspects, victims, and/or witnesses involved in crimes in St. Louis in the 1990s. Data are derived from police records, via snowball sampling from five initial homicides. Left nodes are people, right nodes are crime events, and edges connect people to particular crimes events they were associated with. Metadata includes names, genders, and roles (suspects, victims, and/or witnesses).
This network has 1380 nodes…
Lightning AI merges with data center operator Voltage Park to create an "AI cloud" with a $2.5B valuation, managing 35,000 Nvidia GPUs across six data centers (Iain Martin/Forbes)
https://www.forbes.com/sites/iainmartin/20
Apple wins dismissal of parts of a class action alleging it violated CA privacy law by collecting user data from its apps despite users believing they opted out (Isaiah Poritz/Bloomberg Law)
https://news.bloomberglaw.com/tech-and
citeseer: CiteSeer citations (2014)
Citations among papers indexed by the CiteSeer digital library. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present.
This network has 384413 nodes and 1751463 edges.
Tags: Informational, Citation, Unweighted
A scam website is actively approaching victims of the Odido data breach in the Netherlands, inviting them to join a mass claim against the telecom company for a one-off payment of €50.
https://nltimes.nl/2026/02/20/fake-sit
add_health: Adolescent health (ADD HEALTH) (1994)
A directed network of friendships obtained through a social survey of high school students in 1994. The ADD HEALTH data are constructed from the in-school questionnaire; 90,118 students representing 84 communities took this survey in 1994-95. Some communities had only one school; others had two. Where there are two schools in a community students from one school were allowed to name friends in the other, the "sister school".
How tech companies, including Meta and OpenAI, are building data centers with their own private power plants, a risky bet that will increase carbon emissions (Evan Halper/Washington Post)
https://www.washingtonpost.com/business/2026/02/19/data-centers-power-gri…
The Fulu Foundation, a group founded by repair advocate and YouTuber Louis Rossmann, which pays out bounties to people who can remove user-hostile features on connected devices, is now offering a potential payout of $10,000 to encourage hackers and tinkerers to disable software features that require Ring devices to send data to Amazon.
law_firm: Lazega law firm network
Multiplex network with 3 edge types representing relationships (coworkers, friendship, advice) between partners and associates of a corporate law firm. Data hosted by Manlio De Domenico.
This network has 71 nodes and 2571 edges.
Tags: Social, Offline, Multilayer, Unweighted
https://net…
More than a million people in France hit by bank account data breach
https://www.connexionfrance.com/practical/more-than-a-million-people-in-france-hit-by-bank-account-data-breach/771322
pokec: Pokec online social network (2012)
The online social network of Pokec, a popular OSN in Slovakia, from 2012. Date covers about 10 years and more than 1.6 million people. Profile data contains gender, age, hobbies, interest, education etc. Profile metadata are in Slovak language. Friendships in Pokec are oriented.
This network has 1632804 nodes and 30622564 edges.
Tags: Social, Online, Metadata
S&P Global: data center deals hit $61B globally in 2025; debt issuance nearly doubled YoY to $182B, with Meta raising $62B debt since 2022, ~50% of that in 2025 (April Roach/CNBC)
https://www.cnbc.com/2025/12/19/data-center-deals-…
corporate_directors: Global corporate directors (2016)
Bipartite network of directors and the companies on whose boards they sit, spanning 54 countries worldwide, constructed from data collected by the Financial Times (c. Sept. 2016). Person nodes are annotated with age and gender. Company nodes are annotated with their country, sector, industry, and number of employees.
This network has 356638 nodes and 377060 edges.
Tags: Economic, Governance, Unweighted, Metadata
<…
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
Don't miss today's packed Metacurity for the most critical infosec developments you need to know, including
--DOGE workers shared SSN data with outsiders, derailed DISA operations,
--UK launches national fraud reporting service,
--China blames Taiwan for cyberattacks,
--EU proposes freezing out Chinese tech suppliers,
--New Zealand launches Manage My Health breach probe,
--Curl ends its bug bounty program due to AI flood,
--Cloudflare fixes WAF…
eu_procurements_alt: EU national procurement networks (2008-2016)
These 234 networks represent the annual national public procurement markets of 26 European countries from 2008-2016, inclusive. Data is sourced from Tenders Electronic Daily (TED), the official procurement portal of the European Union.
This network has 5038 nodes and 6325 edges.
Tags: Economic, Commerce, Weighted, Temporal
South Korean trade data: chip exports rose 134% YoY, while computer peripherals rose 129% in the first 20 days of February, extending gains driven by AI demand (Heesu Lee/Bloomberg)
https://www.bloomberg.com/news/articles/202…
high_tech_company: Krackhardt high tech company network
Multiplex network of 3 edge types representing relationships (advice, friendship, and “reports to”) between managers of a high-tech company. Data hosted by Manlio De Domenico.
This network has 21 nodes and 312 edges.
Tags: Social, Offline, Multilayer, Unweighted
https://
A look at the growing reliance of US data centers and the Pentagon on Chinese batteries, a dependence increasingly viewed as a national security threat (New York Times)
https://www.nytimes.com/2025/…
Analysis: Oracle has moved $66B of debt for building AI data centers off its balance sheet using SPVs; Meta has moved $30B, xAI moved $20B, and CoreWeave $2.6B (Tabby Kinder/Financial Times)
https://www.ft.com/content/0ae9d6cd-6b94-4e22-a559-f047734bef83
Anthropic's data shows software engineering accounts for ~50% of its AI agent tool calls; the remaining verticals are greenfields most founders are overlooking (Garry Tan/Garry's List)
https://garryslist.org/posts/half-the-ai-agent-market-i…
US farmers are increasingly rejecting multimillion-dollar offers from data center developers; some estimate ~40K acres are needed globally for new AI projects (Niamh Rowe/The Guardian)
https://www.theguardian.com/technology/2026/feb/21/us-farmers-datacenters
Intel says it struggled to satisfy demand for its server chips used in AI data centers, and forecasts Q1 2026 revenue and profit below market estimates (Reuters)
https://www.reuters.com/business/intel-forecasts-first-quarter-sales-profi…
Kargo, which uses cameras and sensors to inspect pallets in a warehouse and provide accurate inventory data, raised a $42M Series B led by Avenir (Colin Campbell/Axios)
https://www.axios.com/pro/supply-chain-deals/2025/12/22/warehouse-tech-kargo-42m-…
How some political candidates and activists across ideologies and professions are pushing back on the spread of data centers and AI in the US (Andrew R. Chow/Time)
https://time.com/7377579/ai-data-centers-people-movement-cover/
The US DOJ says at least two DOGE employees accessed Social Security data that was off-limits under a court ruling and shared agency data on third-party servers (April Rubin/Axios)
https://www.axios.com/2026/01/20/doge-employees-social-security-inform…
IBM shares fall 12% after Anthropic outlined in a blog post how Claude Code can automate the exploration and analysis phases of COBOL modernization (Pia Singh/CNBC)
https://www.cnbc.com/2026/02/23/ibm-is-the-latest-ai-casualty-…
Anthropic says DeepSeek, MiniMax, and Moonshot violated its ToS by prompting Claude a combined 16M times and using distillation to train their own products (Wall Street Journal)
https://www.wsj.com/tech/ai/anthropic-acc…