2025-11-10 20:18:00
Rumble übernimmt deutschen Cloudbetreiber Northern Data
Truth-Social-Host Rumble steigt ins Cloudgeschäft ein und übernimmt die deutsche Cloudfirma Northern Data. Deren Aktionäre erhalten weniger als angekündigt.
h…
Rumble übernimmt deutschen Cloudbetreiber Northern Data
Truth-Social-Host Rumble steigt ins Cloudgeschäft ein und übernimmt die deutsche Cloudfirma Northern Data. Deren Aktionäre erhalten weniger als angekündigt.
h…
"EU-US Data Transfers: Time to prepare for more trouble to come", #maxschrems…
Mississippi Governor Tate Reeves says xAI is set to spend $20B to build its MACROHARDRR data center in Southaven, its third in the greater Memphis area (Sophie Bates/Associated Press)
https://apnews.com/article/xai-musk-data-center-missis…
https://fortune.com/2025/12/10/coupang-ceo-resigns-south-korea-data-breach/
Wow
Coupang CEO resigns over historic South Korean data breach
Beyond Real Data: Synthetic Data through the Lens of Regularization
Amitis Shidani, Tyler Farghly, Yang Sun, Habib Ganjgahi, George Deligiannidis
https://arxiv.org/abs/2510.08095
What the data says about Ukraine’s mobilization crisis: https://benborges.xyz/2026/01/09/what-the-data-says-about.html
Japanese nuclear plant operator fabricated seismic risk data https://arstechnica.com/science/2026/01/japanese-nuclear-plant-operator-fabricated-seismic-risk-data/
can’t wait for ai bros to argue that csam laws are holding agi back https://www.404media.co/a-developer-accidentally-found-csam-in-ai-data-google-banned-him-for-it/
Synthetic Series-Symbol Data Generation for Time Series Foundation Models
Wenxuan Wang, Kai Wu, Yujian Betterest Li, Dan Wang, Xiaoyu Zhang
https://arxiv.org/abs/2510.08445 http…
Seoul cyber investigators seize data, devices from ‘South Korea’s Amazon’ following data breach https://therecord.media/seoul-cyber-investigators-seize-data-korea-tech-giant
»Instagram Data Leak Exposes Sensitive Info of 17.5M Accounts:
A significant security breach has compromised approximately 17.5 million Instagram user accounts, exposing sensitive personal information that is now circulating on the dark web«
Which of you is surprised or even shocked - as you are not now? Unfortunately, many still naively believe that they have nothing to hide.
🤷
Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling
Jannek Ulm, Kevin Du, V\'esteinn Sn{\ae}bjarnarson
https://arxiv.org/abs/2510.08245 http…
'Are you a data professional, researcher or practitioner (user) working with data within or across communities and sectors? This survey is for you!' https://qualtrics.ucl.ac.uk/jfe/form/SV_9S7AtciBfXf9K9o
Via Melissa Terras on a DH mailing list, so GLAMs…
Scaling up AI requires staggering amounts of power and water
— especially when considering that many areas are already dealing with strained grids or drought conditions.
Even when optimized, a single hyperscale facility can draw as muchpower as a mid-sized city
and millions of gallons of water annually.
Professor Romany Webb, deputy director of Columbia University's Sabin Center for Climate Change Law, explained the challenge:
"Data centers are incred…
Google says Cl0p hackers who exploited vulnerabilities in Oracle's E-Business Suite have stolen data from "dozens" of organizations since at least July 10 (Zack Whittaker/TechCrunch)
https://techcrunch.com/2025/10/09/dozens-of-o…
A template for data analysis projects structured as R packages (or not) https://github.com/Pakillo/template by @…
Future of Privacy Forum Appoints Matthew Reisman as Vice President of U.S. Policy
https://fpf.org/press-releases/future-of-privacy-forum-appoints-matthew-reisman-as-vice-president-of-u-s-policy/
New Jersey lawmakers OK plan to charge data centers for spiking electric costs - Route Fifty
https://www.route-fifty.com/infrastructure/2026/01/nj-lawmakers-ok-plan-charge-data-centers-spiking-electric-costs/410580/
DRACO: Data Replication and Collection Framework for Enhanced Data Availability and Robustness in IoT Networks
Waleed Bin Qaim, Oznur Ozkasap, Rabia Qadar, Moncef Gabbouj
https://arxiv.org/abs/2510.07464
"Did the Upper Great Highway closure make Sunset neighborhood streets less safe? Supervisor Alan Wong claimed it did at a January 8, 2026 press conference, citing a simple year-over-year map comparison of crash data. But my analysis, using the same DataSF crash data with rigorous statistical controls, finds no evidence to support that claim, and if anything, the data suggest the opposite."
oh nice, redpocket brought back my favorite sim plan (for my kids), and they upgraded it!
For $30/yr, you get unlimited text/talk, and 200MB per month. For those of you with low data needs (or who specifically want to limit their kids data usage): https://www.ebay.com/itm/136840233242
Trump Posted Unpublished Jobs Data Early in Social Media Post (Reade Pickert/Bloomberg)
https://www.bloomberg.com/news/articles/2026-01-09/trump-posted-unpublished-jobs-data-early-in-social-media-post
http://www.memeorandum.com/260109/p43#a260109p43
DL-PIM: Improving Data Locality in Processing-in-Memory Systems
Parker Hao Tian, Zahra Yousefijamarani, Alaa Alameldeen
https://arxiv.org/abs/2510.07719 https://
Apologies no alt text at the link destination.
It's a message from Andrew Feinstein on Twitter (although I've used a "nitter" link) from the end of October, saying that he and 2 other people at MOU have resigned as directors, and explaining some shenanigans over the "Your Party" membership data.
I don't fully understand all the ins and outs of the original connection between MOU and "Your Party", but evidently MOU was something else originally and then got lumbered with looking after YP money and membership data.
Main takeaway, it does seem that the "Your Party" contingent are remarkably bad at admin!
I don't feel personally worried about it, in fact it might be a good thing if "Your Party" properly falls over and everyone joins the Greens.
#YourParty #AndrewFeinstein #UKPol
のびてるね、投票数
QT: https://fedibird.com/@tukine/115859321870564945
TracE2E: Easily Deployable Middleware for Decentralized Data Traceability
Daniel Pressens\'e, Elisavet Kozyri
https://arxiv.org/abs/2510.08225 https://…
BLAZER: Bootstrapping LLM-based Manipulation Agents with Zero-Shot Data Generation
Rocktim Jyoti Das, Harsh Singh, Diana Turmakhan, Muhammad Abdullah Sohail, Mingfei Han, Preslav Nakov, Fabio Pizzati, Ivan Laptev
https://arxiv.org/abs/2510.08572
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
TCDRM: A Tenant Budget-Aware Data Replication Framework for Multi-Cloud Computing
Santatra Hagamalala Bernardin (IRIT-PYRAMIDE, IRIT), Riad Mokadem (IRIT-PYRAMIDE, IRIT), Franck Morvan (IRIT-PYRAMIDE, IRIT), Hasinarivo Ramanana, Hasimandimby Rakotoarivelo
https://arxiv.org/abs/2510.07833
https://www.koreatimes.co.kr/business/companies/20251209/police-raid-coupang-over-massive-data-breach
This is what happens when you do bad cybersecurity.
Police reportedly seek to check for possible lapses in Coupang'…
Advanced PHPUnit Data Provider shenanigans. #PHP
Part 1: https://peakd.com/hive-168588/@crell/fun-with-phpunit-data-providers
Part 2:
On 16WW Data Collections and Graphs - Open for research #dataset - https://www.earth.org.uk/note-on-data.html
Hyperspectral data augmentation with transformer-based diffusion models
Mattia Ferrari, Lorenzo Bruzzone
https://arxiv.org/abs/2510.08363 https://arxiv.org…
Woo hoo! You all get to start adding stuff to Cosmik Network’s Semble: a social bookmarking tool built on ATProto, so all your data is stored in your own account https://semble.so
I’ve been using it for weeks, have it as a PWA on my phone, and it’s been great to just get the basics of saving links somewhere.
You have noticed NFL TV ratings are soaring. Now understand why that is happening https://www.nytimes.com/athletic/6699995/2025/10/10/nfl-tv-ratings-nielsen-big-data/
The Milky Way Imaging Scroll Painting Survey - Data Release 1: #MWISP survey was conducted using the PMO 13.7 m telescope at a spatial resolution of approximately 50" and a velocity resolution of 0.16 km/s at 115 GHz. DR1 fully covered 2310 square degrees within the Galactic longitude (l) and latitude (b) range of 9.75 deg =< l=< 229.75 deg and |b| =< 5.25 deg."
Sistrix: among the 10 UK news sites with the highest search visibility scores, six saw double-digit percentage drops in Google search visibility since January (Charlotte Tobitt/Press Gazette)
https://pressgazette.co.uk/media-audie
Photometric Redshift Estimation for Rubin Observatory Data Preview 1 with Redshift Assessment Infrastructure Layers (RAIL)
T. Zhang, E. Charles, J. F. Crenshaw, S. J. Schmidt, P. Adari, J. Gschwend, S. Mau, B. Andrews, E. Aubourg, Y. Bains, K. Bechtol, A. Boucaud, D. Boutigny, P. Burchat, J. Chevalier, J. Chiang, H. -F. Chiang, D. Clowe, J. Cohen-Tanugi, C. Combet, A. Connolly, S. Dagoret-Campagne, P. N. Daly, F. Daruich, G. Daubard, J. De Vicente, H. Drass, K. Fanning, E. Gawiser, M. …
Irgend jemand scheint unter "Digitale Souveränität" etwas ganz anderes zu verstehen als der Rest der Welt.
#DigitaleSouveränität #DigitalSovereignty
A data fusion approach for mobility hub impact assessment and location selection: integrating hub usage data into a large-scale mode choice model
Xiyuan Ren, Joseph Y. J. Chow
https://arxiv.org/abs/2510.08366
som vanligt toppennyhetsbrev av @… https://dekaminski.se/?mailpoe…
Det kanadensiska samtalet om digital suveränitet är i full fart. Canadian Center for Policy Alternatives skriver under rubriken "Every data centre is a U.S military base".
https://www.policyalternatives.ca/news-research/every-data-centre-is…
A Developer Accidentally Found CSAM in AI Data. Google Banned Him For It
Mark Russo reported the dataset to all the right organizations, but still couldn't get into his accounts for months.
— by @…
🤦
Una lectura mšs que obligatoria para entender lo que conlleva “la nube”
https://www.cloudwards.net/deep-dives/the-data-center-water-wars/
@…
Yeah lmao just tested and I don’t even need to reboot the console, just restart the game
Weight data and played days don’t seem to be moved tho, just the rankings
(I do have an even older nnid account that I don’t use anymore, tho that has a different name (just “oldaccound” now) and it didn’t steal any saves)
(I don’t really remember if when moving to Pretendo I moved the mii and created a new similar one for the old account or if it was the other way around and I just created a similar mii to the new one)
(Back then I did try moving the save data between the accounts with a SaveMii, but it didn’t work for Wii Fit U (or for mk8, tho haven’t had issues with that one))
#WiiU
Prompts Generalize with Low Data: Non-vacuous Generalization Bounds for Optimizing Prompts with More Informative Priors
David Madras (Richard), Joshua Safyan (Richard), Qiuyi (Richard), Zhang
https://arxiv.org/abs/2510.08413
I rewrote a data analysis pipeline, moving it from #python to #julialang . I am now in love with the threading support in Julia.
The task is very parallelizable but each thread needs random read access to a tens-of-GB dataset. In Python (with multiprocessing, shared stores, etc) data bookkeeping was a nightmar…
‘Profound impacts’: record ocean heat is intensifying climate disasters, data shows
https://www.theguardian.com/environment/2026/jan/09/profound-impacts-record-ocean-heat-intensifying-climate-disa…
The Austrian Data Protection Authority ("DSB") issued a decision finding that Microsoft 365 Education illegally tracks students and uses student data for Microsoft's own purposes.
https://noyb.eu/en/noyb-win-microsoft-365-education-tracks-school-child…
EU Set the Global Standard on Privacy and AI. Now It’s Pulling Back | TechPolicy.Press https://techpolicy.press/eu-set-the-global-standard-on-privacy-and-ai-now-its-pulling-back
Curriculum Learning with Synthetic Data for Enhanced Pulmonary Nodule Detection in Chest Radiographs
Pranav Sambhu, Om Guin, Madhav Sambhu, Jinho Cha
https://arxiv.org/abs/2510.07681
The thing about a life-logger, is you input sensitive data about your life, lifestyle and activities, so privacy and data-integrity are some of the most important issues.
There can be no server, the data has to be yours and yours alone. Because you can’t tell what is happening to the data in a closed-source app, it must be completely free and open source.
You can’t trust a corporate diary, they must sell to anyone offering enough money.
So it is with my life log app, all data completely in your own device. No home server ever sees anything.
There is no home server. Just the code.
To achieve this Exocortex Log is a Progressive Web App. It downloads when you are online at the website and can be installed onto the homepage of your phone.
It keeps all data on the local device using indexdb.
This means you must be responsible for your own backups. Be sure to export and back up your data regularly. I have gaps in my ten year record where my phone was stolen and most recent backup was months prior.
Once installed it will work offline, airplane mode, no internet, down in the tube station at midnight, anywhere.
There's a blog on the website saying this and more: https://exocortexlog.com/news/articles/2025-12-06-release/
Who Stole Your Data? A Method for Detecting Unauthorized RAG Theft
Peiyang Liu, Ziqiang Cui, Di Liang, Wei Ye
https://arxiv.org/abs/2510.07728 https://arxi…
Stochastic Gradient Descent for Incomplete Tensor Linear Systems
Anna Ma, Deanna Needell, Alexander Xue
https://arxiv.org/abs/2510.07630 https://arxiv.org/…
High-dimensional Analysis of Synthetic Data Selection
Parham Rezaei, Filip Kovacevic, Francesco Locatello, Marco Mondelli
https://arxiv.org/abs/2510.08123 https://
Teen who allegedly stole millions of personal data records arrested in Spain https://therecord.media/spain-arrests-teen-suspect-data-theft-and-sale
White House claims "more than 1,000%" rise in assaults on ICE agents, data says otherwise (NPR)
https://www.npr.org/2025/10/10/nx-s1-5565146/white-house-claims-more-than-1-000-rise-in-assaults-on-ice-agents-data-says-otherwise
http://www.memeorandum.com/251010/p96#a251010p96
Protege, which offers an AI data platform to access high-quality, proprietary training data at scale, raised $30M from a16z, extending its $25M Series A (Duncan Riley/SiliconANGLE)
https://siliconangle.com/2026/01/08/protege-raises-30m-grow-…
92% of ICE Detainees
since Sept 2025 are Immigrants with No Criminal Convictions
ICE published new detention data today that brings the current population to a record-high of 68,990.
Plus: ICE records first detention death of 2026
https://austinkocher.substack.com/p/92
The NHS Barts Health Hospital wants to legally ban the publication, use, or sharing of data stolen by the Clop gang by anyone.
https://www.bankinfosecurity.com/uk-hospital-asks-court-to-stymie-ransomware-data-leak-a-30222
Uber launches Uber Intelligence, an insights platform that lets advertisers tap into Uber's data about customer trips and deliveries (Lara O'Reilly/Business Insider)
https://www.businessinsider.com/uber-ads-launches-intelli…
MobilityDuck: Mobility Data Management with DuckDB
Nhu Ngoc Hoang, Ngoc Hoa Pham, Viet Phuong Hoang, Esteban Zim\'anyi
https://arxiv.org/abs/2510.07963 https://
Robust Source-Free Domain Adaptation for Medical Image Segmentation based on Curriculum Learning
Ziqi Zhang, Yuexiang Li, Yawen Huang, Nanjun He, Tao Xu, Liwei Lin, Yefeng Zheng, Shaoxin Li, Feiyue Huang
https://arxiv.org/abs/2510.08393
FastUMI-100K: Advancing Data-driven Robotic Manipulation with a Large-scale UMI-style Dataset
Kehui Liu, Zhongjie Jia, Yang Li, Zhaxizhuoma, Pengan Chen, Song Liu, Xin Liu, Pingrui Zhang, Haoming Song, Xinyi Ye, Nieqing Cao, Zhigang Wang, Jia Zeng, Dong Wang, Yan Ding, Bin Zhao, Xuelong Li
https://arxiv.org/abs/2510.08022
An Airwallex executive warned in 2023 that China staff were pushing to access client data; Keith Rabois accuses Airwallex of enabling Chinese access to US data (Lucas Baird/Australian Financial Review)
https://www.afr.com/companies/finan…
A Developer Accidentally Found CSAM in AI Data. Google Banned Him For It https://www.404media.co/a-developer-accidentally-found-csam-in-ai-data-google-banned-him-for-it/
Lets be honest, we spend too much time cleaning data. {janitor} can help with that: #rstats #datasciece
Open Infrastructure Map
Open Infrastructure Map is a view of the world's infrastructure mapped in the OpenStreetMap database. This data isn't exposed on the default OSM map, so there built Open Infrastructure Map to visualise it.
🌎 https://openinframap.org
:mastodon:
Austria's privacy regulator finds that Microsoft violated EU law by illegally tracking students through its Microsoft 365 Education software (Suzanne Smalley/The Record)
https://therecord.media/microsoft-violated-eu-law-austria
Better Together: Leveraging Unpaired Multimodal Data for Stronger Unimodal Models
Sharut Gupta, Shobhita Sundaram, Chenyu Wang, Stefanie Jegelka, Phillip Isola
https://arxiv.org/abs/2510.08492
A district court judge denies Texas AG Ken Paxton's request for a restraining order preventing Samsung from collecting data on TV watchers (Ryan Autullo/Bloomberg Law)
https://news.bloomberglaw.com/tech-and-telecom-law/sa…
Coupang CEO Park Dae-jun resigns after the company incurred South Korea's largest-ever data breach, in which the personal data of 30M people was compromised (Mark Anderson/Bloomberg)
https://www.bloomberg.com/news/articles/20
R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation
Xiuwei Xu, Angyuan Ma, Hankun Li, Bingyao Yu, Zheng Zhu, Jie Zhou, Jiwen Lu
https://arxiv.org/abs/2510.08547
Not only was Coupang's CEO forced to step down today but police raided the company's headquarters for a second time too.
Police raid Coupang headquarters for 2nd day over massive data breach
https://en.yna.co.kr/view/AEN20251210004400315
Kristi Noem and DHS do not deserve the benefit of the doubt (Philip Bump/MS NOW)
https://www.ms.now/opinion/kristi-noem-dhs-ice-data
http://www.memeorandum.com/260109/p130#a260109p130
On Website Technicals (2026-01) - Tech updates: new energy series, SSES peak tweak, Zenodo archives, data correction... - https://m.earth.org.uk/note-on-site-technicals-104.html
Starcloud, which launched a satellite with a Nvidia H100 chip in November, says the satellite is running and querying responses from Google's Gemma (Pia Singh/CNBC)
https://www.cnbc.com/2025/12/10/nvidia-backed-starcloud-tr…
Data privacy whistleblowers would get expanded protections under California proposal https://therecord.media/california-data-privacy-agency-whistleblower-protections-proposal
Dual-granularity Sinkhorn Distillation for Enhanced Learning from Long-tailed Noisy Data
Feng Hong, Yu Huang, Zihua Zhao, Zhihan Zhou, Jiangchao Yao, Dongsheng Li, Ya Zhang, Yanfeng Wang
https://arxiv.org/abs/2510.08179
Check out today's Metacurity for the most crucial cybersecurity developments you should know, including
--Man pleads guilty in the first successful US prosecution of a stalkerware operator,
--Korea warns of hacking forum that steals and sells data,
--NZ High court enjoins publication of stolen medical data,
--UK government launches $282m cyber action plan,
--Threat actor stole and threatens to leak data from insurer Prosura,
--Command injection flaw fou…
Sources: OpenAI has been slow to expand in-app checkouts for ChatGPT as the startup and its partners Shopify and Stripe struggle to standardize merchant data (Ann Gehan/The Information)
https://www.theinformation.com/articles/openais-shopping-ambitio…
It's a pretty big day in cybersecurity news, so don't miss today's Metacurity for the critical developments you should know, including
--Korean cops raid Coupang HQ looking for security lapses, breach perpetrator clues,
--Compromise NDAA bill is chock full of cyber provisions,
--FTC rejects petition from spyware company founder,
--Commonwealth Bank of Australia fined A$702k for breaching data rules,
--FBI warns of fake proof of life photos,
--Oz…
Several of Asia's top tycoons and conglomerates are joining the data center race as tech giants plan $240B in APAC hyperscale expansion over the next five years (Jonathan Burgos/Forbes)
http://www.forbes.com/sites/jonathanburgos/2…
Uber launches Uber Intelligence, an insights platform that lets advertisers tap into Uber's data about customer trips and deliveries (Lara O'Reilly/Business Insider)
https://www.businessinsider.com/uber-ads-launches-intelli…
https://www.bleepingcomputer.com/news/security/sandworm-hackers-use-data-wipers-to-disrupt-ukraines-grain-sector/
Sandworm hackers use data wipers to disrupt Ukraine's grain sector
The European Commission says Apple and Google's Android-iPhone data transfer tool, which will be available globally, is an example of the benefits of the DMA (Chance Miller/9to5Mac)
https://9to5mac.com/2025/12/09/iphone-android-switching-ios-26/
China bans TechInsights from working with or receiving data from Chinese entities, citing national security concerns, after a report on Huawei's Ascend AI chips (Dylan Butts/CNBC)
https://www.cnbc.com/2025/10/10/china-blac
Sources: Blue Origin has worked for over a year on tech for orbital AI data centers; SpaceX plans to use upgraded Starlink satellites for AI computing payloads (Wall Street Journal)
https://www.wsj.com/tech/bezos-and-musk-ra
Rumble agrees to acquire German AI infrastructure company Northern Data in an up to $970M deal, set to close in Q2 2026; both companies are backed by Tether (Billy Gray/Wall Street Journal)
https://www.wsj.com/b…
South Korean media: police raided Coupang's HQ, searching for evidence related to a historic data breach that compromised 30M people's personal information (Jane Lanhee Lee/Bloomberg)
https://www.bloomberg.com/news/articles/20
Snowflake agrees to acquire Observe, an observability platform built on Snowflake databases; the deal was previously reported to be valued at around $1B (Rebecca Szkutak/TechCrunch)
https://techcrunch.com/2026/01/08/snowflake-announces-…
Microsoft, Meta, and sources say, Google will not publish diversity reports and data this year; Amazon, Apple, and Nvidia released diversity data this year (Paresh Dave/Wired)
https://www.wired.com/story/google-microsoft-and-meta-have-s…
Polymarket agrees to its first media partnership to supply Dow Jones outlets, including the WSJ and Barron's, with real-time prediction market trading data (Katherine Doherty/Bloomberg)
https://www.bloomberg.com/news/articles/20
How the huge data center buildout is heating up local politics in US towns across red and blue states on issues like water, electricity, and noise (Evan Halper/Washington Post)
https://www.washingtonpost.com/business/2026/01/06/da…