
2025-06-05 15:03:48
You might want to fine-tune your notification settings:
“Apple Gave Governments Data on Thousands of Push Notifications” - https://www.404media.co/apple-gave-governments-data-on-thousands-of-push-notifications/
You might want to fine-tune your notification settings:
“Apple Gave Governments Data on Thousands of Push Notifications” - https://www.404media.co/apple-gave-governments-data-on-thousands-of-push-notifications/
Apple's July 2022 to June 2024 transparency report shows it gave governments data on thousands of push notifications, which can include unencrypted content (Joseph Cox/404 Media)
https://www.404media.co/apple-gave-governments-data-on-tho…
Data Sharing for Research Tracker
https://fpf.org/blog/data-sharing-for-research-tracker/
@…
Co-authored by Hannah Babinski, for…
This looks very cool.
'OpenAIRE in collaboration with Area Science Park organizes a hands-on workshop titled “Where LEGO Meets FAIR Data,” designed to introduce the principles of FAIR data through a creative, interactive simulation using LEGO metaphors.'
https://www.
"Apple provided governments around the world with data related to thousands of push notifications sent to its devices, which can identify a target’s specific device or in some cases include unencrypted content."
https://www.404media.co/apple-gave-governm
A coalition of seven leading civil society organisations has called on the European Commission to revoke the United Kingdom's data adequacy status, citing what they describe as a sustained and systemic erosion of privacy and data protection standards in the UK.
https://www.
Do you need inspiration how to present a dataset in a clear figure and what package to use? Check out #rstats
This https://arxiv.org/abs/2501.14755 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
Trade-offs in Data Memorization via Strong Data Processing Inequalities
Vitaly Feldman, Guy Kornowski, Xin Lyu
https://arxiv.org/abs/2506.01855 https://
Amend Investigatory Powers Act: bar Govt from protected, private customer data
Amend the Act to specifically and completely bar any Government from attempting to compel any company to give them access to customers' data, which we think undermines privacy. We don't want any companies to pull privacy features from the UK market, potentially as a result of the Act and its use.
FDA tells drugmakers to redo studies over data integrity issue
(*CRO misconduct issue)
https://www.statnews.com/pharmalot/2025/04/04/fda-research-cro-raptim-india-data/
Mind Security, which offers AI-powered automated data loss prevention services to help prevent breaches, raised a $30M Series A (Kyt Dotson/SiliconANGLE)
https://siliconangle.com/2025/06/04/mind-raises-30m-help-businesses-prevent-da…
"Only two European states have net zero military emissions target, data shows"
#Europe #Climate #ClimateChange #Emissions
I have contributed a public statement in the currently open consultation on EU data retention plans: https://www.mayrhofer.eu.org/post/on-mass-data-retention/
You might want to do so as well.
An Algorithmic Pipeline for GDPR-Compliant Healthcare Data Anonymisation: Moving Toward Standardisation
Hamza Khan, Lore Menten, Liesbet M. Peeters
https://arxiv.org/abs/2506.02942
What Does Information Science Offer for Data Science Research?: A Review of Data and Information Ethics Literature
Brady D. Lund, Ting Wang
https://arxiv.org/abs/2506.03165
PSA: We had some image loading issues since yesterday on datasci.social, soon to be resolved by itself. Thx for understanding.
Reason: Due to migrating to a more stable provider: An additional cache for images was added that also stores the data in an efficient manner, but due to the large initial load some of its requests are timing out to the backend. We are increasing the timeouts, and also manually pushing data into it so it has data before the initial request.
Apple Gave Governments Data on Thousands of Push Notifications https://www.404media.co/apple-gave-governments-data-on-thousands-of-push-notifications/
This https://arxiv.org/abs/2504.20369 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csHC_…
U.S. Is Trimming Back Its Collection of Consumer Price Data (Ben Casselman/New York Times)
https://www.nytimes.com/2025/06/04/business/bls-price-data-collection.html
http://www.memeorandum.com/250604/p181#a250604p181
RT @…
#Hackathon with a 💰 £4,000 prize pot will take place from 9am-3pm (CEST) on 11, 12 and 16 June.
Entrants will be expected to use new data collected via our web-first panel survey to produce policy-relevant insights.
Registration closes…
The most energetic transients - tidal disruptions of high-mass stars: #ExtremeNuclearTransients (ENTs) are the most energetic transients yet observed.
twitter_higgs: Twitter, Higgs boson (2012)
Data on tweets related to the announcement of the discovery of a new fundamental particle with the features of the Higgs boson on 4th July 2012. Data covers 1-7 July 2012, and includes four types of networks: followers, retweets, replies, and mentions.
This network has 304691 nodes and 563069 edges.
Tags: Social, Online, Weighted, Multilayer
“Legitimate interests is now our legal basis for using your information to improve Meta Products.”
Alright, Zuck. 😂
Here’s what's included:
https://www.facebook.com/privacy/policy/?section_id=18.4-LegitimateInterestsWeRely
Users should be in control of what data their browser extensions can access, which is why Mozilla plans to build a data collection consent directly into Firefox's add-on installation flow
https://blog.mozilla.org/addons/2025/0
Rethinking the effects of data contamination in Code Intelligence
Zhen Yang, Hongyi Lin, Yifan He, Jie Xu, Zeyu Sun, Shuo Liu, Pengpeng Wang, Zhongxing Yu, Qingyuan Liang
https://arxiv.org/abs/2506.02791
Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training
Pierre-Carl Langlais, Carlos Rosas Hinostroza, Mattia Nee, Catherine Arnett, Pavel Chizhov, Eliot Krzystof Jones, Ir\`ene Girard, David Mach, Anastasia Stasenko, Ivan P. Yamshchikov
https://arxiv.org/abs/2506.01732
Here's a sort of rebuttal to AI 2027 that feels more informed than most of the ones I've read:
https://www.lesswrong.com/posts/XiMRyQcEyKCryST8T/slowdown-after-2028-compute-rlvr-uncertainty-moe-data-wall…
Ukraine's military intelligence agency (HUR) has gained access to sensitive data of Russia's strategic aircraft manufacturer Tupolev
Tupolev, a Soviet-era aerospace firm now fully integrated into Russia's defense-industrial complex, has been under international sanctions since 2022 for its role in Russia's war against Ukraine.
Its bombers have been widely used to launch long-range cruise missiles against Ukrainian cities and infrastructure.
According to the so…
Second-order AAA algorithms for structured data-driven modeling
Michael S. Ackermann, Ion Victor Gosea, Serkan Gugercin, Steffen W. R. Werner
https://arxiv.org/abs/2506.02241
Multimodal Financial Foundation Models (MFFMs): Progress, Prospects, and Challenges
Xiao-Yang Liu Yanglet, Yupeng Cao, Li Deng
https://arxiv.org/abs/2506.01973
Eighteen Exoplanet Host Stars from the NPOI Data Archive
Ellyn K. Baines, Jeremy Jones, James H. Clark III, Henrique R. Schmitt, Jordan M. Stone
https://arxiv.org/abs/2506.02934
This https://arxiv.org/abs/2503.17414 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCY_…
Differentially Private Distribution Release of Gaussian Mixture Models via KL-Divergence Minimization
Hang Liu, Anna Scaglione, Sean Peisert
https://arxiv.org/abs/2506.03467
This https://arxiv.org/abs/2506.01696 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Labelling Data with Unknown References
Adrian de Wynter
https://arxiv.org/abs/2506.03083 https://arxiv.org/pdf/2506.03083
Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data
Ben Moran, Mauro Comi, Steven Bohez, Tom Erez, Zhibin Li, Leonard Hasenclever
https://arxiv.org/abs/2506.04120
Incorporating Correlated Nugget Effects in Multivariate Spatial Models: An Application to Argo Ocean Data
Damilya Saduakhas, David Bolin, Xiaotian Jin, Alexandre B. Simas, Jonas Wallin
https://arxiv.org/abs/2506.03042
Signals as a First-Class Citizen When Querying Knowledge Graphs
Tobias Schwarzinger, Gernot Steindl, Thomas Fr\"uhwirth, Thomas Preindl, Konrad Diwold, Katrin Ehrenm\"uller, Fajar J. Ekaputra
https://arxiv.org/abs/2506.03826
https://coincentral.com/coinbase-hid-massive-data-breach-for-months-before-going-public/
“Coinbase knew about a TaskUs employee data breach in January but only disclosed it publicly in May after rejecting hackers' ransom demands.”
Trump Taps Palantir to Create Master Database on Every American | The New Republic
https://newrepublic.com/post/195904/trump-palantir-data-americans
Voor iedereen die zich nog afvroeg of privacy in de VS nog een rol van betekenis speelt of dat er …
A data model to connect the ESO Data Processing System (EDPS) to ELT data archives
Hugo Buddelmeijer, Gijs Verdoes Kleijn
https://arxiv.org/abs/2506.00899 …
Carbon-Aware Temporal Data Transfer Scheduling Across Cloud Datacenters
Elvis Rodrigues, Jacob Goldverg, Tevfik Kosar
https://arxiv.org/abs/2506.04117 http…
Combining social relations and interaction data in Recommender System with Graph Convolution Collaborative Filtering
Tin T. Tran, Vaclav Snasel, Loc Tan Nguyen
https://arxiv.org/abs/2506.02834
Analysis of Multiple Long Run Relations in Panel Data Models with Applications to Financial Ratios
Alexander Chudik, M. Hashem Pesaran, Ron P. Smith
https://arxiv.org/abs/2506.02135
@… So long as you are using the same user data (settings) you will get that. If you want it to run independently specific another directory to house user data with --user-data-dir=
@…
The {esquisse} package makes it easy to plot your data in different ways with a drag and drop interface: #rstats
🇺🇦 #NowPlaying on KEXP's #VarietyMix
Radiohead:
🎵 Weird Fishes/Arpeggi
#Radiohead
https://ghostdata.bandcamp.com/track/radiohead-weird-fishes-arpeggi-ghost-data-remix
https://open.spotify.com/track/4wajJ1o7jWIg62YqpkHC7S
Decentralized COVID-19 Health System Leveraging Blockchain
Lingsheng Chen, Shipeng Ye, Xiaoqi Li
https://arxiv.org/abs/2506.02674 https://
SMOTE-DP: Improving Privacy-Utility Tradeoff with Synthetic Data
Yan Zhou, Bradley Malin, Murat Kantarcioglu
https://arxiv.org/abs/2506.01907 https://
Trump Administration Backs Off Effort to Collect Data on Food Stamp Recipients (Zach Montague/New York Times)
https://www.nytimes.com/2025/06/03/us/politics/trump-administration-personal-data-food-stamp-recipients.html?unlocked_article_code=1.ME8.Ii3t.2hf3Htm_rfho&smid=url-share
http://www.memeorandum.com/250603/p131#a250603p131
This https://arxiv.org/abs/2412.03824 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
This https://arxiv.org/abs/2505.19746 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
I really like this approach
https://infosec.exchange/@masek/114624183034377452
Dynamical Dark Energy from $F(R)$ Gravity Models Unifying Inflation with Dark Energy: Confronting the Latest Observational Data
S. D. Odintsov, V. K. Oikonomou, G. S. Sharov
https://arxiv.org/abs/2506.02245
I honestly don’t get how this is possible in 2025.
There’s no excuse for not having protected backups of everything critical. That was true 30 years ago, it is true today. Back then a bigger piece of the risk was hardware, less malware. Today it is unlikely for a disk failure to take your data because we all learned that RAID was worthwhile. But RAID is not a backup and backups today need to be resistant to intentional malicious deletion.
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
https://www.reuters.com/legal/legalindustry/marriott-wins-us-appeals-order-striking-down-data-breach-class-action-2025-06-03/
Marriott wins US appeals order striking down data breach class action
Nonsmooth data error estimates for exponential Runge--Kutta methods and applications to split exponential integrators
Qiumei Huang, Alexander Ostermann, Gangfan Zhong
https://arxiv.org/abs/2506.02778
Nearly 3,000 North Face website customer accounts breached as retail incidents continue https://therecord.media/north-face-customer-accounts-data-breach-notification
Optical spectroscopic signatures of the red giant evolutionary state
Ella Xi Wang, Melissa Ness, Thomas Nordlander, Andrew R. Casey, Sarah Martell, Marc Pinsonneault, Xiaoting Fu, Dennis Stello, Claudia Reyes, Marc Hon, Madeleine McKenzie, Mingjie Jian, Jie Yu, Sven Buder, Karin Lind, Joss Bland-Hawthorn, Daniel B. Zucker, Pradosh Barun Das, Richard de Grijs, Michael Hayden
Analysis of Server Throughput For Managed Big Data Analytics Frameworks
Emmanouil Anagnostakis, Polyvios Pratikakis
https://arxiv.org/abs/2506.03854 https:…
If you just need a pretty figure from a dataset and not the full power of R, have a look at #gui
Optimization of Functional Materials Design with Optimal Initial Data in Surrogate-Based Active Learning
Seongmin Kim, In-Saeng Suh
https://arxiv.org/abs/2506.03329
Brookfield says it plans to invest ~$10B to build an AI data center in the Swedish city of Strangnas, after committing to spend €20B in France earlier in 2025 (Supantha Mukherjee/Reuters)
https://www.reuters.com/technology/brookfi
AI Data Development: A Scorecard for the System Card Framework
Tadesse K. Bahiru, Haileleol Tibebu, Ioannis A. Kakadiaris
https://arxiv.org/abs/2506.02071 …
This https://arxiv.org/abs/2503.04149 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
No such thing as a slow cybersecurity news day anymore, so don't miss today's Metacurity for the critical infosec developments you should know, including
--Top cyber vendors hope to clean up crazy threat group naming practices,
--Coinbase knew of data leak in January,
--Prolific swatter pleads guilty,
--Cartier confirms data breach,
--Abilene gropes for recovery after rejecting ransom payment,
--North Face customers' data stolen in credential s…
Simulating Complex Crossectional and Longitudinal Data using the simDAG R Package
Robin Denz, Nina Timmesfeld
https://arxiv.org/abs/2506.01498 https://
product_space: Atlas of Economic Complexity export network
Two networks of economic products, where a pair of products are connected if they are exported at similar rates by the same countries. The data are a projection from a bipartite network of nations and the products they export. Edges weights represent a similarity score (called "proximity"). Data based on UN Comtrade worldwide trade patterns. SITC network based on the Standard International Trade Classification and HS …
A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering
Madan Krishnamurthy, Daniel Korn, Melissa A Haendel, Christopher J Mungall, Anne E Thessen
https://arxiv.org/abs/2506.02160
The Curse of Dimensionality: De-identification Challenges in the Sharing of Highly Dimensional Datasets
https://fpf.org/blog/the-curse-of-dimensionality-de-identification-challenges-in-the-sharing-of-highl…
This https://arxiv.org/abs/2505.18458 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
This https://arxiv.org/abs/2505.19193 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
This https://arxiv.org/abs/2505.19641 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
Mitigating Data Poisoning Attacks to Local Differential Privacy
Xiaolin Li, Ninghui Li, Boyang Wang, Wenhai Sun
https://arxiv.org/abs/2506.02156 https://…
Google says hackers loosely affiliated with the Com in the US, the UK, and Europe breached 20 companies in the US and Europe to steal Salesforce data (Margi Murphy/Bloomberg)
https://www.bloomberg.com/news/articles/20
Big Data-Driven Fraud Detection Using Machine Learning and Real-Time Stream Processing
Chen Liu, Hengyu Tang, Zhixiao Yang, Ke Zhou, Sangwhan Cha
https://arxiv.org/abs/2506.02008 …
Dynamically derived morphology from the recurrence patterns of close binary stars using Kepler data
Anisha R. V. Kashyap, D. Pawar, R. Misra, G. Ambika, Sandip V George
https://arxiv.org/abs/2506.04111
The Com strikes again.
Google Warns Hackers Stealing Salesforce Data From Companies
https://www.bloomberg.com/news/articles/20
Google warns of cybercriminals targeting Salesforce app to steal data, extort companies https://therecord.media/google-warns-cybercriminals-targeting-salesforce-apps
Integrating Expert Knowledge and Recursive Bayesian Inference: A Framework for Spatial and Spatio-Temporal Data Challenges
Mario Figueira, David Conesa, Antonio L\'opez-Qu\'ilez, H{\aa}vard Rue
https://arxiv.org/abs/2506.00221
Wall Street giants like Blackstone, KKR, and BlackRock are pouring hundreds of billions into AI data centers, creating concerns of "oversupply" and a bubble (Maureen Farrell/New York Times)
https://www.nytimes.com/2025/06/02/business/ai-da…
CityPulse: Real-Time Traffic Data Analytics and Congestion Prediction
Idriss Djiofack Teledjieu, Irzum Shafique
https://arxiv.org/abs/2506.01971 https://…
scDataset: Scalable Data Loading for Deep Learning on Large-Scale Single-Cell Omics
Davide D'Ascenzo, Sebastiano Cultrera di Montesano
https://arxiv.org/abs/2506.01883
Vintage NPOI: New and Updated Angular Diameters for 145 Stars
Ellyn K. Baines, James H. Clark III, Bradley I. Kingsley, Henrique R. Schmitt, Jordan M. Stone
https://arxiv.org/abs/2506.02912
D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed Storage
Maxime Gonthier (University of Chicago, Argonne National Laboratory), Dante D. Sanchez-Gallegos (Universidad Carlos III de Madrid), Haochen Pan (University of Chicago), Bogdan Nicolae (Argonne National Laboratory), Sicheng Zhou (Southern University of Science and Technology), Hai Duc Nguyen (University of Chicago, Argonne National Laboratory), Valerie Hayot-Sasson (University of Chicago, Ar…
Germany's data protection commission fines Vodafone a record €45M due to data privacy violations linked to malicious behavior by third-party sales agents (Karin Matussek/Bloomberg)
https://www.bloomberg.com/news/articles/20
This https://arxiv.org/abs/2505.23470 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Tel Aviv-based Speedata, which is designing analytics processing units for big data workloads, raised a $44M Series B, aims to showcase its first APU this month (Kate Park/TechCrunch)
https://techcrunch.com/2025/06/03/speedata…
Investigating Timing-Based Information Leakage in Data Flow-Driven Real-Time Systems
Mohammad Fakhruddin Babar, Zain A. H. Hammadeh, Mohammad Hamad, Monowar Hasan
https://arxiv.org/abs/2506.01991
This https://arxiv.org/abs/2505.21938 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
This https://arxiv.org/abs/2505.24603 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Brazil is piloting dWallet, a digital wallet program that allows users to monetize their data, the first nationwide initiative of its kind in the world (Gabriel Daros/Rest of World)
https://restofworld.org/2025/brazil-dwallet-user-data-pilot/
SubMIT: A Physics Analysis Facility at MIT
Josh Bendavid (Massachusetts Institute of Technology, CERN), Mariarosaria D'Alfonso (Massachusetts Institute of Technology), Jan Eysermans (Massachusetts Institute of Technology), Chad Freer (Massachusetts Institute of Technology), Maxim Goncharov (Massachusetts Institute of Technology), Matthew Heine (Massachusetts Institute of Technology), Luca Lavezzo (Massachusetts Institute of Technology), Marianne Moore (Massachusetts Institute of Te…
This https://arxiv.org/abs/2505.17226 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
23andMe plans to hold a new auction for its DNA data, opening with a $305M bid led by Anne Wojcicki, after the ex-CEO challenged Regeneron's winning bid in May (Steven Church/Bloomberg)
https://www.bloomberg.com/news/articles/20
This https://arxiv.org/abs/2505.11197 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…