dblp_cite: DBLP citations (2014)
Citations among papers contained in the DBLP computer science bibliography. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. This snapshot from May 2014.
This network has 12590 nodes and 49759 edges.
Tags: Informational, Citation, Unweighted
Minimizing Communication for Parallel Symmetric Tensor Times Same Vector Computation
Hussam Al Daas (STFC, Scientific Computing Department, Rutherford Appleton Laboratory, Didcot, UK), Grey Ballard (Wake Forest University, Computer Science Department, Winston-Salem, NC, USA), Laura Grigori (EPFL, Institute of Mathematics, Lausanne, Switzerland,PSI, Center for Scientific Computing, Theory,Data, Villigen, Switzerland), Suraj Kumar (Institut national de recherche en sciences et technologi…
A new paper projecting Joshua tree habitat under future climate based on incredibly high-resolution distribution data, from Joshua Tree Genome Project collaborators at USGS. They estimate up to 80% loss of suitable habitat by 2100 under the worst-case climate scenario.
#JoshuaTree #science
Photoabsorption Cross Sections studied within the axially deformed Relativistic Quasiparticle Finite Amplitude Framework
C. Chen (Frontiers Science Center for Rare isotope, Lanzhou University, Lanzhou, China, School of Nuclear Science and Technology, Lanzhou University, Lanzhou, China), Y. F. Niu (Frontiers Science Center for Rare isotope, Lanzhou University, Lanzhou, China, School of Nuclear Science and Technology, Lanzhou University, Lanzhou, China), R. Xu (China Nuclear Data Center,…
A case study: the savings potential thanks to FAIR data in one Materials Science PhD project
Michael Seitz, Nick Garabedian, Ilia Bagov, Christian Greiner
https://arxiv.org/abs/2506.12043
MindGrab for BrainChop: Fast and Accurate Skull Stripping for Command Line and Browser
Armina Fani (Tri-Institutional Center for Translational Research in Neuroimaging and Data Science), Mike Doan (Tri-Institutional Center for Translational Research in Neuroimaging and Data Science), Isabelle Le (Tri-Institutional Center for Translational Research in Neuroimaging and Data Science), Alex Fedorov (Emory University), Malte Hoffmann (Harvard University), Chris Rorden (University of South C…

MindGrab for BrainChop: Fast and Accurate Skull Stripping for Command Line and Browser
We developed MindGrab, a parameter- and memory-efficient deep fully-convolutional model for volumetric skull-stripping in head images of any modality. Its architecture, informed by a spectral interpretation of dilated convolutions, was trained exclusively on modality-agnostic synthetic data. MindGrab was evaluated on a retrospective dataset of 606 multimodal adult-brain scans (T1, T2, DWI, MRA, PDw MRI, EPI, CT, PET) sourced from the SynthStrip dataset. Performance was benchmarked against Synth…
High Signal: Data Science | Career | AI
Great Australian Pods Podcast Directory: #GreatAusPods
Procedures for Constraining Robotic Fiber Positioning for Highly Multiplexed Spectroscopic Surveys: The Case of FPS for SDSS-V
Ilija Medan, Tom Dwelly, Kevin R. Covey, Eleonora Zari, Michael R. Blanton, Joleen K. Carlberg, S. Drew Chojnowski, Alexander Ji, Yue Shen, John Donor, Jos\'e S\'anchez-Gallego, Sean Morrison, H\'ector J. Ibarra-Medel, Conor Sayres, Keivan G. Stassun
Humans, Machine Learning, and Language Models in Union: A Cognitive Study on Table Unionability
Sreeram Marimuthu, Nina Klimenkova, Roee Shraga
https://arxiv.org/abs/2506.12990
🔍 Wie wir aus Erfahrung lernen können: Thomas Bayes und die Diagnose von Erkrankungen in der Medizin
Risiko & Wahrscheinlichkeit: Die Schlüssel zu besseren Gesundheitsentscheidungen! 🩺 Wir erklären, wie Statistik & Medizin zusammenpassen.
⏲ Uhrzeit: 17:00-24:00 Uhr
🗓️ Datum: 21.06.2025
🧑🔬 Veranstalter: CIDAS Campus-Institut Data Science
📍 Veranstaltungsort: @…
General Manager of Azure AI at Microsoft Don Scott shares a second consecutive year win in this post on the AI and Machine Learning portion of the official Azure blog. Gartner Group has identified Microsoft as a industry leader in data science and machine learning platforms.
"Microsoft recognized for second consecutive year as a Leader in the 2025 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms"
This looks very cool.
'OpenAIRE in collaboration with Area Science Park organizes a hands-on workshop titled “Where LEGO Meets FAIR Data,” designed to introduce the principles of FAIR data through a creative, interactive simulation using LEGO metaphors.'
https://www.
The most energetic transients - tidal disruptions of high-mass stars: #ExtremeNuclearTransients (ENTs) are the most energetic transients yet observed.
Our partner conference, MICES – Mix-Camp E-Commerce Search, takes place on June 18th at TUECHTIG in Berlin. Focusing on e-commerce search, MICES brings together experts from various fields such as IT, product management, UX design, search management, information retrieval, data science and search engine vendors to discuss challenges, share ideas, best practices and case studies in e-commerce search.
Register for free and learn more here:
MLOps with Microservices: A Case Study on the Maritime Domain
Renato Cordeiro Ferreira (Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University), Rowanne Trapmann (Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University), Willem-Jan van den Heuvel (Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University)
SAS in ESA Datalabs: A New Platform for XMM-Newton Analysis
Esin G. Gulbahar, Camille M. Diez, Aitor Ibarra, Ivan Valtchanov, Richard Saxton, Ignacio de la Calle Perez, Jose Lopez-Miralles, Alejandro Gonzalez Ganzabal, Peter Kretschmar
https://arxiv.org/abs/2506.14444
Breakthrough Listen: A Technosignature Search Around 27 Eclipsing Exoplanets Selected from the Transiting Exoplanet Survey Satellite Catalogue
R. Barrett (University of Southern Queensland), C. D. Tremblay (SETI Institute, CSIRO Astronomy and Space Science), B. Addison (University of Southern Queensland Centre for Astrophysics, Swinburne University of Technology Centre for Astrophysics and Supercomputing), D. C. Price (International Centre for Radio Astronomy Research, SKA Observatory …
A Novel, Human-in-the-Loop Computational Grounded Theory Framework for Big Social Data
Lama Alqazlan, Zheng Fang, Michael Castelle, Rob Procter
https://arxiv.org/abs/2506.06083
SDSS-V Milky Way Mapper (MWM): ASPCAP Stellar Parameters and Abundances in SDSS-V Data Release 19
Szabolcs M\'esz\'aros, Paula Jofr\'e, Jennifer A. Johnson, Jonathan C. Bird, Andrew R. Casey, Katia Cunha, Nathan De Lee, Peter Frinchaboy, Guillaume Guiglion, Viola Heged\H{u}s, Alex P. Ji, Juna A. Kollmeier, Melissa K. Ness, Jonah Otto, Marc H. Pinsonneault, Alexandre Roman-Lopes, Amaya Sinha, Ying-Yi Song, Guy S. Stringfellow, Keivan G. Stassun, Jamie Tayar, Andrew Tkachenko…
What Does Information Science Offer for Data Science Research?: A Review of Data and Information Ethics Literature
Brady D. Lund, Ting Wang
https://arxiv.org/abs/2506.03165
📣 Calling all GÉANT Project partners…
Got an idea for digital research, data transfer or secure storage solutions to support open science?
The 2025 GÉANT Above-the-Net Services Incubator is officially open for proposals!
This is your opportunity to:
✅ Develop and test your innovative idea
✅ Deliver impact to the whole community through new, shared, open-source services
✅ Help shape the future of GÉANT's Above-the-Net services portfolio
Learn more: …
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes
Eugenie Lai, Gerardo Vitagliano, Ziyu Zhang, Sivaprasad Sudhir, Om Chabra, Anna Zeng, Anton A. Zabreyko, Chenning Li, Ferdi Kossmann, Jialin Ding, Jun Chen, Markos Markakis, Matthew Russo, Weiyang Wang, Ziniu Wu, Michael J. Cafarella, Lei Cao, Samuel Madden, Tim Kraska
Teams of #PurdueFortWayne 🐘 students and faculty presented their projects on data science and industrial consulting at the Indiana Data Mine Symposium at #Purdue #WestLafayette
Next investigation should be into the councillors themselves
Warrnambool council abandons peer-reviewed flood study, citing 'supposed science' - ABC News
https://www.abc.net.au/news/2025-06-05/regiona…
Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
Mingjie Chen (Zhejiang University), Tiancheng Zhu (Huazhong University of Science,Technology), Mingxue Zhang (The State Key Laboratory of Blockchain,Data Security, Zhejiang University,Hangzhou High-Tech Zone), Yiling He (University College London), Minghao Lin (University of Southern California), Penghui Li (Columbia University), Kui Ren (The State Key Laboratory of Blockchain,Data Security, Z…
Insights from a 30-Year international Partnership on Astronomical Archives
David R. Rodriguez, Maria Arevalo, Patrick Dowler, Javier Espinosa, Brian McLean, Chris Willott
https://arxiv.org/abs/2506.11888
Trustworthy Provenance for Big Data Science: a Modular Architecture Leveraging Blockchain in Federated Settings
Nicola Giuseppe Marchioro, Yannis Velegrakis, Valentine Anantharaj, Ian Foster, Sandro Luigi Fiore
https://arxiv.org/abs/2505.24675
A Tale of Two Systems: Characterizing Architectural Complexity on Machine Learning-Enabled Systems
Renato Cordeiro Ferreira (University of S\~ao Paulo, Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University)
https://arxiv.org/abs/2506.11295
The future of gravitational wave science unlocking LIGO potential: AI-driven data analysis and exploration
Yong Xiao, Li, Zin Nandar Win, He Wang, Hla Myo Tun, Win Thu Zar
https://arxiv.org/abs/2506.04584
lol, basically every single example in this post shows how the LLM is just generating context that's not in the actual image. But somehow this is sold as being better than "classical" computer vision.
I don't know folks, if I actually wanted to do "data science", with focus on the "science" bit, I'd be disturbed by that. 🤷♂️
https://fosstodon.org/@Posit/114597245963405210
KI4Demokratie: An AI-Based Platform for Monitoring and Fostering Democratic Discourse
Rudy Alexandro Garrido Veliz, Till Nikolaus Schaland, Simon Bergmoser, Florian Horwege, Somya Bansal, Ritesh Nahar, Martin Semmann, J\"org Forthmann, Seid Muhie Yimam
https://arxiv.org/abs/2506.09947…
sp_infectious: Art exhibit dynamic contacts (2011)
This dataset contains the daily dynamic contact networks collected during the Infectious SocioPatterns event that took place at the Science Gallery in Dublin, Ireland, during the artscience exhibition INFECTIOUS: STAY AWAY. Each file in the downloadable package contains a tab-separated list representing the active contacts during 20-second intervals of one day of data collection. Each line has the form “t i j“, where i and j are the a…
Series B, Episode 04 - Horizon
CALLY: [Enters] Orac, where did you get this information? [Holds up a data card. Avon takes it]
ORAC: I was instructed to obtain anything relating to the planet. The data was obtained by cross-referencing prisoner and execution lists. It is standard procedure.
https://blake.torpidity.n…
Observable Covariance and Principal Observable Analysis for Data on Metric Spaces
Ece Karacam, Washington Mio, Osman Berat Okutan
https://arxiv.org/abs/2506.04003
Roadmap for electronic structure, anharmonicity, and electron-phonon calculations in locally disordered inorganic and hybrid halide perovskites
Marios Zacharias, George Volonakis, Laurent Pedesseau, Claudine Katan, Feliciano Giustino, Jacky Even
https://arxiv.org/abs/2506.10402
D-Rex: Heterogeneity-Aware Reliability Framework and Adaptive Algorithms for Distributed Storage
Maxime Gonthier (University of Chicago, Argonne National Laboratory), Dante D. Sanchez-Gallegos (Universidad Carlos III de Madrid), Haochen Pan (University of Chicago), Bogdan Nicolae (Argonne National Laboratory), Sicheng Zhou (Southern University of Science and Technology), Hai Duc Nguyen (University of Chicago, Argonne National Laboratory), Valerie Hayot-Sasson (University of Chicago, Ar…
Series B, Episode 04 - Horizon
CALLY: [Enters] Orac, where did you get this information? [Holds up a data card. Avon takes it]
ORAC: I was instructed to obtain anything relating to the planet. The data was obtained by cross-referencing prisoner and execution lists. It is standard procedure.
https://blake.torpidity.n…
Savage-Dickey density ratio estimation with normalizing flows for Bayesian model comparison
Kiyam Lin, Alicja Polanska, Davide Piras, Alessio Spurio Mancini, Jason D. McEwen
https://arxiv.org/abs/2506.04339
Medium writer Paolo Perrone curates a short list of interesting algorithms, the rationale behind them, along with graphs and diagrams to boot.
Algorithms that made this short list:
Wave Function Collapse
The Diffusion Model
Simulated Annealing
Sleep Sort
BOGO Sort
BOID
SHOR’s
Marching Cubes
Practical Byzantine Fault Tolerance and,
Boyer Moore
"The 10 Weirdest, Most Brilliant Algorithms Ever Devised and What They Actually Do&…
faculty_hiring: Faculty hiring networks (Comp. Sci., Business, History)
Three networks of faculty hiring in Computer Science Departments, Business Schools, and History Departments. Each node is a PhD-granting institution in the respective field, and a directed edge (i,j) indicates that a person received their PhD from node i and was tenure-track faculty at node j during time of collection (2011-2013). All data collected from faculty public rosters at the sampled institutions.
Thi…
Millimeter-wave observations of Euclid Deep Field South using the South Pole Telescope: A data release of temperature maps and catalogs
M. Archipley, A. Hryciuk, L. E. Bleem, K. Kornoelje, M. Klein, A. J. Anderson, B. Ansarinejad, M. Aravena, L. Balkenhol, P. S. Barry, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, S. Bocquet, F. R. Bouchet, E. Camphuis, M. G. Campitiello, J. E. Carlstrom, J. Cathey, C. L. Chang, S. C. Chapman, P. Chaubal, P. M. Chichura, A. Chokshi, T. -L. Chou…
Edge interventions can mitigate demographic and prestige disparities in the Computer Science coauthorship network
Kate Barnes, Mia Ellis-Einhorn, Carolina Ch\'avez-Ruelas, Nayera Hasan, Mohammad Fanous, Blair D. Sullivan, Sorelle Friedler, Aaron Clauset
https://arxiv.org/abs/2506.04435…
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
A Metrics-Oriented Architectural Model to Characterize Complexity on Machine Learning-Enabled Systems
Renato Cordeiro Ferreira (University of S\~ao Paulo, Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University)
https://arxiv.org/abs/2506.08153
Replaced article(s) found for stat.OT. https://arxiv.org/list/stat.OT/new/
[1/1]:
A Mathematical Lens for Teaching Data Science
https://
The SPHEREx Sky Simulator: Science Data Modeling for the First All-Sky Near-Infrared Spectral Survey
Brendan P. Crill, Yoonsoo P. Bach, Sean A. Bryan, Jean Choppin de Janvry, Ari J. Cukierman, C. Darren Dowell, Spencer W. Everett, Candice Fazar, Tatiana Goldina, Zhaoyu Huai, Howard Hui, Woong-Seob Jeong, Jae Hwan Kang, Phillip M. Korngut, Jae Joon Lee, Daniel C. Masters, Chi H. Nguyen, Jeonghyun Pyo, Teresa Symons, Yujin Yang, Michael Zemcov, Rachel Akeson, Matthew L. N. Ashby, James J…
#Blakes7 Series B, Episode 06 - Trial
THANIA: We reserve our opening declaration, sir.
SAMOR: Very well. Enter prosecution data. [A clerk presses some buttons.]
https://blake.torpidity.net/m/206/53
faculty_hiring: Faculty hiring networks (Comp. Sci., Business, History)
Three networks of faculty hiring in Computer Science Departments, Business Schools, and History Departments. Each node is a PhD-granting institution in the respective field, and a directed edge (i,j) indicates that a person received their PhD from node i and was tenure-track faculty at node j during time of collection (2011-2013). All data collected from faculty public rosters at the sampled institutions.
Thi…
Challenging Spontaneous Quantum Collapse with XENONnT
E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Ant\'on Martin, S. R. Armbruster, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, C. Cai, C. Capelli, J. M. R. Cardoso, A. P. Cimental Ch\'avez, A. P. Colijn, J. Conrad, J. J. Cuenca-Garc\'ia, C. Curceanu, V. D'Andrea, L. C. Daniel Garcia, M. P. Decowski, A. Deist…
Analysis of points outcome in ATP Grand Slam Tennis using big data and machine learning
Martin Illum (Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Denmark), Hans Christian Bechs{\o}fft Mikkelsen (Department of Applied Mathematics and Computer Science, Technical University of Denmark, Richard Petersens Plads, Denmark), Emil Hovad (Department of Applied Mathematics and Computer Science, Technical University of Denmark, …
Motivando el uso y aprendizaje de Bash a trav\'es de concursos de programaci\'on
Luis Costero, Jorge Villarrubia, Francisco D. Igual
https://arxiv.org/abs/2506.00105
Is Your Training Pipeline Production-Ready? A Case Study in the Healthcare Domain
Daniel Lawand (University of S\~ao Paulo), Lucas Quaresma (University of S\~ao Paulo), Roberto Bolgheroni (University of S\~ao Paulo), Alfredo Goldman (University of S\~ao Paulo), Renato Cordeiro Ferreira (University of S\~ao Paulo, Jheronimus Academy of Data Science, Technical University of Eindhoven, Tilburg University)
Replaced article(s) found for physics.soc-ph. https://arxiv.org/list/physics.soc-ph/new/
[1/1]:
Growth of Science and Women: Methodological Challenges of Using Structured Big Data
TFW you've spent ages trying to solve a tricky problem and it. just. does. not. work. You take a break, come back, try something new, nope still nothi... Wait! It's there, there it is! That's the data...
🥳🎊
That is why we love doing #science and in an increasingly packed day, the opportunity to work away uninterrupted* on a focused problem is increasingly rare. *That's* why I love fieldwork...
Details in a #FieldDiary soon.
*Uninterrupted time helped today by failure of all internet phone communication in Qaanaaq..
#Fieldwork #LifeOfAScientist #AcademicChatter
faculty_hiring: Faculty hiring networks (Comp. Sci., Business, History)
Three networks of faculty hiring in Computer Science Departments, Business Schools, and History Departments. Each node is a PhD-granting institution in the respective field, and a directed edge (i,j) indicates that a person received their PhD from node i and was tenure-track faculty at node j during time of collection (2011-2013). All data collected from faculty public rosters at the sampled institutions.
Thi…
PDRs4All XIV: CH radical and $H_3^ $ molecular ion in the irradiated protoplanetary disk d203-506
I. Schroetter, O. Bern\'e, J. R. Goicoechea, J. H. Black, O. Roncero, F. Alarcon, P. Amiot, O. Asvany, C. Boersma, S. Br\"unken, J. Cami, L. Coudert, E. Dartois, A. Fuente, B. Gans, A. Gusdorf, U. Jacovella, M. A. Martin Drumel, T. Onaka, E. Peeters, E. Roueff, A. G. G. M. Tielens, M. Zannese
sp_infectious: Art exhibit dynamic contacts (2011)
This dataset contains the daily dynamic contact networks collected during the Infectious SocioPatterns event that took place at the Science Gallery in Dublin, Ireland, during the artscience exhibition INFECTIOUS: STAY AWAY. Each file in the downloadable package contains a tab-separated list representing the active contacts during 20-second intervals of one day of data collection. Each line has the form “t i j“, where i and j are the a…
Hello there.
I, or maybe we, intend this to serve both as a diary and a reference.
We are a jack of many trades. As such, it is hard to squeeze into 1.5k symbols, and it is by no means comprehensive. Still, sometimes labels are helpful.
Here is about IT, interconnections between technologies (Fullstack, Data Science... sometimes even AI and ethics of it). Politics, because our life is inevitably tied to it (especially with
#gsa #nyc #columbia #climate
The original Charney Report that says doubling CO2 will yield 1.5-4.5 degrees C temperature change (
#NOAA 's #GFDL and #NASA 's #GISS - incidentally, gfdl was also in the news yesterday
https://www.propublica.org/article/trump-noaa-budget-cuts-climate-change-modeling-princeton-gfdl
" The gutting of NOAA was outlined earlier this month in a leaked memo from the Office of Management and Budget that detailed steep reductions at the Department of Commerce, which houses the science agency. ... NOAA’s overall funding would be slashed by 27%…
most significant target is the office of Oceanic and Atmospheric Research ⎯ a nerve center of global climate science, data collection and modeling, including the Geophysical Fluid Dynamics Laboratory ⎯ which would be cut by 74%. “At this funding level, OAR is eliminated as a line office,” the memo stated.”