2024-02-13 11:00:01
{dtrack} makes documentation of data wrangling part of the analysis and creates pretty flow charts: #rstats
{dtrack} makes documentation of data wrangling part of the analysis and creates pretty flow charts: #rstats
in Sachen #OpenAccess-#Verlagsverträge/-#Rahmenverträge:
"Are Transformative Agreements Worth It? An Analysis of Open Access Publication Data at the Universi…
Data analysis from the hydroacoustic stations of the Comprehensive Nuclear-Test-Ban Treaty Organization
has unveiled distinctive pressure signals linked to aircraft crashes of varying sizes in the ocean.
Notably, these signals were detected at distances ranging from two to five thousand kilometres,
highlighting the efficacy of underwater acoustic technology in event identification and classification in marine environments.
In this study, we investigate the plausibil…
This https://arxiv.org/abs/2401.05507 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…
A Methodology for Questionnaire Analysis: Insights through Cluster Analysis of an Investor Competition Data
Carlos Henrique Q. Forster, Paulo Andr\'e Lima de Castro, Andrei Ramalho
https://arxiv.org/abs/2402.06759
Distributed Record Linkage in Healthcare Data with Apache Spark
Mohammad Heydari, Reza Sarshar, Mohammad Ali Soltanshahi
https://arxiv.org/abs/2404.07939 h…
Toward an Android Static Analysis Approach for Data Protection
Mugdha Khedkar, Eric Bodden
https://arxiv.org/abs/2402.07889 https://a…
Novel definition and quantitative analysis of branch structure with topological data analysis
Haruhisa Oda, Mayuko Kida, Yoichi Nakata, Hiroki Kurihara
https://arxiv.org/abs/2402.07436
in Sachen #OpenAccess-#Verlagsverträge/-#Rahmenverträge:
"Are Transformative Agreements Worth It? An Analysis of Open Access Publication Data at the Universi…
Pulling back symmetric Riemannian geometry for data analysis
Willem Diepeveen
https://arxiv.org/abs/2403.06612 https://arxiv.org/pdf/…
This https://arxiv.org/abs/2403.04403 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csPL_…
Semantic Data for Humanities and Social Sciences (SDHSS): an Ecosystem of CIDOC CRM Extensions for Research Data Production and Reuse
Francesco BerettaLARHRA, LARHRA PHN
https://arxiv.org/abs/2402.07531
This https://arxiv.org/abs/2404.05696 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Logistic Multidimensional Data Analysis for Ordinal Response Variables using a Cumulative Link function
Mark de Rooij, Ligaya Breemer, Dion Woestenburg, Frank Busing
https://arxiv.org/abs/2402.07629
This https://arxiv.org/abs/2402.01276 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2402.19306 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_hepe…
Grr I let my data analysis run for 2h, just for it to blow up at the *last* crucial saving step with a stupid ImportError because I forgot to 'import pickle' 🤦 I should run #mypy more often... 😅
#Python #dataAnalysis
Comparative Analysis of ImageNet Pre-Trained Deep Learning Models and DINOv2 in Medical Imaging Classification
Yuning Huang, Jingchen Zou, Lanxi Meng, Xin Yue, Qing Zhao, Jianqiang Li, Changwei Song, Gabriel Jimenez, Shaowu Li, Guanghui Fu
https://arxiv.org/abs/2402.07595
Using Mathlink Cubes to Introduce Data Wrangling with Examples in R
Lucy D'Agostino McGowan
https://arxiv.org/abs/2402.07029 https://
Signed graphs in data sciences via communicability geometry
Fernando Diaz-Diaz, Ernesto Estrada
https://arxiv.org/abs/2403.07493 https://
An analysis of parameter compression and full-modeling techniques with Velocileptors for DESI 2024 and beyond
M. Maus, S. Chen, M. White, J. Aguilar, S. Ahlen, A. Aviles, S. Brieden, D. Brooks, T. Claybaugh, S. Cole, A. de la Macorra, Arjun Dey, P. Doel, S. Ferraro, N. Findlay, J. E. Forero-Romero, E. Gazta\~naga, H. Gil-Mar\'in, S. Gontcho A Gontcho, C. Hahn, K. Honscheid, C. Howlett, M. Ishak, S. Juneau, A. Kremin, Y. Lai, M. Landriau, M. E. Levi, M. Manera, R. Miquel, E. Mueller…
Time Series Analysis of Key Societal Events as Reflected in Complex Social Media Data Streams
Andy Skumanich, Han Kyul Kim
https://arxiv.org/abs/2403.07090
This https://arxiv.org/abs/2304.13406 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_sta…
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
Gang Hu, Ke Qin, Chenhan Yuan, Min Peng, Alejandro Lopez-Lira, Benyou Wang, Sophia Ananiadou, Wanlong Yu, Jimin Huang, Qianqian Xie
https://arxiv.org/abs/2403.06249…
This https://arxiv.org/abs/2111.00187 has been replaced.
link: https://scholar.google.com/scholar?q=a
This https://arxiv.org/abs/2306.03889 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_hepp…
Virtual reassembling of 3D fragments for the data-driven analysis of fracture mechanisms in composite materials
Thomas WilhelmInstitute of Stochastics, Ulm University, 89069 Ulm, Germany, Trang Thu V\~oInstitute of Particle Technology and Mineral Processing, TU Bergakademie Freiberg, 09599 Freiberg, Germany, Orkun FuratInstitute of Stochastics, Ulm University, 89069 Ulm, Germany, Urs A. PeukerInstitute of Particle Technology and Mineral Processing, TU Bergakademie Freiberg, 09599 Freib…
Data-driven sparse modeling of oscillations in plasma space propulsion
B. Bay\'on-Buj\'an, M. Merino
https://arxiv.org/abs/2403.06809 https://
Data driven approach to study the transition from dispersive to dissipative systems through dimensionality reduction techniques
Mairembam Kelvin Singh, A. Surjalal Sharma, N. Nimai Singh, Moirangthem Shubhakanta Singh
https://arxiv.org/abs/2403.06987
This https://arxiv.org/abs/2301.06136 has been replaced.
link: https://scholar.google.com/scholar?q=a
A Statistical and Multiwavelength Photometric Analysis of a Young Embedded Open Star Cluster: IC 1590
A. H. Sheikh, Biman J. Medhi
https://arxiv.org/abs/2402.07750
This https://arxiv.org/abs/2105.15106 has been replaced.
link: https://scholar.google.com/scholar?q=a
Used ChatGPT to get some nifty @obsidianmd dataview queries, its ideal for regex and formatting stuff, the data is all there, but I normally don’t have the patience to write and finesse the queries, since I don’t do data analysis often enough to remember all the details
This https://arxiv.org/abs/2404.03936 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCV_…
This https://arxiv.org/abs/2210.03859 has been replaced.
link: https://scholar.google.com/scholar?q=a
Low Cost Carriers induce specific and identifiable delay propagation patterns: an analysis of the EU and US systems
Sofia Gil-Rodrigo, Massimiliano Zanin
https://arxiv.org/abs/2402.07656
This https://arxiv.org/abs/2306.12690 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
Study of the Impact of the Big Data Era on Accounting and Auditing
Yuxiang Sun, Jingyi Li, Mengdie Lu, Zongying Guo
https://arxiv.org/abs/2403.07180 https:…
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation
Jan Laukemann, Ahmed E. Helal, S. Isaac Geronimo Anderson, Fabio Checconi, Yongseok Soh, Jesmin Jahan Tithi, Teresa Ranadive, Brian J Gravelle, Fabrizio Petrini, Jee Choi
https://arxiv.org/abs/2403.06348
Visualization for physics analysis improvement and applications in BESIII
Zhi-Jun Li, Ming-Kuan Yuan, Yun-Xuan Song, Yan-Gu Li, Jing-Shu Li, Sheng-Sen Sun, Xiao-Long Wang, Zheng-Yun You, Ya-Jun Mao
https://arxiv.org/abs/2404.07951
Performance Analysis of Matrix Multiplication for Deep Learning on the Edge
Cristian Ram\'irez, Adri\'an Castell\'o, H\'ector Mart\'inez, Enrique S. Quintana-Ort\'i
https://arxiv.org/abs/2403.07731
scRDiT: Generating single-cell RNA-seq data by diffusion transformers and accelerating sampling
Shengze Dong, Zhuorui Cui, Ding Liu, Jinzhi Lei
https://arxiv.org/abs/2404.06153
Topological Data Analysis of Monopoles in $U(1)$ Lattice Gauge Theory
Xavier Crean, Jeffrey Giansiracusa, Biagio Lucini
https://arxiv.org/abs/2403.07739 ht…
This https://arxiv.org/abs/2112.06432 has been replaced.
link: https://scholar.google.com/scholar?q=a
Augmenting Interpolation-Based Model Checking with Auxiliary Invariants (Extended Version)
Dirk Beyer, Po-Chun Chien, Nian-Ze Lee
https://arxiv.org/abs/2403.07821
Implementation of implicit filter for spatial spectra extraction
Kacper Nowak, Sergey Danilov, Vasco M\"uller, Caili Liu
https://arxiv.org/abs/2404.07398
Highly Accurate Disease Diagnosis and Highly Reproducible Biomarker Identification with PathFormer
Zehao Dong, Qihang Zhao, Philip R. O. Payne, Michael A Province, Carlos Cruchaga, Muhan Zhang, Tianyu Zhao, Yixin Chen, Fuhai Li
https://arxiv.org/abs/2402.07268
Analysis: Asia Pacific data center deals, which have totaled $840.47M in 2024, or 50% of the global total so far, are set to surpass 2023's $3.45B record high (Reuters)
https://www.reuters.com/markets/deals/ai-boom-set-fuel-…
De Casteljau's Algorithm in Geometric Data Analysis: Theory and Application
Martin Hanik, Esfandiar Nava-Yazdani, Christoph von Tycowicz
https://arxiv.org/abs/2402.07550
This https://arxiv.org/abs/2310.19181 has been replaced.
link: https://scholar.google.com/scholar?q=a
This is the best source of stock splits, delistings, and other corporate actions that I have ever seen: https://stockanalysis.com/actions/
Makes me wonder where do they get the data for it. I have tried perusing the web a bit and just can't find one conclusive answer.
This https://arxiv.org/abs/2403.16110 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Validating the Galaxy and Quasar Catalog-Level Blinding Scheme for the DESI 2024 analysis
U. Andrade, J. Mena-Fern\'andez, H. Awan, A. J. Ross, S. Brieden, J. Pan, A. de Mattia, J. Aguilar, S. Ahlen, O. Alves, D. Brooks, E. Buckley-Geer, E. Chaussidon, T. Claybaugh, S. Cole, A. de la Macorra, Arjun Dey, P. Doel, K. Fanning, J. E. Forero-Romero, E. Gazta\~naga, H. Gil-Mar\'in, S. Gontcho A Gontcho, J. Guy, C. Hahn, M. M. S Hanif, K. Honscheid, C. Howlett, D. Huterer, S. Juneau, …
Investigating the Soft X-ray Spectra of Solar Flare Onsets
Anant Telikicherla, Thomas N. Woods, Bennet D. Schwab
https://arxiv.org/abs/2403.05992 https://<…
Rethinking ASTE: A Minimalist Tagging Scheme Alongside Contrastive Learning
Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu
https://arxiv.org/abs/2403.07342
FNSPID: A Comprehensive Financial News Dataset in Time Series
Zihan Dong, Xinyu Fan, Zhiyuan Peng
https://arxiv.org/abs/2402.06698 https://
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
Gang Hu, Ke Qin, Chenhan Yuan, Min Peng, Alejandro Lopez-Lira, Benyou Wang, Sophia Ananiadou, Wanlong Yu, Jimin Huang, Qianqian Xie
https://arxiv.org/abs/2403.06249…
This https://arxiv.org/abs/2404.06882 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_…
PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI
Peirong Liu, Oula Puonti, Annabel Sorby-Adams, William T. Kimberly, Juan E. Iglesias
https://arxiv.org/abs/2403.06227
New slow-roll approximations for inflation in Einstein-Gauss-Bonnet gravity
E. O. Pozdeeva, M. A. Skugoreva, A. V. Toporensky, S. Yu. Vernov
https://arxiv.org/abs/2403.06147
Sensitivity analysis for publication bias in meta-analysis of sparse data based on exact likelihood
Taojun Hu, Yi Zhou, Sattoshi Hattori
https://arxiv.org/abs/2404.06837
Correlation and Autocorrelation of Data on Complex Networks
Rudy Arthur
https://arxiv.org/abs/2405.05125 https://arxiv.org/pdf/2405.0…
This https://arxiv.org/abs/2307.11153 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_hepp…
A template for data analysis projects structured as R packages (or not) #rstats #datascience
Study of the Impact of the Big Data Era on Accounting and Auditing
Yuxiang Sun, Jingyi Li, Mengdie Lu, Zongying Guo
https://arxiv.org/abs/2403.07180 https:…
A Comparison of Different Representations of Ordinal Patterns and Their Usability in Data Analysis
Alexander Schnurr, Angelika Silbernagel
https://arxiv.org/abs/2402.07478
IsoPredict: Dynamic Predictive Analysis for Detecting Unserializable Behaviors in Weakly Isolated Data Store Applications
Chujun Geng, Spyros Blanas, Michael D. Bond, Yang Wang
https://arxiv.org/abs/2404.04621
Performance Analysis of Matrix Multiplication for Deep Learning on the Edge
Cristian Ram\'irez, Adri\'an Castell\'o, H\'ector Mart\'inez, Enrique S. Quintana-Ort\'i
https://arxiv.org/abs/2403.07731
Rethinking ASTE: A Minimalist Tagging Scheme Alongside Contrastive Learning
Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu
https://arxiv.org/abs/2403.07342
This https://arxiv.org/abs/2310.08560 has been replaced.
link: https://scholar.google.com/scholar?q=a
PEPSI: Pathology-Enhanced Pulse-Sequence-Invariant Representations for Brain MRI
Peirong Liu, Oula Puonti, Annabel Sorby-Adams, William T. Kimberly, Juan E. Iglesias
https://arxiv.org/abs/2403.06227
Evaluation and thermodynamic optimization of phase diagram of lithium niobate tantalate solid solutions
Umar Bashir, Detlef Klimm, Michael Rusing, Matthias Bickermann, Steffen Ganschow
https://arxiv.org/abs/2403.07527
Towards Data-center Level Carbon Modeling and Optimization for Deep Learning Inference
Shixin Ji, Zhuoping Yang, Xingzhen Chen, Jingtong Hu, Yiyu Shi, Alex K. Jones, Peipei Zhou
https://arxiv.org/abs/2403.04976
Investigating the Soft X-ray Spectra of Solar Flare Onsets
Anant Telikicherla, Thomas N. Woods, Bennet D. Schwab
https://arxiv.org/abs/2403.05992 https://<…
Integrating LSTM and BERT for Long-Sequence Data Analysis in Intelligent Tutoring Systems
Zhaoxing Li, Jujie Yang, Jindi Wang, Lei Shi, Sebastian Stein
https://arxiv.org/abs/2405.05136
This https://arxiv.org/abs/2305.19618 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
Evaluating Navigation and Comparison Performance of Computational Notebooks on Desktop and in Virtual Reality
Sungwon In, Erick Krokos, Kirsten Whitley, Chris North, Yalong Yang
https://arxiv.org/abs/2404.07161
Loglinear modeling with mixed numerical and categorical predictor variables through an Extended Stereotype Model
Mark de Rooij
https://arxiv.org/abs/2402.07634
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGN II
A. Pizzetti, N. Torres-Alba, S. Marchesi, J. Buchner, I. Cox, X. Zhao, S. Neal, D. Sengupta, R. Silver, M. Ajello
https://arxiv.org/abs/2403.06919
This https://arxiv.org/abs/2303.12018 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_hepp…
Evaluation and thermodynamic optimization of phase diagram of lithium niobate tantalate solid solutions
Umar Bashir, Detlef Klimm, Michael Rusing, Matthias Bickermann, Steffen Ganschow
https://arxiv.org/abs/2403.07527
SIDE-real: Truncated marginal neural ratio estimation for Supernova Ia Dust Extinction with real data
Konstantin Karchev, Matthew Grayling, Benjamin M. Boyd, Roberto Trotta, Kaisey S. Mandel, Christoph Weniger
https://arxiv.org/abs/2403.07871
A Repository for Formal Contexts
Tom Hanika, Robert J\"aschke
https://arxiv.org/abs/2404.04344 https://arxiv.org/pdf/2404.04344<…
Investigating Interaction Modes and User Agency in Human-LLM Collaboration for Domain-Specific Data Analysis
Jiajing Guo, Vikram Mohanty, Jorge Piazentin Ono, Hongtao Hao, Liang Gou, Liu Ren
https://arxiv.org/abs/2405.05548
A step towards the integration of machine learning and small area estimation
Tomasz \.Z\k{a}d{\l}o, Adam Chwila
https://arxiv.org/abs/2402.07521 https://…
Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System
Steve Rhyner, Haocong Luo, Juan G\'omez-Luna, Mohammad Sadrosadati, Jiawei Jiang, Ataberk Olgun, Harshita Gupta, Ce Zhang, Onur Mutlu
https://arxiv.org/abs/2404.07164
GlossLM: Multilingual Pretraining for Low-Resource Interlinear Glossing
Michael GinnUniversity of Colorado, Lindia TjuatjaCarnegie Mellon University, Taiqi HeCarnegie Mellon University, Enora RiceUniversity of Colorado, Graham NeubigCarnegie Mellon University, Alexis PalmerUniversity of Colorado, Lori LevinCarnegie Mellon University
https://
Towards $21$-cm intensity mapping at $z=2.28$ with uGMRT using the tapered gridded estimator -- IV. Wideband analysis
Khandakar Md Asif ElahiIndian Institute of Technology Kharagpur, Kharagpur, India, Somnath BharadwajIndian Institute of Technology Kharagpur, Kharagpur, India, Srijita PalIndian Institute of Science, Bangalore, India, Abhik GhoshBanwarilal Bhalotia College, Asansol, India, Sk. Saiyad AliJadavpur University, Kolkata, India, Samir ChoudhuriIndian Institute of Technology M…
Investigating Interaction Modes and User Agency in Human-LLM Collaboration for Domain-Specific Data Analysis
Jiajing Guo, Vikram Mohanty, Jorge Piazentin Ono, Hongtao Hao, Liang Gou, Liu Ren
https://arxiv.org/abs/2405.05548
This https://arxiv.org/abs/2311.02206 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
This https://arxiv.org/abs/2403.15721 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDC_…
GuideGen: A Text-guided Framework for Joint CT Volume and Anatomical structure Generation
Linrui Dai, Rongzhao Zhang, Zhongzhen Huang, Xiaofan Zhang
https://arxiv.org/abs/2403.07247
Preliminary Guidelines For Combining Data Integration and Visual Data Analysis
Adam Coscia, Ashley Suh, Remco Chang, Alex Endert
https://arxiv.org/abs/2403.04757
A General Identification Algorithm For Data Fusion Problems Under Systematic Selection
Jaron J. R. Lee, AmirEmad Ghassami, Ilya Shpitser
https://arxiv.org/abs/2404.06602
Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi
https://arxiv.org/abs/2405.05049<…
BOLD v4: A Centralized Bioinformatics Platform for DNA-based Biodiversity Data
Sujeevan Ratnasingham, Catherine Wei, Dean Chan, Jireh Agda, Josh Agda, Liliana Ballesteros-Mejia, Hamza Ait Boutou, Zak Mohammad El Bastami, Eddie Ma, Ramya Manjunath, Dana Rea, Chris Ho, Angela Telfer, Jaclyn McKeowan, Miduna Rahulan, Claudia Steinke, Justin Dorsheimer, Megan Milton, Paul D. N. Hebert
Analysis of Distributed Algorithms for Big-data
Rajendra Purohit, K R Chowdhary, S D Purohit
https://arxiv.org/abs/2404.06461 https://
Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Lasse Hyldig Hansen, Nikolaj Andersen, Jack Gallifant, Liam G. McCoy, James K Stone, Nura Izath, Marcela Aguirre-Jerez, Danielle S Bitterman, Judy Gichoya, Leo Anthony Celi
https://arxiv.org/abs/2405.05049<…
BOLD v4: A Centralized Bioinformatics Platform for DNA-based Biodiversity Data
Sujeevan Ratnasingham, Catherine Wei, Dean Chan, Jireh Agda, Josh Agda, Liliana Ballesteros-Mejia, Hamza Ait Boutou, Zak Mohammad El Bastami, Eddie Ma, Ramya Manjunath, Dana Rea, Chris Ho, Angela Telfer, Jaclyn McKeowan, Miduna Rahulan, Claudia Steinke, Justin Dorsheimer, Megan Milton, Paul D. N. Hebert