
2025-09-12 07:42:19
Optimizing the Variant Calling Pipeline Execution on Human Genomes Using GPU-Enabled Machines
Ajay Kumar, Praveen Rao, Peter Sanders
https://arxiv.org/abs/2509.09058 https://
Optimizing the Variant Calling Pipeline Execution on Human Genomes Using GPU-Enabled Machines
Ajay Kumar, Praveen Rao, Peter Sanders
https://arxiv.org/abs/2509.09058 https://
⚗️ Secrets of the dark genome could spark new drug discoveries
#drugs
Pangenome-guided sequence assembly via binary optimisation
Josh Cudby, James Bonfield, Chenxi Zhou, Richard Durbin, Sergii Strelchuk
https://arxiv.org/abs/2508.08200 https://
MicroTrace: A Lightweight R Tool for SNP-Based Pathogen Clustering in Outbreak Detection
Kaitao Lai
https://arxiv.org/abs/2507.08060 https://
Topological Sequence Analysis of Genomes: Delta Complex approaches
Jian Liu, Li Shen, Dong Chen, Guo-Wei Wei
https://arxiv.org/abs/2507.05452 https://
Scientists Decode 1918 Flu Virus Genome From Century-Old Tissue
https://scitechdaily.com/scientists-decode-1918-flu-virus-genome-from-century-old-tissue/
Minimum-Cost Synthetic Genome Planning: An Algorithmic Framework
Michail Patsakis, Ioannis Mouratidis, Ilias Georgakopoulos-Soares
https://arxiv.org/abs/2509.06234 https://
Crosslisted article(s) found for eess.IV. https://arxiv.org/list/eess.IV/new
[1/1]:
- The Protocol Genome A Self Supervised Learning Framework from DICOM Headers
Jimmy Joseph
When is String Reconstruction using de Bruijn Graphs Hard?
Ben Bals, Sebastiaan van Krieken, Solon P. Pissis, Leen Stougie, Hilde Verbeek
https://arxiv.org/abs/2508.03433 https:…
Liquid-liquid phase separation enables highly selective viral genome packaging
Layne B. Frechette, Michael F. Hagan
https://arxiv.org/abs/2508.17211 https://
FDR controlling procedures with dimension reduction and their application to GWAS with linkage disequilibrium score
Dayeon Jung, Yewon Kim, Junyong Park
https://arxiv.org/abs/2507.06049
Modelling transcriptional silencing and its coupling to 3D genome organisation
Massimiliano Semeraro, Giuseppe Negro, Davide Marenduzzo, Giada Forte
https://arxiv.org/abs/2507.02150
Scalable Quantum State Preparation for Encoding Genomic Data with Matrix Product States
Floyd M. Creevey, Hitham T. Hassan, James McCafferty, Lloyd C. L. Hollenberg, Sergii Strelchuk
https://arxiv.org/abs/2508.06184
A near-exact linear mixed model for genome-wide association studies
Zhibin Pu, Shufei Ge, Shijia Wang
https://arxiv.org/abs/2508.05278 https://arxiv.org/pd…
Um link para deixar cheganos a espumar da boca.
tl;dr - nunca existiu nenhuma espécie de "sangue português". Sempre foi uma gigantesca misturada, com imigrantes vindos de tudo quanto é sítio a contribuir para a caldeirada.
https://link.springer.com/article/10.118…
AuraGenome: An LLM-Powered Framework for On-the-Fly Reusable and Scalable Circular Genome Visualizations
Chi Zhang, Yu Dong, Yang Wang, Yuetong Han, Guihua Shan, Bixia Tang
https://arxiv.org/abs/2507.02877
Replaced article(s) found for cs.AR. https://arxiv.org/list/cs.AR/new
[1/1]:
- MARS: Processing-In-Memory Acceleration of Raw Signal Genome Analysis Inside the Storage Subsystem
Soysal, Koliogeorgi, Firtina, Ghiasi, Nadig, Mao, Oliveira, Liang, Zambaku, Sadrosadati, Mutlu
…
Predicting Antimicrobial Resistance (AMR) in Campylobacter, a Foodborne Pathogen, and Cost Burden Analysis Using Machine Learning
Shubham Mishra, The Anh Han, Bruno Silvester Lopes, Shatha Ghareeb, Zia Ush Shamszaman
https://arxiv.org/abs/2509.03551
A new paper projecting Joshua tree habitat under future climate based on incredibly high-resolution distribution data, from Joshua Tree Genome Project collaborators at USGS. They estimate up to 80% loss of suitable habitat by 2100 under the worst-case climate scenario.
#JoshuaTree #science
Suppression of errors in collectively coded information
Martin J. Falk, Leon Zhou, Yoshiya J. Matsubara, Kabir Husain, Jack W. Szostak, Arvind Murugan
https://arxiv.org/abs/2508.21806
A Machine Learning Framework for Breast Cancer Treatment Classification Using a Novel Dataset
Md Nahid Hasan, Md Monzur Murshed, Md Mahadi Hasan, Faysal A. Chowdhury
https://arxiv.org/abs/2507.06243
A Biased Random Key Genetic Algorithm for Solving the Longest Run Subsequence Problem
Christian Blum, Pedro Pinacho-Davidson
https://arxiv.org/abs/2508.14020 https://
BMFM-DNA: A SNP-aware DNA foundation model to capture variant effects
Hongyang Li, Sanjoy Dey, Bum Chul Kwon, Michael Danziger, Michal Rosen-Tzvi, Jianying Hu, James Kozloski, Ching-Huei Tsou, Bharath Dandala, Pablo Meyer
https://arxiv.org/abs/2507.05265
Friend or Foe
Oleksandr Cherendichenko, Josephine Solowiej-Wedderburn, Laura M. Carroll, Eric Libby
https://arxiv.org/abs/2509.00123 https://arxiv.org/pdf/…
AI, AGI, and learning efficiency
My 4-month-old kid is not DDoSing Wikipedia right now, nor will they ever do so before learning to speak, read, or write. Their entire "training corpus" will not top even 100 million "tokens" before they can speak & understand language, and do so with real intentionally.
Just to emphasize that point: 100 words-per-minute times 60 minutes-per-hour times 12 hours-per-day times 365 days-per-year times 4 years is a mere 105,120,000 words. That's a ludicrously *high* estimate of words-per-minute and hours-per-day, and 4 years old (the age of my other kid) is well after basic speech capabilities are developed in many children, etc. More likely the available "training data" is at least 1 or 2 orders of magnitude less than this.
The point here is that large language models, trained as they are on multiple *billions* of tokens, are not developing their behavioral capabilities in a way that's remotely similar to humans, even if you believe those capabilities are similar (they are by certain very biased ways of measurement; they very much aren't by others). This idea that humans must be naturally good at acquiring language is an old one (see e.g. #AI #LLM #AGI
Testing Homogeneity in a heteroscedastic contaminated normal mixture
Xiaoqing Niu, Pengfei Li, Yuejiao Fu
https://arxiv.org/abs/2507.15630 https://
Artificial Intelligence for CRISPR Guide RNA Design: Explainable Models and Off-Target Safety
Alireza Abbaszadeh, Armita Shahlai
https://arxiv.org/abs/2508.20130 https://…
Crosslisted article(s) found for physics.bio-ph. https://arxiv.org/list/physics.bio-ph/new
[1/1]:
- Liquid-liquid phase separation enables highly selective viral genome packaging
Layne B. Frechette, Michael F. Hagan
Investigating DNA words and their distributions across the tree of life
Charalampos Koilakos, Kimonas Provatas, Michail Patsakis, Aris Karatzikos, Ilias Georgakopoulos-Soares
https://arxiv.org/abs/2509.05539
FAIR sharing of Chromatin Tracing datasets using the newly developed 4DN FISH Omics Format
Rahi Navelkar, Andrea Cosolo, Bogdan Bintu, Yubao Cheng, Vincent Gardeux, Silvia Gutnik, Taihei Fujimori, Antonina Hafner, Atishay Jay, Bojing Blair Jia, Adam Paul Jussila, Gerard Llimos, Antonios Lioutas, Nuno MC Martins, William J Moore, Yodai Takei, Frances Wong, Kaifu Yang, Huaiying Zhang, Quan Zhu, Magda Bienko, Lacramioara Bintu, Long Cai, Bart Deplancke, Marcelo Nollmann, Susan E Mango, Bi…
A Novel cVAE-Augmented Deep Learning Framework for Pan-Cancer RNA-Seq Classification
Vinil Polepalli
https://arxiv.org/abs/2508.02743 https://arxiv.org/pdf…
Consistency and Central Limit Results for the Maximum Likelihood Estimator in the Admixture Model
Carola Sophia Heinzel
https://arxiv.org/abs/2507.19564 https://
Improving Genomic Models via Task-Specific Self-Pretraining
Sohan Mupparapu, Parameswari Krishnamurthy, Ratish Puduppully
https://arxiv.org/abs/2506.17766 …
Replaced article(s) found for q-bio.GN. https://arxiv.org/list/q-bio.GN/new
[1/1]:
- MARS: Processing-In-Memory Acceleration of Raw Signal Genome Analysis Inside the Storage Subsystem
Soysal, Koliogeorgi, Firtina, Ghiasi, Nadig, Mao, Oliveira, Liang, Zambaku, Sadrosadati, Mutl…
GlobDB: A comprehensive species-dereplicated microbial genome resource
Daan R. Speth (Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria), Nick Pullen (Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria), Samuel T. N. Aroney (Centre for Microbiome Research School of Biomedical Sciences, Queensland University of Technology, Translational Research Institute, Woolloongabba, Australia), Benjamin L. …
Functional Analysis of Variance for Association Studies
Olga A. Vsevolozhskaya, Dmitri V. Zaykin, Mark C. Greenwood, Changshuai Wei, Qing Lu
https://arxiv.org/abs/2508.11069 htt…
AGP: A Novel Arabidopsis thaliana Genomics-Phenomics Dataset and its HyperGraph Baseline Benchmarking
Manuel Serna-Aguilera, Fiona L. Goggin, Aranyak Goswami, Alexander Bucksch, Suxing Liu, Khoa Luu
https://arxiv.org/abs/2508.14934
Multimodal Modeling of CRISPR-Cas12 Activity Using Foundation Models and Chromatin Accessibility Data
Azim Dehghani Amirabad, Yanfei Zhang, Artem Moskalev, Sowmya Rajesh, Tommaso Mansi, Shuwei Li, Mangal Prakash, Rui Liao
https://arxiv.org/abs/2506.11182