Computational tools unlock the secrets of the genome

· News-Medical

PRiMeR (Sens et al. 2024) is a method that leverages genetic information to learn disease risk predictors across cohorts, circumventing the need for traditional longitudinal studies. With training on risk factors and genetic data from a healthy cohort, along with results from genome-wide association studies (GWAS), PRiMeR can assess risk for new patients. This method was validated on simulations of type 2 diabetes and Alzheimer's and Parkinson's disease onset. This method could facilitate more timely and targeted preventive strategies.

Circular extrachromosomal DNA (ecDNA) is a form of oncogene amplification found across cancer types and is associated with poor outcome in patients. EcDNAs drive tumor formation, evolution, and drug resistance by modulating oncogene copy-number and rewiring gene-regulatory networks. Two methods CoRAL (Zue et al. 2024) and Decoil (Giurgiu et al. 2024) resolve ecDNA structure using long-read sequencing data, profiling the landscape and evolution of focal amplifications in tumors.

Another method, DIISCO (Park et al. 2024), characterizes the temporal dynamics of cell–cell interactions in complex biological systems using single-cell RNA sequencing data, elucidating mechanisms underlying normal biological processes and disease progression. This method was demonstrated on simulated and experimental lymphoma–immune interaction data and revealed immune interactions of a cytotoxic T cell subtype that expands with therapy. This method can guide the design of improved treatments to promote cell states and crosstalk crucial for therapeutic response.

Finally, repetitive DNA poses significant challenges for accurate and efficient genome assembly and sequence alignment. This is particularly true for metagenomic data, where genome dynamics such as horizontal gene transfer, gene duplication, and gene loss/gain complicate accurate genome assembly from microbial communities. Detecting repeats is a crucial first step in overcoming these challenges. Azizpour et al. (2024) presents GraSSRep, a novel approach that detects and classifies DNA sequences into repetitive and non-repetitive categories in metagenomics data.

Source:

Cold Spring Harbor Laboratory