Genome Biology is calling for submissions to our Collection on the applications of large language models (LLMs) in genome analysis.
The integration of machine learning, particularly LLMs built using transformers, into biological research has opened up new avenues for understanding complex biological data. These advanced computational models have shown promise in processing vast datasets, from genomic sequences to protein structures, enabling researchers to extract meaningful insights and identify patterns that were previously unattainable.
The employment of LLMs in biology has enhanced our capabilities in areas such as genomics, transcriptomics, and proteomics. For instance, LLMs have been successfully applied to predict the functional consequences of genetic variants in both coding and non-coding genomes, facilitating targeted precision medicine and personalized therapies. Furthermore, these models enable researchers to automate the annotation of DNA and protein sequences, accelerating the pace of discovery and innovation in biotechnology and pharmaceutical research. Additionally, LLMs are increasingly being used to explore the regulation of gene expression, including the identification and characterization of regulatory elements such as enhancers, promoters, and transcription factor binding sites.
Future advancements may lead to the creation of more sophisticated models capable of integrating multi-omic data, facilitating the understanding of complex biological systems. Such innovations could potentially enable real-time analysis and prediction of biological responses, transforming our approach to disease modeling, drug discovery, and synthetic biology. The incorporation of regulatory elements into machine learning models will be crucial for uncovering the mechanisms that govern cellular behavior and tissue-specific gene regulation.
Topics accepted for submission include, but are not limited to, the following:
- Applications of LLMs in genomic data analysis
- Transformer models in protein sequence prediction
- Comparative studies of LLMs and traditional methods
- LLMs for functional prediction and annotation of proteins
- Machine learning approaches to the study of regulatory elements in gene expression
All manuscripts submitted to this journal, including those submitted to collections and special issues, are assessed in line with our editorial policies and the journal’s peer review process. Reviewers and editors are required to declare competing interests and can be excluded from the peer review process if a competing interest exists.
Image credit: [M] NicoElNino / Getty Images / iStock