Machine learning approaches to identify core and dispensable genes in pangenomes

Abstract

A gene in a given taxonomic group is either present in every individual (core) or absent in at least a single individual (dispensable). Previous pangenomic studies have identified certain functional differences between core and dispensable genes. However, identifying if a gene belongs to the core or dispensable portion of the genome requires the construction of a pangenome, which involves sequencing the genomes of many individuals. Here we aim to leverage the previously characterized core and dispensable gene content for two grass species [Brachypodium distachyon (L.) P. Beauv. and Oryza sativa L.] to construct a machine learning model capable of accurately classifying genes as core or dispensable using only a single annotated reference genome. Such a model may mitigate the need for pangenome construction, an expensive hurdle especially in orphan crops, which often lack the adequate genomic resources.

Machine learning approaches to identify core and dispensable genes in pangenomes

Published by nAlan E. Yocca, nPatrick P. Edgern on September 17, 2021

Abstract

Plant Biology

Exploring the mechanism of Suxin Hugan Fang in treating ulcerative colitis based on network pharmacology

Plant Biology

Effect of electrode size and distance to tissue on unipolar and bipolar voltage electrograms and their implications for a near-field cutoff

Plant Biology

Influence of different definitions of unintentional burns on the prevalence and risk factors in children living in rural areas in Zunyi, Southwest China

Machine learning approaches to identify core and dispensable genes in pangenomes

Published by nAlan E. Yocca, nPatrick P. Edgern on September 17, 2021

Abstract

Related Posts

Plant Biology

Exploring the mechanism of Suxin Hugan Fang in treating ulcerative colitis based on network pharmacology

Plant Biology

Effect of electrode size and distance to tissue on unipolar and bipolar voltage electrograms and their implications for a near-field cutoff

Plant Biology

Influence of different definitions of unintentional burns on the prevalence and risk factors in children living in rural areas in Zunyi, Southwest China