Briefings in Bioinformatics
FGeneBERT: function-driven pre-trained gene language model for metagenomics
AbstractMetagenomic data, comprising mixed multi-species genomes, are prevalent in diverse environments like oceans and soils, significantly impacting human health and ecological functions. However, current research relies on K-mer, which limits the ca…