Document type

Article

Version

Published version

Publication date

Publication license

cc-by (c) Bernabeu Lorenzo, Manuel et al., 2019
Please use this identifier to cite or link to this item: https://hdl.handle.net/2445/163053

Gene duplications in the E. coli genome: common themes among pathotypes

Journal Title

Director/Tutor

Journal ISSN

Volume Title

Abstract

Background: Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplications in this microorganism is needed. Results: The genomes of the E. coli enteroaggregative strain 042 and other pathogenic strains contain duplications of the gene that codes for the global regulator Hha. To determine whether the presence of additional copies of the hha gene correlates with the presence of other genes, we performed a comparative genomic analysis between E. coli strains with and without hha duplications. The results showed that strains harboring additional copies of the hha gene also encode the yeeR irmA (aec69) gene cluster, which, in turn, is also duplicated in strain 042 and several other strains. The identification of these duplications prompted us to obtain a global map of gene duplications, first in strain 042 and later in other E. coli genomes. Duplications in the genomes of the enteroaggregative strain 042, the uropathogenic strain CFT073 and the enterohemorrhagic strain O145:H28 have been identified by a BLASTp protein similarity search. This algorithm was also used to evaluate the distribution of the identified duplicates among the genomes of a set of 28 representative E. coli strains. Despite the high genomic diversity of E. coli strains, we identified several duplicates in the genomes of almost all studied pathogenic strains. Most duplicated genes have no known function. Transcriptomic analysis also showed that most of these duplications are regulated by the H-NS/Hha proteins. Conclusions: Several duplicated genes are widely distributed among pathogenic E. coli strains. In addition, some duplicated genes are present only in specific pathotypes, and others are strain specific. This gene duplication analysis shows novel relationships between E. coli pathotypes and suggests that newly identified genes that are duplicated in a high percentage of pathogenic E. coli isolates may play a role in virulence. Our study also shows a relationship between the duplication of genes encoding regulators and genes encoding their targets.

Subject (English)

Citation

Citation

BERNABEU LORENZO, Manuel, et al. Gene duplications in the E. coli genome: common themes among pathotypes. Bmc Genomics. 2019. Vol. 20, num. 313. ISSN 1471-2164. [consulted: 14 of June of 2026]. Available at: https://hdl.handle.net/2445/163053

Export metadata

JSON - METS

Share record