Search for a gene
Search for a genome
e.g. type esc to find Escherichia
- Search for a gene - type the name of a gene or other identifier into the search box above
- Find a genome - click in the 'browse a genome' box above and start typing your genome name to find matching genomes
- View full list of all Ensembl Bacteria species
- Access Ensembl Bacteria programmatically
What's New in Release 50
Release 50 of Ensembl Bacteria has no major updates since the previous release. As for release 49, we only represent non-redundant bacterial genomes as defined by criteria set out by UniProt. See more details about this update in our blog.
Genomes
- A total of 31,332 bacterial and archaeal genomes
Data
- Annotation of pathogen-host interaction data (PHI-base version 2019-09-16
- Alignments to Rfam covariance models (Rfam 12.2) visible in separate track (‘Rfam models’)
- Updated protein features for all species using InterProScan version 81
- Updated pan-taxonomic compara (which includes key bacterial species)
- Bacterial species names used within our production processes now have the assembly accession as a suffix (e.g. streptococcus_pneumoniae_tigr4 is now named streptococcus_pneumoniae_tigr4_gca_000006885). Please amend any stored bookmarks for species pages.
Archive sites
The following archive sites are available to access previous versions of data:
- Release 49, December 2020 eg49-bacteria.ensembl.org
- Release 45, September 2019 eg45-bacteria.ensembl.org
- Release 40, July 2018 eg40-bacteria.ensembl.org
- Release 37, October 2017 eg37-bacteria.ensembl.org
Ensembl Bacteria
Ensembl Bacteria is a browser for bacterial and archaeal genomes. These are taken from the databases of the International Nucleotide Sequence Database Collaboration, the European Nucleotide Archive at the EBI, GenBank at the NCBI, and the DNA Database of Japan).
Data access
Data can be visualised through the Ensembl genome browser and accessed programmatically via our Perl and RESTful APIs. Data is also accessible through public MySQL databases and our FTP site containing full data dumps in FASTA, EMBL, GTF, GFF3, JSON and RDF formats. A selection of over 100 key bacterial genomes have been included in the pan-taxonomic compara, and genes from all genomes have been classified into families using HAMAP and PANTHER more details.
Ensembl Genomes is developed by EMBL-EBI and is powered by the Ensembl software system for the analysis and visualisation of genomic data. For details of our funding please click here.