Search for a gene
Search for a genome
e.g. type esc to find Escherichia
Access to 6000 Bacterial Genomes
- Search for a gene - type the name of a gene or other identifier into the search box above
- Find a genome - click in the 'browse a genome' box above and start typing your genome name to find matching genomes
- View full list of all Ensembl Bacteria species
- Access Ensembl Bacteria programmatically
What's new in release 18
Release 18 of Ensembl Bacteria has been loaded from EMBL-Bank release 115 into 27 multispecies Ensembl v71 databases. The current dataset contains 6,306 genomes (6,047 eubacteria and 162 archaea) containing 20,630,403 protein coding genes loaded from 508,569 INSDC entries.
- Generated stable IDs now have the prefix EB in place of ENA. The web interface provides mappings between all old and new identifiers (including pre-EG17 identifiers where available).
- Issue where some assemblies consisted of annotated WGSs rather than supercontig/chromosome now addressed.
- Addition of cross-references to Rhea.
Future Releases
Release 19 of Ensembl Genomes is scheduled for 2nd July 2013. Detailed notes for this can be found here.
Ensembl Bacteria
Over 6000 genome sequences from bacteria and archaea have been annotated and deposited in the public archives of the members of the International Nucleotide Sequence Database Collaboration. This site provides access to complete, annotated genomes from bacteria and archaea (present in the European Nucleotide Archive) through the Ensembl graphical user interface (genome browser). More details about the integration are provided here
Programmatic access is available through the Ensembl Perl and REST-ful APIs and through publicly accessible mysql databases, along with full data dumps (including DNA sequence and protein sequence in FASTA format, annotations in GTF format, and mysql dump files). Due to the large number of these databases, there is some modification to the APIs, and database and FTP site structure, compared to that used for other branches of the taxonomy (e.g. the storage of many genomes in one database; the provision of lookup services to identify genomes by INSDC identifiers, taxonomy identifiers, or partial names. Full details are available here.
BioMart access is not available, but we are working on providing new, more powerful data mining tools to allow users to exploit these genomes. A selection of over 100 key bacterial genomes has been included in the pan-taxonomic Compara, and genes from all genomes are classified into families using HAMAP and PANTHER (more details)
Ensembl Bacteria has been developed with the support of the Microme project, a resource for bacterial metabolism.
Ensembl Genomes is developed by EMBL-EBI and is powered by Ensembl software system for the analysis and visualisation of genomic data. For details of our funding please click here.




