Ensembl

From Wikipedia, the free encyclopedia

Ensembl is a bioinformatics research project precisely a Genome Browser aiming to "develop a software system which produces and maintains automatic annotation on selected eukaryotic genomes". It is run in a collaboration between the Wellcome Trust Sanger Institute and the European Bioinformatics Institute, an outstation of the European Molecular Biology Laboratory.

Contents

[edit] Software and data

The project is open source - all data and all software that is produced in the project can be freely accessed and used.

Most of the software produced and used is written in the language Perl and is based on the BioPerl infrastructure. The Perl API can be easily employed in other genomic projects e.g. for the annotation of gene or clone lists.

The website code uses an extensible plugins system which allows groups to modify the website for their own data sets, e.g. Vega which stores and displays manual annotation and Gramene which stores plant genomes.

[edit] Current species

The annotated genomes include most fully sequenced vertebrates and selected model organisms. All of them are eukaryotes, there are no prokaryotes. Currently this includes:

  • Chordates
    • Mammals: Armadillo, Bush baby, Cat, Chimp, Cow, Dog, Elephant, Guinea pig, Hedgehog, Human, Macaque, Micro bat, Mouse, Opossum, Platypus, Rabbit, Rat, shrew, squirrel, Tenrec, Tree shrew, horse (pre), mouse lemur (pre), pig (pre), pika (pre)
    • Birds: Chicken
    • Fish: Takifugu rubripes (Fugu), Tetraodon nigroviridis (Green spotted pufferfish), Danio rerio (Zebrafish), Oryzias latipes (Medaka), Gasterosteus aculeatus (Stickleback), Petromyzon marinus (Sea lamprey) (pre)
    • Lizard: Anole Lizard (pre)
    • Frog: Xenopus tropicalis
    • Ancient relatives: Ciona intestinalis, Ciona savignyi
  • Invertebrates
  • Yeast: Saccharomyces cerevisiae (Baker's yeast)

[edit] Usage

The service is used by molecular biologists and bioinformaticians around the world working with genome data of the above organisms. The predictions of coding, controlling and other elements in the genomes can be compared with primary research data and with common repositories of current genomic knowledge (Biological Databases).

The comparison of organisms (comparative genomics or also intergenomics) with respect to their gene structures and the coded proteins is of special interest. The synteny view can be useful educational material for school classes.

[edit] See also

[edit] External links

Databases supported by Bioinformatic Harvester
NCBI-BLAST | CDD | Ensembl | Entrez | Flybase | Flymine | GFP-cDNA | Genome_browser | GeneCard | Google_Scholar | GoPubMed | HomoloGene | iHOP | IPI | OMIM | Mitocheck | PSORT | PolyMeta | UniProt | SOURCE | SOSUI | RZPD | Sciencenet | STRING | SMART | ZFIN |