Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Overview  





2 Example public databases for molecular biology  



2.1  Primary sequence databases  





2.2  Meta-databases  





2.3  Genome Databases  





2.4  Genome Browsers  





2.5  Protein sequence databases  





2.6  Protein structure Databases  





2.7  Protein-protein interactions  





2.8  Metabolic pathway Databases  





2.9  Microarray databases  





2.10  Mathematical Model Databases  





2.11  PCR / Real time PCR primer Databases  





2.12  Specialized databases  







3 Wiki style databases  





4 References  



4.1  See also  





4.2  External links  
















Biological database






العربية
Čeština
Español
فارسی
Français
Bahasa Melayu
Português
Suomi
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 




Print/export  







In other projects  



Wikimedia Commons
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


This is an old revision of this page, as edited by Physicistjedi (talk | contribs)at02:09, 6 October 2008 (Wiki style databases). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff)  Previous revision | Latest revision (diff) | Newer revision  (diff)

Biological databases are libraries of life sciences information, collected from scientific experiments, published literature, high throughput experiment technology, and computational analyses. They contain information from research areas including genomics, proteomics, metabolomics, microarray gene expression, and phylogenetics. [1] Information contained in biological databases includes gene function, structure, localization (both cellular and chromosomal), clinical effects of mutations as well as similarities of biological sequences and structures.

Relational database concepts of computer science and Information retrieval concepts of digital libraries are important for understanding biological databases. Biological database design, development, and long-term management is a core area of the discipline of Bioinformatics. [2]. Data contents include gene sequences, textual descriptions, attributes and ontology classifications, citations, and tabular data. These are often described as semi-structured data, and can be represented as tables, key delimited records, and XML structures. Cross-references among databases are common, using database accession numbers.

Overview

Biological databases have become an important tool in assisting scientists to understand and explain a host of biological phenomena from the structure of biomolecules and their interaction, to the whole metabolism of organisms and to understanding the evolutionofspecies. This knowledge helps facilitate the fight against diseases, assists in the development of medications and in discovering basic relationships amongst species in the history of life.

The biological knowledge is distributed amongst many different general and specialized databases. This sometimes makes it difficult to ensure the consistency of information. Biological databases cross-reference other databases with accession numbers as one way of linking their related knowledge together.

An important resource for finding biological databases is a special yearly issue of the journal Nucleic Acids Research (NAR). The Database Issue of NAR is freely available, and categorizes many of the publicly available online databases related to biology and bioinformatics.


Example public databases for molecular biology

(from www.kokocinski.net)

Primary sequence databases

The International Nucleotide Sequence Database (INSD) consists of the following databases.

  1. DDBJ (DNA Data Bank of Japan)
  2. EMBL Nucleotide DB (European Molecular Biology Laboratory)
  3. GenBank [1] (National Center for Biotechnology Information)

These databanks represent the current knowledge about the sequences of all organisms. They interchange the stored information and are the source for many other databases.

Meta-databases

Strictly speaking a meta-database can be considered a database of databases, rather than any one integration project or technology. They collect data from different sources and usually makes them available in new and more convenient form, or with an emphasis on a particular disease or organism.

  1. Entrez[2] (National Center for Biotechnology Information)
  2. euGenes (Indiana University)
  3. GeneCards (Weizmann Inst.)
  4. SOURCE (Stanford University)
  5. mGen containing four of the world biggest databases GenBank, Refseq, EMBL and DDBJ - easy and simple program friendly gene extraction
  6. Bioinformatic Harvester[3] (Karlsruhe Institute of Technology) - Integrating 26 major protein/gene resources.
  7. MetaBase[4] (KOBIC) - A user contributed database of biological databases.

Genome Databases

These databases collect organism genome sequences, annotate and analyze them, and provide public access. Some add curation of experimental literature to improve computed annotations. These databases may hold many species genomes, or a single model organism genome.

  1. Ensembl provides automatic annotation databases for human, mouse, other vertebrate and eukaryote genomes.
  2. JGI Genomes of the DOE-Joint Genome Institute provides databases of many eukaryote and microbial genomes.
  3. CAMERA Resource for microbial genomics and metagenomics
  4. MGI Mouse Genome (Jackson Lab.)
  5. Corn, the Maize Genetics and Genomics Database
  6. Saccharomyces Genome Database, genome of the yeast model organism.
  7. Wormbase, genome of the model organism Caenorhabditis elegans
  8. Flybase, genome of the model organism Drosophila melanogaster
  9. Zebrafish Information Network, genome of this fish model organism.
  10. Viral Bioinformatics Resource Center Curated database containing annotated genome data for eleven virus families.
  11. ERIC (Enteropathogen Resource Integration Center) Curated database containing annotated genome data for five enteropathogens - Escherichia coli, Shigella, Salmonella, Yersinia enterocolitica, and Y. pestis.

Genome Browsers

Genome Browsers enable researchers to visualize and browse entire genomes (most have many complete genomes) with annotated data including gene prediction and structure, proteins, expression, regulation, variation, comparative analysis, etc. Annotated data is usually from multiple diverse sources.

  1. Integrated Microbial Genomes (IMG) system by the DOE-Joint Genome Institute
  2. UCSC Genome Bioinformatics Genome Browser and Tools (UCSC)
  3. Ensembl The Ensembl Genome Browser (Sanger Institute and EBI)
  4. GBrowse The GMOD GBrowse Project
  5. Pathway Tools Genome Browser
  6. X:Map A genome browser that shows Affymetrix Exon Microarray hit locations alongside the gene, transcript and exon data on a Google maps api
  7. Viral Genome Organizer (VGO) A genome browser providing visualization and analysis tools for annotated whole genomes from the eleven virus families in the VBRC (Viral Bioinformatics Resource Center) databases
  8. Apollo Genome Annotation Curation Tool A cross-platform, JAVA-based standalone genome viewer with enterprise-level functionality and customizations. The standard for many model organism databases.

Protein sequence databases

  1. UniProt[5] Universal Protein Resource (UniProt Consortium: EBI, Expasy, PIR)
  2. PIR Protein Information Resource (Georgetown University Medical Center (GUMC))
  3. Swiss-Prot[6] Protein Knowledgebase (Swiss Institute of Bioinformatics)
  4. PEDANT Protein Extraction, Description and ANalysis Tool (Forschungszentrum f. Umwelt & Gesundheit)
  5. PROSITE Database of Protein Families and Domains
  6. DIP Database of Interacting Proteins (Univ. of California)
  7. Pfam Protein families database of alignments and HMMs (Sanger Institute)
  8. ProDom Comprehensive set of Protein Domain Families (INRA/CNRS)
  9. SignalP 3.0 Server for signal peptide prediction (including cleavage site prediction), based on artificial neural networks and HMMs
  10. SUPERFAMILY Library of HMMs representing superfamilies and database of (superfamily and family) annotations for all completely sequenced organisms

Protein structure Databases

  1. Protein Data Bank[7] (PDB) (Research Collaboratory for Structural Bioinformatics (RCSB))
  2. CATH Protein Structure Classification
  3. SCOP Structural Classification of Proteins
  4. SWISS-MODEL Server and Repository for Protein Structure Models
  5. ModBase Database of Comparative Protein Structure Models (Sali Lab, UCSF)

Protein-protein interactions

  1. BioGRID [8] A General Repository for Interaction Datasets (Samuel Lunenfeld Research Institute)
  2. STRING: STRING is a database of known and predicted protein-protein interactions. (EMBL)
  3. DIP Database of Interacting Proteins

Metabolic pathway Databases

  1. BioCyc Database Collection including EcoCyc and MetaCyc
  2. KEGG PATHWAY Database[9] (Univ. of Kyoto)
  3. MANET database [10] (University of Illinois)
  4. Reactome[11] (Cold Spring Harbor Laboratory, EBI, Gene Ontology Consortium)

Microarray databases

  1. ArrayExpress (European Bioinformatics Institute)
  2. Gene Expression Omnibus (National Center for Biotechnology Information)
  3. maxd (Univ. of Manchester)
  4. SMD (Stanford University)
  5. GPX(Scottish Centre for Genomic Technology and Informatics)

Mathematical Model Databases

  1. CellML
  2. Biomodels Database

PCR / Real time PCR primer Databases

  1. PathoOligoDB: A free QPCR oligo database for pathogens

Specialized databases

  1. BIOMOVIE (ETH Zurich) movies related to biology and biotechnology
  2. CGAP Cancer Genes (National Cancer Institute)
  3. Clone Registry Clone Collections (National Center for Biotechnology Information)
  4. DBGET H.sapiens (Univ. of Kyoto)
  5. GDB Hum. Genome Db (Human Genome Organisation)
  6. SHMPD The Singapore Human Mutation and Polymorphism Database
  7. NCBI-UniGene (National Center for Biotechnology Information)
  8. OMIM Inherited Diseases (Online Mendelian Inheritance in Man)
  9. Off. Hum. Genome Db (HUGO Gene Nomenclature Committee)
  10. HGMD disease-causing mutations (HGMD Human Gene Mutation Database)
  11. PhenCode linking human mutations with phenotype
  12. List with SNP-Databases
  13. p53 The p53 Knowledgebase
  14. Edinburgh Mouse Atlas
  15. HvrBase++ Human and primate mitochondrial DNA
  16. PolygenicPathways Genes and risk factors implicated in Alzheimer's disease, Bipolar disorder or Schizophrenia
  17. Connectivity map Transcriptional expression data and correlation tools for drugs
  18. CTD The Comparative Toxicogenomics Database describes chemical-gene-disease interactions

Wiki style databases

  1. EcoliWiki
  2. Gene Wiki
  3. OpenWetWare
  4. PDBWiki
  5. Proteopedia
  6. Topsan
  7. WikiGenes
  8. WikiPathways
  9. WikiProfessional


References

  1. ^ Altman RB (2004). "Building successful biological databases". Brief. Bioinformatics. 5 (1): 4–5. PMID 15153301. {{cite journal}}: Unknown parameter |month= ignored (help)
  • ^ Bourne P (2005). "Will a biological database be different from a biological journal?". PLoS Comput. Biol. 1 (3): 179–81. doi:10.1371/journal.pcbi.0010034. PMID 16158097. {{cite journal}}: Unknown parameter |month= ignored (help)CS1 maint: unflagged free DOI (link)
  • See also

    Template:Harvesternavi

    External links


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Biological_database&oldid=243332591"

    Categories: 
    Bioinformatics databases
    Bioinformatics
    Online databases
    Hidden categories: 
    CS1 errors: unsupported parameter
    CS1 maint: unflagged free DOI
     



    This page was last edited on 6 October 2008, at 02:09 (UTC).

    This version of the page has been revised. Besides normal editing, the reason for revision may have been that this version contains factual inaccuracies, vandalism, or material not compatible with the Creative Commons Attribution-ShareAlike License.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki