- Course Project
- Database Search and Sequence Alignment
- Multiple Sequence Alignments
- Remember, none of these are optimal, so
looking at the various different alignments they produce can be
- Hidden Markov Models
- Molecular Biology Databases
- NAR annual
compilation of Molecular Biology Databases
- National Center for
Biotechnology Information (NCBI) and specific NCBI sites:
- UniProt (curated protein information, includes SwissProt and TrEMBL).
- PDB (protein structures)
- InterPro (protein families and motifs)
- The Gene Ontology (molecular biology conceptual structure, with mappings to particular genes), see also OBO
- Santa Cruz "Golden Path" (configurable human genome browser)
- DIP (database of interacting proteins) and MINT, a competitor.
- Kyoto Encylopedia of Genes and Genomes (KEGG) (metabolic and signalling pathways)
- Enzyme DB (US mirror) (enzyme activities), and Reactome, a human-specific competitor. Also consider MetaCyc
- The Human Genome Nomenclature Committee (HGNC) for gene names.
- The Human Protein Reference Database (HPRD)
- Regulatory information databases, such as Transfac, COMPEL,
and Mapper, EPD and CREB
- Information Hyperlinked over Proteins (IHOP) generated by text mining.