KEGG icon KEGG   KEGG2   ATLAS   PATHWAY   BRITE   DBGET   transparent  
 
KEGG Home
   Introduction
   Overview
   Release notes
   Current statistics
KEGG Identifiers
KGML
KEGG API
KEGG FTP
KegTools

Feedback
GenomeNet

KEGG Identifiers

KEGG Object Identifier

KEGG objects are biological entities from molecular to higher levels. Each object (except for GENES) is identified by the KEGG object identifer, consisting of a five-digit number prefixed by an upper-case alphabet, such as K05032 and D00336, or prefixed by a 2-4 letter code for PATHWAY and BRITE, such as map00010 and br08301.
Prefix Content Database
K
C
D
G
R
(code)
(code)
M
H
T
Gene/protein ortholog group
Chemical compound
Drug
Glycan
Reaction
Pathway map
Hierarchical text file
Pathway module
Human disease
Organism
ORTHOLOGY
COMPOUND
DRUG
GLYCAN
REACTION
PATHWAY
BRITE
MODULE
DISEASE
GENOME
Enter KEGG object identifiers to retrieve corresponding database entries:

   (Example) R07326 R00623 R00754 R01036 R04805 R04880 R06917
    

KEGG Organism Code

Each KEGG object can be uniquely identified without the database name becuase of the different prefix, but the general format to retrieve a database entry in KEGG and all other GenomeNet databases is:
db:entry
where "db" is the database name and "entry" is the entry name or the accession number (see DBGET for the list of database names and abbreviations). The above examples can then be written as ko:K05032 and drug:D00336 (or dr:D00336).

In addition to the internally used T number shown above, an organism in KEGG is given a three-letter KEGG organism code (with prefix "d" for draft genome and "e" for EST contigs), which is treated like a database name. Therefore, individual genes in an organism can be identified in the following way:
org:gene
where "org" is the KEGG organism code and "gene" is the KEGG GENES entry name (see below). The KEGG organism code is also used as a prefix to identify organism-specific pathway maps or BRITE functional hierarchies, such as hsa04930 and mmu04000.
Prefix Content Database
map
ko
br
"org"
Reference pathway
Reference pathway or BRITE hierarchy
Reference BRITE hierarchy
Organism-specific pathway or BRITE hierarchy
PATHWAY
PATHWAY or BRITE
BRITE
PATHWAY or BRITE

KEGG GENES Entry Name

Entry names of the KEGG GENES database are usually locus_tags given by the International Nucleotide Sequence Database Collaboration (INSDC). The major sequence databases such as NCBI and UniProt/Swiss-Prot use different sets of gene/protein identifiers. In order to facilitate the use of KEGG, automatic name conversion has been implemented for these identifiers.
Enter outside DB accession numbers to convert to KEGG GENES entries:
NCBI GeneID NCBI gi UniProt

   (Example) 3775638 3737440 3743551 3897645 3902295
    

KEGG Namespace

The KEGG identifiers, consisting of the KEGG object identifiers (prefix + five-digit numbers), the GENES entry names (locus_tags), and the enzyme identifiers (EC numbers), form the KEGG namespace. The KEGG identifiers summarized below are highly linked within the KEGG namespace representing different types of biological relationships.
Database Identifier Remark
PATHWAY map number Corresponds to a set of K numbers
BRITE ko/br number Corresponds to a set of K/C/D/G/R/A/T numbers
ORTHOLOGY K number Corresponds to a set of GENES entry names
GENES Entry name
(locus_tag)
 
LIGAND C/D/G/R/A number
EC number
 

Links to Outside Databases

In addition to the links within the KEGG namespace, the KEGG identifiers are also linked to outside database entries. The links usually represent the same biological meaning, such as the same gene or protein in the same organism, enabling the user to integrate information from different sources. In this respect, KEGG is a portal site to more specialized databases in various fields of biomedical sciences.
Database Identifier Outside links
GENES Entry name
(locus_tag)
NCBI gi number
Entrez Gene
OMIM
UniProt/Swiss-Prot
TIGR CMR
Sanger GeneDB
Other genome DBs
GENOME Organism code
(T number)
NCBI Taxonomy
RefSeq
JGI
TIGR
Sanger
Other genome DBs
ENZYME EC number
IUBMB
ExplorEnz
BRENDA
ExPASy
UM-BBD
ERGO
COMPOUND C number
PubChem
ChEBI
3DMET
DRUG D number
PubChem
GLYCAN G number
CarbBank
ORTHOLOGY K number
Gene Ontology
COG
TCDB
CAZy
KEGG pathways and BRITE functional hierarchies are higher-level objects, which are developed based on published literature and selected databases. Links to outside databases are given as a form of acknowledgement in this development.
Database Identifier Outside links
BRITE ko number
IUBMB Nomenclature
IUPHAR Nomenclature
SWISS-PROT Documents
TRANSFAC
TCDB
GPCR
NURSA
MEROPS
PROW
br number
IUPAC Nomenclature
LIPID MAPS
PATHWAY map number UM-BBD

Last updated: June 23, 2008
Copyright 1995-2008 Kanehisa Laboratories