From Wikipedia,
the free encyclopedia.
The International Nucleotide
Sequence Database (INSD)
consists of a joint effort to
collect and disseminate
databases about
DNA and
RNA sequences. It involves the
following computerized
databases:
DNA Data Bank of Japan (Japan),
GenBank (USA)
and the
EMBL Nucleotide Sequence Database
(European
Molecular Biology Laboratory,
Germany). New and updated data
on
nucleotide sequences
contributed by research teams to
each of the three databases are
synchronized on a daily basis,
thanks to continuous interactions
and meetings between the staffs of
the organizations in charge.
The DDBJ/EMBL/GenBank
synchronization is possible thanks
to a number of guidelines which
are produced and published by an
International Advisory. These
consist in a common definition of
the feature tables
[1] for the databases, which
regulate the content and
syntax
[2] of the database entries,
in the form of a common
DTD or Document Type
Definition.
The syntax is called INSDSeq
and its core consists of the
letter sequence of the
gene expression (aminoacid
sequence) and the letter sequence
for nucleotide bases in the gene
or decoded segment. In
[3] a DBFetch operation shows
a typical INSD entry.
Related topics