The majority of this document was taken from: Primer on Molecular Genetics, DOE Human Genome Program, U.S. Department of Eneregy, Office of Energy Research, June, 1992
Adenine (A): A nitrogenous base, one member of the base pair A-T (adenine-thymine).
Base pair (bp): Two nitrogenous bases (adenine and thymine or guanine and cytosine) held together by weak bonds. Two strands of DNA are held together in the shape of a double helix by the bonds between base pairs.
Base sequence: The order of nucleotide bases in a DNA molecule.
Base sequence analysis: A method, sometimes automated, for determining the base sequence.
Centimorgan (cM): A unit of measure of recombination frequency. One centimorgan is equal to a 1% chance that a marker at one genetic locus will be separated from a marker at a second locus due to crossing over in a single generation. In human beings, 1 centimorgan is equivalent, on average, to 1 million base pairs.
Centromere: A specialized chromosome region to which spindle fibers attach during cell division.
Clone bank: See genomic library.
Clones: A group of cells derived from a single ancestor.
Complementary DNA (cDNA): DNA that is synthesized from a messenger RNA template; the single-stranded form is often used as a probe in physical mapping.
Contigs: Groups of clones representing overlapping regions of a genome.
Cytosine (C): A nitrogenous base, one member of the base pair G-C (guanine and cytosine).
Deoxyribonucleotide: See nucleotide.
DNA (deoxyribonucleic acid): The molecule that encodes genetic information. DNA is a double-stranded molecule held together by weak bonds between base pairs of nucleotides. The four nucleotides in DNA contain the bases: adenine (A), guanine (G), cytosine (C), and thymine (T). In nature, base pairs form only between A and T and between G and C; thus the base sequence of each single strand can be deduced from that of its partner.
Double helix: The shape that two linear strands of DNA assume when bonded together.
EST: Expressed sequence tag. See sequence tagged site.
Evolutionarily conserved: See conserved sequence.
Exogenous DNA: DNA originating outside an organism.
Exons: The protein-coding DNA sequences of a gene. Compare introns.
Expressed gene: See gene expression.
FISH (fluorescence in situ hybridization): A physical mapping approach that uses fluorescein tags to detect hybridization of probes with metaphase chromosomes and with the less-condensed somatic interphase chromatin.
Gene families: Groups of closely related genes that make similar products.
Gene library: See genomic library.
Genetic engineering technologies: See recombinant DNA technologies.
Genetics: The study of the patterns of inheritance of specific traits.
Global Alignment:Alignment of A within B (like agrep) Allows for gaps.
Guanine (G): A nitrogenous base, one member of the base pair G-C (guanine and cytosine).
Heterozygosity: The presence of different alleles at one or more loci on homologous chromosomes.
Highest Scoring Pairs(hsp): The the highest scoring pair of alignments between two nucleotides or amino acids sequences.
Human gene therapy: Insertion of normal DNA directly into cells to correct
a genetic defect.
Interphase: The period in the cell cycle when DNA is replicated in the
nucleus; followed by mitosis.
In vitro: Outside a living organism.
Kilobase (kb): Unit of length for DNA fragments equal to 1000
nucleotides.
k-tuple (k-tup): a size "k" tuple of nucleotides or
amino acids.
Global Alignment:Alignment of A to B like (diff) Looks for Longest Common
Subsequence (LCS).
Localize: Determination of the original position (locus) of a
gene or other marker on a chromosome.
Locus (pl. loci): The position on a chromosome of a gene or
other chromosome marker; also, the DNA at that position. The use of
locus is sometimes restricted to mean regions of DNA that are
expressed. See gene expression.
Longest Commmon Subequence (lcs): The longest sequence of
nucleotides or amino acids common to two sequences.
Mapping: See gene mapping, linkage map, physical map.
Megabase (Mb): Unit of length for DNA fragments equal to 1 million
nucleotides and roughly equal to 1 cM.
Messenger RNA (mRNA): RNA that serves as a template for protein synthesis.
See genetic code.
Multifactorial or multigenic disorders: See polygenic disorders.
Multiple Alignment: A sequencing approach that simultaneously aligns
n sequences.
Mutation: Any heritable change in DNA sequence. Compare
polymorphism.
Nitrogenous base: A nitrogen-containing molecule having the chemical
properties of a base.
Nucleic acid: A large molecule composed of nucleotide subunits.
Nucleus: The cellular organelle in eukaryotes that contains the
genetic material.
Overlapping clones: See genomic library.
PCR: See polymerase chain reaction.
Phage: A virus for which the natural host is a bacterial cell.
Physical map: A map of the locations of identifiable landmarks on DNA
(e.g., restriction enzyme cutting sites, genes), regardless of
inheritance. Distance is measured in base pairs. For the human
genome, the lowest-resolution physical map is the banding
patterns on the 24 different chromosomes; the highest-resolution map
would be the complete nucleotide sequence of the chromosomes.
Polymerase chain reaction (PCR): A method for amplifying a DNA base
sequence using a heat-stable polymerase and two 20-base
primers, one complementary to the (+)-strand at one end of the
sequence to be amplified and the other complementary to the (-)-strand at the
other end. Because the newly synthesized DNA strands can subsequently serve as
additional templates for the same primer sequences, successive rounds of primer
annealing, strand elongation, and dissociation produce rapid and highly
specific amplification of the desired sequence. PCR also can be used to detect
the existence of the defined sequence in a DNA sample.
Promoter: A site on DNA to which RNA polymerase will bind and
initiate transcription.
Rare-cutter enzyme: See restriction enzyme cutting site.
Recombinant clones: Clones containing recombinant DNA
molecules. See recombinant DNA technologies.
Regulatory regions or sequences: A DNA base sequence that controls
gene expression.
Resolution: Degree of molecular detail on a physical map of DNA,
ranging from low to high.
Restriction enzyme, endonuclease: A protein that recognizes
specific, short nucleotide sequences and cuts DNA at those sites.
Bacteria contain over 400 such enzymes that recognize and cut over 100
different DNA sequences. See restriction enzyme cutting site.
Restriction fragment length polymorphism (RFLP): Variation between
individuals in DNA fragment sizes cut by specific restriction enzymes;
polymorphic sequences that result in RFLPs are used as markers
on both physical maps and genetic linkage maps. RFLPs are
usually caused by mutation at a cutting site. See marker.
RFLP: See restriction fragment length polymorphism.
Ribonucleic acid (RNA): A chemical found in the nucleus and
cytoplasm of cells; it plays an important role in protein synthesis and other
chemical activities of the cell. The structure of RNA is similar to that of
DNA. There are several classes of RNA molecules, including messenger RNA,
transfer RNA, ribosomal RNA, and other small RNAs, each serving a different
purpose.
Ribonucleotides: See nucleotide.
Ribosomal RNA (rRNA): A class of RNA found in the ribosomes of cells.
Sequence tagged site (STS): Short (200 to 500 base pairs) DNA
sequence that has a single occurrence in the human genome and whose location
and base sequence are known. Detectable by polymerase chain reaction,
STSs are useful for localizing and orienting the mapping and sequence data
reported from many different laboratories and serve as landmarks on the
developing physical map of the human genome. Expressed sequence tags
(ESTs) are STSs derived from cDNAs.
Somatic cells: Any cell in the body except gametes and their
precursors.
STS: See sequence tagged site.
Thymine (T): A nitrogenous base, one member of the base pair A-T
(adenine-thymine).
Transfer RNA (tRNA): A class of RNA having structures with triplet
nucleotide sequences that are complementary to the triplet
nucleotide coding sequences of mRNA. The role of tRNAs in protein
synthesis is to bond with amino acids and transfer them to the
ribosomes, where proteins are assembled according to the genetic code carried
by mRNA.
VLSI: Very large-scale integration allowing over 100,000 transistors on a
chip.
YAC: See yeast artificial chromosome.
Yeast artificial chromosome (YAC): A vector used to clone DNA fragments (up
to 400 kb); it is constructed from the telomeric, centromeric, and replication
origin sequences needed for replication in yeast cells. Compare cloning
vector, cosmid.