Recognition sequence
Encyclopedia
The recognition sequence, sometimes also referred to as recognition site, of any DNA-binding protein motif
DNA-binding domain
A DNA-binding domain is an independently folded protein domain that contains at least one motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA...

 that exhibits binding specificity, refers to the DNA sequence
DNA sequence
The sequence or primary structure of a nucleic acid is the composition of atoms that make up the nucleic acid and the chemical bonds that bond those atoms. Because nucleic acids, such as DNA and RNA, are unbranched polymers, this specification is equivalent to specifying the sequence of...

 (or subset thereof), to which the domain is specific. Recognition sequences are palindromes
Palindromic sequence
A palindromic sequence is a nucleic acid sequence that is the same whether read 5' to 3' on one strand or 5' to 3' on the complementary strand with which it forms a double helix....

.

The transcription factor Sp1 for example, binds the sequences 5'-(G/T)GGGCGG(G/A)(G/A)(C/T)-3', where (G/T) indicates that the domain will bind a guanine or thymine at this position.

The restriction endonuclease PstI recognizes, binds, and cleaves the sequence 5'-CTGCAG-3'.

However, a recognition sequence refers to a different aspect from that of recognition site. A given recognition sequence can occur one or more times, or not at all on a specific DNA fragment. A recognition site is specified by the position of the site. For example, there are two PstI recognition site in the following DNA sequence fragment, start at base 9 and 31 respectively. A recognition sequence is a specific sequence, usually very short (less than 10 bases). Depending on the degree of specificity of the protein, a DNA-binding protein can bind to more than one specific sequence. For PstI, which has a single sequence specificity, it is 5'-CTGCAG-3'. It is always the same whether at the first recognition site or the second in the following example sequence. For Sp1, which has multiple (16) sequence specificity as shown above, the two recognition sites in the following example sequence fragment are at 18 and 32, and their respective recognition sequences are 5'-GGGGCGGAGC-3' and 5'-TGGGCGGAAC-3'.

5'-AACGTTAGCTGCAGTCGGGGCGGAGCTAGGCTGCAGGAATTGGGCGGAACCT-3'

See also

  • DNA-binding domain
    DNA-binding domain
    A DNA-binding domain is an independently folded protein domain that contains at least one motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence or have a general affinity to DNA...

  • Transcription factor#Classes, for more examples
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK