; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006091 (gene) of Snake gourd v1 genome

Gene IDTan0006091
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionzinc finger protein ZAT2-like
Genome locationLG01:20494365..20495105
RNA-Seq ExpressionTan0006091
SyntenyTan0006091
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR013087 - Zinc finger C2H2-type
IPR036236 - Zinc finger C2H2 superfamily
IPR044653 - C2H2-type zinc-finger protein AZF1/2/3-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596792.1 Zinc finger protein ZAT3, partial [Cucurbita argyrosperma subsp. sororia]1.6e-2039.07Show/hide
Query:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK
        +++ R  C  C   F SEK +HGHMRSHP+R WRGM       AAAAA AATG    SSS S G  H  R     +  C S  L+  WS TA RGR+ I 
Subjt:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK

Query:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD
          ++SS  +G    + +G               + + +L+ EY C  C K F SPQALGGHKSS            P+ N           G +  K++ 
Subjt:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD

Query:  FDLNQLPPQDHHGEA
        FDLN+LPP+D  GEA
Subjt:  FDLNQLPPQDHHGEA

XP_008449183.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103491133 [Cucumis melo]1.4e-1332.88Show/hide
Query:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRK--
        +EK  +  C VC   F S K ++GHMRSHPDR W+G++   P+  A++++++      SSSP                          WS+TAKRG K  
Subjt:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRK--

Query:  -CIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD-------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY
         C+    +SS+   S+S + +   SL    ++  D          +E+ LKK Y+C+ C K++ S QALGGHKS  H K           NL   T  + 
Subjt:  -CIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD-------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY

Query:  QMGSRPNKILDFDLNQLPPQDH
               KIL+FDLN+LP  +H
Subjt:  QMGSRPNKILDFDLNQLPPQDH

XP_022143488.1 zinc finger protein ZAT3-like [Momordica charantia]9.4e-1836.02Show/hide
Query:  CTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIKNNTTSSN
        C +C  +F + K ++GHMRSHPDR WRGM    P P++++++AA      +S+P                          WSVT +RGRK          
Subjt:  CTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIKNNTTSSN

Query:  DTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKE-YQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY--QMGSRPNKILDFDLN
         T S + STS S           D+HD     K   Y+C++C K+F SPQALGGHKSS H+K               P ++D+  Q  S   + +DFDLN
Subjt:  DTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKE-YQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY--QMGSRPNKILDFDLN

Query:  QLPPQDHHGEA
        +LPP D HGEA
Subjt:  QLPPQDHHGEA

XP_022951048.1 zinc finger protein ZAT2-like [Cucurbita moschata]1.6e-2038.6Show/hide
Query:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK
        +++ R  C  C   F SEK +HGHMRSHP+R WRGM       AAAAAA A      SSS S G  H  R     +  C S  L+  WS TA RGR+ I 
Subjt:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK

Query:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD
          ++SS  +G    + +G               + + +L+ EY C  C K F SPQALGGHKSS            P+ N           G +  K+L 
Subjt:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD

Query:  FDLNQLPPQDHHGEA
        FDLN+LPP+D  GEA
Subjt:  FDLNQLPPQDHHGEA

XP_023539493.1 zinc finger protein ZAT2-like [Cucurbita pepo subsp. pepo]1.0e-1938.14Show/hide
Query:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK
        +++ R  C  C   F SEK +HGHMRSHP+R WRGM      P +AAA  ATG    SSS S G  H  R     +  C S  L+  WS TA RGR+ I 
Subjt:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK

Query:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD
          ++SS  +G    + +G               + + +L+ EY C  C + F SPQALG HKSS   K TE                    G +  K+L 
Subjt:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD

Query:  FDLNQLPPQDHHGEA
        FDLN+LPP+D  GEA
Subjt:  FDLNQLPPQDHHGEA

TrEMBL top hitse value%identityAlignment
A0A0A0L2N1 Uncharacterized protein2.2e-1232.44Show/hide
Query:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCI
        +EK  +  C VC   F S K ++GHMRSHPDR W+G++   P+  A++++++      SSSP                          WS TAKRG K I
Subjt:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCI

Query:  KNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD--------------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPT
           +T++    S+S S S S     + E    D D              +E+ L K Y+C+ C K++ S QALGGHKS  H K           N+  PT
Subjt:  KNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD--------------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPT

Query:  QSDYQMGSRPNKILDFDLNQLPPQD
          +        KIL+FDLN+LP  +
Subjt:  QSDYQMGSRPNKILDFDLNQLPPQD

A0A1S3BLH2 LOW QUALITY PROTEIN: uncharacterized protein LOC1034911336.8e-1432.88Show/hide
Query:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRK--
        +EK  +  C VC   F S K ++GHMRSHPDR W+G++   P+  A++++++      SSSP                          WS+TAKRG K  
Subjt:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRK--

Query:  -CIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD-------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY
         C+    +SS+   S+S + +   SL    ++  D          +E+ LKK Y+C+ C K++ S QALGGHKS  H K           NL   T  + 
Subjt:  -CIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHD-------HEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY

Query:  QMGSRPNKILDFDLNQLPPQDH
               KIL+FDLN+LP  +H
Subjt:  QMGSRPNKILDFDLNQLPPQDH

A0A2G9HH86 Uncharacterized protein1.6e-1033.33Show/hide
Query:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAA------AAAAAATGHRMISSS-PSDG---CGHGHRHGVAENYYCCSKALSLRWSV
        E  E   C VC   FP+EK +HGHMR HPDR WRGMK   PVPA        A  +     M S   P +G    G    HG A               +
Subjt:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAA------AAAAAATGHRMISSS-PSDG---CGHGHRHGVAENYYCCSKALSLRWSV

Query:  TAKRGRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDH---EVILKKEYQCHICNKKFPSPQALGGHKS--------SGHHKLTEPAA---AA
         AK      K    +SN     SG   G G  L   ++  +  +    ++I K+ Y C IC+++F S QALGGHKS        SG    +  AA   A 
Subjt:  TAKRGRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDH---EVILKKEYQCHICNKKFPSPQALGGHKS--------SGHHKLTEPAA---AA

Query:  PQLNLQPPTQSDYQMGSRPNKILDFDLNQLPPQD
        P +  +   QS            +FDLN+LPP+D
Subjt:  PQLNLQPPTQSDYQMGSRPNKILDFDLNQLPPQD

A0A6J1CQU4 zinc finger protein ZAT3-like4.6e-1836.02Show/hide
Query:  CTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIKNNTTSSN
        C +C  +F + K ++GHMRSHPDR WRGM    P P++++++AA      +S+P                          WSVT +RGRK          
Subjt:  CTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIKNNTTSSN

Query:  DTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKE-YQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY--QMGSRPNKILDFDLN
         T S + STS S           D+HD     K   Y+C++C K+F SPQALGGHKSS H+K               P ++D+  Q  S   + +DFDLN
Subjt:  DTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKE-YQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDY--QMGSRPNKILDFDLN

Query:  QLPPQDHHGEA
        +LPP D HGEA
Subjt:  QLPPQDHHGEA

A0A6J1GGL7 zinc finger protein ZAT2-like7.5e-2138.6Show/hide
Query:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK
        +++ R  C  C   F SEK +HGHMRSHP+R WRGM       AAAAAA A      SSS S G  H  R     +  C S  L+  WS TA RGR+ I 
Subjt:  EKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCCSKALSLRWSVTAKRGRKCIK

Query:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD
          ++SS  +G    + +G               + + +L+ EY C  C K F SPQALGGHKSS            P+ N           G +  K+L 
Subjt:  NNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQMGSRPNKILD

Query:  FDLNQLPPQDHHGEA
        FDLN+LPP+D  GEA
Subjt:  FDLNQLPPQDHHGEA

SwissProt top hitse value%identityAlignment
O65499 Zinc finger protein ZAT33.3e-0525.88Show/hide
Query:  RKPHEGTSKSTSVDVNNNDDEMKLVKELDEKQER--RTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGH
        +K    T  S+S     +  + K  K+ D    +  R CT C   F S K + GHMR HP+R WRG+          AA++   ++++ +  S      H
Subjt:  RKPHEGTSKSTSVDVNNNDDEMKLVKELDEKQER--RTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGH

Query:  R--------------HGVAENYYC--CSKALSLRWSVTAKR-GRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHE---VILKKEYQCHICN
                           E + C  C K      ++   R   K +K     +N T      ++ SG            HDH+   +     ++C+IC 
Subjt:  R--------------HGVAENYYC--CSKALSLRWSVTAKR-GRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHE---VILKKEYQCHICN

Query:  KKFPSPQALGGHKSSGHHKLTEP-AAAAPQLNLQPPTQSDYQMGSRPNKILDFDL
        + F S QALGGH      K  EP  + A  LN+ PPT  D          LD  L
Subjt:  KKFPSPQALGGHKSSGHHKLTEP-AAAAPQLNLQPPTQSDYQMGSRPNKILDFDL

Arabidopsis top hitse value%identityAlignment
AT2G17180.1 C2H2-like zinc finger protein1.3e-0427.23Show/hide
Query:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGM------KKLAPVPAAAAAA----AATGHRMISS--SPSDGCGHGHRHGVAENYYC--CSKALS
        D  Q  R CT C   F S K + GHMR HP+R WRG+      K+     AA++++    +   H + S     ++G        V E + C  C K   
Subjt:  DEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGM------KKLAPVPAAAAAA----AATGHRMISS--SPSDGCGHGHRHGVAENYYC--CSKALS

Query:  LRWSVTAKRG-RKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDH----EVILKKEYQCHICNKKFPSPQALGGHKSSGHHK-LTEPAAAAPQL
           ++   R   K +K    + N T                   EI D D     +++    ++C+IC++ F S QALGGH      K   E       L
Subjt:  LRWSVTAKRG-RKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDH----EVILKKEYQCHICNKKFPSPQALGGHKSSGHHK-LTEPAAAAPQL

Query:  NLQPPTQSDYQMG
        N+   T SD  +G
Subjt:  NLQPPTQSDYQMG

AT3G49930.1 C2H2 and C2HC zinc fingers superfamily protein8.3e-0462.07Show/hide
Query:  KKEYQCHICNKKFPSPQALGGHKSSGHHK
        +K+Y+C +C K FPS QALGGHK+S H K
Subjt:  KKEYQCHICNKKFPSPQALGGHKSSGHHK

AT4G35280.1 C2H2-like zinc finger protein2.3e-0625.88Show/hide
Query:  RKPHEGTSKSTSVDVNNNDDEMKLVKELDEKQER--RTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGH
        +K    T  S+S     +  + K  K+ D    +  R CT C   F S K + GHMR HP+R WRG+          AA++   ++++ +  S      H
Subjt:  RKPHEGTSKSTSVDVNNNDDEMKLVKELDEKQER--RTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGH

Query:  R--------------HGVAENYYC--CSKALSLRWSVTAKR-GRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHE---VILKKEYQCHICN
                           E + C  C K      ++   R   K +K     +N T      ++ SG            HDH+   +     ++C+IC 
Subjt:  R--------------HGVAENYYC--CSKALSLRWSVTAKR-GRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHE---VILKKEYQCHICN

Query:  KKFPSPQALGGHKSSGHHKLTEP-AAAAPQLNLQPPTQSDYQMGSRPNKILDFDL
        + F S QALGGH      K  EP  + A  LN+ PPT  D          LD  L
Subjt:  KKFPSPQALGGHKSSGHHKLTEP-AAAAPQLNLQPPTQSDYQMGSRPNKILDFDL

AT5G56200.1 C2H2 type zinc finger transcription factor family9.8e-0531.71Show/hide
Query:  KLVKELDEKQERR----TCTVCLLVFPSEKGVHGHMRSHPDRGWRGM---KKLAPVPAAAAAAAATGH--RMISS--SPSDGCGHGHRHGVAENYYCCSK
        K+V  +  + ER      C VC   F S K ++GHMR HPDRGW+G+     L P P   ++++   H    ISS     D           EN      
Subjt:  KLVKELDEKQERR----TCTVCLLVFPSEKGVHGHMRSHPDRGWRGM---KKLAPVPAAAAAAAATGH--RMISS--SPSDGCGHGHRHGVAENYYCCSK

Query:  ALSLR--------WSVTAKRGRK
         L L         WS   KRGR+
Subjt:  ALSLR--------WSVTAKRGRK

AT5G56200.1 C2H2 type zinc finger transcription factor family2.2e-0123.98Show/hide
Query:  KKLAPVPAAAAAAAATGHRM---ISSSPSDGCGHGHRHGVAENYYC--CSKALSLRWSVTAKRGR----KCIKNNTTSSNDTGSTSGS---TSGSGSLLR
        K+L+ +   ++ +    H++   +  +  +G G G R    E + C  C+K+ S   ++   R      K ++N+   +N   S  G+    +G  S   
Subjt:  KKLAPVPAAAAAAAATGHRM---ISSSPSDGCGHGHRHGVAENYYC--CSKALSLRWSVTAKRGR----KCIKNNTTSSNDTGSTSGS---TSGSGSLLR

Query:  EKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKS-------SGHHKLTEPAAA---APQLNLQPPTQSDYQMGSRPNKILDFDLNQLPPQD
                H+       ++ C+IC+K F + QALGGHK        S     T P AA       +    T++  ++     ++L+FDLN+LPP +
Subjt:  EKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKS-------SGHHKLTEPAAA---APQLNLQPPTQSDYQMGSRPNKILDFDLNQLPPQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAGCCTCATGAAGGCACCTCAAAAAGTACTAGTGTGGATGTTAATAATAATGATGATGAGATGAAATTAGTGAAAGAGTTGGATGAGAAGCAGGAACGCCGCAC
ATGCACCGTCTGCTTGCTTGTGTTCCCATCGGAGAAGGGGGTGCACGGTCATATGAGGTCTCATCCTGACAGGGGTTGGAGGGGAATGAAAAAATTAGCTCCAGTTCCAG
CTGCAGCTGCAGCAGCCGCCGCCACCGGTCACCGGATGATCTCTTCCTCCCCTTCGGATGGTTGCGGCCACGGCCACCGCCACGGCGTGGCTGAGAATTATTATTGTTGT
AGTAAGGCTTTGTCATTGAGATGGTCTGTTACTGCCAAGAGAGGGCGGAAGTGTATTAAAAATAATACTACTTCGTCTAACGACACTGGCTCTACTTCTGGCTCTACTTC
TGGCTCTGGTTCTCTTTTGAGGGAAAAGGAAATTGAAATTGATGATCATGATCATGAGGTGATTTTGAAGAAGGAATATCAATGTCATATTTGCAACAAAAAATTCCCAT
CTCCACAAGCATTGGGAGGGCACAAGTCCAGTGGCCACCACAAATTAACTGAACCAGCTGCTGCAGCTCCTCAACTCAATCTGCAGCCGCCTACTCAATCAGATTATCAG
ATGGGCTCACGCCCCAATAAAATATTAGATTTTGATCTCAACCAGCTCCCGCCACAAGACCACCATGGAGAAGCTACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAGCCTCATGAAGGCACCTCAAAAAGTACTAGTGTGGATGTTAATAATAATGATGATGAGATGAAATTAGTGAAAGAGTTGGATGAGAAGCAGGAACGCCGCAC
ATGCACCGTCTGCTTGCTTGTGTTCCCATCGGAGAAGGGGGTGCACGGTCATATGAGGTCTCATCCTGACAGGGGTTGGAGGGGAATGAAAAAATTAGCTCCAGTTCCAG
CTGCAGCTGCAGCAGCCGCCGCCACCGGTCACCGGATGATCTCTTCCTCCCCTTCGGATGGTTGCGGCCACGGCCACCGCCACGGCGTGGCTGAGAATTATTATTGTTGT
AGTAAGGCTTTGTCATTGAGATGGTCTGTTACTGCCAAGAGAGGGCGGAAGTGTATTAAAAATAATACTACTTCGTCTAACGACACTGGCTCTACTTCTGGCTCTACTTC
TGGCTCTGGTTCTCTTTTGAGGGAAAAGGAAATTGAAATTGATGATCATGATCATGAGGTGATTTTGAAGAAGGAATATCAATGTCATATTTGCAACAAAAAATTCCCAT
CTCCACAAGCATTGGGAGGGCACAAGTCCAGTGGCCACCACAAATTAACTGAACCAGCTGCTGCAGCTCCTCAACTCAATCTGCAGCCGCCTACTCAATCAGATTATCAG
ATGGGCTCACGCCCCAATAAAATATTAGATTTTGATCTCAACCAGCTCCCGCCACAAGACCACCATGGAGAAGCTACTTAA
Protein sequenceShow/hide protein sequence
MRKPHEGTSKSTSVDVNNNDDEMKLVKELDEKQERRTCTVCLLVFPSEKGVHGHMRSHPDRGWRGMKKLAPVPAAAAAAAATGHRMISSSPSDGCGHGHRHGVAENYYCC
SKALSLRWSVTAKRGRKCIKNNTTSSNDTGSTSGSTSGSGSLLREKEIEIDDHDHEVILKKEYQCHICNKKFPSPQALGGHKSSGHHKLTEPAAAAPQLNLQPPTQSDYQ
MGSRPNKILDFDLNQLPPQDHHGEAT