; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008197 (gene) of Snake gourd v1 genome

Gene IDTan0008197
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG09:10437192..10442262
RNA-Seq ExpressionTan0008197
SyntenyTan0008197
Gene Ontology termsNA
InterPro domainsIPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PPD84469.1 hypothetical protein GOBAR_DD18598 [Gossypium barbadense]3.6e-2231.1Show/hide
Query:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIK-INIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRY
        VG+ +     +D  + N  W   +R+ I+I++S  +RR +K +  D     C   +KYE+L   C +CG+IGH  K C    ++        QYG WLR 
Subjt:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIK-INIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRY

Query:  IGKLSKIAKTPISGGKGGDLVLK--PKVKEGRGGK-DSTDVSDQHRTPGLRKAFLILKTMEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIG
        +  ++   +  I G  G ++++K  P  ++  G K D+ + S Q       K            ++FTG YG  D   R  +W++LRK+       W+IG
Subjt:  IGKLSKIAKTPISGGKGGDLVLK--PKVKEGRGGK-DSTDVSDQHRTPGLRKAFLILKTMEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIG

Query:  GDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGD
        GD N IL + EK GG  + ++LI +FR V+ +  L DL  +     W N R GD
Subjt:  GDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGD

TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]5.1e-2428.38Show/hide
Query:  TMEQEK-WQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGDDQM
        T+  +K W+ TG YG P+   R   W LLR+L  +   PW +GGD NEI+   EKVGG  R    + NF+  L DC L+DLG  G   TW N R+ +  +
Subjt:  TMEQEK-WQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGDDQM

Query:  GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEIELETLLHQEEQYWRQRQNTDDLHVADFITPTRSWDMSKLAQYFDEQEVDGEHDFPTQRRWF
         +  +  VG  G +       +    + LDF   +   I LE    +EE   R R   D           R W          E++     DF +Q  + 
Subjt:  GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEIELETLLHQEEQYWRQRQNTDDLHVADFITPTRSWDMSKLAQYFDEQEVDGEHDFPTQRRWF

Query:  DVV-------DIVQQEDFELVLIGVWSIWNDRNNVVHNRPIMNFE--DRCGWIVDYLHRFKHLNSPGVSRSGVQKESLC----EVPRGCVKVFVDVACDE
        D V       DI+    FE + +  W +W  RN +V+ +        D   W   ++  FK   +  V    V K+ +       P G  K+  D   D 
Subjt:  DVV-------DIVQQEDFELVLIGVWSIWNDRNNVVHNRPIMNFE--DRCGWIVDYLHRFKHLNSPGVSRSGVQKESLC----EVPRGCVKVFVDVACDE

Query:  VNKRVGFGVAIVSADGKLIVTMENCGQNYISPQ----IDVRDGARLASQMGFKHCLNFSDSLVVISMVNN
          K  G GV I    G ++ ++       + PQ    + V  G RLA + G       SDSL V++++N+
Subjt:  VNKRVGFGVAIVSADGKLIVTMENCGQNYISPQ----IDVRDGARLASQMGFKHCLNFSDSLVVISMVNN

XP_010686122.1 PREDICTED: uncharacterized protein LOC104900404 [Beta vulgaris subsp. vulgaris]1.0e-2433.47Show/hide
Query:  ILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINI-DGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYIGKLSKIAK
        +L++  D   W  + RV I +D+ K LRR  +I++ DG      + +KYE+L   C  CG+IGH  +DC          N   Q+G WLR   +  + +K
Subjt:  ILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINI-DGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYIGKLSKIAK

Query:  TPISGGKGGDLVLKPKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKTMEQ-EKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYE
             G+  D V         G K++ D +         K  +    + + E+W+F G+YG P+   + +TWEL+R L    D P V+GGD NEIL   E
Subjt:  TPISGGKGGDLVLKPKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKTMEQ-EKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYE

Query:  KVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTW
        K GG  R+R  +  FR V+  C L+DL   G   TW
Subjt:  KVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTW

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.3e-4027.86Show/hide
Query:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYI
        +G+ +  +   D  + N  WG+N+RV + +D+SK LRRGIK+N+DGP+GG WIP++YE+L   C HCG+               +S    HQYG WLRY 
Subjt:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYI

Query:  GKL---------------------------------------------------SKIAKTPISGGKGGDLVLKP-KVKEGRGGKDSTDVSD-----QHRT
        G +                                                   S + +TP  G +       P  + EG    +  ++S+     +   
Subjt:  GKL---------------------------------------------------SKIAKTPISGGKGGDLVLKP-KVKEGRGGKDSTDVSD-----QHRT

Query:  PGLRKAFL-ILKTMEQE---KWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGN
        P ++ ++   L  M+       +FTG YG P    R  TWELLR++ +++ SPW+IGGD+N IL +YE       D S I  FRN++  C L D+G +G 
Subjt:  PGLRKAFL-ILKTMEQE---KWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGN

Query:  LLTWCNIRNGDDQM-----------------------------------------------GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEI
        + TWCN R   DQ+                                               G+ + + +  + +  +    +AY QP PLDF +I+ +E 
Subjt:  LLTWCNIRNGDDQM-----------------------------------------------GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEI

Query:  ELETLLHQEEQYWRQRQNTD
        +L  LL  EE +W+QR   D
Subjt:  ELETLLHQEEQYWRQRQNTD

XP_027118730.1 uncharacterized protein LOC113735973 [Coffea arabica]7.3e-2332.66Show/hide
Query:  SWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYIGKLSKIAKTPISGGKGG-
        S G  +R+ ++++++  L+R +K+ I+G +  C +  +YE+L   C+ CG IGH  +DC                        KL  +AK   S    G 
Subjt:  SWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYIGKLSKIAKTPISGGKGG-

Query:  -DLVLKPKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKT-------------MEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEIL
         D   +P V+       ST V+ Q R   L     IL T             M    W+ TG YG P+   R  TW+++RKL ++   PWV  GD NE+L
Subjt:  -DLVLKPKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKT-------------MEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEIL

Query:  RDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGDD
           E  G   R +  I NFR  L DC L DLG EGN  TWC  R+  D
Subjt:  RDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGDD

TrEMBL top hitse value%identityAlignment
A0A2N9E949 CCHC-type domain-containing protein5.1e-3033.45Show/hide
Query:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFH-QYGQWLRY
        VG+ +     +D+ ED  +WG  MRV +RIDVS  L R  ++ + G     W+ +KYEKL   C +CGI+GH  ++C L  ++  SK+  H +YG WLR 
Subjt:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFH-QYGQWLRY

Query:  IGKLSKIAKTPISGG--KGGDLVLKPKVKEG-----------------RGGKDSTDVSDQH-----RTPGLRKAFLI---LKTMEQEK-WQFTGLYGQPD
             K       G   K   +V   ++  G                 +G  +   V   H       P +   FL+   +K +E EK W+ TG YG P+
Subjt:  IGKLSKIAKTPISGG--KGGDLVLKPKVKEG-----------------RGGKDSTDVSDQH-----RTPGLRKAFLI---LKTMEQEK-WQFTGLYGQPD

Query:  HRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNG
           R  +W+LL+ L      PWV+ GD NEIL + EK+G   R  S +++FR  L    L+DLG  G   TW N R G
Subjt:  HRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNG

A0A2N9G3M9 CCHC-type domain-containing protein6.5e-2528.27Show/hide
Query:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFH-QYGQWLRY
        VG+ +     +D+ ED  +WG  MRV +RIDVS  L R  ++ + G     W+ +KYEKL   C +CGI+GH  ++C L  ++  SK+  H +YG WLR 
Subjt:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFH-QYGQWLRY

Query:  IGKLSKIAKTPISG--------------GKGGDLVLKP-----------------------------------------------KVKEGRGGKDSTDVS
             K       G                GG     P                                               KV++         V 
Subjt:  IGKLSKIAKTPISG--------------GKGGDLVLKP-----------------------------------------------KVKEGRGGKDSTDVS

Query:  DQHRTPGLRKAFLILKTMEQEK------------------WQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLIS
         Q R  GL   +  L  +E +K                  W+ TG YG P+   R  +W+LL+ L      PWV+ GD NEIL + EK+G   R  S ++
Subjt:  DQHRTPGLRKAFLILKTMEQEK------------------WQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLIS

Query:  NFRNVLYDCELKDLGCEGNLLTWCNIRNG
        +FR  L    L+DLG  G   TW N R G
Subjt:  NFRNVLYDCELKDLGCEGNLLTWCNIRNG

A0A6J1DX30 uncharacterized protein LOC1110248741.1e-4027.86Show/hide
Query:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYI
        +G+ +  +   D  + N  WG+N+RV + +D+SK LRRGIK+N+DGP+GG WIP++YE+L   C HCG+               +S    HQYG WLRY 
Subjt:  VGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFHQYGQWLRYI

Query:  GKL---------------------------------------------------SKIAKTPISGGKGGDLVLKP-KVKEGRGGKDSTDVSD-----QHRT
        G +                                                   S + +TP  G +       P  + EG    +  ++S+     +   
Subjt:  GKL---------------------------------------------------SKIAKTPISGGKGGDLVLKP-KVKEGRGGKDSTDVSD-----QHRT

Query:  PGLRKAFL-ILKTMEQE---KWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGN
        P ++ ++   L  M+       +FTG YG P    R  TWELLR++ +++ SPW+IGGD+N IL +YE       D S I  FRN++  C L D+G +G 
Subjt:  PGLRKAFL-ILKTMEQE---KWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGN

Query:  LLTWCNIRNGDDQM-----------------------------------------------GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEI
        + TWCN R   DQ+                                               G+ + + +  + +  +    +AY QP PLDF +I+ +E 
Subjt:  LLTWCNIRNGDDQM-----------------------------------------------GQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEI

Query:  ELETLLHQEEQYWRQRQNTD
        +L  LL  EE +W+QR   D
Subjt:  ELETLLHQEEQYWRQRQNTD

A0A803PRV4 Uncharacterized protein3.8e-2531.41Show/hide
Query:  RVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNT--FHQYGQWLRY--IGKLSKIAKTPISGGKGGDLV
        R+W  + ++K +  G  +   G     WI  +YE+L  +C  CG IGH  KDC      V  + T     YG+WL+   IG+     K+   G +G   +
Subjt:  RVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNT--FHQYGQWLRY--IGKLSKIAKTPISGGKGGDLV

Query:  LKPKVKEGRGGKD------STDVSDQ-------------HRTPG-----LRK--AFLILKTMEQEK---------------------WQFTGLYGQPDHR
        ++     G G         S ++ DQ             H   G     LRK    ++L  M+ E                      W+FTG YG PD  
Subjt:  LKPKVKEGRGGKD------STDVSDQ-------------HRTPG-----LRK--AFLILKTMEQEK---------------------WQFTGLYGQPDHR

Query:  LRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGD
         RF +W+LL+++  +   PWV+GGD NEI+   EK GG P+   LI NFR  L  C L+++G EG+  TWCN R  D
Subjt:  LRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGD

A0A803QD63 Uncharacterized protein4.5e-2633.33Show/hide
Query:  RVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNV--NSKNTFHQYGQWLRYIGKLSKIAKTPISGGKGGDLVLK
        RVW  + ++K +  G  +   G     WI  KYE+   +C  CG IGH FKDC+     +  +       YG WL+       + +   + GK G ++L 
Subjt:  RVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNV--NSKNTFHQYGQWLRYIGKLSKIAKTPISGGKGGDLVLK

Query:  PKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKTMEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNF
                   S DV  Q  +         ++  + + W+FTG YG PD   R  +W+LL++L  +   PW +GG+ NEIL   EK+GG  +   LI+NF
Subjt:  PKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKTMEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNEILRDYEKVGGEPRDRSLISNF

Query:  RNVLYDCELKDLGCEGNLLTWCNIR
        R  L  C+L+D+G EG+  TWCN R
Subjt:  RNVLYDCELKDLGCEGNLLTWCNIR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.3e-0435.82Show/hide
Query:  LDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDC
        +D+   N+  G   RV I ++++K L+  + IN D         + YE LSK+CS CGI GH    C
Subjt:  LDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTCAACAAATTCTTGCTAGCTCTGGAACACCCCCTTCTGTTTACGAAGTCGTCGGAGATGGGGTTCGTTTTTACACAATTCTAGATTTAGGGGAGGATAATTA
CTCCTGGGGTGCAAATATGCGAGTTTGGATACGGATCGATGTATCAAAGTCATTGCGGAGAGGGATTAAAATCAATATCGATGGCCCGATGGGAGGATGTTGGATCCCTA
TGAAGTATGAGAAACTTTCGAAATTATGTTCTCATTGCGGTATCATTGGGCATCACTTTAAAGATTGCAGCCTGTTTTACAAGAATGTCAACTCCAAGAATACCTTCCAT
CAATATGGCCAATGGCTTCGGTATATAGGTAAATTATCGAAAATAGCTAAAACCCCTATCTCTGGTGGGAAAGGAGGTGATCTTGTTTTGAAGCCAAAGGTCAAAGAGGG
GAGGGGAGGGAAGGATTCGACTGATGTTTCTGATCAACATAGAACTCCCGGATTGAGGAAGGCGTTTCTAATTCTAAAAACGATGGAACAAGAAAAATGGCAGTTTACGG
GCTTGTATGGACAACCTGACCATAGACTTAGGTTTCAAACTTGGGAGCTTTTGAGAAAGTTACAGTCCATAGAAGATTCCCCTTGGGTGATAGGAGGAGATTTGAACGAA
ATTTTAAGGGACTATGAGAAAGTGGGAGGAGAGCCAAGAGATAGATCCCTGATTTCAAATTTTCGTAATGTGTTGTATGACTGCGAGTTGAAGGATTTGGGTTGTGAAGG
GAATCTGTTGACTTGGTGTAACATAAGGAATGGTGATGATCAGATGGGGCAAGGAAGTGAATTCTGGGTTGGTACCAAAGGTCAAATTATTGAAAGGGGAACTAAAGAAG
CCTATGCACAACCTAGACCTTTAGATTTTAAGGTTATTAATGACATGGAAATCGAATTGGAAACACTTTTACACCAGGAAGAACAATATTGGCGACAGAGACAGAATACG
GATGATCTACATGTAGCAGATTTTATTACGCCTACCCGATCATGGGATATGAGTAAATTGGCTCAATATTTTGATGAACAGGAGGTGGATGGCGAACATGATTTCCCTAC
TCAAAGACGATGGTTTGACGTGGTGGATATAGTCCAACAAGAGGATTTTGAGTTAGTACTGATAGGAGTATGGTCAATTTGGAATGACAGGAACAACGTGGTTCATAATA
GACCCATTATGAATTTCGAGGATAGATGTGGTTGGATTGTTGACTACTTACATAGATTTAAACATCTAAATAGTCCAGGTGTCAGTAGGAGTGGGGTGCAGAAGGAGTCG
TTATGTGAGGTACCGAGGGGTTGTGTTAAGGTCTTTGTGGATGTGGCTTGTGATGAAGTGAATAAAAGAGTAGGGTTTGGGGTAGCTATTGTAAGTGCAGATGGGAAGTT
GATTGTGACAATGGAGAATTGTGGGCAGAATTATATCTCGCCACAGATTGATGTTCGTGATGGGGCTCGTTTGGCTTCTCAAATGGGCTTTAAACATTGTCTCAATTTTT
CTGATTCCTTGGTCGTGATTTCCATGGTCAATAATGGTCATAGTGAAGTTTTGGCGAAAGCTCTAACGGTTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTCAACAAATTCTTGCTAGCTCTGGAACACCCCCTTCTGTTTACGAAGTCGTCGGAGATGGGGTTCGTTTTTACACAATTCTAGATTTAGGGGAGGATAATTA
CTCCTGGGGTGCAAATATGCGAGTTTGGATACGGATCGATGTATCAAAGTCATTGCGGAGAGGGATTAAAATCAATATCGATGGCCCGATGGGAGGATGTTGGATCCCTA
TGAAGTATGAGAAACTTTCGAAATTATGTTCTCATTGCGGTATCATTGGGCATCACTTTAAAGATTGCAGCCTGTTTTACAAGAATGTCAACTCCAAGAATACCTTCCAT
CAATATGGCCAATGGCTTCGGTATATAGGTAAATTATCGAAAATAGCTAAAACCCCTATCTCTGGTGGGAAAGGAGGTGATCTTGTTTTGAAGCCAAAGGTCAAAGAGGG
GAGGGGAGGGAAGGATTCGACTGATGTTTCTGATCAACATAGAACTCCCGGATTGAGGAAGGCGTTTCTAATTCTAAAAACGATGGAACAAGAAAAATGGCAGTTTACGG
GCTTGTATGGACAACCTGACCATAGACTTAGGTTTCAAACTTGGGAGCTTTTGAGAAAGTTACAGTCCATAGAAGATTCCCCTTGGGTGATAGGAGGAGATTTGAACGAA
ATTTTAAGGGACTATGAGAAAGTGGGAGGAGAGCCAAGAGATAGATCCCTGATTTCAAATTTTCGTAATGTGTTGTATGACTGCGAGTTGAAGGATTTGGGTTGTGAAGG
GAATCTGTTGACTTGGTGTAACATAAGGAATGGTGATGATCAGATGGGGCAAGGAAGTGAATTCTGGGTTGGTACCAAAGGTCAAATTATTGAAAGGGGAACTAAAGAAG
CCTATGCACAACCTAGACCTTTAGATTTTAAGGTTATTAATGACATGGAAATCGAATTGGAAACACTTTTACACCAGGAAGAACAATATTGGCGACAGAGACAGAATACG
GATGATCTACATGTAGCAGATTTTATTACGCCTACCCGATCATGGGATATGAGTAAATTGGCTCAATATTTTGATGAACAGGAGGTGGATGGCGAACATGATTTCCCTAC
TCAAAGACGATGGTTTGACGTGGTGGATATAGTCCAACAAGAGGATTTTGAGTTAGTACTGATAGGAGTATGGTCAATTTGGAATGACAGGAACAACGTGGTTCATAATA
GACCCATTATGAATTTCGAGGATAGATGTGGTTGGATTGTTGACTACTTACATAGATTTAAACATCTAAATAGTCCAGGTGTCAGTAGGAGTGGGGTGCAGAAGGAGTCG
TTATGTGAGGTACCGAGGGGTTGTGTTAAGGTCTTTGTGGATGTGGCTTGTGATGAAGTGAATAAAAGAGTAGGGTTTGGGGTAGCTATTGTAAGTGCAGATGGGAAGTT
GATTGTGACAATGGAGAATTGTGGGCAGAATTATATCTCGCCACAGATTGATGTTCGTGATGGGGCTCGTTTGGCTTCTCAAATGGGCTTTAAACATTGTCTCAATTTTT
CTGATTCCTTGGTCGTGATTTCCATGGTCAATAATGGTCATAGTGAAGTTTTGGCGAAAGCTCTAACGGTTATTTAG
Protein sequenceShow/hide protein sequence
MDFQQILASSGTPPSVYEVVGDGVRFYTILDLGEDNYSWGANMRVWIRIDVSKSLRRGIKINIDGPMGGCWIPMKYEKLSKLCSHCGIIGHHFKDCSLFYKNVNSKNTFH
QYGQWLRYIGKLSKIAKTPISGGKGGDLVLKPKVKEGRGGKDSTDVSDQHRTPGLRKAFLILKTMEQEKWQFTGLYGQPDHRLRFQTWELLRKLQSIEDSPWVIGGDLNE
ILRDYEKVGGEPRDRSLISNFRNVLYDCELKDLGCEGNLLTWCNIRNGDDQMGQGSEFWVGTKGQIIERGTKEAYAQPRPLDFKVINDMEIELETLLHQEEQYWRQRQNT
DDLHVADFITPTRSWDMSKLAQYFDEQEVDGEHDFPTQRRWFDVVDIVQQEDFELVLIGVWSIWNDRNNVVHNRPIMNFEDRCGWIVDYLHRFKHLNSPGVSRSGVQKES
LCEVPRGCVKVFVDVACDEVNKRVGFGVAIVSADGKLIVTMENCGQNYISPQIDVRDGARLASQMGFKHCLNFSDSLVVISMVNNGHSEVLAKALTVI