; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004929 (gene) of Snake gourd v1 genome

Gene IDTan0004929
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG01:22204271..22205224
RNA-Seq ExpressionTan0004929
SyntenyTan0004929
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO61345.1 reverse transcriptase [Corchorus capsularis]1.7e-2933.88Show/hide
Query:  EELIEDWSKLNLTAEEE-ETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLF
        E L + W   NLT EE  E +VD +   E   E   ++CL+GKL+  R ++ EVMRN   + WK+  GL+V  IG+NL+ F+F ++++  RV    PW F
Subjt:  EELIEDWSKLNLTAEEE-ETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLF

Query:  DRFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGG-VLIS
        ++ LLVL                        D+ L   N+ + + +G + G+  E D    +++WG  +R + R++++KPLRRG  + L  P GG +LIS
Subjt:  DRFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGG-VLIS

Query:  IKYEKLPEFCSHCGIIGHHFKDCNQ-FYKNSTQGVKFHQYEQYLR
         +YEKLP+FC  CG + H   +C +       +G    +Y  +LR
Subjt:  IKYEKLPEFCSHCGIIGHHFKDCNQ-FYKNSTQGVKFHQYEQYLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]3.2e-4139.26Show/hide
Query:  LIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFDR
        L+E+W    LT+EE++ +VD+   +     + L+  L+ KL+  R IS  V++NT K+AWK++     V+ IG N++ F F    D  R+    PW FDR
Subjt:  LIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFDR

Query:  FLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKY
         L+++                        D+ LAC NK MA RLGNA+G F + +       WG+ +RV++R D+ KPL RGIKLNLDGPMGG  I I+Y
Subjt:  FLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKY

Query:  EKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRF
        E+LP+F  HCG + H  KDC+    +S    K  QY  +LRF
Subjt:  EKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRF

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]5.5e-4944.49Show/hide
Query:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        E L+ DW K  LT+EE+E ++DV  D+    E+ L   LVGKL+  R ISA+V+     +AWK+E  L VE IGKNL+ F F  E D  RV    PW FD
Subjt:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
        + L+VL                        D+ ++  NK MA RLGNA+G+FV+ D      SWGAS+R+++ IDI+KPLRRGIK+N+DGPMGG  I I+
Subjt:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVK-FHQYEQYLRFV
        YE+LP+FC  CG+IGH   DC+  Y  +    +   +Y  +LRFV
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVK-FHQYEQYLRFV

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]7.7e-4335.83Show/hide
Query:  ELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        +L+E+W    LT+EEEET++DV   +      RL+  LVGKL   R I+  VM+NT + AWK+E +  EV+ +G NL+ F F   +D  ++    PW FD
Subjt:  ELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
        R L+++                        D+ L C  ++MA RLGNA+G F E D  +    WG+++RV++ +DISKPLRRGIKLNLDGP+GG  I I+
Subjt:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRFVNRPPVIARSPFSDEPRKHQTSVKTTTASMGTPMATSPKILGR--AKSRGSPAAIGV
        YE+LP+FC HCG+               +   K HQY  +LR+                   Q +VK T   M  P        G     S  SP   G 
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRFVNRPPVIARSPFSDEPRKHQTSVKTTTASMGTPMATSPKILGR--AKSRGSPAAIGV

Query:  DGISKSP
         G+  +P
Subjt:  DGISKSP

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]9.8e-3031.97Show/hide
Query:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        + L++    L+LT+EE+   V +  DS   +  + D CLVGKL+  R  + E M+NT    W+   G++V  IG NL+ F F + +D  RV    PW FD
Subjt:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
        + LL+L                        ++ L   NK++ Q +GNA+G F++ D  +G ++WG +MR+++ ID+ KPLRRG+KL L      + +  K
Subjt:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQ--FYKNSTQGVKFHQYEQYLRFVN-------RPPVIARSPFSDEPRKHQTSVKTTTASM----GTPMATSP
        YE+LP +C  CG +GH  ++C+    + + T+ V   QY  +LR  N       R   I +      P      ++T  A M    G P+  SP
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQ--FYKNSTQGVKFHQYEQYLRFVN-------RPPVIARSPFSDEPRKHQTSVKTTTASM----GTPMATSP

TrEMBL top hitse value%identityAlignment
A0A2N9EEA9 CCHC-type domain-containing protein2.1e-3033.33Show/hide
Query:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        E L++DW K +LT E+E    DV+ D+  N  E    CLVGKL+  +Y +   ++ T    W    G+    I  NL+ F+F N  +  RV  G PWLFD
Subjt:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVLDIL------------LACF------------NKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
          LL+L+++             +CF             +E  +R+G+  G  ++ D     + WG ++R+++ +DIS+P  RG  +     +G + +S K
Subjt:  RFLLVLDIL------------LACF------------NKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQ-GVKFHQYEQYLR
        YE+LP  C HCG IGH  +DC    K   Q G  F QY  +LR
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQ-GVKFHQYEQYLR

A0A2N9FJK9 CCHC-type domain-containing protein1.2e-3032.92Show/hide
Query:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        +EL+EDW + +LT E+E     +  ++  + E     CL+GKL+  ++ +   ++ T    W   SG+  + +G+NL+ F+F+N+ DC RV HG PWLFD
Subjt:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVLDIL------------LACF------------NKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
          LLVL++               CF             K+  +RLG A+G+    D     + WG  +RV+I +D++KPL+RG +L   G  G   I+ K
Subjt:  RFLLVLDIL------------LACF------------NKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCN-QFYKNSTQGVKFHQYEQYLR
        YE+LP  C HCG +GH  ++C  + +       +F  Y  +LR
Subjt:  YEKLPEFCSHCGIIGHHFKDCN-QFYKNSTQGVKFHQYEQYLR

A0A6J1BSZ1 uncharacterized protein LOC1110054811.6e-4139.26Show/hide
Query:  LIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFDR
        L+E+W    LT+EE++ +VD+   +     + L+  L+ KL+  R IS  V++NT K+AWK++     V+ IG N++ F F    D  R+    PW FDR
Subjt:  LIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFDR

Query:  FLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKY
         L+++                        D+ LAC NK MA RLGNA+G F + +       WG+ +RV++R D+ KPL RGIKLNLDGPMGG  I I+Y
Subjt:  FLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKY

Query:  EKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRF
        E+LP+F  HCG + H  KDC+    +S    K  QY  +LRF
Subjt:  EKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRF

A0A6J1DU55 uncharacterized protein LOC1110231352.7e-4944.49Show/hide
Query:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        E L+ DW K  LT+EE+E ++DV  D+    E+ L   LVGKL+  R ISA+V+     +AWK+E  L VE IGKNL+ F F  E D  RV    PW FD
Subjt:  EELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
        + L+VL                        D+ ++  NK MA RLGNA+G+FV+ D      SWGAS+R+++ IDI+KPLRRGIK+N+DGPMGG  I I+
Subjt:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVK-FHQYEQYLRFV
        YE+LP+FC  CG+IGH   DC+  Y  +    +   +Y  +LRFV
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVK-FHQYEQYLRFV

A0A6J1DX30 uncharacterized protein LOC1110248743.7e-4335.83Show/hide
Query:  ELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
        +L+E+W    LT+EEEET++DV   +      RL+  LVGKL   R I+  VM+NT + AWK+E +  EV+ +G NL+ F F   +D  ++    PW FD
Subjt:  ELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIE-SGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD

Query:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK
        R L+++                        D+ L C  ++MA RLGNA+G F E D  +    WG+++RV++ +DISKPLRRGIKLNLDGP+GG  I I+
Subjt:  RFLLVL------------------------DILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIK

Query:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRFVNRPPVIARSPFSDEPRKHQTSVKTTTASMGTPMATSPKILGR--AKSRGSPAAIGV
        YE+LP+FC HCG+               +   K HQY  +LR+                   Q +VK T   M  P        G     S  SP   G 
Subjt:  YEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKFHQYEQYLRFVNRPPVIARSPFSDEPRKHQTSVKTTTASMGTPMATSPKILGR--AKSRGSPAAIGV

Query:  DGISKSP
         G+  +P
Subjt:  DGISKSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding8.6e-0821.26Show/hide
Query:  EEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPW-LFDRFLLVLD------
        E+EE  + +  +    +      C++ K++ G  I   V+    +  WK    + V  + +  +  RF  E + +    G PW +   +LLV D      
Subjt:  EEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPW-LFDRFLLVLD------

Query:  -----------------ILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKYEKLPEFCSHCGII
                         I    +++ +   +   +G  ++ D        G   RV I ++++KPL+  + +N      G    + YE L + CS CGI 
Subjt:  -----------------ILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKYEKLPEFCSHCGII

Query:  GHHFKDC
        GH    C
Subjt:  GHHFKDC

AT3G47920.1 unknown protein1.2e-0625.35Show/hide
Query:  FRFRNEMDCLRVTHGRPWLFDRFLLV------------------------LDILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKP
        F F+NE+D L V     WLF+ + +                         + +L  C  +E A  + + +G  +  D  +  ++  A +RV++RI I+  
Subjt:  FRFRNEMDCLRVTHGRPWLFDRFLLV------------------------LDILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKP

Query:  LRRGIKLNLDGPMGGVLISIKYEKLPEFCSHCGIIGHHFKDC
        LR   ++  D      LI  +YE+L   CS C  + HH   C
Subjt:  LRRGIKLNLDGPMGGVLISIKYEKLPEFCSHCGIIGHHFKDC

AT5G36228.1 nucleic acid binding;zinc ion binding3.3e-0721.1Show/hide
Query:  EDWSKL-NLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEG--IGKNLYFFRFRNEMDCLRVTHGRPWLFDR
        E W+ + ++    EE  + + + + V         L+G+++  +  S E  R   ++ ++   G +V G  +    +  RFR+E+D L      PW+F+ 
Subjt:  EDWSKL-NLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEG--IGKNLYFFRFRNEMDCLRVTHGRPWLFDR

Query:  FLLVLD----------------------ILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKYEK
        + + L                       I L   ++   + + + +G  V  D      S    +RVK+R+D ++PLR   ++         +I  +YEK
Subjt:  FLLVLD----------------------ILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKYEK

Query:  LPEFCSHCGIIGHHFKDC
        L   C++C  + H    C
Subjt:  LPEFCSHCGIIGHHFKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCTTCAAGCCAACCAAAAGGGCAAGAGGAACTGATAGAGGACTGGAGCAAACTTAATCTAACAGCAGAGGAAGAAGAGACTTCAGTCGATGTGAAGTTTGATTC
GGAGGTGAATGTGGAAGAGAGGCTGGATAGTTGTCTTGTGGGAAAACTGATCTGTGGAAGATATATTTCGGCAGAGGTTATGCGGAATACCTTCAAGGTAGCATGGAAGA
TCGAATCGGGACTAGAGGTCGAGGGTATTGGCAAGAATCTGTACTTCTTTCGATTCCGTAATGAGATGGATTGTTTGAGAGTGACTCACGGGAGACCATGGCTTTTCGAC
AGGTTCTTACTTGTTCTAGACATTCTGTTGGCATGCTTCAACAAAGAAATGGCTCAGAGATTAGGTAACGCTATGGGCTCTTTTGTAGAGTTTGACGAGGGGAATGGAGA
ACTGAGTTGGGGGGCAAGCATGAGAGTGAAAATTAGGATTGATATATCAAAACCTTTGAGAAGGGGAATTAAACTGAATCTAGATGGCCCCATGGGTGGAGTTCTGATTT
CTATCAAGTATGAGAAATTACCAGAATTTTGTTCTCATTGTGGAATCATAGGACACCATTTCAAGGACTGTAACCAATTTTACAAGAATTCGACCCAAGGTGTGAAATTC
CACCAATATGAACAGTACCTGAGATTCGTGAACAGACCCCCAGTGATAGCCCGATCCCCTTTCTCAGATGAACCAAGGAAGCATCAGACTAGTGTGAAAACAACAACAGC
GAGTATGGGTACACCGATGGCTACTTCTCCAAAGATCCTCGGAAGAGCGAAGAGCAGAGGATCCCCCGCCGCGATCGGCGTTGATGGGATCTCCAAATCACCACCGCATT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCTTCAAGCCAACCAAAAGGGCAAGAGGAACTGATAGAGGACTGGAGCAAACTTAATCTAACAGCAGAGGAAGAAGAGACTTCAGTCGATGTGAAGTTTGATTC
GGAGGTGAATGTGGAAGAGAGGCTGGATAGTTGTCTTGTGGGAAAACTGATCTGTGGAAGATATATTTCGGCAGAGGTTATGCGGAATACCTTCAAGGTAGCATGGAAGA
TCGAATCGGGACTAGAGGTCGAGGGTATTGGCAAGAATCTGTACTTCTTTCGATTCCGTAATGAGATGGATTGTTTGAGAGTGACTCACGGGAGACCATGGCTTTTCGAC
AGGTTCTTACTTGTTCTAGACATTCTGTTGGCATGCTTCAACAAAGAAATGGCTCAGAGATTAGGTAACGCTATGGGCTCTTTTGTAGAGTTTGACGAGGGGAATGGAGA
ACTGAGTTGGGGGGCAAGCATGAGAGTGAAAATTAGGATTGATATATCAAAACCTTTGAGAAGGGGAATTAAACTGAATCTAGATGGCCCCATGGGTGGAGTTCTGATTT
CTATCAAGTATGAGAAATTACCAGAATTTTGTTCTCATTGTGGAATCATAGGACACCATTTCAAGGACTGTAACCAATTTTACAAGAATTCGACCCAAGGTGTGAAATTC
CACCAATATGAACAGTACCTGAGATTCGTGAACAGACCCCCAGTGATAGCCCGATCCCCTTTCTCAGATGAACCAAGGAAGCATCAGACTAGTGTGAAAACAACAACAGC
GAGTATGGGTACACCGATGGCTACTTCTCCAAAGATCCTCGGAAGAGCGAAGAGCAGAGGATCCCCCGCCGCGATCGGCGTTGATGGGATCTCCAAATCACCACCGCATT
GA
Protein sequenceShow/hide protein sequence
MGASSQPKGQEELIEDWSKLNLTAEEEETSVDVKFDSEVNVEERLDSCLVGKLICGRYISAEVMRNTFKVAWKIESGLEVEGIGKNLYFFRFRNEMDCLRVTHGRPWLFD
RFLLVLDILLACFNKEMAQRLGNAMGSFVEFDEGNGELSWGASMRVKIRIDISKPLRRGIKLNLDGPMGGVLISIKYEKLPEFCSHCGIIGHHFKDCNQFYKNSTQGVKF
HQYEQYLRFVNRPPVIARSPFSDEPRKHQTSVKTTTASMGTPMATSPKILGRAKSRGSPAAIGVDGISKSPPH