; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021419 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021419
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionZinc finger CCCH domain-containing protein
Genome locationscaffold4:601577..614371
RNA-Seq ExpressionSpg021419
SyntenySpg021419
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000571 - Zinc finger, CCCH-type
IPR025558 - Domain of unknown function DUF4283
IPR036855 - Zinc finger, CCCH-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047189.1 hypothetical protein E6C27_scaffold83G00690 [Cucumis melo var. makuwa]6.9e-4037.66Show/hide
Query:  STVLRNRKVSYWIRKNQEVLKVNFADFWVVSRLFAHNSWKEIIEVLEFHFKSKISINPLFADKALLKFEDDKVVSSLDSVGKWKVFGNFHLLLEKWNKKR
        S+ L   K   W+ +N EV+  NF + W++++LFA +  ++I ++LE +F++KI INPLF + AL+  ++  +   + + GKW+V G+F+L  EKW+K +
Subjt:  STVLRNRKVSYWIRKNQEVLKVNFADFWVVSRLFAHNSWKEIIEVLEFHFKSKISINPLFADKALLKFEDDKVVSSLDSVGKWKVFGNFHLLLEKWNKKR

Query:  HSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGNIYLHFGDVAILDSPNIIHHS
        +S P  M+GYGGW+ IKN     W   + E                      SEARIQVK NLCGF+P+TIE+ D   GNI+L+FGD   L+ P     +
Subjt:  HSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGNIYLHFGDVAILDSPNIIHHS

Query:  LELSDFSNPMDLFRIKQVMEDEGFD----SHFQNPDVEY
        + +SDF   + L RI +V++DEG D      F+ P++ +
Subjt:  LELSDFSNPMDLFRIKQVMEDEGFD----SHFQNPDVEY

XP_008447068.1 PREDICTED: zinc finger CCCH domain-containing protein 37 [Cucumis melo]9.9e-3954.76Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C YYMKTGDCKFGERCKFHHPIDRSAPKQ A HNVKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ
                     +AIICPYYLKTGTCKYG+TCKFDHPPPGEVM MAAIQS PGKEGE+RIDESV+E+
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ

XP_038888750.1 zinc finger CCCH domain-containing protein 37 isoform X1 [Benincasa hispida]8.1e-4157.4Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C +YMKTGDCKFGERCKFHHPIDRSAPKQ AQ  VKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ
                     DAIICPYYLKTGTCKYG+TCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ

XP_038888751.1 zinc finger CCCH domain-containing protein 37 isoform X2 [Benincasa hispida]8.1e-4157.4Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C +YMKTGDCKFGERCKFHHPIDRSAPKQ AQ  VKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ
                     DAIICPYYLKTGTCKYG+TCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQ

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]4.5e-3953.85Show/hide
Query:  GKWKVFGNFHLLLEKWNKKRHSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGN
        GKW+ FG+FHL  E+WN   H  P ++ GYGGWI+IKN PLDYW +Q+FEAIG+YFGGL +I+ E LN+  V +A I+VK NLCGF+PATIE+ +   G+
Subjt:  GKWKVFGNFHLLLEKWNKKRHSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGN

Query:  IYLHFGDVAILDSPNIIHHSLELSDFSNPMDLFRIKQVMEDEG
        IYL+FGD++  + P+ +   L  SDF+NP+DL R+ +V   EG
Subjt:  IYLHFGDVAILDSPNIIHHSLELSDFSNPMDLFRIKQVMEDEG

TrEMBL top hitse value%identityAlignment
A0A1S3BGI6 zinc finger CCCH domain-containing protein 374.8e-3954.76Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C YYMKTGDCKFGERCKFHHPIDRSAPKQ A HNVKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ
                     +AIICPYYLKTGTCKYG+TCKFDHPPPGEVM MAAIQS PGKEGE+RIDESV+E+
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ

A0A5A7U128 Uncharacterized protein3.3e-4037.66Show/hide
Query:  STVLRNRKVSYWIRKNQEVLKVNFADFWVVSRLFAHNSWKEIIEVLEFHFKSKISINPLFADKALLKFEDDKVVSSLDSVGKWKVFGNFHLLLEKWNKKR
        S+ L   K   W+ +N EV+  NF + W++++LFA +  ++I ++LE +F++KI INPLF + AL+  ++  +   + + GKW+V G+F+L  EKW+K +
Subjt:  STVLRNRKVSYWIRKNQEVLKVNFADFWVVSRLFAHNSWKEIIEVLEFHFKSKISINPLFADKALLKFEDDKVVSSLDSVGKWKVFGNFHLLLEKWNKKR

Query:  HSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGNIYLHFGDVAILDSPNIIHHS
        +S P  M+GYGGW+ IKN     W   + E                      SEARIQVK NLCGF+P+TIE+ D   GNI+L+FGD   L+ P     +
Subjt:  HSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDSYGGNIYLHFGDVAILDSPNIIHHS

Query:  LELSDFSNPMDLFRIKQVMEDEGFD----SHFQNPDVEY
        + +SDF   + L RI +V++DEG D      F+ P++ +
Subjt:  LELSDFSNPMDLFRIKQVMEDEGFD----SHFQNPDVEY

A0A5A7VAA0 Zinc finger CCCH domain-containing protein 376.3e-3954.17Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C YYMKTGDCKFGERCKFHHP+DRSAPKQ A HNVKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ
                     +AIICPYYLKTGTCKYG+TCKFDHPPPGEVM MAAIQS PGKEGE+RIDESV+E+
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQ

A0A6J1GM64 zinc finger CCCH domain-containing protein 37 isoform X21.4e-3855.83Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C YYMKTGDCKFGERCKFHHPIDRSAP QA+QHNVKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDE
                     DAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAM A+Q VPGKEGEERIDE
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDE

A0A6J1HW69 zinc finger CCCH domain-containing protein 371.4e-3855.83Show/hide
Query:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM
        G++ C YYMKTGDCKFGERCKFHHPIDRSAP QA+QHNVKLTLAGLPRRE                                                  
Subjt:  GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLM

Query:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDE
                     DAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAM A+Q VPGKEGEERIDE
Subjt:  GEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDE

SwissProt top hitse value%identityAlignment
Q2QT65 Zinc finger CCCH domain-containing protein 661.0e-0932.86Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTL-AGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDR
        YYM+TG C+FG  CKF+HP +R     AA+ N +     G P  + Y+ +  C    T +    HP   +   N        +L V G            
Subjt:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTL-AGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDR

Query:  IRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVM
              + P+   C YYL+TG CK+ STCKF HP P   M
Subjt:  IRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVM

Q2R4J4 Zinc finger CCCH domain-containing protein 633.5e-1032.86Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDR
        YYM+TG C+FG  CKF+HP DR     AA+   +     G P  + Y+ +  C    T   C  H      A                T    + +G   
Subjt:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDR

Query:  IRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVM
              + P+   C YYL+TG CK+GSTCKF HP P   M
Subjt:  IRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVM

Q5NAW2 Zinc finger CCCH domain-containing protein 65.9e-1032.21Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDRSAPK--QAAQHNVKLTL---AGLPRREVYI-------SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTL
        YY++TG C FG+RC+++HP DR   +    A++   L     AG P  E Y+          C      Q+  V P+  +N+G      F  RL      
Subjt:  YYMKTGDCKFGERCKFHHPIDRSAPK--QAAQHNVKLTL---AGLPRREVYI-------SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTL

Query:  AAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEV
              GE               C YY+KTG CK+G+TCKF HP  G V
Subjt:  AAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEV

Q5ZDJ6 Zinc finger CCCH domain-containing protein 81.2e-1836.81Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDRSAPK-----QAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMG
        +YMKTG CKF +RCKFHHPIDRSAP      + A+ +V+LTLAGLPRRE                                                   
Subjt:  YYMKTGDCKFGERCKFHHPIDRSAPK-----QAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMG

Query:  EDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA
                    DA++C +Y+KTG CK+G  CKFDHPPP E +A
Subjt:  EDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA

Q941Q3 Zinc finger CCCH domain-containing protein 373.0e-2243.66Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDR--SAPKQAAQH-NVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGED
        YYMKTG+CKFGERCKFHHP DR  +  KQA Q  NVKL+LAG PRRE                                                     
Subjt:  YYMKTGDCKFGERCKFHHPIDR--SAPKQAAQH-NVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGED

Query:  RIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA
                   A+ CPYY+KTGTCKYG+TCKFDHPPPGEVMA
Subjt:  RIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA

Arabidopsis top hitse value%identityAlignment
AT1G04990.2 Zinc finger C-x8-C-x5-C-x3-H type family protein6.1e-1026.32Show/hide
Query:  PIRCFVIGEDF-YGKLGC-YYMKTGDCKFGERCKFHHP--------------IDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWS
        P+   VIG     G+  C YY++TG C+FG  CKFHHP                 +  + A+   +  T   LPR +V  S + ++    Q       W+
Subjt:  PIRCFVIGEDF-YGKLGC-YYMKTGDCKFGERCKFHHP--------------IDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWS

Query:  --SNAGNSFFNKFLSRLIVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHP------PPGEVMAMAAIQSVPGK
            A NS +N          + +    +  +R  +  +  P+   C +++ TGTCKYG  CK+ HP      PP  ++    + + PG+
Subjt:  --SNAGNSFFNKFLSRLIVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHP------PPGEVMAMAAIQSVPGK

AT2G47850.1 Zinc finger C-x8-C-x5-C-x3-H type family protein4.6e-1032.89Show/hide
Query:  IGEDFY----GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRL
        +G D Y    G   C YYM+TG C +G RC+++HP DR++ +   +   +     G P  + Y+ +  C  +F       HP    NAG S  +  L+  
Subjt:  IGEDFY----GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRL

Query:  IVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPP
         ++G            +R   N       C YYLKTG CK+G TCKF HP P
Subjt:  IVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPP

AT2G47850.3 Zinc finger C-x8-C-x5-C-x3-H type family protein4.6e-1032.89Show/hide
Query:  IGEDFY----GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRL
        +G D Y    G   C YYM+TG C +G RC+++HP DR++ +   +   +     G P  + Y+ +  C  +F       HP    NAG S  +  L+  
Subjt:  IGEDFY----GKLGC-YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKL-TLAGLPRREVYI-SAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRL

Query:  IVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPP
         ++G            +R   N       C YYLKTG CK+G TCKF HP P
Subjt:  IVWGTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPP

AT3G12680.1 floral homeotic protein (HUA1)2.1e-2343.66Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDR--SAPKQAAQH-NVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGED
        YYMKTG+CKFGERCKFHHP DR  +  KQA Q  NVKL+LAG PRRE                                                     
Subjt:  YYMKTGDCKFGERCKFHHPIDR--SAPKQAAQH-NVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGED

Query:  RIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA
                   A+ CPYY+KTGTCKYG+TCKFDHPPPGEVMA
Subjt:  RIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMA

AT3G48440.1 Zinc finger C-x8-C-x5-C-x3-H type family protein2.1e-1028.37Show/hide
Query:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDRIR
        +YM+TG CKFG  CKF+HP+ R    Q A+ N       +  +E     + L+     +C  +       G   + +         T      + +  + 
Subjt:  YYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVWGTLAAGSLMGEDRIR

Query:  TVPNIN-------PDAIICPYYLKTGTCKYGSTCKFDHPPP
        + P +N       P  + CPYY++ G+CKYG+ CKF+HP P
Subjt:  TVPNIN-------PDAIICPYYLKTGTCKYGSTCKFDHPPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTTTTTATTGCGTTTGGTCCGAAGAGGAACGTTTTTTTGTGGAAGATGTGGCTTTCAACAAGGCGATTTCTCTTTCTTCTTCCCTTTTGCTTTGGTTGGAATG
TTCGTTGGTTGAGATTTTATGCCAGCCTATTCAGAATTTTTTCCGTAAGCAATTTCGGGATGCTTTTGGATTAATTCGTTTAGGCAAACTTCGTTCTTCTTCAGGCTGGT
TTTTGCGCTGTGTTGTTTGGCCTATTTCAGGTGGTAGACAATTCATCCATGTGCCAATTGGAAATTCCAAAGAGGGTTGTTCTTTATTCAAGGAGCTTGTGGTCGATTCC
ATTAGGAGTTTTATGGTTTCTAATGCTCCAGTAACTTGTTGGGAGGAACCTCCTATGAGTTATGCAGAAACTCTCAAGATTCCCTTGAAGTCATTAGCACTAGGTTCTGG
GGCTGTTGGTGTATCTAAGAATTCTACCGTTCTTCGGAATCGTAAGGTTTCTTACTGGATTAGAAAAAATCAGGAGGTTTTGAAGGTTAATTTTGCAGATTTTTGGGTGG
TGTCTAGATTATTTGCCCACAATAGTTGGAAAGAAATCATTGAGGTTTTGGAATTTCATTTCAAATCAAAGATTTCGATCAATCCGCTTTTTGCAGATAAAGCATTGCTC
AAATTTGAAGACGATAAAGTTGTTAGTTCATTAGATTCGGTTGGAAAATGGAAGGTCTTTGGTAATTTTCATCTTTTGCTTGAAAAATGGAACAAAAAGCGTCATAGTCA
TCCCTGTTTTATGGAGGGTTATGGTGGTTGGATTGCGATCAAGAATTTTCCTCTAGATTATTGGTTTCGGCAATCTTTTGAAGCTATTGGAGAATATTTTGGGGGTTTAG
TGAACATTTCAAGTGAAACTCTTAACATGACTATTGTTTCGGAAGCTAGAATTCAGGTTAAGACAAACTTATGTGGTTTTATGCCGGCTACTATTGAACTTAAAGATAGT
TACGGAGGGAATATTTATCTTCATTTTGGAGATGTGGCTATTCTGGATTCTCCAAATATTATTCACCATAGCCTTGAGTTGAGTGATTTTTCCAATCCAATGGACCTTTT
TAGAATCAAACAAGTAATGGAAGATGAAGGGTTTGATTCTCATTTTCAAAACCCAGATGTTGAATACTGTGGAAAAATTCTGGAAATTCCTTCTGTATCGAGGGACCCTA
AGGTAATTTACAAGTTTGTTCAATTACCTTGTGATGTTTGTAATATGAAAGGGCTTTCTATACTGCCAAACATTTCTGAAGTTGAAGAGTCACTGAAATTAAATTTTGAT
GATGATTTATTGTCAAGAAAGTCCTTTTCAGATCCCCCAGTAATTGAAAAGGTTATTTCGAATATTGTAAATGCCCGAGGTAATTTTCAAGACCTTGCAGTAATTGAGAA
AGAGAAAGAGTTATTAAATGCTGAAGAACATGCAGTAATTGAGAAAGAGAAGGAGTTATTAAATGCCAAAGGGCTGCATGTCTTTGGTCTTCCTTCTGCCGAGAGTGTAT
TTTCCAAGGAGCCTTTGCATGAAGTGGGGGAGGTTTTTAAGAATAAGGTGGATGCTGCATTTATTGATGAGATAGTAAATGATGGGGAGGTTTTTAAGAATAAGGAGGAT
GCTGCATTTATTGATGAGATAGTAAATGATGTTTTAACTCAAGTTTCAAACATGGCTGATAGATCTTTAACCGAGGGGCAACCTTTCAAATGTCCAGTAATTAATAATCC
AAAGGCTTTTTCAGATGAGAACACGGTTCAAGATACTTTTAATTCCGATTTAATTGGAAATTATTCTTTGTCTGAATATTTGGACTCCGTTATTCCATCAAGTGTTAAAG
AAGTACATGATAATTTCTGTAAATCTTATTCCAAATATTATGTGCGAAAGAAAGGGCCAAGTGTGGAAGGTGATAATTTAAAGGTTAATGCTGATGCTTCGGAAGAAGTT
GTCTCTAAGGTATTGGCTCCTCAAGAATCTATATTAAATCAGGATTATTCAAAGGTAGCAGACAAGTCAAATGATCTTGAAATCAATAGTTGTAATATCGGGCCAGAAGG
TGTGCTTTTCACAAGAGTTCTTTTTCCCTCTTCTAAAGCTAGTTTGGCGACAAAAGTATCATCACATTGTGATATTGAAAATTCAGATGATGAGTTAGAGGTTAGTATTA
GCAGTGAAGAGATTGATTTTCCTCCTTCGGAAAATCTTTTGAATGCTGATAGTTGTGATATTTTGTTTTTAGTTGCTTTTGTGTGTGTGTATACGAGCTTTCTGGAGCTC
AGGGAGATGAACGAGGAGTTCTCCCCCATCCGATGTTTTGTGATAGGGGAAGATTTTTATGGAAAGCTAGGATGTTATTATATGAAGACTGGGGACTGTAAATTTGGGGA
ACGATGCAAATTTCATCACCCTATTGATCGGTCAGCACCAAAGCAGGCTGCTCAACATAACGTGAAACTAACACTAGCAGGACTACCCAGGAGAGAGGTATATATATCTG
CTATGTGCCTTGTTAGGTTCACCTTACAAGAATGTGGAGTACACCCTATCTGGAGTAGTAATGCTGGAAACTCCTTTTTCAATAAGTTTCTCTCTCGCTTGATTGTTTGG
GGTACTTTAGCAGCTGGAAGTCTGATGGGTGAGGACAGAATTAGAACAGTACCAAATATTAATCCTGATGCCATAATTTGTCCATATTATCTGAAAACTGGTACGTGCAA
GTATGGTTCTACTTGCAAATTTGACCACCCACCTCCTGGAGAAGTAATGGCAATGGCAGCAATACAATCTGTTCCAGGAAAAGAAGGCGAAGAACGAATAGATGAATCAG
TCAATGAACAGCAGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTCTTTTTATTGCGTTTGGTCCGAAGAGGAACGTTTTTTTGTGGAAGATGTGGCTTTCAACAAGGCGATTTCTCTTTCTTCTTCCCTTTTGCTTTGGTTGGAATG
TTCGTTGGTTGAGATTTTATGCCAGCCTATTCAGAATTTTTTCCGTAAGCAATTTCGGGATGCTTTTGGATTAATTCGTTTAGGCAAACTTCGTTCTTCTTCAGGCTGGT
TTTTGCGCTGTGTTGTTTGGCCTATTTCAGGTGGTAGACAATTCATCCATGTGCCAATTGGAAATTCCAAAGAGGGTTGTTCTTTATTCAAGGAGCTTGTGGTCGATTCC
ATTAGGAGTTTTATGGTTTCTAATGCTCCAGTAACTTGTTGGGAGGAACCTCCTATGAGTTATGCAGAAACTCTCAAGATTCCCTTGAAGTCATTAGCACTAGGTTCTGG
GGCTGTTGGTGTATCTAAGAATTCTACCGTTCTTCGGAATCGTAAGGTTTCTTACTGGATTAGAAAAAATCAGGAGGTTTTGAAGGTTAATTTTGCAGATTTTTGGGTGG
TGTCTAGATTATTTGCCCACAATAGTTGGAAAGAAATCATTGAGGTTTTGGAATTTCATTTCAAATCAAAGATTTCGATCAATCCGCTTTTTGCAGATAAAGCATTGCTC
AAATTTGAAGACGATAAAGTTGTTAGTTCATTAGATTCGGTTGGAAAATGGAAGGTCTTTGGTAATTTTCATCTTTTGCTTGAAAAATGGAACAAAAAGCGTCATAGTCA
TCCCTGTTTTATGGAGGGTTATGGTGGTTGGATTGCGATCAAGAATTTTCCTCTAGATTATTGGTTTCGGCAATCTTTTGAAGCTATTGGAGAATATTTTGGGGGTTTAG
TGAACATTTCAAGTGAAACTCTTAACATGACTATTGTTTCGGAAGCTAGAATTCAGGTTAAGACAAACTTATGTGGTTTTATGCCGGCTACTATTGAACTTAAAGATAGT
TACGGAGGGAATATTTATCTTCATTTTGGAGATGTGGCTATTCTGGATTCTCCAAATATTATTCACCATAGCCTTGAGTTGAGTGATTTTTCCAATCCAATGGACCTTTT
TAGAATCAAACAAGTAATGGAAGATGAAGGGTTTGATTCTCATTTTCAAAACCCAGATGTTGAATACTGTGGAAAAATTCTGGAAATTCCTTCTGTATCGAGGGACCCTA
AGGTAATTTACAAGTTTGTTCAATTACCTTGTGATGTTTGTAATATGAAAGGGCTTTCTATACTGCCAAACATTTCTGAAGTTGAAGAGTCACTGAAATTAAATTTTGAT
GATGATTTATTGTCAAGAAAGTCCTTTTCAGATCCCCCAGTAATTGAAAAGGTTATTTCGAATATTGTAAATGCCCGAGGTAATTTTCAAGACCTTGCAGTAATTGAGAA
AGAGAAAGAGTTATTAAATGCTGAAGAACATGCAGTAATTGAGAAAGAGAAGGAGTTATTAAATGCCAAAGGGCTGCATGTCTTTGGTCTTCCTTCTGCCGAGAGTGTAT
TTTCCAAGGAGCCTTTGCATGAAGTGGGGGAGGTTTTTAAGAATAAGGTGGATGCTGCATTTATTGATGAGATAGTAAATGATGGGGAGGTTTTTAAGAATAAGGAGGAT
GCTGCATTTATTGATGAGATAGTAAATGATGTTTTAACTCAAGTTTCAAACATGGCTGATAGATCTTTAACCGAGGGGCAACCTTTCAAATGTCCAGTAATTAATAATCC
AAAGGCTTTTTCAGATGAGAACACGGTTCAAGATACTTTTAATTCCGATTTAATTGGAAATTATTCTTTGTCTGAATATTTGGACTCCGTTATTCCATCAAGTGTTAAAG
AAGTACATGATAATTTCTGTAAATCTTATTCCAAATATTATGTGCGAAAGAAAGGGCCAAGTGTGGAAGGTGATAATTTAAAGGTTAATGCTGATGCTTCGGAAGAAGTT
GTCTCTAAGGTATTGGCTCCTCAAGAATCTATATTAAATCAGGATTATTCAAAGGTAGCAGACAAGTCAAATGATCTTGAAATCAATAGTTGTAATATCGGGCCAGAAGG
TGTGCTTTTCACAAGAGTTCTTTTTCCCTCTTCTAAAGCTAGTTTGGCGACAAAAGTATCATCACATTGTGATATTGAAAATTCAGATGATGAGTTAGAGGTTAGTATTA
GCAGTGAAGAGATTGATTTTCCTCCTTCGGAAAATCTTTTGAATGCTGATAGTTGTGATATTTTGTTTTTAGTTGCTTTTGTGTGTGTGTATACGAGCTTTCTGGAGCTC
AGGGAGATGAACGAGGAGTTCTCCCCCATCCGATGTTTTGTGATAGGGGAAGATTTTTATGGAAAGCTAGGATGTTATTATATGAAGACTGGGGACTGTAAATTTGGGGA
ACGATGCAAATTTCATCACCCTATTGATCGGTCAGCACCAAAGCAGGCTGCTCAACATAACGTGAAACTAACACTAGCAGGACTACCCAGGAGAGAGGTATATATATCTG
CTATGTGCCTTGTTAGGTTCACCTTACAAGAATGTGGAGTACACCCTATCTGGAGTAGTAATGCTGGAAACTCCTTTTTCAATAAGTTTCTCTCTCGCTTGATTGTTTGG
GGTACTTTAGCAGCTGGAAGTCTGATGGGTGAGGACAGAATTAGAACAGTACCAAATATTAATCCTGATGCCATAATTTGTCCATATTATCTGAAAACTGGTACGTGCAA
GTATGGTTCTACTTGCAAATTTGACCACCCACCTCCTGGAGAAGTAATGGCAATGGCAGCAATACAATCTGTTCCAGGAAAAGAAGGCGAAGAACGAATAGATGAATCAG
TCAATGAACAGCAGCAGTAG
Protein sequenceShow/hide protein sequence
MNSFYCVWSEEERFFVEDVAFNKAISLSSSLLLWLECSLVEILCQPIQNFFRKQFRDAFGLIRLGKLRSSSGWFLRCVVWPISGGRQFIHVPIGNSKEGCSLFKELVVDS
IRSFMVSNAPVTCWEEPPMSYAETLKIPLKSLALGSGAVGVSKNSTVLRNRKVSYWIRKNQEVLKVNFADFWVVSRLFAHNSWKEIIEVLEFHFKSKISINPLFADKALL
KFEDDKVVSSLDSVGKWKVFGNFHLLLEKWNKKRHSHPCFMEGYGGWIAIKNFPLDYWFRQSFEAIGEYFGGLVNISSETLNMTIVSEARIQVKTNLCGFMPATIELKDS
YGGNIYLHFGDVAILDSPNIIHHSLELSDFSNPMDLFRIKQVMEDEGFDSHFQNPDVEYCGKILEIPSVSRDPKVIYKFVQLPCDVCNMKGLSILPNISEVEESLKLNFD
DDLLSRKSFSDPPVIEKVISNIVNARGNFQDLAVIEKEKELLNAEEHAVIEKEKELLNAKGLHVFGLPSAESVFSKEPLHEVGEVFKNKVDAAFIDEIVNDGEVFKNKED
AAFIDEIVNDVLTQVSNMADRSLTEGQPFKCPVINNPKAFSDENTVQDTFNSDLIGNYSLSEYLDSVIPSSVKEVHDNFCKSYSKYYVRKKGPSVEGDNLKVNADASEEV
VSKVLAPQESILNQDYSKVADKSNDLEINSCNIGPEGVLFTRVLFPSSKASLATKVSSHCDIENSDDELEVSISSEEIDFPPSENLLNADSCDILFLVAFVCVYTSFLEL
REMNEEFSPIRCFVIGEDFYGKLGCYYMKTGDCKFGERCKFHHPIDRSAPKQAAQHNVKLTLAGLPRREVYISAMCLVRFTLQECGVHPIWSSNAGNSFFNKFLSRLIVW
GTLAAGSLMGEDRIRTVPNINPDAIICPYYLKTGTCKYGSTCKFDHPPPGEVMAMAAIQSVPGKEGEERIDESVNEQQQ