; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017484 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017484
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153048:516266..522325
RNA-Seq ExpressionSgr017484
SyntenySgr017484
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035686.1 hypothetical protein SDJN02_02484 [Cucurbita argyrosperma subsp. argyrosperma]1.0e-10983.98Show/hide
Query:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ +V++D K RFKHQ+LLQDY ELEKETET KRKLQ+MKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPAL PLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_022135294.1 uncharacterized protein LOC111007291 [Momordica charantia]1.1e-11386.49Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK
        MKK+KG VSQYAVYEDPK RFKH SLLQDY+ELEKETETVKRKLQ+M QKKMTL+AEVRFLRKRYEYLMKNQSSN+  NGD   VQQK+LNNQV  NNKK
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRRPALQPLP+ISDINQKE+IDR IDIP Q+N TPTPV DLNQKAKT+RKKANLQN  P LDLNQKERMYSGRDAGERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQ HYEPLRA+ELKKSLLRGG+DEQQ NDIKISACRTIGDGPSRA KRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_022932147.1 uncharacterized protein LOC111438466 [Cucurbita moschata]2.6e-11084.38Show/hide
Query:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ +V++D K RFKHQ+LLQDY ELEKETET KRKLQ+MKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_038879754.1 uncharacterized protein LOC120071506 isoform X1 [Benincasa hispida]6.5e-10983.4Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEK-------ETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQ
        MKKMKG VSQY V+ED KTRFKHQSLLQDY++LEK       ET TVKRKLQ+MK KKMTLIAEVRFLRKRYEYLMKNQ S+NDH  N + VQQK+LNNQ
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEK-------ETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQ

Query:  VPKNNKKGKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQ
        V  NNKKGKNG RRR  LQPLPTISD+NQKERID+ ID+P QNSTP PVLDLNQKAKT SRKKAN QNSTP  DLNQKERMYSGRDA ERN+TPFFDLNQ
Subjt:  VPKNNKKGKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQ

Query:  ISIEEEELQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        ISIEEEELQT+YEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  ISIEEEELQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_038879755.1 uncharacterized protein LOC120071506 isoform X2 [Benincasa hispida]5.3e-11185.66Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG VSQY V+ED KTRFKHQSLLQDY++LEKET TVKRKLQ+MK KKMTLIAEVRFLRKRYEYLMKNQ S+NDH  N + VQQK+LNNQV  NNKK
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRR  LQPLPTISD+NQKERID+ ID+P QNSTP PVLDLNQKAKT SRKKAN QNSTP  DLNQKERMYSGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQT+YEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

TrEMBL top hitse value%identityAlignment
A0A1S3B8I5 uncharacterized protein LOC1034873441.5e-10684.5Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG VSQY VYED KTRFKHQSLLQDY +LEKET TVKRKLQ+MKQKKMTLIAEVRFLRKRYEYLMKNQ S+ DH  NG+ VQQK+  NQV  NNKK
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
         KNG RRR ALQPLPTISDINQKE    RID+P QNSTP PVLDLNQKAKT SRKKAN  NS P  DLNQKERM SGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQTHYEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A5A7T0V7 Uncharacterized protein1.5e-10684.5Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG VSQY VYED KTRFKHQSLLQDY +LEKET TVKRKLQ+MKQKKMTLIAEVRFLRKRYEYLMKNQ S+ DH  NG+ VQQK+  NQV  NNKK
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
         KNG RRR ALQPLPTISDINQKE    RID+P QNSTP PVLDLNQKAKT SRKKAN  NS P  DLNQKERM SGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQTHYEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1C111 uncharacterized protein LOC1110072915.5e-11486.49Show/hide
Query:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK
        MKK+KG VSQYAVYEDPK RFKH SLLQDY+ELEKETETVKRKLQ+M QKKMTL+AEVRFLRKRYEYLMKNQSSN+  NGD   VQQK+LNNQV  NNKK
Subjt:  MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRRPALQPLP+ISDINQKE+IDR IDIP Q+N TPTPV DLNQKAKT+RKKANLQN  P LDLNQKERMYSGRDAGERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQ HYEPLRA+ELKKSLLRGG+DEQQ NDIKISACRTIGDGPSRA KRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1EVU7 uncharacterized protein LOC1114384661.3e-11084.38Show/hide
Query:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ +V++D K RFKHQ+LLQDY ELEKETET KRKLQ+MKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1HJN9 uncharacterized protein LOC1114651177.7e-10883.59Show/hide
Query:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ +V++D K RFKHQ+LLQDY ELEKETET KRKLQ+MKQKKMTL+AEVRFLRKRYEYLMKNQ  NDH NG+ VQQK  + QV  N KKG
Subjt:  MKKMKGAVSQY-AVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+G+GPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.4e-1633.6Show/hide
Query:  DPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ---SSNDHL----NGDLVQQKRLNNQVPKNNKKGKNGVRRRPA
        DP  R     +L    ELEKE E  +++L+++KQK++TL +EVRFLR+RYE+L ++Q   +S + L    +G L   ++     P   +K ++GVR    
Subjt:  DPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQ---SSNDHL----NGDLVQQKRLNNQVPKNNKKGKNGVRRRPA

Query:  LQPLPTISDI-NQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRA
           L   + I N+KE +   +             DL++K K SR    L       DLN +     G  +G   V P FDLNQIS EEEE + + E + A
Subjt:  LQPLPTISDI-NQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRA

Query:  DELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVAL
        + +K ++L     +   + K+  C  +    +RA KRK++WQD VAL
Subjt:  DELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVAL

AT5G57910.1 unknown protein4.9e-2232.52Show/hide
Query:  YEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQPLP
        +EDPK RF+H SL+QDY EL  ETE ++++LQ ++++K TL+AEVRFLR+RY +L ++Q                  Q  K  ++   G + R  + P  
Subjt:  YEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQPLP

Query:  TISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE----LQTHYEPLRADE
             N+ E   + + +P                                DLN  E+ +       +   P FDLNQIS EEE+    +  + E  R +E
Subjt:  TISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE----LQTHYEPLRADE

Query:  LK--KSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVA
            K L+    + QQ D+K S+CR  G+G   + KRKISWQD VA
Subjt:  LK--KSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGATGAAAGGGGCCGTCTCCCAATACGCTGTGTATGAGGATCCAAAGACCAGATTCAAGCATCAGAGTCTCCTGCAAGATTATGAGGAATTGGAGAAGGAAAC
TGAAACTGTTAAGAGGAAATTGCAGATAATGAAGCAAAAGAAGATGACCCTGATTGCAGAAGTCCGATTCTTGAGGAAGAGATATGAGTACTTAATGAAGAACCAGTCAT
CAAACGACCATTTAAATGGAGATCTTGTGCAGCAGAAGCGACTTAATAATCAAGTGCCTAAAAATAATAAGAAGGGGAAGAATGGTGTGAGGAGGAGGCCCGCTTTGCAA
CCGCTTCCAACGATCTCTGATATAAACCAAAAGGAAAGAATTGACAGACGAATTGATATTCCTCAGCAGAATTCTACTCCAACCCCTGTCCTTGACTTAAACCAGAAGGC
AAAGACTTCTAGGAAGAAAGCCAATCTACAGAATTCAACACCAACTTTGGACTTGAACCAGAAGGAAAGAATGTACAGTGGGAGAGATGCTGGCGAGAGAAATGTCACTC
CGTTTTTTGACTTGAACCAAATTTCGATAGAGGAAGAAGAATTGCAGACCCATTACGAGCCACTGAGAGCAGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGGGAACGAT
GAGCAGCAAAACGATATCAAGATTTCAGCGTGCAGGACCATTGGAGATGGTCCAAGTCGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAGGGAGCA
CGGAAAAGAAGACGTGTACAATCAAGTGTATCACATACTCTACCGAAAAGTTGGAAGTTCATCTGGAGTCATGACTCGCAACAGCTGTGATCAGATCACTTTAGATAAGA
AGGCGAGCCCTGTGCTTCTTCCCCTGCTTGTGCGTCTCCATGACAACCACACTTTGAGTTCCTATTTGGCACATTTCGCACCAGAACTTAAAGTCATTCTTTTCAAGAGG
TTTTGCAGATTGGCTCAACGCTTGCTCTTCAGTTCCTGCCCTCCCGGGCGTGATCTGCAGGTGGTCTCTCAGCTTTCCGCTTTGCTTCGAATTGTTCTGGGTCAGGCTTT
GGCTGCAATTTCCACACCACCAGAGCATAAATACTTCAACCTTCTACTTCATGCAGACCAGCACCAGTTCTTGCTCGATAATGAGATCCCTCCTAATCTCCGCCTCCAGC
ATTCGCCGGCTAGCGATCTCTCGAATCATTATCTCTTCTTTTATCCGCTGTTTCACGAGCTCATGCTGGGCGTCGTCGCAGTTCCGCATTACGCCGGCGTTTGGAAGATC
ATCTGGACCTCGCAGAGGGTGGTCGGAGGAGTACCGAGAACTGGAAGGGGCGGCCGGCTCCGGCTGTTTGTGATCGATGGCTCGGAACCTGAAATCCATCAAAACCAAAG
AGAAAAGACTGGAGAATCTAAGAAGGGCGCGAAGAGAGAGATTAGTGGGGTTAAAGTTCTGCACACTGGAAATCAAAGTCTGGAAGGAGGATTTTGGTACGGAGAAAAGA
GACGAAAACGTGAGCTGCCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGATGAAAGGGGCCGTCTCCCAATACGCTGTGTATGAGGATCCAAAGACCAGATTCAAGCATCAGAGTCTCCTGCAAGATTATGAGGAATTGGAGAAGGAAAC
TGAAACTGTTAAGAGGAAATTGCAGATAATGAAGCAAAAGAAGATGACCCTGATTGCAGAAGTCCGATTCTTGAGGAAGAGATATGAGTACTTAATGAAGAACCAGTCAT
CAAACGACCATTTAAATGGAGATCTTGTGCAGCAGAAGCGACTTAATAATCAAGTGCCTAAAAATAATAAGAAGGGGAAGAATGGTGTGAGGAGGAGGCCCGCTTTGCAA
CCGCTTCCAACGATCTCTGATATAAACCAAAAGGAAAGAATTGACAGACGAATTGATATTCCTCAGCAGAATTCTACTCCAACCCCTGTCCTTGACTTAAACCAGAAGGC
AAAGACTTCTAGGAAGAAAGCCAATCTACAGAATTCAACACCAACTTTGGACTTGAACCAGAAGGAAAGAATGTACAGTGGGAGAGATGCTGGCGAGAGAAATGTCACTC
CGTTTTTTGACTTGAACCAAATTTCGATAGAGGAAGAAGAATTGCAGACCCATTACGAGCCACTGAGAGCAGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGGGAACGAT
GAGCAGCAAAACGATATCAAGATTTCAGCGTGCAGGACCATTGGAGATGGTCCAAGTCGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAGGGAGCA
CGGAAAAGAAGACGTGTACAATCAAGTGTATCACATACTCTACCGAAAAGTTGGAAGTTCATCTGGAGTCATGACTCGCAACAGCTGTGATCAGATCACTTTAGATAAGA
AGGCGAGCCCTGTGCTTCTTCCCCTGCTTGTGCGTCTCCATGACAACCACACTTTGAGTTCCTATTTGGCACATTTCGCACCAGAACTTAAAGTCATTCTTTTCAAGAGG
TTTTGCAGATTGGCTCAACGCTTGCTCTTCAGTTCCTGCCCTCCCGGGCGTGATCTGCAGGTGGTCTCTCAGCTTTCCGCTTTGCTTCGAATTGTTCTGGGTCAGGCTTT
GGCTGCAATTTCCACACCACCAGAGCATAAATACTTCAACCTTCTACTTCATGCAGACCAGCACCAGTTCTTGCTCGATAATGAGATCCCTCCTAATCTCCGCCTCCAGC
ATTCGCCGGCTAGCGATCTCTCGAATCATTATCTCTTCTTTTATCCGCTGTTTCACGAGCTCATGCTGGGCGTCGTCGCAGTTCCGCATTACGCCGGCGTTTGGAAGATC
ATCTGGACCTCGCAGAGGGTGGTCGGAGGAGTACCGAGAACTGGAAGGGGCGGCCGGCTCCGGCTGTTTGTGATCGATGGCTCGGAACCTGAAATCCATCAAAACCAAAG
AGAAAAGACTGGAGAATCTAAGAAGGGCGCGAAGAGAGAGATTAGTGGGGTTAAAGTTCTGCACACTGGAAATCAAAGTCTGGAAGGAGGATTTTGGTACGGAGAAAAGA
GACGAAAACGTGAGCTGCCGTGA
Protein sequenceShow/hide protein sequence
MKKMKGAVSQYAVYEDPKTRFKHQSLLQDYEELEKETETVKRKLQIMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQ
PLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRADELKKSLLRGGND
EQQNDIKISACRTIGDGPSRAGKRKISWQDQVALREHGKEDVYNQVYHILYRKVGSSSGVMTRNSCDQITLDKKASPVLLPLLVRLHDNHTLSSYLAHFAPELKVILFKR
FCRLAQRLLFSSCPPGRDLQVVSQLSALLRIVLGQALAAISTPPEHKYFNLLLHADQHQFLLDNEIPPNLRLQHSPASDLSNHYLFFYPLFHELMLGVVAVPHYAGVWKI
IWTSQRVVGGVPRTGRGGRLRLFVIDGSEPEIHQNQREKTGESKKGAKREISGVKVLHTGNQSLEGGFWYGEKRRKRELP