; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002817 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002817
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00001784:23369..29441
RNA-Seq ExpressionSgr002817
SyntenySgr002817
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035686.1 hypothetical protein SDJN02_02484 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-10983.98Show/hide
Query:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ ++++D K RFKHQ+LLQDY ELEKETET KRKLQMMKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPAL PLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_022135294.1 uncharacterized protein LOC111007291 [Momordica charantia]1.7e-11386.1Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK
        MKK+KG +SQYA+YEDPK RFKH SLLQDY+ELEKETETVKRKLQMM QKKMTL+AEVRFLRKRYEYLMKNQSSN+  NGD   VQQK+LNNQV  NNKK
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRRPALQPLP+ISDINQKE+IDR IDIP Q+N TPTPV DLNQKAKT+RKKANLQN  P LDLNQKERMYSGRDAGERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQ HYEPLRA+ELKKSLLRGG+DEQQ NDIKISACRTIGDGPSRA KRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_022932147.1 uncharacterized protein LOC111438466 [Cucurbita moschata]5.0e-11084.38Show/hide
Query:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ ++++D K RFKHQ+LLQDY ELEKETET KRKLQMMKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_038879754.1 uncharacterized protein LOC120071506 isoform X1 [Benincasa hispida]9.4e-10983.02Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEK-------ETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQ
        MKKMKG +SQY ++ED KTRFKHQSLLQDY++LEK       ET TVKRKLQMMK KKMTLIAEVRFLRKRYEYLMKNQ S+NDH  N + VQQK+LNNQ
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEK-------ETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQ

Query:  VPKNNKKGKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQ
        V  NNKKGKNG RRR  LQPLPTISD+NQKERID+ ID+P QNSTP PVLDLNQKAKT SRKKAN QNSTP  DLNQKERMYSGRDA ERN+TPFFDLNQ
Subjt:  VPKNNKKGKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQ

Query:  ISIEEEELQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        ISIEEEELQT+YEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  ISIEEEELQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

XP_038879755.1 uncharacterized protein LOC120071506 isoform X2 [Benincasa hispida]7.7e-11185.27Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG +SQY ++ED KTRFKHQSLLQDY++LEKET TVKRKLQMMK KKMTLIAEVRFLRKRYEYLMKNQ S+NDH  N + VQQK+LNNQV  NNKK
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRR  LQPLPTISD+NQKERID+ ID+P QNSTP PVLDLNQKAKT SRKKAN QNSTP  DLNQKERMYSGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQT+YEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

TrEMBL top hitse value%identityAlignment
A0A1S3B8I5 uncharacterized protein LOC1034873442.1e-10684.11Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG +SQY +YED KTRFKHQSLLQDY +LEKET TVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ S+ DH  NG+ VQQK+  NQV  NNKK
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
         KNG RRR ALQPLPTISDINQKE    RID+P QNSTP PVLDLNQKAKT SRKKAN  NS P  DLNQKERM SGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQTHYEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A5A7T0V7 Uncharacterized protein2.1e-10684.11Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK
        MKKMKG +SQY +YED KTRFKHQSLLQDY +LEKET TVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ S+ DH  NG+ VQQK+  NQV  NNKK
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ-SSNDHL-NGDLVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
         KNG RRR ALQPLPTISDINQKE    RID+P QNSTP PVLDLNQKAKT SRKKAN  NS P  DLNQKERM SGRDA ERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKT-SRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQTHYEPLR DELKKSLLRGGNDEQQNDIKISACR+IGDGPSRAGKRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1C111 uncharacterized protein LOC1110072918.0e-11486.1Show/hide
Query:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK
        MKK+KG +SQYA+YEDPK RFKH SLLQDY+ELEKETETVKRKLQMM QKKMTL+AEVRFLRKRYEYLMKNQSSN+  NGD   VQQK+LNNQV  NNKK
Subjt:  MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGD--LVQQKRLNNQVPKNNKK

Query:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE
        GKNG RRRPALQPLP+ISDINQKE+IDR IDIP Q+N TPTPV DLNQKAKT+RKKANLQN  P LDLNQKERMYSGRDAGERN+TPFFDLNQISIEEEE
Subjt:  GKNGVRRRPALQPLPTISDINQKERIDRRIDIP-QQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE

Query:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR
        LQ HYEPLRA+ELKKSLLRGG+DEQQ NDIKISACRTIGDGPSRA KRKISWQDQVALR
Subjt:  LQTHYEPLRADELKKSLLRGGNDEQQ-NDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1EVU7 uncharacterized protein LOC1114384662.4e-11084.38Show/hide
Query:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ ++++D K RFKHQ+LLQDY ELEKETET KRKLQMMKQKKMTL+AEVRFL+KRYEYLMKNQ  NDH NG+ VQQK+LN QV  N KKG
Subjt:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+GDGPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

A0A6J1HJN9 uncharacterized protein LOC1114651171.1e-10783.59Show/hide
Query:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG
        MKKMKGA SQ+ ++++D K RFKHQ+LLQDY ELEKETET KRKLQMMKQKKMTL+AEVRFLRKRYEYLMKNQ  NDH NG+ VQQK  + QV  N KKG
Subjt:  MKKMKGAISQY-ALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKG

Query:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ
        KNG RRRPALQPLPTISDINQKERI+RRIDIP Q+STP PVLDLNQKAKT RKKAN QNSTP  DLNQKERMYSGRDA ER +TPFFDLNQIS+EEEELQ
Subjt:  KNGVRRRPALQPLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQ

Query:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR
        THY+ LRADELKKSLLRGGNDEQQNDIKISACRT+G+GPSRAGKRKISWQDQVALR
Subjt:  THYEPLRADELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein6.9e-1734.01Show/hide
Query:  DPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ---SSNDHL----NGDLVQQKRLNNQVPKNNKKGKNGVRRRPA
        DP  R     +L    ELEKE E  +++L+M+KQK++TL +EVRFLR+RYE+L ++Q   +S + L    +G L   ++     P   +K ++GVR    
Subjt:  DPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQ---SSNDHL----NGDLVQQKRLNNQVPKNNKKGKNGVRRRPA

Query:  LQPLPTISDI-NQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRA
           L   + I N+KE +   +             DL++K K SR    L       DLN +     G  +G   V P FDLNQIS EEEE + + E + A
Subjt:  LQPLPTISDI-NQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRA

Query:  DELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVAL
        + +K ++L     +   + K+  C  +    +RA KRK++WQD VAL
Subjt:  DELKKSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVAL

AT5G57910.1 unknown protein7.1e-2232.52Show/hide
Query:  YEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQPLP
        +EDPK RF+H SL+QDY EL  ETE ++++LQ ++++K TL+AEVRFLR+RY +L ++Q                  Q  K  ++   G + R  + P  
Subjt:  YEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQPLP

Query:  TISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE----LQTHYEPLRADE
             N+ E   + + +P                                DLN  E+ +       +   P FDLNQIS EEE+    +  + E  R +E
Subjt:  TISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEE----LQTHYEPLRADE

Query:  LK--KSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVA
            K L+    + QQ D+K S+CR  G+G   + KRKISWQD VA
Subjt:  LK--KSLLRGGNDEQQNDIKISACRTIGDGPSRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGATGAAAGGGGCCATCTCCCAATACGCTCTGTATGAGGATCCAAAGACCAGATTCAAGCATCAGAGTCTCCTGCAAGATTATGAGGAATTGGAGAAGGAAAC
CGAAACTGTTAAGAGGAAATTGCAGATGATGAAGCAAAAGAAGATGACCCTGATTGCAGAAGTCCGATTCTTGAGGAAGAGATATGAGTACTTAATGAAGAACCAGTCAT
CAAACGACCATTTAAATGGAGATCTTGTGCAGCAGAAGCGACTTAATAATCAAGTGCCTAAAAATAATAAGAAGGGGAAGAATGGTGTGAGGAGGAGGCCCGCTTTGCAA
CCGCTTCCAACGATCTCTGATATAAACCAAAAGGAAAGAATTGACAGACGAATTGATATTCCTCAGCAGAATTCTACTCCAACCCCTGTCCTTGACTTAAACCAGAAGGC
AAAGACTTCTAGGAAGAAAGCCAATCTACAGAATTCAACACCAACTTTGGACTTGAACCAGAAGGAAAGAATGTACAGTGGGAGAGATGCTGGCGAGAGAAATGTCACTC
CATTTTTTGACTTGAACCAAATTTCGATAGAGGAAGAGGAATTGCAGACCCATTACGAGCCACTGAGAGCAGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGGGAACGAT
GAGCAGCAAAACGATATCAAGATTTCAGCGTGCAGGACCATTGGAGATGGTCCAAGTCGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAGGGAGCA
CGGAAAAGAAGACGTGTACAATCAAGTGTATCACATACTCTACCGAAAAGTTGGAAGTTCATCTGGAGTCATGACTCGCAACAGCTGTGATCAGATCACTTTAGATAAGA
AGGCGAGCCCTGTGCTTCTTCCCCTGCTTGTGCGTCTCCATGACAACCACACTTTGAGTTCCTATTTGGCACATTTCGCACCAGAACTTAAAGTCATTCTTTTCAAGAGC
TCCTGGGCCACTTCCCTCTTTGGCCATGTGCTTCTTGCCTTGCAAGAAATGGCCCAGTATCAGCGTGATCTGCAGGTGGTCTCTCAGCTTTCCGCTTTGCTTCGAATTGT
TCTGGGTCAGGCTTCGGCTGCATTTTCCACACCACCAGAGCATAAATACTTCAACCTTCTACTTCATGCAGACCAGCACCAGGCAAAGGCTTCGGCTCTTCCATTTGAGG
CTCCGGAAATCGAAGCGAACCCCTCCGTCCGGCGGGCAGCTCTATGCATTGCTAGTTCTTGCTCGATAAGGAGATCCCTCCTAATCTCCGCCTCCAGCATTCGCCGGCTA
GCGATCTCTCGAATCATTATCTCTTCTTTTATCCGCTGTTTCACGAGCTCATGCTGGGCGTCGTCGCAGTTCCGCATTACGCCGGCGTTTGGAAGATCATCTGATTTCTC
CCGTTTGGTCGGCCATCAAACAAACACACACACAGTAAGAGACAGAGAGAGAGAGTCATTTGATAAGCTTAGGACCTCGCAGAGGGTGGTCGGAGGAGTACCGAGAACTG
GAAGGGGCGGCCGGCTCCGGCTGTTTGTCATCGATGGCTCGGAACCTGAAATCCATCAAAACCAAAGAGAAAAGACTGGAGAATCTAAGAAGGGCGCGAAGAGAGAGATT
AGTGGGGTTAAAGTTCTGCACACTGGAAATCAAAGTCTGGAAGGAGGATTTTGGTACGGAGAAAAGAGACGAAAACGTGAGCTGCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGATGAAAGGGGCCATCTCCCAATACGCTCTGTATGAGGATCCAAAGACCAGATTCAAGCATCAGAGTCTCCTGCAAGATTATGAGGAATTGGAGAAGGAAAC
CGAAACTGTTAAGAGGAAATTGCAGATGATGAAGCAAAAGAAGATGACCCTGATTGCAGAAGTCCGATTCTTGAGGAAGAGATATGAGTACTTAATGAAGAACCAGTCAT
CAAACGACCATTTAAATGGAGATCTTGTGCAGCAGAAGCGACTTAATAATCAAGTGCCTAAAAATAATAAGAAGGGGAAGAATGGTGTGAGGAGGAGGCCCGCTTTGCAA
CCGCTTCCAACGATCTCTGATATAAACCAAAAGGAAAGAATTGACAGACGAATTGATATTCCTCAGCAGAATTCTACTCCAACCCCTGTCCTTGACTTAAACCAGAAGGC
AAAGACTTCTAGGAAGAAAGCCAATCTACAGAATTCAACACCAACTTTGGACTTGAACCAGAAGGAAAGAATGTACAGTGGGAGAGATGCTGGCGAGAGAAATGTCACTC
CATTTTTTGACTTGAACCAAATTTCGATAGAGGAAGAGGAATTGCAGACCCATTACGAGCCACTGAGAGCAGATGAGCTGAAGAAAAGCCTTCTTCGAGGTGGGAACGAT
GAGCAGCAAAACGATATCAAGATTTCAGCGTGCAGGACCATTGGAGATGGTCCAAGTCGAGCTGGTAAAAGAAAGATTTCATGGCAAGACCAGGTGGCTTTAAGGGAGCA
CGGAAAAGAAGACGTGTACAATCAAGTGTATCACATACTCTACCGAAAAGTTGGAAGTTCATCTGGAGTCATGACTCGCAACAGCTGTGATCAGATCACTTTAGATAAGA
AGGCGAGCCCTGTGCTTCTTCCCCTGCTTGTGCGTCTCCATGACAACCACACTTTGAGTTCCTATTTGGCACATTTCGCACCAGAACTTAAAGTCATTCTTTTCAAGAGC
TCCTGGGCCACTTCCCTCTTTGGCCATGTGCTTCTTGCCTTGCAAGAAATGGCCCAGTATCAGCGTGATCTGCAGGTGGTCTCTCAGCTTTCCGCTTTGCTTCGAATTGT
TCTGGGTCAGGCTTCGGCTGCATTTTCCACACCACCAGAGCATAAATACTTCAACCTTCTACTTCATGCAGACCAGCACCAGGCAAAGGCTTCGGCTCTTCCATTTGAGG
CTCCGGAAATCGAAGCGAACCCCTCCGTCCGGCGGGCAGCTCTATGCATTGCTAGTTCTTGCTCGATAAGGAGATCCCTCCTAATCTCCGCCTCCAGCATTCGCCGGCTA
GCGATCTCTCGAATCATTATCTCTTCTTTTATCCGCTGTTTCACGAGCTCATGCTGGGCGTCGTCGCAGTTCCGCATTACGCCGGCGTTTGGAAGATCATCTGATTTCTC
CCGTTTGGTCGGCCATCAAACAAACACACACACAGTAAGAGACAGAGAGAGAGAGTCATTTGATAAGCTTAGGACCTCGCAGAGGGTGGTCGGAGGAGTACCGAGAACTG
GAAGGGGCGGCCGGCTCCGGCTGTTTGTCATCGATGGCTCGGAACCTGAAATCCATCAAAACCAAAGAGAAAAGACTGGAGAATCTAAGAAGGGCGCGAAGAGAGAGATT
AGTGGGGTTAAAGTTCTGCACACTGGAAATCAAAGTCTGGAAGGAGGATTTTGGTACGGAGAAAAGAGACGAAAACGTGAGCTGCAGTGA
Protein sequenceShow/hide protein sequence
MKKMKGAISQYALYEDPKTRFKHQSLLQDYEELEKETETVKRKLQMMKQKKMTLIAEVRFLRKRYEYLMKNQSSNDHLNGDLVQQKRLNNQVPKNNKKGKNGVRRRPALQ
PLPTISDINQKERIDRRIDIPQQNSTPTPVLDLNQKAKTSRKKANLQNSTPTLDLNQKERMYSGRDAGERNVTPFFDLNQISIEEEELQTHYEPLRADELKKSLLRGGND
EQQNDIKISACRTIGDGPSRAGKRKISWQDQVALREHGKEDVYNQVYHILYRKVGSSSGVMTRNSCDQITLDKKASPVLLPLLVRLHDNHTLSSYLAHFAPELKVILFKS
SWATSLFGHVLLALQEMAQYQRDLQVVSQLSALLRIVLGQASAAFSTPPEHKYFNLLLHADQHQAKASALPFEAPEIEANPSVRRAALCIASSCSIRRSLLISASSIRRL
AISRIIISSFIRCFTSSCWASSQFRITPAFGRSSDFSRLVGHQTNTHTVRDRERESFDKLRTSQRVVGGVPRTGRGGRLRLFVIDGSEPEIHQNQREKTGESKKGAKREI
SGVKVLHTGNQSLEGGFWYGEKRRKRELQ