; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020081 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020081
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF3082 domain-containing protein
Genome locationtig00153447:176142..185974
RNA-Seq ExpressionSgr020081
SyntenySgr020081
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021434 - Protein of unknown function DUF3082


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600385.1 hypothetical protein SDJN03_05618, partial [Cucurbita argyrosperma subsp. sororia]8.8e-10278.6Show/hide
Query:  MLQTQNFL-FSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITF-HSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQV
        ML T N L  SNFP +LSLTH   LSPP     PFSSL RPIT   SV P   LRT QC    Q SELSDA+ +F DD+GPIELP TIFATTD+PSS+QV
Subjt:  MLQTQNFL-FSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITF-HSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQV

Query:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF
        ATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA++SLKAISTGPI+SKS PSP+QAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD+F
Subjt:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF

Query:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQS
        SV+QLTITIRTIV G+CYLATFVFGINAVGLFLYSGQLA+NS+MEE S+ KE ATKGDKQVS PNSTVE  LD TESSSSKDDQS
Subjt:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQS

TYK11253.1 DUF3082 domain-containing protein [Cucumis melo var. makuwa]1.8e-10278.47Show/hide
Query:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN L SNFP FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +C    Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

XP_022142844.1 uncharacterized protein LOC111012858 [Momordica charantia]2.6e-11483.97Show/hide
Query:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV
        MLQT NFL S FP TLSLTHK+ LSPP+L     SSL RPITF  +S H RLR QQCS QISELS+A+ TFD+D+GP+ELPPTIFATTD+PSSLQVATSV
Subjt:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV

Query:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ
        LLTG+ISIFLFRSLRRRA+RAKELKFRS GVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD+FSV+Q
Subjt:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ

Query:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +TITIRTIV G+CYLATFVFGINAVGLFLYSGQLAVNS+ME+ S DKETAT  DKQVSPPNSTVE  LDSTESSS+KDDQSS SN Q
Subjt:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

XP_031739166.1 uncharacterized protein LOC101221005 [Cucumis sativus]2.0e-10177.78Show/hide
Query:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSP--HHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN L SNFP FTLS     HK FLSPP    T  SSL RPITFHSVSP  +HR     C  Q ++L+DA  TF DD GP+ELPPTIFATTD+PSSL
Subjt:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSP--HHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEA+LNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKD E   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

XP_038905614.1 uncharacterized protein LOC120091579 [Benincasa hispida]2.4e-10780.55Show/hide
Query:  MLQTQNFLFSNFP-FTLSLT--HKAFLSPPTLSSTPFSSLCRPITFHSVSP---HHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        MLQTQN L SNFP FTLSLT  HK FLSPPT SS+  SSL RPI FHSVSP   HH      C  Q SEL+DA  TF DD GP+ELP TIFATTD+PSSL
Subjt:  MLQTQNFLFSNFP-FTLSLT--HKAFLSPPTLSSTPFSSLCRPITFHSVSP---HHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPIESKSTPSPIQAFLGAIAAGVIA+ILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   KGDKQVSPPNST E  L+STESS+S+DDQSS SN Q
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

TrEMBL top hitse value%identityAlignment
A0A1S4E0Y7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034962101.6e-10177.78Show/hide
Query:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN L SN P FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +C    Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  L+STESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5A7UXD2 DUF3082 domain-containing protein1.6e-10177.78Show/hide
Query:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN L SN P FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +C    Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  L+STESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5B7BP03 Uncharacterized protein (Fragment)8.3e-8265.38Show/hide
Query:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSS-TPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELP---PTIFATTDNPSSLQV
        MLQ+Q+ L SNFPF L   H    SP + SS +P + L RPIT   VS H R R +   AQ+ E    +TT  +DEGPIELP   P+IFA TD+PS+LQV
Subjt:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSS-TPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELP---PTIFATTDNPSSLQV

Query:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF
        ATSVLLTG+IS+FLFRSLRRRA+RAKELKFRS+G KKSLKEEA+DSLKA++  P+++KS PSP+QA LG + AGVIALILYKFTTTIEAALNRQT+SD+F
Subjt:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF

Query:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        SV+Q+TITIRTIV G+CYLATFVFGIN+VGL LYSGQLA+NS+M + S  KET  K + Q+S PNST ++  DS+E SSS  DQSS
Subjt:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5D3CH90 DUF3082 domain-containing protein8.5e-10378.47Show/hide
Query:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN L SNFP FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +C    Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLFSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A6J1CM21 uncharacterized protein LOC1110128581.3e-11483.97Show/hide
Query:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV
        MLQT NFL S FP TLSLTHK+ LSPP+L     SSL RPITF  +S H RLR QQCS QISELS+A+ TFD+D+GP+ELPPTIFATTD+PSSLQVATSV
Subjt:  MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV

Query:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ
        LLTG+ISIFLFRSLRRRA+RAKELKFRS GVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD+FSV+Q
Subjt:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ

Query:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +TITIRTIV G+CYLATFVFGINAVGLFLYSGQLAVNS+ME+ S DKETAT  DKQVSPPNSTVE  LDSTESSS+KDDQSS SN Q
Subjt:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G15110.1 unknown protein9.8e-5955.38Show/hide
Query:  HSVSPHHRLRTQQCS-AQISELSDASTTFDDDEGPIELPP----------TIFATTDNPSSLQVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVK
        HS+S   R+R      + + E++D +    +++GPIELP           +IFAT+D+P+ LQ+ATSVLLTG+I++FL RS+RRRA+RAKEL FRS G K
Subjt:  HSVSPHHRLRTQQCS-AQISELSDASTTFDDDEGPIELPP----------TIFATTDNPSSLQVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVK

Query:  KSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYS
        KSLKEEA+D+LKA+S+ PIE   STPS  QAFLGAIAAGVIALILYKFT T+E+ LNRQT+SD+FSV+Q+T+T+RTI+ G+CYLATFVFG+NA GL LYS
Subjt:  KSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYS

Query:  GQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        GQLA N    E +  K T   GD           ++ D++E + S +DQSS
Subjt:  GQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAGACCCAGAACTTTCTCTTCTCCAACTTCCCATTCACTCTCTCTCTAACCCACAAAGCCTTTCTCTCTCCTCCCACCCTCTCATCCACCCCCTTCTCCTCTCT
CTGCAGACCCATCACCTTCCACTCTGTTTCACCCCACCATCGACTAAGAACCCAGCAATGCTCAGCCCAAATCTCGGAACTCTCAGACGCCAGCACCACTTTTGATGATG
ACGAGGGCCCAATTGAACTCCCACCCACCATTTTTGCTACCACAGATAACCCTTCTTCTCTCCAAGTTGCTACCAGCGTTCTCCTTACAGGGTCCATCTCCATCTTTCTC
TTTCGCTCCCTCCGCCGCCGCGCTCAGCGGGCCAAAGAGCTGAAATTTAGGTCCGCTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAG
TACAGGTCCAATTGAATCTAAGTCGACGCCTTCACCCATACAAGCATTCTTGGGAGCAATAGCAGCCGGTGTTATTGCACTGATCTTATATAAGTTCACCACTACCATCG
AAGCTGCTCTGAATCGACAGACGATGTCTGATAGCTTCTCGGTTCAACAGCTGACAATAACCATAAGAACTATCGTGAAGGGATTATGCTACCTTGCGACATTTGTTTTC
GGAATTAATGCTGTCGGTTTATTCCTTTACTCTGGTCAGTTGGCCGTAAATTCCATGATGGAAGAAGCTTCCAAAGATAAAGAAACTGCAACTAAAGGTGATAAGCAAGT
TAGCCCACCAAATTCAACAGTTGAAGCGAACCTCGATAGCACCGAATCAAGCAGCAGCAAGGATGATCAAAGTTCAAGGTCAAATTGTCAAAATCCAACTCCAGCCGCGC
CGACGCCGAACGTAGCCGGCCGGTGCGGCTCCAGCTTCCTGTCCTTCCAGAAGCAGGAGGCGTCCGGCGCCAGCTTCGCCGACGACCTCATCGGCTCCGACTTGCACCGC
GTCAGCACGAACGGCTCGTACGCCTTCGCACTCACTAGCTTCTGCTCTATCATCGTCGCCATCGAAACGCCGCCCCCCACCGCCGCCTGCGCCGGAAACGAACACGACCA
CCTCGCCGGCTGCACCGCCGCTTGCGTGTTCTCCGTCGGTTTCGTCTCCACGTTGACCCGCTTCTTCGCCCCATCTCGGCCGTTCGTTTTCTCCACTGCCGTCTTCTTCT
CCCTCGTCGGAACGCACCTGATGAAGTCGGTGCTGCAGACCCATGTCTCCTTGGAGACCTCCATTGATAGCTTTGGCTCGTACATCATCAGTAGCAAGCAATCTGGTAGG
ACGGAAGCTTGTGTTTCTCTCTCCGGTTCCGGCGGCGGAGTTTCTCGCTCCTTTGACTGGTCGAGACCCATTTCGTCGTTTTGGATATTGGGTCTGATTTCTTCGTGGGT
CGCTTTCACTTCGTCTTCCCAAGACATCATCATCGTCTCTTTTCCTTGGATTGACCCGCCTGTGTTTTCTTCTTCTTCCCAATTCCCATCAACTTCAGTTCCACCTTCGT
CTGCCGCTTGTTTTGTGTGTCCTGATAATGAAGGCGAAGTGGGCTCACCACTTTCGCCATTTTCGCCATCTTCACCTTCTTCTTCTTCACTCTGTTCTTCGTCTTCTTCT
GTTTCGTGCTCTGGTTCGTTCCCATGGGGTAGTTTGGATTCATCTTCTTCGGCTGATTCCGGCCTCTCTGCATCTAGAGCCACTTCAGGGTCACAGCTGGTTGGAATTGC
CATTGTTTCTTCTGCTAACTTCTCTTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAGACCCAGAACTTTCTCTTCTCCAACTTCCCATTCACTCTCTCTCTAACCCACAAAGCCTTTCTCTCTCCTCCCACCCTCTCATCCACCCCCTTCTCCTCTCT
CTGCAGACCCATCACCTTCCACTCTGTTTCACCCCACCATCGACTAAGAACCCAGCAATGCTCAGCCCAAATCTCGGAACTCTCAGACGCCAGCACCACTTTTGATGATG
ACGAGGGCCCAATTGAACTCCCACCCACCATTTTTGCTACCACAGATAACCCTTCTTCTCTCCAAGTTGCTACCAGCGTTCTCCTTACAGGGTCCATCTCCATCTTTCTC
TTTCGCTCCCTCCGCCGCCGCGCTCAGCGGGCCAAAGAGCTGAAATTTAGGTCCGCTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAG
TACAGGTCCAATTGAATCTAAGTCGACGCCTTCACCCATACAAGCATTCTTGGGAGCAATAGCAGCCGGTGTTATTGCACTGATCTTATATAAGTTCACCACTACCATCG
AAGCTGCTCTGAATCGACAGACGATGTCTGATAGCTTCTCGGTTCAACAGCTGACAATAACCATAAGAACTATCGTGAAGGGATTATGCTACCTTGCGACATTTGTTTTC
GGAATTAATGCTGTCGGTTTATTCCTTTACTCTGGTCAGTTGGCCGTAAATTCCATGATGGAAGAAGCTTCCAAAGATAAAGAAACTGCAACTAAAGGTGATAAGCAAGT
TAGCCCACCAAATTCAACAGTTGAAGCGAACCTCGATAGCACCGAATCAAGCAGCAGCAAGGATGATCAAAGTTCAAGGTCAAATTGTCAAAATCCAACTCCAGCCGCGC
CGACGCCGAACGTAGCCGGCCGGTGCGGCTCCAGCTTCCTGTCCTTCCAGAAGCAGGAGGCGTCCGGCGCCAGCTTCGCCGACGACCTCATCGGCTCCGACTTGCACCGC
GTCAGCACGAACGGCTCGTACGCCTTCGCACTCACTAGCTTCTGCTCTATCATCGTCGCCATCGAAACGCCGCCCCCCACCGCCGCCTGCGCCGGAAACGAACACGACCA
CCTCGCCGGCTGCACCGCCGCTTGCGTGTTCTCCGTCGGTTTCGTCTCCACGTTGACCCGCTTCTTCGCCCCATCTCGGCCGTTCGTTTTCTCCACTGCCGTCTTCTTCT
CCCTCGTCGGAACGCACCTGATGAAGTCGGTGCTGCAGACCCATGTCTCCTTGGAGACCTCCATTGATAGCTTTGGCTCGTACATCATCAGTAGCAAGCAATCTGGTAGG
ACGGAAGCTTGTGTTTCTCTCTCCGGTTCCGGCGGCGGAGTTTCTCGCTCCTTTGACTGGTCGAGACCCATTTCGTCGTTTTGGATATTGGGTCTGATTTCTTCGTGGGT
CGCTTTCACTTCGTCTTCCCAAGACATCATCATCGTCTCTTTTCCTTGGATTGACCCGCCTGTGTTTTCTTCTTCTTCCCAATTCCCATCAACTTCAGTTCCACCTTCGT
CTGCCGCTTGTTTTGTGTGTCCTGATAATGAAGGCGAAGTGGGCTCACCACTTTCGCCATTTTCGCCATCTTCACCTTCTTCTTCTTCACTCTGTTCTTCGTCTTCTTCT
GTTTCGTGCTCTGGTTCGTTCCCATGGGGTAGTTTGGATTCATCTTCTTCGGCTGATTCCGGCCTCTCTGCATCTAGAGCCACTTCAGGGTCACAGCTGGTTGGAATTGC
CATTGTTTCTTCTGCTAACTTCTCTTGCTGA
Protein sequenceShow/hide protein sequence
MLQTQNFLFSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQCSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSVLLTGSISIFL
FRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVF
GINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQNPTPAAPTPNVAGRCGSSFLSFQKQEASGASFADDLIGSDLHR
VSTNGSYAFALTSFCSIIVAIETPPPTAACAGNEHDHLAGCTAACVFSVGFVSTLTRFFAPSRPFVFSTAVFFSLVGTHLMKSVLQTHVSLETSIDSFGSYIISSKQSGR
TEACVSLSGSGGGVSRSFDWSRPISSFWILGLISSWVAFTSSSQDIIIVSFPWIDPPVFSSSSQFPSTSVPPSSAACFVCPDNEGEVGSPLSPFSPSSPSSSSLCSSSSS
VSCSGSFPWGSLDSSSSADSGLSASRATSGSQLVGIAIVSSANFSC