; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003793 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003793
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF3082 domain-containing protein
Genome locationtig00002439:28202..38093
RNA-Seq ExpressionSgr003793
SyntenySgr003793
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021434 - Protein of unknown function DUF3082


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600385.1 hypothetical protein SDJN03_05618, partial [Cucurbita argyrosperma subsp. sororia]2.5e-10178.6Show/hide
Query:  MLQTQNFL-SSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITF-HSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQV
        ML T N L SSNFP +LSLTH   LSPP     PFSSL RPIT   SV P   LRT Q     Q SELSDA+ +F DD+GPIELP TIFATTD+PSS+QV
Subjt:  MLQTQNFL-SSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITF-HSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQV

Query:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF
        ATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA++SLKAISTGPI+SKS PSP+QAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD+F
Subjt:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF

Query:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQS
        SV+QLTITIRTIV G+CYLATFVFGINAVGLFLYSGQLA+NS+MEE S+ KE ATKGDKQVS PNSTVE  LD TESSSSKDDQS
Subjt:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQS

TYK11253.1 DUF3082 domain-containing protein [Cucumis melo var. makuwa]5.1e-10278.47Show/hide
Query:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN LSSNFP FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +     Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

XP_022142844.1 uncharacterized protein LOC111012858 [Momordica charantia]1.0e-11383.97Show/hide
Query:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV
        MLQT NFLSS FP TLSLTHK+ LSPP+L     SSL RPITF  +S H RLR QQ S QISELS+A+ TFD+D+GP+ELPPTIFATTD+PSSLQVATSV
Subjt:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV

Query:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ
        LLTG+ISIFLFRSLRRRA+RAKELKFRS GVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD+FSV+Q
Subjt:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ

Query:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +TITIRTIV G+CYLATFVFGINAVGLFLYSGQLAVNS+ME+ S DKETAT  DKQVSPPNSTVE  LDSTESSS+KDDQSS SN Q
Subjt:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

XP_031739166.1 uncharacterized protein LOC101221005 [Cucumis sativus]4.3e-10177.78Show/hide
Query:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSP--HHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN LSSNFP FTLS     HK FLSPP    T  SSL RPITFHSVSP  +HR        Q ++L+DA  TF DD GP+ELPPTIFATTD+PSSL
Subjt:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSP--HHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRS GVKKSLKEEA+DSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEA+LNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKD E   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

XP_038905614.1 uncharacterized protein LOC120091579 [Benincasa hispida]1.8e-10780.89Show/hide
Query:  MLQTQNFLSSNFP-FTLSLT--HKAFLSPPTLSSTPFSSLCRPITFHSVSP---HHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        MLQTQN LSSNFP FTLSLT  HK FLSPPT SS+  SSL RPI FHSVSP   HH     QF    SEL+DA  TF DD GP+ELP TIFATTD+PSSL
Subjt:  MLQTQNFLSSNFP-FTLSLT--HKAFLSPPTLSSTPFSSLCRPITFHSVSP---HHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPIESKSTPSPIQAFLGAIAAGVIA+ILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   KGDKQVSPPNST E  L+STESS+S+DDQSS SN Q
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

TrEMBL top hitse value%identityAlignment
A0A1S4E0Y7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034962104.7e-10177.78Show/hide
Query:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN LSSN P FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +     Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  L+STESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5A7UXD2 DUF3082 domain-containing protein4.7e-10177.78Show/hide
Query:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN LSSN P FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +     Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  L+STESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5B7BP03 Uncharacterized protein (Fragment)7.5e-8365.73Show/hide
Query:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSS-TPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELP---PTIFATTDNPSSLQV
        MLQ+Q+ LSSNFPF L   H    SP + SS +P + L RPIT   VS H R R + + AQ+ E    +TT  +DEGPIELP   P+IFA TD+PS+LQV
Subjt:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSS-TPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELP---PTIFATTDNPSSLQV

Query:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF
        ATSVLLTG+IS+FLFRSLRRRA+RAKELKFRS+G KKSLKEEA+DSLKA++  P+++KS PSP+QA LG + AGVIALILYKFTTTIEAALNRQT+SD+F
Subjt:  ATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSF

Query:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        SV+Q+TITIRTIV G+CYLATFVFGIN+VGL LYSGQLA+NS+M + S  KET  K + Q+S PNST ++  DS+E SSS  DQSS
Subjt:  SVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A5D3CH90 DUF3082 domain-containing protein2.5e-10278.47Show/hide
Query:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL
        M  TQN LSSNFP FTLS     HK FLSPP    T  SSL RPITFHS+SP   L T +     Q ++L+DA  TF DD GP+ELPPTIFATTDNPSSL
Subjt:  MLQTQNFLSSNFP-FTLS---LTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSA--QISELSDASTTFDDDEGPIELPPTIFATTDNPSSL

Query:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD
        QVATSVLLTG+IS+FLFRSLRRRA+R KELKFRSAGVKKSLKEEA+DSLKAISTGPI SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQT+SD
Subjt:  QVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD

Query:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        +FSV+QLTITIRTIV GLCYLATFVFGINA+GLFLYSGQLA+NS+MEE SKDKE   K D+QVSPP ST E  LDSTESS+SKDDQSS
Subjt:  SFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS

A0A6J1CM21 uncharacterized protein LOC1110128584.8e-11483.97Show/hide
Query:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV
        MLQT NFLSS FP TLSLTHK+ LSPP+L     SSL RPITF  +S H RLR QQ S QISELS+A+ TFD+D+GP+ELPPTIFATTD+PSSLQVATSV
Subjt:  MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSV

Query:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ
        LLTG+ISIFLFRSLRRRA+RAKELKFRS GVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSD+FSV+Q
Subjt:  LLTGSISIFLFRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQ

Query:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ
        +TITIRTIV G+CYLATFVFGINAVGLFLYSGQLAVNS+ME+ S DKETAT  DKQVSPPNSTVE  LDSTESSS+KDDQSS SN Q
Subjt:  LTITIRTIVKGLCYLATFVFGINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G15110.1 unknown protein5.7e-5955.38Show/hide
Query:  HSVSPHHRLRTQQFS-AQISELSDASTTFDDDEGPIELPP----------TIFATTDNPSSLQVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVK
        HS+S   R+R      + + E++D +    +++GPIELP           +IFAT+D+P+ LQ+ATSVLLTG+I++FL RS+RRRA+RAKEL FRS G K
Subjt:  HSVSPHHRLRTQQFS-AQISELSDASTTFDDDEGPIELPP----------TIFATTDNPSSLQVATSVLLTGSISIFLFRSLRRRAQRAKELKFRSAGVK

Query:  KSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYS
        KSLKEEA+D+LKA+S+ PIE   STPS  QAFLGAIAAGVIALILYKFT T+E+ LNRQT+SD+FSV+Q+T+T+RTI+ G+CYLATFVFG+NA GL LYS
Subjt:  KSLKEEALDSLKAISTGPIE-SKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVFGINAVGLFLYS

Query:  GQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS
        GQLA N    E +  K T   GD           ++ D++E + S +DQSS
Subjt:  GQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCAGACCCAGAACTTTCTCTCCTCCAACTTCCCATTCACTCTCTCTCTAACCCACAAAGCCTTTCTCTCTCCTCCCACCCTCTCATCCACCCCCTTCTCCTCTCT
CTGCAGACCCATCACCTTCCACTCTGTTTCACCCCACCATCGACTCAGAACTCAGCAATTCTCAGCCCAAATCTCGGAACTCTCAGACGCCAGCACCACTTTTGATGATG
ACGAGGGCCCAATTGAACTCCCACCCACCATTTTTGCTACCACAGATAACCCTTCTTCTCTCCAAGTTGCTACCAGCGTTCTCCTTACAGGGTCCATCTCCATCTTTCTC
TTTCGCTCCCTCCGCCGCCGCGCTCAGCGGGCCAAAGAGCTGAAATTTAGGTCTGCTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAG
TACAGGTCCAATTGAATCTAAGTCGACGCCTTCACCCATACAAGCATTCTTGGGAGCAATAGCAGCCGGTGTTATTGCACTGATCTTATATAAGTTCACCACTACCATAG
AAGCTGCTCTGAACCGACAGACGATGTCTGATAGCTTCTCGGTTCAACAGCTGACAATAACCATAAGAACTATCGTGAAGGGATTATGCTACCTTGCGACATTTGTTTTC
GGAATTAATGCTGTGGGTTTATTCCTTTACTCTGGTCAGTTGGCCGTAAATTCCATGATGGAAGAAGCTTCCAAAGATAAAGAAACTGCAACTAAAGGTGATAAGCAAGT
TAGCCCACCAAATTCAACAGTTGAAGCGAACCTCGATAGCACCGAATCAAGCAGCAGCAAGGATGATCAAAGTTCAAGGTCAAATTGTCAAAATCCAACTCCAGCCGCGC
CGACGCCGAACGTAGCCGGCCGGTGCGGCTCCAGCTTCCTGTCCTTCCAGAAGCAGGAGGCGTCCGGCGCCAGCTTCGCCGACGACCTCATCGGCTCCGACTTGCACCGC
GTCAGCACGAACGGCTCGTACGCCTTCGCACTCACTAGCTTCTGCTCTATCATCGTCGCCATCGAAACGCCGCCCCCCACCGCCGCCTGCGCCGGAAACGAACACGACCA
CCTCGCCGGCTGCACCGCCGCTTGCGTGTTCTCCGTCGGTTTCGTCTCCACGTTGACCCGCTTCTTCGCCCCATCTCGGCCGTTCGTTTTCTCCACTGCCGTCTTCTTCT
CCCTCGTCGGAACGCACCTGATGAAGTCGGTGCTGCAGACCCATGTCTCCTTGGATACCTCCATTGATAGCTTTGGCTCGTACATCATCAGGAGCAAGCAATCTGGTAGG
ACGGCAGCTTGTGTTTCTCTCTCCGGTTCCGGCGGCGGAGTTTCTCGCTCCTTTGACTGGTCGGGACCCATTTCGTCGTTTTGGATATTGGGTCTGATTTCTTCGTGGGT
CGCTTTCTCTTCGTCTTCCCAAGACATCATCATCGTCTCTTTTCCTTGGATTGACCCGCCTGTGTTTTCTTCTTCTTCCCAATTCCCATCAACTTCAGTTCCACCTTCGT
CTGCCGCTTGTTTTGTGTGTCCTGATAATGAAGGCGAAGTGGGCTCACCACTTTCGCCATTTTCGCCATCTTCACCTTCTTCTTCTTCACTCTGTTCTTCGTCTTCTTCT
GTTTCGTGCTCTGGTTCGTTCCCATGGGGTAGTTTGGATTCATCTTCTTCGGCTGATTCCGGCCTCTCTGCATCTAGAGCCACTTCAGGGTCACAGCTGGTTGGAATTGC
CATTGTTTCTTCTGCTAACTTCTCTTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGCAGACCCAGAACTTTCTCTCCTCCAACTTCCCATTCACTCTCTCTCTAACCCACAAAGCCTTTCTCTCTCCTCCCACCCTCTCATCCACCCCCTTCTCCTCTCT
CTGCAGACCCATCACCTTCCACTCTGTTTCACCCCACCATCGACTCAGAACTCAGCAATTCTCAGCCCAAATCTCGGAACTCTCAGACGCCAGCACCACTTTTGATGATG
ACGAGGGCCCAATTGAACTCCCACCCACCATTTTTGCTACCACAGATAACCCTTCTTCTCTCCAAGTTGCTACCAGCGTTCTCCTTACAGGGTCCATCTCCATCTTTCTC
TTTCGCTCCCTCCGCCGCCGCGCTCAGCGGGCCAAAGAGCTGAAATTTAGGTCTGCTGGAGTAAAGAAGTCCTTGAAGGAGGAAGCATTGGATAGCTTGAAAGCAATCAG
TACAGGTCCAATTGAATCTAAGTCGACGCCTTCACCCATACAAGCATTCTTGGGAGCAATAGCAGCCGGTGTTATTGCACTGATCTTATATAAGTTCACCACTACCATAG
AAGCTGCTCTGAACCGACAGACGATGTCTGATAGCTTCTCGGTTCAACAGCTGACAATAACCATAAGAACTATCGTGAAGGGATTATGCTACCTTGCGACATTTGTTTTC
GGAATTAATGCTGTGGGTTTATTCCTTTACTCTGGTCAGTTGGCCGTAAATTCCATGATGGAAGAAGCTTCCAAAGATAAAGAAACTGCAACTAAAGGTGATAAGCAAGT
TAGCCCACCAAATTCAACAGTTGAAGCGAACCTCGATAGCACCGAATCAAGCAGCAGCAAGGATGATCAAAGTTCAAGGTCAAATTGTCAAAATCCAACTCCAGCCGCGC
CGACGCCGAACGTAGCCGGCCGGTGCGGCTCCAGCTTCCTGTCCTTCCAGAAGCAGGAGGCGTCCGGCGCCAGCTTCGCCGACGACCTCATCGGCTCCGACTTGCACCGC
GTCAGCACGAACGGCTCGTACGCCTTCGCACTCACTAGCTTCTGCTCTATCATCGTCGCCATCGAAACGCCGCCCCCCACCGCCGCCTGCGCCGGAAACGAACACGACCA
CCTCGCCGGCTGCACCGCCGCTTGCGTGTTCTCCGTCGGTTTCGTCTCCACGTTGACCCGCTTCTTCGCCCCATCTCGGCCGTTCGTTTTCTCCACTGCCGTCTTCTTCT
CCCTCGTCGGAACGCACCTGATGAAGTCGGTGCTGCAGACCCATGTCTCCTTGGATACCTCCATTGATAGCTTTGGCTCGTACATCATCAGGAGCAAGCAATCTGGTAGG
ACGGCAGCTTGTGTTTCTCTCTCCGGTTCCGGCGGCGGAGTTTCTCGCTCCTTTGACTGGTCGGGACCCATTTCGTCGTTTTGGATATTGGGTCTGATTTCTTCGTGGGT
CGCTTTCTCTTCGTCTTCCCAAGACATCATCATCGTCTCTTTTCCTTGGATTGACCCGCCTGTGTTTTCTTCTTCTTCCCAATTCCCATCAACTTCAGTTCCACCTTCGT
CTGCCGCTTGTTTTGTGTGTCCTGATAATGAAGGCGAAGTGGGCTCACCACTTTCGCCATTTTCGCCATCTTCACCTTCTTCTTCTTCACTCTGTTCTTCGTCTTCTTCT
GTTTCGTGCTCTGGTTCGTTCCCATGGGGTAGTTTGGATTCATCTTCTTCGGCTGATTCCGGCCTCTCTGCATCTAGAGCCACTTCAGGGTCACAGCTGGTTGGAATTGC
CATTGTTTCTTCTGCTAACTTCTCTTGCTGA
Protein sequenceShow/hide protein sequence
MLQTQNFLSSNFPFTLSLTHKAFLSPPTLSSTPFSSLCRPITFHSVSPHHRLRTQQFSAQISELSDASTTFDDDEGPIELPPTIFATTDNPSSLQVATSVLLTGSISIFL
FRSLRRRAQRAKELKFRSAGVKKSLKEEALDSLKAISTGPIESKSTPSPIQAFLGAIAAGVIALILYKFTTTIEAALNRQTMSDSFSVQQLTITIRTIVKGLCYLATFVF
GINAVGLFLYSGQLAVNSMMEEASKDKETATKGDKQVSPPNSTVEANLDSTESSSSKDDQSSRSNCQNPTPAAPTPNVAGRCGSSFLSFQKQEASGASFADDLIGSDLHR
VSTNGSYAFALTSFCSIIVAIETPPPTAACAGNEHDHLAGCTAACVFSVGFVSTLTRFFAPSRPFVFSTAVFFSLVGTHLMKSVLQTHVSLDTSIDSFGSYIIRSKQSGR
TAACVSLSGSGGGVSRSFDWSGPISSFWILGLISSWVAFSSSSQDIIIVSFPWIDPPVFSSSSQFPSTSVPPSSAACFVCPDNEGEVGSPLSPFSPSSPSSSSLCSSSSS
VSCSGSFPWGSLDSSSSADSGLSASRATSGSQLVGIAIVSSANFSC