; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012364 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012364
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDUF1685 domain-containing protein
Genome locationtig00153348:23268..24023
RNA-Seq ExpressionSgr012364
SyntenySgr012364
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057058.1 DUF1685 domain-containing protein [Cucumis melo var. makuwa]2.8e-7366.41Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA
        MD EQ+LNLFDSFWFER +FN  HPF   S+  P++ +P+    +LP E+   PR   + TRS+SEDLSSKL+F+S+S+SPDSVLFSPKLQTI SSKDIA
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA

Query:  GSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--S
        G+ESPE +RK E  ++ K   R+R RGR R R SES+SLSELEFEELKGFMDLGFVFSEED+ SSLASI+PGLNRLGK++++E +E+EEEEE    K   
Subjt:  GSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--S

Query:  QISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR
        +ISRPYLSEAWEAME +EEK    K PL M WRFP  SN+IDMKDNLKWWAH VASTVR
Subjt:  QISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR

XP_022140169.1 uncharacterized protein LOC111010901 [Momordica charantia]1.6e-7668.75Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEA-AAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MDVEQ+LNLFDS WFER IFN           NPE +N EK   N P  +    PR   I TRS+SEDLSSKL+F+S S+SPDS+LFSPKLQTI SSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEA-AAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRKERPQQLKAASRKRIRGRG---RRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK
        A +ESPENNRKE   +L+   RKR +GRG   RRR  ESKSLSELEFEELKGFMDLGFVFSEED+DSSLASIIPGLNRL K++ EE+EE EE E      
Subjt:  AGSESPENNRKERPQQLKAASRKRIRGRG---RRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK

Query:  SQISRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR
          ISRPYLSEAWEAME++ E KK PLM W FPAL NEIDMKDNLKWWAHTVASTVR
Subjt:  SQISRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR

XP_023001638.1 uncharacterized protein LOC111495710 [Cucurbita maxima]7.2e-7469.17Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MDVEQVLNLFDSFWFER IFN  HPF  T+ QNP  +N +++PL N P E    PR+     RS+SEDLSSKLTF+S S SPDSVLFSPKLQTILSSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI
        AG E PE +R+   +Q K   R+RI GR   R SES+SLSELEFEELKGFMDLGFVFSE D+ SSLA I+PGLNRLGKRD+EE+EE+EEEEE       I
Subjt:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI

Query:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR
        SRPYLSEAW AME++EE KK  +M WR PA  NEIDMKDNLKWWAH VASTVR
Subjt:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR

XP_031737252.1 uncharacterized protein LOC105434498 [Cucumis sativus]8.3e-7063.36Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPE-KEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MD + +LNLFDSFWF+R++ N +HPF S    NP++  P+ ++P  LP E+   PR   +RTRS+SEDLSSKL+F+S S+SPDSVL SPKLQTI SSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPE-KEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--
        AG+ESPE + K E  ++ K   R+R+RGR R R SES+SLSELEFEELKGFMDLGFVFSEED+ SSLASI+PGLNRLGKR+++ ++E EEEEE   ++  
Subjt:  AGSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--

Query:  --SQISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR
           +ISRPYLSEAWEA+  +EEK    K PL M WRFP  SN+IDMKDNLKWWAH VASTVR
Subjt:  --SQISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR

XP_038895996.1 uncharacterized protein LOC120084174 [Benincasa hispida]6.1e-7366.93Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA
        MD EQ+LNLFDSFWFE  IFN  HPF S    NP+   PE +  +LP E    PR   +RTRS+SEDLSSKL+F+S S+SPDSVLFSPKLQTI SSKDIA
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA

Query:  GSESPENNRK----ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK
        G+ESPEN+RK     RP   K  SR+++RGR R R SES+SLSELEFEELKGFMDLGFVFSEED+ SSLASI+PGLNRLGK ++E+ EE+EE +      
Subjt:  GSESPENNRK----ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK

Query:  SQISRPYLSEAWEAMERDEEK-KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR
         +ISRPYLSEAWEAME +EE+ K PL M W+FP  SN+IDMKDNLKWWAH VASTVR
Subjt:  SQISRPYLSEAWEAMERDEEK-KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LV26 Uncharacterized protein4.0e-7063.36Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPE-KEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MD + +LNLFDSFWF+R++ N +HPF S    NP++  P+ ++P  LP E+   PR   +RTRS+SEDLSSKL+F+S S+SPDSVL SPKLQTI SSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPE-KEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--
        AG+ESPE + K E  ++ K   R+R+RGR R R SES+SLSELEFEELKGFMDLGFVFSEED+ SSLASI+PGLNRLGKR+++ ++E EEEEE   ++  
Subjt:  AGSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--

Query:  --SQISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR
           +ISRPYLSEAWEA+  +EEK    K PL M WRFP  SN+IDMKDNLKWWAH VASTVR
Subjt:  --SQISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR

A0A5A7UPL7 DUF1685 domain-containing protein1.3e-7366.41Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA
        MD EQ+LNLFDSFWFER +FN  HPF   S+  P++ +P+    +LP E+   PR   + TRS+SEDLSSKL+F+S+S+SPDSVLFSPKLQTI SSKDIA
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIA

Query:  GSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--S
        G+ESPE +RK E  ++ K   R+R RGR R R SES+SLSELEFEELKGFMDLGFVFSEED+ SSLASI+PGLNRLGK++++E +E+EEEEE    K   
Subjt:  GSESPENNRK-ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK--S

Query:  QISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR
        +ISRPYLSEAWEAME +EEK    K PL M WRFP  SN+IDMKDNLKWWAH VASTVR
Subjt:  QISRPYLSEAWEAMERDEEK----KLPL-MNWRFPALSNEIDMKDNLKWWAHTVASTVR

A0A6J1CEZ9 uncharacterized protein LOC1110109017.6e-7768.75Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEA-AAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MDVEQ+LNLFDS WFER IFN           NPE +N EK   N P  +    PR   I TRS+SEDLSSKL+F+S S+SPDS+LFSPKLQTI SSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEA-AAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRKERPQQLKAASRKRIRGRG---RRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK
        A +ESPENNRKE   +L+   RKR +GRG   RRR  ESKSLSELEFEELKGFMDLGFVFSEED+DSSLASIIPGLNRL K++ EE+EE EE E      
Subjt:  AGSESPENNRKERPQQLKAASRKRIRGRG---RRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDK

Query:  SQISRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR
          ISRPYLSEAWEAME++ E KK PLM W FPAL NEIDMKDNLKWWAHTVASTVR
Subjt:  SQISRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR

A0A6J1EH41 uncharacterized protein LOC1114340584.9e-6865.22Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MDVEQVL+LFDS WFER IFN  HPF  T+ QNP  +N +++PL N P E    PR+     RS+SEDLSSKLTF+S S SPDSVLFSPKLQTILSSK+I
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI
        AG E PE +R+   +Q K   R+RI GR   R SES+SLSELEFEE+KGFMDLGFVFSE D+ SSLA I+PGLNRLGKRD+EE+E              I
Subjt:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI

Query:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR
        SRPYLSEAW AME +EE KK  +M WR PA  NEIDMKDNLKWWAH VASTVR
Subjt:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR

A0A6J1KR35 uncharacterized protein LOC1114957103.5e-7469.17Show/hide
Query:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI
        MDVEQVLNLFDSFWFER IFN  HPF  T+ QNP  +N +++PL N P E    PR+     RS+SEDLSSKLTF+S S SPDSVLFSPKLQTILSSKDI
Subjt:  MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPL-NLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDI

Query:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI
        AG E PE +R+   +Q K   R+RI GR   R SES+SLSELEFEELKGFMDLGFVFSE D+ SSLA I+PGLNRLGKRD+EE+EE+EEEEE       I
Subjt:  AGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQI

Query:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR
        SRPYLSEAW AME++EE KK  +M WR PA  NEIDMKDNLKWWAH VASTVR
Subjt:  SRPYLSEAWEAMERDEE-KKLPLMNWRFPALSNEIDMKDNLKWWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.0e-0431.15Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQE--EDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPAL-
        + SKSL++ + E+L+G +DLGF FS  D    L + +P L       Q+  +D++++  E +SV+    S P ++              P+ NW+  +  
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQE--EDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPAL-

Query:  SNEIDMKDNLKWWAHTVASTVR
         N  D+K  LK+WA  VA TV+
Subjt:  SNEIDMKDNLKWWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)1.0e-0431.15Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQE--EDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPAL-
        + SKSL++ + E+L+G +DLGF FS  D    L + +P L       Q+  +D++++  E +SV+    S P ++              P+ NW+  +  
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQE--EDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPAL-

Query:  SNEIDMKDNLKWWAHTVASTVR
         N  D+K  LK+WA  VA TV+
Subjt:  SNEIDMKDNLKWWAHTVASTVR

AT2G31560.1 Protein of unknown function (DUF1685)2.4e-0631.67Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPALSNE
        + +KSL++ + EELKG +DLGF FS  D    L + +P L       Q+  ++ ++    S ++   S P  + A            P+ NW+  +  ++
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPALSNE

Query:  -IDMKDNLKWWAHTVASTVR
          D+K  LK+WA TVA TVR
Subjt:  -IDMKDNLKWWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)2.4e-0631.67Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPALSNE
        + +KSL++ + EELKG +DLGF FS  D    L + +P L       Q+  ++ ++    S ++   S P  + A            P+ NW+  +  ++
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLPLMNWRFPALSNE

Query:  -IDMKDNLKWWAHTVASTVR
          D+K  LK+WA TVA TVR
Subjt:  -IDMKDNLKWWAHTVASTVR

AT2G42760.1 unknown protein9.2e-3543.33Show/hide
Query:  EQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAA---FPRVTSIRTRSMSED----LSSKLTFISTSH-----SPDSVL----FS
        E++L LF+  W ER IF  D    +  S+       EKE L    E  A   FP V+ +  R+MS++     SSK +  S+S      SP SVL      
Subjt:  EQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAA---FPRVTSIRTRSMSED----LSSKLTFISTSH-----SPDSVL----FS

Query:  PKLQTILSSKDIAGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEED-RDSSLASIIPGLNRLGKRDQ-EEDEE
         KLQTILS K++      E   +ER    K   RK+ + +   R  + KS+S+LE+EELKGFMDLGFVFSE+D +DS L SI+PGL RL K+D     EE
Subjt:  PKLQTILSSKDIAGSESPENNRKERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEED-RDSSLASIIPGLNRLGKRDQ-EEDEE

Query:  DEEEEEASVDKSQISRPYLSEAWEAMERDEEKK--LPLMNWRF--PALSNEIDMKDNLKWWAHTVASTVR
        +EEEEE  +  ++ +RPYLSEAW+     + KK   P + WR   PA ++E+D+KDNL+ WAH VAST+R
Subjt:  DEEEEEASVDKSQISRPYLSEAWEAMERDEEKK--LPLMNWRF--PALSNEIDMKDNLKWWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTGAGCAAGTTCTTAATCTGTTTGATTCTTTCTGGTTCGAGCGTAGAATCTTCAACAGCGACCACCCCTTTTCATCAACGTCGTCCCAAAACCCAGAACTTGA
CAATCCAGAGAAAGAGCCATTAAACTTGCCGGCGGAAGCTGCTGCTTTTCCACGCGTTACATCGATTCGCACGAGGTCCATGAGCGAAGATCTGAGCTCCAAGTTGACCT
TTATCTCCACTTCCCATTCACCCGACTCGGTTCTCTTCTCTCCAAAGCTTCAAACAATCCTCTCCAGCAAAGACATAGCCGGATCAGAGTCACCGGAAAACAACCGGAAG
GAAAGGCCGCAACAACTCAAAGCAGCGTCGAGAAAGAGAATTAGAGGAAGAGGGAGGAGACGGGCGTCGGAAAGCAAGAGCCTATCGGAGCTGGAATTTGAAGAGCTAAA
AGGTTTCATGGATCTGGGCTTCGTTTTCTCGGAGGAAGACAGAGATTCGAGCTTGGCGTCGATCATTCCCGGTTTGAACAGGCTGGGCAAAAGAGATCAGGAAGAAGACG
AAGAAGACGAAGAAGAAGAAGAGGCCTCCGTCGACAAGTCGCAAATCTCGAGGCCTTACCTATCGGAAGCTTGGGAGGCGATGGAAAGAGACGAGGAAAAGAAGCTGCCA
TTGATGAACTGGAGGTTTCCTGCTTTGAGCAATGAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGTTGAGCAAGTTCTTAATCTGTTTGATTCTTTCTGGTTCGAGCGTAGAATCTTCAACAGCGACCACCCCTTTTCATCAACGTCGTCCCAAAACCCAGAACTTGA
CAATCCAGAGAAAGAGCCATTAAACTTGCCGGCGGAAGCTGCTGCTTTTCCACGCGTTACATCGATTCGCACGAGGTCCATGAGCGAAGATCTGAGCTCCAAGTTGACCT
TTATCTCCACTTCCCATTCACCCGACTCGGTTCTCTTCTCTCCAAAGCTTCAAACAATCCTCTCCAGCAAAGACATAGCCGGATCAGAGTCACCGGAAAACAACCGGAAG
GAAAGGCCGCAACAACTCAAAGCAGCGTCGAGAAAGAGAATTAGAGGAAGAGGGAGGAGACGGGCGTCGGAAAGCAAGAGCCTATCGGAGCTGGAATTTGAAGAGCTAAA
AGGTTTCATGGATCTGGGCTTCGTTTTCTCGGAGGAAGACAGAGATTCGAGCTTGGCGTCGATCATTCCCGGTTTGAACAGGCTGGGCAAAAGAGATCAGGAAGAAGACG
AAGAAGACGAAGAAGAAGAAGAGGCCTCCGTCGACAAGTCGCAAATCTCGAGGCCTTACCTATCGGAAGCTTGGGAGGCGATGGAAAGAGACGAGGAAAAGAAGCTGCCA
TTGATGAACTGGAGGTTTCCTGCTTTGAGCAATGAAATTGATATGAAAGACAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGATGA
Protein sequenceShow/hide protein sequence
MDVEQVLNLFDSFWFERRIFNSDHPFSSTSSQNPELDNPEKEPLNLPAEAAAFPRVTSIRTRSMSEDLSSKLTFISTSHSPDSVLFSPKLQTILSSKDIAGSESPENNRK
ERPQQLKAASRKRIRGRGRRRASESKSLSELEFEELKGFMDLGFVFSEEDRDSSLASIIPGLNRLGKRDQEEDEEDEEEEEASVDKSQISRPYLSEAWEAMERDEEKKLP
LMNWRFPALSNEIDMKDNLKWWAHTVASTVR