; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036200 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036200
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF1685 domain-containing protein
Genome locationscaffold5:40889344..40891855
RNA-Seq ExpressionSpg036200
SyntenySpg036200
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057058.1 DUF1685 domain-containing protein [Cucumis melo var. makuwa]7.3e-8474.1Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN
        MD EQ+LNLFD FWFER +FNKH F SNLQ PQ ++P  +S P     +PRL TRSISEDLS+KLSFMS+SNSPDSVLFSPKLQTI SSK+IA AESPE 
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN

Query:  GRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEE-----VCGEISRPYLS
         R     RRPKTE R+R  GR+ R SES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+ E+E +EEEEEEE     + GEISRPYLS
Subjt:  GRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEE-----VCGEISRPYLS

Query:  EAWEAMEKEEE----LKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        EAWEAME+EEE     KKPLM  MKWRFP+N+IDMKDNLKWWAH VASTVR
Subjt:  EAWEAMEKEEE----LKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

XP_022927119.1 uncharacterized protein LOC111434058 [Cucurbita moschata]4.7e-8373.88Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE
        MDVEQVL+LFD  WFER+IFNKH FP+N QNP+PEN    P +NS PP EPFVPR+  RSISEDLS+KL+FMS+S+SPDSVLFSPKLQTILSSKEIA  E
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE

Query:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA
         PE  R    R+ K + R+RIGGR  RGSES+SLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR       +EEEEE+ G ISRPYLSEA
Subjt:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        W AME+EEELKK L+  MKWR PANEIDMKDNLKWWAH VASTVR
Subjt:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

XP_023001638.1 uncharacterized protein LOC111495710 [Cucurbita maxima]2.9e-8876.73Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE
        MDVEQVLNLFD FWFER+IFNKH FP+N QNP+PEN    P +NS PP EPFVPR+  RSISEDLS+KL+FMS+S+SPDSVLFSPKLQTILSSK+IA  E
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE

Query:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA
         PE  R    R+ K + R+RIGGR  RGSES+SLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR EEE EEEEEEEE+ G ISRPYLSEA
Subjt:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        W AME+EEE+KK L+  MKWR PANEIDMKDNLKWWAH VASTVR
Subjt:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

XP_031737252.1 uncharacterized protein LOC105434498 [Cucumis sativus]1.0e-8271.88Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQ--NPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAES
        MD + +LNLFD FWF+RQ+ N H FPSN Q   PQ ++P    P P E F +PRLRTRSISEDLS+KLSFMSNSNSPDSVL SPKLQTI SSK+IA AES
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQ--NPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAES

Query:  PENG-RSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR-------GEEEGEEEEEEEEVCGEIS
        PE   +    RRPKTE R+R+ GR+ R SES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR       GEEE EE+EEE ++ GEIS
Subjt:  PENG-RSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR-------GEEEGEEEEEEEEVCGEIS

Query:  RPYLSEAWEAM----EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        RPYLSEAWEA+    EKEE LK+PLM  MKWRFP+N+IDMKDNLKWWAH VASTVR
Subjt:  RPYLSEAWEAM----EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

XP_038895996.1 uncharacterized protein LOC120084174 [Benincasa hispida]1.8e-9079.1Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPE
        MD EQ+LNLFD FWFE +IFNKH FPSN QNPQPEN  Q++  P EPF VPRLRTRSISEDLS+KLSFMSNSNSPDSVLFSPKLQTI SSK+IA AESPE
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPE

Query:  NGRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE
        N R  G  RRPKTE R+++ GR+ R SES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK  EEE  EEEEE ++ GEISRPYLSEAWE
Subjt:  NGRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE

Query:  AM-EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        AM E+EEELK PL   MKW+FP+N+IDMKDNLKWWAH VASTVR
Subjt:  AM-EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0LV26 Uncharacterized protein5.1e-8371.88Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQ--NPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAES
        MD + +LNLFD FWF+RQ+ N H FPSN Q   PQ ++P    P P E F +PRLRTRSISEDLS+KLSFMSNSNSPDSVL SPKLQTI SSK+IA AES
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQ--NPQPENPFQNSPPPAEPF-VPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAES

Query:  PENG-RSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR-------GEEEGEEEEEEEEVCGEIS
        PE   +    RRPKTE R+R+ GR+ R SES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR       GEEE EE+EEE ++ GEIS
Subjt:  PENG-RSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKR-------GEEEGEEEEEEEEVCGEIS

Query:  RPYLSEAWEAM----EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        RPYLSEAWEA+    EKEE LK+PLM  MKWRFP+N+IDMKDNLKWWAH VASTVR
Subjt:  RPYLSEAWEAM----EKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

A0A5A7UPL7 DUF1685 domain-containing protein3.5e-8474.1Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN
        MD EQ+LNLFD FWFER +FNKH F SNLQ PQ ++P  +S P     +PRL TRSISEDLS+KLSFMS+SNSPDSVLFSPKLQTI SSK+IA AESPE 
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN

Query:  GRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEE-----VCGEISRPYLS
         R     RRPKTE R+R  GR+ R SES+SLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+ E+E +EEEEEEE     + GEISRPYLS
Subjt:  GRS-GAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEE-----VCGEISRPYLS

Query:  EAWEAMEKEEE----LKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        EAWEAME+EEE     KKPLM  MKWRFP+N+IDMKDNLKWWAH VASTVR
Subjt:  EAWEAMEKEEE----LKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

A0A6J1CEZ9 uncharacterized protein LOC1110109012.4e-8072.06Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN
        MDVEQ+LNLFD  WFER IFN+   PSN +    E P +NSPPP+   +PR+ TRSISEDLS+KLSFMSNSNSPDS+LFSPKLQTI SSK+IA  ESPEN
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPEN

Query:  GRSGAGRRPKTELRKRI----GGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA
         R       +TE RKR      GR+RR  ESKSLSELEFEELKGFMDLGFVFSEEDK SSLASI+PGLNRL K+   E EEEEE EE  GEISRPYLSEA
Subjt:  GRSGAGRRPKTELRKRI----GGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKMKWRFPA--NEIDMKDNLKWWAHTVASTVR
        WEAMEKE ELKKP +  M+W FPA  NEIDMKDNLKWWAHTVASTVR
Subjt:  WEAMEKEEELKKPLMMKMKWRFPA--NEIDMKDNLKWWAHTVASTVR

A0A6J1EH41 uncharacterized protein LOC1114340582.3e-8373.88Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE
        MDVEQVL+LFD  WFER+IFNKH FP+N QNP+PEN    P +NS PP EPFVPR+  RSISEDLS+KL+FMS+S+SPDSVLFSPKLQTILSSKEIA  E
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE

Query:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA
         PE  R    R+ K + R+RIGGR  RGSES+SLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR       +EEEEE+ G ISRPYLSEA
Subjt:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        W AME+EEELKK L+  MKWR PANEIDMKDNLKWWAH VASTVR
Subjt:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

A0A6J1KR35 uncharacterized protein LOC1114957101.4e-8876.73Show/hide
Query:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE
        MDVEQVLNLFD FWFER+IFNKH FP+N QNP+PEN    P +NS PP EPFVPR+  RSISEDLS+KL+FMS+S+SPDSVLFSPKLQTILSSK+IA  E
Subjt:  MDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPEN----PFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAE

Query:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA
         PE  R    R+ K + R+RIGGR  RGSES+SLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR EEE EEEEEEEE+ G ISRPYLSEA
Subjt:  SPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEA

Query:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR
        W AME+EEE+KK L+  MKWR PANEIDMKDNLKWWAH VASTVR
Subjt:  WEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)2.7e-0429.37Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE----AMEKEEELKKPLMMK--MKWRF
        + +KSL++ + EELKG +DLGF FS  D+   L + +P L                  E+C  +S+ +L +  +    + E+++    P        W+ 
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE----AMEKEEELKKPLMMK--MKWRF

Query:  PA---NEIDMKDNLKWWAHTVASTVR
         +   +  D+K  LK+WA TVA TVR
Subjt:  PA---NEIDMKDNLKWWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)2.7e-0429.37Show/hide
Query:  SESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE----AMEKEEELKKPLMMK--MKWRF
        + +KSL++ + EELKG +DLGF FS  D+   L + +P L                  E+C  +S+ +L +  +    + E+++    P        W+ 
Subjt:  SESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSEAWE----AMEKEEELKKPLMMK--MKWRF

Query:  PA---NEIDMKDNLKWWAHTVASTVR
         +   +  D+K  LK+WA TVA TVR
Subjt:  PA---NEIDMKDNLKWWAHTVASTVR

AT2G42760.1 unknown protein5.8e-3140.67Show/hide
Query:  EQVLNLFDCFWFERQIFNKHSFPSN------------LQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSN-----SPDSVL----FSPK
        E++L LF+  W ER IF K     N            L+  + E   +N   P    V R  +       S+K S  S+S+     SP SVL       K
Subjt:  EQVLNLFDCFWFERQIFNKHSFPSN------------LQNPQPENPFQNSPPPAEPFVPRLRTRSISEDLSTKLSFMSNSN-----SPDSVL----FSPK

Query:  LQTILSSKEIAEAESPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEED-KGSSLASIVPGLNRLGKRGE---EEGEEE
        LQTILS KE+      E  R  + +  + + +K+    + R  + KS+S+LE+EELKGFMDLGFVFSE+D K S L SI+PGL RL K+ +   +E EEE
Subjt:  LQTILSSKEIAEAESPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEED-KGSSLASIVPGLNRLGKRGE---EEGEEE

Query:  EEEEEVCG-EISRPYLSEAWEAMEKEEELKKPLMMKMKWRFP----ANEIDMKDNLKWWAHTVASTVR
        EEE+++ G   +RPYLSEAW+     +  KK +  ++KWR P    A+E+D+KDNL+ WAH VAST+R
Subjt:  EEEEEVCG-EISRPYLSEAWEAMEKEEELKKPLMMKMKWRFP----ANEIDMKDNLKWWAHTVASTVR

AT2G43340.1 Protein of unknown function (DUF1685)3.6e-0429.03Show/hide
Query:  SKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSE------AWEAMEKEEELKKPLMMKMKWRFPA
        +KSL++ + EELKG +DLGF F+ E+    L + +P L                  E+C  +S+ ++ +      +    +K   L  P+     W+  +
Subjt:  SKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKRGEEEGEEEEEEEEVCGEISRPYLSE------AWEAMEKEEELKKPLMMKMKWRFPA

Query:  ---NEIDMKDNLKWWAHTVASTVR
           N  D+K  LK+WA  VA TVR
Subjt:  ---NEIDMKDNLKWWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGGAGAAAGGTGAGGGATTACAAAAGTGGTGATGGTCGGCCTCGACCTCGGGAAGAGGCCGAGCACAGCAGCCTCGACCTCGGGAAGAGGTTAAGCACAACCCC
CACCGGGCGCTTCGCCCCTGGATCCCGTACTTGGAAAATACCTTTAAAGTTTGTGCTCGACCTCGGCAAGAGGGGCTCGGTTTTGGCCTCTACCAGGCGTGGGCTCTATA
GGCTCGCATGTAAGCGTGGGCTCTATAGGCTCGCATGTGAAGCGTGGGCACTAAGCAGTAAGCTTAGGCTCGCGAGTAGACGTGGGCTCTATAGGCTCGCATGGAGGCGT
GGGCACTATAGGCTCGTATGTGAAGCGTGGGCACTAAGTAGTAAGCTTAGGCTCGCGAGTAGACGTGGGCATTATAGGCTCGCATGTAAGCGTGGACTCTATAGGCTCGC
ATGTGAAGCGTGGGCACTAAGCAGTAAGCTTAGGCTCCCGCCATTGATGGACGTCGAGCAAGTTCTGAATCTCTTCGATTGCTTCTGGTTCGAGCGTCAAATCTTCAACA
AACACTCCTTTCCCTCAAACCTTCAAAACCCACAACCTGAAAATCCATTCCAAAACTCACCGCCGCCAGCGGAGCCTTTCGTTCCACGGCTTCGCACGAGGTCCATCAGC
GAAGATTTGAGCACCAAATTGAGCTTTATGTCCAATTCCAACTCGCCGGATTCGGTTCTGTTCTCACCGAAGCTTCAAACGATTCTTTCCAGCAAAGAAATCGCCGAAGC
AGAGTCGCCGGAGAACGGCCGGAGTGGAGCTGGGCGGCGGCCGAAAACAGAGTTGAGAAAGAGAATCGGAGGGAGAAAAAGAAGGGGATCGGAAAGTAAGAGCCTTTCGG
AGCTGGAATTTGAGGAGCTAAAAGGGTTTATGGATTTGGGATTTGTTTTCTCGGAGGAAGATAAAGGTTCGAGCTTGGCGTCGATCGTTCCCGGATTGAACAGGCTGGGG
AAAAGGGGAGAAGAAGAAGGAGAAGAAGAAGAAGAAGAAGAAGAAGTTTGTGGTGAAATTTCGAGGCCTTATCTGTCGGAAGCTTGGGAGGCTATGGAGAAAGAGGAGGA
ATTGAAGAAGCCATTGATGATGAAGATGAAGTGGAGGTTTCCTGCTAATGAGATTGATATGAAGGATAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGGAGAAAGGTGAGGGATTACAAAAGTGGTGATGGTCGGCCTCGACCTCGGGAAGAGGCCGAGCACAGCAGCCTCGACCTCGGGAAGAGGTTAAGCACAACCCC
CACCGGGCGCTTCGCCCCTGGATCCCGTACTTGGAAAATACCTTTAAAGTTTGTGCTCGACCTCGGCAAGAGGGGCTCGGTTTTGGCCTCTACCAGGCGTGGGCTCTATA
GGCTCGCATGTAAGCGTGGGCTCTATAGGCTCGCATGTGAAGCGTGGGCACTAAGCAGTAAGCTTAGGCTCGCGAGTAGACGTGGGCTCTATAGGCTCGCATGGAGGCGT
GGGCACTATAGGCTCGTATGTGAAGCGTGGGCACTAAGTAGTAAGCTTAGGCTCGCGAGTAGACGTGGGCATTATAGGCTCGCATGTAAGCGTGGACTCTATAGGCTCGC
ATGTGAAGCGTGGGCACTAAGCAGTAAGCTTAGGCTCCCGCCATTGATGGACGTCGAGCAAGTTCTGAATCTCTTCGATTGCTTCTGGTTCGAGCGTCAAATCTTCAACA
AACACTCCTTTCCCTCAAACCTTCAAAACCCACAACCTGAAAATCCATTCCAAAACTCACCGCCGCCAGCGGAGCCTTTCGTTCCACGGCTTCGCACGAGGTCCATCAGC
GAAGATTTGAGCACCAAATTGAGCTTTATGTCCAATTCCAACTCGCCGGATTCGGTTCTGTTCTCACCGAAGCTTCAAACGATTCTTTCCAGCAAAGAAATCGCCGAAGC
AGAGTCGCCGGAGAACGGCCGGAGTGGAGCTGGGCGGCGGCCGAAAACAGAGTTGAGAAAGAGAATCGGAGGGAGAAAAAGAAGGGGATCGGAAAGTAAGAGCCTTTCGG
AGCTGGAATTTGAGGAGCTAAAAGGGTTTATGGATTTGGGATTTGTTTTCTCGGAGGAAGATAAAGGTTCGAGCTTGGCGTCGATCGTTCCCGGATTGAACAGGCTGGGG
AAAAGGGGAGAAGAAGAAGGAGAAGAAGAAGAAGAAGAAGAAGAAGTTTGTGGTGAAATTTCGAGGCCTTATCTGTCGGAAGCTTGGGAGGCTATGGAGAAAGAGGAGGA
ATTGAAGAAGCCATTGATGATGAAGATGAAGTGGAGGTTTCCTGCTAATGAGATTGATATGAAGGATAATCTCAAATGGTGGGCTCATACTGTTGCTTCTACTGTTAGAT
GA
Protein sequenceShow/hide protein sequence
MERRKVRDYKSGDGRPRPREEAEHSSLDLGKRLSTTPTGRFAPGSRTWKIPLKFVLDLGKRGSVLASTRRGLYRLACKRGLYRLACEAWALSSKLRLASRRGLYRLAWRR
GHYRLVCEAWALSSKLRLASRRGHYRLACKRGLYRLACEAWALSSKLRLPPLMDVEQVLNLFDCFWFERQIFNKHSFPSNLQNPQPENPFQNSPPPAEPFVPRLRTRSIS
EDLSTKLSFMSNSNSPDSVLFSPKLQTILSSKEIAEAESPENGRSGAGRRPKTELRKRIGGRKRRGSESKSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLG
KRGEEEGEEEEEEEEVCGEISRPYLSEAWEAMEKEEELKKPLMMKMKWRFPANEIDMKDNLKWWAHTVASTVR