; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1921 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1921
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationMC09:23882610..23884608
RNA-Seq ExpressionMC09g1921
SyntenyMC09g1921
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MBA0717830.1 hypothetical protein [Gossypium laxum]2.76e-4446.43Show/hide
Query:  DDGCDDDEVV----NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDE----ELSNEEKNYSESGFYSLPATPPRIQ---RKSGNKDC
        DD  DD +V     NL RLS+C SS    ++  +  + G+   SMS++SIESF+ D D+E    ELS++     E   YSLPATPPR +   R +  KD 
Subjt:  DDGCDDDEVV----NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDE----ELSNEEKNYSESGFYSLPATPPRIQ---RKSGNKDC

Query:  MSENEAKKC----TRDVRKRR---RRRMEGAENCKSGGMGIKMGY------------SERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSR
         SENEA++     ++D+RKRR    R ++    CK  G      Y            S    GGV++I RPKGG+RS+CM +EEVKACRDLG ELEMP R
Subjt:  MSENEAKKC----TRDVRKRR---RRRMEGAENCKSGGMGIKMGY------------SERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSR

Query:  ISISPASALDINNGGNSHI-----SSPGDDPQHVKARLKRWAQAVALASTPR
        +S+S ++  + ++GGNS I     SSPGDDP+ VKARLK WAQAVALAST R
Subjt:  ISISPASALDINNGGNSHI-----SSPGDDPQHVKARLKRWAQAVALASTPR

XP_006444637.1 uncharacterized protein LOC18047012 [Citrus clementina]2.63e-4645.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDAD EE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA+K        R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_006492440.1 uncharacterized protein LOC102609729 [Citrus sinensis]2.63e-4645.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDAD EE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA+K        R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_021676111.1 uncharacterized protein LOC110661706 [Hevea brasiliensis]2.16e-4746.79Show/hide
Query:  LDYKDDH-MDDGCDDDE---VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSM----SQMSIESFEGDADDEELSNEEKNYS----------ESGFYSLP
        L+ +DDH  DDG D D      NL RLSVCTSS MY  +  +  +    C  M    S++SIESF+ D   EELS+E++ +           E   YSLP
Subjt:  LDYKDDH-MDDGCDDDE---VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSM----SQMSIESFEGDADDEELSNEEKNYS----------ESGFYSLP

Query:  ATPPRIQRKSG--------NKDCMSENEAKKCTRDVRKRRRRRM-------------EGAENCKSGGMGIKMGY-----SERQSGGVMLIARPKGGRRSM
        ATPPR + + G         K+  SENEA+K +     RRRRR+             +  E C  GGMG   GY      E + GGV+LI RPKGG RS+
Subjt:  ATPPRIQRKSG--------NKDCMSENEAKKCTRDVRKRRRRRM-------------EGAENCKSGGMGIKMGY-----SERQSGGVMLIARPKGGRRSM

Query:  CMHLEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        CM L+EVKACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  CMHLEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_022151790.1 uncharacterized protein LOC111019688 [Momordica charantia]5.13e-219100Show/hide
Query:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
        MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
Subjt:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC

Query:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
        MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
Subjt:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG

Query:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
        IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
Subjt:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY

TrEMBL top hitse value%identityAlignment
A0A067H441 Uncharacterized protein1.28e-4645.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDAD EE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA+K        R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

A0A6A6M7M5 Uncharacterized protein2.07e-4445.55Show/hide
Query:  LDYKDDH-MDDGCDDDEVV---NLCRLSVCTSSCMYGDNSAELYENGIGCGSM----SQMSIESFEGDADDEELSNEEKNYS----------ESGFYSLP
        L+ +DDH  DDG D+D      NL RLSVCTSS MY     +  +    C  M    S++S ESF+ D   EELSN+E+             E+G YSLP
Subjt:  LDYKDDH-MDDGCDDDEVV---NLCRLSVCTSSCMYGDNSAELYENGIGCGSM----SQMSIESFEGDADDEELSNEEKNYS----------ESGFYSLP

Query:  ATPPRIQRKSGN--------KDCMSENEAKK--CTRDVRKRRRR-----RMEGAENCKSG---GMGIKMGY---------SERQSGGVMLIARPKGGRRS
        ATPPR +   G         K+  SENEA+K     ++R RRRR      ++G +  K      M + MG           E + GG+++I RPKGGRRS
Subjt:  ATPPRIQRKSGN--------KDCMSENEAKK--CTRDVRKRRRR-----RMEGAENCKSG---GMGIKMGY---------SERQSGGVMLIARPKGGRRS

Query:  MCMHLEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        +CM +EEVKACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  MCMHLEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

A0A6J1DC65 uncharacterized protein LOC1110196882.48e-219100Show/hide
Query:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
        MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
Subjt:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC

Query:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
        MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
Subjt:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG

Query:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
        IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
Subjt:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY

A0A7J9A1M9 Uncharacterized protein1.34e-4446.43Show/hide
Query:  DDGCDDDEVV----NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDE----ELSNEEKNYSESGFYSLPATPPRIQ---RKSGNKDC
        DD  DD +V     NL RLS+C SS    ++  +  + G+   SMS++SIESF+ D D+E    ELS++     E   YSLPATPPR +   R +  KD 
Subjt:  DDGCDDDEVV----NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDE----ELSNEEKNYSESGFYSLPATPPRIQ---RKSGNKDC

Query:  MSENEAKKC----TRDVRKRR---RRRMEGAENCKSGGMGIKMGY------------SERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSR
         SENEA++     ++D+RKRR    R ++    CK  G      Y            S    GGV++I RPKGG+RS+CM +EEVKACRDLG ELEMP R
Subjt:  MSENEAKKC----TRDVRKRR---RRRMEGAENCKSGGMGIKMGY------------SERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSR

Query:  ISISPASALDINNGGNSHI-----SSPGDDPQHVKARLKRWAQAVALASTPR
        +S+S ++  + ++GGNS I     SSPGDDP+ VKARLK WAQAVALAST R
Subjt:  ISISPASALDINNGGNSHI-----SSPGDDPQHVKARLKRWAQAVALASTPR

V4TWG0 Uncharacterized protein1.28e-4645.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDAD EE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA+K        R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEAKK------CTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFELE      MP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFELE------MPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)1.2e-0436.56Show/hide
Query:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA
        LEE+K C DLGF     E+P   +  PA  L  +                                 N  ISSPGDDP  VKARLK WAQ VA
Subjt:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA

AT2G31560.2 Protein of unknown function (DUF1685)1.2e-0436.56Show/hide
Query:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA
        LEE+K C DLGF     E+P   +  PA  L  +                                 N  ISSPGDDP  VKARLK WAQ VA
Subjt:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA

AT2G46940.1 unknown protein9.3e-2639.59Show/hide
Query:  LDYKDDHMDDGCDDDEVVN-LCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDAD---DEELSNE----EKNYSESGFYSLPATPPRIQRK-
        LD + ++ DDG   +   N L RLS+CT+S    D   E + + +G    S +S+ESF  D     D E+S++    + +   SGFYSLP    R +RK 
Subjt:  LDYKDDHMDDGCDDDEVVN-LCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDAD---DEELSNE----EKNYSESGFYSLPATPPRIQRK-

Query:  --SGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGG---VMLIARPKGG-RRSMCMHLEEVKACRDLGFELEMPSRISISPAS
          SG  +   E+E K          +RR+   ++ +    G      ER S G   + ++ + +GG ++SM M  EEVKACRDLGFEL++P R+S    S
Subjt:  --SGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGG---VMLIARPKGG-RRSMCMHLEEVKACRDLGFELEMPSRISISPAS

Query:  ALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
          + ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  ALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

AT3G62070.1 unknown protein1.3e-2742.31Show/hide
Query:  VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNE---EKNYSES-------GFYSLPATPPRIQRK---SGNKDCMSENE-
        V +L RLS+CTSS    D +A   +      S    S+E+F+G   D E+S++   ++N  ES       GFYSLP TP R +RK   SG  D    N+ 
Subjt:  VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNE---EKNYSES-------GFYSLPATPPRIQRK---SGNKDCMSENE-

Query:  AKKCTRDVRKRRRRRMEGAE-NCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELE--MPSRISISPASA---LDINNGGNS-
          KC+   R+RR ++ +G    C  GG             G+ ++ R +GG RS+ M L+EVKACRDLGFELE  +P RIS+S   +      ++GG+S 
Subjt:  AKKCTRDVRKRRRRRMEGAE-NCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELE--MPSRISISPASA---LDINNGGNS-

Query:  ----HISSPGDDPQHVKARLKRWAQAVALASTPR
             ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  ----HISSPGDDPQHVKARLKRWAQAVALASTPR

AT4G01670.1 unknown protein2.1e-3040.25Show/hide
Query:  DDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMS-QMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRK---------------SGNKD
        ++++  +L RLSVC++   Y  + A+      G  S S +  +    G+   EEL+  + +   +G  SLPATPPR +R+               SG+K 
Subjt:  DDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMS-QMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRK---------------SGNKD

Query:  CMSENEAKKCTRDVRKRRRRRME---GAENCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGG
          SENE +K   + R+RRR + E     ++ +   +G +        GGV+++ RP GG R +CM LEEVKAC+DLGFELE P R+S S  S +D ++GG
Subjt:  CMSENEAKKCTRDVRKRRRRRME---GAENCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGG

Query:  NS------HISSPGDDPQHVKARLKRWAQAVALAST
        NS       ISSPGDDP+ VKARLK WAQAVA  ST
Subjt:  NS------HISSPGDDPQHVKARLKRWAQAVALAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTTTTAGCTTATCACTGACACTAAGAGCAGATAAATTGAAGTCCTCTAGAAATAAGCCCAACCCTTCTCTCCCTCTCCCTCTAACTGTCTATCCCCTTCACAA
TCTCCCATGTGCTCCTAATTTCCACTGCCTCTGCCCCCATCTCTTTCTTTTCAGCTTTGTACAAAAGAAAACACACATAATGGCCTTGCATGATCACCGACTTGATTACA
AAGATGATCATATGGATGATGGATGTGATGATGATGAAGTTGTCAACTTGTGTAGGCTATCCGTTTGTACCAGCAGTTGTATGTATGGTGACAATAGTGCCGAGCTGTAT
GAAAATGGGATTGGATGTGGCAGCATGTCTCAAATGTCCATAGAAAGTTTTGAAGGTGACGCGGATGACGAGGAGTTGTCCAATGAAGAGAAAAATTACTCTGAATCCGG
TTTTTACTCACTACCGGCTACACCTCCACGGATTCAGAGAAAGAGTGGTAATAAAGATTGCATGAGCGAAAATGAAGCTAAAAAGTGTACGAGGGACGTGAGGAAAAGGA
GAAGAAGGAGAATGGAAGGAGCCGAAAATTGCAAGTCGGGGGGAATGGGAATTAAGATGGGTTATAGTGAGAGGCAGAGCGGTGGAGTAATGTTGATAGCAAGGCCCAAG
GGTGGGAGGAGATCGATGTGCATGCACTTGGAGGAAGTGAAAGCATGCAGAGATCTTGGGTTTGAGCTGGAGATGCCCTCTCGTATTTCCATCTCACCTGCTTCAGCACT
CGATATCAATAATGGTGGAAATTCCCACATCTCTAGCCCTGGGGATGATCCACAGCACGTTAAGGCTCGGCTCAAGAGATGGGCGCAAGCGGTGGCTCTTGCTTCTACGC
CTCGCCCCACCATCTACTAA
mRNA sequenceShow/hide mRNA sequence
TTTGGCAAATGGCTATACTGCACTGTACCTGTACGCGGTTTCAGTTTATTTTTAGGAGTTAGGCCGACAAAATAGGCCACAATGTCATTTTACATGTTCCACGAAAAAAA
AAACTCCAATATTCATCATGTTTTACTTTGATAATTCAAAGGGACTCATACTGTATATATGTATGCTTTTAGCTTATCACTGACACTAAGAGCAGATAAATTGAAGTCCT
CTAGAAATAAGCCCAACCCTTCTCTCCCTCTCCCTCTAACTGTCTATCCCCTTCACAATCTCCCATGTGCTCCTAATTTCCACTGCCTCTGCCCCCATCTCTTTCTTTTC
AGCTTTGTACAAAAGAAAACACACATAATGGCCTTGCATGATCACCGACTTGATTACAAAGATGATCATATGGATGATGGATGTGATGATGATGAAGTTGTCAACTTGTG
TAGGCTATCCGTTTGTACCAGCAGTTGTATGTATGGTGACAATAGTGCCGAGCTGTATGAAAATGGGATTGGATGTGGCAGCATGTCTCAAATGTCCATAGAAAGTTTTG
AAGGTGACGCGGATGACGAGGAGTTGTCCAATGAAGAGAAAAATTACTCTGAATCCGGTTTTTACTCACTACCGGCTACACCTCCACGGATTCAGAGAAAGAGTGGTAAT
AAAGATTGCATGAGCGAAAATGAAGCTAAAAAGTGTACGAGGGACGTGAGGAAAAGGAGAAGAAGGAGAATGGAAGGAGCCGAAAATTGCAAGTCGGGGGGAATGGGAAT
TAAGATGGGTTATAGTGAGAGGCAGAGCGGTGGAGTAATGTTGATAGCAAGGCCCAAGGGTGGGAGGAGATCGATGTGCATGCACTTGGAGGAAGTGAAAGCATGCAGAG
ATCTTGGGTTTGAGCTGGAGATGCCCTCTCGTATTTCCATCTCACCTGCTTCAGCACTCGATATCAATAATGGTGGAAATTCCCACATCTCTAGCCCTGGGGATGATCCA
CAGCACGTTAAGGCTCGGCTCAAGAGATGGGCGCAAGCGGTGGCTCTTGCTTCTACGCCTCGCCCCACCATCTACTAACCGATTGCCTCCTCTGCTAAATCTTCACAACT
TTATGGGTTTTCACAACAGGAAGAATATTTTCAAGCTTCCTTCCCTACCTATTTTCTTCTTTTCAACGGATTTTTGGTTGTTCACTGATATATAGATGAGCTTTCTTGTA
AATATGTGGTGTAAAGTAGAGGCATTGATGCAATTCCAAAAGCCACTGACTTTCCTGGTTTGATTAGGAACAGTTCCCGTTTTACGCGGTGTATGCAATTTGTTTCGACT
TATTATGGAGTAAACAATAGTCCAAAATCAGATTTTGTTTTAGTTTCTCTTTTAGCATCTAGATCGAAGTTGAATGCACACTTTTCCAACAAGTTTCGCGTCCCTCACTG
GAGTAGTAAACAACACTAACACGTTTTCTTATCATTTGCTCCCAGAAGAAAGTTTCGTTTGTTTACAGGCAAACAGCGAACCGACGTTTGCCATCAACATTTTGCCACTA
ATCATAGATGGCACAAATACCAATGAGAATTCAAAACAATAATAATTTGAGCTTCACAACGGAGCAGCAGTCTTGTCGGAGTTTCCAATAATGGCAATTGGGACGATCAA
GGATGATATAAAAACATGGGATAACTATGCGTTGCTGTCACTTAAAGACCAGATTATGGCAAGTTTTATCACATGGGTATGCACAATCAATAGCTTTCACACTGGCTCTA
TCGAAGTACCAGTCTCCAACGGACACTGCAATCGCCTAAGAGCACAAAAGTAAGTCATGCAAAAAAATTTATCAGGGATTAAGGTAATAAATTGGCTAAATAAGAAAAAA
GTTTCATACAAAACATACAAAAAAGA
Protein sequenceShow/hide protein sequence
MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELY
ENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGGVMLIARPK
GGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY