; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g42010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g42010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1685)
Genome locationchr9:31926713..31927715
RNA-Seq ExpressionMoc09g42010
SyntenyMoc09g42010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006444637.1 uncharacterized protein LOC18047012 [Citrus clementina]5.8e-3845.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDA DEE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA      +K  R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_006492440.1 uncharacterized protein LOC102609729 [Citrus sinensis]5.8e-3845.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDA DEE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKS---------GNKDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S           KD  SENEA      +K  R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKS---------GNKDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_021279771.1 uncharacterized protein LOC110413326 [Herrania umbratica]2.5e-3645.05Show/hide
Query:  QKKTHIMALHDHRLDYKDDHMDDGCDDD-EVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSE
        +  T + ++H+   D  D   D   D      NL RLS+CTSS MY +   +  ++    G  +S++SIESF+ D  DEE S+E++          +  E
Subjt:  QKKTHIMALHDHRLDYKDDHMDDGCDDD-EVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSE

Query:  SGFYSLPATPPRIQRKSG---------NKDCMSENEAK-------KCTRDVRKRR--RRRMEGAEN-CKS---------GG----MGIKMGYSERQSGGV
         G YSLPATPPR + ++G          KD  SENEA+       K ++++RKRR  R R    E+ C S         GG     G   G SE  S G+
Subjt:  SGFYSLPATPPRIQRKSG---------NKDCMSENEAK-------KCTRDVRKRR--RRRMEGAEN-CKS---------GG----MGIKMGYSERQSGGV

Query:  MLIARPKGGRRSMCMHLEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        ++I RPKGGRRS+CM LEEVKACRDLGFE      LEMPSR+S+S  S LD ++GGNS     HISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  MLIARPKGGRRSMCMHLEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_021676111.1 uncharacterized protein LOC110661706 [Hevea brasiliensis]1.5e-3846.74Show/hide
Query:  LDYKDDH-MDDGCDDDE---VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNY----------SESGFYSLPATPP
        L+ +DDH  DDG D D      NL RLSVCTSS    D+      +G+    +S++SIESF+    DEELS+E++ +           E   YSLPATPP
Subjt:  LDYKDDH-MDDGCDDDE---VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNY----------SESGFYSLPATPP

Query:  RIQRKS--------GNKDCMSENEAKKCTRDVRKRRRRRM-------------EGAENCKSGGMGIKMGY-----SERQSGGVMLIARPKGGRRSMCMHL
        R + +         G K+  SENEA+K +     RRRRR+             +  E C  GGMG   GY      E + GGV+LI RPKGG RS+CM L
Subjt:  RIQRKS--------GNKDCMSENEAKKCTRDVRKRRRRRM-------------EGAENCKSGGMGIKMGY-----SERQSGGVMLIARPKGGRRSMCMHL

Query:  EEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        +EVKACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  EEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

XP_022151790.1 uncharacterized protein LOC111019688 [Momordica charantia]3.8e-170100Show/hide
Query:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
        MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
Subjt:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC

Query:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
        MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
Subjt:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG

Query:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
        IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
Subjt:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY

TrEMBL top hitse value%identityAlignment
A0A067H441 Uncharacterized protein2.8e-3845.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDA DEE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKS---------GNKDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S           KD  SENEA      +K  R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKS---------GNKDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

A0A6A6M7M5 Uncharacterized protein2.0e-3645.49Show/hide
Query:  LDYKDDH-MDDGCDDDEVV---NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNY----------SESGFYSLPATPP
        L+ +DDH  DDG D+D      NL RLSVCTSS MY     +  +       +S++S ESF+    DEELSN+E+             E+G YSLPATPP
Subjt:  LDYKDDH-MDDGCDDDEVV---NLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNY----------SESGFYSLPATPP

Query:  RIQRKSGN--------KDCMSENEAK--KCTRDVRKRRRR-----RMEGAENCK---SGGMGIKMG---------YSERQSGGVMLIARPKGGRRSMCMH
        R +   G         K+  SENEA+  K   ++R RRRR      ++G +  K      M + MG           E + GG+++I RPKGGRRS+CM 
Subjt:  RIQRKSGN--------KDCMSENEAK--KCTRDVRKRRRR-----RMEGAENCK---SGGMGIKMG---------YSERQSGGVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        +EEVKACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

A0A6J0ZYX2 uncharacterized protein LOC1104133261.2e-3645.05Show/hide
Query:  QKKTHIMALHDHRLDYKDDHMDDGCDDD-EVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSE
        +  T + ++H+   D  D   D   D      NL RLS+CTSS MY +   +  ++    G  +S++SIESF+ D  DEE S+E++          +  E
Subjt:  QKKTHIMALHDHRLDYKDDHMDDGCDDD-EVVNLCRLSVCTSSCMYGDNSAELYENGIGCGS-MSQMSIESFEGDADDEELSNEEK----------NYSE

Query:  SGFYSLPATPPRIQRKSG---------NKDCMSENEAK-------KCTRDVRKRR--RRRMEGAEN-CKS---------GG----MGIKMGYSERQSGGV
         G YSLPATPPR + ++G          KD  SENEA+       K ++++RKRR  R R    E+ C S         GG     G   G SE  S G+
Subjt:  SGFYSLPATPPRIQRKSG---------NKDCMSENEAK-------KCTRDVRKRR--RRRMEGAEN-CKS---------GG----MGIKMGYSERQSGGV

Query:  MLIARPKGGRRSMCMHLEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        ++I RPKGGRRS+CM LEEVKACRDLGFE      LEMPSR+S+S  S LD ++GGNS     HISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  MLIARPKGGRRSMCMHLEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

A0A6J1DC65 uncharacterized protein LOC1110196881.8e-170100Show/hide
Query:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
        MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC
Subjt:  MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSC

Query:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
        MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG
Subjt:  MYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMG

Query:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
        IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY
Subjt:  IKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY

V4TWG0 Uncharacterized protein2.8e-3845.49Show/hide
Query:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY
        T + ++HD   D  DD  D      +  NL RLSVCTSS    ++  +  ++G G    MS++SIESF+GDA DEE S+E++          +  E+G+Y
Subjt:  THIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCG-SMSQMSIESFEGDADDEELSNEEK----------NYSESGFY

Query:  SLPATPPR--IQRKSGN---------KDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH
        S+P TPPR   +R+S +         KD  SENEA      +K  R++R+RR        N K        G S++  G   GV++I RPKGGRRS+CM 
Subjt:  SLPATPPR--IQRKSGN---------KDCMSENEA------KKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSG---GVMLIARPKGGRRSMCMH

Query:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
        LEE+KACRDLGFE      LEMP R+S+S  S LD ++GGNS      ISSPGDDP+ VKARLK WAQAVALAST R
Subjt:  LEEVKACRDLGFE------LEMPSRISISPASALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)1.2e-0436.56Show/hide
Query:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA
        LEE+K C DLGF     E+P   +  PA  L  +                                 N  ISSPGDDP  VKARLK WAQ VA
Subjt:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA

AT2G31560.2 Protein of unknown function (DUF1685)1.2e-0436.56Show/hide
Query:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA
        LEE+K C DLGF     E+P   +  PA  L  +                                 N  ISSPGDDP  VKARLK WAQ VA
Subjt:  LEEVKACRDLGFEL---EMPSRISISPASALDINNG------------------------------GNSHISSPGDDPQHVKARLKRWAQAVA

AT2G46940.1 unknown protein9.3e-2639.59Show/hide
Query:  LDYKDDHMDDGCDDDEVVN-LCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDAD---DEELSNE----EKNYSESGFYSLPATPPRIQRK-
        LD + ++ DDG   +   N L RLS+CT+S    D   E + + +G    S +S+ESF  D     D E+S++    + +   SGFYSLP    R +RK 
Subjt:  LDYKDDHMDDGCDDDEVVN-LCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDAD---DEELSNE----EKNYSESGFYSLPATPPRIQRK-

Query:  --SGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGG---VMLIARPKGG-RRSMCMHLEEVKACRDLGFELEMPSRISISPAS
          SG  +   E+E K          +RR+   ++ +    G      ER S G   + ++ + +GG ++SM M  EEVKACRDLGFEL++P R+S    S
Subjt:  --SGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGG---VMLIARPKGG-RRSMCMHLEEVKACRDLGFELEMPSRISISPAS

Query:  ALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR
          + ++GGNS      ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  ALDINNGGNS-----HISSPGDDPQHVKARLKRWAQAVALASTPR

AT3G62070.1 unknown protein1.3e-2742.31Show/hide
Query:  VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNE---EKNYSES-------GFYSLPATPPRIQRK---SGNKDCMSENE-
        V +L RLS+CTSS    D +A   +      S    S+E+F+G   D E+S++   ++N  ES       GFYSLP TP R +RK   SG  D    N+ 
Subjt:  VVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMSQMSIESFEGDADDEELSNE---EKNYSES-------GFYSLPATPPRIQRK---SGNKDCMSENE-

Query:  AKKCTRDVRKRRRRRMEGAE-NCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELE--MPSRISISPASA---LDINNGGNS-
          KC+   R+RR ++ +G    C  GG             G+ ++ R +GG RS+ M L+EVKACRDLGFELE  +P RIS+S   +      ++GG+S 
Subjt:  AKKCTRDVRKRRRRRMEGAE-NCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELE--MPSRISISPASA---LDINNGGNS-

Query:  ----HISSPGDDPQHVKARLKRWAQAVALASTPR
             ISSPGDDP+ VKARLK WAQAVALAS  R
Subjt:  ----HISSPGDDPQHVKARLKRWAQAVALASTPR

AT4G01670.1 unknown protein2.1e-3040.25Show/hide
Query:  DDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMS-QMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRK---------------SGNKD
        ++++  +L RLSVC++   Y  + A+      G  S S +  +    G+   EEL+  + +   +G  SLPATPPR +R+               SG+K 
Subjt:  DDDEVVNLCRLSVCTSSCMYGDNSAELYENGIGCGSMS-QMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRK---------------SGNKD

Query:  CMSENEAKKCTRDVRKRRRRRME---GAENCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGG
          SENE +K   + R+RRR + E     ++ +   +G +        GGV+++ RP GG R +CM LEEVKAC+DLGFELE P R+S S  S +D ++GG
Subjt:  CMSENEAKKCTRDVRKRRRRRME---GAENCKSGGMGIKMGYSERQSGGVMLIARPKGGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGG

Query:  NS------HISSPGDDPQHVKARLKRWAQAVALAST
        NS       ISSPGDDP+ VKARLK WAQAVA  ST
Subjt:  NS------HISSPGDDPQHVKARLKRWAQAVALAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTTTTAGCTTATCACTGACACTAAGAGCAGATAAATTGAAGTCCTCTAGAAATAAGCCCAACCCTTCTCTCCCTCTCCCTCTAACTGTCTATCCCCTTCACAA
TCTCCCATGTGCTCCTAATTTCCACTGCCTCTGCCCCCATCTCTTTCTTTTCAGCTTTGTACAAAAGAAAACACACATAATGGCCTTGCATGATCACCGACTTGATTACA
AAGATGATCATATGGATGATGGATGTGATGATGATGAAGTTGTCAACTTGTGTAGGCTATCCGTTTGTACCAGCAGTTGTATGTATGGTGACAATAGTGCCGAGCTGTAT
GAAAATGGGATTGGATGTGGCAGCATGTCTCAAATGTCCATAGAAAGTTTTGAAGGTGACGCGGATGACGAGGAGTTGTCCAATGAAGAGAAAAATTACTCTGAATCCGG
TTTTTACTCACTACCGGCTACACCTCCACGGATTCAGAGAAAGAGTGGTAATAAAGATTGCATGAGCGAAAATGAAGCTAAAAAGTGTACGAGGGACGTGAGGAAAAGGA
GAAGAAGGAGAATGGAAGGAGCCGAAAATTGCAAGTCGGGGGGAATGGGAATTAAGATGGGTTATAGTGAGAGGCAGAGCGGTGGAGTAATGTTGATAGCAAGGCCCAAG
GGTGGGAGGAGATCGATGTGCATGCACTTGGAGGAAGTGAAAGCATGCAGAGATCTTGGGTTTGAGCTGGAGATGCCCTCTCGTATTTCCATCTCACCTGCTTCAGCACT
CGATATCAATAATGGTGGAAATTCCCACATCTCTAGCCCTGGGGATGATCCACAGCACGTTAAGGCTCGGCTCAAGAGATGGGCGCAAGCGGTGGCTCTTGCTTCTACGC
CTCGCCCCACCATCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGCTTTTAGCTTATCACTGACACTAAGAGCAGATAAATTGAAGTCCTCTAGAAATAAGCCCAACCCTTCTCTCCCTCTCCCTCTAACTGTCTATCCCCTTCACAA
TCTCCCATGTGCTCCTAATTTCCACTGCCTCTGCCCCCATCTCTTTCTTTTCAGCTTTGTACAAAAGAAAACACACATAATGGCCTTGCATGATCACCGACTTGATTACA
AAGATGATCATATGGATGATGGATGTGATGATGATGAAGTTGTCAACTTGTGTAGGCTATCCGTTTGTACCAGCAGTTGTATGTATGGTGACAATAGTGCCGAGCTGTAT
GAAAATGGGATTGGATGTGGCAGCATGTCTCAAATGTCCATAGAAAGTTTTGAAGGTGACGCGGATGACGAGGAGTTGTCCAATGAAGAGAAAAATTACTCTGAATCCGG
TTTTTACTCACTACCGGCTACACCTCCACGGATTCAGAGAAAGAGTGGTAATAAAGATTGCATGAGCGAAAATGAAGCTAAAAAGTGTACGAGGGACGTGAGGAAAAGGA
GAAGAAGGAGAATGGAAGGAGCCGAAAATTGCAAGTCGGGGGGAATGGGAATTAAGATGGGTTATAGTGAGAGGCAGAGCGGTGGAGTAATGTTGATAGCAAGGCCCAAG
GGTGGGAGGAGATCGATGTGCATGCACTTGGAGGAAGTGAAAGCATGCAGAGATCTTGGGTTTGAGCTGGAGATGCCCTCTCGTATTTCCATCTCACCTGCTTCAGCACT
CGATATCAATAATGGTGGAAATTCCCACATCTCTAGCCCTGGGGATGATCCACAGCACGTTAAGGCTCGGCTCAAGAGATGGGCGCAAGCGGTGGCTCTTGCTTCTACGC
CTCGCCCCACCATCTACTAA
Protein sequenceShow/hide protein sequence
MYAFSLSLTLRADKLKSSRNKPNPSLPLPLTVYPLHNLPCAPNFHCLCPHLFLFSFVQKKTHIMALHDHRLDYKDDHMDDGCDDDEVVNLCRLSVCTSSCMYGDNSAELY
ENGIGCGSMSQMSIESFEGDADDEELSNEEKNYSESGFYSLPATPPRIQRKSGNKDCMSENEAKKCTRDVRKRRRRRMEGAENCKSGGMGIKMGYSERQSGGVMLIARPK
GGRRSMCMHLEEVKACRDLGFELEMPSRISISPASALDINNGGNSHISSPGDDPQHVKARLKRWAQAVALASTPRPTIY