; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g18370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g18370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:14403772..14415408
RNA-Seq ExpressionMoc09g18370
SyntenyMoc09g18370
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150861.1 uncharacterized protein LOC111018906 [Momordica charantia]7.0e-7461.15Show/hide
Query:  PPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYT
        PPR ++DR AVINTI  GPSGGQS  KRK LAREA  EVC    ++P   I F D D EGVH+PHNDALVIAPLIDHV VRRVLVDGGAS N+LS +TY 
Subjt:  PPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYT

Query:  ALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSR
        ALGW R  LK SPTPLVGF+GE +S EGCI LPV IG+ D QVT++ EFVVI   SAYN I GRP+IH  RAV ST HQVL   T  G+ TVRGEQKTSR
Subjt:  ALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSR

Query:  ECYATAIKGTTTCAAVTDAAEPCSDEPELSR--------GSLAEKLELVPLLGPEKQPQV
        ECYA+A+KG++  A    A+     +PE +          +  EKLELVPLL P++Q Q+
Subjt:  ECYATAIKGTTTCAAVTDAAEPCSDEPELSR--------GSLAEKLELVPLLGPEKQPQV

XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]1.9e-7151.37Show/hide
Query:  KDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAP-----SDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKR
        K P++   +P+++  D +    R   + +    + K+       D    KF  + +        E KR ++PP R   DRPAVINTI GGPSGGQS  KR
Subjt:  KDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAP-----SDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKR

Query:  KALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEG
        K LAR A  EVC    + P  PI FD  D   VH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY ALGW R  LK SPTPLVGF+GES+  EG
Subjt:  KALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEG

Query:  CISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVT-----DAAEPC
        CI LPVT+G+   +VT++ EFVV++  SAYN I GRP+IH  RA+PST HQVL   T  G+ TVRGEQ  SRECYA+ +KGT+ CA  T        E  
Subjt:  CISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVT-----DAAEPC

Query:  SDEPELSRGSLAEKLELVPLLGPEKQPQV
        +D P     +  E+LELVPLL  EKQ Q+
Subjt:  SDEPELSRGSLAEKLELVPLLGPEKQPQV

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]4.7e-7841.37Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPHFSQVILDAPIPSRFKAPVKSSYDGSGDSISYVEVFRGKDGFPGRERRHEVPSISNSLGGLNK
        R E DL++ +F  ++E LK +C +   P    +  E  F+  I++APIP +FK P    YDGS D   YVEVF G   F       +  +   +L G  +
Subjt:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPHFSQVILDAPIPSRFKAPVKSSYDGSGDSISYVEVFRGKDGFPGRERRHEVPSISNSLGGLNK

Query:  I-----------VVPTVEAPIHGQLSATEKDVHQPILSSTV-------VEVELDDRVRKPTAGLLERDARQYIDDLELWKANGARRSNRGKDPDQKSPSP
        +               +     GQ S    D       +T+       + V+L +      A +L+ +A++ ID  EL +    R     K  DQK  S 
Subjt:  I-----------VVPTVEAPIHGQLSATEKDVHQPILSSTV-------VEVELDDRVRKPTAGLLERDARQYIDDLELWKANGARRSNRGKDPDQKSPSP

Query:  KKQRSDGWGSSRRADDNQSRG--RRDEKAPSDRR----------------GTKFNKREKVEPKGSA--REEKREKSPPPRWKEDRPAVINTIHGGPSGGQ
        KK++ D     + +  + SR   RR E  PS  R                   + K+   +P+ ++  ++E+R++S  P  +EDRPAVINTI GGPSGGQ
Subjt:  KKQRSDGWGSSRRADDNQSRG--RRDEKAPSDRR----------------GTKFNKREKVEPKGSA--REEKREKSPPPRWKEDRPAVINTIHGGPSGGQ

Query:  SRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGES
           KRK LA EA  +V     ++P   I F D D EGVH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY AL   R  LK SPTPLVGF+ ES
Subjt:  SRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGES

Query:  LSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPC
        +S EGCI LPVTIG+   QVT++ EFVVI+   AYN I  RP+IH  +AVPS  HQVL   T  G+ TVRGEQKTSRECYA+A+K ++ CA     ++  
Subjt:  LSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPC

Query:  SDEPELSRGS
         D P  ++GS
Subjt:  SDEPELSRGS

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]2.8e-7547.87Show/hide
Query:  LSSTVVEVELDDRVRKPTAGLLERDARQYIDDLELWKANGAR------RSNRGKD--PDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDE----------
        ++   + V+L +      A +L++ A++ ID  EL +    R      R   GKD   D KS   K   S G    RRA++  +R R  E          
Subjt:  LSSTVVEVELDDRVRKPTAGLLERDARQYIDDLELWKANGAR------RSNRGKD--PDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDE----------

Query:  ---------------KAPSDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILF
                       K P   RG    +R K +   + ++E+R++S  P  + DRPAVINTI GGPSGGQS  KRK LAREA  EVC    + P  PI F
Subjt:  ---------------KAPSDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILF

Query:  DDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVIN
        D  D E VH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY ALGW R  LK SPTPLVGF+GES+  EGCI LPVT+G+   +VT++ EFVV++
Subjt:  DDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVIN

Query:  RSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPCSDEPELSRGSLA---EKLELVPLLGPEKQ
          S YN I GRP+IH  R +PST HQVL   T  G+ TVRGEQ  SRECYA A+KG++ CA  T        E +L R   A   E+LELVPLL PEKQ
Subjt:  RSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPCSDEPELSRGSLA---EKLELVPLLGPEKQ

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]5.7e-8457.27Show/hide
Query:  KPTAGLLE--RDARQYIDDLELWKANGARRSNRGKDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAPSDRRGTKFNK------------------
        +P A L E    ARQYID LELWKANGARRS+RG+D D KSP  KK+  D   SSRRADD++SR RRDE+  S+RRG KF+K                  
Subjt:  KPTAGLLE--RDARQYIDDLELWKANGARRSNRGKDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAPSDRRGTKFNK------------------

Query:  ------------------------------------------------------------REKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSG
                                                                    RE+ E +GSAREEKRE+S PPR KEDRPAVINTIHGGPSG
Subjt:  ------------------------------------------------------------REKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSG

Query:  GQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAG
         +S QKRKALARE AHEVCTSY K P++PILFD+QD E VH+PHNDALVIAPLIDHVKVRRV VDGGASAN+ SFSTYTALGWER HLK   T LVGFA 
Subjt:  GQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAG

Query:  ESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAY
        ES+S EGCISLPVTI EG+ QVT+V EFVVI+RSSAY
Subjt:  ESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAY

TrEMBL top hitse value%identityAlignment
A0A6J1DCR3 uncharacterized protein LOC1110189063.4e-7461.15Show/hide
Query:  PPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYT
        PPR ++DR AVINTI  GPSGGQS  KRK LAREA  EVC    ++P   I F D D EGVH+PHNDALVIAPLIDHV VRRVLVDGGAS N+LS +TY 
Subjt:  PPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYT

Query:  ALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSR
        ALGW R  LK SPTPLVGF+GE +S EGCI LPV IG+ D QVT++ EFVVI   SAYN I GRP+IH  RAV ST HQVL   T  G+ TVRGEQKTSR
Subjt:  ALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSR

Query:  ECYATAIKGTTTCAAVTDAAEPCSDEPELSR--------GSLAEKLELVPLLGPEKQPQV
        ECYA+A+KG++  A    A+     +PE +          +  EKLELVPLL P++Q Q+
Subjt:  ECYATAIKGTTTCAAVTDAAEPCSDEPELSR--------GSLAEKLELVPLLGPEKQPQV

A0A6J1DD03 uncharacterized protein LOC1110198999.2e-7251.37Show/hide
Query:  KDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAP-----SDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKR
        K P++   +P+++  D +    R   + +    + K+       D    KF  + +        E KR ++PP R   DRPAVINTI GGPSGGQS  KR
Subjt:  KDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAP-----SDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKR

Query:  KALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEG
        K LAR A  EVC    + P  PI FD  D   VH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY ALGW R  LK SPTPLVGF+GES+  EG
Subjt:  KALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEG

Query:  CISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVT-----DAAEPC
        CI LPVT+G+   +VT++ EFVV++  SAYN I GRP+IH  RA+PST HQVL   T  G+ TVRGEQ  SRECYA+ +KGT+ CA  T        E  
Subjt:  CISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVT-----DAAEPC

Query:  SDEPELSRGSLAEKLELVPLLGPEKQPQV
        +D P     +  E+LELVPLL  EKQ Q+
Subjt:  SDEPELSRGSLAEKLELVPLLGPEKQPQV

A0A6J1DPC9 uncharacterized protein LOC1110222802.3e-7841.37Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPHFSQVILDAPIPSRFKAPVKSSYDGSGDSISYVEVFRGKDGFPGRERRHEVPSISNSLGGLNK
        R E DL++ +F  ++E LK +C +   P    +  E  F+  I++APIP +FK P    YDGS D   YVEVF G   F       +  +   +L G  +
Subjt:  RSEVDLLRDQFQREIEDLKRQC-RPVDPHRVAEQEEPHFSQVILDAPIPSRFKAPVKSSYDGSGDSISYVEVFRGKDGFPGRERRHEVPSISNSLGGLNK

Query:  I-----------VVPTVEAPIHGQLSATEKDVHQPILSSTV-------VEVELDDRVRKPTAGLLERDARQYIDDLELWKANGARRSNRGKDPDQKSPSP
        +               +     GQ S    D       +T+       + V+L +      A +L+ +A++ ID  EL +    R     K  DQK  S 
Subjt:  I-----------VVPTVEAPIHGQLSATEKDVHQPILSSTV-------VEVELDDRVRKPTAGLLERDARQYIDDLELWKANGARRSNRGKDPDQKSPSP

Query:  KKQRSDGWGSSRRADDNQSRG--RRDEKAPSDRR----------------GTKFNKREKVEPKGSA--REEKREKSPPPRWKEDRPAVINTIHGGPSGGQ
        KK++ D     + +  + SR   RR E  PS  R                   + K+   +P+ ++  ++E+R++S  P  +EDRPAVINTI GGPSGGQ
Subjt:  KKQRSDGWGSSRRADDNQSRG--RRDEKAPSDRR----------------GTKFNKREKVEPKGSA--REEKREKSPPPRWKEDRPAVINTIHGGPSGGQ

Query:  SRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGES
           KRK LA EA  +V     ++P   I F D D EGVH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY AL   R  LK SPTPLVGF+ ES
Subjt:  SRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGES

Query:  LSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPC
        +S EGCI LPVTIG+   QVT++ EFVVI+   AYN I  RP+IH  +AVPS  HQVL   T  G+ TVRGEQKTSRECYA+A+K ++ CA     ++  
Subjt:  LSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPC

Query:  SDEPELSRGS
         D P  ++GS
Subjt:  SDEPELSRGS

A0A6J1DYW5 uncharacterized protein LOC1110243321.4e-7547.87Show/hide
Query:  LSSTVVEVELDDRVRKPTAGLLERDARQYIDDLELWKANGAR------RSNRGKD--PDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDE----------
        ++   + V+L +      A +L++ A++ ID  EL +    R      R   GKD   D KS   K   S G    RRA++  +R R  E          
Subjt:  LSSTVVEVELDDRVRKPTAGLLERDARQYIDDLELWKANGAR------RSNRGKD--PDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDE----------

Query:  ---------------KAPSDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILF
                       K P   RG    +R K +   + ++E+R++S  P  + DRPAVINTI GGPSGGQS  KRK LAREA  EVC    + P  PI F
Subjt:  ---------------KAPSDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAHEVCTSYLKEPMIPILF

Query:  DDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVIN
        D  D E VH+PHNDALVIAPLIDHV VRRVLVDGGASAN+LS  TY ALGW R  LK SPTPLVGF+GES+  EGCI LPVT+G+   +VT++ EFVV++
Subjt:  DDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQVTKVVEFVVIN

Query:  RSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPCSDEPELSRGSLA---EKLELVPLLGPEKQ
          S YN I GRP+IH  R +PST HQVL   T  G+ TVRGEQ  SRECYA A+KG++ CA  T        E +L R   A   E+LELVPLL PEKQ
Subjt:  RSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPCSDEPELSRGSLA---EKLELVPLLGPEKQ

A0A6J1E0L8 uncharacterized protein LOC1110253102.8e-8457.27Show/hide
Query:  KPTAGLLE--RDARQYIDDLELWKANGARRSNRGKDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAPSDRRGTKFNK------------------
        +P A L E    ARQYID LELWKANGARRS+RG+D D KSP  KK+  D   SSRRADD++SR RRDE+  S+RRG KF+K                  
Subjt:  KPTAGLLE--RDARQYIDDLELWKANGARRSNRGKDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAPSDRRGTKFNK------------------

Query:  ------------------------------------------------------------REKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSG
                                                                    RE+ E +GSAREEKRE+S PPR KEDRPAVINTIHGGPSG
Subjt:  ------------------------------------------------------------REKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSG

Query:  GQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAG
         +S QKRKALARE AHEVCTSY K P++PILFD+QD E VH+PHNDALVIAPLIDHVKVRRV VDGGASAN+ SFSTYTALGWER HLK   T LVGFA 
Subjt:  GQSRQKRKALAREAAHEVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAG

Query:  ESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAY
        ES+S EGCISLPVTI EG+ QVT+V EFVVI+RSSAY
Subjt:  ESLSAEGCISLPVTIGEGDQQVTKVVEFVVINRSSAY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAA
GAACAGTGCAAAGTTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATA
CCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTAT
GACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATATGGCTAATCCTATTGGAAAAATACAAAAAGAAGAATGCAAGTCG
TTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCTGTTCGAGAA
GTCGTGCAACATATCTACAACTTAAGGACTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCATTAAG
AGAGTTGTGTTTTTGGAGGAAAGAGCACGTGACACTCTCCCTTGCTTGAAAGAGAAGGGTGAAGCTCTCAATCCCGATAAAAACTCTCCTCAGAGGTCCTCCCAC
GGTGACCATTCGTTTCGGTCTGAAGTGGACCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCCTGTAGATCCACATCGCGTG
GCCGAGCAAGAGGAACCGCATTTCTCCCAAGTAATCCTAGACGCACCTATCCCATCGAGGTTCAAGGCCCCGGTCAAGAGTTCCTACGACGGGTCTGGAGATTCG
ATCTCATACGTGGAGGTGTTTCGAGGGAAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAGGGCTCAACAAGATT
GTGGTACCGACAGTTGAAGCCCCGATCCATGGACAGTTATCGGCAACTGAGAAGGATGTTCATCAACCAATTCTCAGCTCGACAGTTGTTGAAGTTGAACTTGAC
GATAGAGTTCGGAAGCCGACCGCCGGCCTCCTTGAACGAGATGCTCGCCAGTACATTGACGACTTGGAGTTATGGAAAGCCAATGGAGCCCGGCGAAGCAACCGT
GGCAAAGATCCGGACCAAAAGTCCCCTTCTCCCAAGAAGCAACGCAGCGATGGTTGGGGTTCGTCTCGGCGAGCCGACGACAATCAGAGTAGAGGTCGTCGCGAC
GAGAAAGCCCCTTCAGACCGTCGAGGGACGAAGTTCAACAAGCGTGAAAAAGTAGAGCCAAAGGGGTCGGCTCGGGAGGAGAAGCGAGAGAAGTCGCCACCACCG
AGATGGAAGGAAGATCGTCCCGCAGTTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCAAGGCAGAAGAGGAAAGCTCTGGCTCGAGAGGCAGCACAC
GAGGTCTGTACCTCGTACCTCAAGGAGCCTATGATACCAATCCTGTTCGACGACCAAGACAACGAGGGAGTGCACGTGCCCCATAATGACGCCCTGGTAATTGCC
CCACTCATAGATCACGTGAAGGTGAGAAGAGTTCTTGTCGACGGTGGAGCGTCGGCCAATATGTTATCATTCTCGACCTACACGGCCCTAGGGTGGGAGAGGATA
CACCTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGCAGGGGAGTCACTCAGTGCGGAAGGATGCATCTCGCTCCCTGTTACCATCGGCGAGGGAGATCAACAA
GTAACTAAGGTTGTGGAATTTGTTGTGATTAATCGGAGCTCTGCGTACAACACCATAATTGGTCGGCCCTTGATTCATGATCTCAGGGCAGTTCCGTCCACTTAC
CACCAAGTCTTGAACTGCCCCACCTCGGCCGGAATTGCGACAGTCCGAGGTGAGCAAAAAACGTCCAGAGAATGCTACGCCACCGCAATAAAGGGAACAACCACT
TGTGCAGCGGTCACGGACGCGGCAGAGCCATGTTCTGACGAACCAGAGCTTAGTCGTGGTAGCCTAGCTGAAAAGCTAGAACTTGTCCCCCTGCTGGGGCCAGAA
AAGCAGCCACAAGTTCGTTTTTTCCGGCGAGGTGCAGCGGCGGCGCTTCTTCGGATTGAACAGCAGCAGTGGCGGCGTTCCTTCGGACCGAACAGCAGCAGCAGT
GGCGCTCCTTCGGCAGTCATTGATGGCAGCAGCGGCGCACCTTCAGTCTGCGGCGGCAGTGCCGTTTCTGCGGCGTTCCGACGGCGGCCCTTGGCCACGCTTACC
CACACCTGTCCGGAGTTCGATTCGCGGTACCCACTCCTATTTAGACTCAAACCAGCATTGCCCATCGCTTTTGGCATCAGAACAGCAAGCCTAAGGACGTTCGAA
CTCAGTTTTGGAAGCCCGAACCTCTTTGGCATTGACCCACACCTGTACGAAGTCGGTTTTGGTGTGCTTAGGTGTGAGATTGACGCAAAACTTAAGGGCGCGGTT
GGTCGATGCGAGGAGTCTTTTGGAAGGGAAGCTATTGGGGCCTTGGGTAGAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTTGGGTGAAGGACTC
GTTGGCGACCATGGTGATGAGACGGAGGAGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAATTTCGAGACCCAATTGGGACAGCTCGCCAATGAATTGAAGAATAGACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAA
GAACAGTGCAAAGTTGTCACCCTCAGGAGTGGACTGGCATATGATGGACCAACAATGCCAACAACAGATGTACAGATTCCGTCCACTGAACCAATTGTAAAGATA
CCAGAAAATCCAACAACACCAGAAAAAGAAAATATTAGAAAAGGAGACTTTGAAGAGTGCTCTGCTATAAATAGCTTGAATCCTATTATGTTTGATGAGTTTTAT
GACTCGTTAGTTACAGAGATTGAAGAAGAGCTTGATAAGATAGCAGAAGGACCAGAAGATATGGCTAATCCTATTGGAAAAATACAAAAAGAAGAATGCAAGTCG
TTACTTCCGTCCATAGTGGAACCACCCACGTTGGAGCAGAAGCCATTGCCGTCGCATTTGAAATATGCGTATCTAGGGGATAACGACACTTTACCTGTTCGAGAA
GTCGTGCAACATATCTACAACTTAAGGACTTCATTGGATTTTGCGGTTTTACCTTCATGGCCTCCAGCGCTAGCTGCTATCCTTGGTCATCCATCTCCCATTAAG
AGAGTTGTGTTTTTGGAGGAAAGAGCACGTGACACTCTCCCTTGCTTGAAAGAGAAGGGTGAAGCTCTCAATCCCGATAAAAACTCTCCTCAGAGGTCCTCCCAC
GGTGACCATTCGTTTCGGTCTGAAGTGGACCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGACAGTGCAGGCCTGTAGATCCACATCGCGTG
GCCGAGCAAGAGGAACCGCATTTCTCCCAAGTAATCCTAGACGCACCTATCCCATCGAGGTTCAAGGCCCCGGTCAAGAGTTCCTACGACGGGTCTGGAGATTCG
ATCTCATACGTGGAGGTGTTTCGAGGGAAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAGGGCTCAACAAGATT
GTGGTACCGACAGTTGAAGCCCCGATCCATGGACAGTTATCGGCAACTGAGAAGGATGTTCATCAACCAATTCTCAGCTCGACAGTTGTTGAAGTTGAACTTGAC
GATAGAGTTCGGAAGCCGACCGCCGGCCTCCTTGAACGAGATGCTCGCCAGTACATTGACGACTTGGAGTTATGGAAAGCCAATGGAGCCCGGCGAAGCAACCGT
GGCAAAGATCCGGACCAAAAGTCCCCTTCTCCCAAGAAGCAACGCAGCGATGGTTGGGGTTCGTCTCGGCGAGCCGACGACAATCAGAGTAGAGGTCGTCGCGAC
GAGAAAGCCCCTTCAGACCGTCGAGGGACGAAGTTCAACAAGCGTGAAAAAGTAGAGCCAAAGGGGTCGGCTCGGGAGGAGAAGCGAGAGAAGTCGCCACCACCG
AGATGGAAGGAAGATCGTCCCGCAGTTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCAAGGCAGAAGAGGAAAGCTCTGGCTCGAGAGGCAGCACAC
GAGGTCTGTACCTCGTACCTCAAGGAGCCTATGATACCAATCCTGTTCGACGACCAAGACAACGAGGGAGTGCACGTGCCCCATAATGACGCCCTGGTAATTGCC
CCACTCATAGATCACGTGAAGGTGAGAAGAGTTCTTGTCGACGGTGGAGCGTCGGCCAATATGTTATCATTCTCGACCTACACGGCCCTAGGGTGGGAGAGGATA
CACCTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGCAGGGGAGTCACTCAGTGCGGAAGGATGCATCTCGCTCCCTGTTACCATCGGCGAGGGAGATCAACAA
GTAACTAAGGTTGTGGAATTTGTTGTGATTAATCGGAGCTCTGCGTACAACACCATAATTGGTCGGCCCTTGATTCATGATCTCAGGGCAGTTCCGTCCACTTAC
CACCAAGTCTTGAACTGCCCCACCTCGGCCGGAATTGCGACAGTCCGAGGTGAGCAAAAAACGTCCAGAGAATGCTACGCCACCGCAATAAAGGGAACAACCACT
TGTGCAGCGGTCACGGACGCGGCAGAGCCATGTTCTGACGAACCAGAGCTTAGTCGTGGTAGCCTAGCTGAAAAGCTAGAACTTGTCCCCCTGCTGGGGCCAGAA
AAGCAGCCACAAGTTCGTTTTTTCCGGCGAGGTGCAGCGGCGGCGCTTCTTCGGATTGAACAGCAGCAGTGGCGGCGTTCCTTCGGACCGAACAGCAGCAGCAGT
GGCGCTCCTTCGGCAGTCATTGATGGCAGCAGCGGCGCACCTTCAGTCTGCGGCGGCAGTGCCGTTTCTGCGGCGTTCCGACGGCGGCCCTTGGCCACGCTTACC
CACACCTGTCCGGAGTTCGATTCGCGGTACCCACTCCTATTTAGACTCAAACCAGCATTGCCCATCGCTTTTGGCATCAGAACAGCAAGCCTAAGGACGTTCGAA
CTCAGTTTTGGAAGCCCGAACCTCTTTGGCATTGACCCACACCTGTACGAAGTCGGTTTTGGTGTGCTTAGGTGTGAGATTGACGCAAAACTTAAGGGCGCGGTT
GGTCGATGCGAGGAGTCTTTTGGAAGGGAAGCTATTGGGGCCTTGGGTAGAAATGGTCAAGGGCCAATAGATGGTGAAGTCATCGGGGCCTTGGGTGAAGGACTC
GTTGGCGACCATGGTGATGAGACGGAGGAGACATGA
Protein sequenceShow/hide protein sequence
MRNFETQLGQLANELKNRPQGSFPGHTELPKREGKEQCKVVTLRSGLAYDGPTMPTTDVQIPSTEPIVKIPENPTTPEKENIRKGDFEECSAINSLNPIMFDEFY
DSLVTEIEEELDKIAEGPEDMANPIGKIQKEECKSLLPSIVEPPTLEQKPLPSHLKYAYLGDNDTLPVREVVQHIYNLRTSLDFAVLPSWPPALAAILGHPSPIK
RVVFLEERARDTLPCLKEKGEALNPDKNSPQRSSHGDHSFRSEVDLLRDQFQREIEDLKRQCRPVDPHRVAEQEEPHFSQVILDAPIPSRFKAPVKSSYDGSGDS
ISYVEVFRGKDGFPGRERRHEVPSISNSLGGLNKIVVPTVEAPIHGQLSATEKDVHQPILSSTVVEVELDDRVRKPTAGLLERDARQYIDDLELWKANGARRSNR
GKDPDQKSPSPKKQRSDGWGSSRRADDNQSRGRRDEKAPSDRRGTKFNKREKVEPKGSAREEKREKSPPPRWKEDRPAVINTIHGGPSGGQSRQKRKALAREAAH
EVCTSYLKEPMIPILFDDQDNEGVHVPHNDALVIAPLIDHVKVRRVLVDGGASANMLSFSTYTALGWERIHLKLSPTPLVGFAGESLSAEGCISLPVTIGEGDQQ
VTKVVEFVVINRSSAYNTIIGRPLIHDLRAVPSTYHQVLNCPTSAGIATVRGEQKTSRECYATAIKGTTTCAAVTDAAEPCSDEPELSRGSLAEKLELVPLLGPE
KQPQVRFFRRGAAAALLRIEQQQWRRSFGPNSSSSGAPSAVIDGSSGAPSVCGGSAVSAAFRRRPLATLTHTCPEFDSRYPLLFRLKPALPIAFGIRTASLRTFE
LSFGSPNLFGIDPHLYEVGFGVLRCEIDAKLKGAVGRCEESFGREAIGALGRNGQGPIDGEVIGALGEGLVGDHGDETEET