; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr2:12730743..12736863
RNA-Seq ExpressionMoc02g16970
SyntenyMoc02g16970
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.3e-6538.82Show/hide
Query:  YDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDE
        YDGS DP  YVEVFE  MDF AASDA++CR F+IAL GS  LWYR+L   SI +Y QLRR F+  FS+R   K   +HL T++Q++ E+L EY+TRF +E
Subjt:  YDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDE

Query:  HVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGAR------RSSRGKGRDQKSPPPRSNAVMIGARLD------G
         +KV  C+DD AM YF TGL D  LT++      A+  E+L +A++ IDG EL +    R      R   GK  +   P  +        R +      G
Subjt:  HVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGAR------RSSRGKGRDQKSPPPRSNAVMIGARLD------G

Query:  PMR--------------------IE----------------APQRESPFRPSRAE--------------------VRQSYLKKYVDSRERAEPKGSARED
        P R                    IE                AP+R S  +  R                      ++  Y KK+V      +P+ S+ E 
Subjt:  PMR--------------------IE----------------APQRESPFRPSRAE--------------------VRQSYLKKYVDSRERAEPKGSARED

Query:  K--RERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDGGTSTN
        K  R+RS  P R+ DRPAVINTI GGPSGG+SG+KRK LA+ A  E                                         V R LVDGGTS N
Subjt:  K--RERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDGGTSTN

Query:  ILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTE
        ILS  TY ALGW R  LK++PTPLVGF+GE V  EG I LPVT+G+    VT++ E
Subjt:  ILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTE

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]1.9e-8560.69Show/hide
Query:  MDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDEHVKVVSCTDDIAMMYFT
        MDFLAASDA++CR FQIALEGSV LWY+QLKPRSIDSYQQLRRLFINQFSARQLLKLP SHL TVKQRDNESLTEYI R MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPP-------------------------RSNAVMIGARLD-----
        TGLNDRNLTIEF SR  ASLN+MLARARQYIDGLELWKA GARRSSRGK RDQ+S PP                         R ++   G + D     
Subjt:  TGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPP-------------------------RSNAVMIGARLD-----

Query:  ---------------------GPMRIEAP------------QRESPFRPSRA---------EVRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKED
                              P ++  P             ++     SR           +R+ YLKKYV SRERA+P+GS RE+KRERS PP RKED
Subjt:  ---------------------GPMRIEAP------------QRESPFRPSRA---------EVRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKED

Query:  RPAVINTIHGGPSGGKSG
        RPAVINTIHGGPSG KSG
Subjt:  RPAVINTIHGGPSGGKSG

XP_022152851.1 uncharacterized protein LOC111020475 [Momordica charantia]1.6e-8160.83Show/hide
Query:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF
        MSSYDGSGDPISYVEVFEGKMDFLA SDAM+C  FQI LEGS  LWYRQLK RSIDSYQQLRRLFINQFS RQ LKLP SHLGTVKQRDNES T YI RF
Subjt:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF

Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPRSNAVMIGARLDGPM--RIEAP
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF S   A LNEM ARARQYIDGLELW A+GA   +  +       PPRS AVMI   L   M  RI+  
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPRSNAVMIGARLDGPM--RIEAP

Query:  QRESPF------------RPSRAE-----------------------------VRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKEDRPAVINTIH
         R+  F            RPS                                +R+ YLKKYV + ERA P+ SA E+KRERS  P+R+EDR       H
Subjt:  QRESPF------------RPSRAE-----------------------------VRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKEDRPAVINTIH

Query:  G-GPSGGKSGQKRK
          GPSGG+SGQK +
Subjt:  G-GPSGGKSGQKRK

XP_022157448.1 uncharacterized protein LOC111024144 [Momordica charantia]6.9e-6439.81Show/hide
Query:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF
        M  YDG  DP  YVEVFEG +DF AAS+A++CR FQIAL G+  LWYR+L  RSI +Y QL++ FI+QFS RQ  +   +HL T++Q++ E+L EY+TRF
Subjt:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF

Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSP-------------------PPRSN
          E +KV  C+DD  M YF TGL D  LT++      ++  E+L +A++ IDG EL K    R   R    DQK P                     RS+
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSP-------------------PPRSN

Query:  AVMIGARLD--GPMRIEAP---------------------QRESPFRPSRAE------------------------------VRQSYLKKYVDSRERAEP
          M  +  +  GP     P                     +R    R  R +                              ++  Y KKYV      +P
Subjt:  AVMIGARLD--GPMRIEAP---------------------QRESPFRPSRAE------------------------------VRQSYLKKYVDSRERAEP

Query:  KGSAREDKRER--SPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHEVRRTLVDGGTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSA
          S+ E K+ER  S  P R++DRPAVINTI G PSGG+SG KRK L       VR+  VD G S NILS +TY ALGW R  LK++ TPLVGFAGE V+ 
Subjt:  KGSAREDKRER--SPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHEVRRTLVDGGTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSA

Query:  EGCISLPVTVGEGDQHVTKVTE
        E CI L +T+G+GD  V ++ E
Subjt:  EGCISLPVTVGEGDQHVTKVTE

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.1e-7746.24Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPR---------------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF SR  ASLNEM ARARQYIDGLELWKANGARRSSRG+ RD KSPP +                     
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPR---------------------

Query:  ----------------------SNAVMIGARLDGPMRIEAPQRESPFRPSRAE-----------------------------VRQSYLKKYVDSRERAEP
                              S A +     D  M +     E   RPS                                +R  YLKKYV SRE+AE 
Subjt:  ----------------------SNAVMIGARLDGPMRIEAPQRESPFRPSRAE-----------------------------VRQSYLKKYVDSRERAEP

Query:  KGSAREDKRERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDG
        +GSARE+KRERS PPR KEDRPAVINTIHGGPSG KSGQKRKALA+E AHE                                         VRR  VDG
Subjt:  KGSAREDKRERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDG

Query:  GTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTELKYPTPTRVATVRGEQK--TSRECYTTAMKGTATCAAIM
        G S NI SFSTYTALGWERRHLK   T LVGFA E VS EGCISLPVT+ EG+  VT+V E      +    V    +   S+ C TT  +G        
Subjt:  GTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTELKYPTPTRVATVRGEQK--TSRECYTTAMKGTATCAAIM

Query:  ELEATVPPCTNEAEPSRGTPAEKLELVPLMGPDKQRSCG
                    A+P     + + ELVPL+GPD+Q S G
Subjt:  ELEATVPPCTNEAEPSRGTPAEKLELVPLMGPDKQRSCG

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.0e-6538.82Show/hide
Query:  YDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDE
        YDGS DP  YVEVFE  MDF AASDA++CR F+IAL GS  LWYR+L   SI +Y QLRR F+  FS+R   K   +HL T++Q++ E+L EY+TRF +E
Subjt:  YDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDE

Query:  HVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGAR------RSSRGKGRDQKSPPPRSNAVMIGARLD------G
         +KV  C+DD AM YF TGL D  LT++      A+  E+L +A++ IDG EL +    R      R   GK  +   P  +        R +      G
Subjt:  HVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGAR------RSSRGKGRDQKSPPPRSNAVMIGARLD------G

Query:  PMR--------------------IE----------------APQRESPFRPSRAE--------------------VRQSYLKKYVDSRERAEPKGSARED
        P R                    IE                AP+R S  +  R                      ++  Y KK+V      +P+ S+ E 
Subjt:  PMR--------------------IE----------------APQRESPFRPSRAE--------------------VRQSYLKKYVDSRERAEPKGSARED

Query:  K--RERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDGGTSTN
        K  R+RS  P R+ DRPAVINTI GGPSGG+SG+KRK LA+ A  E                                         V R LVDGGTS N
Subjt:  K--RERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDGGTSTN

Query:  ILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTE
        ILS  TY ALGW R  LK++PTPLVGF+GE V  EG I LPVT+G+    VT++ E
Subjt:  ILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTE

A0A6J1D5T3 uncharacterized protein LOC1110175489.1e-8660.69Show/hide
Query:  MDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDEHVKVVSCTDDIAMMYFT
        MDFLAASDA++CR FQIALEGSV LWY+QLKPRSIDSYQQLRRLFINQFSARQLLKLP SHL TVKQRDNESLTEYI R MDEHVKVVSCTDDIAMMYFT
Subjt:  MDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDEHVKVVSCTDDIAMMYFT

Query:  TGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPP-------------------------RSNAVMIGARLD-----
        TGLNDRNLTIEF SR  ASLN+MLARARQYIDGLELWKA GARRSSRGK RDQ+S PP                         R ++   G + D     
Subjt:  TGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPP-------------------------RSNAVMIGARLD-----

Query:  ---------------------GPMRIEAP------------QRESPFRPSRA---------EVRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKED
                              P ++  P             ++     SR           +R+ YLKKYV SRERA+P+GS RE+KRERS PP RKED
Subjt:  ---------------------GPMRIEAP------------QRESPFRPSRA---------EVRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKED

Query:  RPAVINTIHGGPSGGKSG
        RPAVINTIHGGPSG KSG
Subjt:  RPAVINTIHGGPSGGKSG

A0A6J1DIZ8 uncharacterized protein LOC1110204758.0e-8260.83Show/hide
Query:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF
        MSSYDGSGDPISYVEVFEGKMDFLA SDAM+C  FQI LEGS  LWYRQLK RSIDSYQQLRRLFINQFS RQ LKLP SHLGTVKQRDNES T YI RF
Subjt:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF

Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPRSNAVMIGARLDGPM--RIEAP
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF S   A LNEM ARARQYIDGLELW A+GA   +  +       PPRS AVMI   L   M  RI+  
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPRSNAVMIGARLDGPM--RIEAP

Query:  QRESPF------------RPSRAE-----------------------------VRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKEDRPAVINTIH
         R+  F            RPS                                +R+ YLKKYV + ERA P+ SA E+KRERS  P+R+EDR       H
Subjt:  QRESPF------------RPSRAE-----------------------------VRQSYLKKYVDSRERAEPKGSAREDKRERSPPPRRKEDRPAVINTIH

Query:  G-GPSGGKSGQKRK
          GPSGG+SGQK +
Subjt:  G-GPSGGKSGQKRK

A0A6J1DTD9 uncharacterized protein LOC1110241443.4e-6439.81Show/hide
Query:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF
        M  YDG  DP  YVEVFEG +DF AAS+A++CR FQIAL G+  LWYR+L  RSI +Y QL++ FI+QFS RQ  +   +HL T++Q++ E+L EY+TRF
Subjt:  MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRF

Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSP-------------------PPRSN
          E +KV  C+DD  M YF TGL D  LT++      ++  E+L +A++ IDG EL K    R   R    DQK P                     RS+
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSP-------------------PPRSN

Query:  AVMIGARLD--GPMRIEAP---------------------QRESPFRPSRAE------------------------------VRQSYLKKYVDSRERAEP
          M  +  +  GP     P                     +R    R  R +                              ++  Y KKYV      +P
Subjt:  AVMIGARLD--GPMRIEAP---------------------QRESPFRPSRAE------------------------------VRQSYLKKYVDSRERAEP

Query:  KGSAREDKRER--SPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHEVRRTLVDGGTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSA
          S+ E K+ER  S  P R++DRPAVINTI G PSGG+SG KRK L       VR+  VD G S NILS +TY ALGW R  LK++ TPLVGFAGE V+ 
Subjt:  KGSAREDKRER--SPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHEVRRTLVDGGTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSA

Query:  EGCISLPVTVGEGDQHVTKVTE
        E CI L +T+G+GD  V ++ E
Subjt:  EGCISLPVTVGEGDQHVTKVTE

A0A6J1E0L8 uncharacterized protein LOC1110253105.3e-7846.24Show/hide
Query:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPR---------------------
        MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEF SR  ASLNEM ARARQYIDGLELWKANGARRSSRG+ RD KSPP +                     
Subjt:  MDEHVKVVSCTDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPR---------------------

Query:  ----------------------SNAVMIGARLDGPMRIEAPQRESPFRPSRAE-----------------------------VRQSYLKKYVDSRERAEP
                              S A +     D  M +     E   RPS                                +R  YLKKYV SRE+AE 
Subjt:  ----------------------SNAVMIGARLDGPMRIEAPQRESPFRPSRAE-----------------------------VRQSYLKKYVDSRERAEP

Query:  KGSAREDKRERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDG
        +GSARE+KRERS PPR KEDRPAVINTIHGGPSG KSGQKRKALA+E AHE                                         VRR  VDG
Subjt:  KGSAREDKRERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHE-----------------------------------------VRRTLVDG

Query:  GTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTELKYPTPTRVATVRGEQK--TSRECYTTAMKGTATCAAIM
        G S NI SFSTYTALGWERRHLK   T LVGFA E VS EGCISLPVT+ EG+  VT+V E      +    V    +   S+ C TT  +G        
Subjt:  GTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCISLPVTVGEGDQHVTKVTELKYPTPTRVATVRGEQK--TSRECYTTAMKGTATCAAIM

Query:  ELEATVPPCTNEAEPSRGTPAEKLELVPLMGPDKQRSCG
                    A+P     + + ELVPL+GPD+Q S G
Subjt:  ELEATVPPCTNEAEPSRGTPAEKLELVPLMGPDKQRSCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCTATGACGGGTCCGGGGATCCAATCTCGTATGTTGAGGTGTTCGAAGGGAAGATGGATTTTTTGGCTGCAAGCGACGCCATGCAGTGCCGAGTATTTCAAAT
AGCTCTAGAGGGCTCGGTGATGTTATGGTACCGACAGTTGAAGCCCCGGTCCATCGACAGTTATCAACAGCTGAGAAGGTTGTTCATCAATCAATTCTCAGCTCGGCAGT
TGTTGAAGTTGCCCCACTCGCACCTCGGAACAGTGAAGCAACGGGATAATGAATCCCTGACGGAGTACATCACTCGGTTCATGGACGAGCACGTCAAGGTGGTGAGTTGT
ACCGACGACATCGCCATGATGTACTTCACCACTGGGTTGAACGATAGAAATCTAACGATAGAGTTCGAAAGCCGTCTGTCGGCCTCGCTGAACGAGATGCTCGCCCGAGC
TCGCCAGTACATTGATGGCCTGGAGCTGTGGAAGGCCAACGGAGCCAGGCGGAGTAGCCGCGGTAAAGGTCGGGACCAGAAGTCCCCTCCTCCCAGAAGCAACGCGGTAA
TGATCGGAGCTCGTCTTGACGGGCCGATGAGAATAGAGGCGCCACAACGAGAGAGCCCCTTCAGACCGTCGAGGGCCGAAGTTCGACAGAGTTATTTGAAGAAGTACGTC
GACAGCAGAGAAAGAGCTGAGCCAAAAGGATCAGCTCGGGAGGATAAGCGAGAAAGGTCACCGCCGCCCAGACGAAAGGAAGATCGTCCTGCTGTTATAAACACCATTCA
TGGGGGTCCGAGCGGGGGAAAGTCAGGGCAGAAAAGAAAGGCTCTAGCTCAAGAAGCAGCGCATGAGGTCAGAAGAACTCTTGTCGACGGTGGAACGTCGACTAATATAT
TATCTTTCTCGACCTACACGGCTCTAGGGTGGGAAAGAAGACACTTGAAGCGCAACCCAACACCTTTGGTCGGCTTCGCAGGAGAGTTAGTTAGCGCGGAAGGATGTATC
TCGCTCCCTGTCACCGTCGGCGAAGGAGATCAGCACGTGACCAAAGTCACAGAGTTGAAGTATCCGACCCCGACTAGAGTCGCAACAGTTCGAGGAGAGCAAAAAACATC
GAGAGAATGCTACACAACTGCGATGAAAGGGACCGCCACTTGCGCGGCAATCATGGAGCTAGAAGCAACTGTGCCACCATGTACCAATGAGGCAGAGCCCAGCCGCGGCA
CCCCAGCAGAAAAGCTAGAACTTGTCCCCTTGATGGGACCAGATAAGCAGAGGAGTTGTGGTTGCTTTTCCAAGTCCCTTCAAGGATCGTTCATTGCGGTTTTGTCTTTT
CGACAAGGGGGTGACGTCCGCGACCAGTCGACACTTTACGTGGATGGTTCGTCCAATGAGAAGGGTTGTGGGGCAGGCATGCTGCTCCTCGGCCCAGGTGACCTTCGGTT
TGAGTACGCGCTCCGGTTCAGTTTCCGAGCTTCGAACAACGAAGCCTTGATAAATGGTCTGAAGGTCGCAAGGGGGATGTGCGTGAAGCGTCTCTTGATCCTCAGCGACT
CCCAACTGATCGTCAACCAGGTAACCGAGGAATACCAAGTGAAGGATACTCGTATGGAAAGGTATCTGGCCAAGACTCAAGGGCTCCTCGCCCAGTTTGAAGATTATGTG
ATTCGACAGGTGCCGAGGTCAGAACACTCCAACGCCGATGCATTGGCTCGTTTAGCCTTGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCTATGACGGGTCCGGGGATCCAATCTCGTATGTTGAGGTGTTCGAAGGGAAGATGGATTTTTTGGCTGCAAGCGACGCCATGCAGTGCCGAGTATTTCAAAT
AGCTCTAGAGGGCTCGGTGATGTTATGGTACCGACAGTTGAAGCCCCGGTCCATCGACAGTTATCAACAGCTGAGAAGGTTGTTCATCAATCAATTCTCAGCTCGGCAGT
TGTTGAAGTTGCCCCACTCGCACCTCGGAACAGTGAAGCAACGGGATAATGAATCCCTGACGGAGTACATCACTCGGTTCATGGACGAGCACGTCAAGGTGGTGAGTTGT
ACCGACGACATCGCCATGATGTACTTCACCACTGGGTTGAACGATAGAAATCTAACGATAGAGTTCGAAAGCCGTCTGTCGGCCTCGCTGAACGAGATGCTCGCCCGAGC
TCGCCAGTACATTGATGGCCTGGAGCTGTGGAAGGCCAACGGAGCCAGGCGGAGTAGCCGCGGTAAAGGTCGGGACCAGAAGTCCCCTCCTCCCAGAAGCAACGCGGTAA
TGATCGGAGCTCGTCTTGACGGGCCGATGAGAATAGAGGCGCCACAACGAGAGAGCCCCTTCAGACCGTCGAGGGCCGAAGTTCGACAGAGTTATTTGAAGAAGTACGTC
GACAGCAGAGAAAGAGCTGAGCCAAAAGGATCAGCTCGGGAGGATAAGCGAGAAAGGTCACCGCCGCCCAGACGAAAGGAAGATCGTCCTGCTGTTATAAACACCATTCA
TGGGGGTCCGAGCGGGGGAAAGTCAGGGCAGAAAAGAAAGGCTCTAGCTCAAGAAGCAGCGCATGAGGTCAGAAGAACTCTTGTCGACGGTGGAACGTCGACTAATATAT
TATCTTTCTCGACCTACACGGCTCTAGGGTGGGAAAGAAGACACTTGAAGCGCAACCCAACACCTTTGGTCGGCTTCGCAGGAGAGTTAGTTAGCGCGGAAGGATGTATC
TCGCTCCCTGTCACCGTCGGCGAAGGAGATCAGCACGTGACCAAAGTCACAGAGTTGAAGTATCCGACCCCGACTAGAGTCGCAACAGTTCGAGGAGAGCAAAAAACATC
GAGAGAATGCTACACAACTGCGATGAAAGGGACCGCCACTTGCGCGGCAATCATGGAGCTAGAAGCAACTGTGCCACCATGTACCAATGAGGCAGAGCCCAGCCGCGGCA
CCCCAGCAGAAAAGCTAGAACTTGTCCCCTTGATGGGACCAGATAAGCAGAGGAGTTGTGGTTGCTTTTCCAAGTCCCTTCAAGGATCGTTCATTGCGGTTTTGTCTTTT
CGACAAGGGGGTGACGTCCGCGACCAGTCGACACTTTACGTGGATGGTTCGTCCAATGAGAAGGGTTGTGGGGCAGGCATGCTGCTCCTCGGCCCAGGTGACCTTCGGTT
TGAGTACGCGCTCCGGTTCAGTTTCCGAGCTTCGAACAACGAAGCCTTGATAAATGGTCTGAAGGTCGCAAGGGGGATGTGCGTGAAGCGTCTCTTGATCCTCAGCGACT
CCCAACTGATCGTCAACCAGGTAACCGAGGAATACCAAGTGAAGGATACTCGTATGGAAAGGTATCTGGCCAAGACTCAAGGGCTCCTCGCCCAGTTTGAAGATTATGTG
ATTCGACAGGTGCCGAGGTCAGAACACTCCAACGCCGATGCATTGGCTCGTTTAGCCTTGGCCTAA
Protein sequenceShow/hide protein sequence
MSSYDGSGDPISYVEVFEGKMDFLAASDAMQCRVFQIALEGSVMLWYRQLKPRSIDSYQQLRRLFINQFSARQLLKLPHSHLGTVKQRDNESLTEYITRFMDEHVKVVSC
TDDIAMMYFTTGLNDRNLTIEFESRLSASLNEMLARARQYIDGLELWKANGARRSSRGKGRDQKSPPPRSNAVMIGARLDGPMRIEAPQRESPFRPSRAEVRQSYLKKYV
DSRERAEPKGSAREDKRERSPPPRRKEDRPAVINTIHGGPSGGKSGQKRKALAQEAAHEVRRTLVDGGTSTNILSFSTYTALGWERRHLKRNPTPLVGFAGELVSAEGCI
SLPVTVGEGDQHVTKVTELKYPTPTRVATVRGEQKTSRECYTTAMKGTATCAAIMELEATVPPCTNEAEPSRGTPAEKLELVPLMGPDKQRSCGCFSKSLQGSFIAVLSF
RQGGDVRDQSTLYVDGSSNEKGCGAGMLLLGPGDLRFEYALRFSFRASNNEALINGLKVARGMCVKRLLILSDSQLIVNQVTEEYQVKDTRMERYLAKTQGLLAQFEDYV
IRQVPRSEHSNADALARLALA