; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g42060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g42060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr6:33005131..33006687
RNA-Seq ExpressionMoc06g42060
SyntenyMoc06g42060
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144467.1 uncharacterized protein LOC111014147 [Momordica charantia]6.2e-13083.9Show/hide
Query:  MEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADV
        MEIKDQRLLKWPERMKAPSAKRSK RYCLFHR HGHATQDCFDLKEEVEGLIRRGYLKEYVEEPK TQN ESDKSPAREIRTIMGGPIERESGRKRKADV
Subjt:  MEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADV

Query:  GEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE
         EART REQNE                                    IANVKVHRVLVDG S ADI+SWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE
Subjt:  GEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE

Query:  GRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC
        GRIE PVTFGSGPKSVTKMVD LVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC
Subjt:  GRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC

XP_022150028.1 uncharacterized protein LOC111018300 [Momordica charantia]8.1e-13078.26Show/hide
Query:  VPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESD-KSPAREIRTIMGGPIERES
        VP+EQVLMEIK QRLL+WPERM AP +KRSKGRYCLFHRDH HATQDCFDLK+EV+ LI+RGYLKEYVE+PK TQN E+D KSPAREIRTIMGGP+ERE 
Subjt:  VPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESD-KSPAREIRTIMGGPIERES

Query:  GRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGF
        GRKRKA + E RTS+ Q+E+YH +   +P  IEFSEDEATHLLHPHND LVITLKIAN KVHR+LVDG SSADIIS TAYKAMDLGE+  KSSPA LV F
Subjt:  GRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGF

Query:  GGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGG
         GERVIPEGR EL VTFGSGPKS+T ++DFLV++Y SSYNAILGRPT+HML+AIPSTYHQS+ FPT GG+GEIK EQRVSRECYYTSM+ NDR ST GG
Subjt:  GGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGG

XP_022156748.1 uncharacterized protein LOC111023587 [Momordica charantia]1.6e-13080.26Show/hide
Query:  YTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPI
        YT TT+P+EQVLMEIKDQRLLKWPERMKAPS KRSKGRYCLFHRDH HATQDCFDLKEEVEGLIRRGYLK                              
Subjt:  YTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPI

Query:  ERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAP
        E     K+KADV EAR +REQNEVYHAY T+RPVTIEFSEDEAT L H HNDALVITLKIANVKVHR+LVDG SSADIISWTAYKAMDL EKVLKSSPAP
Subjt:  ERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAP

Query:  LVGFGGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTST
        LVGFGGERVI EGRIELPVTFGSGPK VTKMVDFLVVNYTSSYN ILGR TMHML+ IPSTYHQSMKFPTPGGV EIKGEQRVSRECYYTSMR NDRTST
Subjt:  LVGFGGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTST

Query:  KGGC
        KGGC
Subjt:  KGGC

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.8e-21587.7Show/hide
Query:  MREKVPPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQ
        MREKVPPKFKLPTVKQFD TTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFV QFVGGRCRSRPVAYLLTIKQ
Subjt:  MREKVPPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQ

Query:  RTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------------------EPDGKRTDPKRERSGDKPQGSRWEKR
        RTTESLRDYVA FNEEKLQVEGLTDAVSLLAFMSGVRDEHL                                 EPDGKRTDPKRERSGDKPQGSRWEKR
Subjt:  RTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------------------EPDGKRTDPKRERSGDKPQGSRWEKR

Query:  DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDK
        DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKA SAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPK TQN ESDK
Subjt:  DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDK

Query:  SPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKA
        SPAREIRTIMGGPIERESGRKRKADV EARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVI LKIANVKVHRVLVDG SSADI+SWTAYKA
Subjt:  SPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKA

Query:  MDLGEKVLKSSPAPLVGFGGERVIPEGRIELPVTFGSGPKSVTKMVD
        MDL EKVLK SPAPLVGFG ERVIPEGRIELPVTFG+     +K  D
Subjt:  MDLGEKVLKSSPAPLVGFGGERVIPEGRIELPVTFGSGPKSVTKMVD

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]1.5e-14482.83Show/hide
Query:  LKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQRTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------
        +KRGSISSFKSLARAFV QFVGGRCRSRPVAYLLTIKQRTTESL DYVA FN+EKLQ+E LTD VSLLAFMSGVRDEHL                     
Subjt:  LKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQRTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------

Query:  ------------EPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGH
                    EPDGKRTD KRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVP+EQVLMEIKDQRLLKWPERMK PS KRSKGRYCLFHRDH H
Subjt:  ------------EPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGH

Query:  ATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLH
        ATQD FDLKEEVEGLIRRGYL+EYVEEPK TQN ES+KSPAREIRTIMGGPIERES RKRKADV EAR SREQNEVYHAYTTNR VTIEFSEDEATHLLH
Subjt:  ATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLH

Query:  PHNDALVITLKIANVKVHRVLVDGDSSADIIS
        PHNDALVITLKIANVKVHR+LVDG SSADIIS
Subjt:  PHNDALVITLKIANVKVHRVLVDGDSSADIIS

TrEMBL top hitse value%identityAlignment
A0A6J1CTS4 uncharacterized protein LOC1110141473.0e-13083.9Show/hide
Query:  MEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADV
        MEIKDQRLLKWPERMKAPSAKRSK RYCLFHR HGHATQDCFDLKEEVEGLIRRGYLKEYVEEPK TQN ESDKSPAREIRTIMGGPIERESGRKRKADV
Subjt:  MEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADV

Query:  GEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE
         EART REQNE                                    IANVKVHRVLVDG S ADI+SWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE
Subjt:  GEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPE

Query:  GRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC
        GRIE PVTFGSGPKSVTKMVD LVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC
Subjt:  GRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC

A0A6J1D8C9 uncharacterized protein LOC1110183003.9e-13078.26Show/hide
Query:  VPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESD-KSPAREIRTIMGGPIERES
        VP+EQVLMEIK QRLL+WPERM AP +KRSKGRYCLFHRDH HATQDCFDLK+EV+ LI+RGYLKEYVE+PK TQN E+D KSPAREIRTIMGGP+ERE 
Subjt:  VPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESD-KSPAREIRTIMGGPIERES

Query:  GRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGF
        GRKRKA + E RTS+ Q+E+YH +   +P  IEFSEDEATHLLHPHND LVITLKIAN KVHR+LVDG SSADIIS TAYKAMDLGE+  KSSPA LV F
Subjt:  GRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGF

Query:  GGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGG
         GERVIPEGR EL VTFGSGPKS+T ++DFLV++Y SSYNAILGRPT+HML+AIPSTYHQS+ FPT GG+GEIK EQRVSRECYYTSM+ NDR ST GG
Subjt:  GGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGG

A0A6J1DRG9 uncharacterized protein LOC1110235877.9e-13180.26Show/hide
Query:  YTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPI
        YT TT+P+EQVLMEIKDQRLLKWPERMKAPS KRSKGRYCLFHRDH HATQDCFDLKEEVEGLIRRGYLK                              
Subjt:  YTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPI

Query:  ERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAP
        E     K+KADV EAR +REQNEVYHAY T+RPVTIEFSEDEAT L H HNDALVITLKIANVKVHR+LVDG SSADIISWTAYKAMDL EKVLKSSPAP
Subjt:  ERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAP

Query:  LVGFGGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTST
        LVGFGGERVI EGRIELPVTFGSGPK VTKMVDFLVVNYTSSYN ILGR TMHML+ IPSTYHQSMKFPTPGGV EIKGEQRVSRECYYTSMR NDRTST
Subjt:  LVGFGGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNYTSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTST

Query:  KGGC
        KGGC
Subjt:  KGGC

A0A6J1DWY0 uncharacterized protein LOC1110252931.8e-21587.7Show/hide
Query:  MREKVPPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQ
        MREKVPPKFKLPTVKQFD TTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFV QFVGGRCRSRPVAYLLTIKQ
Subjt:  MREKVPPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQ

Query:  RTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------------------EPDGKRTDPKRERSGDKPQGSRWEKR
        RTTESLRDYVA FNEEKLQVEGLTDAVSLLAFMSGVRDEHL                                 EPDGKRTDPKRERSGDKPQGSRWEKR
Subjt:  RTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------------------EPDGKRTDPKRERSGDKPQGSRWEKR

Query:  DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDK
        DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKA SAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPK TQN ESDK
Subjt:  DRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDK

Query:  SPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKA
        SPAREIRTIMGGPIERESGRKRKADV EARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVI LKIANVKVHRVLVDG SSADI+SWTAYKA
Subjt:  SPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKA

Query:  MDLGEKVLKSSPAPLVGFGGERVIPEGRIELPVTFGSGPKSVTKMVD
        MDL EKVLK SPAPLVGFG ERVIPEGRIELPVTFG+     +K  D
Subjt:  MDLGEKVLKSSPAPLVGFGGERVIPEGRIELPVTFGSGPKSVTKMVD

A0A6J1DYL6 uncharacterized protein LOC1110257857.3e-14582.83Show/hide
Query:  LKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQRTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------
        +KRGSISSFKSLARAFV QFVGGRCRSRPVAYLLTIKQRTTESL DYVA FN+EKLQ+E LTD VSLLAFMSGVRDEHL                     
Subjt:  LKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQRTTESLRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHL---------------------

Query:  ------------EPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGH
                    EPDGKRTD KRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVP+EQVLMEIKDQRLLKWPERMK PS KRSKGRYCLFHRDH H
Subjt:  ------------EPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKAPSAKRSKGRYCLFHRDHGH

Query:  ATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLH
        ATQD FDLKEEVEGLIRRGYL+EYVEEPK TQN ES+KSPAREIRTIMGGPIERES RKRKADV EAR SREQNEVYHAYTTNR VTIEFSEDEATHLLH
Subjt:  ATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPVTIEFSEDEATHLLH

Query:  PHNDALVITLKIANVKVHRVLVDGDSSADIIS
        PHNDALVITLKIANVKVHR+LVDG SSADIIS
Subjt:  PHNDALVITLKIANVKVHRVLVDGDSSADIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAAAGGTCCCTCCAAAATTCAAGCTACCCACGGTGAAGCAATTCGACGGGACGACCGACCCAGTGGATCATCTAGATGCTTATCGGGAATGGATGGAT
ATCTATGGAGTGTCGGAAGCGGTCAGGTGTCGGGTATTCTCGACTACTCTCAACGGGTCGGCCAGAATATGGTTCCGACAATTAAAACGAGGGTCGATCTCGAGT
TTCAAGAGCTTGGCTAGAGCATTCGTGGCTCAGTTTGTAGGGGGGCGGTGTCGGAGCCGACCCGTGGCTTATCTCTTAACCATTAAGCAGAGGACGACAGAGAGT
CTACGCGACTATGTAGCCTGGTTCAACGAGGAGAAGCTGCAGGTAGAAGGCCTTACAGACGCTGTATCTCTGCTGGCCTTCATGTCCGGCGTCAGGGATGAACAT
TTGGAACCTGACGGAAAGCGAACCGACCCAAAGAGGGAGAGGTCGGGAGATAAACCGCAGGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCA
CCCCGAAAATTTGAAAAGTATACCCCGACCACCGTTCCAATCGAGCAAGTGCTGATGGAGATCAAAGACCAAAGGTTGCTTAAATGGCCGGAGAGGATGAAGGCC
CCGTCAGCTAAGCGCAGCAAAGGCAGGTATTGTCTTTTCCACAGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATC
CGAAGGGGCTACCTCAAGGAGTATGTAGAGGAACCTAAAGTGACCCAGAACGACGAAAGCGACAAGTCTCCCGCAAGAGAGATTCGAACTATAATGGGGGGCCCC
ATAGAAAGAGAATCCGGGAGAAAAAGGAAAGCAGATGTAGGTGAAGCTAGGACGAGTCGCGAGCAAAATGAGGTCTACCACGCGTATACTACAAACCGGCCGGTG
ACAATTGAATTTTCGGAGGACGAAGCAACTCACCTCCTTCACCCTCATAACGATGCACTTGTTATCACGTTGAAGATAGCAAATGTTAAAGTACATCGAGTCTTG
GTGGATGGGGACAGCTCGGCGGATATCATCTCCTGGACAGCCTATAAAGCTATGGATCTAGGAGAAAAGGTGCTCAAGAGTAGTCCAGCACCACTGGTAGGATTT
GGGGGAGAACGGGTCATCCCCGAAGGGAGGATTGAGTTGCCAGTAACATTCGGAAGTGGGCCGAAGAGTGTCACCAAGATGGTGGACTTTTTGGTCGTTAACTAC
ACATCCTCCTACAACGCAATACTGGGAAGACCGACGATGCATATGCTCAGAGCGATACCATCCACATATCACCAATCCATGAAGTTCCCAACACCGGGTGGGGTA
GGAGAAATTAAAGGAGAACAACGAGTGTCACGGGAATGCTACTATACCTCAATGAGGGACAACGACAGAACTTCTACTAAGGGGGGCTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAAAAGGTCCCTCCAAAATTCAAGCTACCCACGGTGAAGCAATTCGACGGGACGACCGACCCAGTGGATCATCTAGATGCTTATCGGGAATGGATGGAT
ATCTATGGAGTGTCGGAAGCGGTCAGGTGTCGGGTATTCTCGACTACTCTCAACGGGTCGGCCAGAATATGGTTCCGACAATTAAAACGAGGGTCGATCTCGAGT
TTCAAGAGCTTGGCTAGAGCATTCGTGGCTCAGTTTGTAGGGGGGCGGTGTCGGAGCCGACCCGTGGCTTATCTCTTAACCATTAAGCAGAGGACGACAGAGAGT
CTACGCGACTATGTAGCCTGGTTCAACGAGGAGAAGCTGCAGGTAGAAGGCCTTACAGACGCTGTATCTCTGCTGGCCTTCATGTCCGGCGTCAGGGATGAACAT
TTGGAACCTGACGGAAAGCGAACCGACCCAAAGAGGGAGAGGTCGGGAGATAAACCGCAGGGGTCGAGATGGGAGAAGAGGGATCGGAGTAGCCAGAAAGATCCA
CCCCGAAAATTTGAAAAGTATACCCCGACCACCGTTCCAATCGAGCAAGTGCTGATGGAGATCAAAGACCAAAGGTTGCTTAAATGGCCGGAGAGGATGAAGGCC
CCGTCAGCTAAGCGCAGCAAAGGCAGGTATTGTCTTTTCCACAGGGATCACGGCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATC
CGAAGGGGCTACCTCAAGGAGTATGTAGAGGAACCTAAAGTGACCCAGAACGACGAAAGCGACAAGTCTCCCGCAAGAGAGATTCGAACTATAATGGGGGGCCCC
ATAGAAAGAGAATCCGGGAGAAAAAGGAAAGCAGATGTAGGTGAAGCTAGGACGAGTCGCGAGCAAAATGAGGTCTACCACGCGTATACTACAAACCGGCCGGTG
ACAATTGAATTTTCGGAGGACGAAGCAACTCACCTCCTTCACCCTCATAACGATGCACTTGTTATCACGTTGAAGATAGCAAATGTTAAAGTACATCGAGTCTTG
GTGGATGGGGACAGCTCGGCGGATATCATCTCCTGGACAGCCTATAAAGCTATGGATCTAGGAGAAAAGGTGCTCAAGAGTAGTCCAGCACCACTGGTAGGATTT
GGGGGAGAACGGGTCATCCCCGAAGGGAGGATTGAGTTGCCAGTAACATTCGGAAGTGGGCCGAAGAGTGTCACCAAGATGGTGGACTTTTTGGTCGTTAACTAC
ACATCCTCCTACAACGCAATACTGGGAAGACCGACGATGCATATGCTCAGAGCGATACCATCCACATATCACCAATCCATGAAGTTCCCAACACCGGGTGGGGTA
GGAGAAATTAAAGGAGAACAACGAGTGTCACGGGAATGCTACTATACCTCAATGAGGGACAACGACAGAACTTCTACTAAGGGGGGCTGTTGA
Protein sequenceShow/hide protein sequence
MREKVPPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCRVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVAQFVGGRCRSRPVAYLLTIKQRTTES
LRDYVAWFNEEKLQVEGLTDAVSLLAFMSGVRDEHLEPDGKRTDPKRERSGDKPQGSRWEKRDRSSQKDPPRKFEKYTPTTVPIEQVLMEIKDQRLLKWPERMKA
PSAKRSKGRYCLFHRDHGHATQDCFDLKEEVEGLIRRGYLKEYVEEPKVTQNDESDKSPAREIRTIMGGPIERESGRKRKADVGEARTSREQNEVYHAYTTNRPV
TIEFSEDEATHLLHPHNDALVITLKIANVKVHRVLVDGDSSADIISWTAYKAMDLGEKVLKSSPAPLVGFGGERVIPEGRIELPVTFGSGPKSVTKMVDFLVVNY
TSSYNAILGRPTMHMLRAIPSTYHQSMKFPTPGGVGEIKGEQRVSRECYYTSMRDNDRTSTKGGC