; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g20170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g20170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr6:15803381..15808340
RNA-Seq ExpressionMoc06g20170
SyntenyMoc06g20170
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]7.5e-9272.87Show/hide
Query:  RDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDPPRKFEKYTPTTVPLEQVLMEIKDQRLLK
        RDEHLSFSF K+TPSTFSEALSRAQ                                           QKDPP+KFEKYT TTVPLEQVLMEIK+QRLLK
Subjt:  RDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDPPRKFEKYTPTTVPLEQVLMEIKDQRLLK

Query:  WPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGLREAMASREQN
        WPERM A STKRSKG+YCLFH DH HATQDCFDLKEEVEGLI  GYLKEY+E+ KATQNGESDKSPAREIRTI+GGPIERESGRKRK  +REA ASREQN
Subjt:  WPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGLREAMASREQN

Query:  EVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        EVYH Y T+R VTIEFSEDEATHLLHPHN+ALVI LKIANV+VHRILVDGGSSADIIS
Subjt:  EVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

XP_022158621.1 uncharacterized protein LOC111025072 [Momordica charantia]3.2e-7485.71Show/hide
Query:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
        MEIKDQRLLKWPERMKAPSTKRSKG+YCLFHRDH HATQDCFDL EEVEGLI+ GYL+EY+E+ KATQNGESDKSPAREIRTI+GGPIERESGRKRKA +
Subjt:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL

Query:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        +EA A+REQNEVYHAY T+R VTI+FSEDEATHLLHPHN+AL I LKIANVKVHRILVDGG+ ADIIS
Subjt:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

XP_022158669.1 uncharacterized protein LOC111025131 [Momordica charantia]5.6e-87100Show/hide
Query:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
        MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
Subjt:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL

Query:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
Subjt:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.2e-13952.56Show/hide
Query:  MREKVLPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCWVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQ
        MREKV PKFKLPTVKQFD TTDPVDHLDAYREWMDIYGVSEAVRC VFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQF+GGRC SRPVAYLLTIKQ
Subjt:  MREKVLPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCWVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQ

Query:  RTAESLHDYVARFNEEKLQEWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGTTKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSST
        RT ESL DYVARFNEEKLQ                                                                                 
Subjt:  RTAESLHDYVARFNEEKLQEWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGTTKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSST

Query:  AMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAVTVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERP
                                                                                                            
Subjt:  AMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAVTVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERP

Query:  TAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDP
          G+   VS  A   G+           RDEHLSFSF K+TP+TFSEALSRAQ                                          SQKDP
Subjt:  TAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDP

Query:  PRKFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRT
        PRKFEKYTPTTVP+EQVLMEIKDQRLLKWPERMKA S KRSKG+YCLFHRDH HATQDCFDLKEEVEGLI+ GYLKEY+E+ KATQNGESDKSPAREIRT
Subjt:  PRKFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRT

Query:  IIGGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        I+GGPIERESGRKRKA +REA  SREQNEVYHAY T+R VTIEFSEDEATHLLHPHN+ALVI LKIANVKVHR+LVDGGSSADI+S
Subjt:  IIGGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]1.6e-10251.45Show/hide
Query:  LKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQRTAESLHDYVARFNEEKLQ--EWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGT
        +KRGSISSFKSLARAFVTQF+GGRC SRPVAYLLTIKQRT ESLHDYVARFN+EKLQ     DV  + A  S +   +  +S  KK   T          
Subjt:  LKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQRTAESLHDYVARFNEEKLQ--EWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGT

Query:  TKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSSTAMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAV
                                      TLS+ ++         Y+ A                       Y+   P  +R                 
Subjt:  TKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSSTAMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAV

Query:  TVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERPTAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQSQKDPPR
               D+ RE          GD P  SR                                              +EK+  S          SQKDPPR
Subjt:  TVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERPTAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQSQKDPPR

Query:  KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTII
        KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMK PSTKRSKG+YCLFHRDH HATQD FDLKEEVEGLI+ GYL+EY+E+ KATQNGES+KSPAREIRTI+
Subjt:  KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTII

Query:  GGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        GGPIERES RKRKA +REA  SREQNEVYHAY T+RSVTIEFSEDEATHLLHPHN+ALVI LKIANVKVHRILVDGGSSADIIS
Subjt:  GGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

TrEMBL top hitse value%identityAlignment
A0A6J1CNT2 uncharacterized protein LOC1110128053.6e-9272.87Show/hide
Query:  RDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDPPRKFEKYTPTTVPLEQVLMEIKDQRLLK
        RDEHLSFSF K+TPSTFSEALSRAQ                                           QKDPP+KFEKYT TTVPLEQVLMEIK+QRLLK
Subjt:  RDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDPPRKFEKYTPTTVPLEQVLMEIKDQRLLK

Query:  WPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGLREAMASREQN
        WPERM A STKRSKG+YCLFH DH HATQDCFDLKEEVEGLI  GYLKEY+E+ KATQNGESDKSPAREIRTI+GGPIERESGRKRK  +REA ASREQN
Subjt:  WPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGLREAMASREQN

Query:  EVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        EVYH Y T+R VTIEFSEDEATHLLHPHN+ALVI LKIANV+VHRILVDGGSSADIIS
Subjt:  EVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

A0A6J1DWY0 uncharacterized protein LOC1110252931.4e-13952.56Show/hide
Query:  MREKVLPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCWVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQ
        MREKV PKFKLPTVKQFD TTDPVDHLDAYREWMDIYGVSEAVRC VFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQF+GGRC SRPVAYLLTIKQ
Subjt:  MREKVLPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCWVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQ

Query:  RTAESLHDYVARFNEEKLQEWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGTTKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSST
        RT ESL DYVARFNEEKLQ                                                                                 
Subjt:  RTAESLHDYVARFNEEKLQEWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGTTKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSST

Query:  AMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAVTVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERP
                                                                                                            
Subjt:  AMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAVTVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERP

Query:  TAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDP
          G+   VS  A   G+           RDEHLSFSF K+TP+TFSEALSRAQ                                          SQKDP
Subjt:  TAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQ------------------------------------------SQKDP

Query:  PRKFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRT
        PRKFEKYTPTTVP+EQVLMEIKDQRLLKWPERMKA S KRSKG+YCLFHRDH HATQDCFDLKEEVEGLI+ GYLKEY+E+ KATQNGESDKSPAREIRT
Subjt:  PRKFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRT

Query:  IIGGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        I+GGPIERESGRKRKA +REA  SREQNEVYHAY T+R VTIEFSEDEATHLLHPHN+ALVI LKIANVKVHR+LVDGGSSADI+S
Subjt:  IIGGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

A0A6J1DYL6 uncharacterized protein LOC1110257857.8e-10351.45Show/hide
Query:  LKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQRTAESLHDYVARFNEEKLQ--EWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGT
        +KRGSISSFKSLARAFVTQF+GGRC SRPVAYLLTIKQRT ESLHDYVARFN+EKLQ     DV  + A  S +   +  +S  KK   T          
Subjt:  LKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQRTAESLHDYVARFNEEKLQ--EWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGT

Query:  TKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSSTAMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAV
                                      TLS+ ++         Y+ A                       Y+   P  +R                 
Subjt:  TKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSSTAMFAVCYLGAAGCATADARGGRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAV

Query:  TVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERPTAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQSQKDPPR
               D+ RE          GD P  SR                                              +EK+  S          SQKDPPR
Subjt:  TVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERPTAGVVATVSRNAAAVGIERRRSSEECRRRDEHLSFSFEKKTPSTFSEALSRAQSQKDPPR

Query:  KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTII
        KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMK PSTKRSKG+YCLFHRDH HATQD FDLKEEVEGLI+ GYL+EY+E+ KATQNGES+KSPAREIRTI+
Subjt:  KFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTII

Query:  GGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        GGPIERES RKRKA +REA  SREQNEVYHAY T+RSVTIEFSEDEATHLLHPHN+ALVI LKIANVKVHRILVDGGSSADIIS
Subjt:  GGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

A0A6J1E032 uncharacterized protein LOC1110251312.7e-87100Show/hide
Query:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
        MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
Subjt:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL

Query:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
Subjt:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

A0A6J1E1G3 uncharacterized protein LOC1110250721.5e-7485.71Show/hide
Query:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL
        MEIKDQRLLKWPERMKAPSTKRSKG+YCLFHRDH HATQDCFDL EEVEGLI+ GYL+EY+E+ KATQNGESDKSPAREIRTI+GGPIERESGRKRKA +
Subjt:  MEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDSKATQNGESDKSPAREIRTIIGGPIERESGRKRKAGL

Query:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS
        +EA A+REQNEVYHAY T+R VTI+FSEDEATHLLHPHN+AL I LKIANVKVHRILVDGG+ ADIIS
Subjt:  REAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAGAAGGTCCTTCCAAAATTCAAGCTACCCACGGTGAAACAATTCGACGGGACGACCGACCCAGTGGACCATCTAGATGCTTATCGGGAATGGATGGATATCTA
CGGAGTGTCGGAGGCGGTCAGGTGCTGGGTATTCTCGACTACATTAAACGGGTCAGCTAGAATATGGTTCCGACAATTAAAACGAGGGTCAATCTCGAGTTTCAAGAGCT
TGGCCAGAGCATTCGTGACCCAGTTTATAGGGGGACGATGTTGGAGTCGACCTGTGGCTTATCTTTTAACTATTAAACAAAGGACGGCAGAGAGTTTACACGACTATGTA
GCTCGGTTCAACGAGGAGAAGCTCCAGGAGTGGGTAGATGTCGAGAAGGTCCGGGCTCCCAAATCCGAGCTCGAACGGCCTTATGCTTGCTGGTCTGATACCAAAAAGGA
ATGGGTAACGCTGATTTGGGTCCGATTAGCGGATGGGACCACGAAACTTGCCGAGCTTCACCGATCTAGACCGTTCGAGGTCGTTGGGGGTCGTCAGAATGAAGAAAACG
TCATAGAAAAGGGCTGTTCGACGTTGTCGAAGCTGGAGAAGTCGTCAACTGCCATGTTCGCCGTGTGCTACCTTGGTGCAGCAGGTTGCGCCACTGCAGACGCTAGAGGA
GGTCGTGCTGCTACTGATCGGGAGAAAGAAACGTTGGAATATGCTTCTGAATGCCCAACCCGCCGTCGCTGCCTCGTGGAGAATGTGGGTCGCGGTTCCTCGCTACTCCA
GAACGCCGTCACCGTTGGGATATCTGTTGCCGACGAGGGTAGAGAAGCGCCGTCGTATGGTTGCTGCTTCACCGTAGGAGACGCGCCGGTGCTCAGTCGCGCTGCCATGG
ATCGGATTCGTGAGCGACCTACTGCAGGCGTCGTCGCCACCGTGTCGAGGAATGCGGCCGCCGTGGGAATTGAACGCCGTCGCTCGTCGGAGGAGTGCCGTCGCCGGGAT
GAGCATTTGTCATTTTCGTTCGAAAAGAAAACACCGAGTACCTTCTCGGAGGCACTGAGCCGAGCTCAGAGCCAGAAAGATCCACCCCGAAAATTTGAAAAGTATACCCC
GACCACCGTTCCACTCGAGCAAGTGCTAATGGAGATCAAAGACCAAAGGTTGCTTAAGTGGCCGGAAAGGATGAAGGCCCCGTCAACTAAACGAAGTAAAGGCCAATATT
GCCTTTTCCACCGGGATCACGTCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATCCAAAGCGGCTACCTCAAAGAGTATTTAGAGGACTCT
AAAGCGACACAAAACGGTGAAAGCGACAAGTCTCCAGCTCGAGAGATTCGAACTATAATAGGAGGCCCCATAGAAAGAGAATCTGGGAGAAAAAGAAAAGCAGGTCTACG
AGAAGCAATGGCGAGTCGCGAACAAAATGAAGTCTACCACGCGTATATTACAGACCGGTCAGTGACGATCGAGTTTTCAGAGGACGAGGCGACTCACCTCCTCCACCCTC
ATAACAATGCACTGGTTATCGCTTTGAAGATAGCAAATGTGAAAGTACATCGAATTTTGGTGGATGGGGGCAGCTCGGCGGATATCATCTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAGAAGGTCCTTCCAAAATTCAAGCTACCCACGGTGAAACAATTCGACGGGACGACCGACCCAGTGGACCATCTAGATGCTTATCGGGAATGGATGGATATCTA
CGGAGTGTCGGAGGCGGTCAGGTGCTGGGTATTCTCGACTACATTAAACGGGTCAGCTAGAATATGGTTCCGACAATTAAAACGAGGGTCAATCTCGAGTTTCAAGAGCT
TGGCCAGAGCATTCGTGACCCAGTTTATAGGGGGACGATGTTGGAGTCGACCTGTGGCTTATCTTTTAACTATTAAACAAAGGACGGCAGAGAGTTTACACGACTATGTA
GCTCGGTTCAACGAGGAGAAGCTCCAGGAGTGGGTAGATGTCGAGAAGGTCCGGGCTCCCAAATCCGAGCTCGAACGGCCTTATGCTTGCTGGTCTGATACCAAAAAGGA
ATGGGTAACGCTGATTTGGGTCCGATTAGCGGATGGGACCACGAAACTTGCCGAGCTTCACCGATCTAGACCGTTCGAGGTCGTTGGGGGTCGTCAGAATGAAGAAAACG
TCATAGAAAAGGGCTGTTCGACGTTGTCGAAGCTGGAGAAGTCGTCAACTGCCATGTTCGCCGTGTGCTACCTTGGTGCAGCAGGTTGCGCCACTGCAGACGCTAGAGGA
GGTCGTGCTGCTACTGATCGGGAGAAAGAAACGTTGGAATATGCTTCTGAATGCCCAACCCGCCGTCGCTGCCTCGTGGAGAATGTGGGTCGCGGTTCCTCGCTACTCCA
GAACGCCGTCACCGTTGGGATATCTGTTGCCGACGAGGGTAGAGAAGCGCCGTCGTATGGTTGCTGCTTCACCGTAGGAGACGCGCCGGTGCTCAGTCGCGCTGCCATGG
ATCGGATTCGTGAGCGACCTACTGCAGGCGTCGTCGCCACCGTGTCGAGGAATGCGGCCGCCGTGGGAATTGAACGCCGTCGCTCGTCGGAGGAGTGCCGTCGCCGGGAT
GAGCATTTGTCATTTTCGTTCGAAAAGAAAACACCGAGTACCTTCTCGGAGGCACTGAGCCGAGCTCAGAGCCAGAAAGATCCACCCCGAAAATTTGAAAAGTATACCCC
GACCACCGTTCCACTCGAGCAAGTGCTAATGGAGATCAAAGACCAAAGGTTGCTTAAGTGGCCGGAAAGGATGAAGGCCCCGTCAACTAAACGAAGTAAAGGCCAATATT
GCCTTTTCCACCGGGATCACGTCCATGCAACTCAGGATTGTTTTGATCTCAAGGAAGAGGTGGAAGGACTAATCCAAAGCGGCTACCTCAAAGAGTATTTAGAGGACTCT
AAAGCGACACAAAACGGTGAAAGCGACAAGTCTCCAGCTCGAGAGATTCGAACTATAATAGGAGGCCCCATAGAAAGAGAATCTGGGAGAAAAAGAAAAGCAGGTCTACG
AGAAGCAATGGCGAGTCGCGAACAAAATGAAGTCTACCACGCGTATATTACAGACCGGTCAGTGACGATCGAGTTTTCAGAGGACGAGGCGACTCACCTCCTCCACCCTC
ATAACAATGCACTGGTTATCGCTTTGAAGATAGCAAATGTGAAAGTACATCGAATTTTGGTGGATGGGGGCAGCTCGGCGGATATCATCTCCTAG
Protein sequenceShow/hide protein sequence
MREKVLPKFKLPTVKQFDGTTDPVDHLDAYREWMDIYGVSEAVRCWVFSTTLNGSARIWFRQLKRGSISSFKSLARAFVTQFIGGRCWSRPVAYLLTIKQRTAESLHDYV
ARFNEEKLQEWVDVEKVRAPKSELERPYACWSDTKKEWVTLIWVRLADGTTKLAELHRSRPFEVVGGRQNEENVIEKGCSTLSKLEKSSTAMFAVCYLGAAGCATADARG
GRAATDREKETLEYASECPTRRRCLVENVGRGSSLLQNAVTVGISVADEGREAPSYGCCFTVGDAPVLSRAAMDRIRERPTAGVVATVSRNAAAVGIERRRSSEECRRRD
EHLSFSFEKKTPSTFSEALSRAQSQKDPPRKFEKYTPTTVPLEQVLMEIKDQRLLKWPERMKAPSTKRSKGQYCLFHRDHVHATQDCFDLKEEVEGLIQSGYLKEYLEDS
KATQNGESDKSPAREIRTIIGGPIERESGRKRKAGLREAMASREQNEVYHAYITDRSVTIEFSEDEATHLLHPHNNALVIALKIANVKVHRILVDGGSSADIIS