; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15890 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15890
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr3:10572015..10573091
RNA-Seq ExpressionMoc03g15890
SyntenyMoc03g15890
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]9.2e-12370.19Show/hide
Query:  MRHRLRTMEEMYAEATRA--NRTASPSRVP-GAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSK
        MR ++ TME+MY+E  +A   R+ S +RV      E+R   + P  +      E  +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S 
Subjt:  MRHRLRTMEEMYAEATRA--NRTASPSRVP-GAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSK

Query:  YKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        Y P+ PE VITREEFD +K K D QVE LKAKC KKE SFDDGDLGESPFT DILEA IP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG
         FQIALTGSARLWY+RLPARSISTYSQLRKEFI QFSSRHYDR TATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADETLTVKL 
Subjt:  AFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG

Query:  EEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGSFIF
        EEAPATF EVLQKAKK+IDGQ+LLRTKT RPEK+IDQ + S+++ K DSK+RD G   F
Subjt:  EEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGSFIF

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]6.0e-12277.81Show/hide
Query:  EGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPD
        E  +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S Y P+ P+ VITREEFD +K K D QVE LKA C KKE SFDDGDLGE PFT D
Subjt:  EGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPD

Query:  ILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIR
        ILEAPI PKFKTPTMKPYDGSK+PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSARLWY+RLPARSISTYSQLRKEFISQFSSR+YDR TATHLATIR
Subjt:  ILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIR

Query:  QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRD
        QK+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKKVIDGQ+LLRTKTGRPEK+IDQ+++ +++ KA SKSRD
Subjt:  QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRD

Query:  MG
         G
Subjt:  MG

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.1e-13178.59Show/hide
Query:  VPGAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETL
        +PGAPGEK APSIQPG+REPIPN EGVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL+PEAVI REEFDLMKH+ DEQVE L
Subjt:  VPGAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETL

Query:  KAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLR
        KA+C KKE  FDD DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLR
Subjt:  KAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLR

Query:  KEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTG
        KEFI QFS RHYDR TATHLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKKVIDGQ+LLRTKT 
Subjt:  KEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTG

Query:  RPEKQIDQKKLSQERRKADSKSRDMGS
        RPEKQIDQK+LSQ++RK DSKS+D GS
Subjt:  RPEKQIDQKKLSQERRKADSKSRDMGS

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.6e-12069.01Show/hide
Query:  MRHRLRTMEEMYAEATRANRTASPSRVPGAPGEK--RAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKY
        MR ++RTMEEMY      N+    +      G++       + GD    P    VD      DLR HL  K+  + R    S  + +   NSN +A+S Y
Subjt:  MRHRLRTMEEMYAEATRANRTASPSRVPGAPGEK--RAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKY

Query:  KPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRA
         P+ PE VITREEF+ +K K D QVE LK +C KKE +FDDGDLGESPFT DILEA IPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKCRA
Subjt:  KPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRA

Query:  FQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE
        FQIALTGSARLWY+RLPARSISTYSQLRKEFISQF SRHYDR T THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET TVKLGE
Subjt:  FQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE

Query:  EAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS
        EA ATFAEVLQ  KK IDGQ+LLRTKT RPEKQIDQKK SQ++RKADSKS+D GS
Subjt:  EAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS

XP_022159250.1 uncharacterized protein LOC111025663 [Momordica charantia]4.7e-11992.89Show/hide
Query:  MKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRL
        MKHK DEQVE LKA+C KKECSFDDGDLGESPFT DILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALT SARLWY+RL
Subjt:  MKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRL

Query:  PARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKV
        PARSISTYSQLRKEFISQFSSRHYDR TATHLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKV
Subjt:  PARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKV

Query:  IDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS
        IDGQ+LL+TKT RPEKQIDQKKL+QE+RKADSKS+D GS
Subjt:  IDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196344.5e-12370.19Show/hide
Query:  MRHRLRTMEEMYAEATRA--NRTASPSRVP-GAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSK
        MR ++ TME+MY+E  +A   R+ S +RV      E+R   + P  +      E  +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S 
Subjt:  MRHRLRTMEEMYAEATRA--NRTASPSRVP-GAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSK

Query:  YKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
        Y P+ PE VITREEFD +K K D QVE LKAKC KKE SFDDGDLGESPFT DILEA IP KFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR
Subjt:  YKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCR

Query:  AFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG
         FQIALTGSARLWY+RLPARSISTYSQLRKEFI QFSSRHYDR TATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADETLTVKL 
Subjt:  AFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLG

Query:  EEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGSFIF
        EEAPATF EVLQKAKK+IDGQ+LLRTKT RPEK+IDQ + S+++ K DSK+RD G   F
Subjt:  EEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGSFIF

A0A6J1DM55 uncharacterized protein LOC1110222672.9e-12277.81Show/hide
Query:  EGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPD
        E  +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S Y P+ P+ VITREEFD +K K D QVE LKA C KKE SFDDGDLGE PFT D
Subjt:  EGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPD

Query:  ILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIR
        ILEAPI PKFKTPTMKPYDGSK+PKDYV+VFEGLM+FQAATDAIKCRAFQIA TGSARLWY+RLPARSISTYSQLRKEFISQFSSR+YDR TATHLATIR
Subjt:  ILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIR

Query:  QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRD
        QK+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+TLTVKLGEEAPATFAEVLQKAKKVIDGQ+LLRTKTGRPEK+IDQ+++ +++ KA SKSRD
Subjt:  QKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRD

Query:  MG
         G
Subjt:  MG

A0A6J1DPC9 uncharacterized protein LOC1110222805.3e-13278.59Show/hide
Query:  VPGAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETL
        +PGAPGEK APSIQPG+REPIPN EGVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL+PEAVI REEFDLMKH+ DEQVE L
Subjt:  VPGAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITREEFDLMKHKLDEQVETL

Query:  KAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLR
        KA+C KKE  FDD DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLR
Subjt:  KAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLR

Query:  KEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTG
        KEFI QFS RHYDR TATHLATIRQKE                                   DETLTVKLGEEAPATFAEVLQ AKKVIDGQ+LLRTKT 
Subjt:  KEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTG

Query:  RPEKQIDQKKLSQERRKADSKSRDMGS
        RPEKQIDQK+LSQ++RK DSKS+D GS
Subjt:  RPEKQIDQKKLSQERRKADSKSRDMGS

A0A6J1DPN4 uncharacterized protein LOC1110230602.7e-12069.01Show/hide
Query:  MRHRLRTMEEMYAEATRANRTASPSRVPGAPGEK--RAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKY
        MR ++RTMEEMY      N+    +      G++       + GD    P    VD      DLR HL  K+  + R    S  + +   NSN +A+S Y
Subjt:  MRHRLRTMEEMYAEATRANRTASPSRVPGAPGEK--RAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKY

Query:  KPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRA
         P+ PE VITREEF+ +K K D QVE LK +C KKE +FDDGDLGESPFT DILEA IPPKFKTPTMK YDGSKDPKDYVEVFEGLMDFQAATDAIKCRA
Subjt:  KPLMPEAVITREEFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRA

Query:  FQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE
        FQIALTGSARLWY+RLPARSISTYSQLRKEFISQF SRHYDR T THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET TVKLGE
Subjt:  FQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGE

Query:  EAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS
        EA ATFAEVLQ  KK IDGQ+LLRTKT RPEKQIDQKK SQ++RKADSKS+D GS
Subjt:  EAPATFAEVLQKAKKVIDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS

A0A6J1DY58 uncharacterized protein LOC1110256632.3e-11992.89Show/hide
Query:  MKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRL
        MKHK DEQVE LKA+C KKECSFDDGDLGESPFT DILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALT SARLWY+RL
Subjt:  MKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRL

Query:  PARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKV
        PARSISTYSQLRKEFISQFSSRHYDR TATHLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKV
Subjt:  PARSISTYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKV

Query:  IDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS
        IDGQ+LL+TKT RPEKQIDQKKL+QE+RKADSKS+D GS
Subjt:  IDGQDLLRTKTGRPEKQIDQKKLSQERRKADSKSRDMGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCATCGGTTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTCCCGGGCGCACCTGGTGAAAAGAGAGCCCC
ATCCATCCAACCTGGCGACCGCGAGCCCATTCCCAACGTTGAAGGAGTGGATTATAGCTTGCGGGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCAT
CTCGGGAGCCAGAAGACTCTCCTTCCTACTCCCGAGAGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGATGCCAGAAGCTGTGATAACTAGAGAA
GAGTTCGACCTGATGAAGCACAAGCTCGATGAGCAGGTCGAGACGCTTAAGGCCAAGTGCGGGAAGAAAGAATGTTCGTTCGACGATGGTGACTTGGGAGAATCACCATT
CACCCCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAAACTCCCACGATGAAGCCTTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAAGGCC
TCATGGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAGATCGCGCTCACCGGTAGCGCACGCCTGTGGTATCAAAGACTGCCGGCTAGGTCGATCTCG
ACCTACTCTCAACTGAGAAAGGAGTTCATCAGTCAGTTCTCTTCTCGGCATTATGACAGAAATACAGCGACTCATCTCGCTACTATCAGGCAGAAGGAGGGAGAGACGCT
GAGGGAGTATGTCACGAGGTTCCAGGAGGAGCAGTTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACCTTAACAG
TAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTCCTGCAGAAGGCAAAGAAGGTCATTGATGGGCAAGATCTCCTCCGAACCAAAACTGGCCGACCTGAGAAG
CAGATCGACCAGAAGAAGTTGAGCCAAGAGAGGAGGAAGGCTGATTCCAAGTCTAGAGATATGGGATCCTTCATCTTCCGCCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCCATCGGTTGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTAGGGTCCCGGGCGCACCTGGTGAAAAGAGAGCCCC
ATCCATCCAACCTGGCGACCGCGAGCCCATTCCCAACGTTGAAGGAGTGGATTATAGCTTGCGGGACAACGATCTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCAT
CTCGGGAGCCAGAAGACTCTCCTTCCTACTCCCGAGAGTTCTCGAACTCGAACCTAAAGGCTCAGTCAAAATACAAGCCTCTGATGCCAGAAGCTGTGATAACTAGAGAA
GAGTTCGACCTGATGAAGCACAAGCTCGATGAGCAGGTCGAGACGCTTAAGGCCAAGTGCGGGAAGAAAGAATGTTCGTTCGACGATGGTGACTTGGGAGAATCACCATT
CACCCCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAAACTCCCACGATGAAGCCTTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGGTCTTCGAAGGCC
TCATGGACTTTCAAGCGGCAACAGATGCAATCAAGTGCCGCGCCTTCCAGATCGCGCTCACCGGTAGCGCACGCCTGTGGTATCAAAGACTGCCGGCTAGGTCGATCTCG
ACCTACTCTCAACTGAGAAAGGAGTTCATCAGTCAGTTCTCTTCTCGGCATTATGACAGAAATACAGCGACTCATCTCGCTACTATCAGGCAGAAGGAGGGAGAGACGCT
GAGGGAGTATGTCACGAGGTTCCAGGAGGAGCAGTTGAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACCGGCCTGGCCGATGAGACCTTAACAG
TAAAACTTGGAGAGGAGGCTCCAGCCACCTTCGCCGAAGTCCTGCAGAAGGCAAAGAAGGTCATTGATGGGCAAGATCTCCTCCGAACCAAAACTGGCCGACCTGAGAAG
CAGATCGACCAGAAGAAGTTGAGCCAAGAGAGGAGGAAGGCTGATTCCAAGTCTAGAGATATGGGATCCTTCATCTTCCGCCAGTAG
Protein sequenceShow/hide protein sequence
MRHRLRTMEEMYAEATRANRTASPSRVPGAPGEKRAPSIQPGDREPIPNVEGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLMPEAVITRE
EFDLMKHKLDEQVETLKAKCGKKECSFDDGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSIS
TYSQLRKEFISQFSSRHYDRNTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPATFAEVLQKAKKVIDGQDLLRTKTGRPEK
QIDQKKLSQERRKADSKSRDMGSFIFRQ