; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g17640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g17640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr8:13377055..13379136
RNA-Seq ExpressionMoc08g17640
SyntenyMoc08g17640
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]1.1e-9058.52Show/hide
Query:  TMEEMYAEATRA--------NRTASPSMAPG-----APREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLK
        TME+MY+E  +A        NR A   M         P +KG P  + G+ E   + RG        DLR+HL  K+  + R+ + SPS S    NSN +
Subjt:  TMEEMYAEATRA--------NRTASPSMAPG-----APREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLK

Query:  AQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNA
        A+S Y P+ PE VITREEFD +K KFD QVEALKA+CEKK+ SFDDGDLGESPFT+DILEA IP KFKTP MKPYDGSKDPKDYVEVFEGLMDFQAAT+A
Subjt:  AQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNA

Query:  IKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLT
        IKCR FQIALTGSARLW                                        Q+EGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADETLT
Subjt:  IKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLT

Query:  VKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPD
        VKL EEAP TF EVLQK KK+IDGQELLRTKT RPEK+IDQ + +++  + D
Subjt:  VKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPD

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]1.3e-9165.16Show/hide
Query:  DYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILE
        +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S Y P+ P+ VITREEFD +K KFD QVEALKA CEKK+ SFDDGDLGE PFT DILE
Subjt:  DYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILE

Query:  APIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW----------------------------------------QRE
        API PKFKTP MKPYDGSK+PKDYV+VFEGLM+FQAAT+AIKCRAFQIA TGSARLW                                        Q++
Subjt:  APIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW----------------------------------------QRE

Query:  GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQE
        GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+TLTVKLGEEAP TFAEVLQK KKVIDGQELLRTKTGRPEK+IDQ+++ ++
Subjt:  GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQE

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]1.4e-10675.17Show/hide
Query:  PGAPREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAP EKGAPSIQ G+REPIPND GVDYSLRDNDLRKHLT+KKK+AS E EDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLWQRE--GETLREYVTRFQ
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAAT+AIKC AFQIALTGSARLW R     ++  Y    +
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLWQRE--GETLREYVTRFQ

Query:  E--EQLKVAHCSDDSAM-CYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
        E   Q    H    +A     +    DETLTVKLGEEAP TFAEVLQ  KKVIDGQELLRTKT RPEK+IDQK+L+Q+ R+ D  S
Subjt:  E--EQLKVAHCSDDSAM-CYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]6.9e-9056.02Show/hide
Query:  LDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPNDRG------VDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSN
        ++ M  + RTMEEMY +  +              R +    + H D     +++G      VD      DLR HL  +K+ +S   E + +Y  +  NSN
Subjt:  LDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPNDRG------VDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSN

Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
         +A+S Y P+APE VITREEF+ +K KFD QVEALK RCEKK+ +FDDGDLGESPFT+DILEA IPPKFKTP MK YDGSKDPKDYVEVFEGLMDFQAAT
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  NAIKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET
        +AIKCRAFQIALTGSARLW                                        Q+EG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET
Subjt:  NAIKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
         TVKLGEEA  TFAEVLQ  KK IDGQELLRTKT RPEK+IDQKK +Q+ R+ D  S
Subjt:  LTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.1e-9856.39Show/hide
Query:  ARHTNQELPPAHPKPSKANRGQGGTSRKTSQRASQGADPEALSTLQHELDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPN
        AR T  +L PAHPKP KANRG+GG SR+T+  A+     E    LQ E++ M  +  TMEEMY E  +A    S S    A  E+G              
Subjt:  ARHTNQELPPAHPKPSKANRGQGGTSRKTSQRASQGADPEALSTLQHELDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPN

Query:  DRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTA
                   DLR HL+ K+  + R+   SPS S +  NSN +A+S Y P+ PE VITREEFD +K KFD QVE LKARCE K  +FDDGDLGESPFT+
Subjt:  DRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTA

Query:  DILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW---------------------------------------
        DILEA IP KFKTP MKPYDGSKDPKDYVEVFEGLM FQAAT+AIK RAFQIALT SARLW                                       
Subjt:  DILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW---------------------------------------

Query:  -QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
         Q+E ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DETLTVKLGEEAP TFAEVLQK KKVIDGQEL RTKTGR EK+IDQKK +QE R+ +  S
Subjt:  -QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

TrEMBL top hitse value%identityAlignment
A0A6J1DDW5 uncharacterized protein LOC1110196345.1e-9158.52Show/hide
Query:  TMEEMYAEATRA--------NRTASPSMAPG-----APREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLK
        TME+MY+E  +A        NR A   M         P +KG P  + G+ E   + RG        DLR+HL  K+  + R+ + SPS S    NSN +
Subjt:  TMEEMYAEATRA--------NRTASPSMAPG-----APREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLK

Query:  AQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNA
        A+S Y P+ PE VITREEFD +K KFD QVEALKA+CEKK+ SFDDGDLGESPFT+DILEA IP KFKTP MKPYDGSKDPKDYVEVFEGLMDFQAAT+A
Subjt:  AQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNA

Query:  IKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLT
        IKCR FQIALTGSARLW                                        Q+EGETLREYVTRFQEEQLKVAHCSD SAMCYFLT LADETLT
Subjt:  IKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLT

Query:  VKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPD
        VKL EEAP TF EVLQK KK+IDGQELLRTKT RPEK+IDQ + +++  + D
Subjt:  VKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPD

A0A6J1DM55 uncharacterized protein LOC1110222676.1e-9265.16Show/hide
Query:  DYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILE
        +Y+ +  DLR+HL  K+  + R+ + SPS S    NSN +A+S Y P+ P+ VITREEFD +K KFD QVEALKA CEKK+ SFDDGDLGE PFT DILE
Subjt:  DYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILE

Query:  APIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW----------------------------------------QRE
        API PKFKTP MKPYDGSK+PKDYV+VFEGLM+FQAAT+AIKCRAFQIA TGSARLW                                        Q++
Subjt:  APIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW----------------------------------------QRE

Query:  GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQE
        GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLAD+TLTVKLGEEAP TFAEVLQK KKVIDGQELLRTKTGRPEK+IDQ+++ ++
Subjt:  GETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQE

A0A6J1DPC9 uncharacterized protein LOC1110222806.7e-10775.17Show/hide
Query:  PGAPREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAP EKGAPSIQ G+REPIPND GVDYSLRDNDLRKHLT+KKK+AS E EDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLWQRE--GETLREYVTRFQ
        ARCEKK+  FDD DLGESPFT+DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAAT+AIKC AFQIALTGSARLW R     ++  Y    +
Subjt:  ARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLWQRE--GETLREYVTRFQ

Query:  E--EQLKVAHCSDDSAM-CYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
        E   Q    H    +A     +    DETLTVKLGEEAP TFAEVLQ  KKVIDGQELLRTKT RPEK+IDQK+L+Q+ R+ D  S
Subjt:  E--EQLKVAHCSDDSAM-CYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

A0A6J1DPN4 uncharacterized protein LOC1110230603.3e-9056.02Show/hide
Query:  LDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPNDRG------VDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSN
        ++ M  + RTMEEMY +  +              R +    + H D     +++G      VD      DLR HL  +K+ +S   E + +Y  +  NSN
Subjt:  LDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPNDRG------VDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSN

Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
         +A+S Y P+APE VITREEF+ +K KFD QVEALK RCEKK+ +FDDGDLGESPFT+DILEA IPPKFKTP MK YDGSKDPKDYVEVFEGLMDFQAAT
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  NAIKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET
        +AIKCRAFQIALTGSARLW                                        Q+EG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLTGLADET
Subjt:  NAIKCRAFQIALTGSARLW----------------------------------------QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADET

Query:  LTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
         TVKLGEEA  TFAEVLQ  KK IDGQELLRTKT RPEK+IDQKK +Q+ R+ D  S
Subjt:  LTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

A0A6J1DZJ1 uncharacterized protein LOC1110257381.5e-9856.39Show/hide
Query:  ARHTNQELPPAHPKPSKANRGQGGTSRKTSQRASQGADPEALSTLQHELDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPN
        AR T  +L PAHPKP KANRG+GG SR+T+  A+     E    LQ E++ M  +  TMEEMY E  +A    S S    A  E+G              
Subjt:  ARHTNQELPPAHPKPSKANRGQGGTSRKTSQRASQGADPEALSTLQHELDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPN

Query:  DRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTA
                   DLR HL+ K+  + R+   SPS S +  NSN +A+S Y P+ PE VITREEFD +K KFD QVE LKARCE K  +FDDGDLGESPFT+
Subjt:  DRGVDYSLRDNDLRKHLTEKKKRASRESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTA

Query:  DILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW---------------------------------------
        DILEA IP KFKTP MKPYDGSKDPKDYVEVFEGLM FQAAT+AIK RAFQIALT SARLW                                       
Subjt:  DILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATNAIKCRAFQIALTGSARLW---------------------------------------

Query:  -QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS
         Q+E ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DETLTVKLGEEAP TFAEVLQK KKVIDGQEL RTKTGR EK+IDQKK +QE R+ +  S
Subjt:  -QREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTKTGRPEKRIDQKKLNQEMRRPDMSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCATGCCCAACTCGAACCGCCGAGATTACCACCACTAGCATTGAAAAGTGCTAAGTATAAGGGCCGAGGTGCGACCTGGTCAAAGTCCGATCTACTGGGA
AGCTCGACGGGTCCGATGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGACCGGGTTCGAGCTACGATCAGAACAAACACTCGAGAGCTCGGTGCAAGAATAGTCGA
GGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGTGCAGATTGCCCGCCATACGAATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAGGCCAACAGA
GGCCAAGGAGGGACGTCGAGAAAAACCTCCCAAAGGGCCAGTCAGGGAGCAGACCCCGAAGCTCTGTCTACTCTCCAGCACGAGTTGGATGATATGCACCATCGA
TCGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGCGCACCCAGAGAAAAGGGAGCTCCATCTATC
CAACATGGCGATCGCGAGCCCATTCCCAACGATAGAGGAGTGGATTATAGCTTGCGGGATAACGATCTGAGAAAACATCTCACTGAAAAGAAGAAGAGAGCATCT
CGGGAGTCGGAAGACTCTCCTTCCTATTCTCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACTAGG
GAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAA
TCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGACTATGTTGAG
GTCTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAAATGCGATCAAATGCCGCGCCTTCCAAATCGCACTTACCGGCAGCGCGCGCCTGTGGCAGAGGGAAGGA
GAAACGCTGAGAGAATATGTCACACGGTTCCAGGAGGAGCAACTTAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACGGGTCTGGCTGAT
GAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAACCACCTTCGCCGAAGTATTGCAGAAGGTGAAAAAAGTCATTGATGGACAGGAGCTACTCCGAACCAAA
ACTGGCCGACCTGAGAAGCGGATCGACCAAAAGAAGTTGAACCAGGAGATGAGGAGGCCTGATATGTCAAGTCCAAGGATAACGGTCCATCCTCCTCCAATAGCA
GAACAGAGTACCGTAGGACGGAGAGTGGCCCTACCCGGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAGG
AGAGCGGGATGGAAAAACTCCTCAAGCGACCTGAGAAGCTCTGAGGAGACCCAGAAAAACGTAGCAAAGACAAGTACTGTCGCTTTCATCGAGATCACTGCCACA
ATACGACAAGTTGCTGGGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCATGCCCAACTCGAACCGCCGAGATTACCACCACTAGCATTGAAAAGTGCTAAGTATAAGGGCCGAGGTGCGACCTGGTCAAAGTCCGATCTACTGGGA
AGCTCGACGGGTCCGATGTCCGCCCAAGTGTTCAGGTCGGTCCGGAGACCGGGTTCGAGCTACGATCAGAACAAACACTCGAGAGCTCGGTGCAAGAATAGTCGA
GGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGTGCAGATTGCCCGCCATACGAATCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAGGCCAACAGA
GGCCAAGGAGGGACGTCGAGAAAAACCTCCCAAAGGGCCAGTCAGGGAGCAGACCCCGAAGCTCTGTCTACTCTCCAGCACGAGTTGGATGATATGCACCATCGA
TCGCGCACAATGGAAGAAATGTACGCCGAGGCAACGCGTGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGCGCACCCAGAGAAAAGGGAGCTCCATCTATC
CAACATGGCGATCGCGAGCCCATTCCCAACGATAGAGGAGTGGATTATAGCTTGCGGGATAACGATCTGAGAAAACATCTCACTGAAAAGAAGAAGAGAGCATCT
CGGGAGTCGGAAGACTCTCCTTCCTATTCTCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCTCTAGCACCAGAAGCTGTGATCACTAGG
GAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAA
TCGCCATTCACCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGACTATGTTGAG
GTCTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAAATGCGATCAAATGCCGCGCCTTCCAAATCGCACTTACCGGCAGCGCGCGCCTGTGGCAGAGGGAAGGA
GAAACGCTGAGAGAATATGTCACACGGTTCCAGGAGGAGCAACTTAAGGTCGCGCACTGCTCCGATGATTCGGCCATGTGCTACTTCCTCACGGGTCTGGCTGAT
GAGACCTTGACAGTAAAACTTGGAGAGGAGGCTCCAACCACCTTCGCCGAAGTATTGCAGAAGGTGAAAAAAGTCATTGATGGACAGGAGCTACTCCGAACCAAA
ACTGGCCGACCTGAGAAGCGGATCGACCAAAAGAAGTTGAACCAGGAGATGAGGAGGCCTGATATGTCAAGTCCAAGGATAACGGTCCATCCTCCTCCAATAGCA
GAACAGAGTACCGTAGGACGGAGAGTGGCCCTACCCGGAGCCGACCTTATGAACGGTACACCCCAACCACCATCCCCATCTCCGAGATACTCACGAACATCGAGG
AGAGCGGGATGGAAAAACTCCTCAAGCGACCTGAGAAGCTCTGAGGAGACCCAGAAAAACGTAGCAAAGACAAGTACTGTCGCTTTCATCGAGATCACTGCCACA
ATACGACAAGTTGCTGGGAATTGA
Protein sequenceShow/hide protein sequence
MSHAQLEPPRLPPLALKSAKYKGRGATWSKSDLLGSSTGPMSAQVFRSVRRPGSSYDQNKHSRARCKNSRGPGPSRARGRSAVQIARHTNQELPPAHPKPSKANR
GQGGTSRKTSQRASQGADPEALSTLQHELDDMHHRSRTMEEMYAEATRANRTASPSMAPGAPREKGAPSIQHGDREPIPNDRGVDYSLRDNDLRKHLTEKKKRAS
RESEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFTADILEAPIPPKFKTPAMKPYDGSKDPKDYVE
VFEGLMDFQAATNAIKCRAFQIALTGSARLWQREGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADETLTVKLGEEAPTTFAEVLQKVKKVIDGQELLRTK
TGRPEKRIDQKKLNQEMRRPDMSSPRITVHPPPIAEQSTVGRRVALPGADLMNGTPQPPSPSPRYSRTSRRAGWKNSSSDLRSSEETQKNVAKTSTVAFIEITAT
IRQVAGN