; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10870 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10870
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr4:8125599..8129466
RNA-Seq ExpressionMoc04g10870
SyntenyMoc04g10870
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149417.1 protein NYNRIN-like [Momordica charantia]5.0e-8389.02Show/hide
Query:  KLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREV
        KLASAYE DLARSVPVEILD+PSILEPDVMEVDTPSP+W+DPIVEFIKGNPPQDPKEQKKMARRA RFTLREG LYRRGFSLPLLK VTPEEGLYILRE+
Subjt:  KLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREV

Query:  HEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +EG CGN SGARSLSAKVVRQGYYWP+ EQDA+QFVK CDNCQ FA+IIHQP ELLT ISAPWPFAQWGVDII
Subjt:  HEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

XP_022150038.1 uncharacterized protein K02A2.6-like [Momordica charantia]2.3e-8086.78Show/hide
Query:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE
        AKLAS YE DLARSVPVEILDTPSILE DV  VDTPSPTWMDPIVEFIKGNPPQ+PKEQKKM RRA RFTLRE MLYRRGFSLPLLK VTPEEGLYILRE
Subjt:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE

Query:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +HEGVCGNHSG RSLSAKV+RQGYYWPS EQDA+ FVKACDNCQ FA+IIHQ P LLT IS PW FAQWGVDII
Subjt:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

XP_022154406.1 uncharacterized protein LOC111021683 [Momordica charantia]1.9e-12746.89Show/hide
Query:  RANQEADPEALSTLQRELDDMPPGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA
        RAN+ A P            + PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA
Subjt:  RANQEADPEALSTLQRELDDMPPGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA

Query:  MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG-----------------------SKDPKDYVEVFE
        MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG                        K P ++ E  E
Subjt:  MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG-----------------------SKDPKDYVEVFE

Query:  GLMD------------------------------------------------------------------------------------------FQAAT-
          +                                                                                           FQA T 
Subjt:  GLMD------------------------------------------------------------------------------------------FQAAT-

Query:  --------------------------------------DAIKCRA--------------------FQIALTGSARQ------------------------
                                               A+K +A                    + I + GS+ +                        
Subjt:  --------------------------------------DAIKCRA--------------------FQIALTGSARQ------------------------

Query:  -------------------------------------------------------------------------------------AKLASAYEIDLARSV
                                                                                             AKLASAYEIDLARSV
Subjt:  -------------------------------------------------------------------------------------AKLASAYEIDLARSV

Query:  PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEGVCGNHSGARSL
        PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEG           
Subjt:  PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEGVCGNHSGARSL

Query:  SAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
                        DAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
Subjt:  SAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]5.9e-8488.95Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEAMITREEFDLMKHKFGEQVEALK
        PGAPGEKGAPSIQPG+REPIPND G+DYSLRDND+RKHLT+KKK+AS EPEDSLSYSR+FSNSNLKAQSKYKPL PEA+I REEFDLMKH+F EQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEAMITREEFDLMKHKFGEQVEALK

Query:  ARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR
        ARCEKKE  FD+ DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSAR
Subjt:  ARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR

XP_022158092.1 uncharacterized protein LOC111024661 [Momordica charantia]3.5e-8488.51Show/hide
Query:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE
        AKLASAYE DLARSVPVEILD+PSIL+PDVME+DTPSP+WMDPI+EFIKGNPPQD KEQK+MARRA RF LR+G+LYRRGFSLPLLK VTPEEGLYILRE
Subjt:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE

Query:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +HEGVCGNHSGARSLSAKVVRQGYYWP+ E DAKQFVK CDNCQ FASIIHQPPELLT ISAPWPFAQWGVDII
Subjt:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

TrEMBL top hitse value%identityAlignment
A0A6J1D7W6 Ribonuclease H2.4e-8389.02Show/hide
Query:  KLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREV
        KLASAYE DLARSVPVEILD+PSILEPDVMEVDTPSP+W+DPIVEFIKGNPPQDPKEQKKMARRA RFTLREG LYRRGFSLPLLK VTPEEGLYILRE+
Subjt:  KLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREV

Query:  HEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +EG CGN SGARSLSAKVVRQGYYWP+ EQDA+QFVK CDNCQ FA+IIHQP ELLT ISAPWPFAQWGVDII
Subjt:  HEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

A0A6J1D8D8 Ribonuclease H1.1e-8086.78Show/hide
Query:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE
        AKLAS YE DLARSVPVEILDTPSILE DV  VDTPSPTWMDPIVEFIKGNPPQ+PKEQKKM RRA RFTLRE MLYRRGFSLPLLK VTPEEGLYILRE
Subjt:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE

Query:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +HEGVCGNHSG RSLSAKV+RQGYYWPS EQDA+ FVKACDNCQ FA+IIHQ P LLT IS PW FAQWGVDII
Subjt:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

A0A6J1DNM4 Ribonuclease H9.4e-12846.89Show/hide
Query:  RANQEADPEALSTLQRELDDMPPGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA
        RAN+ A P            + PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA
Subjt:  RANQEADPEALSTLQRELDDMPPGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEA

Query:  MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG-----------------------SKDPKDYVEVFE
        MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG                        K P ++ E  E
Subjt:  MITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDG-----------------------SKDPKDYVEVFE

Query:  GLMD------------------------------------------------------------------------------------------FQAAT-
          +                                                                                           FQA T 
Subjt:  GLMD------------------------------------------------------------------------------------------FQAAT-

Query:  --------------------------------------DAIKCRA--------------------FQIALTGSARQ------------------------
                                               A+K +A                    + I + GS+ +                        
Subjt:  --------------------------------------DAIKCRA--------------------FQIALTGSARQ------------------------

Query:  -------------------------------------------------------------------------------------AKLASAYEIDLARSV
                                                                                             AKLASAYEIDLARSV
Subjt:  -------------------------------------------------------------------------------------AKLASAYEIDLARSV

Query:  PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEGVCGNHSGARSL
        PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEG           
Subjt:  PVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEGVCGNHSGARSL

Query:  SAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
                        DAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
Subjt:  SAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

A0A6J1DPC9 uncharacterized protein LOC1110222802.9e-8488.95Show/hide
Query:  PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEAMITREEFDLMKHKFGEQVEALK
        PGAPGEKGAPSIQPG+REPIPND G+DYSLRDND+RKHLT+KKK+AS EPEDSLSYSR+FSNSNLKAQSKYKPL PEA+I REEFDLMKH+F EQVEALK
Subjt:  PGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEAMITREEFDLMKHKFGEQVEALK

Query:  ARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR
        ARCEKKE  FD+ DLGESPFT DI+EAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSAR
Subjt:  ARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR

A0A6J1DWA1 Ribonuclease H1.7e-8488.51Show/hide
Query:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE
        AKLASAYE DLARSVPVEILD+PSIL+PDVME+DTPSP+WMDPI+EFIKGNPPQD KEQK+MARRA RF LR+G+LYRRGFSLPLLK VTPEEGLYILRE
Subjt:  AKLASAYEIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILRE

Query:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII
        +HEGVCGNHSGARSLSAKVVRQGYYWP+ E DAKQFVK CDNCQ FASIIHQPPELLT ISAPWPFAQWGVDII
Subjt:  VHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDII

SwissProt top hitse value%identityAlignment
Q4R6I1 Gypsy retrotransposon integrase-like protein 12.5e-0833.33Show/hide
Query:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-
        +P   P E+  + R A +F  ++  L+  G       L  V+ EE   +LRE HE   G H G  S +  +V   YYW S   D KQ+V AC +CQ    
Subjt:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-

Query:  SIIHQPPELLTLISAPW
        ++I  P + L  +  PW
Subjt:  SIIHQPPELLTLISAPW

Q5RBK0 Gypsy retrotransposon integrase-like protein 16.5e-0934.19Show/hide
Query:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-
        +P   P E++ + R A +F  +E  L+  G       L  V+ EE   +LRE HE   G H G  S +  +V   YYW S   D KQ+V AC +CQ    
Subjt:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-

Query:  SIIHQPPELLTLISAPW
        ++I  P + L  +  PW
Subjt:  SIIHQPPELLTLISAPW

Q66H30 Gypsy retrotransposon integrase-like protein 12.5e-0834.19Show/hide
Query:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFAS
        +P     E+  + R A +F  +E  L+  G       L  V+ EE   +LRE HE   G H G  S +  +V   YYW S   D KQ+V AC +CQ   S
Subjt:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFAS

Query:  -IIHQPPELLTLISAPW
         +I  P + L+++  PW
Subjt:  -IIHQPPELLTLISAPW

Q8K259 Gypsy retrotransposon integrase-like protein 11.7e-0935.04Show/hide
Query:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-
        +P   P E+  + R A +F  +E  L+  G       L  V+ EE   +LRE HE   G H G  S +  +V  GYYW S   D KQ+V AC +CQ    
Subjt:  NPPQDPKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-

Query:  SIIHQPPELLTLISAPW
        ++I  P + L ++  PW
Subjt:  SIIHQPPELLTLISAPW

Q9NXP7 Gypsy retrotransposon integrase-like protein 11.1e-0834.82Show/hide
Query:  PKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-SIIHQ
        P E+  + R A +F  +E  L+  G       L  V+ EE   +LRE HE   G H G  S +  +V   YYW S   D KQ+V AC +CQ    ++I  
Subjt:  PKEQKKMARRADRFTLREGMLYRRGFSLP--LLKYVTPEEGLYILREVHEGVCGNHSGARSLSAKVVRQGYYWPSFEQDAKQFVKACDNCQCFA-SIIHQ

Query:  PPELLTLISAPW
        P + L  +  PW
Subjt:  PPELLTLISAPW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCACCAAAGTTCGACCTGACTGGGGTCCGACCTGCTCGGAACCCGACAGGTCCACTCTGTTGTTCAGGTCGGAACCGAAGACCGGGTTCGAACTCGATTCGTGCAG
AACCGTTGCAAGGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAAGGGTGTGAACGCTGATAACGACCCTCAGCGAGACCTCGGTGCAAGGATAGTCGAGG
ACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGTCGCGTAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCAAACAGAGGCCGA
GGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACCAGGAAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCCCCCGGGCGCACCCGGTGA
AAAGGGAGCTCCGTCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGGAGGAATGGATTACAGCTTGCGGGATAACGATATGAGAAAGCATCTCACTGAAAAGA
AGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCTTTCCTACTCCCGAAAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAACCTCTAGCACCAGAAGCTATG
ATCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGGTGAGCAGGTTGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAATGTTCGTTCGACAATGGCGACTTGGG
AGAATCGCCATTCACCCCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCTACCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGG
TCTTCGAAGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTCACCGGCAGCGCGCGCCAAGCTAAATTGGCATCAGCGTAC
GAGATCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTGGAGCCAGATGTGATGGAGGTTGATACTCCATCACCCACTTGGATGGACCCAAT
CGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGCGCGAAGAGCGGATCGGTTCACGCTCCGAGAAGGAATGTTGTACCGACGTGGCT
TCTCTCTGCCCCTCCTCAAGTATGTGACTCCCGAAGAAGGCCTTTACATTCTTAGGGAAGTTCATGAAGGGGTGTGTGGAAACCACTCTGGCGCCAGGTCGTTGTCGGCC
AAAGTGGTTCGACAAGGGTACTATTGGCCTAGTTTCGAGCAGGATGCAAAGCAGTTTGTGAAAGCTTGTGACAACTGCCAGTGTTTCGCAAGCATTATTCATCAACCTCC
CGAACTGCTCACCCTCATCTCGGCCCCATGGCCATTTGCGCAATGGGGGGTAGACATCATAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTC
CAAGTTGGACATTTGGTCTTAAGGAAAATTCAGAGTCATGTTGGCACATTCGACCTGGAACTTATATGCTGGTCGATCTGGAGGGAAAAATGCTTGCGCATCCATGGAAC
GCGGAGCACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCACCAAAGTTCGACCTGACTGGGGTCCGACCTGCTCGGAACCCGACAGGTCCACTCTGTTGTTCAGGTCGGAACCGAAGACCGGGTTCGAACTCGATTCGTGCAG
AACCGTTGCAAGGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAAGGGTGTGAACGCTGATAACGACCCTCAGCGAGACCTCGGTGCAAGGATAGTCGAGG
ACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGTCGCGTAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCAAACAGAGGCCGA
GGAGGGACGTCGAGAAAGACCTCCCAAAGGGCCAACCAGGAAGCAGACCCTGAAGCTCTGTCTACTCTCCAGCGCGAGTTGGATGATATGCCCCCGGGCGCACCCGGTGA
AAAGGGAGCTCCGTCTATCCAACCTGGCGATCGCGAGCCCATTCCCAACGATGGAGGAATGGATTACAGCTTGCGGGATAACGATATGAGAAAGCATCTCACTGAAAAGA
AGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCTTTCCTACTCCCGAAAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAACCTCTAGCACCAGAAGCTATG
ATCACTAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGGTGAGCAGGTTGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGAATGTTCGTTCGACAATGGCGACTTGGG
AGAATCGCCATTCACCCCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCTACCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGACTATGTTGAGG
TCTTCGAAGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTCACCGGCAGCGCGCGCCAAGCTAAATTGGCATCAGCGTAC
GAGATCGACCTGGCTAGATCGGTCCCGGTCGAGATCTTGGACACTCCTTCAATCTTGGAGCCAGATGTGATGGAGGTTGATACTCCATCACCCACTTGGATGGACCCAAT
CGTGGAGTTCATCAAAGGAAACCCACCGCAAGATCCGAAGGAGCAAAAGAAGATGGCGCGAAGAGCGGATCGGTTCACGCTCCGAGAAGGAATGTTGTACCGACGTGGCT
TCTCTCTGCCCCTCCTCAAGTATGTGACTCCCGAAGAAGGCCTTTACATTCTTAGGGAAGTTCATGAAGGGGTGTGTGGAAACCACTCTGGCGCCAGGTCGTTGTCGGCC
AAAGTGGTTCGACAAGGGTACTATTGGCCTAGTTTCGAGCAGGATGCAAAGCAGTTTGTGAAAGCTTGTGACAACTGCCAGTGTTTCGCAAGCATTATTCATCAACCTCC
CGAACTGCTCACCCTCATCTCGGCCCCATGGCCATTTGCGCAATGGGGGGTAGACATCATAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTC
CAAGTTGGACATTTGGTCTTAAGGAAAATTCAGAGTCATGTTGGCACATTCGACCTGGAACTTATATGCTGGTCGATCTGGAGGGAAAAATGCTTGCGCATCCATGGAAC
GCGGAGCACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MGTKVRPDWGPTCSEPDRSTLLFRSEPKTGFELDSCRTVARMVHPANSANTTEQKGVNADNDPQRDLGARIVEDQVRAGQEGDLSRRSARHANQELPPAHPKPSKANRGR
GGTSRKTSQRANQEADPEALSTLQRELDDMPPGAPGEKGAPSIQPGDREPIPNDGGMDYSLRDNDMRKHLTEKKKRASREPEDSLSYSRKFSNSNLKAQSKYKPLAPEAM
ITREEFDLMKHKFGEQVEALKARCEKKECSFDNGDLGESPFTPDILEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARQAKLASAY
EIDLARSVPVEILDTPSILEPDVMEVDTPSPTWMDPIVEFIKGNPPQDPKEQKKMARRADRFTLREGMLYRRGFSLPLLKYVTPEEGLYILREVHEGVCGNHSGARSLSA
KVVRQGYYWPSFEQDAKQFVKACDNCQCFASIIHQPPELLTLISAPWPFAQWGVDIIEQNGQTLQCPSSTSKLPSWTFGLKENSESCWHIRPGTYMLVDLEGKMLAHPWN
AEHLKRYYP