; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g16230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g16230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:12193963..12198582
RNA-Seq ExpressionMoc02g16230
SyntenyMoc02g16230
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON60333.1 Zinc finger, CCHC-type [Parasponia andersonii]6.3e-5135.28Show/hide
Query:  EVLDYLDSKELELPL-EGKPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE
        ++ DYL +K+L  PL   KP+ M + +W+ LDR+VL                                                      G S+A ++NE
Subjt:  EVLDYLDSKELELPL-EGKPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE

Query:  FDALITKLVAIDLEFSDEVYAILLLRSLP-------------------------------DILNVDRG-------------------RNNNRGYGNRGKS
        F+ ++++L ++++ F +EV A++LL SLP                               ++  +D G                   +N+NRG  ++ KS
Subjt:  FDALITKLVAIDLEFSDEVYAILLLRSLP-------------------------------DILNVDRG-------------------RNNNRGYGNRGKS

Query:  KNNRSRSRSS-RFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDI
        +N R++SRS  R ECWNCGKTGH+K+NC+AP+K+D       +T ++ DAL+L+V++  D+WV+DSGASFHTT  RD+LENYIAGN+GKVYLADGEPLDI
Subjt:  KNNRSRSRSS-RFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDI

Query:  I------------------------EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN
        +                           +NLISVGQ D EG  ++F  G+WKV+KGAMV+ARG+K+GTL  T   R+
Subjt:  I------------------------EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN

RVW42863.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.6e-4936.27Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVLG-----------------------------TSIAANLNEFDALITKLVAIDLEFSDEVYAILLL
        ++ DYL  ++L LPL G KP+ M  +EW  LDR+VLG                              S+A +LNEF+ +  +L +++++F DE+ A+++L
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVLG-----------------------------TSIAANLNEFDALITKLVAIDLEFSDEVYAILLL

Query:  RSLPD---------------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLK
         SLP+                                              LN++ RGR NNR    G  N   S  NRS+SRS  + +CWNCGKTGH K
Subjt:  RSLPD---------------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLK

Query:  RNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------
        R CK+PKK + +++   VT+++ DAL+LAV+S  D WV+DSGASFHTT  R+I++NY+AG+ GKVYLADG  LD++                        
Subjt:  RNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------

Query:  EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSRE
        +  +NLISVGQ D+EG  I F  G WKVTKGA V+ARG K+GTL  T     TI   D ST T         + E   +++   G + E++ +DF M   
Subjt:  EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSRE

Query:  EPLPLQKR
          L  QK+
Subjt:  EPLPLQKR

RVW67125.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.9e-4936.52Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSLPD------
        ++ DYL  ++L LPL G KP+ M  +EW  LDR+                      S+A +LNEF+ +  +L +++++F DE+ A+++L SLP+      
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSLPD------

Query:  ---------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLKRNCKAPKKNDG
                                                LN++ RGR NNR    G  N   S  NRS+SRS  + +CWNCGKTGH KR CK+PKK + 
Subjt:  ---------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLKRNCKAPKKNDG

Query:  NEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYDKNLISVGQ
        +++   VT+++ DAL+LAV+S  D WV+DSGASFHTT  R+I++NY+AG+ GKVYLADG  LD++                        +  +NLISVGQ
Subjt:  NEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYDKNLISVGQ

Query:  FDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSREEPLPLQKR
         D+EG  I F  G WKVTKGA V+ARG K+GTL  T     TI   D ST T         + E   +++   G + E++ +DF M     L  QK+
Subjt:  FDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSREEPLPLQKR

RVW87710.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.4e-5036.54Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVLGT--------------------------SIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSL
        ++ DYL  ++L LPL G KP+ M  +EW  LDR+VLG                           S+A +LNEF+ +  +L +++++F DE+ A+++L SL
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVLGT--------------------------SIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSL

Query:  PD---------------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLKRNC
        P+                                              LN++ RGR NNR    G  N   S  NRS+SRS  + +CWNCGKTGH KR C
Subjt:  PD---------------------------------------------ILNVD-RGRNNNR----GYGNRGKSKNNRSRSRS-SRFECWNCGKTGHLKRNC

Query:  KAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYD
        K+PKK + +++   VT+++ DAL+LAV+S  D WV+DSGASFHTT  R+I++NY+AG+ GKVYLADG  LD++                        +  
Subjt:  KAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYD

Query:  KNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSREEPL
        +NLISVGQ D+EG  I F  G WKVTKGA V+ARG K+GTL  T     TI   D ST T         + E   +++   G + E++ +DF M     L
Subjt:  KNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETK----TIRNEDHSTTTP-----EEPVMESEEEVVELDGPVVEIEELDFSMSREEPL

Query:  PLQKR
          QK+
Subjt:  PLQKR

XP_022152845.1 uncharacterized protein LOC111020469 [Momordica charantia]6.3e-8353.44Show/hide
Query:  VLDYLDSKELELPLEGKPDDMGEKEWKKLDRKVLG-----------------------------------------------------TSIAANLNEFDA
        VLDYL SKELE PLEGKPDDMGEKEWKKLDRKVLG                                                     T I A+LNEFD 
Subjt:  VLDYLDSKELELPLEGKPDDMGEKEWKKLDRKVLG-----------------------------------------------------TSIAANLNEFDA

Query:  LITKLVAIDLEFSDEVYAILLLRSLPD---------------------------------------------ILNVDRGRNNNRGYGNRGKSKNNRSRSR
        LI KLVA+DLEFS EVYAILLLRSLPD                                             +LNVDRGRNNNRGYGNRGKSKNNRSRSR
Subjt:  LITKLVAIDLEFSDEVYAILLLRSLPD---------------------------------------------ILNVDRGRNNNRGYGNRGKSKNNRSRSR

Query:  SSRFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDIIEYDK----
        +SRFECWNCGK GHLK NCKAPKKN+GNEA ANV +QIHDALV+AVESAHDTWVMDS                  GNHGKVYLADGEPLDII   +    
Subjt:  SSRFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDIIEYDK----

Query:  ----NLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLV----ETKTIRNEDHSTTT
            ++  + + DNEGCEISFGQGNWKVTKGAMVIARGSKSGTL     +   +   DHS+ T
Subjt:  ----NLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLV----ETKTIRNEDHSTTT

TrEMBL top hitse value%identityAlignment
A0A2N9ESI4 Uncharacterized protein9.5e-5338.62Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE
        ++ DYL  K+L LPL G KP+DM + EW  LDR+VL                                                      GT++A +LNE
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE

Query:  FDALITKLVAIDLEFSDEVYAILLLRSLPD-----------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKTGHLKRNCKA
        F+ +  +L ++++EF DE+ A+++L SLP+                  LN++ RGR  +R Y NRG+SK+ + RS+S    + ECWNCGKTGH+++NC  
Subjt:  FDALITKLVAIDLEFSDEVYAILLLRSLPD-----------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKTGHLKRNCKA

Query:  PKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYDKN
         KK + N++   VT+++HDAL+L+V+S  ++WV+DSGASFHTT  R+I++NY+AG+ GKVYLAD E LD++                        E  +N
Subjt:  PKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII------------------------EYDKN

Query:  LISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN
        LISVGQ D EG  I F  G WK+TKGAMV+ARG K+GTL  T + R+
Subjt:  LISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN

A0A2N9GTI4 Uncharacterized protein2.0e-5037.08Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK-----------------------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEV
        ++ DYL  K+L LPL G KP+DM + EW  LDR+                                     GT++A +LNEF+ +  +L ++++EF DE+
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK-----------------------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEV

Query:  YAILLLRSLPD---------------------------------------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKT
         A+++L SLP+                                              LN++ RGR  +R Y NRG+SK+ + RS+S    + ECWNCGKT
Subjt:  YAILLLRSLPD---------------------------------------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKT

Query:  GHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII--------------------
        GH+++NC   KK + N++   VT+++HDAL+L+V+S  ++WV+DSGASFHTT  R+I++NY+AG+ GKVYLAD E LD++                    
Subjt:  GHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII--------------------

Query:  ----EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN
            E  +NLISVGQ D EG  I F  G WK+TKGAMV+ARG K+GTL  T + R+
Subjt:  ----EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN

A0A2N9HHD8 Uncharacterized protein2.0e-5037.08Show/hide
Query:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK-----------------------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEV
        ++ DYL  K+L LPL G KP+DM + EW  LDR+                                     GT++A +LNEF+ +  +L ++++EF DE+
Subjt:  EVLDYLDSKELELPLEG-KPDDMGEKEWKKLDRK-----------------------------------VLGTSIAANLNEFDALITKLVAIDLEFSDEV

Query:  YAILLLRSLPD---------------------------------------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKT
         A+++L SLP+                                              LN++ RGR  +R Y NRG+SK+ + RS+S    + ECWNCGKT
Subjt:  YAILLLRSLPD---------------------------------------------ILNVD-RGRNNNRGYGNRGKSKNNRSRSRSS---RFECWNCGKT

Query:  GHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII--------------------
        GH+++NC   KK + N++   VT+++HDAL+L+V+S  ++WV+DSGASFHTT  R+I++NY+AG+ GKVYLAD E LD++                    
Subjt:  GHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDII--------------------

Query:  ----EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN
            E  +NLISVGQ D EG  I F  G WK+TKGAMV+ARG K+GTL  T + R+
Subjt:  ----EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN

A0A2P5CH01 Zinc finger, CCHC-type3.1e-5135.28Show/hide
Query:  EVLDYLDSKELELPL-EGKPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE
        ++ DYL +K+L  PL   KP+ M + +W+ LDR+VL                                                      G S+A ++NE
Subjt:  EVLDYLDSKELELPL-EGKPDDMGEKEWKKLDRKVL------------------------------------------------------GTSIAANLNE

Query:  FDALITKLVAIDLEFSDEVYAILLLRSLP-------------------------------DILNVDRG-------------------RNNNRGYGNRGKS
        F+ ++++L ++++ F +EV A++LL SLP                               ++  +D G                   +N+NRG  ++ KS
Subjt:  FDALITKLVAIDLEFSDEVYAILLLRSLP-------------------------------DILNVDRG-------------------RNNNRGYGNRGKS

Query:  KNNRSRSRSS-RFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDI
        +N R++SRS  R ECWNCGKTGH+K+NC+AP+K+D       +T ++ DAL+L+V++  D+WV+DSGASFHTT  RD+LENYIAGN+GKVYLADGEPLDI
Subjt:  KNNRSRSRSS-RFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDI

Query:  I------------------------EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN
        +                           +NLISVGQ D EG  ++F  G+WKV+KGAMV+ARG+K+GTL  T   R+
Subjt:  I------------------------EYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRN

A0A6J1DF43 uncharacterized protein LOC1110204693.0e-8353.44Show/hide
Query:  VLDYLDSKELELPLEGKPDDMGEKEWKKLDRKVLG-----------------------------------------------------TSIAANLNEFDA
        VLDYL SKELE PLEGKPDDMGEKEWKKLDRKVLG                                                     T I A+LNEFD 
Subjt:  VLDYLDSKELELPLEGKPDDMGEKEWKKLDRKVLG-----------------------------------------------------TSIAANLNEFDA

Query:  LITKLVAIDLEFSDEVYAILLLRSLPD---------------------------------------------ILNVDRGRNNNRGYGNRGKSKNNRSRSR
        LI KLVA+DLEFS EVYAILLLRSLPD                                             +LNVDRGRNNNRGYGNRGKSKNNRSRSR
Subjt:  LITKLVAIDLEFSDEVYAILLLRSLPD---------------------------------------------ILNVDRGRNNNRGYGNRGKSKNNRSRSR

Query:  SSRFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDIIEYDK----
        +SRFECWNCGK GHLK NCKAPKKN+GNEA ANV +QIHDALV+AVESAHDTWVMDS                  GNHGKVYLADGEPLDII   +    
Subjt:  SSRFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDIIEYDK----

Query:  ----NLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLV----ETKTIRNEDHSTTT
            ++  + + DNEGCEISFGQGNWKVTKGAMVIARGSKSGTL     +   +   DHS+ T
Subjt:  ----NLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLV----ETKTIRNEDHSTTT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-1826.9Show/hide
Query:  GTSIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSLPD-----------------------------------------ILNVDRGRNNNR---GYGN
        GT+  ++LN F+ LIT+L  + ++  +E  AILLL SLP                                          ++   RGR+  R    YG 
Subjt:  GTSIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSLPD-----------------------------------------ILNVDRGRNNNR---GYGN

Query:  RGKSKNNRSRSRSSRFECWNCGKTGHLKRNCKAPKKNDGNEAGAN------VTKQIHDALVLAVESAHD---------TWVMDSGASFHTTRQRDILENY
         G    +++RS+S    C+NC + GH KR+C  P+K  G  +G           Q +D +VL +    +          WV+D+ AS H T  RD+   Y
Subjt:  RGKSKNNRSRSRSSRFECWNCGKTGHLKRNCKAPKKNDGNEAGAN------VTKQIHDALVLAVESAHD---------TWVMDSGASFHTTRQRDILENY

Query:  IAGNHGKVYLAD------------------------GEPLDIIEYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVET
        +AG+ G V + +                         +   + +   NLIS    D +G E  F    W++TKG++VIA+G   GTL  T
Subjt:  IAGNHGKVYLAD------------------------GEPLDIIEYDKNLISVGQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVET

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCAAAATTGAAAGAAGTATTAGATTATTTGGACTCGAAAGAGTTGGAATTGCCATTAGAAGGAAAGCCGGATGATATGGGAGAAAAAGAATGGAAGAAG
TTGGACAGGAAAGTGTTGGGTACATCTATTGCTGCCAATTTAAATGAGTTTGACGCGTTGATTACTAAACTGGTAGCTATTGATTTAGAATTCAGTGATGAAGTT
TATGCTATTTTGTTATTAAGATCTTTGCCTGATATATTGAATGTGGATAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGCAAGTCGAAAAACAACAGA
AGCAGGTCGAGAAGCAGCAGGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGCAAAGCCCCGAAGAAAAATGATGGGAACGAAGCCGGT
GCTAATGTTACTAAGCAGATACATGATGCTTTGGTTCTTGCAGTTGAGAGCGCTCATGACACATGGGTGATGGATTCAGGTGCGTCTTTTCATACTACAAGACAA
CGTGACATTCTTGAAAATTATATTGCAGGAAATCATGGAAAGGTCTATCTTGCCGATGGAGAGCCTTTGGATATCATTGAATATGATAAAAACTTGATTTCTGTG
GGGCAGTTCGATAATGAAGGATGTGAAATATCCTTTGGTCAAGGAAACTGGAAAGTTACAAAGGGTGCCATGGTGATTGCTCGAGGAAGCAAGTCAGGAACTTTA
GTTGAGACCAAGACAATAAGAAATGAAGATCATAGTACAACTACTCCTGAGGAACCAGTTATGGAATCTGAGGAAGAGGTTGTTGAACTTGATGGACCAGTTGTT
GAAATTGAAGAATTGGACTTCTCCATGAGTAGAGAGGAGCCGCTACCGTTGCAGAAGAGACGTGATGCTTGTTTTGTCTCCAAGTGGGAGATTGTTGGGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCAAAATTGAAAGAAGTATTAGATTATTTGGACTCGAAAGAGTTGGAATTGCCATTAGAAGGAAAGCCGGATGATATGGGAGAAAAAGAATGGAAGAAG
TTGGACAGGAAAGTGTTGGGTACATCTATTGCTGCCAATTTAAATGAGTTTGACGCGTTGATTACTAAACTGGTAGCTATTGATTTAGAATTCAGTGATGAAGTT
TATGCTATTTTGTTATTAAGATCTTTGCCTGATATATTGAATGTGGATAGAGGAAGAAATAATAACAGAGGTTATGGGAATCGAGGCAAGTCGAAAAACAACAGA
AGCAGGTCGAGAAGCAGCAGGTTTGAGTGTTGGAATTGTGGTAAGACTGGACACTTGAAGAGGAATTGCAAAGCCCCGAAGAAAAATGATGGGAACGAAGCCGGT
GCTAATGTTACTAAGCAGATACATGATGCTTTGGTTCTTGCAGTTGAGAGCGCTCATGACACATGGGTGATGGATTCAGGTGCGTCTTTTCATACTACAAGACAA
CGTGACATTCTTGAAAATTATATTGCAGGAAATCATGGAAAGGTCTATCTTGCCGATGGAGAGCCTTTGGATATCATTGAATATGATAAAAACTTGATTTCTGTG
GGGCAGTTCGATAATGAAGGATGTGAAATATCCTTTGGTCAAGGAAACTGGAAAGTTACAAAGGGTGCCATGGTGATTGCTCGAGGAAGCAAGTCAGGAACTTTA
GTTGAGACCAAGACAATAAGAAATGAAGATCATAGTACAACTACTCCTGAGGAACCAGTTATGGAATCTGAGGAAGAGGTTGTTGAACTTGATGGACCAGTTGTT
GAAATTGAAGAATTGGACTTCTCCATGAGTAGAGAGGAGCCGCTACCGTTGCAGAAGAGACGTGATGCTTGTTTTGTCTCCAAGTGGGAGATTGTTGGGAATTAA
Protein sequenceShow/hide protein sequence
MEPKLKEVLDYLDSKELELPLEGKPDDMGEKEWKKLDRKVLGTSIAANLNEFDALITKLVAIDLEFSDEVYAILLLRSLPDILNVDRGRNNNRGYGNRGKSKNNR
SRSRSSRFECWNCGKTGHLKRNCKAPKKNDGNEAGANVTKQIHDALVLAVESAHDTWVMDSGASFHTTRQRDILENYIAGNHGKVYLADGEPLDIIEYDKNLISV
GQFDNEGCEISFGQGNWKVTKGAMVIARGSKSGTLVETKTIRNEDHSTTTPEEPVMESEEEVVELDGPVVEIEELDFSMSREEPLPLQKRRDACFVSKWEIVGN