; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g17640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g17640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:13210372..13215047
RNA-Seq ExpressionMoc02g17640
SyntenyMoc02g17640
Gene Ontology termsGO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]3.6e-2525.37Show/hide
Query:  KNWRLTGFYGHPEEANIDLSWDLLRWLRGSNDDPWLVGGDFNGILSHSEKDGGRRKDDKLINEFSKSA--CSIPRTLG----RNKTGNFRNCLKVAEAKL
        K WRLTGFYGHP        W+LLR L G +  PW VGGDFN I+  SEK GG  + +  +  F ++   C + R LG    R    N R+     + +L
Subjt:  KNWRLTGFYGHPEEANIDLSWDLLRWLRGSNDDPWLVGGDFNGILSHSEKDGGRRKDDKLINEFSKSA--CSIPRTLG----RNKTGNFRNCLKVAEAKL

Query:  QSAIHDLRFAPNRDAFQ------HATTNMNQLLKEEERIMTSGAEE-------------------------DKLIWNYEKTGV----YLVRSGYKVALLN
           + +  +      F         + +   LL+  ++   +G                            D +++   K  +    +L    ++V    
Subjt:  QSAIHDLRFAPNRDAFQ------HATTNMNQLLKEEERIMTSGAEE-------------------------DKLIWNYEKTGV----YLVRSGYKVALLN

Query:  NSCGQAPSSSSSE--ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDM
        N      SS +    ++++WA +++ +F+ A +      V       W+P   G YKINTD +     +  G+G++IR+  G VMA+  +    +     
Subjt:  NSCGQAPSSSSSE--ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDM

Query:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKS
         EAVA + G +LA + G+ P  +E+DS  + NL +      ++ G ++       + +  +S +F+ R  N  AH  A  +L      +W+ED PL ++S
Subjt:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKS

Query:  CLEMECLEEL
         +  +C   L
Subjt:  CLEMECLEEL

XP_022140628.1 uncharacterized protein LOC111011237 [Momordica charantia]8.7e-9690.82Show/hide
Query:  ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASK
        +LVEWAN YVMEFREANSNPFPGRVTNTAE+LW PPD+ IYKINTD SFLASDQHAGLGIIIRNDRGQVMA+ATKYLENIQSVDMAEA+ AVEGLQLASK
Subjt:  ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASK

Query:  IGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL
        IGVNPVILETDSSRIFNLFSQPSEDLS+TGEIVLKAKNFWTQ+LHASFNF+KR+GNKAAH+ A RALLLREFSIWMEDWPLELKSCLEMECLEEL+
Subjt:  IGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.3e-8753.55Show/hide
Query:  GAEEDKLIWNYEKTGVYLVRSGYKVALLNNSCGQAPSSSSSE----------------------------------------------------------
        GAEED+LIWNYEKTGVY VRSGYKVALLNN C QAPSSSSSE                                                          
Subjt:  GAEEDKLIWNYEKTGVYLVRSGYKVALLNNSCGQAPSSSSSE----------------------------------------------------------

Query:  ----------------------------------------------------------------------ELVEWANNYVMEFREANSNPFPGRVTNTAE
                                                                              ELVEWAN Y MEFREA SNP  GRVTNTAE
Subjt:  ----------------------------------------------------------------------ELVEWANNYVMEFREANSNPFPGRVTNTAE

Query:  ILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTG
        ILWQPPDEGIYKINTD SFLASDQHAGLGIII NDRGQVMA ATKYLENIQSVDMAEA+AAVEGLQLAS+IG++P +                EDLS+TG
Subjt:  ILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTG

Query:  EIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL
        EIVLKAKNFWTQ+LHASFNF+KR+GNKAAH+ A RALLL EFSIWMEDWPLELKSCLEMECLEELL
Subjt:  EIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]9.2e-1386Show/hide
Query:  GRNKTGNFRNCLKVAEAKLQSAIHDLRFAPNRDAFQHATTNMNQLLKEEE
        GRNKTGNFRN LKVAE  LQSAIHDL FAPNR+AFQ A TNMNQLLKEEE
Subjt:  GRNKTGNFRNCLKVAEAKLQSAIHDLRFAPNRDAFQHATTNMNQLLKEEE

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.6e-3345.16Show/hide
Query:  EELVEWANNYVMEFREANSNPFPGR------VTNTAEI-------LWQPPDEGIYKINTDTSFLASDQHAGLG-IIIRNDRGQVMATATKYLENIQSVDM
        ++L  W + Y+  F+  N+N              +++I       +W P +EG++K+ TD SF + D +AGLG IIIR+ RGQV+A+ATKYLE++ SVD 
Subjt:  EELVEWANNYVMEFREANSNPFPGR------VTNTAEI-------LWQPPDEGIYKINTDTSFLASDQHAGLG-IIIRNDRGQVMATATKYLENIQSVDM

Query:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLRE
        AEA+AAVEGL++A + G++P++LETDS RI+NLF++  E LSKTG I+   K      L  S++F KR GN  AHL A RAL  +E
Subjt:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLRE

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]8.3e-2234.6Show/hide
Query:  NSCGQAPS----SSSSEELVEWANNYVMEFREANSNPFPGRVT--NTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQ
        N+  Q P       S  +LV W+ NY+  ++ A  +     V        +W+PP   + K+N D +F      AG+G+IIR+  G V  TA + L    
Subjt:  NSCGQAPS----SSSSEELVEWANNYVMEFREANSNPFPGRVT--NTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQ

Query:  SVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFW-TQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWP
         VD  E  A  EG+ LA + G     +ETDS RIFNL +    D S+ G +    K F  +     SF+F  R GN  AHL A  AL      IW+E+WP
Subjt:  SVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFW-TQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWP

Query:  LELKSCLEMEC
         E+ S L ++C
Subjt:  LELKSCLEMEC

TrEMBL top hitse value%identityAlignment
A0A5C7IIT4 Uncharacterized protein1.7e-2525.37Show/hide
Query:  KNWRLTGFYGHPEEANIDLSWDLLRWLRGSNDDPWLVGGDFNGILSHSEKDGGRRKDDKLINEFSKSA--CSIPRTLG----RNKTGNFRNCLKVAEAKL
        K WRLTGFYGHP        W+LLR L G +  PW VGGDFN I+  SEK GG  + +  +  F ++   C + R LG    R    N R+     + +L
Subjt:  KNWRLTGFYGHPEEANIDLSWDLLRWLRGSNDDPWLVGGDFNGILSHSEKDGGRRKDDKLINEFSKSA--CSIPRTLG----RNKTGNFRNCLKVAEAKL

Query:  QSAIHDLRFAPNRDAFQ------HATTNMNQLLKEEERIMTSGAEE-------------------------DKLIWNYEKTGV----YLVRSGYKVALLN
           + +  +      F         + +   LL+  ++   +G                            D +++   K  +    +L    ++V    
Subjt:  QSAIHDLRFAPNRDAFQ------HATTNMNQLLKEEERIMTSGAEE-------------------------DKLIWNYEKTGV----YLVRSGYKVALLN

Query:  NSCGQAPSSSSSE--ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDM
        N      SS +    ++++WA +++ +F+ A +      V       W+P   G YKINTD +     +  G+G++IR+  G VMA+  +    +     
Subjt:  NSCGQAPSSSSSE--ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDM

Query:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKS
         EAVA + G +LA + G+ P  +E+DS  + NL +      ++ G ++       + +  +S +F+ R  N  AH  A  +L      +W+ED PL ++S
Subjt:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKS

Query:  CLEMECLEEL
         +  +C   L
Subjt:  CLEMECLEEL

A0A6J1CIF1 uncharacterized protein LOC1110112374.2e-9690.82Show/hide
Query:  ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASK
        +LVEWAN YVMEFREANSNPFPGRVTNTAE+LW PPD+ IYKINTD SFLASDQHAGLGIIIRNDRGQVMA+ATKYLENIQSVDMAEA+ AVEGLQLASK
Subjt:  ELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASK

Query:  IGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL
        IGVNPVILETDSSRIFNLFSQPSEDLS+TGEIVLKAKNFWTQ+LHASFNF+KR+GNKAAH+ A RALLLREFSIWMEDWPLELKSCLEMECLEEL+
Subjt:  IGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL

A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-8753.55Show/hide
Query:  GAEEDKLIWNYEKTGVYLVRSGYKVALLNNSCGQAPSSSSSE----------------------------------------------------------
        GAEED+LIWNYEKTGVY VRSGYKVALLNN C QAPSSSSSE                                                          
Subjt:  GAEEDKLIWNYEKTGVYLVRSGYKVALLNNSCGQAPSSSSSE----------------------------------------------------------

Query:  ----------------------------------------------------------------------ELVEWANNYVMEFREANSNPFPGRVTNTAE
                                                                              ELVEWAN Y MEFREA SNP  GRVTNTAE
Subjt:  ----------------------------------------------------------------------ELVEWANNYVMEFREANSNPFPGRVTNTAE

Query:  ILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTG
        ILWQPPDEGIYKINTD SFLASDQHAGLGIII NDRGQVMA ATKYLENIQSVDMAEA+AAVEGLQLAS+IG++P +                EDLS+TG
Subjt:  ILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTG

Query:  EIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL
        EIVLKAKNFWTQ+LHASFNF+KR+GNKAAH+ A RALLL EFSIWMEDWPLELKSCLEMECLEELL
Subjt:  EIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEELL

A0A6J1DAR4 uncharacterized protein LOC1110189544.5e-1386Show/hide
Query:  GRNKTGNFRNCLKVAEAKLQSAIHDLRFAPNRDAFQHATTNMNQLLKEEE
        GRNKTGNFRN LKVAE  LQSAIHDL FAPNR+AFQ A TNMNQLLKEEE
Subjt:  GRNKTGNFRNCLKVAEAKLQSAIHDLRFAPNRDAFQHATTNMNQLLKEEE

A0A6J1DAR4 uncharacterized protein LOC1110189541.7e-3345.16Show/hide
Query:  EELVEWANNYVMEFREANSNPFPGR------VTNTAEI-------LWQPPDEGIYKINTDTSFLASDQHAGLG-IIIRNDRGQVMATATKYLENIQSVDM
        ++L  W + Y+  F+  N+N              +++I       +W P +EG++K+ TD SF + D +AGLG IIIR+ RGQV+A+ATKYLE++ SVD 
Subjt:  EELVEWANNYVMEFREANSNPFPGR------VTNTAEI-------LWQPPDEGIYKINTDTSFLASDQHAGLG-IIIRNDRGQVMATATKYLENIQSVDM

Query:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLRE
        AEA+AAVEGL++A + G++P++LETDS RI+NLF++  E LSKTG I+   K      L  S++F KR GN  AHL A RAL  +E
Subjt:  AEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLRE

A0A6J1DBJ7 uncharacterized protein LOC1110189734.0e-2234.6Show/hide
Query:  NSCGQAPS----SSSSEELVEWANNYVMEFREANSNPFPGRVT--NTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQ
        N+  Q P       S  +LV W+ NY+  ++ A  +     V        +W+PP   + K+N D +F      AG+G+IIR+  G V  TA + L    
Subjt:  NSCGQAPS----SSSSEELVEWANNYVMEFREANSNPFPGRVT--NTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQ

Query:  SVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFW-TQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWP
         VD  E  A  EG+ LA + G     +ETDS RIFNL +    D S+ G +    K F  +     SF+F  R GN  AHL A  AL      IW+E+WP
Subjt:  SVDMAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFW-TQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWP

Query:  LELKSCLEMEC
         E+ S L ++C
Subjt:  LELKSCLEMEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.7e-0726.79Show/hide
Query:  EELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLAS
        E+  EW+       RE        +V     + W+ P     K NTD ++   +   G+G I+RN+ G V+    + L   ++V  AE  A    +   S
Subjt:  EELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEGLQLAS

Query:  KIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLH----ASFNFMKRKGNKAAHLFA
        +     +I E+D+  + NL +  S+D   T +  L+      Q LH      F F  R GNK A   A
Subjt:  KIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLH----ASFNFMKRKGNKAAHLFA

AT4G29090.1 Ribonuclease H-like superfamily protein4.3e-0825.43Show/hide
Query:  SSEELVEWANNYVMEFR---EANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEG
        +++E++  A + + E+R   EA S     +V  ++   W+PP     K NTD ++   ++  G+G ++RN++G+V     + L  ++SV  AE  A    
Subjt:  SSEELVEWANNYVMEFR---EANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVDMAEAVAAVEG

Query:  LQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRAL
        +   S+   N VI E+DS  +  + +   E        +   +   +Q     F F+ R+GN  A   A  +L
Subjt:  LQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGTTGATGTTATAAGTGAAAGTCCTACAACCACGGATAAAAATAATGGGGGTGAGCAAATTGCTGATTTGATGGATTTGAAGGTTATGGAAAGCATGCGGAAGCC
AACTAATAAGGCAAAAGGGCCTTTGGAGTCATTTTGTGTGAGTGGACCTTCTTCTGGTAATGGGCTAAATAAGAGGGCGGGATGGAAAAGATTACCTCGTGAAGCCCAAT
CAAATTTGGGTATGGCTTCTATGCCTTTGGGGAAACGAAAGGAGTATTTGGATGATGCTGATTTGGGTACTAAAGGGGACAAACTTCACGCACGGAAGCAAAAACAAATT
AAAATTGGAGTGGCAGATGATAGTTTGGTTATTGGCTCAGGCCCACAGGGTGTTTCTGATTGTTTGGCGGTGGCTGATTCTCAGCCCGCCTACCCCAATGATTGGCCTAT
TTTAGAATGGACAAAGGCAACAAAAGAAAAGATGAATAATTTGAAGTTTCGGCTTGGGTTTCACAATTGTATTGTCGTTGATTGTCAAGAAAAAGTGGTGGAGTTTGAGG
GTAAAAATTGGCGGCTTACAGGATTTTACGGCCACCCAGAGGAGGCGAACATAGATTTATCTTGGGACCTTTTGCGTTGGTTACGAGGAAGCAATGATGATCCTTGGCTA
GTTGGTGGTGATTTCAATGGTATCTTGAGTCATAGTGAAAAAGATGGTGGGAGAAGGAAAGACGACAAACTAATTAATGAATTTTCTAAATCAGCGTGCTCTATCCCTCG
CACACTGGGGCGCAATAAAACAGGGAATTTTCGAAATTGTCTAAAGGTAGCTGAAGCTAAGTTGCAATCTGCAATCCACGATCTTCGTTTTGCTCCAAATCGAGATGCAT
TTCAACATGCCACAACAAATATGAATCAGTTATTAAAGGAAGAGGAGCGCATCATGACATCGGGTGCGGAAGAGGATAAGCTAATCTGGAACTATGAGAAGACAGGGGTG
TACTTGGTCAGAAGTGGGTATAAGGTGGCTTTGCTGAATAATTCATGTGGTCAGGCCCCCTCCTCTTCATCTTCTGAAGAGCTTGTTGAATGGGCAAATAATTATGTTAT
GGAGTTTAGGGAAGCTAACTCTAATCCTTTTCCGGGGAGAGTTACAAATACAGCAGAGATTTTATGGCAACCACCAGACGAAGGAATATATAAAATTAACACTGATACCT
CTTTTTTAGCTTCAGATCAGCATGCAGGATTAGGAATCATCATCCGTAATGATAGAGGGCAAGTTATGGCTACAGCTACGAAGTACCTGGAGAATATTCAATCGGTGGAT
ATGGCGGAAGCGGTTGCTGCAGTGGAGGGACTTCAACTGGCGTCGAAAATTGGTGTCAACCCAGTGATCTTGGAGACGGATTCATCTCGTATTTTCAATCTTTTCTCTCA
GCCTTCGGAGGACCTGTCGAAAACGGGAGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAAACTTTACATGCAAGTTTCAATTTCATGAAAAGGAAGGGTAATAAAG
CGGCTCACCTGTTTGCTTGGCGGGCTCTCCTTCTTCGTGAGTTTTCGATCTGGATGGAGGATTGGCCACTAGAGTTGAAGAGCTGTTTAGAAATGGAATGTTTGGAGGAG
CTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGTTGATGTTATAAGTGAAAGTCCTACAACCACGGATAAAAATAATGGGGGTGAGCAAATTGCTGATTTGATGGATTTGAAGGTTATGGAAAGCATGCGGAAGCC
AACTAATAAGGCAAAAGGGCCTTTGGAGTCATTTTGTGTGAGTGGACCTTCTTCTGGTAATGGGCTAAATAAGAGGGCGGGATGGAAAAGATTACCTCGTGAAGCCCAAT
CAAATTTGGGTATGGCTTCTATGCCTTTGGGGAAACGAAAGGAGTATTTGGATGATGCTGATTTGGGTACTAAAGGGGACAAACTTCACGCACGGAAGCAAAAACAAATT
AAAATTGGAGTGGCAGATGATAGTTTGGTTATTGGCTCAGGCCCACAGGGTGTTTCTGATTGTTTGGCGGTGGCTGATTCTCAGCCCGCCTACCCCAATGATTGGCCTAT
TTTAGAATGGACAAAGGCAACAAAAGAAAAGATGAATAATTTGAAGTTTCGGCTTGGGTTTCACAATTGTATTGTCGTTGATTGTCAAGAAAAAGTGGTGGAGTTTGAGG
GTAAAAATTGGCGGCTTACAGGATTTTACGGCCACCCAGAGGAGGCGAACATAGATTTATCTTGGGACCTTTTGCGTTGGTTACGAGGAAGCAATGATGATCCTTGGCTA
GTTGGTGGTGATTTCAATGGTATCTTGAGTCATAGTGAAAAAGATGGTGGGAGAAGGAAAGACGACAAACTAATTAATGAATTTTCTAAATCAGCGTGCTCTATCCCTCG
CACACTGGGGCGCAATAAAACAGGGAATTTTCGAAATTGTCTAAAGGTAGCTGAAGCTAAGTTGCAATCTGCAATCCACGATCTTCGTTTTGCTCCAAATCGAGATGCAT
TTCAACATGCCACAACAAATATGAATCAGTTATTAAAGGAAGAGGAGCGCATCATGACATCGGGTGCGGAAGAGGATAAGCTAATCTGGAACTATGAGAAGACAGGGGTG
TACTTGGTCAGAAGTGGGTATAAGGTGGCTTTGCTGAATAATTCATGTGGTCAGGCCCCCTCCTCTTCATCTTCTGAAGAGCTTGTTGAATGGGCAAATAATTATGTTAT
GGAGTTTAGGGAAGCTAACTCTAATCCTTTTCCGGGGAGAGTTACAAATACAGCAGAGATTTTATGGCAACCACCAGACGAAGGAATATATAAAATTAACACTGATACCT
CTTTTTTAGCTTCAGATCAGCATGCAGGATTAGGAATCATCATCCGTAATGATAGAGGGCAAGTTATGGCTACAGCTACGAAGTACCTGGAGAATATTCAATCGGTGGAT
ATGGCGGAAGCGGTTGCTGCAGTGGAGGGACTTCAACTGGCGTCGAAAATTGGTGTCAACCCAGTGATCTTGGAGACGGATTCATCTCGTATTTTCAATCTTTTCTCTCA
GCCTTCGGAGGACCTGTCGAAAACGGGAGAAATCGTTTTGAAGGCGAAGAATTTCTGGACTCAAACTTTACATGCAAGTTTCAATTTCATGAAAAGGAAGGGTAATAAAG
CGGCTCACCTGTTTGCTTGGCGGGCTCTCCTTCTTCGTGAGTTTTCGATCTGGATGGAGGATTGGCCACTAGAGTTGAAGAGCTGTTTAGAAATGGAATGTTTGGAGGAG
CTTCTGTAA
Protein sequenceShow/hide protein sequence
MSVDVISESPTTTDKNNGGEQIADLMDLKVMESMRKPTNKAKGPLESFCVSGPSSGNGLNKRAGWKRLPREAQSNLGMASMPLGKRKEYLDDADLGTKGDKLHARKQKQI
KIGVADDSLVIGSGPQGVSDCLAVADSQPAYPNDWPILEWTKATKEKMNNLKFRLGFHNCIVVDCQEKVVEFEGKNWRLTGFYGHPEEANIDLSWDLLRWLRGSNDDPWL
VGGDFNGILSHSEKDGGRRKDDKLINEFSKSACSIPRTLGRNKTGNFRNCLKVAEAKLQSAIHDLRFAPNRDAFQHATTNMNQLLKEEERIMTSGAEEDKLIWNYEKTGV
YLVRSGYKVALLNNSCGQAPSSSSSEELVEWANNYVMEFREANSNPFPGRVTNTAEILWQPPDEGIYKINTDTSFLASDQHAGLGIIIRNDRGQVMATATKYLENIQSVD
MAEAVAAVEGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSKTGEIVLKAKNFWTQTLHASFNFMKRKGNKAAHLFAWRALLLREFSIWMEDWPLELKSCLEMECLEE
LL