; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020980 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020980
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEndo/exonuclease/phosphatase domain-containing protein
Genome locationtig00153578:514185..527960
RNA-Seq ExpressionSgr020980
SyntenySgr020980
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]7.2e-2025.17Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAGN-WQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKEN-----------------
        GLCL W+ E+DV + SYS  HID  ++      W+ TGFYG P    R   W LLRRL       W VGGD NE++  S+K                   
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAGN-WQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKEN-----------------

Query:  ---------------------------------------------VQVNHLDWSKLDHRAIELVLCKQQE------------CQWNKRQHLNFAIKESWD
                                                       + HLD+ K DHR I L +  ++E            C W +R      ++  + 
Subjt:  ---------------------------------------------VQVNHLDWSKLDHRAIELVLCKQQE------------CQWNKRQHLNFAIKESWD

Query:  SLPDCRRIITIEGR------------WDSVKDRAKPLDYH-----LH--------ASFAALRLWGK---EGS---------WNPPSIAWIKLNSNAAYCA
         LP    ++  +G+            W  V  R   L Y      +H        ASF       K    GS         W P      K+N++A    
Subjt:  SLPDCRRIITIEGR------------WDSVKDRAKPLDYH-----LH--------ASFAALRLWGK---EGS---------WNPPSIAWIKLNSNAAYCA

Query:  NKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHV
            TG+GVV+RD  G ++ ++ +SF     P  V+  A L G  LA + G     ++ D    + L+N      +E    + DI ++ +   F S   V
Subjt:  NKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHV

Query:  KREMNGVTHSLAKEAIRFHLSILWNDNYP
         R  N V HSLAK ++ F    +W ++ P
Subjt:  KREMNGVTHSLAKEAIRFHLSILWNDNYP

TXG73106.1 hypothetical protein EZV62_001685 [Acer yangbiense]2.2e-1636.47Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFISW-DAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKK-----ENVQVNHLDWSKLD
        GL L W+D VD+TI  +SS HID  +   D   W F+GFYG P ANNR  +W LLRRL ++D+  WV GGD NELL  + K     +NV V+  +   L 
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFISW-DAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKK-----ENVQVNHLDWSKLD

Query:  HRAIELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLW
            +L         WN ++     I+E  D      R +   G  D  KD       HLH +     +W
Subjt:  HRAIELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLW

VVA21182.1 PREDICTED: reverse mRNAase [Prunus dulcis]2.5e-2026.89Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFIS--WDAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAI
        GLCL W +E+ VT  S+ +NHID  +      G W+FTGFYG P    R+ +W+LLRRL  ++   W+  GD NE+LR  +K             DH   
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFIS--WDAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAI

Query:  ELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVV
          VL K   C W      N   + +W         I + G    +++  +        +  A+  W +     PP  A IK+N N  Y    S  GVG+V
Subjt:  ELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVV

Query:  LRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHS
        +R++ G  +       +   +P+H KL A  EG+  A +       ++ D +  +  + +    LS      ED  ++  +         +R  NGV   
Subjt:  LRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHS

Query:  LAKEAIRFHLSILWNDNYPQWLIDATEVELL
        LA+ A+       W  + P +L D   ++++
Subjt:  LAKEAIRFHLSILWNDNYPQWLIDATEVELL

XP_022154440.1 uncharacterized protein LOC111021711 [Momordica charantia]1.6e-1935.09Show/hide
Query:  DYHLHASFAALRLWGKE-GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDY
        D HL  S  A    GK+   W PPS  W+K+N + A   + S TG+GVV +++ G+I+ AM +  DV + PL V+  A LEG+ L Q +    + V  D 
Subjt:  DYHLHASFAALRLWGKE-GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDY

Query:  RETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL
        + +I L++Q+ +   +   WV+DI      F    F+H++R  + VTH LA  AI  + + +W   +P WL
Subjt:  RETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]1.3e-1637.09Show/hide
Query:  WNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNW
        W PP    +K+N++AA   ++  +GVGV+LR+   +IV AM   F +  +PL  K+ A  EG+ LA ++G  R+VV+ D  E + L+  +     EA +W
Subjt:  WNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNW

Query:  VEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL
        VEDI +    F    F HV RE N V + L +E I      LW  ++P WL
Subjt:  VEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL

TrEMBL top hitse value%identityAlignment
A0A5C7IIT4 Uncharacterized protein3.5e-2025.17Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAGN-WQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKEN-----------------
        GLCL W+ E+DV + SYS  HID  ++      W+ TGFYG P    R   W LLRRL       W VGGD NE++  S+K                   
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAGN-WQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKEN-----------------

Query:  ---------------------------------------------VQVNHLDWSKLDHRAIELVLCKQQE------------CQWNKRQHLNFAIKESWD
                                                       + HLD+ K DHR I L +  ++E            C W +R      ++  + 
Subjt:  ---------------------------------------------VQVNHLDWSKLDHRAIELVLCKQQE------------CQWNKRQHLNFAIKESWD

Query:  SLPDCRRIITIEGR------------WDSVKDRAKPLDYH-----LH--------ASFAALRLWGK---EGS---------WNPPSIAWIKLNSNAAYCA
         LP    ++  +G+            W  V  R   L Y      +H        ASF       K    GS         W P      K+N++A    
Subjt:  SLPDCRRIITIEGR------------WDSVKDRAKPLDYH-----LH--------ASFAALRLWGK---EGS---------WNPPSIAWIKLNSNAAYCA

Query:  NKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHV
            TG+GVV+RD  G ++ ++ +SF     P  V+  A L G  LA + G     ++ D    + L+N      +E    + DI ++ +   F S   V
Subjt:  NKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHV

Query:  KREMNGVTHSLAKEAIRFHLSILWNDNYP
         R  N V HSLAK ++ F    +W ++ P
Subjt:  KREMNGVTHSLAKEAIRFHLSILWNDNYP

A0A5E4F0G7 PREDICTED: reverse mRNAase1.2e-2026.89Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFIS--WDAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAI
        GLCL W +E+ VT  S+ +NHID  +      G W+FTGFYG P    R+ +W+LLRRL  ++   W+  GD NE+LR  +K             DH   
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFIS--WDAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAI

Query:  ELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVV
          VL K   C W      N   + +W         I + G    +++  +        +  A+  W +     PP  A IK+N N  Y    S  GVG+V
Subjt:  ELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVV

Query:  LRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHS
        +R++ G  +       +   +P+H KL A  EG+  A +       ++ D +  +  + +    LS      ED  ++  +         +R  NGV   
Subjt:  LRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHS

Query:  LAKEAIRFHLSILWNDNYPQWLIDATEVELL
        LA+ A+       W  + P +L D   ++++
Subjt:  LAKEAIRFHLSILWNDNYPQWLIDATEVELL

A0A6J1DJL6 uncharacterized protein LOC1110217117.7e-2035.09Show/hide
Query:  DYHLHASFAALRLWGKE-GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDY
        D HL  S  A    GK+   W PPS  W+K+N + A   + S TG+GVV +++ G+I+ AM +  DV + PL V+  A LEG+ L Q +    + V  D 
Subjt:  DYHLHASFAALRLWGKE-GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDY

Query:  RETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL
        + +I L++Q+ +   +   WV+DI      F    F+H++R  + VTH LA  AI  + + +W   +P WL
Subjt:  RETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWL

A0A803PI84 Uncharacterized protein2.9e-1922.92Show/hide
Query:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAG-NWQFTGFYGQPTANNRYLTWELLRRLHN-SDDSAWVVGGDLNELLRYSKKENVQVNHLDWS-----KL
        G+ L W++ VDV++ S ++NH D F+S+D G  W F+  YG P + N+  TWELL RL + S    W++ GDLNE+  +S K    + H +        L
Subjt:  GLCLFWEDEVDVTIRSYSSNHIDWFISWDAG-NWQFTGFYGQPTANNRYLTWELLRRLHN-SDDSAWVVGGDLNELLRYSKKENVQVNHLDWS-----KL

Query:  DHRAIELVLCKQQECQWNKRQHLNFAIKES---WDSLPDCR---RIITIEGRWDSVK------DRAKPLDYHLHASFAAL------------RLWGKEGS
        D   ++    +  E  W K +    A+KES    ++   C    R+ ++ G+   V         +  L    H +F  +             +  K G 
Subjt:  DHRAIELVLCKQQECQWNKRQHLNFAIKES---WDSLPDCR---RIITIEGRWDSVK------DRAKPLDYHLHASFAAL------------RLWGKEGS

Query:  WNPPSIAW-------------------------------------------------IKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSP
        W     +W                                                 +K+N +AA   +++  GVG++++++ G++V A  K    R+ P
Subjt:  WNPPSIAW-------------------------------------------------IKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSP

Query:  LHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYP
          ++  A + G+  A     S  +++ D    +  +N     +S   + V DI +  +        HVK++ N   H LAK+A+      +W +  P
Subjt:  LHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYP

A0A803PSG1 Uncharacterized protein1.4e-2126.78Show/hide
Query:  LFWEDEVDVTIRSYSSNHIDWFISW-DAGNWQFTGFYGQPTANNRYLTWELLRRLHN-SDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAIEL-
        L W+ + D+TI +YSSNHID F+ + D  ++ FTGFYG P  N R  TW  L+R  N +    W+V GD NE+L  ++K    + +    +     I+L 
Subjt:  LFWEDEVDVTIRSYSSNHIDWFISW-DAGNWQFTGFYGQPTANNRYLTWELLRRLHN-SDDSAWVVGGDLNELLRYSKKENVQVNHLDWSKLDHRAIEL-

Query:  ----VLCKQQECQW-NKRQ----------HLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHA-SFAALR-LWGKEG---------------
            ++ + + C W NK Q             F  ++ W     C  II  +   +S     + +D      SFA LR  +G  G               
Subjt:  ----VLCKQQECQW-NKRQ----------HLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHA-SFAALR-LWGKEG---------------

Query:  --SWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEA
           W  P  + +KLN++A+   + S  G+G +LR++ G+IV AM  S      P  ++  A    +    Q+  S  +++ D    +  L +    L+  
Subjt:  --SWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEA

Query:  TNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYP
           + D+  + + F      HV R     TH LAK A+      +W +  P
Subjt:  TNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein2.6e-0421.38Show/hide
Query:  WNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIE------TL
        WNPP   W+K N ++ Y      T  G  +R+  G IV   +         LH +    L  + +    G   +  + D +  + L+N   +       +
Subjt:  WNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIE------TL

Query:  SEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWLID
         +  +W+  +P     +C   F  V RE N    +LA                P WL++
Subjt:  SEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWLID

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-0824.32Show/hide
Query:  WNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKD-RAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVT
        W  R  L F  KE +D+    RR +     W + ++   K     +  + +          W  P   W+K N++A +       G+G +LR+  G ++ 
Subjt:  WNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKD-RAKPLDYHLHASFAALRLWGKEGSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVT

Query:  AMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQ--IETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRF
           ++     + L  +L A    V    +  Y RI+ + D +  + LLN      TL  A   +EDI  + + F    F    R  N V   +A+E+I F
Subjt:  AMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQ--IETLSEATNWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRF

Query:  HLSILWNDNY--------PQWL
                NY        PQWL
Subjt:  HLSILWNDNY--------PQWL

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-0924.46Show/hide
Query:  GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEAT
        G W PP   W+K N++A +  +    G+G VLR+  G++     ++     S L  +L A    V    +  Y+ ++ + D +  I +LN   E      
Subjt:  GSWNPPSIAWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEAT

Query:  NWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRF
          ++D+  + + F    F+ + RE N +   +A+E++ F
Subjt:  NWVEDIPSITNTFCFFSFLHVKREMNGVTHSLAKEAIRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAATGTTGTGTAGTTGAACTGGAGAAGGAGATGAAATTTGTTGAAGAAGTTTGCTGCCATTTGCATGCAGAGAAGACTAAAAGTTTCTACCTTTCTAGAGC
CTGGAGGTGGGCGACGGATGGTTGTGGACTTTGCTTGTTTTGGGAAGATGAGGTTGATGTGACTATTCGTTCTTATTCTTCTAATCACATTGATTGGTTTATTTCCTGGG
ATGCCGGCAACTGGCAATTCACTGGTTTTTATGGCCAACCAACTGCTAATAATCGATATCTTACTTGGGAGCTCTTAAGACGACTTCATAATAGTGATGATTCAGCTTGG
GTGGTGGGTGGTGACTTGAATGAGCTTCTTCGTTACTCGAAAAAGGAAAATGTGCAAGTAAACCATTTGGATTGGTCCAAATTGGACCATCGAGCTATTGAGCTAGTTTT
ATGTAAGCAACAAGAATGTCAGTGGAATAAGAGACAACATCTTAATTTTGCAATCAAGGAGAGTTGGGATTCACTTCCAGATTGTAGAAGAATAATTACTATTGAAGGAA
GATGGGATTCAGTAAAGGATCGGGCCAAGCCACTAGACTATCATTTGCATGCTTCTTTTGCTGCTCTACGTTTGTGGGGCAAGGAGGGTAGTTGGAATCCACCTTCTATT
GCCTGGATTAAACTCAATTCGAATGCTGCTTATTGTGCCAATAAGTCTTCGACTGGTGTTGGTGTTGTGCTTCGGGATAATTTGGGCAAAATCGTTACTGCAATGCACAA
GTCTTTTGATGTTCGTTTTTCACCCTTGCATGTCAAACTTCACGCAACTCTTGAAGGAGTTTGCTTAGCACAACAAATGGGTTATTCTCGGATTGTGGTGAAGTTAGATT
ATAGAGAAACAATTTTATTACTGAACCAGCAGATAGAGACTTTGAGTGAAGCTACTAACTGGGTTGAAGATATCCCTAGCATCACTAATACTTTCTGCTTTTTCTCCTTT
TTGCATGTAAAAAGGGAGATGAATGGAGTGACTCACTCTCTTGCTAAGGAAGCTATTAGATTTCATCTTTCTATTTTATGGAATGATAATTATCCTCAATGGCTCATTGA
TGCAACAGAGGTTGAACTTCTGCCTCTCCCTCTGCATTCAGCCTCTCTCCGTCCGTATCCGCTTCAAGCTTCAGCCCCTGTAAAACCTCCGGCTCCGACCGCCGGCCACC
ACCCTTCACGGCCTCCTCATACGGCGGCTCCCTATCGACCGACCGGATTATGCAAGTATTTGGACTGTGTTTGGGCATTTGAATTTATTCAAAAGAAAAGAAAGAGGTTT
ATTAGCTTGCAAAATGGAAGCCCTTCCTTTCTCTCTTGTTCCACGCCTCCTTCCCTTGTGTTCACTGCCACTGCCGCCCCCTCCCTCGCCCTAGCTACGTGCGCACCCTT
TCTCACTGTTCCTCCTCTCCCTCGATCTCAGGTTCTAATTCCTATGTCAAGTTTTAAATTGTTGCCGGATTGTCGCCATGCCGATGAAACTGAACTG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAAATGTTGTGTAGTTGAACTGGAGAAGGAGATGAAATTTGTTGAAGAAGTTTGCTGCCATTTGCATGCAGAGAAGACTAAAAGTTTCTACCTTTCTAGAGC
CTGGAGGTGGGCGACGGATGGTTGTGGACTTTGCTTGTTTTGGGAAGATGAGGTTGATGTGACTATTCGTTCTTATTCTTCTAATCACATTGATTGGTTTATTTCCTGGG
ATGCCGGCAACTGGCAATTCACTGGTTTTTATGGCCAACCAACTGCTAATAATCGATATCTTACTTGGGAGCTCTTAAGACGACTTCATAATAGTGATGATTCAGCTTGG
GTGGTGGGTGGTGACTTGAATGAGCTTCTTCGTTACTCGAAAAAGGAAAATGTGCAAGTAAACCATTTGGATTGGTCCAAATTGGACCATCGAGCTATTGAGCTAGTTTT
ATGTAAGCAACAAGAATGTCAGTGGAATAAGAGACAACATCTTAATTTTGCAATCAAGGAGAGTTGGGATTCACTTCCAGATTGTAGAAGAATAATTACTATTGAAGGAA
GATGGGATTCAGTAAAGGATCGGGCCAAGCCACTAGACTATCATTTGCATGCTTCTTTTGCTGCTCTACGTTTGTGGGGCAAGGAGGGTAGTTGGAATCCACCTTCTATT
GCCTGGATTAAACTCAATTCGAATGCTGCTTATTGTGCCAATAAGTCTTCGACTGGTGTTGGTGTTGTGCTTCGGGATAATTTGGGCAAAATCGTTACTGCAATGCACAA
GTCTTTTGATGTTCGTTTTTCACCCTTGCATGTCAAACTTCACGCAACTCTTGAAGGAGTTTGCTTAGCACAACAAATGGGTTATTCTCGGATTGTGGTGAAGTTAGATT
ATAGAGAAACAATTTTATTACTGAACCAGCAGATAGAGACTTTGAGTGAAGCTACTAACTGGGTTGAAGATATCCCTAGCATCACTAATACTTTCTGCTTTTTCTCCTTT
TTGCATGTAAAAAGGGAGATGAATGGAGTGACTCACTCTCTTGCTAAGGAAGCTATTAGATTTCATCTTTCTATTTTATGGAATGATAATTATCCTCAATGGCTCATTGA
TGCAACAGAGGTTGAACTTCTGCCTCTCCCTCTGCATTCAGCCTCTCTCCGTCCGTATCCGCTTCAAGCTTCAGCCCCTGTAAAACCTCCGGCTCCGACCGCCGGCCACC
ACCCTTCACGGCCTCCTCATACGGCGGCTCCCTATCGACCGACCGGATTATGCAAGTATTTGGACTGTGTTTGGGCATTTGAATTTATTCAAAAGAAAAGAAAGAGGTTT
ATTAGCTTGCAAAATGGAAGCCCTTCCTTTCTCTCTTGTTCCACGCCTCCTTCCCTTGTGTTCACTGCCACTGCCGCCCCCTCCCTCGCCCTAGCTACGTGCGCACCCTT
TCTCACTGTTCCTCCTCTCCCTCGATCTCAGGTTCTAATTCCTATGTCAAGTTTTAAATTGTTGCCGGATTGTCGCCATGCCGATGAAACTGAACTG
Protein sequenceShow/hide protein sequence
MEKKCCVVELEKEMKFVEEVCCHLHAEKTKSFYLSRAWRWATDGCGLCLFWEDEVDVTIRSYSSNHIDWFISWDAGNWQFTGFYGQPTANNRYLTWELLRRLHNSDDSAW
VVGGDLNELLRYSKKENVQVNHLDWSKLDHRAIELVLCKQQECQWNKRQHLNFAIKESWDSLPDCRRIITIEGRWDSVKDRAKPLDYHLHASFAALRLWGKEGSWNPPSI
AWIKLNSNAAYCANKSSTGVGVVLRDNLGKIVTAMHKSFDVRFSPLHVKLHATLEGVCLAQQMGYSRIVVKLDYRETILLLNQQIETLSEATNWVEDIPSITNTFCFFSF
LHVKREMNGVTHSLAKEAIRFHLSILWNDNYPQWLIDATEVELLPLPLHSASLRPYPLQASAPVKPPAPTAGHHPSRPPHTAAPYRPTGLCKYLDCVWAFEFIQKKRKRF
ISLQNGSPSFLSCSTPPSLVFTATAAPSLALATCAPFLTVPPLPRSQVLIPMSSFKLLPDCRHADETEL