; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017654 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017654
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENDO3c domain-containing protein
Genome locationtig00153054:474159..481834
RNA-Seq ExpressionSgr017654
SyntenySgr017654
Gene Ontology termsNA
InterPro domainsIPR003854 - Gibberellin regulated protein
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136993.1 putative DNA glycosylase At3g47830 [Momordica charantia]2.8e-5650.51Show/hide
Query:  MQKNRKKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRL-SECCS
        MQK  K+RRLQ              PE V+DSHSN+PR KSA  I   NG T DP PAH SPT+D+CLS+RDDLL LHGFPREFVKYRKERQR  SECCS
Subjt:  MQKNRKKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRL-SECCS

Query:  VV-GGGEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CL
        V  GGGEP D+                       NTTEANSERAFASLKSAFATWED      S+ +ED                    SL+      C+
Subjt:  VV-GGGEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CL

Query:  Q-------------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
        +                                           VFEIAKFIGWVP+EADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
Subjt:  Q-------------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK

XP_022954941.1 putative DNA glycosylase At3g47830 [Cucurbita moschata]6.1e-5647.99Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG
        KKRRLQ             +PEP  DS +N PR KS R +NGF+  T +P PA+ SPT+DECLSVRDDLL L+GFPREFVKYRKER+RLSECCSVV   G
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG

Query:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------
        GE  +N                 NTTEANSE+AFASLKSAFATWED      S+ +ED                    SL+      CL+          
Subjt:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------

Query:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR
                                         VFEIAKFIGWVP+EADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK ++   G +  V + +
Subjt:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR

XP_022994647.1 putative DNA glycosylase At3g47830 [Cucurbita maxima]3.3e-5749.66Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG
        KKRRLQ             +PEP  DSHSN PR KS R +NGF+  T +P PA+ SPT+DECLSVRDDLL LHGFPREFV YRKER+RLSECCSVV   G
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG

Query:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------
        GE  +N                 NTTEANSE+AFASLKSAFATWED      S+ +ED                    SL+      CL+          
Subjt:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------

Query:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVRE
                                         VFEIAKFIGWVP+EADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK ++   G R+
Subjt:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVRE

XP_023541273.1 putative DNA glycosylase At3g47830 [Cucurbita pepo subsp. pepo]9.5e-5748.66Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG
        KKRRLQ             +PEP  DS +N PR KS R +NGFN  T +P PA+ SPT+DECLSVRDDLL LHGFPREFVKYRKER+RLSECCSVV   G
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG

Query:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------
        GE  +N                 NTTEANSE+AFASLKSAFATWED      S+ +ED                    SL+      CL+          
Subjt:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------

Query:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR
                                         VFEIAKFIGWVP+EADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK ++   G +  V + +
Subjt:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR

XP_038894941.1 putative DNA glycosylase At3g47830 [Benincasa hispida]8.0e-5650Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGG-
        KKRRLQ         KPEP P    DS  N P  KS R +NGFN  T DP PAH SPT+DECLSVRDDLL LHGFPREF+KYRKER+RLSECCS+V GG 
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGG-

Query:  ---------EPQD---------------NNTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----
                 EP D                NTTEANSERAF+SLKSAFATWED      S+ +ED                    SL+      CL+    
Subjt:  ---------EPQD---------------NNTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----

Query:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
                                               VFEIAKFIGWVP++ADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK
Subjt:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK

TrEMBL top hitse value%identityAlignment
A0A1S3AW45 putative DNA glycosylase At3g478301.1e-5548.26Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGG--
        KKRRLQ             +PEP +DSHS+RPR KS R +NGFN  T +P PAH SPT+DECLSVRDDLL LHGFPREF+KYRKER+RLSECCS V G  
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGG--

Query:  GEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----
         E  DN                       NTTEANSERAF SLKSAF+TWED      S+ +ED                    SL+      CL+    
Subjt:  GEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----

Query:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
                                               VF+IAKF+GWVP++ADRNK YLHLN+RIPNHLKFDLNCLLYTHGKLCSK
Subjt:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK

A0A5D3BJ43 Putative DNA glycosylase1.1e-5548.26Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGG--
        KKRRLQ             +PEP +DSHS+RPR KS R +NGFN  T +P PAH SPT+DECLSVRDDLL LHGFPREF+KYRKER+RLSECCS V G  
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGG--

Query:  GEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----
         E  DN                       NTTEANSERAF SLKSAF+TWED      S+ +ED                    SL+      CL+    
Subjt:  GEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----

Query:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
                                               VF+IAKF+GWVP++ADRNK YLHLN+RIPNHLKFDLNCLLYTHGKLCSK
Subjt:  ---------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK

A0A6J1C919 putative DNA glycosylase At3g478301.3e-5650.51Show/hide
Query:  MQKNRKKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRL-SECCS
        MQK  K+RRLQ              PE V+DSHSN+PR KSA  I   NG T DP PAH SPT+D+CLS+RDDLL LHGFPREFVKYRKERQR  SECCS
Subjt:  MQKNRKKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRL-SECCS

Query:  VV-GGGEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CL
        V  GGGEP D+                       NTTEANSERAFASLKSAFATWED      S+ +ED                    SL+      C+
Subjt:  VV-GGGEPQDN-----------------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CL

Query:  Q-------------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
        +                                           VFEIAKFIGWVP+EADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK
Subjt:  Q-------------------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSK

A0A6J1GSC0 putative DNA glycosylase At3g478303.0e-5647.99Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG
        KKRRLQ             +PEP  DS +N PR KS R +NGF+  T +P PA+ SPT+DECLSVRDDLL L+GFPREFVKYRKER+RLSECCSVV   G
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG

Query:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------
        GE  +N                 NTTEANSE+AFASLKSAFATWED      S+ +ED                    SL+      CL+          
Subjt:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------

Query:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR
                                         VFEIAKFIGWVP+EADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK ++   G +  V + +
Subjt:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVREVVEKRR

A0A6J1JWG0 putative DNA glycosylase At3g478301.6e-5749.66Show/hide
Query:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG
        KKRRLQ             +PEP  DSHSN PR KS R +NGF+  T +P PA+ SPT+DECLSVRDDLL LHGFPREFV YRKER+RLSECCSVV   G
Subjt:  KKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVV--GG

Query:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------
        GE  +N                 NTTEANSE+AFASLKSAFATWED      S+ +ED                    SL+      CL+          
Subjt:  GEPQDN-----------------NTTEANSERAFASLKSAFATWEDFGDPFVSRPLED------------------FDSLT------CLQ----------

Query:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVRE
                                         VFEIAKFIGWVP+EADRNK YLHLNQRIPNHLKFDLNCLLYTHGKLCSK ++   G R+
Subjt:  ---------------------------------VFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCSKYAELLHGVRE

SwissProt top hitse value%identityAlignment
F4IQJ4 Gibberellin-regulated protein 117.6e-1762.9Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC   CQ RC   SRP LC RACGTCC RC CV PGTSGN + C  CY  +TTHG + KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

F4JCQ3 Putative DNA glycosylase At3g478302.5e-2834.87Show/hide
Query:  DSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGGEPQDN---------------------
        D  S  P  KS   ++G     G+P P    PTA+EC  VRD LL+LHGFP EF  YR++R R     S V   + Q N                     
Subjt:  DSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGGEPQDN---------------------

Query:  ----NTTEANSERAFASLKSAFATWED-----------------------------------------------FGDPFVSRPLEDF-----DSLTCL--
            NTTE+NS+RAFASLK+ F  W+D                                                    V   L  F      +++C+  
Subjt:  ----NTTEANSERAFASLKSAFATWED-----------------------------------------------FGDPFVSRPLEDF-----DSLTCL--

Query:  ------------QVFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCS
                     VFEIAK +GWVP+ ADRNK Y+HLN++IP+ LKFDLNCLLYTHGK+CS
Subjt:  ------------QVFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCS

P46688 Gibberellin-regulated protein 21.3e-1661.29Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC G C+ RC   SR KLC+RAC +CC RC CVPPGTSGN  +C  CY  +TTHG + KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

Q93X17 Snakin-22.8e-1969.35Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC G C  RC   SRP+LC RACGTCC RC CVPPGTSGN E C  CY  +TTHGNK KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

Q9LFR3 Gibberellin-regulated protein 147.4e-2067.74Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC  LC  RCG HSR  +CMRAC TCC RCKCVPPGT GN+E CG+CY +M T G K+KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

Arabidopsis top hitse value%identityAlignment
AT2G18420.1 Gibberellin-regulated family protein5.4e-1862.9Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC   CQ RC   SRP LC RACGTCC RC CV PGTSGN + C  CY  +TTHG + KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

AT3G47830.1 DNA glycosylase superfamily protein1.8e-2934.87Show/hide
Query:  DSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGGEPQDN---------------------
        D  S  P  KS   ++G     G+P P    PTA+EC  VRD LL+LHGFP EF  YR++R R     S V   + Q N                     
Subjt:  DSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKYRKERQRLSECCSVVGGGEPQDN---------------------

Query:  ----NTTEANSERAFASLKSAFATWED-----------------------------------------------FGDPFVSRPLEDF-----DSLTCL--
            NTTE+NS+RAFASLK+ F  W+D                                                    V   L  F      +++C+  
Subjt:  ----NTTEANSERAFASLKSAFATWED-----------------------------------------------FGDPFVSRPLEDF-----DSLTCL--

Query:  ------------QVFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCS
                     VFEIAK +GWVP+ ADRNK Y+HLN++IP+ LKFDLNCLLYTHGK+CS
Subjt:  ------------QVFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGKLCS

AT4G09600.1 GAST1 protein homolog 34.6e-1758.06Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC G C+ RC   SRP LC+RAC +CC RC CVPPGT+GN  +C  CY  +TT G + KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

AT4G09610.1 GAST1 protein homolog 29.2e-1861.29Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC G C+ RC   SR KLC+RAC +CC RC CVPPGTSGN  +C  CY  +TTHG + KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP

AT5G14920.1 Gibberellin-regulated family protein5.2e-2167.74Show/hide
Query:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP
        IDC  LC  RCG HSR  +CMRAC TCC RCKCVPPGT GN+E CG+CY +M T G K+KCP
Subjt:  IDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGAGAAGCCCACGACAGGGCTTCATATGATAGAGGACGTATCGGCTTCGAAAACGCTCGGCGTGATGCAGAAGAACCGTAAAAAGAGGCGTCTTCAACTTCAACC
TCCTCAAAAGCCTGAACGAAAACCAGAACCAGAGCCAGAGCCAGTGTCCGATTCTCACTCGAACAGACCTCGAACAAAATCGGCCAGAATTATTAATGGCTTCAATGGGC
CCACCGGGGACCCGTGTCCTGCTCATCCTTCGCCCACTGCAGACGAATGTCTATCCGTAAGAGACGATCTCTTGACTCTTCATGGTTTCCCGCGAGAGTTCGTCAAGTAT
CGGAAGGAGAGACAGAGGTTGAGTGAGTGCTGCTCCGTCGTGGGCGGCGGTGAGCCCCAGGATAATAACACCACGGAAGCCAATTCTGAGAGGGCTTTTGCTTCTCTCAA
ATCTGCTTTTGCTACCTGGGAAGATTTTGGTGATCCCTTTGTTTCAAGACCATTGGAAGATTTTGATTCTCTAACATGTTTGCAGGTCTTTGAGATCGCAAAGTTCATCG
GTTGGGTCCCGGAGGAGGCAGACAGGAACAAAGCATATCTCCATCTTAACCAAAGGATCCCAAATCATCTCAAGTTCGATCTCAACTGTCTTCTTTACACTCATGGCAAG
CTCTGTTCGAAATACGCTGAGCTTCTCCATGGCGTGCGCGAAGTCGTTGAAAAACGCCGTCTGGTTCACCGCGTACAGCTCCACATACGGCTTCGTCCTCGGGTCCTTTA
TCAGAGCGTTGTCCGTCGACAGCAGCCCCAGCCCTCTCTGCAGATTCTGCGCGTCCGCAAACTTGGGGTAGATGTCGGGGTCCGTCGGTGTCGTGGTGCTGAAGTTGAAG
AGACGGTCGGTGAACTCCTTGCAGTGAGAGAAGCCGATGGTGTGGCCGCCGGAGAGCGCCACCATTCCTGGACTGTGAAGCCTTTAGCGTTGAAGAGGTTGATGATGTCG
TCCAAGGATTGGGTCACCTTCGGGAGATTTCCTTGGACATTTGAGAGTTTTGAAGTGAGACCATCTTTGCGGCCCAACCGACATTGTAGTAGGGGCCACCACCATACGAT
GAGGAGAGTTCGAGGTTGGTCTTGGCGTGGACACGACGTCGAAGGCGTCGCCGGAGAGGGAGTGGTTGATTTCGTCTTCACGCTCGGCCTTGTTGAAGGAATTGGAGGAG
ATAAGGACGGAAGCGTCGCAGCCGTCGACCATGCAGTCGTGGAAGAAGAGGCGGAGGGTGCCGGCGGCGGTGACGGGGCTGGTGATCTGCTTGTTGGTGACGGTGTCACG
GATGATCTTCTCAAAGTTGGGGCAAGTTTTCTGGTAATAACCGAGGGTGAGTTTGGATTGGACGAGGGAGGAGAAGGGAATAATGGAAAGGAAGAGAAGAAAAAGCAAAG
AAAAAGGTGCCATGGCGGAAGAGGAGGGAGAACCTGCTTCCTCCTTATCTATATTCCTCAATGGGGGAAGCCTCTTCCGATGGAGGGGAGGGCAATGAAGAATAAGAGAA
GAGGGAGGAAGCAAAAGAAAAAAACCCAACTTAGGACCAAAATGAATGGGGGAGGAGGAGAATATCTGAGAAAAAACGGGATCGACATAAAATGGTGTCGGGATTACTTC
TTTGGACGGGAGAAACTTTTGGGCTTTCGTTATCTATCAAAAACAAAACCCTTGGTGTCGTCTGATGTATACCCAGAAGAAGGGTTGGCAATGGCAGATGGGGGAGCAGT
GAGAGGCCCAAACAGAAGGCTCATGCAGTACATAGATTGTGAAGGGTTGTGCCAGCGGCGGTGCGGCGCGCACTCGAGGCCGAAGTTGTGCATGAGGGCGTGCGGGACGT
GCTGCGTGCGGTGCAAGTGCGTGCCGCCGGGCACTTCAGGGAACCGTGAGATGTGCGGCACCTGCTACACCGACATGACCACCCACGGCAACAAGACCAAGTGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGAGAAGCCCACGACAGGGCTTCATATGATAGAGGACGTATCGGCTTCGAAAACGCTCGGCGTGATGCAGAAGAACCGTAAAAAGAGGCGTCTTCAACTTCAACC
TCCTCAAAAGCCTGAACGAAAACCAGAACCAGAGCCAGAGCCAGTGTCCGATTCTCACTCGAACAGACCTCGAACAAAATCGGCCAGAATTATTAATGGCTTCAATGGGC
CCACCGGGGACCCGTGTCCTGCTCATCCTTCGCCCACTGCAGACGAATGTCTATCCGTAAGAGACGATCTCTTGACTCTTCATGGTTTCCCGCGAGAGTTCGTCAAGTAT
CGGAAGGAGAGACAGAGGTTGAGTGAGTGCTGCTCCGTCGTGGGCGGCGGTGAGCCCCAGGATAATAACACCACGGAAGCCAATTCTGAGAGGGCTTTTGCTTCTCTCAA
ATCTGCTTTTGCTACCTGGGAAGATTTTGGTGATCCCTTTGTTTCAAGACCATTGGAAGATTTTGATTCTCTAACATGTTTGCAGGTCTTTGAGATCGCAAAGTTCATCG
GTTGGGTCCCGGAGGAGGCAGACAGGAACAAAGCATATCTCCATCTTAACCAAAGGATCCCAAATCATCTCAAGTTCGATCTCAACTGTCTTCTTTACACTCATGGCAAG
CTCTGTTCGAAATACGCTGAGCTTCTCCATGGCGTGCGCGAAGTCGTTGAAAAACGCCGTCTGGTTCACCGCGTACAGCTCCACATACGGCTTCGTCCTCGGGTCCTTTA
TCAGAGCGTTGTCCGTCGACAGCAGCCCCAGCCCTCTCTGCAGATTCTGCGCGTCCGCAAACTTGGGGTAGATGTCGGGGTCCGTCGGTGTCGTGGTGCTGAAGTTGAAG
AGACGGTCGGTGAACTCCTTGCAGTGAGAGAAGCCGATGGTGTGGCCGCCGGAGAGCGCCACCATTCCTGGACTGTGAAGCCTTTAGCGTTGAAGAGGTTGATGATGTCG
TCCAAGGATTGGGTCACCTTCGGGAGATTTCCTTGGACATTTGAGAGTTTTGAAGTGAGACCATCTTTGCGGCCCAACCGACATTGTAGTAGGGGCCACCACCATACGAT
GAGGAGAGTTCGAGGTTGGTCTTGGCGTGGACACGACGTCGAAGGCGTCGCCGGAGAGGGAGTGGTTGATTTCGTCTTCACGCTCGGCCTTGTTGAAGGAATTGGAGGAG
ATAAGGACGGAAGCGTCGCAGCCGTCGACCATGCAGTCGTGGAAGAAGAGGCGGAGGGTGCCGGCGGCGGTGACGGGGCTGGTGATCTGCTTGTTGGTGACGGTGTCACG
GATGATCTTCTCAAAGTTGGGGCAAGTTTTCTGGTAATAACCGAGGGTGAGTTTGGATTGGACGAGGGAGGAGAAGGGAATAATGGAAAGGAAGAGAAGAAAAAGCAAAG
AAAAAGGTGCCATGGCGGAAGAGGAGGGAGAACCTGCTTCCTCCTTATCTATATTCCTCAATGGGGGAAGCCTCTTCCGATGGAGGGGAGGGCAATGAAGAATAAGAGAA
GAGGGAGGAAGCAAAAGAAAAAAACCCAACTTAGGACCAAAATGAATGGGGGAGGAGGAGAATATCTGAGAAAAAACGGGATCGACATAAAATGGTGTCGGGATTACTTC
TTTGGACGGGAGAAACTTTTGGGCTTTCGTTATCTATCAAAAACAAAACCCTTGGTGTCGTCTGATGTATACCCAGAAGAAGGGTTGGCAATGGCAGATGGGGGAGCAGT
GAGAGGCCCAAACAGAAGGCTCATGCAGTACATAGATTGTGAAGGGTTGTGCCAGCGGCGGTGCGGCGCGCACTCGAGGCCGAAGTTGTGCATGAGGGCGTGCGGGACGT
GCTGCGTGCGGTGCAAGTGCGTGCCGCCGGGCACTTCAGGGAACCGTGAGATGTGCGGCACCTGCTACACCGACATGACCACCCACGGCAACAAGACCAAGTGCCCTTAA
Protein sequenceShow/hide protein sequence
MLEKPTTGLHMIEDVSASKTLGVMQKNRKKRRLQLQPPQKPERKPEPEPEPVSDSHSNRPRTKSARIINGFNGPTGDPCPAHPSPTADECLSVRDDLLTLHGFPREFVKY
RKERQRLSECCSVVGGGEPQDNNTTEANSERAFASLKSAFATWEDFGDPFVSRPLEDFDSLTCLQVFEIAKFIGWVPEEADRNKAYLHLNQRIPNHLKFDLNCLLYTHGK
LCSKYAELLHGVREVVEKRRLVHRVQLHIRLRPRVLYQSVVRRQQPQPSLQILRVRKLGVDVGVRRCRGAEVEETVGELLAVREADGVAAGERHHSWTVKPLALKRLMMS
SKDWVTFGRFPWTFESFEVRPSLRPNRHCSRGHHHTMRRVRGWSWRGHDVEGVAGEGVVDFVFTLGLVEGIGGDKDGSVAAVDHAVVEEEAEGAGGGDGAGDLLVGDGVT
DDLLKVGASFLVITEGEFGLDEGGEGNNGKEEKKKQRKRCHGGRGGRTCFLLIYIPQWGKPLPMEGRAMKNKRRGRKQKKKTQLRTKMNGGGGEYLRKNGIDIKWCRDYF
FGREKLLGFRYLSKTKPLVSSDVYPEEGLAMADGGAVRGPNRRLMQYIDCEGLCQRRCGAHSRPKLCMRACGTCCVRCKCVPPGTSGNREMCGTCYTDMTTHGNKTKCP