; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027310 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027310
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationtig00153048:2972049..2979775
RNA-Seq ExpressionSgr027310
SyntenySgr027310
Gene Ontology termsGO:0009751 - response to salicylic acid (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72326.1 hypothetical protein VITISV_041246 [Vitis vinifera]3.1e-11652.42Show/hide
Query:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI---SYRNIAAADIPPPP------HSAGFSGGA------------------VAPEERAMF
        M KEI LQF + NR  A +G+ K+  +VC+R +KDIM  AI     +N+  + +  PP      +  G  G                    V P++R MF
Subjt:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI---SYRNIAAADIPPPP------HSAGFSGGA------------------VAPEERAMF

Query:  VTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKFIPKRLQP----PPPPPLPPQDGN
        VTFSKGYPV+EWEVREFF R +GDCIES  MQEVE NEQ+LFARI F SASTI+ IL G  + KF IN KH+WARKF+PKR +P     P     P    
Subjt:  VTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKFIPKRLQP----PPPPPLPPQDGN

Query:  TGRH---------SPSTSPQALLVPISTRPSILGRGVRHEEIATGVKMMIRGGGCGE-EQRRSRKSEGKKMITNCFTMDSGSSGEEPTSWEDLCSINLMP
        + +H          P   P    +   T    L  G  H  I   +        CG  EQ ++  S   ++  NC  MD+GS+GEEP SW++L +INLMP
Subjt:  TGRH---------SPSTSPQALLVPISTRPSILGRGVRHEEIATGVKMMIRGGGCGE-EQRRSRKSEGKKMITNCFTMDSGSSGEEPTSWEDLCSINLMP

Query:  SELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQMHATGWKWKLTTCLGGDGVS
        SELF+KFRKE+QG RVG+NLEFYNAP NEY+AKLVLKPL P++RWKFIYEP+  D+ +LSKKIP+T+FLNLQVG+GH+FQ+H TGWKWKLTTCLGG+G+S
Subjt:  SELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQMHATGWKWKLTTCLGGDGVS

Query:  RIRNKTSISPFPGMDLRFGWRADYVLPEITGDI
        RIRNKTS+  FPG+D RFGW ADYVLPEITG +
Subjt:  RIRNKTSISPFPGMDLRFGWRADYVLPEITGDI

XP_004144030.1 uncharacterized protein LOC101208324 [Cucumis sativus]4.5e-9992.06Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVG+NLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVT+FLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNK+SISPFPG+D RFGWRADYVLPEITG      ALGTGEPLFNMNSG+LEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

XP_008450920.1 PREDICTED: uncharacterized protein LOC103492367 [Cucumis melo]4.5e-9992.06Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVG+NLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVT+FLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNK+SISPFPG+D RFGWRADYVLPEITG      ALGTGEPLFNMNSG+LEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

XP_022147544.1 uncharacterized protein LOC111016448 isoform X1 [Momordica charantia]1.0e-10394.82Show/hide
Query:  NCFTMDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV
        NCFTMDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV
Subjt:  NCFTMDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV

Query:  GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMD RFGWRADYVLPEITG      ALGTGEPLFNMNSGKLEASLDRIEAI+TH
Subjt:  GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

XP_022147546.1 uncharacterized protein LOC111016448 isoform X3 [Momordica charantia]1.1e-10094.71Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMD RFGWRADYVLPEITG      ALGTGEPLFNMNSGKLEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

TrEMBL top hitse value%identityAlignment
A0A0A0M256 Uncharacterized protein2.2e-9992.06Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVG+NLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVT+FLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNK+SISPFPG+D RFGWRADYVLPEITG      ALGTGEPLFNMNSG+LEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

A0A5D3CFM5 Uncharacterized protein2.2e-9992.06Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVG+NLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVT+FLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNK+SISPFPG+D RFGWRADYVLPEITG      ALGTGEPLFNMNSG+LEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

A0A6J1D1B3 uncharacterized protein LOC111016448 isoform X15.0e-10494.82Show/hide
Query:  NCFTMDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV
        NCFTMDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV
Subjt:  NCFTMDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQV

Query:  GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMD RFGWRADYVLPEITG      ALGTGEPLFNMNSGKLEASLDRIEAI+TH
Subjt:  GIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

A0A6J1D2N1 uncharacterized protein LOC111016448 isoform X35.2e-10194.71Show/hide
Query:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
        MDSGSSG+EPTSWEDLCSINLMPSELF+KFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH
Subjt:  MDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGH

Query:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMD RFGWRADYVLPEITG      ALGTGEPLFNMNSGKLEASLDRIEAI+TH
Subjt:  NFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

A5BGQ4 N-acetyltransferase domain-containing protein1.5e-11652.42Show/hide
Query:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI---SYRNIAAADIPPPP------HSAGFSGGA------------------VAPEERAMF
        M KEI LQF + NR  A +G+ K+  +VC+R +KDIM  AI     +N+  + +  PP      +  G  G                    V P++R MF
Subjt:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI---SYRNIAAADIPPPP------HSAGFSGGA------------------VAPEERAMF

Query:  VTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKFIPKRLQP----PPPPPLPPQDGN
        VTFSKGYPV+EWEVREFF R +GDCIES  MQEVE NEQ+LFARI F SASTI+ IL G  + KF IN KH+WARKF+PKR +P     P     P    
Subjt:  VTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKFIPKRLQP----PPPPPLPPQDGN

Query:  TGRH---------SPSTSPQALLVPISTRPSILGRGVRHEEIATGVKMMIRGGGCGE-EQRRSRKSEGKKMITNCFTMDSGSSGEEPTSWEDLCSINLMP
        + +H          P   P    +   T    L  G  H  I   +        CG  EQ ++  S   ++  NC  MD+GS+GEEP SW++L +INLMP
Subjt:  TGRH---------SPSTSPQALLVPISTRPSILGRGVRHEEIATGVKMMIRGGGCGE-EQRRSRKSEGKKMITNCFTMDSGSSGEEPTSWEDLCSINLMP

Query:  SELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQMHATGWKWKLTTCLGGDGVS
        SELF+KFRKE+QG RVG+NLEFYNAP NEY+AKLVLKPL P++RWKFIYEP+  D+ +LSKKIP+T+FLNLQVG+GH+FQ+H TGWKWKLTTCLGG+G+S
Subjt:  SELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQMHATGWKWKLTTCLGGDGVS

Query:  RIRNKTSISPFPGMDLRFGWRADYVLPEITGDI
        RIRNKTS+  FPG+D RFGW ADYVLPEITG +
Subjt:  RIRNKTSISPFPGMDLRFGWRADYVLPEITGDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49290.1 unknown protein2.5e-2333.17Show/hide
Query:  EISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRA----------------------------------ISYRNIAAADIPPP-PH------------
        + +L+ +   R+    GVTK   DVC RA  D+ + A                                  +S + +  A  PPP PH            
Subjt:  EISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRA----------------------------------ISYRNIAAADIPPP-PH------------

Query:  -----SAGFSGGAVAPEERAMFVTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFAR--IDFRSASTIDSILRGQQRRKFIINGKHIWARK
                   G +A ++R +F+TFSKGYP+ E EVR +FTR  G+ IE+ +MQEVE NEQ LFA+  +  + AS +D I+  + R KF I+GKH+WARK
Subjt:  -----SAGFSGGAVAPEERAMFVTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPNEQALFAR--IDFRSASTIDSILRGQQRRKFIINGKHIWARK

Query:  FIPKRLQP
        ++ K   P
Subjt:  FIPKRLQP

AT1G64870.1 unknown protein1.5e-2034Show/hide
Query:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAIS-----------------------------YRNIAAADIPPPPHSAGFSGG--------
        M+ +ISLQ ++ +R TAI G+      +C R   DI+QR +                                NI A D  P  +S  F  G        
Subjt:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAIS-----------------------------YRNIAAADIPPPPHSAGFSGG--------

Query:  AVAPEERAMFVTFSKGYPVHEWEVREFFTREHG-DCIESFQMQEVEPN------------EQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKF
            +ER +F+TFS+G+PV   EV   FT  +G DC+ES  M E   N            +Q LFA++   S  T+D IL GQ+++K+ INGKHIWARKF
Subjt:  AVAPEERAMFVTFSKGYPVHEWEVREFFTREHG-DCIESFQMQEVEPN------------EQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKF

AT2G06010.1 OBP3-responsive gene 45.3e-8275.14Show/hide
Query:  SSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQM
        S  EEP SWEDL  INLMPSELF+KFRKELQG RVG+NLE YN P N+Y AKLVLKPL P ++WKFIYEP+ Q++R+LSKKIPVTRFLNLQVG+GHNFQM
Subjt:  SSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNLQVGIGHNFQM

Query:  HATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH
        +A GWKWKLT+CLGGDGVSRIRNKT++   PG+D RFGWRAD+VLPE+TG      ALGT EPLFNM+SG+LEASLDR+EAI+TH
Subjt:  HATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTH

AT3G45200.1 unknown protein1.3e-1930.69Show/hide
Query:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAISYRNIAAAD--------IPPPPHSA-------------------------------GFS
        ++K ISLQ +Y NR +AI G+      VC R   DI+ R +   ++++ D        IP  PH                                 G++
Subjt:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAISYRNIAAAD--------IPPPPHSA-------------------------------GFS

Query:  GGAVAPE-ERAMFVTFSKGYPVHEWEVREFFTREHGD-CIESFQMQEVEP-----------NEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWAR
           +A E +R +F+TFS+GYPV   E+ E FT+E+G+ C+E   MQ                +Q+LFAR+   S +T+D +L  +Q+++ +I GK+IWAR
Subjt:  GGAVAPE-ERAMFVTFSKGYPVHEWEVREFFTREHGD-CIESFQMQEVEP-----------NEQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWAR

Query:  KF
        K+
Subjt:  KF

AT5G11220.1 unknown protein2.4e-2133Show/hide
Query:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI----SYRNIAAAD----IPPPPHSA-----------------------------GFSGG
        + K+ISLQ  + +R +AI G+      VC R   DI+QRA+    SY  +        IP  PH                               G++  
Subjt:  MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAI----SYRNIAAAD----IPPPPHSA-----------------------------GFSGG

Query:  AVAPE-ERAMFVTFSKGYPVHEWEVREFFTREHGD-CIESFQMQEVEPN-----------EQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKF
         +A + ER MF+TFS+G+PV + EV+ FFT+ +G+ C+E   M+E   N           +Q+LFA++   S +T+D IL G++ ++F  NGKHIWARK+
Subjt:  AVAPE-ERAMFVTFSKGYPVHEWEVREFFTREHGD-CIESFQMQEVEPN-----------EQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGGAGATCTCCCTCCAGTTCCTCTACGCCAACCGCCAAACCGCCATTGAAGGAGTGACCAAAGTTCAAAACGATGTTTGTATCAGAGCAATGAAAGACATCAT
GCAGAGAGCCATAAGTTACAGAAACATCGCCGCCGCCGATATTCCACCACCACCACATTCAGCCGGATTTTCCGGTGGCGCAGTGGCGCCGGAAGAAAGGGCGATGTTCG
TGACGTTCTCGAAGGGGTACCCGGTGCATGAGTGGGAGGTCCGGGAGTTCTTCACCAGAGAGCATGGCGATTGCATAGAGTCGTTTCAGATGCAAGAAGTGGAGCCGAAC
GAGCAAGCGCTGTTCGCTCGGATAGATTTCCGGTCAGCCTCCACCATAGACTCGATTCTCCGCGGTCAGCAAAGAAGGAAGTTCATCATCAATGGGAAACATATCTGGGC
TCGGAAGTTCATCCCGAAGCGACTACAGCCGCCGCCGCCGCCGCCGCTGCCTCCACAGGATGGCAACACGGGGAGGCATTCTCCATCTACGTCCCCACAGGCACTTTTAG
TCCCCATCTCTACCCGTCCTTCCATTTTGGGGCGGGGGGTGAGGCATGAAGAGATAGCAACGGGCGTGAAGATGATGATCAGAGGCGGTGGCTGTGGAGAAGAACAAAGA
AGATCAAGAAAGAGTGAGGGGAAGAAGATGATCACCAATTGCTTCACTATGGATTCCGGATCAAGTGGCGAGGAACCCACTTCTTGGGAGGACCTTTGCAGTATCAATTT
GATGCCCTCCGAATTGTTTGTGAAGTTCCGCAAAGAGTTACAGGGCTTTCGAGTCGGTATCAATTTGGAGTTCTATAATGCTCCGTGTAATGAATATGAAGCCAAGCTTG
TGCTGAAGCCATTATATCCGAACCAGCGTTGGAAATTTATCTATGAGCCAATCCGTCAAGACATACGTCTTCTTTCCAAAAAGATTCCTGTAACCAGATTTTTAAATCTT
CAGGTTGGTATTGGACATAATTTTCAGATGCACGCCACTGGTTGGAAATGGAAGCTAACCACATGTTTGGGTGGGGATGGTGTATCTCGCATACGGAATAAGACATCAAT
TAGCCCATTTCCAGGGATGGATTTGCGCTTCGGATGGAGGGCGGATTACGTGCTTCCTGAAATTACAGGTGACATTGAATTCGACAGGGCTCTGGGTACCGGTGAACCAT
TGTTTAACATGAACTCAGGAAAGCTGGAAGCCTCCCTTGATAGAATTGAGGCCATTCTCACTCACAGGCGGACTCGGAATCTTCAAACGGAGACCAGATTGTCTTCTCCT
TTGTTTTCTCATGTGTGGAAATCGATGTTTTTTGCTTCTCTTCATGCTCAATTTGTTTATCTGACATTGTTTGTTGAAGAACAAGGGGGTAATTTCCCGCTCCTGCTCCG
TCTTCGTTTTCGTCTTCGTCTTCTGCCCTGTTCAGAGGGTGAATTTGTTCGCAGGAAGCTCGATTTCAATGTCGGCAATCTCGATTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGGAGATCTCCCTCCAGTTCCTCTACGCCAACCGCCAAACCGCCATTGAAGGAGTGACCAAAGTTCAAAACGATGTTTGTATCAGAGCAATGAAAGACATCAT
GCAGAGAGCCATAAGTTACAGAAACATCGCCGCCGCCGATATTCCACCACCACCACATTCAGCCGGATTTTCCGGTGGCGCAGTGGCGCCGGAAGAAAGGGCGATGTTCG
TGACGTTCTCGAAGGGGTACCCGGTGCATGAGTGGGAGGTCCGGGAGTTCTTCACCAGAGAGCATGGCGATTGCATAGAGTCGTTTCAGATGCAAGAAGTGGAGCCGAAC
GAGCAAGCGCTGTTCGCTCGGATAGATTTCCGGTCAGCCTCCACCATAGACTCGATTCTCCGCGGTCAGCAAAGAAGGAAGTTCATCATCAATGGGAAACATATCTGGGC
TCGGAAGTTCATCCCGAAGCGACTACAGCCGCCGCCGCCGCCGCCGCTGCCTCCACAGGATGGCAACACGGGGAGGCATTCTCCATCTACGTCCCCACAGGCACTTTTAG
TCCCCATCTCTACCCGTCCTTCCATTTTGGGGCGGGGGGTGAGGCATGAAGAGATAGCAACGGGCGTGAAGATGATGATCAGAGGCGGTGGCTGTGGAGAAGAACAAAGA
AGATCAAGAAAGAGTGAGGGGAAGAAGATGATCACCAATTGCTTCACTATGGATTCCGGATCAAGTGGCGAGGAACCCACTTCTTGGGAGGACCTTTGCAGTATCAATTT
GATGCCCTCCGAATTGTTTGTGAAGTTCCGCAAAGAGTTACAGGGCTTTCGAGTCGGTATCAATTTGGAGTTCTATAATGCTCCGTGTAATGAATATGAAGCCAAGCTTG
TGCTGAAGCCATTATATCCGAACCAGCGTTGGAAATTTATCTATGAGCCAATCCGTCAAGACATACGTCTTCTTTCCAAAAAGATTCCTGTAACCAGATTTTTAAATCTT
CAGGTTGGTATTGGACATAATTTTCAGATGCACGCCACTGGTTGGAAATGGAAGCTAACCACATGTTTGGGTGGGGATGGTGTATCTCGCATACGGAATAAGACATCAAT
TAGCCCATTTCCAGGGATGGATTTGCGCTTCGGATGGAGGGCGGATTACGTGCTTCCTGAAATTACAGGTGACATTGAATTCGACAGGGCTCTGGGTACCGGTGAACCAT
TGTTTAACATGAACTCAGGAAAGCTGGAAGCCTCCCTTGATAGAATTGAGGCCATTCTCACTCACAGGCGGACTCGGAATCTTCAAACGGAGACCAGATTGTCTTCTCCT
TTGTTTTCTCATGTGTGGAAATCGATGTTTTTTGCTTCTCTTCATGCTCAATTTGTTTATCTGACATTGTTTGTTGAAGAACAAGGGGGTAATTTCCCGCTCCTGCTCCG
TCTTCGTTTTCGTCTTCGTCTTCTGCCCTGTTCAGAGGGTGAATTTGTTCGCAGGAAGCTCGATTTCAATGTCGGCAATCTCGATTTATAG
Protein sequenceShow/hide protein sequence
MRKEISLQFLYANRQTAIEGVTKVQNDVCIRAMKDIMQRAISYRNIAAADIPPPPHSAGFSGGAVAPEERAMFVTFSKGYPVHEWEVREFFTREHGDCIESFQMQEVEPN
EQALFARIDFRSASTIDSILRGQQRRKFIINGKHIWARKFIPKRLQPPPPPPLPPQDGNTGRHSPSTSPQALLVPISTRPSILGRGVRHEEIATGVKMMIRGGGCGEEQR
RSRKSEGKKMITNCFTMDSGSSGEEPTSWEDLCSINLMPSELFVKFRKELQGFRVGINLEFYNAPCNEYEAKLVLKPLYPNQRWKFIYEPIRQDIRLLSKKIPVTRFLNL
QVGIGHNFQMHATGWKWKLTTCLGGDGVSRIRNKTSISPFPGMDLRFGWRADYVLPEITGDIEFDRALGTGEPLFNMNSGKLEASLDRIEAILTHRRTRNLQTETRLSSP
LFSHVWKSMFFASLHAQFVYLTLFVEEQGGNFPLLLRLRFRLRLLPCSEGEFVRRKLDFNVGNLDL