; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022249 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022249
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionN-lysine methyltransferase SETD6 isoform X1
Genome locationChr05:22314561..22322490
RNA-Seq ExpressionHG10022249
SyntenyHG10022249
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463512.1 PREDICTED: ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X2 [Cucumis melo]1.1e-17969.2Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        MLLG+R  NIWRW+ SPV+STS+AF+FN HFST    +EL+SL  SNDDTFLPWLE+KA+ KISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLG GSEWAPYI RLPQP EMHNTIFW ESELEMIRKS LYEESLNQRSQI+REF AIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR
        RI+CDDFMHAY+LVTSRAWRS + VSL                           V+ADRDYAPGEHVLIRYGKYSNATLMLDFGF LPYNIHDQ  +  +
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR

Query:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL
                 +  +  + + +  SC P       V     +  +F   EVR ATGKGRGLPQSLRAFARILSC+NPQ                        
Subjt:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL

Query:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
            ELNELS EAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEA+ P+D  CLC+KLARRRLMAQHLLTGE+RI+KSAIAWLENYCDAI
Subjt:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

XP_011655345.1 actin-histidine N-methyltransferase [Cucumis sativus]2.4e-17968.8Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        M LGIR NNIWRW+ SPV+ TS+AF+FN HFSTS   +EL+SL  SNDDTFLPWLERKA+ KISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PD LPLPIRDLLGNEIGNVAKLAVV+LLE KLG GSEWAPYI RLPQPWEMHNTIFW ESELEMIRKSSLYEESLNQRSQI+REFLAIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR
        RI+CDDFMHAY+LVTSRAWRS + VSL                           V+ADRD+APGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ      
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR

Query:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL
          P      +  +  + + +  SC         V     +  +F   EVRSATGKGRGLPQSLRAFARILSC+NPQ                        
Subjt:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL

Query:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
            ELN+LSSEAVNGDGRLARIPLKNV KEVEAHRILLSQFKQLVEEYNASIEA+ P+D  CL +KLARRRL+AQHLLTGE+R++KSAIAWLENYC+AI
Subjt:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

XP_038889893.1 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X1 [Benincasa hispida]8.7e-19874.4Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        MLLGIRFN I RWR SPVIS SHAFHFNG+FST+   EEL  L  SN+DTFLPWLERKA+MKISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PD+LPLPI++LLGNEIGNVAKLAVV+LLEQKLG GSEWAPYITRLPQPWEMHNT+FWNESELEMIR SSLYEESLNQRSQIEREFLAIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR
        RINCDDFMHA++LVTSRAWRS + VSL                           VIADRDYAPGE VLIRYGKYSNATLMLDFGFALPYNIHDQ  +  +
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR

Query:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL
              D  +  +  + + +  SC P       V +   +  +F   EVR ATGKGRGLPQSLRAFARILSCSNPQGLYL LY LSFL+W +  DI LVL
Subjt:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL

Query:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
        V CTELNELS+EAVNGDGRLARIPLKNVNKEVEAH+ILLSQFKQLVEEYNASIEAL P+D  CLCKKLA RRLMA HLLTGELRI+KSAIAWLE YCDAI
Subjt:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

XP_038889894.1 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X2 [Benincasa hispida]1.2e-19975.15Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILKQISPDILP
        MLLGIRFN I RWR SPVIS SHAFHFNG+FST+   EEL  L  SN+DTFLPWLERKA+MKISSVLSIGKSSIGRFLFASETIRAGDCILKQISPD+LP
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILKQISPDILP

Query:  LPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIIDRINCD
        LPI++LLGNEIGNVAKLAVV+LLEQKLG GSEWAPYITRLPQPWEMHNT+FWNESELEMIR SSLYEESLNQRSQIEREFLAIRKALE+FPEIIDRINCD
Subjt:  LPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIIDRINCD

Query:  DFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCY
        DFMHA++LVTSRAWRS + VSL                           VIADRDYAPGE VLIRYGKYSNATLMLDFGFALPYNIHDQ  +  +     
Subjt:  DFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCY

Query:  IDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTE
         D  +  +  + + +  SC P       V +   +  +F   EVR ATGKGRGLPQSLRAFARILSCSNPQGLYL LY LSFL+W +  DI LVLV CTE
Subjt:  IDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTE

Query:  LNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
        LNELS+EAVNGDGRLARIPLKNVNKEVEAH+ILLSQFKQLVEEYNASIEAL P+D  CLCKKLA RRLMA HLLTGELRI+KSAIAWLE YCDAI
Subjt:  LNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

XP_038889895.1 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X3 [Benincasa hispida]5.5e-18470.6Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        MLLGIRFN I RWR SPVIS SHAFHFNG+FST+   EEL  L  SN+DTFLPWLERKA+MKISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PD+LPLPI++LLGNEIGNVAKLAVV+LLEQKLG GSEWAPYITRLPQPWEMHNT+FWNESELEMIR SSLYEESLNQRSQIEREFLAIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR
        RINCDDFMHA++LVTSRAWRS + VSL                           VIADRDYAPGE VLIRYGKYSNATLMLDFGFALPYNIHDQ  +  +
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR

Query:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL
              D  +  +  + + +  SC P       V +   +  +F   EVR ATGKGRGLPQSLRAFARILSCSNPQ                        
Subjt:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL

Query:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
            ELNELS+EAVNGDGRLARIPLKNVNKEVEAH+ILLSQFKQLVEEYNASIEAL P+D  CLCKKLA RRLMA HLLTGELRI+KSAIAWLE YCDAI
Subjt:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

TrEMBL top hitse value%identityAlignment
A0A0A0KNP0 Uncharacterized protein8.9e-18068.94Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        M LGIR NNIWRW+ SPV+ TS+AF+FN HFSTS   +EL+SL  SNDDTFLPWLERKA+ KISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PD LPLPIRDLLGNEIGNVAKLAVV+LLE KLG GSEWAPYI RLPQPWEMHNTIFW ESELEMIRKSSLYEESLNQRSQI+REFLAIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL--------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRN
        RI+CDDFMHAY+LVTSRAWRS + VSL                          V+ADRD+APGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ       
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL--------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRN

Query:  YPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLV
         P      +  +  + + +  SC         V     +  +F   EVRSATGKGRGLPQSLRAFARILSC+NPQ                         
Subjt:  YPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLV

Query:  DCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
           ELN+LSSEAVNGDGRLARIPLKNV KEVEAHRILLSQFKQLVEEYNASIEA+ P+D  CL +KLARRRL+AQHLLTGE+R++KSAIAWLENYC+AI
Subjt:  DCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

A0A1S3CJF6 histone-lysine N-methyltransferase setd3 isoform X12.6e-17968.38Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK--------
        MLLG+R  NIWRW+ SPV+STS+AF+FN HFST    +EL+SL  SNDDTFLPWLE+KA+ KISSVLSIGKSSIGRFLFASETIRAGDCILK        
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK--------

Query:  ---QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALES
           QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLG GSEWAPYI RLPQP EMHNTIFW ESELEMIRKS LYEESLNQRSQI+REF AIRKALE+
Subjt:  ---QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALES

Query:  FPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ
        FPEIIDRI+CDDFMHAY+LVTSRAWRS + VSL                           V+ADRDYAPGEHVLIRYGKYSNATLMLDFGF LPYNIHDQ
Subjt:  FPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ

Query:  ALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEF
          +  +         +  +  + + +  SC P       V     +  +F   EVR ATGKGRGLPQSLRAFARILSC+NPQ                  
Subjt:  ALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEF

Query:  DIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLE
                  ELNELS EAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEA+ P+D  CLC+KLARRRLMAQHLLTGE+RI+KSAIAWLE
Subjt:  DIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLE

Query:  NYCDAI
        NYCDAI
Subjt:  NYCDAI

A0A1S3CJW1 ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic isoform X25.2e-18069.2Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS
        MLLG+R  NIWRW+ SPV+STS+AF+FN HFST    +EL+SL  SNDDTFLPWLE+KA+ KISSVLSIGKSSIGRFLFASETIRAGDCILK     QIS
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QIS

Query:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID
        PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLG GSEWAPYI RLPQP EMHNTIFW ESELEMIRKS LYEESLNQRSQI+REF AIRKALE+FPEIID
Subjt:  PDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIID

Query:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR
        RI+CDDFMHAY+LVTSRAWRS + VSL                           V+ADRDYAPGEHVLIRYGKYSNATLMLDFGF LPYNIHDQ  +  +
Subjt:  RINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYR

Query:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL
                 +  +  + + +  SC P       V     +  +F   EVR ATGKGRGLPQSLRAFARILSC+NPQ                        
Subjt:  NYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVL

Query:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
            ELNELS EAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEA+ P+D  CLC+KLARRRLMAQHLLTGE+RI+KSAIAWLENYCDAI
Subjt:  VDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

A0A6J1GPC6 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X11.0e-16763.98Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHS--------LTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK
        MLLG RF N+WRW  SP ISTSHAFHFN  F TS   E   S         T   DD FLPWLERK+  +ISSVLSIGKS IGRFLFASETIRAGDCILK
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHS--------LTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK

Query:  -----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKAL
             QISPD+LP PIRDLLG+EIGNVAK+A+VILLEQKLG  S+WAPYI RLP+PWEMHNTIFW+E ELEMIRKSSLYEESLNQRSQIEREFLAI++AL
Subjt:  -----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKAL

Query:  ESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH
        E+FPEI+D INCDDFMHAY+LVTSRAWRS K  SL                           VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH
Subjt:  ESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH

Query:  DQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQE
        DQ L+  +        T  +     +   L           V+       +F   EVRSATGKGRGLPQSLRAFARILSC++PQ                
Subjt:  DQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQE

Query:  EFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAW
                    ELN+L +EA NGDGRLARIPLKN+N+EVEAH+ILLSQFKQLVEEY ASIEAL P+D  C CKK+A+RRLMAQHLLTGELRI+KSA AW
Subjt:  EFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAW

Query:  LENYCDAI
        L NYCD I
Subjt:  LENYCDAI

A0A6J1JPU4 fructose-bisphosphate aldolase-lysine N-methyltransferase, chloroplastic isoform X15.4e-16162.4Show/hide
Query:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHS--------LTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK
        MLLG RF N+WRW  SP I TSH+F  N  F TS   E   S         T   DD FLPWLERK+   ISSVLSIGKS IGRFLFASETIRAGDCILK
Subjt:  MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHS--------LTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK

Query:  -----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKAL
             QISPD+LP PIRDLLG+EIGNVAK+A+VILLEQKLG  S+WAPYI RLP+PWEMHNTIFW+E ELEMIRK SL+EESLNQRSQIEREFLAI++AL
Subjt:  -----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKAL

Query:  ESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH
        E+FPEIID IN DDFMHAY+LVTSRAWRS K  SL                           VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH
Subjt:  ESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL---------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIH

Query:  DQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQE
        DQ L+  +        T  +     +   L           V+       +F   EV+SATGKGRGLPQSLRAFARILSC++PQ                
Subjt:  DQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQE

Query:  EFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAW
                    ELN+L +EA NGDGRLARIPLKN+N+EVEAH+ILLSQFKQ VEEY ASIEAL P+D  C CKK+A+RRLMAQHLLTGELRI+KSA AW
Subjt:  EFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAW

Query:  LENYCDAI
        L NYCD I
Subjt:  LENYCDAI

SwissProt top hitse value%identityAlignment
B0VX69 Actin-histidine N-methyltransferase5.1e-0723.5Show/hide
Query:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II
        PL  +D +   +GN+A LA  +L E +  P S W PYI  LP   E    +++ E E+  ++ +    +  +Q     R++    K +++ P      + 
Subjt:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II

Query:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ
        D    +D+  A S V +R                                 +  + D    +A +D+  GE + I YG  SNA  ++  GF    N HD+
Subjt:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ

B7ZUF3 Actin-histidine N-methyltransferase7.8e-0824Show/hide
Query:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II
        PL  +D +   +GN+  LA  +L E +  P S W PYI  LP   E    +++NE E++ ++ +    +  +Q     R++    K +++ P      + 
Subjt:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II

Query:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ
        D    DD+  A S V +R                                 +  + D    +A +D+  GE + I YG  SNA  ++  GF    N+HD+
Subjt:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ

Q12504 Ribosomal lysine N-methyltransferase 42.1e-0823.04Show/hide
Query:  DTFLPWLERKAKMKISSVLSIGK---SSIGRFLFASETIRAGDCILKQISPDILPL----------PIRDLLGNEIGNVAKLAVVILLEQK-LGPGSEWA
        + F+ WL+  A++++S  + I      + GR + A++ I+  + + K     +L +           ++D   NE G+   L + IL E + L   S WA
Subjt:  DTFLPWLERKAKMKISSVLSIGK---SSIGRFLFASETIRAGDCILKQISPDILPL----------PIRDLLGNEIGNVAKLAVVILLEQK-LGPGSEWA

Query:  PYITRLPQPWEMHNTIFWNESELEMIRKSSLYEE--SLNQRSQIEREFLAIRKALESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSLVIADRDYAPG
        PY     +P +M+  IFW+++EL++++ S + E       +   ER   +I++    F  +      D+F +  S++ S ++  +   S V  + +    
Subjt:  PYITRLPQPWEMHNTIFWNESELEMIRKSSLYEE--SLNQRSQIEREFLAIRKALESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDVSLVIADRDYAPG

Query:  EHVL
        E  L
Subjt:  EHVL

Q5ZML9 Actin-histidine N-methyltransferase3.9e-0723.47Show/hide
Query:  RDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----IIDRIN
        +D +   +GN+  LA  +L E +  P S W PYI  LP   E    +++ E E++ +R +    +  +Q     R++    K +++ P      + D   
Subjt:  RDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----IIDRIN

Query:  CDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ
         DD+  A S V +R                                 +  + D    +A +D+  GE + I YG  SNA  ++  GF    N HD+
Subjt:  CDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ

Q7SXS7 Actin-histidine N-methyltransferase6.0e-0825Show/hide
Query:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II
        PL  +D +   +GNV  LA+ +L E +  P S W PYI  LP   E    +++ E E+  +  +   ++ L+Q     R++    K + + P      + 
Subjt:  PLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPE-----II

Query:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ
        D    DD+  A S V +R                                 +  + D    +A +DY  GE + I YG  SNA  ++  GF    N HD+
Subjt:  DRINCDDFMHAYSLVTSR--------------------------------AWRSQKDVSLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQ

Arabidopsis top hitse value%identityAlignment
AT3G07670.1 Rubisco methyltransferase family protein3.0e-1025.27Show/hide
Query:  LSIGKSSIG-RFLFASETIRAGDCILKQISPDILPLPIRDLLGNEIGNVAK---------LAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESE
        ++I +  IG R L AS+ +R G+ +L  + P ++     +    E G V K         LA  ++ E  L   S W  YI+ LP+  + ++ ++W  +E
Subjt:  LSIGKSSIG-RFLFASETIRAGDCILKQISPDILPLPIRDLLGNEIGNVAK---------LAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESE

Query:  LEM-IRKSSLYEESLNQRSQIEREFLAIRKALES-FPEIIDR--INCDDFMHAYSLVTSR--------------AW-----------------RSQKDVS
        L+M +  S + E ++ + + +   +  +R  + S  P++  +   N + F  ++ ++ SR               W                 +S K V 
Subjt:  LEM-IRKSSLYEESLNQRSQIEREFLAIRKALES-FPEIIDR--INCDDFMHAYSLVTSR--------------AW-----------------RSQKDVS

Query:  LVIADRDYAPGEHVLIRYGKYSNATLMLDFGF-----ALPYNIHDQALLSYRNYPCYIDYTETESSFEFKDFALS---CFP
        +   DR Y PGE V I YG  SN  L+L +GF       P +  + AL   +N  CY      E     K   LS   CFP
Subjt:  LVIADRDYAPGEHVLIRYGKYSNATLMLDFGF-----ALPYNIHDQALLSYRNYPCYIDYTETESSFEFKDFALS---CFP

AT3G55080.1 SET domain-containing protein2.3e-9542.76Show/hide
Query:  SNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRL
        S D+ FLPWLER A  KI++ LSIGKS+ GR LFAS+ I AGDC+LK     QI+PD LP  IR LL NE+GN+  LA V++ E+K+G  S W PYI+RL
Subjt:  SNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILK-----QISPDILPLPIRDLLGNEIGNVAKLAVVILLEQKLGPGSEWAPYITRL

Query:  PQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESF-PEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL-----------------
        PQP EMH++IFW E EL MIR S++++E++ Q++QIE++F  + +A +   P + +R + +DFM+AY+LV SRAW + K +SL                 
Subjt:  PQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESF-PEIIDRINCDDFMHAYSLVTSRAWRSQKDVSL-----------------

Query:  ----------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNF
                  V ADR+Y+PG+ V I+YG++SNATLMLDFGF  PYNIHD+  +        +D    +     K   L    +      ++    S   F
Subjt:  ----------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNF

Query:  IAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQ
           EV+SA GKG+G+PQSLRAFAR+L C  PQ                            ELN+LS EA   DGRLAR+P K+ N+E+EAH+ILLS   +
Subjt:  IAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQ

Query:  LVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
        L+E+++  I+ +   +   + ++ A RR MA+ LL GELR+++SA  WL +YC  +
Subjt:  LVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

AT3G55080.2 SET domain-containing protein1.5e-7038.96Show/hide
Query:  LAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESF-PEIIDRINCDDFMHAYSLVTSRAWR
        LA V++ E+K+G  S W PYI+RLPQP EMH++IFW E EL MIR S++++E++ Q++QIE++F  + +A +   P + +R + +DFM+AY+LV SRAW 
Subjt:  LAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESF-PEIIDRINCDDFMHAYSLVTSRAWR

Query:  SQKDVSL--------------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCYIDYTETESSF
        + K +SL                                V ADR+Y+PG+ V I+YG++SNATLMLDFGF  PYNIHD+  +        +D    +   
Subjt:  SQKDVSL--------------------------------VIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCYIDYTETESSF

Query:  EFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTELNELSSEAVN
          K   L    +      ++    S   F   EV+SA GKG+G+PQSLRAFAR+L C  PQ                            ELN+LS EA  
Subjt:  EFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQSLRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTELNELSSEAVN

Query:  GDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI
         DGRLAR+P K+ N+E+EAH+ILLS   +L+E+++  I+ +   +   + ++ A RR MA+ LL GELR+++SA  WL +YC  +
Subjt:  GDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRRLMAQHLLTGELRIIKSAIAWLENYCDAI

AT5G14260.1 Rubisco methyltransferase family protein7.5e-0628.57Show/hide
Query:  FLFASETIRAGDCILKQISPDILPLPIRDLLGNE----------IGNVAKLAVVILLEQKLGPGSEWAPYITRLPQ-----PWEMHNTIFWNESELEMIR
        ++ ASE ++ GD       PD L + +  +LGNE          +  +A LA+ ++ E+K G  S W PYI  L +       +  + + W+E+EL+ + 
Subjt:  FLFASETIRAGDCILKQISPDILPLPIRDLLGNE----------IGNVAKLAVVILLEQKLGPGSEWAPYITRLPQ-----PWEMHNTIFWNESELEMIR

Query:  KSSLYEESLNQRSQIEREF
         S    E L +   I+RE+
Subjt:  KSSLYEESLNQRSQIEREF

AT5G14260.3 Rubisco methyltransferase family protein7.5e-0628.57Show/hide
Query:  FLFASETIRAGDCILKQISPDILPLPIRDLLGNE----------IGNVAKLAVVILLEQKLGPGSEWAPYITRLPQ-----PWEMHNTIFWNESELEMIR
        ++ ASE ++ GD       PD L + +  +LGNE          +  +A LA+ ++ E+K G  S W PYI  L +       +  + + W+E+EL+ + 
Subjt:  FLFASETIRAGDCILKQISPDILPLPIRDLLGNE----------IGNVAKLAVVILLEQKLGPGSEWAPYITRLPQ-----PWEMHNTIFWNESELEMIR

Query:  KSSLYEESLNQRSQIEREF
         S    E L +   I+RE+
Subjt:  KSSLYEESLNQRSQIEREF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTTGGGAATTCGATTCAACAATATATGGCGATGGCGGGCATCTCCAGTAATTTCTACTTCTCATGCTTTCCACTTCAATGGCCACTTCTCAACCTCTCCTGCATG
TGAGGAACTACATTCCTTGACTGGGAGTAATGATGACACATTTCTACCATGGTTGGAACGGAAAGCAAAGATGAAGATCTCGTCAGTGCTTTCTATTGGGAAATCTTCCA
TTGGAAGGTTTCTGTTTGCTTCTGAGACTATACGGGCTGGAGATTGTATTTTAAAGCAAATTTCACCTGATATTCTTCCTTTACCCATTAGAGATCTTTTAGGCAACGAG
ATTGGAAATGTTGCCAAGCTCGCTGTTGTAATTCTTCTTGAACAGAAATTGGGCCCGGGTTCTGAATGGGCGCCTTACATTACCCGACTTCCTCAACCATGGGAGATGCA
TAACACAATATTTTGGAATGAAAGTGAGTTGGAGATGATTCGTAAAAGCTCTTTGTATGAGGAATCACTTAATCAAAGATCACAGATTGAAAGGGAATTTCTGGCAATCA
GGAAAGCTCTGGAAAGTTTCCCTGAAATTATTGATAGGATCAATTGCGATGATTTCATGCATGCATATTCCCTTGTTACTTCTAGAGCATGGAGAAGCCAAAAGGATGTC
TCTCTGGTCATTGCTGATCGTGATTATGCCCCTGGTGAACATGTACTCATAAGGTATGGAAAATATTCAAATGCTACGTTAATGTTGGACTTTGGGTTTGCGCTTCCATA
CAACATTCATGATCAGGCATTGTTAAGTTATCGGAATTATCCATGTTATATAGACTATACGGAGACAGAATCCAGTTTTGAGTTTAAGGATTTTGCTCTGAGTTGCTTCC
CATATGTATATAAATGCAGTCAGGTAGATACTGAGCGTCTGAGCGAGAAGAATTTTATTGCTTGGGAAGTGAGATCTGCCACTGGGAAAGGGCGAGGTCTTCCCCAATCA
CTTCGTGCATTTGCTCGTATTTTATCTTGCAGTAATCCTCAGGGTTTGTATCTCATTCTTTATCCGCTGTCATTTCTGCATTGGCAGGAAGAGTTTGATATCTTCTTGGT
CCTTGTTGATTGTACAGAGTTAAATGAATTAAGTTCTGAAGCTGTTAATGGTGATGGTCGGTTGGCTCGAATTCCACTGAAGAATGTCAATAAAGAGGTTGAAGCACATC
GGATTTTGCTTTCTCAATTTAAACAATTAGTTGAAGAGTATAATGCATCTATTGAGGCACTAGGGCCTATTGATCCTCTCTGTTTGTGCAAAAAGTTGGCACGGCGAAGG
CTGATGGCCCAACATCTTCTCACCGGTGAGCTTCGCATCATCAAGTCCGCTATTGCTTGGCTGGAGAACTATTGTGATGCCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTTTGGGAATTCGATTCAACAATATATGGCGATGGCGGGCATCTCCAGTAATTTCTACTTCTCATGCTTTCCACTTCAATGGCCACTTCTCAACCTCTCCTGCATG
TGAGGAACTACATTCCTTGACTGGGAGTAATGATGACACATTTCTACCATGGTTGGAACGGAAAGCAAAGATGAAGATCTCGTCAGTGCTTTCTATTGGGAAATCTTCCA
TTGGAAGGTTTCTGTTTGCTTCTGAGACTATACGGGCTGGAGATTGTATTTTAAAGCAAATTTCACCTGATATTCTTCCTTTACCCATTAGAGATCTTTTAGGCAACGAG
ATTGGAAATGTTGCCAAGCTCGCTGTTGTAATTCTTCTTGAACAGAAATTGGGCCCGGGTTCTGAATGGGCGCCTTACATTACCCGACTTCCTCAACCATGGGAGATGCA
TAACACAATATTTTGGAATGAAAGTGAGTTGGAGATGATTCGTAAAAGCTCTTTGTATGAGGAATCACTTAATCAAAGATCACAGATTGAAAGGGAATTTCTGGCAATCA
GGAAAGCTCTGGAAAGTTTCCCTGAAATTATTGATAGGATCAATTGCGATGATTTCATGCATGCATATTCCCTTGTTACTTCTAGAGCATGGAGAAGCCAAAAGGATGTC
TCTCTGGTCATTGCTGATCGTGATTATGCCCCTGGTGAACATGTACTCATAAGGTATGGAAAATATTCAAATGCTACGTTAATGTTGGACTTTGGGTTTGCGCTTCCATA
CAACATTCATGATCAGGCATTGTTAAGTTATCGGAATTATCCATGTTATATAGACTATACGGAGACAGAATCCAGTTTTGAGTTTAAGGATTTTGCTCTGAGTTGCTTCC
CATATGTATATAAATGCAGTCAGGTAGATACTGAGCGTCTGAGCGAGAAGAATTTTATTGCTTGGGAAGTGAGATCTGCCACTGGGAAAGGGCGAGGTCTTCCCCAATCA
CTTCGTGCATTTGCTCGTATTTTATCTTGCAGTAATCCTCAGGGTTTGTATCTCATTCTTTATCCGCTGTCATTTCTGCATTGGCAGGAAGAGTTTGATATCTTCTTGGT
CCTTGTTGATTGTACAGAGTTAAATGAATTAAGTTCTGAAGCTGTTAATGGTGATGGTCGGTTGGCTCGAATTCCACTGAAGAATGTCAATAAAGAGGTTGAAGCACATC
GGATTTTGCTTTCTCAATTTAAACAATTAGTTGAAGAGTATAATGCATCTATTGAGGCACTAGGGCCTATTGATCCTCTCTGTTTGTGCAAAAAGTTGGCACGGCGAAGG
CTGATGGCCCAACATCTTCTCACCGGTGAGCTTCGCATCATCAAGTCCGCTATTGCTTGGCTGGAGAACTATTGTGATGCCATTTAG
Protein sequenceShow/hide protein sequence
MLLGIRFNNIWRWRASPVISTSHAFHFNGHFSTSPACEELHSLTGSNDDTFLPWLERKAKMKISSVLSIGKSSIGRFLFASETIRAGDCILKQISPDILPLPIRDLLGNE
IGNVAKLAVVILLEQKLGPGSEWAPYITRLPQPWEMHNTIFWNESELEMIRKSSLYEESLNQRSQIEREFLAIRKALESFPEIIDRINCDDFMHAYSLVTSRAWRSQKDV
SLVIADRDYAPGEHVLIRYGKYSNATLMLDFGFALPYNIHDQALLSYRNYPCYIDYTETESSFEFKDFALSCFPYVYKCSQVDTERLSEKNFIAWEVRSATGKGRGLPQS
LRAFARILSCSNPQGLYLILYPLSFLHWQEEFDIFLVLVDCTELNELSSEAVNGDGRLARIPLKNVNKEVEAHRILLSQFKQLVEEYNASIEALGPIDPLCLCKKLARRR
LMAQHLLTGELRIIKSAIAWLENYCDAI