; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023249 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023249
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionfactor of DNA methylation 4-like
Genome locationchr7:46397813..46399846
RNA-Seq ExpressionLag0023249
SyntenyLag0023249
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
InterPro domainsIPR005379 - Uncharacterised domain XH


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF8396789.1 hypothetical protein HHK36_018422 [Tetracentron sinense]3.0e-7537.67Show/hide
Query:  MKHLG-----DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADH
        MKH+G     ++K+KM ++ ++L  KE E E ++ +NQ+LI K ER++ DE++DARKELI      S ++ IGVKRMG LD KPF  A+  K+  EEAD 
Subjt:  MKHLG-----DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADH

Query:  RAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAH-----
        +A ELCSLWEEYLRDPEW PF+++ ++  + +EI+D+ DE LK LKN+ GDEV  AV TAL+E NEYNPSGR  + ELWNF+E RKATLKEG+A      
Subjt:  RAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAH-----

Query:  ------------LHTEAQN--------------KDQNLK----------------------------------------------KENEKL-------QK
                    +H E Q               K+Q L+                                              +ENEKL       +K
Subjt:  ------------LHTEAQN--------------KDQNLK----------------------------------------------KENEKL-------QK

Query:  KIIELEKRLDTRQA--------LELEIERLKASLEVMKDDD----AKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELV---EAFD
        ++ E  K L+ R+A        L +E E+LK  L+V         +  EM+   +  EEK ++ + +E + Q L  + R ++ E+QDAR+EL+   +  D
Subjt:  KIIELEKRLDTRQA--------LELEIERLKASLEVMKDDD----AKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELV---EAFD

Query:  C-----QPIGAF------IGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNG------------------------
        C     Q +  F      IG+KR+GEL  +PF      K+   E+   ++ EL S W E   N DWHPFK I  +G                        
Subjt:  C-----QPIGAF------IGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNG------------------------

Query:  --GKAG--IIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQW-TLHKRRRT
          GK    IID+ND ++K LKD +G+EVYKAV  AL+E+NE+ P  R    ELWN KEGR+A+LKE +  +LKQ+  L   +RT
Subjt:  --GKAG--IIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQW-TLHKRRRT

KAG7019188.1 Factor of DNA methylation 4, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-7351.23Show/hide
Query:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        T  +L +  E+ D    E   ++ E +R       R+ QD R   + I+ E++++   LK+    +  +     L++    N + R  + +     E  +
Subjt:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC
        ATL++  A         +Q  +KE EKL KKIIELE++LD RQALELEIERLK SLEVMK      DDDAKK+M++ QQ   EKEEE +  +NI QNL  
Subjt:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC

Query:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL
        + RRT+DEVQDAR+EL+  +      AFIGVKRMG+L  +PF TA+KLKY K EEA E+A+EL S+WE  L +  WHPF++I+D+GG+A  IIDENDE L
Subjt:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL

Query:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        KNL++EYGDEVYKAVVTALMEMNEY P  R T LELWN KEGRKATLKEG AH+LKQW LHKRR+
Subjt:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

KAG8368247.1 hypothetical protein BUALT_Bualt15G0025500 [Buddleja alternifolia]2.7e-8438.19Show/hide
Query:  MKHLG------DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD
        +KH+G      ++K+K+  I+ +L+ KEEE E + ++NQ+L+ K ERR+ DE++DARKELI      S+++FIGVKRMG LD K F TA   K+ +EEAD
Subjt:  MKHLG------DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD

Query:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRG-RAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--
         +A ELC+ W+ ++RDP W PF+I+    G R K II+E DE +K L+N+ G E +KAV TALMEINEYNPSGR  + ELWN ++ R+ATLKEGI+HL  
Subjt:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRG-RAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--

Query:  -----------------------------------------------------------------------------------------------HTEAQ
                                                                                                         EA 
Subjt:  -----------------------------------------------------------------------------------------------HTEAQ

Query:  NKDQNL------------------------------KKENEKLQKKIIELEKRLDTR--QALELEIERLKASLEVMKDDDAKKEMERFQQLFEEKEEEEK
        N+ +NL                              K+E E LQ K+I+LEK+LD +  QALELEI RLK  L+V  +++  K++   +Q  EEKEEE +
Subjt:  NKDQNL------------------------------KKENEKLQKKIIELEKRLDTR--QALELEIERLKASLEVMKDDDAKKEMERFQQLFEEKEEEEK

Query:  CLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWE-YFLNHDWHPFKVIKDNGGKA
         LE + Q L  +  R +DE+  AR+ELV     Q    FIGVKRM EL  +PF  A K KY  E++   +A+EL ++W+ +  +  W+PFK++     K 
Subjt:  CLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWE-YFLNHDWHPFKVIKDNGGKA

Query:  G--IIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIPRSTGL-ELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
           I+DE DE+LK L++E G+E Y+AV TAL E+N+Y P   G+ ELWN K+ R+ATL EG++HL+ QW+L KR+R
Subjt:  G--IIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIPRSTGL-ELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

XP_021828671.1 factor of DNA methylation 4-like [Prunus avium]5.8e-9541.7Show/hide
Query:  DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADHRAAELCSLWE
        + ++KM +I++ L  KEEE+ +++ + ++LI  +ERR  DEV++ARK +I      S+++ IGVK MG LD KPF TAT  ++ +EEAD +A ELCSLW+
Subjt:  DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADHRAAELCSLWE

Query:  EYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--------------
        E+LRDP W PFRII D  G+ KEII+E D+ LK LKN+ GDEVY+ V TA+ME+NEYN SGR T+ ELWNF+EGRKA L+EG+  L              
Subjt:  EYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--------------

Query:  -------------------------------------------------HTEAQNKD--------------------------------------QNLKK
                                                         H+EAQ K+                                      + LK+
Subjt:  -------------------------------------------------HTEAQNKD--------------------------------------QNLKK

Query:  ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQ
          EKL K+IIELEK+LD +Q LELEIER++ +L+VMK      D +AKK+M+  Q+  +EK+EE   +E +   L  + RR++DEV++AR+EL+      
Subjt:  ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQ

Query:  PIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKD-NGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMN
           A IGVK MG L  + F TA K KY  EEEA  +A+EL S W E+  +  WHPF++I D  G    II+E D++LK LK+E GDEVY+ V TA+ME+N
Subjt:  PIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKD-NGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMN

Query:  EY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        EY    R T  ELWN +EGRKA+++EGV  LL +W L ++R+
Subjt:  EY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

XP_042485078.1 factor of DNA methylation 4-like [Macadamia integrifolia]1.8e-8344.97Show/hide
Query:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADH
        MKH  D     +++ +  ++++LD KE + E+++S+NQ LI K ER + DE++DAR++LI      S  + IGVKRMG LD KPF  A   K+  EEA+ 
Subjt:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADH

Query:  RAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEA
        +AA++CSLW+E+LR P W PF+II  + G+ +EII+E DE LK+LK+D G+EVY+AV  AL EIN+YNPSGR  + ELWNF+E RKATL E +A      
Subjt:  RAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEA

Query:  QNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDARE
                     + K++I+  K          EI+ LK  LEV K      D   +K++E  ++  ++K+ E K L+++ Q L  + R ++DE+QDAR 
Subjt:  QNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDARE

Query:  ELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGKAGIIDENDERLKNLKDEYGDEVYKAV
        EL+         A IGVKRMG L  + F  A + KY  EEE  E+A  + S W E+  N  WHPFK+I  +G    IIDE DE+LK LK++ G+EVY+AV
Subjt:  ELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGKAGIIDENDERLKNLKDEYGDEVYKAV

Query:  VTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHK
         TAL E+N+Y P  R    ELWN KE RKATLKEGVA++LKQ  ++K
Subjt:  VTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHK

TrEMBL top hitse value%identityAlignment
A0A6J1HIA9 protein INVOLVED IN DE NOVO 2-like isoform X11.4e-7350.82Show/hide
Query:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        T  +L +  E+ D    E   ++ E +R       R+ QD R   + I+ E++++   LK+    +  +     L++    N + R  + +     E  +
Subjt:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC
        ATL++  A         +Q  +KE EKL KKIIELE++LD RQALELEIERLK SLEVMK      DDD KK+M++ QQ   EKEEE +  +NI QNL  
Subjt:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC

Query:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL
        + RRT+DEVQDAR+EL+  +      AFIGVKRMG+L  +PF TA+KLKY K EEA E+A+EL S+WE  L +  WHPF++I+D+GG+A  IIDENDE L
Subjt:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL

Query:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT
        KNL++EYGDEVYKAVVTALMEMNEY P  R T LELWN KEGRKATLKEG AH+LKQW LHKRR++
Subjt:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT

A0A6J1HLH1 protein INVOLVED IN DE NOVO 2-like isoform X21.4e-7350.82Show/hide
Query:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        T  +L +  E+ D    E   ++ E +R       R+ QD R   + I+ E++++   LK+    +  +     L++    N + R  + +     E  +
Subjt:  TATNLKHDKEEADHRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC
        ATL++  A         +Q  +KE EKL KKIIELE++LD RQALELEIERLK SLEVMK      DDD KK+M++ QQ   EKEEE +  +NI QNL  
Subjt:  ATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTC

Query:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL
        + RRT+DEVQDAR+EL+  +      AFIGVKRMG+L  +PF TA+KLKY K EEA E+A+EL S+WE  L +  WHPF++I+D+GG+A  IIDENDE L
Subjt:  RVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERL

Query:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT
        KNL++EYGDEVYKAVVTALMEMNEY P  R T LELWN KEGRKATLKEG AH+LKQW LHKRR++
Subjt:  KNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT

A0A6J1I318 protein INVOLVED IN DE NOVO 2-like isoform X24.0e-7354.43Show/hide
Query:  QDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKR
        QD R   + I+ E++++   LK+    +  +     L++    N + R  + +     E  +ATL++  A         +Q  +KE EKL KKIIELE++
Subjt:  QDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKR

Query:  LDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELG
        LD RQALELEIERLK SLEV+K      DDDAKK+M++ QQ   EKEEE +  +NI QNL  + RRT+DEVQDAR+EL+  +      AFIGVKRMG+L 
Subjt:  LDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELG

Query:  FEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWN
         +PF TA+KLKY K EEA E+A+EL S+WE  L +  WHPF++I+D+GG+A  IIDENDE LKNL++EYGDEVYKAVVTALMEMNEY P  R T LELWN
Subjt:  FEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWN

Query:  NKEGRKATLKEGVAHLLKQWTLHKRRR
         KEGRKATLKEG AH+LKQW LHKRR+
Subjt:  NKEGRKATLKEGVAHLLKQWTLHKRRR

A0A6J1I5Q0 protein INVOLVED IN DE NOVO 2-like isoform X14.0e-7354.43Show/hide
Query:  QDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKR
        QD R   + I+ E++++   LK+    +  +     L++    N + R  + +     E  +ATL++  A         +Q  +KE EKL KKIIELE++
Subjt:  QDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKR

Query:  LDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELG
        LD RQALELEIERLK SLEV+K      DDDAKK+M++ QQ   EKEEE +  +NI QNL  + RRT+DEVQDAR+EL+  +      AFIGVKRMG+L 
Subjt:  LDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELG

Query:  FEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWN
         +PF TA+KLKY K EEA E+A+EL S+WE  L +  WHPF++I+D+GG+A  IIDENDE LKNL++EYGDEVYKAVVTALMEMNEY P  R T LELWN
Subjt:  FEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIKDNGGKA-GIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWN

Query:  NKEGRKATLKEGVAHLLKQWTLHKRRR
         KEGRKATLKEG AH+LKQW LHKRR+
Subjt:  NKEGRKATLKEGVAHLLKQWTLHKRRR

A0A6P5TNU0 factor of DNA methylation 4-like2.8e-9541.7Show/hide
Query:  DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADHRAAELCSLWE
        + ++KM +I++ L  KEEE+ +++ + ++LI  +ERR  DEV++ARK +I      S+++ IGVK MG LD KPF TAT  ++ +EEAD +A ELCSLW+
Subjt:  DIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADHRAAELCSLWE

Query:  EYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--------------
        E+LRDP W PFRII D  G+ KEII+E D+ LK LKN+ GDEVY+ V TA+ME+NEYN SGR T+ ELWNF+EGRKA L+EG+  L              
Subjt:  EYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHL--------------

Query:  -------------------------------------------------HTEAQNKD--------------------------------------QNLKK
                                                         H+EAQ K+                                      + LK+
Subjt:  -------------------------------------------------HTEAQNKD--------------------------------------QNLKK

Query:  ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQ
          EKL K+IIELEK+LD +Q LELEIER++ +L+VMK      D +AKK+M+  Q+  +EK+EE   +E +   L  + RR++DEV++AR+EL+      
Subjt:  ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQ

Query:  PIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKD-NGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMN
           A IGVK MG L  + F TA K KY  EEEA  +A+EL S W E+  +  WHPF++I D  G    II+E D++LK LK+E GDEVY+ V TA+ME+N
Subjt:  PIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKD-NGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMN

Query:  EY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        EY    R T  ELWN +EGRKA+++EGV  LL +W L ++R+
Subjt:  EY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

SwissProt top hitse value%identityAlignment
F4JH53 Factor of DNA methylation 21.3e-4448.28Show/hide
Query:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD
        MKHLGD     ++ KM ++  +LD K+ E E+++SMN  L+TK ER++ DE++ AR+++I  + G    +S IGVKRMG LD KPF     L++   EA 
Subjt:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD

Query:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE
          AA LCS W+E L++P W+PF+  +     A+E++DE+DE LK LK +WG EV+ AV  AL+E+NEYN SGR    ELWNF+EGRKATLKE I  + T+
Subjt:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE

Query:  AQN
         +N
Subjt:  AQN

Q8VZ79 Protein INVOLVED IN DE NOVO 22.0e-4235.92Show/hide
Query:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        ++K  +E    ++ EL  L EE  ++ +  +R    IQ+      ++I+D+++++ + L+++      K    A  E+  +N + R+ + E    +  + 
Subjt:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN
        A+    +     E Q  D+ +KK       + E+L +KII LE++ D +QA+ELE+E+LK  L VMK    D DA+  KE++   +   EKE +   L+ 
Subjt:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN

Query:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI
          Q L  R RRT+DE+Q+A +ELV     +     IGVKRMGEL  +PF  AM+ KY  +++  +RA+E+   WE++L + DWHPFK +K  +   +  +
Subjt:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI

Query:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        ID+ DE+L+ LK + GD  Y AV  AL+E+NEY P  R    ELWN K  +KATL+EGV  LL QW   KR+R
Subjt:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

Q9LHB1 Factor of DNA methylation 33.6e-3941.96Show/hide
Query:  EAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDA
        +AQ    + K + EKL K+I  LE++LD +Q LELE+++LK+ L VM+        +   ++E F +   E E E   L    Q+L  + R+++DE+Q+A
Subjt:  EAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDA

Query:  REELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGK--AGIIDENDERLKNLKDEYGDEV
        R  L+   + + +G  IGVKRMGEL  +PF  AM++KY  +E+  + A+E+   W EY  + DWHPFK IK    +    +IDE+DE+L+ LK+E GD+ 
Subjt:  REELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGK--AGIIDENDERLKNLKDEYGDEV

Query:  YKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT
        Y+AV  AL+E+NEY P  R    ELWN +E RKATL+EGV  LL+QW   K  ++
Subjt:  YKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT

Q9LMH6 Factor of DNA methylation 45.8e-3739.21Show/hide
Query:  NFEEGRKATLKEGIAHLHTEAQNK-DQNL-------KKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVM--------KDDDAKKEMERFQQLFEE
        N  E RK   ++    + T+ QNK D+++       ++E ++L+K++ ELE+++D  QALELEIER++  L+VM        +D   K+ +E+ ++  +E
Subjt:  NFEEGRKATLKEGIAHLHTEAQNK-DQNL-------KKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVM--------KDDDAKKEMERFQQLFEE

Query:  KEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFLNHD-WHPFKVIK
        KEE+ +  E++ Q L  +   T+DE+QDAR+ L+ +       A+IGVKRMG L   PF    K KY    EA ++A EL S WE  L    WHP KV++
Subjt:  KEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFLNHD-WHPFKVIK

Query:  DNGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHK
         +G     ++E DE+L+ L+ E G+EVY AV  AL E NEY    R    ELWN K+ RKA++KEGV +L+  W   K
Subjt:  DNGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEY--IPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHK

Q9S9P3 Factor of DNA methylation 13.4e-4549.5Show/hide
Query:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD
        MKHLGD     +++KM ++  +LD K+ E E ++SMN  L+TK ER++ DE++ ARK+LI  + G    ++ IGVKRMG LD KPF     L++   EA 
Subjt:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD

Query:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE
          AA LCS W+E L++P W+PF+  +     A+E++DE+DE LK LK +WG EV+ AV TAL+E+NEYN SGR T  ELWNF+EGRKATLKE I  +  +
Subjt:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE

Arabidopsis top hitse value%identityAlignment
AT1G15910.1 XH/XS domain-containing protein2.4e-4649.5Show/hide
Query:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD
        MKHLGD     +++KM ++  +LD K+ E E ++SMN  L+TK ER++ DE++ ARK+LI  + G    ++ IGVKRMG LD KPF     L++   EA 
Subjt:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD

Query:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE
          AA LCS W+E L++P W+PF+  +     A+E++DE+DE LK LK +WG EV+ AV TAL+E+NEYN SGR T  ELWNF+EGRKATLKE I  +  +
Subjt:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE

AT3G12550.1 XH/XS domain-containing protein2.6e-4041.96Show/hide
Query:  EAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDA
        +AQ    + K + EKL K+I  LE++LD +Q LELE+++LK+ L VM+        +   ++E F +   E E E   L    Q+L  + R+++DE+Q+A
Subjt:  EAQNKDQNLKKENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK------DDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDA

Query:  REELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGK--AGIIDENDERLKNLKDEYGDEV
        R  L+   + + +G  IGVKRMGEL  +PF  AM++KY  +E+  + A+E+   W EY  + DWHPFK IK    +    +IDE+DE+L+ LK+E GD+ 
Subjt:  REELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQW-EYFLNHDWHPFKVIKDNGGK--AGIIDENDERLKNLKDEYGDEV

Query:  YKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT
        Y+AV  AL+E+NEY P  R    ELWN +E RKATL+EGV  LL+QW   K  ++
Subjt:  YKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT

AT3G48670.1 XH/XS domain-containing protein1.4e-4335.92Show/hide
Query:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        ++K  +E    ++ EL  L EE  ++ +  +R    IQ+      ++I+D+++++ + L+++      K    A  E+  +N + R+ + E    +  + 
Subjt:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN
        A+    +     E Q  D+ +KK       + E+L +KII LE++ D +QA+ELE+E+LK  L VMK    D DA+  KE++   +   EKE +   L+ 
Subjt:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN

Query:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI
          Q L  R RRT+DE+Q+A +ELV     +     IGVKRMGEL  +PF  AM+ KY  +++  +RA+E+   WE++L + DWHPFK +K  +   +  +
Subjt:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI

Query:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        ID+ DE+L+ LK + GD  Y AV  AL+E+NEY P  R    ELWN K  +KATL+EGV  LL QW   KR+R
Subjt:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

AT3G48670.2 XH/XS domain-containing protein1.4e-4335.92Show/hide
Query:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK
        ++K  +E    ++ EL  L EE  ++ +  +R    IQ+      ++I+D+++++ + L+++      K    A  E+  +N + R+ + E    +  + 
Subjt:  NLKHDKEEADHRAAELCSLWEEYLRDPE--WRPFRIIQD-IRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRK

Query:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN
        A+    +     E Q  D+ +KK       + E+L +KII LE++ D +QA+ELE+E+LK  L VMK    D DA+  KE++   +   EKE +   L+ 
Subjt:  ATLKEGIAHLHTEAQNKDQNLKK-------ENEKLQKKIIELEKRLDTRQALELEIERLKASLEVMK----DDDAK--KEMERFQQLFEEKEEEEKCLEN

Query:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI
          Q L  R RRT+DE+Q+A +ELV     +     IGVKRMGEL  +PF  AM+ KY  +++  +RA+E+   WE++L + DWHPFK +K  +   +  +
Subjt:  IIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEEAHERAMELWSQWEYFL-NHDWHPFKVIK--DNGGKAGI

Query:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR
        ID+ DE+L+ LK + GD  Y AV  AL+E+NEY P  R    ELWN K  +KATL+EGV  LL QW   KR+R
Subjt:  IDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIP--RSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRR

AT4G00380.1 XH/XS domain-containing protein9.1e-4648.28Show/hide
Query:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD
        MKHLGD     ++ KM ++  +LD K+ E E+++SMN  L+TK ER++ DE++ AR+++I  + G    +S IGVKRMG LD KPF     L++   EA 
Subjt:  MKHLGD-----IKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIK-VFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEAD

Query:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE
          AA LCS W+E L++P W+PF+  +     A+E++DE+DE LK LK +WG EV+ AV  AL+E+NEYN SGR    ELWNF+EGRKATLKE I  + T+
Subjt:  HRAAELCSLWEEYLRDPEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTE

Query:  AQN
         +N
Subjt:  AQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCATTTGGGCGATATCAAGAGAAAAATGGTGCAGATTCAACAAGATTTGGACGCCAAGGAAGAAGAATTTGAAAACATGCAAAGCATGAATCAATCCCTCATCAC
CAAATTAGAGCGCAGAACCTGCGATGAAGTTGAAGACGCGCGCAAAGAATTGATCAAAGTGTTTGGCAGTCCGTCGACCCAATCCTTCATTGGCGTCAAGAGAATGGGAG
GTCTTGACTGCAAGCCATTCTTCACAGCCACAAATTTGAAGCATGACAAAGAAGAAGCAGATCACAGAGCAGCAGAGTTGTGCTCACTGTGGGAGGAGTACCTTCGTGAC
CCGGAGTGGCGCCCTTTCAGGATCATACAAGACATTCGAGGACGAGCTAAGGAAATTATTGATGAAAATGATGAGATGTTAAAAAATTTGAAGAATGATTGGGGAGATGA
AGTTTACAAGGCTGTTGCCACAGCCTTGATGGAAATAAACGAGTATAATCCAAGTGGTAGGCTTACAGTGTTGGAGCTTTGGAACTTTGAAGAAGGAAGAAAAGCGACAT
TAAAGGAAGGAATAGCTCATCTACATACTGAAGCACAAAACAAGGATCAGAATCTCAAGAAAGAGAATGAGAAGCTTCAGAAAAAGATCATAGAGCTGGAAAAGAGACTT
GATACAAGACAAGCATTAGAGTTGGAAATTGAGAGGTTGAAGGCTTCGTTAGAAGTCATGAAAGATGATGATGCCAAGAAAGAAATGGAACGGTTTCAACAACTTTTTGA
GGAGAAGGAAGAAGAAGAAAAATGCTTAGAAAACATCATTCAAAACCTTACGTGCAGAGTGCGCAGAACCGACGATGAAGTTCAAGATGCGCGTGAAGAATTGGTTGAAG
CGTTTGATTGTCAGCCGATCGGAGCCTTTATTGGTGTCAAGAGAATGGGAGAACTTGGCTTCGAACCATTCTTCACAGCCATGAAGTTGAAGTATGACAAAGAAGAAGAA
GCACATGAGAGAGCAATGGAGTTGTGGTCACAGTGGGAGTACTTTCTTAACCATGATTGGCATCCTTTCAAGGTTATAAAGGACAATGGAGGAAAAGCAGGAATTATTGA
TGAAAATGATGAGAGGTTAAAAAATTTGAAGGATGAGTATGGAGATGAAGTTTACAAGGCTGTTGTCACAGCCTTGATGGAAATGAATGAGTATATCCCAAGGTCTACAG
GATTGGAGCTGTGGAACAATAAAGAGGGAAGAAAAGCCACATTAAAGGAAGGAGTAGCTCATTTACTGAAACAATGGACACTGCACAAAAGAAGGAGAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCATTTGGGCGATATCAAGAGAAAAATGGTGCAGATTCAACAAGATTTGGACGCCAAGGAAGAAGAATTTGAAAACATGCAAAGCATGAATCAATCCCTCATCAC
CAAATTAGAGCGCAGAACCTGCGATGAAGTTGAAGACGCGCGCAAAGAATTGATCAAAGTGTTTGGCAGTCCGTCGACCCAATCCTTCATTGGCGTCAAGAGAATGGGAG
GTCTTGACTGCAAGCCATTCTTCACAGCCACAAATTTGAAGCATGACAAAGAAGAAGCAGATCACAGAGCAGCAGAGTTGTGCTCACTGTGGGAGGAGTACCTTCGTGAC
CCGGAGTGGCGCCCTTTCAGGATCATACAAGACATTCGAGGACGAGCTAAGGAAATTATTGATGAAAATGATGAGATGTTAAAAAATTTGAAGAATGATTGGGGAGATGA
AGTTTACAAGGCTGTTGCCACAGCCTTGATGGAAATAAACGAGTATAATCCAAGTGGTAGGCTTACAGTGTTGGAGCTTTGGAACTTTGAAGAAGGAAGAAAAGCGACAT
TAAAGGAAGGAATAGCTCATCTACATACTGAAGCACAAAACAAGGATCAGAATCTCAAGAAAGAGAATGAGAAGCTTCAGAAAAAGATCATAGAGCTGGAAAAGAGACTT
GATACAAGACAAGCATTAGAGTTGGAAATTGAGAGGTTGAAGGCTTCGTTAGAAGTCATGAAAGATGATGATGCCAAGAAAGAAATGGAACGGTTTCAACAACTTTTTGA
GGAGAAGGAAGAAGAAGAAAAATGCTTAGAAAACATCATTCAAAACCTTACGTGCAGAGTGCGCAGAACCGACGATGAAGTTCAAGATGCGCGTGAAGAATTGGTTGAAG
CGTTTGATTGTCAGCCGATCGGAGCCTTTATTGGTGTCAAGAGAATGGGAGAACTTGGCTTCGAACCATTCTTCACAGCCATGAAGTTGAAGTATGACAAAGAAGAAGAA
GCACATGAGAGAGCAATGGAGTTGTGGTCACAGTGGGAGTACTTTCTTAACCATGATTGGCATCCTTTCAAGGTTATAAAGGACAATGGAGGAAAAGCAGGAATTATTGA
TGAAAATGATGAGAGGTTAAAAAATTTGAAGGATGAGTATGGAGATGAAGTTTACAAGGCTGTTGTCACAGCCTTGATGGAAATGAATGAGTATATCCCAAGGTCTACAG
GATTGGAGCTGTGGAACAATAAAGAGGGAAGAAAAGCCACATTAAAGGAAGGAGTAGCTCATTTACTGAAACAATGGACACTGCACAAAAGAAGGAGAACCTGA
Protein sequenceShow/hide protein sequence
MKHLGDIKRKMVQIQQDLDAKEEEFENMQSMNQSLITKLERRTCDEVEDARKELIKVFGSPSTQSFIGVKRMGGLDCKPFFTATNLKHDKEEADHRAAELCSLWEEYLRD
PEWRPFRIIQDIRGRAKEIIDENDEMLKNLKNDWGDEVYKAVATALMEINEYNPSGRLTVLELWNFEEGRKATLKEGIAHLHTEAQNKDQNLKKENEKLQKKIIELEKRL
DTRQALELEIERLKASLEVMKDDDAKKEMERFQQLFEEKEEEEKCLENIIQNLTCRVRRTDDEVQDAREELVEAFDCQPIGAFIGVKRMGELGFEPFFTAMKLKYDKEEE
AHERAMELWSQWEYFLNHDWHPFKVIKDNGGKAGIIDENDERLKNLKDEYGDEVYKAVVTALMEMNEYIPRSTGLELWNNKEGRKATLKEGVAHLLKQWTLHKRRRT