; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005809 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005809
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionfactor of DNA methylation 4
Genome locationscaffold11:2034341..2041791
RNA-Seq ExpressionSpg005809
SyntenySpg005809
Gene Ontology termsGO:0031047 - gene silencing by RNA (biological process)
InterPro domainsIPR005379 - Uncharacterised domain XH
IPR005380 - XS domain
IPR005381 - Zinc finger-XS domain
IPR038588 - XS domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583426.1 putative serine/threonine-protein kinase PBL10, partial [Cucurbita argyrosperma subsp. sororia]3.9e-29159.23Show/hide
Query:  YRHCSLSVDSAPHFFYVSFRAPLSHRRRSSYASLIISCERIVTTGGLGGCPMKIVKLRLASCKSILSNRSKMSVGPKKESETDSLKKYQDFYYADLKHGH
        Y HCSLSVDS+PH                                                   +L +R KMSVG KKESETDSLKKYQD YYADLK G 
Subjt:  YRHCSLSVDSAPHFFYVSFRAPLSHRRRSSYASLIISCERIVTTGGLGGCPMKIVKLRLASCKSILSNRSKMSVGPKKESETDSLKKYQDFYYADLKHGH

Query:  IRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYDRDQPQPVRKEQRHDRHQPQPVC
        IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQPV KEQR DR QPQPV KEQ +DR Q Q +C
Subjt:  IRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYDRDQPQPVRKEQRHDRHQPQPVC

Query:  KKQRYDHDQPQPVC--------------------------------------------------------------------------------------
        K++RYD DQPQ VC                                                                                      
Subjt:  KKQRYDHDQPQPVC--------------------------------------------------------------------------------------

Query:  ----------------------------------------------KEQH----------------HDQPQPVRKEQRYDH------------------D
                                                      KEQH                HDQPQPV KEQ YDH                  D
Subjt:  ----------------------------------------------KEQH----------------HDQPQPVRKEQRYDH------------------D

Query:  QPQPISKEKRCDRDQ-QPVRKEQR--CDRDQ-------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVA
        QPQ I KE+  DRD+ Q VRKEQ   CDR Q             QPV KEQ  DRDQ                   QP+RKEQ YD DQLFVWPWMAIVA
Subjt:  QPQPISKEKRCDRDQ-QPVRKEQR--CDRDQ-------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVA

Query:  NIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSK
        NIQTEIHAGR VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDH GKKDY VSRDRGKKLYGWVARDDDYNSK
Subjt:  NIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSK

Query:  SIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAF
        S+FGDYLRKNGDLKTV GKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDET K YNE                     K+  
Subjt:  SIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAF

Query:  SL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMG
         L                                  NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMG
Subjt:  SL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMG

Query:  EDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELC
        EDGDDDAKKKMDQIQQDL+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRMGDLDSKPF  A KLKY KEEADEKAVELC
Subjt:  EDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELC

Query:  SQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATL
        S+WED+LRDPSWHPFRIIE+DGGQAK                 EIIDENDEMLKNL+NEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATL
Subjt:  SQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATL

Query:  KEGVAHILRQWKLHKRRR
        KEG AHIL+QWKLHKRR+
Subjt:  KEGVAHILRQWKLHKRRR

KAG7019188.1 Factor of DNA methylation 4, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-30771.07Show/hide
Query:  YRHCSLSVDSAPHFFYVSFRAPLSHRRRSSYASLIISCERIVTTGGLGGCPMKIVKLRLASCKSILS-NRSKMSVGPKKESETDSLKKYQDFYYADLKHG
        Y HCSLSVDS+PH         L HR     + +  +    V  G              +   S LS +  KMSVG KKESETDSLKKYQD YYADLK G
Subjt:  YRHCSLSVDSAPHFFYVSFRAPLSHRRRSSYASLIISCERIVTTGGLGGCPMKIVKLRLASCKSILS-NRSKMSVGPKKESETDSLKKYQDFYYADLKHG

Query:  HIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYDRDQPQPVRKEQRHDRHQPQPV
         IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQPV KEQR DR QPQPV KEQ +DR Q Q +
Subjt:  HIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYDRDQPQPVRKEQRHDRHQPQPV

Query:  CKKQRYDHDQPQPVC--------------KEQHH--DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ-QPVRKEQRCDRDQ-QPVCKEQRCDR--------
        CK++RYD DQPQ VC              KEQH+  D+PQ VRKEQ YD D+PQ + KE+  DRDQ Q V KE+  DRDQ QPV KEQ  D+        
Subjt:  CKKQRYDHDQPQPVC--------------KEQHH--DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ-QPVRKEQRCDRDQ-QPVCKEQRCDR--------

Query:  --DQ-QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH
          DQ QP+RKEQ YD DQLFVWPWMAIVANIQTEIHAGR VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDH
Subjt:  --DQ-QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH

Query:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK
         GKKDY VSRDRGKKLYGWVARDDDYNSKS+FGDYLRKNGDLKTV GKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDET K
Subjt:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK

Query:  TYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE
         YNE                     K+   L                                  NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIE
Subjt:  TYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE

Query:  LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRM
        LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDL+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRM
Subjt:  LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRM

Query:  GDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTA
        GDLDSKPF  A KLKY KEEADEKAVELCS+WED+LRDPSWHPFRIIE+DGGQAK                 EIIDENDEMLKNL+NEYGDEVYKAVVTA
Subjt:  GDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTA

Query:  LMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        LMEMNEYNPSGRYTVLELWNFKEGRKATLKEG AHIL+QWKLHKRR+
Subjt:  LMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

XP_022964643.1 protein INVOLVED IN DE NOVO 2-like isoform X2 [Cucurbita moschata]3.5e-30068.11Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQPV
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV

Query:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQP--------------------------------------------VRK
         KEQR D  QPQPV KEQ +DR Q Q +CK+QRYD D+PQ VCKE+ +D+ QP                                            VRK
Subjt:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQP--------------------------------------------VRK

Query:  EQRYDHDQPQPISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ------
        E+ YDHDQPQP+ KE+  DRDQ                                 Q VRKEQ  DRD+             QPV KEQ  DRDQ      
Subjt:  EQRYDHDQPQPISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ------

Query:  -------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF
                     QP+RKEQ YD DQLFVWPWMAIVANIQTEIHAGR VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MF
Subjt:  -------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF

Query:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM
        ENSFEVDH GKKDY VSRDRGKKL+GWVARDDDYNSKS+FGDYLRKNGDLKTVSGKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMM
Subjt:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM

Query:  EQMDETVKTYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKE
        EQMDET K YNE                     K+   L                                  NERATLEQ+KAEDEVL LAGEQQKEKE
Subjt:  EQMDETVKTYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKE

Query:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR
        KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDD KKKMDQIQQDL+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTR
Subjt:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR

Query:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE
        AFI VKRMGDLDSKPF  A KLKY KEEADEKAVELCS+WED+LRDPSWHPFRIIE+DGGQAK                 EIIDENDEMLKNL+NEYGDE
Subjt:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE

Query:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEG AHIL+QWKLHKRR++
Subjt:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

XP_022970505.1 protein INVOLVED IN DE NOVO 2-like isoform X2 [Cucurbita maxima]3.2e-29363.58Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQP 
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-

Query:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCK
                                                               VHKEQ YDRD+PQ VRKEQ +DR +PQ VCK+  YDHDQPQPV K
Subjt:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCK

Query:  EQHH------------------------------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ------------------
        EQ++                                                D+PQ VRKE+ +DHDQPQP+ KE+  DRDQ                  
Subjt:  EQHH------------------------------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ------------------

Query:  -QPVRKEQRCDRDQ-----------------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVAN
         Q V+KEQ  DRD+                             QPV K+Q  DRDQ                   QP+RKEQ YD DQLFVWPWMAIVAN
Subjt:  -QPVRKEQRCDRDQ-----------------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVAN

Query:  IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKS
        IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDHHGKKDY VSRDRGKKLYGWVARDDDYNSKS
Subjt:  IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKS

Query:  IFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFS
        +FGDYLRKNGDLKTV GKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDE VK YN+                     K+   
Subjt:  IFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFS

Query:  L----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGE
        L                                  NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEV+KHMGE
Subjt:  L----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGE

Query:  DGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCS
        DGDDDAKKKMDQIQQ L+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRMGDLDSKPF  A KLKY KEEADEKAVELCS
Subjt:  DGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCS

Query:  QWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK
        +WED+LRDPSWHPFRIIE+DGG+AK                 EIIDENDEMLKNL+NEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK
Subjt:  QWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK

Query:  EGVAHILRQWKLHKRRR
        EG AHIL+QWKLHKRR+
Subjt:  EGVAHILRQWKLHKRRR

XP_038895220.1 factor of DNA methylation 4-like [Benincasa hispida]6.4e-30273.95Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV
        MS+G KKESETDSLKKYQDFYYADLK G IRIK  GS+YRCPFCHG+SGKED ++KDLL HAS  GR SQSWS KERAKH ALERYMNKYF  EDQ Q V
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV

Query:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHH--DQPQPVRKEQRYDHDQPQPIS---KEKRCDRDQ-QPVRKEQRCDRDQ---
          +QR D DQPQPV KEQRHDR+Q Q V K+QRY+ DQ QPV KEQH+  DQ QPV KEQ YD D+PQPI    KE+R DRD+ Q V KEQR DRD+   
Subjt:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHH--DQPQPVRKEQRYDHDQPQPIS---KEKRCDRDQ-QPVRKEQRCDRDQ---

Query:  ------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLK
                    QPVCKEQ  DRDQ                   QPVRKE+ YD DQ FVWPWMAIVANIQTEIHAGRHVGESGSKLRDEF+RQGFNPLK
Subjt:  ------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLK

Query:  VHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVS
        VHPLWNRFGHSG AIVEFNKDWDGFRNAI+FENSFEV+HHGKKDY VSRDRGK+LYGWVARDDDY SK+IFGDYLRKNGDLKTVSGKEAEDNSKALKLVS
Subjt:  VHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVS

Query:  NLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFSL------------------------------
        NLA+TLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK YN+                     K+   L                              
Subjt:  NLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFSL------------------------------

Query:  ----NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNIN
            NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKII LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDL+EKE+EFEYFQNIN
Subjt:  ----NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNIN

Query:  QNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIW
        QNLIIKERRTNDEVQDARKELINV+ GSSTRAFI VKRMGDLDSKPF  ATKLKYVKEEA+EKAVELCS WEDQL DPSWHPFRIIE++ GQAK      
Subjt:  QNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIW

Query:  KGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
                   EIIDEND MLKNLKNEYGDEV++ VVTALMEMNEYNPSGRYTV+ELWNFKEGRKATLKEGVAHIL+QWKLHKRRRT
Subjt:  KGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

TrEMBL top hitse value%identityAlignment
A0A1S3C697 factor of DNA methylation 41.5e-28571.69Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV
        MSVG KKESETD LKK Q+FYY DLK G +RIK  GS+YRCPFCH KSG+ED ++K+LLRHASG GR SQ WS KE AKH ALERYMNKYF PEDQ Q  
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV

Query:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHD--QPQPVRKEQRYDHDQPQPISKEKRCDRDQ-QPVRKEQRCDRDQ-QPVCK
          +   DRD+PQ V KEQR+DR Q QPV K+ RY+HD+PQPV KE+H D  +PQP+ K+Q YD D+PQP+ K++  DRD+ Q V KEQ  DRDQ QP  K
Subjt:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHD--QPQPVRKEQRYDHDQPQPISKEKRCDRDQ-QPVRKEQRCDRDQ-QPVCK

Query:  EQ-----RCDRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF
         +       +   QPVR +Q ++ DQ FVWPWMAIVANIQTEIH GRHVG+SGSKLRDEF+RQGFN LKVHPLWNRFGHSG AIVEFNKDWDGFRNAI+F
Subjt:  EQ-----RCDRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF

Query:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM
        E SFEVDHHGKKDY VSR+RGK+LYGWVARDDD+ SKS+FGDYLRKNGDLKTVSGKEAEDNSKA KLVSNLANTLETK+LHLKEI HKVLETNASLNNMM
Subjt:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM

Query:  EQMDETVKTYNE--------------------SKIAF----------------------------------SLNERATLEQRKAEDEVLRLAGEQQKEKE
        EQMDETVK YN+                     K+                                     +NERATLEQ+KAEDEVL LAGEQQKEKE
Subjt:  EQMDETVKTYNE--------------------SKIAF----------------------------------SLNERATLEQRKAEDEVLRLAGEQQKEKE

Query:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR
        KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKM Q+QQDLDEKE+E EYFQNINQ+LIIKERR+NDEVQDARKELINV+GGSSTR
Subjt:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR

Query:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE
        AFI+VKRMGDLDSKPF  ATKLKY K+EAD KA+ELCS+WEDQLRD SWHPFRII +D GQAK                 EII+ENDE LKNLKNEYGDE
Subjt:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE

Query:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        V+KAVVTALMEMNEYNPSGRY V ELWNFKEGRKATLKEGVAHIL+QWKLHKRR+T
Subjt:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

A0A6J1HIA9 protein INVOLVED IN DE NOVO 2-like isoform X11.6e-29062.16Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQP 
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-

Query:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQR-----------
                                                               VHKEQ YDRD+PQ VRKEQ +D  +PQ VCK+Q            
Subjt:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQR-----------

Query:  -----------------YDHDQPQPVCKEQHHD--QPQP------------------------------------------------VRKEQRYDHDQPQ
                         YD DQPQPV KEQ++D  QPQP                                                VRKE+ YDHDQPQ
Subjt:  -----------------YDHDQPQPVCKEQHHD--QPQP------------------------------------------------VRKEQRYDHDQPQ

Query:  PISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ----------------
        P+ KE+  DRDQ                                 Q VRKEQ  DRD+             QPV KEQ  DRDQ                
Subjt:  PISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ----------------

Query:  ---QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHG
           QP+RKEQ YD DQLFVWPWMAIVANIQTEIHAGR VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDH G
Subjt:  ---QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHG

Query:  KKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTY
        KKDY VSRDRGKKL+GWVARDDDYNSKS+FGDYLRKNGDLKTVSGKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDET K Y
Subjt:  KKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTY

Query:  NE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELE
        NE                     K+   L                                  NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIELE
Subjt:  NE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELE

Query:  QKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGD
        QKLDARQALELEIERLKGSLEVMKHMGEDGDDD KKKMDQIQQDL+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRMGD
Subjt:  QKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGD

Query:  LDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALM
        LDSKPF  A KLKY KEEADEKAVELCS+WED+LRDPSWHPFRIIE+DGGQAK                 EIIDENDEMLKNL+NEYGDEVYKAVVTALM
Subjt:  LDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALM

Query:  EMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        EMNEYNPSGRYTVLELWNFKEGRKATLKEG AHIL+QWKLHKRR++
Subjt:  EMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

A0A6J1HLH1 protein INVOLVED IN DE NOVO 2-like isoform X21.7e-30068.11Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQPV
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPV

Query:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQP--------------------------------------------VRK
         KEQR D  QPQPV KEQ +DR Q Q +CK+QRYD D+PQ VCKE+ +D+ QP                                            VRK
Subjt:  HKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQP--------------------------------------------VRK

Query:  EQRYDHDQPQPISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ------
        E+ YDHDQPQP+ KE+  DRDQ                                 Q VRKEQ  DRD+             QPV KEQ  DRDQ      
Subjt:  EQRYDHDQPQPISKEKRCDRDQ---------------------------------QPVRKEQRCDRDQ-------------QPVCKEQRCDRDQ------

Query:  -------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF
                     QP+RKEQ YD DQLFVWPWMAIVANIQTEIHAGR VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MF
Subjt:  -------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMF

Query:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM
        ENSFEVDH GKKDY VSRDRGKKL+GWVARDDDYNSKS+FGDYLRKNGDLKTVSGKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMM
Subjt:  ENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMM

Query:  EQMDETVKTYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKE
        EQMDET K YNE                     K+   L                                  NERATLEQ+KAEDEVL LAGEQQKEKE
Subjt:  EQMDETVKTYNE--------------------SKIAFSL----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKE

Query:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR
        KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDD KKKMDQIQQDL+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTR
Subjt:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTR

Query:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE
        AFI VKRMGDLDSKPF  A KLKY KEEADEKAVELCS+WED+LRDPSWHPFRIIE+DGGQAK                 EIIDENDEMLKNL+NEYGDE
Subjt:  AFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDE

Query:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEG AHIL+QWKLHKRR++
Subjt:  VYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

A0A6J1I318 protein INVOLVED IN DE NOVO 2-like isoform X21.5e-29363.58Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQP 
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-

Query:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCK
                                                               VHKEQ YDRD+PQ VRKEQ +DR +PQ VCK+  YDHDQPQPV K
Subjt:  -------------------------------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCK

Query:  EQHH------------------------------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ------------------
        EQ++                                                D+PQ VRKE+ +DHDQPQP+ KE+  DRDQ                  
Subjt:  EQHH------------------------------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ------------------

Query:  -QPVRKEQRCDRDQ-----------------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVAN
         Q V+KEQ  DRD+                             QPV K+Q  DRDQ                   QP+RKEQ YD DQLFVWPWMAIVAN
Subjt:  -QPVRKEQRCDRDQ-----------------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVAN

Query:  IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKS
        IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDHHGKKDY VSRDRGKKLYGWVARDDDYNSKS
Subjt:  IQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKS

Query:  IFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFS
        +FGDYLRKNGDLKTV GKEAEDNSKAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDE VK YN+                     K+   
Subjt:  IFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFS

Query:  L----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGE
        L                                  NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEV+KHMGE
Subjt:  L----------------------------------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGE

Query:  DGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCS
        DGDDDAKKKMDQIQQ L+EKE+EFEYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRMGDLDSKPF  A KLKY KEEADEKAVELCS
Subjt:  DGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCS

Query:  QWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK
        +WED+LRDPSWHPFRIIE+DGG+AK                 EIIDENDEMLKNL+NEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK
Subjt:  QWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLK

Query:  EGVAHILRQWKLHKRRR
        EG AHIL+QWKLHKRR+
Subjt:  EGVAHILRQWKLHKRRR

A0A6J1I5Q0 protein INVOLVED IN DE NOVO 2-like isoform X11.0e-28458.71Show/hide
Query:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-
        MSVG KKESETDSLKKYQDFYYADLK G IRIK  GS YRCPFCHG+SGKED+++K+LLRHASG GR SQSW+IKERAKH ALERYMNKYF  EDQPQP 
Subjt:  MSVGPKKESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHH--------------------
                                       VHKEQ YDRD+PQ VRKEQ +DR +PQ VCK+  YDHDQPQPV KEQ++                    
Subjt:  -------------------------------VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHH--------------------

Query:  ----------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ-------------------QPVRKEQRCDRDQ----------
                                    D+PQ VRKE+ +DHDQPQP+ KE+  DRDQ                   Q V+KEQ  DRD+          
Subjt:  ----------------------------DQPQPVRKEQRYDHDQPQPISKEKRCDRDQ-------------------QPVRKEQRCDRDQ----------

Query:  -------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMR
                           QPV K+Q  DRDQ                   QP+RKEQ YD DQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMR
Subjt:  -------------------QPVCKEQRCDRDQ-------------------QPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMR

Query:  QGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNS
        QGFNPLKVHPLWNRFGHSG A+VEFNKDWDGFRNA+MFENSFEVDHHGKKDY VSRDRGKKLYGWVARDDDYNSKS+FGDYLRKNGDLKTV GKEAEDNS
Subjt:  QGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNS

Query:  KALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFSL-----------------------
        KAL+LVSNL NTLE KNLHLKEI HKVLETNASLNNMMEQMDE VK YN+                     K+   L                       
Subjt:  KALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE--------------------SKIAFSL-----------------------

Query:  -----------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEF
                   NERATLEQ+KAEDEVL LAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEV+KHMGEDGDDDAKKKMDQIQQ L+EKE+EF
Subjt:  -----------NERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEF

Query:  EYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQA
        EYFQNINQNLIIKERRTNDEVQDARKELI+V+GGSSTRAFI VKRMGDLDSKPF  A KLKY KEEADEKAVELCS+WED+LRDPSWHPFRIIE+DGG+A
Subjt:  EYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQA

Query:  KNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        K                 EIIDENDEMLKNL+NEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEG AHIL+QWKLHKRR+
Subjt:  KNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

SwissProt top hitse value%identityAlignment
Q8VZ79 Protein INVOLVED IN DE NOVO 22.4e-11035.7Show/hide
Query:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY
        ESE D   +Y D  Y +LK G +++++    + CP+C  K  K  ++YKDLL+HASG G   S   S KE+A H AL +Y+                   
Subjt:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY

Query:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP
                +++      + +P  K+Q                                                        ++  P+   Q CD D++ 
Subjt:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP

Query:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY
                     V+PW  IV NI  T+   GR  GESGSKLRDE++ +GFNP +V PLWN  GHSG AIVEFNKDW+G  N ++F+ ++ VD HGKKD+
Subjt:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY

Query:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-
        +        LYGW+AR DDYN  +I G+ LRK GDLKT++    E+  K   LV NL   +E K   +KEI       +  LN +ME+ ++   K Y E 
Subjt:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-

Query:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII
                                      K+    N                              E A +EQ+KA++EV +LA +Q+++KE+LH+KII
Subjt:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII

Query:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR
         LE++ D +QA+ELE+E+LKG L VMKHM  DGD +  K++D I +DL EKE +       NQ LI++ERRTNDE+Q+A KEL+N+    +T   I VKR
Subjt:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR

Query:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT
        MG+L +KPF+ A + KY +++ +++AVE+   WE  L+D  WHPF+ ++ +    +                 E+ID+ DE L+ LK + GD  Y AV  
Subjt:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT

Query:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        AL+E+NEYNPSGRY   ELWNFK  +KATL+EGV  +L QW+  KR+R
Subjt:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

Q9LHB1 Factor of DNA methylation 31.4e-9437.8Show/hide
Query:  VWPWMAIVANI-QTEIHAGRH-VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYG
        VWPW  ++ NI  T    GR   GESG KL+DE +R+GFNP++V  +W+RFGHSG  IVEFN+DW+G ++A++F+ ++E D HGKKD++        LY 
Subjt:  VWPWMAIVANI-QTEIHAGRH-VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYG

Query:  WVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNESK------------
        W+A  DDY   +I G+ LRK GDLK++     E+  K  KL+  L   +E K   LK++  K  + +  L    E+ ++ ++ Y+E              
Subjt:  WVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNESK------------

Query:  ----------------------------------------IAFSLNERA---------TLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQAL
                                                +A  L + A          LEQ+K  ++  RLA + + +KEKLHK+I  LE++LD +Q L
Subjt:  ----------------------------------------IAFSLNERA---------TLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQAL

Query:  ELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAA
        ELE+++LK  L VM+ +  D   +   K++   +DL E E E  +    NQ+L+++ER++NDE+Q+AR+ LI+          I VKRMG+LD+KPF+ A
Subjt:  ELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAA

Query:  TKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSG
         ++KY +E+ ++ AVE+   WE+ L+DP WHPF+ I+ +  +                T  E+IDE+DE L+ LKNE GD+ Y+AV  AL+E+NEYNPSG
Subjt:  TKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSG

Query:  RYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        RY   ELWNF+E RKATL+EGV  +L QW   K  ++
Subjt:  RYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

Q9LMH6 Factor of DNA methylation 42.9e-12438.74Show/hide
Query:  YYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP--------VHKEQRYDR----
        YY+++K G  ++K+  S +RCPFC+    K DY++ DLLRHASG G  S++   +++A+H ALERYM KY RP ++P+P        + KE+   +    
Subjt:  YYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP--------VHKEQRYDR----

Query:  ----DQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPV--RKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDR
            ++ + +  E     H  +   K    D        + +  D+P P    +++ Y   +P  +S  K  D   Q +                     
Subjt:  ----DQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPV--RKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDR

Query:  DQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH--AGRHVGESGSKLRDEFMRQGFNPLKVHPLWN-RFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH
          Q +      + DQ++V PW  I+AN++   +    ++ GESGSK+R++ +++GFNP KV PLWN R G +G AIV+F K+W+GFRNA MF+  FEV  
Subjt:  DQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH--AGRHVGESGSKLRDEFMRQGFNPLKVHPLWN-RFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH

Query:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK
         GK+D+ ++RD G KLYGWVA+ DDY S++  GD+LRK GDLK+VSGKEAED  K   LVSNL NTL TK+ +L+++     +T++ L   M++ DE + 
Subjt:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK

Query:  TYNESKIAFSLNER------------------------------------------------------ATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE
        T+NE         R                                                      AT EQ KA+++++RLA +QQ+EK++L K++ E
Subjt:  TYNESKIAFSLNER------------------------------------------------------ATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE

Query:  LEQKLDARQALELEIERLKGSLEVMKHM--GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVK
        LE+K+DA QALELEIER++G L+VM HM  GE  D   K+ +++ +++L EKE+++EY +++ Q L++K   TNDE+QDARK LI      +TRA+I VK
Subjt:  LEQKLDARQALELEIERLKGSLEVMKHM--GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVK

Query:  RMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV
        RMG LD  PF    K KY   EAD+KA ELCS WE+ L D +WHP +++E DG                     E ++E DE L+ L+ E G+EVY AV 
Subjt:  RMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV

Query:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHK
         AL E NEYN SGRY V ELWNFK+ RKA++KEGV +++  WK  K
Subjt:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHK

Q9S9P3 Factor of DNA methylation 11.5e-8331.44Show/hide
Query:  MSVGPKKESETDS-LKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP
        MS+  ++   ++S ++ Y +  Y  L+ G  ++KV G + RCPFC GK  K+DY+YK+L  HA+G  + S + S  ++A H AL  ++        +P  
Subjt:  MSVGPKKESETDS-LKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP

Query:  VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQR
                                 P+P     + D  +P P                                                          
Subjt:  VHKEQRYDRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQR

Query:  CDRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH
                          ++VWPWM IV N   E      + +S   L+       F P++V+  W         I +FN DW GF  A   E  FE   
Subjt:  CDRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH

Query:  HGKKDYIV-SRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQ-----
          KK++   S D   K YGW AR DD+ S+   G+YL K G L+TVS    ++      ++  L++ +   N  L ++ +    T  SL  ++++     
Subjt:  HGKKDYIV-SRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQ-----

Query:  ---MDETVK------------TYNESKIAFSLNER-----------------------------------------ATLEQRKAEDEVLRLAGEQQKEKE
            DET K             Y++ K++  L+ +                                         A+ EQ+KA++ VLRL  E Q++KE
Subjt:  ---MDETVK------------TYNESKIAFSLNER-----------------------------------------ATLEQRKAEDEVLRLAGEQQKEKE

Query:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELI-NVFGGSST
            KI+ LE++LD +Q LE+EI+ LKG L+VMKH+G+D D+  +KKM ++  +LD+K+ E E  +++N  L+ KER++NDE+Q ARK+LI  + G    
Subjt:  KLHKKIIELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELI-NVFGGSST

Query:  RAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGD
           I VKRMG+LD KPFL   KL+Y   EA  +A  LCS W++ L++PSW PF+  E  G  A+                 E++DE+DE LK LK E+G 
Subjt:  RAFISVKRMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGD

Query:  EVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        EV+ AV TAL+EMNEYN SGRYT  ELWNFKEGRKATLKE +  I    K+ KR+RT
Subjt:  EVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

Q9SAI1 Factor of DNA methylation 51.3e-7931.51Show/hide
Query:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYD
        ESE D    Y +  Y  L +G  ++KV    +RCPFC GK  K+ Y+YK+LL HASG  + S S S K++A H AL +YM                    
Subjt:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYD

Query:  RDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQPV
                               +   D D P+P       +Q Q V                                                     
Subjt:  RDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQPV

Query:  RKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIV
                D ++VWPWM IV N          + +S   L+       FNPL+V  LW         I +FN  W GF++    E  +E+   G+KD+I 
Subjt:  RKEQRYDCDQLFVWPWMAIVANIQTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIV

Query:  SR-DRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE---
         R D   K YGW AR DDYNS+    +YL K G L++ S    E+      +V +LAN +   N  L ++ +   E   SL  ++ + DE  + Y +   
Subjt:  SR-DRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNE---

Query:  ----------------------------------------------------------SKIAFSLNERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE
                                                                  S +  S  + A+LEQ+K +D VLRL  E +++KE+   KI++
Subjt:  ----------------------------------------------------------SKIAFSLNERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE

Query:  LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGG-SSTRAFISVKR
        LE++LD++Q L++EI+ LKG L+VMKH  ED D+  KKKM +++++L+EK  E +  ++ N  L++KER++NDE+ +ARK LI       S R  I VKR
Subjt:  LEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGG-SSTRAFISVKR

Query:  MGDLDSKPFLAATKLK-YVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV
        MG+L+ KPF+ A + +  V+EEA  +   LCS+W+++++D +W PF+ +   G + K                 E++DE DE +K L+ E+G+EV  AV 
Subjt:  MGDLDSKPFLAATKLK-YVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV

Query:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        TAL E+NE+NPSGRY+V ELWN K+GRKATLKE + +I +Q K  KRRR
Subjt:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

Arabidopsis top hitse value%identityAlignment
AT1G13790.1 XH/XS domain-containing protein2.1e-12538.74Show/hide
Query:  YYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP--------VHKEQRYDR----
        YY+++K G  ++K+  S +RCPFC+    K DY++ DLLRHASG G  S++   +++A+H ALERYM KY RP ++P+P        + KE+   +    
Subjt:  YYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQP--------VHKEQRYDR----

Query:  ----DQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPV--RKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDR
            ++ + +  E     H  +   K    D        + +  D+P P    +++ Y   +P  +S  K  D   Q +                     
Subjt:  ----DQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPV--RKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDR

Query:  DQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH--AGRHVGESGSKLRDEFMRQGFNPLKVHPLWN-RFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH
          Q +      + DQ++V PW  I+AN++   +    ++ GESGSK+R++ +++GFNP KV PLWN R G +G AIV+F K+W+GFRNA MF+  FEV  
Subjt:  DQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH--AGRHVGESGSKLRDEFMRQGFNPLKVHPLWN-RFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH

Query:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK
         GK+D+ ++RD G KLYGWVA+ DDY S++  GD+LRK GDLK+VSGKEAED  K   LVSNL NTL TK+ +L+++     +T++ L   M++ DE + 
Subjt:  HGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVK

Query:  TYNESKIAFSLNER------------------------------------------------------ATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE
        T+NE         R                                                      AT EQ KA+++++RLA +QQ+EK++L K++ E
Subjt:  TYNESKIAFSLNER------------------------------------------------------ATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIE

Query:  LEQKLDARQALELEIERLKGSLEVMKHM--GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVK
        LE+K+DA QALELEIER++G L+VM HM  GE  D   K+ +++ +++L EKE+++EY +++ Q L++K   TNDE+QDARK LI      +TRA+I VK
Subjt:  LEQKLDARQALELEIERLKGSLEVMKHM--GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVK

Query:  RMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV
        RMG LD  PF    K KY   EAD+KA ELCS WE+ L D +WHP +++E DG                     E ++E DE L+ L+ E G+EVY AV 
Subjt:  RMGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVV

Query:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHK
         AL E NEYN SGRY V ELWNFK+ RKA++KEGV +++  WK  K
Subjt:  TALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHK

AT3G12550.1 XH/XS domain-containing protein1.0e-9537.8Show/hide
Query:  VWPWMAIVANI-QTEIHAGRH-VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYG
        VWPW  ++ NI  T    GR   GESG KL+DE +R+GFNP++V  +W+RFGHSG  IVEFN+DW+G ++A++F+ ++E D HGKKD++        LY 
Subjt:  VWPWMAIVANI-QTEIHAGRH-VGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYG

Query:  WVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNESK------------
        W+A  DDY   +I G+ LRK GDLK++     E+  K  KL+  L   +E K   LK++  K  + +  L    E+ ++ ++ Y+E              
Subjt:  WVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNESK------------

Query:  ----------------------------------------IAFSLNERA---------TLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQAL
                                                +A  L + A          LEQ+K  ++  RLA + + +KEKLHK+I  LE++LD +Q L
Subjt:  ----------------------------------------IAFSLNERA---------TLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQAL

Query:  ELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAA
        ELE+++LK  L VM+ +  D   +   K++   +DL E E E  +    NQ+L+++ER++NDE+Q+AR+ LI+          I VKRMG+LD+KPF+ A
Subjt:  ELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAA

Query:  TKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSG
         ++KY +E+ ++ AVE+   WE+ L+DP WHPF+ I+ +  +                T  E+IDE+DE L+ LKNE GD+ Y+AV  AL+E+NEYNPSG
Subjt:  TKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSG

Query:  RYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT
        RY   ELWNF+E RKATL+EGV  +L QW   K  ++
Subjt:  RYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRRT

AT3G48670.1 XH/XS domain-containing protein1.7e-11135.7Show/hide
Query:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY
        ESE D   +Y D  Y +LK G +++++    + CP+C  K  K  ++YKDLL+HASG G   S   S KE+A H AL +Y+                   
Subjt:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY

Query:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP
                +++      + +P  K+Q                                                        ++  P+   Q CD D++ 
Subjt:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP

Query:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY
                     V+PW  IV NI  T+   GR  GESGSKLRDE++ +GFNP +V PLWN  GHSG AIVEFNKDW+G  N ++F+ ++ VD HGKKD+
Subjt:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY

Query:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-
        +        LYGW+AR DDYN  +I G+ LRK GDLKT++    E+  K   LV NL   +E K   +KEI       +  LN +ME+ ++   K Y E 
Subjt:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-

Query:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII
                                      K+    N                              E A +EQ+KA++EV +LA +Q+++KE+LH+KII
Subjt:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII

Query:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR
         LE++ D +QA+ELE+E+LKG L VMKHM  DGD +  K++D I +DL EKE +       NQ LI++ERRTNDE+Q+A KEL+N+    +T   I VKR
Subjt:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR

Query:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT
        MG+L +KPF+ A + KY +++ +++AVE+   WE  L+D  WHPF+ ++ +    +                 E+ID+ DE L+ LK + GD  Y AV  
Subjt:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT

Query:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        AL+E+NEYNPSGRY   ELWNFK  +KATL+EGV  +L QW+  KR+R
Subjt:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

AT3G48670.2 XH/XS domain-containing protein1.7e-11135.7Show/hide
Query:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY
        ESE D   +Y D  Y +LK G +++++    + CP+C  K  K  ++YKDLL+HASG G   S   S KE+A H AL +Y+                   
Subjt:  ESETDSLKKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASG-GGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRY

Query:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP
                +++      + +P  K+Q                                                        ++  P+   Q CD D++ 
Subjt:  DRDQPQPVRKEQRHDRHQPQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQP

Query:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY
                     V+PW  IV NI  T+   GR  GESGSKLRDE++ +GFNP +V PLWN  GHSG AIVEFNKDW+G  N ++F+ ++ VD HGKKD+
Subjt:  VRKEQRYDCDQLFVWPWMAIVANI-QTEIHAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDY

Query:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-
        +        LYGW+AR DDYN  +I G+ LRK GDLKT++    E+  K   LV NL   +E K   +KEI       +  LN +ME+ ++   K Y E 
Subjt:  IVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDET-VKTYNE-

Query:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII
                                      K+    N                              E A +EQ+KA++EV +LA +Q+++KE+LH+KII
Subjt:  -----------------------------SKIAFSLN------------------------------ERATLEQRKAEDEVLRLAGEQQKEKEKLHKKII

Query:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR
         LE++ D +QA+ELE+E+LKG L VMKHM  DGD +  K++D I +DL EKE +       NQ LI++ERRTNDE+Q+A KEL+N+    +T   I VKR
Subjt:  ELEQKLDARQALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKR

Query:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT
        MG+L +KPF+ A + KY +++ +++AVE+   WE  L+D  WHPF+ ++ +    +                 E+ID+ DE L+ LK + GD  Y AV  
Subjt:  MGDLDSKPFLAATKLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVT

Query:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR
        AL+E+NEYNPSGRY   ELWNFK  +KATL+EGV  +L QW+  KR+R
Subjt:  ALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILRQWKLHKRRR

AT5G59390.1 XH/XS domain-containing protein3.1e-9740.36Show/hide
Query:  DRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH-AGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH
        + D   V+++QR      FVWPW+ +VANI TE+  +GR VG+SGS LRDEF  +GFNP +V P+W+  GHSG A+VEF KD++GF +A+ FE SF +D 
Subjt:  DRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEIH-AGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDH

Query:  HGKKDYIVS-RDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETN--------------
        HGK+D+    R R  KLYGWVAR+DDYN     G  ++K  DLK++S    ED  K + LV N++ T+E K    +E+  KV ET+              
Subjt:  HGKKDYIVS-RDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTVSGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETN--------------

Query:  -----------------------------ASLNNMMEQMDETVKTYNE-----------SKIAFSLNERATLEQRKAEDEVLRLAGEQQ-----KEKEKL
                                     A L    E++DE  +   +           S++   +N++A  EQ +A +E ++LA + Q     KEKEKL
Subjt:  -----------------------------ASLNNMMEQMDETVKTYNE-----------SKIAFSLNERATLEQRKAEDEVLRLAGEQQ-----KEKEKL

Query:  HKKIIELEQKLDARQALELEIERLKGSLEVMKHM-GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSS---
        HK+I+E+E KL+  Q LELEIE+LKG+  VMKHM G DGD D  +K+ + Q  LD +E            L  KER TNDE QD  KE+I V+  +    
Subjt:  HKKIIELEQKLDARQALELEIERLKGSLEVMKHM-GEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSS---

Query:  TRAFISVKRMGDLDSKPFLAAT--KLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNE
         +  I VKRMG L+  PFL A   K K  + +A+ KA++LCS WE  + D  W PFR+ E+DG                  TP  ++DENDE L+ LKN+
Subjt:  TRAFISVKRMGDLDSKPFLAAT--KLKYVKEEADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNE

Query:  YGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILR
        YG+EVY  VV   +E+ E+N SG Y +LELWN++E RKAT++E    +L+
Subjt:  YGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATLKEGVAHILR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AACCAGAATTCCCCTCAGTTATGCCCTTCTCAATGCTTCCCTCCGCCCTCTGAGCCACCGGAAACTCTCTGTTGTTACCGTCATTGTTCTCTTTCGGTTGACTCCGCTCC
TCATTTTTTCTACGTCTCTTTCCGCGCTCCACTGAGCCACCGGAGACGTTCCTCTTATGCTTCACTGATCATTTCGTGTGAAAGGATCGTTACGACTGGAGGGTTAGGTG
GTTGTCCCATGAAAATAGTCAAGTTACGACTTGCAAGTTGTAAATCTATTCTTTCTAATAGGTCCAAGATGTCTGTTGGCCCGAAGAAAGAATCTGAGACTGATAGTTTG
AAGAAGTATCAAGACTTCTATTATGCAGATTTGAAACATGGTCACATAAGAATTAAAGTTTTTGGTTCTAAGTACAGATGTCCATTCTGCCATGGGAAGAGTGGGAAGGA
GGACTACCGATATAAGGATCTTCTCCGACATGCTTCTGGTGGAGGGAGATGCTCACAGAGCTGGAGCATAAAAGAAAGGGCAAAGCACTCAGCTTTGGAGAGGTATATGA
ATAAGTATTTTCGCCCTGAGGATCAACCTCAACCTGTCCACAAGGAGCAGCGTTATGATCGTGATCAACCTCAACCCGTCCGCAAGGAGCAGCGCCATGATCGTCATCAA
CCTCAACCTGTCTGTAAGAAGCAACGTTATGATCATGATCAACCTCAACCTGTCTGCAAGGAGCAACATCATGATCAACCTCAACCTGTTCGCAAGGAGCAACGTTATGA
TCATGATCAACCTCAACCTATCAGCAAGGAGAAGCGTTGTGATCGTGATCAACAACCTGTTCGCAAGGAGCAGCGTTGTGATCGTGATCAACAACCTGTCTGCAAGGAGC
AGCGCTGTGATCGTGATCAACAACCTGTCCGCAAGGAGCAGCGCTATGATTGCGATCAGTTGTTTGTCTGGCCTTGGATGGCTATTGTAGCAAACATACAAACTGAGATA
CATGCTGGACGACATGTTGGGGAAAGTGGTTCCAAACTTCGAGATGAATTTATGAGACAAGGTTTTAACCCTTTGAAGGTTCATCCTTTGTGGAATCGCTTCGGTCATTC
TGGACTTGCAATTGTTGAGTTTAACAAGGATTGGGACGGTTTTAGAAATGCCATAATGTTTGAAAATAGCTTTGAAGTTGACCATCATGGGAAGAAAGACTATATTGTTT
CAAGGGACCGGGGAAAGAAATTGTATGGTTGGGTGGCAAGGGATGATGATTACAATTCAAAAAGCATATTTGGAGATTATCTGCGGAAGAATGGGGACTTGAAAACTGTA
TCCGGTAAAGAGGCTGAAGATAACAGTAAAGCATTAAAGCTTGTCTCAAACTTAGCCAATACACTGGAAACTAAGAATCTACACCTTAAAGAGATCGCTCACAAAGTTCT
AGAGACTAATGCATCCTTGAATAATATGATGGAACAGATGGACGAGACGGTTAAAACTTACAATGAAAGTAAGATTGCATTTTCTTTGAATGAAAGGGCCACTTTGGAGC
AAAGGAAGGCAGAAGATGAAGTCTTGCGCCTAGCAGGGGAACAACAGAAAGAGAAGGAGAAGCTTCACAAAAAGATCATAGAGCTGGAACAGAAACTTGATGCAAGACAA
GCATTAGAGTTGGAAATTGAGAGGTTGAAGGGTTCCTTAGAAGTCATGAAACATATGGGTGAGGATGGTGATGATGATGCCAAGAAAAAAATGGATCAGATTCAACAAGA
TCTGGACGAGAAAGAAGATGAATTTGAATACTTTCAAAACATCAATCAAAATCTTATCATCAAAGAGCGTAGAACCAATGATGAAGTTCAAGATGCGCGCAAAGAATTGA
TCAATGTGTTTGGAGGTTCATCGACCCGAGCCTTTATTAGTGTCAAGAGAATGGGAGATCTTGACAGCAAACCATTCTTAGCAGCCACAAAGTTGAAATATGTCAAGGAA
GAAGCAGATGAGAAAGCAGTAGAGTTGTGCTCACAGTGGGAGGACCAACTTCGTGACCCTAGCTGGCATCCTTTCAGAATTATAGAGAACGATGGAGGACAAGCTAAGAA
TTTCTGGTTGATTTGGAAAGGAGTGGATTTGCCTTGGCCAACGCCATATGAAATTATCGATGAAAATGATGAAATGTTAAAAAATTTGAAGAATGAGTATGGAGATGAAG
TTTACAAGGCTGTTGTCACAGCCTTGATGGAAATGAACGAGTATAACCCAAGTGGTAGATATACAGTATTGGAGCTGTGGAACTTTAAGGAGGGAAGAAAAGCGACATTA
AAGGAAGGAGTAGCTCATATACTGAGGCAATGGAAACTGCACAAAAGAAGGAGAACCTGA
mRNA sequenceShow/hide mRNA sequence
AACCAGAATTCCCCTCAGTTATGCCCTTCTCAATGCTTCCCTCCGCCCTCTGAGCCACCGGAAACTCTCTGTTGTTACCGTCATTGTTCTCTTTCGGTTGACTCCGCTCC
TCATTTTTTCTACGTCTCTTTCCGCGCTCCACTGAGCCACCGGAGACGTTCCTCTTATGCTTCACTGATCATTTCGTGTGAAAGGATCGTTACGACTGGAGGGTTAGGTG
GTTGTCCCATGAAAATAGTCAAGTTACGACTTGCAAGTTGTAAATCTATTCTTTCTAATAGGTCCAAGATGTCTGTTGGCCCGAAGAAAGAATCTGAGACTGATAGTTTG
AAGAAGTATCAAGACTTCTATTATGCAGATTTGAAACATGGTCACATAAGAATTAAAGTTTTTGGTTCTAAGTACAGATGTCCATTCTGCCATGGGAAGAGTGGGAAGGA
GGACTACCGATATAAGGATCTTCTCCGACATGCTTCTGGTGGAGGGAGATGCTCACAGAGCTGGAGCATAAAAGAAAGGGCAAAGCACTCAGCTTTGGAGAGGTATATGA
ATAAGTATTTTCGCCCTGAGGATCAACCTCAACCTGTCCACAAGGAGCAGCGTTATGATCGTGATCAACCTCAACCCGTCCGCAAGGAGCAGCGCCATGATCGTCATCAA
CCTCAACCTGTCTGTAAGAAGCAACGTTATGATCATGATCAACCTCAACCTGTCTGCAAGGAGCAACATCATGATCAACCTCAACCTGTTCGCAAGGAGCAACGTTATGA
TCATGATCAACCTCAACCTATCAGCAAGGAGAAGCGTTGTGATCGTGATCAACAACCTGTTCGCAAGGAGCAGCGTTGTGATCGTGATCAACAACCTGTCTGCAAGGAGC
AGCGCTGTGATCGTGATCAACAACCTGTCCGCAAGGAGCAGCGCTATGATTGCGATCAGTTGTTTGTCTGGCCTTGGATGGCTATTGTAGCAAACATACAAACTGAGATA
CATGCTGGACGACATGTTGGGGAAAGTGGTTCCAAACTTCGAGATGAATTTATGAGACAAGGTTTTAACCCTTTGAAGGTTCATCCTTTGTGGAATCGCTTCGGTCATTC
TGGACTTGCAATTGTTGAGTTTAACAAGGATTGGGACGGTTTTAGAAATGCCATAATGTTTGAAAATAGCTTTGAAGTTGACCATCATGGGAAGAAAGACTATATTGTTT
CAAGGGACCGGGGAAAGAAATTGTATGGTTGGGTGGCAAGGGATGATGATTACAATTCAAAAAGCATATTTGGAGATTATCTGCGGAAGAATGGGGACTTGAAAACTGTA
TCCGGTAAAGAGGCTGAAGATAACAGTAAAGCATTAAAGCTTGTCTCAAACTTAGCCAATACACTGGAAACTAAGAATCTACACCTTAAAGAGATCGCTCACAAAGTTCT
AGAGACTAATGCATCCTTGAATAATATGATGGAACAGATGGACGAGACGGTTAAAACTTACAATGAAAGTAAGATTGCATTTTCTTTGAATGAAAGGGCCACTTTGGAGC
AAAGGAAGGCAGAAGATGAAGTCTTGCGCCTAGCAGGGGAACAACAGAAAGAGAAGGAGAAGCTTCACAAAAAGATCATAGAGCTGGAACAGAAACTTGATGCAAGACAA
GCATTAGAGTTGGAAATTGAGAGGTTGAAGGGTTCCTTAGAAGTCATGAAACATATGGGTGAGGATGGTGATGATGATGCCAAGAAAAAAATGGATCAGATTCAACAAGA
TCTGGACGAGAAAGAAGATGAATTTGAATACTTTCAAAACATCAATCAAAATCTTATCATCAAAGAGCGTAGAACCAATGATGAAGTTCAAGATGCGCGCAAAGAATTGA
TCAATGTGTTTGGAGGTTCATCGACCCGAGCCTTTATTAGTGTCAAGAGAATGGGAGATCTTGACAGCAAACCATTCTTAGCAGCCACAAAGTTGAAATATGTCAAGGAA
GAAGCAGATGAGAAAGCAGTAGAGTTGTGCTCACAGTGGGAGGACCAACTTCGTGACCCTAGCTGGCATCCTTTCAGAATTATAGAGAACGATGGAGGACAAGCTAAGAA
TTTCTGGTTGATTTGGAAAGGAGTGGATTTGCCTTGGCCAACGCCATATGAAATTATCGATGAAAATGATGAAATGTTAAAAAATTTGAAGAATGAGTATGGAGATGAAG
TTTACAAGGCTGTTGTCACAGCCTTGATGGAAATGAACGAGTATAACCCAAGTGGTAGATATACAGTATTGGAGCTGTGGAACTTTAAGGAGGGAAGAAAAGCGACATTA
AAGGAAGGAGTAGCTCATATACTGAGGCAATGGAAACTGCACAAAAGAAGGAGAACCTGA
Protein sequenceShow/hide protein sequence
NQNSPQLCPSQCFPPPSEPPETLCCYRHCSLSVDSAPHFFYVSFRAPLSHRRRSSYASLIISCERIVTTGGLGGCPMKIVKLRLASCKSILSNRSKMSVGPKKESETDSL
KKYQDFYYADLKHGHIRIKVFGSKYRCPFCHGKSGKEDYRYKDLLRHASGGGRCSQSWSIKERAKHSALERYMNKYFRPEDQPQPVHKEQRYDRDQPQPVRKEQRHDRHQ
PQPVCKKQRYDHDQPQPVCKEQHHDQPQPVRKEQRYDHDQPQPISKEKRCDRDQQPVRKEQRCDRDQQPVCKEQRCDRDQQPVRKEQRYDCDQLFVWPWMAIVANIQTEI
HAGRHVGESGSKLRDEFMRQGFNPLKVHPLWNRFGHSGLAIVEFNKDWDGFRNAIMFENSFEVDHHGKKDYIVSRDRGKKLYGWVARDDDYNSKSIFGDYLRKNGDLKTV
SGKEAEDNSKALKLVSNLANTLETKNLHLKEIAHKVLETNASLNNMMEQMDETVKTYNESKIAFSLNERATLEQRKAEDEVLRLAGEQQKEKEKLHKKIIELEQKLDARQ
ALELEIERLKGSLEVMKHMGEDGDDDAKKKMDQIQQDLDEKEDEFEYFQNINQNLIIKERRTNDEVQDARKELINVFGGSSTRAFISVKRMGDLDSKPFLAATKLKYVKE
EADEKAVELCSQWEDQLRDPSWHPFRIIENDGGQAKNFWLIWKGVDLPWPTPYEIIDENDEMLKNLKNEYGDEVYKAVVTALMEMNEYNPSGRYTVLELWNFKEGRKATL
KEGVAHILRQWKLHKRRRT