; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015371 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015371
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationtig00003469:741214..743357
RNA-Seq ExpressionSgr015371
SyntenySgr015371
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV85037.1 DDE_4 domain-containing protein [Cephalotus follicularis]5.5e-16261.82Show/hide
Query:  DEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENPNA
        DEDASSGS  D +M+DGHNKRQS  PS+SG  KRSRKATGDAIVDAMLEIAA SKMRA AIM+NEDRFSISKCIKVLDEMQGVDQ+ YFLALDL ENP+A
Subjt:  DEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENPNA

Query:  RETFISL-----------KNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSL
        RE FISL           K +S +MDDF+LELDEMELVAAAAGYYYYN++T+QP  SLSPSGCGFMTE+ +G+DD CREM RMDK VF+KLC  LR R L
Subjt:  RETFISL-----------KNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSL

Query:  LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL----------------
        LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP P  TP EI  S        +C+                
Subjt:  LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL----------------

Query:  -------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKE
                                       WE                         GKYYLVD  Y NMEGFIAP   VRY    ++GA+ LP NAKE
Subjt:  -------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKE

Query:  LFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDS
        LFNHRH+ L N I + F VLKTRFPILKLAPQYAFHIQRDIVIA+CV+HNFIR  ER DWLF   EGM + EE P+  + P+MHL + +Q+ ++ +LR+S
Subjt:  LFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDS

Query:  IALAMWDDFMTKWDQW
        IA AMW+DF+ +WD+W
Subjt:  IALAMWDDFMTKWDQW

KAA3479824.1 DDE_4 domain-containing protein [Gossypium australe]1.2e-16156.73Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        MNYVD++AS+GS  D +M++G NKRQS  PSSSG RKRSRKATGDAIVDAMLEIAAASK+RA+AIM+NED+FSISKCIKVLDEMQGVDQ+ YFLALDLFE
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLK---------------------------------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQP-C
        NPNARE FISLK                                                   ++ST MDD DLELDEMELVAAAAGYYYYNS+TRQ  C
Subjt:  NPNARETFISLK---------------------------------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQP-C

Query:  PSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS
         SL PSG GFM EVL G DD CREMLRMDKHVFHKLC  LRHR +LRDTAGVMIEEQLAIFL+IVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS
Subjt:  PSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS

Query:  REFLQPTPNITPVEIFE----------------------STSLI--------NCL---------------------------------------------
        REFLQP    TP EI +                      S SL+        +C+                                             
Subjt:  REFLQPTPNITPVEIFE----------------------STSLI--------NCL---------------------------------------------

Query:  --WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKL
          WE                         GKYYLVD GYSNMEGF+AP+  VRY+LH++RGA+ LP NAKELFNHRH+SL NVI+++F VLKTRFPILKL
Subjt:  --WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKL

Query:  APQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW
        APQYAFHIQRDIVIA CV+HN+IR  ER DWLF S EG  +D E P++ E PE+   +  Q+ +A SLR+SIA  MW+DF+ KWDQW
Subjt:  APQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW

KAF7820599.1 putative nuclease HARBI1 [Senna tora]1.4e-16862.76Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        MNYV+EDASSGS  D  M+DGH KRQ+G+PSSSG RKRSRKATGDAIVDAMLEIAAASKMRATAIM+NEDRFSISKCIKVLDEMQGVDQ+ YF ALDLFE
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLK------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDIL
        +P ARE FISLK            ++S ++DD DLEL+EMELVAAAAGYYYYNS+ +QPC SLSP  CGFMTE+L+  DD+CREMLRMDKHVFHKLCDIL
Subjt:  NPNARETFISLK------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDIL

Query:  RHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-----------
        R RS+LRDTAGV+IEEQLAIFLNI+GHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP    TP EI  S        +C+           
Subjt:  RHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-----------

Query:  ------------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLP
                                            WE                         GKYYLVDMGYSNM GFI+PF  VRY+ +++RGA+ LP
Subjt:  ------------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLP

Query:  TNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAF
         NAKELFNHRH  L N I++SF+VLKTRFPILKLAPQYAFH+QRDIVIA CV+HNFIR  ER DW+F S  G  +DE      E P + L + +Q+ +AF
Subjt:  TNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAF

Query:  SLRDSIALAMWDDFMTKWDQW
        SLRDSIA AMWDDF+ KW++W
Subjt:  SLRDSIALAMWDDFMTKWDQW

PSR88066.1 Nuclease [Actinidia chinensis var. chinensis]4.5e-16463.31Show/hide
Query:  YVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENP
        +VDEDASSGS  D  M+DGHNKRQS  P SSG RKRSRKATGDAIVDAMLEIAAASKMRAT IM+NEDRFSISKCIKVLDEMQGVDQ+ YFLALDLFENP
Subjt:  YVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENP

Query:  NARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMI
        NARETFI+LK+L  AMDD DLELDEMELVAAAAGYYYYNS+T+QP  S SPSG  F+ EVL+G DD CR+M RMDKHVFHKL D LR + +LRDT GVMI
Subjt:  NARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMI

Query:  EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-------------------------
        EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP P  TP EI  S        +C+                         
Subjt:  EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-------------------------

Query:  ----------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASL
                              WE                         GKYYLVD GYSN EGFIAPF  +RY++ ++RGA  LP NAKELFNHRH+SL
Subjt:  ----------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASL

Query:  SNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDF
         N I+KSF VLKTRFPIL++APQYAFHIQRDIVIA CV+HN IR  ER DWLF   EG  + EE P++ + P+  L + +Q+   +SLR+SIA AMW+DF
Subjt:  SNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDF

Query:  MTKWDQW
        ++KW +W
Subjt:  MTKWDQW

TXG60196.1 hypothetical protein EZV62_014769 [Acer yangbiense]6.5e-17166.94Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        MNYVDED +SGS  DT+M+DGH+KRQ   PSSSG RKRSRK  GDAIVDAMLEIAAASKMRA AIM+NE+RFSIS+CIKVLDEMQGVDQ+ YFLALDLFE
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLK---------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREM
        NPNARE FISLK                           + S +MDDFD+ELDEMELVAAAAGYYYYNS+T++P   +S SG GFMT+VL GHDD CREM
Subjt:  NPNARETFISLK---------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREM

Query:  LRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFES-------
         RMDKH+FHKLCDILR RS+LRDT+GVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISR+FNNVLKAIKSLSREFLQP P  TP EI  S       
Subjt:  LRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFES-------

Query:  -----TSLINCLWEF------VSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAF
             T  I   + +      +SGKYYLVDMGY+N +GFIAP+  VR +LH+F GA+ LP NAKELFNHRH+SL NVI++ F VLKTRFPILK+APQYAF
Subjt:  -----TSLINCLWEF------VSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAF

Query:  HIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW
        HIQRDIVIA CV+HNFIR  ER DWLF + EG ++ EE P+  +F +M L   +Q+ +A + RDSIA  MW+DF+ KWD+W
Subjt:  HIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW

TrEMBL top hitse value%identityAlignment
A0A1Q3CXR1 DDE_4 domain-containing protein2.7e-16261.82Show/hide
Query:  DEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENPNA
        DEDASSGS  D +M+DGHNKRQS  PS+SG  KRSRKATGDAIVDAMLEIAA SKMRA AIM+NEDRFSISKCIKVLDEMQGVDQ+ YFLALDL ENP+A
Subjt:  DEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENPNA

Query:  RETFISL-----------KNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSL
        RE FISL           K +S +MDDF+LELDEMELVAAAAGYYYYN++T+QP  SLSPSGCGFMTE+ +G+DD CREM RMDK VF+KLC  LR R L
Subjt:  RETFISL-----------KNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSL

Query:  LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL----------------
        LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP P  TP EI  S        +C+                
Subjt:  LRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL----------------

Query:  -------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKE
                                       WE                         GKYYLVD  Y NMEGFIAP   VRY    ++GA+ LP NAKE
Subjt:  -------------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKE

Query:  LFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDS
        LFNHRH+ L N I + F VLKTRFPILKLAPQYAFHIQRDIVIA+CV+HNFIR  ER DWLF   EGM + EE P+  + P+MHL + +Q+ ++ +LR+S
Subjt:  LFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDS

Query:  IALAMWDDFMTKWDQW
        IA AMW+DF+ +WD+W
Subjt:  IALAMWDDFMTKWDQW

A0A2R6PAQ7 Nuclease2.2e-16463.31Show/hide
Query:  YVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENP
        +VDEDASSGS  D  M+DGHNKRQS  P SSG RKRSRKATGDAIVDAMLEIAAASKMRAT IM+NEDRFSISKCIKVLDEMQGVDQ+ YFLALDLFENP
Subjt:  YVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENP

Query:  NARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMI
        NARETFI+LK+L  AMDD DLELDEMELVAAAAGYYYYNS+T+QP  S SPSG  F+ EVL+G DD CR+M RMDKHVFHKL D LR + +LRDT GVMI
Subjt:  NARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMI

Query:  EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-------------------------
        EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP P  TP EI  S        +C+                         
Subjt:  EEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-------------------------

Query:  ----------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASL
                              WE                         GKYYLVD GYSN EGFIAPF  +RY++ ++RGA  LP NAKELFNHRH+SL
Subjt:  ----------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASL

Query:  SNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDF
         N I+KSF VLKTRFPIL++APQYAFHIQRDIVIA CV+HN IR  ER DWLF   EG  + EE P++ + P+  L + +Q+   +SLR+SIA AMW+DF
Subjt:  SNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDF

Query:  MTKWDQW
        ++KW +W
Subjt:  MTKWDQW

A0A5B6WEV3 DDE_4 domain-containing protein5.9e-16256.73Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        MNYVD++AS+GS  D +M++G NKRQS  PSSSG RKRSRKATGDAIVDAMLEIAAASK+RA+AIM+NED+FSISKCIKVLDEMQGVDQ+ YFLALDLFE
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLK---------------------------------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQP-C
        NPNARE FISLK                                                   ++ST MDD DLELDEMELVAAAAGYYYYNS+TRQ  C
Subjt:  NPNARETFISLK---------------------------------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQP-C

Query:  PSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS
         SL PSG GFM EVL G DD CREMLRMDKHVFHKLC  LRHR +LRDTAGVMIEEQLAIFL+IVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS
Subjt:  PSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLS

Query:  REFLQPTPNITPVEIFE----------------------STSLI--------NCL---------------------------------------------
        REFLQP    TP EI +                      S SL+        +C+                                             
Subjt:  REFLQPTPNITPVEIFE----------------------STSLI--------NCL---------------------------------------------

Query:  --WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKL
          WE                         GKYYLVD GYSNMEGF+AP+  VRY+LH++RGA+ LP NAKELFNHRH+SL NVI+++F VLKTRFPILKL
Subjt:  --WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKL

Query:  APQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW
        APQYAFHIQRDIVIA CV+HN+IR  ER DWLF S EG  +D E P++ E PE+   +  Q+ +A SLR+SIA  MW+DF+ KWDQW
Subjt:  APQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW

A0A5C7HV80 DDE Tnp4 domain-containing protein3.1e-17166.94Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        MNYVDED +SGS  DT+M+DGH+KRQ   PSSSG RKRSRK  GDAIVDAMLEIAAASKMRA AIM+NE+RFSIS+CIKVLDEMQGVDQ+ YFLALDLFE
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLK---------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREM
        NPNARE FISLK                           + S +MDDFD+ELDEMELVAAAAGYYYYNS+T++P   +S SG GFMT+VL GHDD CREM
Subjt:  NPNARETFISLK---------------------------NLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREM

Query:  LRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFES-------
         RMDKH+FHKLCDILR RS+LRDT+GVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISR+FNNVLKAIKSLSREFLQP P  TP EI  S       
Subjt:  LRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFES-------

Query:  -----TSLINCLWEF------VSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAF
             T  I   + +      +SGKYYLVDMGY+N +GFIAP+  VR +LH+F GA+ LP NAKELFNHRH+SL NVI++ F VLKTRFPILK+APQYAF
Subjt:  -----TSLINCLWEF------VSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAF

Query:  HIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW
        HIQRDIVIA CV+HNFIR  ER DWLF + EG ++ EE P+  +F +M L   +Q+ +A + RDSIA  MW+DF+ KWD+W
Subjt:  HIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW

A0A6A6K5L4 DDE Tnp4 domain-containing protein2.9e-16161.49Show/hide
Query:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE
        M ++DEDASSGS  D +M+DGHNKRQS  PSSSG  KRSRKATGDAIVDAMLEIAAASKMRA+AIM+NEDRFSISKCIKVLDEMQ               
Subjt:  MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFE

Query:  NPNARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGV
                    + ST+MDDFDLELDEMELVAAAAGYYYYNS+ R PC S  PSG GFMTEVL GHDD CREM RMDKHVFHKLC+ LR R +LRDTAGV
Subjt:  NPNARETFISLKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGV

Query:  MIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-----------------------
        MIEEQLAIFLNI+GHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP P  TP EIF S+       +C+                       
Subjt:  MIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLI----NCL-----------------------

Query:  ------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHA
                                WE                         GKYYLVD GYSNMEGFIAP+  VRY+LH+FRGA+ LP NA+ELFNHRH+
Subjt:  ------------------------WE----------------------FVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHA

Query:  SLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWD
        SL NVI++SF VLK RFPILK+APQY FHIQRDIVIA CV+HN+IRC ER DWLF S +G+ + EE P++ + PEM L + +Q+ +AFSLR+SIA AMW+
Subjt:  SLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWD

Query:  DFMTKWDQW
        DF+ KWDQW
Subjt:  DFMTKWDQW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein5.2e-1726.01Show/hide
Query:  ELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQH
        ELV   A  YY     R P       G   +   L     AC ++LRM    F  LC++L+    L+ T  + IEE +A+FL I GHNE  R +  RF  
Subjt:  ELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQH

Query:  SGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLINCLWEFVSG-------------------------------------------------
        + ET+ R F  VL A + L+ ++++         I E   +    W + SG                                                 
Subjt:  SGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLINCLWEFVSG-------------------------------------------------

Query:  ---------------------------KYYLVDMGYSNMEGFIAPFHD-----VRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTR
                                   KYYLVD GY N +G +AP+       VRY++ +F      P N  ELFN  H SL +VI+++F + K +
Subjt:  ---------------------------KYYLVDMGYSNMEGFIAPFHD-----VRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTR

AT4G10890.1 unknown protein2.5e-1129.63Show/hide
Query:  ILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLINCLWEFVSG-KYYLVD
        +L  R  L++   V +EE +A+FL  V  N   R I  R+Q S   + R  ++VL A+   + + L+ +      ++ +  +         S  KYYLV+
Subjt:  ILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLINCLWEFVSG-KYYLVD

Query:  MGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPIL
          Y    G++ P   + Y+L +F G    P   +ELFN +H  L +VI ++F V K ++ IL
Subjt:  MGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPIL

AT5G28730.1 unknown protein5.9e-1325.23Show/hide
Query:  ACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTS
        +C+ ++RM    F +LC+IL  +  L+ +  + ++E +AIFL I   N+  R I  RF H+ ETI R F++VLKA++ L+ E+++P       ++ E  +
Subjt:  ACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTS

Query:  LINCL------WEFV----------------------------------------------------SGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRG
        + N L      W F+                                                      KYYLVD GY+N  G++AP+            
Subjt:  LINCL------WEFV----------------------------------------------------SGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRG

Query:  ASHLPTNAKELFNHRHASLSNV
         + L  N  E  N +     NV
Subjt:  ASHLPTNAKELFNHRHASLSNV

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.5e-1937.34Show/hide
Query:  KYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKD
        K+YLVD G++N   F+APF  VRY+L +F G    P    ELFN RH SL NVI++ F + K+RF I K AP +++  Q  +V+    +HNF+R   R D
Subjt:  KYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKD

Query:  WL----FMSTEGMIIDEE----EPNYVEFPEMHLTTLVQDHVAFSL-RDSIALAMWDD
               +  EG +++ E      N ++  E  L    QD    ++ R S+A  MW D
Subjt:  WL----FMSTEGMIIDEE----EPNYVEFPEMHLTTLVQDHVAFSL-RDSIALAMWDD

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.4e-4635.04Show/hide
Query:  GCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP
        G  F+ ++L+G ++ C E  RMDK VF+KLCD+L+ R LLR T  + IE QLAIFL I+GHN R R +QE F +SGETISRHFNNVL A+ ++S++F QP
Subjt:  GCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNRVIQERFQHSGETISRHFNNVLKAIKSLSREFLQP

Query:  TPN--------------ITPVEIFESTSLI-----------------NCL---------------WE--------------------FVSGKYYLVDMGY
          N              +  V+ F    ++                 N L               WE                       GKYY+VD  Y
Subjt:  TPN--------------ITPVEIFESTSLI-----------------NCL---------------WE--------------------FVSGKYYLVDMGY

Query:  SNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLF-MSTEG
         N+ GFIAP+H V  N  +          AKE+FN RH  L   I ++F  LK RFPIL  AP Y    Q  +VIA C +HN++R  +  D +F M  E 
Subjt:  SNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLSNVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLF-MSTEG

Query:  MIIDEEEPNYVEFPEMHLTTLVQDH--------VAFSLRDSIALAMWDDFM
         + +  E   V   E  +  + Q+H         +  LRD IA  +W+ ++
Subjt:  MIIDEEEPNYVEFPEMHLTTLVQDH--------VAFSLRDSIALAMWDDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTATGTGGACGAGGATGCCTCTTCGGGTTCTCAAGGTGATACACACATGGTGGACGGACACAATAAGCGTCAATCTGGGCTTCCTTCCAGTTCTGGTCATCGGAA
GAGAAGTCGTAAAGCTACTGGTGATGCTATTGTTGATGCCATGCTTGAAATAGCAGCTGCTTCAAAAATGAGGGCAACTGCAATTATGAGGAACGAGGACCGATTTTCTA
TAAGCAAATGCATCAAAGTATTGGATGAAATGCAAGGTGTTGATCAGCAGGCTTACTTTCTTGCTTTGGATTTATTTGAGAACCCCAATGCCAGAGAGACTTTTATCTCT
CTTAAGAATTTATCAACTGCAATGGATGATTTTGACTTAGAGTTGGATGAGATGGAATTAGTTGCAGCAGCAGCTGGCTACTATTATTATAATAGCTTAACTAGACAGCC
TTGCCCTAGTTTATCACCTAGTGGATGTGGGTTTATGACTGAAGTGCTGAGTGGTCATGATGATGCATGCCGGGAAATGCTTAGGATGGATAAGCATGTTTTTCACAAGT
TATGTGATATTCTCCGACATAGAAGCTTACTACGGGATACAGCCGGTGTTATGATAGAGGAGCAGCTTGCAATTTTTTTGAACATTGTTGGTCATAATGAACGTAACAGA
GTAATACAAGAAAGGTTTCAGCACTCTGGTGAAACTATTAGTCGGCATTTCAACAATGTGCTGAAAGCAATCAAGTCTTTATCACGTGAATTTCTACAACCTACTCCCAA
CATCACTCCTGTGGAAATATTTGAAAGTACGTCTCTTATCAATTGTTTATGGGAATTTGTCTCAGGCAAGTATTATTTAGTCGACATGGGATATTCAAACATGGAAGGGT
TTATTGCTCCATTTCATGATGTCAGGTACAATCTTCACAAATTTAGAGGGGCTAGTCATTTACCTACAAACGCAAAGGAGTTATTTAATCATAGGCATGCATCTTTGAGC
AATGTCATTAAGAAGTCATTTCACGTGCTAAAAACTCGTTTTCCTATTCTGAAACTTGCTCCTCAATATGCATTTCATATCCAAAGAGACATAGTCATTGCAACATGTGT
TATTCACAATTTCATCCGTTGTGCGGAAAGGAAAGATTGGTTGTTTATGAGTACTGAAGGAATGATTATAGATGAAGAAGAGCCCAATTATGTCGAATTTCCCGAGATGC
ATTTGACAACCTTGGTGCAAGATCACGTTGCTTTCTCGTTGCGGGATTCAATTGCTTTAGCTATGTGGGATGACTTTATGACTAAATGGGATCAGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATTATGTGGACGAGGATGCCTCTTCGGGTTCTCAAGGTGATACACACATGGTGGACGGACACAATAAGCGTCAATCTGGGCTTCCTTCCAGTTCTGGTCATCGGAA
GAGAAGTCGTAAAGCTACTGGTGATGCTATTGTTGATGCCATGCTTGAAATAGCAGCTGCTTCAAAAATGAGGGCAACTGCAATTATGAGGAACGAGGACCGATTTTCTA
TAAGCAAATGCATCAAAGTATTGGATGAAATGCAAGGTGTTGATCAGCAGGCTTACTTTCTTGCTTTGGATTTATTTGAGAACCCCAATGCCAGAGAGACTTTTATCTCT
CTTAAGAATTTATCAACTGCAATGGATGATTTTGACTTAGAGTTGGATGAGATGGAATTAGTTGCAGCAGCAGCTGGCTACTATTATTATAATAGCTTAACTAGACAGCC
TTGCCCTAGTTTATCACCTAGTGGATGTGGGTTTATGACTGAAGTGCTGAGTGGTCATGATGATGCATGCCGGGAAATGCTTAGGATGGATAAGCATGTTTTTCACAAGT
TATGTGATATTCTCCGACATAGAAGCTTACTACGGGATACAGCCGGTGTTATGATAGAGGAGCAGCTTGCAATTTTTTTGAACATTGTTGGTCATAATGAACGTAACAGA
GTAATACAAGAAAGGTTTCAGCACTCTGGTGAAACTATTAGTCGGCATTTCAACAATGTGCTGAAAGCAATCAAGTCTTTATCACGTGAATTTCTACAACCTACTCCCAA
CATCACTCCTGTGGAAATATTTGAAAGTACGTCTCTTATCAATTGTTTATGGGAATTTGTCTCAGGCAAGTATTATTTAGTCGACATGGGATATTCAAACATGGAAGGGT
TTATTGCTCCATTTCATGATGTCAGGTACAATCTTCACAAATTTAGAGGGGCTAGTCATTTACCTACAAACGCAAAGGAGTTATTTAATCATAGGCATGCATCTTTGAGC
AATGTCATTAAGAAGTCATTTCACGTGCTAAAAACTCGTTTTCCTATTCTGAAACTTGCTCCTCAATATGCATTTCATATCCAAAGAGACATAGTCATTGCAACATGTGT
TATTCACAATTTCATCCGTTGTGCGGAAAGGAAAGATTGGTTGTTTATGAGTACTGAAGGAATGATTATAGATGAAGAAGAGCCCAATTATGTCGAATTTCCCGAGATGC
ATTTGACAACCTTGGTGCAAGATCACGTTGCTTTCTCGTTGCGGGATTCAATTGCTTTAGCTATGTGGGATGACTTTATGACTAAATGGGATCAGTGGTGA
Protein sequenceShow/hide protein sequence
MNYVDEDASSGSQGDTHMVDGHNKRQSGLPSSSGHRKRSRKATGDAIVDAMLEIAAASKMRATAIMRNEDRFSISKCIKVLDEMQGVDQQAYFLALDLFENPNARETFIS
LKNLSTAMDDFDLELDEMELVAAAAGYYYYNSLTRQPCPSLSPSGCGFMTEVLSGHDDACREMLRMDKHVFHKLCDILRHRSLLRDTAGVMIEEQLAIFLNIVGHNERNR
VIQERFQHSGETISRHFNNVLKAIKSLSREFLQPTPNITPVEIFESTSLINCLWEFVSGKYYLVDMGYSNMEGFIAPFHDVRYNLHKFRGASHLPTNAKELFNHRHASLS
NVIKKSFHVLKTRFPILKLAPQYAFHIQRDIVIATCVIHNFIRCAERKDWLFMSTEGMIIDEEEPNYVEFPEMHLTTLVQDHVAFSLRDSIALAMWDDFMTKWDQW