; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G31780 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G31780
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBED-type domain-containing protein
Genome locationChr1:26451921..26453790
RNA-Seq ExpressionCSPI01G31780
SyntenyCSPI01G31780
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041802.1 hypothetical protein E6C27_scaffold67G001750 [Cucumis melo var. makuwa]3.4e-16068.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

KAA0050353.1 hypothetical protein E6C27_scaffold88G00840 [Cucumis melo var. makuwa]3.4e-16068.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

KAA0062061.1 hypothetical protein E6C27_scaffold89G004030 [Cucumis melo var. makuwa]1.3e-15968.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFT QK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL+VQDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

XP_031737060.1 uncharacterized protein LOC101204843 [Cucumis sativus]9.3e-16670.28Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRG+EISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWK+SK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLN SFYYSNPN QEDDEIVNGLYSCITKMVASLEVQDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFAVRILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEE+DELVF+DDSLTWGDVSRAVGAKEPSF
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQRKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+ASTSR KT VSCSS ST QRKQVNLDDF L EEDT GYKSNE +NEDEDQF+DDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQRKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

XP_031741477.1 uncharacterized protein LOC105435633 [Cucumis sativus]1.0e-16470.07Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRG+EISN IYV PGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWK+SK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLN SFYYSNPN QEDDEIVNGLYSCITKMVASLEVQDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKF VRILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEE+DELVF+DDSLTWGDVSRAVGAKEPSF
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQRKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+ASTSR KT VSCSS ST QRKQVNLDDF L EEDT GYKSNE +NEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQRKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

TrEMBL top hitse value%identityAlignment
A0A5A7TY62 BED-type domain-containing protein1.7e-16068.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

A0A5A7U370 Uncharacterized protein1.7e-16068.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

A0A5A7V8P5 BED-type domain-containing protein6.3e-16068.47Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFT QK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL+VQDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCSS ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

A0A5A7VJR4 BED-type domain-containing protein8.2e-16068.25Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA RLLEAKRPQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRGMEISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        DEWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF
        SGC RNWSV EQ                                                  LIGRLDDDSEEEDELVFDDD LTWGDVSRA GAKEP+F
Subjt:  SGCLRNWSVIEQ--------------------------------------------------LIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSF

Query:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL
        YS+A  S A TNVSCS  ST Q   KQ+NLDD    EEDT GYKSNE VNEDEDQFSDDEFDL
Subjt:  YSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFIL-EEDTGGYKSNERVNEDEDQFSDDEFDL

A0A5D3C113 Uncharacterized protein6.1e-15572.04Show/hide
Query:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL
        MA  LLE+K PQLIWSPCAAHCLDLMLEDIY+I NIR+ LKRG+EISN IYVRPGLLNMMRRFTNQK+LVR AKTRFATACITLSSI  QKNNLRKMFT 
Subjt:  MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTL

Query:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH
        +EWKDSK                  F   ++                   ++  +GY+YE  DR + AIAKSFNNNEEKYKDIFTIID+RWELQLHRPLH
Subjt:  DEWKDSK----------------CKFLDWLM------------------ARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLH

Query:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA
        AAGYYLNPSFYYSNP+ QEDDEIVNGLYSCITKMVASL++QDKIL ELSKYKRA+ALFGQPLAI QRDKISPVEWWDNFGQSTPNLQKFA+RILGLT SA
Subjt:  AAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSA

Query:  SGCLRNWSVI---------EQLIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSFYSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFILE-EDTG
        SG       I         E LIGRLDDDSEEEDELVFDDD+LTWGDVS A GAKEP+FYS+A  S A TNVSCSS ST +   KQ+NLDD   E EDT 
Subjt:  SGCLRNWSVI---------EQLIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSFYSKASTSRAKTNVSCSSLSTMQ--RKQVNLDDFILE-EDTG

Query:  GYKSNERVNEDEDQFSDDEFDL
        GYKSNE VNEDEDQFSDDEFDL
Subjt:  GYKSNERVNEDEDQFSDDEFDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22220.1 hAT transposon superfamily3.6e-4335.29Show/hide
Query:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF
        P L W PCAAHC+D MLE+  ++  IR  +++   ++ IIY   G+LN+MR+FT   D+V+   T  AT   T+  I   K  L+ M T  EW D  C +
Subjt:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF

Query:  L---------------DWLMA---------------------RRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNP
                        D+  A                     R+  +GY+Y    R + AI  +  + EE Y   + IID+ W   L +PL+AAG+YLNP
Subjt:  L---------------DWLMA---------------------RRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNP

Query:  SFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNW
         F+YS  + +   EI   +  CI K+V  + +QD ++ +++ YK A  +FG+ LAI  RD + P EWW  +G+S  NL +FA+RIL  T S+S G +RN 
Subjt:  SFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNW

Query:  SVIEQL
        + I Q+
Subjt:  SVIEQL

AT3G22220.2 hAT transposon superfamily3.6e-4335.29Show/hide
Query:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF
        P L W PCAAHC+D MLE+  ++  IR  +++   ++ IIY   G+LN+MR+FT   D+V+   T  AT   T+  I   K  L+ M T  EW D  C +
Subjt:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF

Query:  L---------------DWLMA---------------------RRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNP
                        D+  A                     R+  +GY+Y    R + AI  +  + EE Y   + IID+ W   L +PL+AAG+YLNP
Subjt:  L---------------DWLMA---------------------RRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNP

Query:  SFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNW
         F+YS  + +   EI   +  CI K+V  + +QD ++ +++ YK A  +FG+ LAI  RD + P EWW  +G+S  NL +FA+RIL  T S+S G +RN 
Subjt:  SFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNW

Query:  SVIEQL
        + I Q+
Subjt:  SVIEQL

AT4G15020.1 hAT transposon superfamily5.1e-4535.41Show/hide
Query:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF
        P L W PCAAHC+D MLE+  ++  I  T+++   I+  +Y   G+LN+M +FT+  D++  A +  AT   TL  I   K+NL+ M T  EW  ++C +
Subjt:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF

Query:  LD---------------W--------------------LMARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPS
         +               W                       +R  +GY+Y    R + AI K+   N E Y   + IID+ WE Q H PL AAG++LNP 
Subjt:  LD---------------W--------------------LMARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPS

Query:  FYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNWS
         +Y N N +   E++  +  CI ++V   ++QDKI+ EL+ YK A  +FG+ LAI  RD + P EWW  +G+S  NL +FA+RIL  T S+S  C RN  
Subjt:  FYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNWS

Query:  VIEQL
         +E +
Subjt:  VIEQL

AT4G15020.2 hAT transposon superfamily5.1e-4535.41Show/hide
Query:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF
        P L W PCAAHC+D MLE+  ++  I  T+++   I+  +Y   G+LN+M +FT+  D++  A +  AT   TL  I   K+NL+ M T  EW  ++C +
Subjt:  PQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF

Query:  LD---------------W--------------------LMARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPS
         +               W                       +R  +GY+Y    R + AI K+   N E Y   + IID+ WE Q H PL AAG++LNP 
Subjt:  LD---------------W--------------------LMARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPS

Query:  FYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNWS
         +Y N N +   E++  +  CI ++V   ++QDKI+ EL+ YK A  +FG+ LAI  RD + P EWW  +G+S  NL +FA+RIL  T S+S  C RN  
Subjt:  FYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSAS-GCLRNWS

Query:  VIEQL
         +E +
Subjt:  VIEQL

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.2e-5937.61Show/hide
Query:  MMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSK-------CKFLDWLM---------------------------ARRSHLGYM
        MMR+FT  ++L R A TR AT+ ITL+     K+NLRKM   DEW  SK        K   +                              R+  +GY+
Subjt:  MMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSK-------CKFLDWLM---------------------------ARRSHLGYM

Query:  YEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALF
        Y   D+ +  I KSF   EE YK  F IID+RW++QLHRPLHAAGYYLNP F+Y  P+    +E++ G   C+ ++V  +E QDKI+ EL  +K+A  LF
Subjt:  YEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRAKALF

Query:  GQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSASGCLRNWSVI------------------------------------------------
        G P+AI  R K+SP EWW  +G STPNLQ FA+++L LT SA+GC RNW V                                                 
Subjt:  GQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSASGCLRNWSVI------------------------------------------------

Query:  --EQLIGRLDDDSE--EEDELVFDDDSLTWGDVSRAVGAKEPSFYSKASTS
          E L GR++++S   E D+LVF++D LTW +V  A GA +P +Y+  ST+
Subjt:  --EQLIGRLDDDSE--EEDELVFDDDSLTWGDVSRAVGAKEPSFYSKASTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGGAGATTGTTAGAAGCAAAGCGACCACAGTTAATATGGTCTCCATGTGCCGCTCATTGCTTAGATTTGATGTTGGAGGATATATACCAGATCTGTAATATTCG
CAGAACATTGAAAAGAGGCATGGAGATTAGCAACATCATATATGTTCGTCCTGGATTGTTAAACATGATGCGACGTTTTACTAACCAAAAGGACTTAGTTAGACTAGCTA
AGACTCGTTTTGCTACTGCTTGCATTACATTATCGAGTATACGTTGTCAAAAGAATAACCTGAGGAAGATGTTTACTTTAGATGAATGGAAGGATAGCAAATGTAAGTTC
TTAGATTGGTTGATGGCGAGAAGAAGCCACCTAGGATATATGTATGAGGGCGGGGATAGAGGTCAGGGAGCTATTGCTAAGTCCTTCAATAATAATGAAGAAAAATACAA
GGACATTTTCACCATAATTGATAAAAGATGGGAGCTTCAGTTGCATCGTCCTCTGCATGCAGCGGGGTATTACTTAAACCCGTCATTCTATTATTCGAATCCCAACAGCC
AAGAGGATGATGAAATAGTTAATGGACTCTACTCATGCATTACGAAAATGGTTGCTTCATTGGAAGTACAAGACAAAATACTTGTAGAGCTAAGCAAGTATAAGAGAGCT
AAAGCATTATTCGGACAACCTTTAGCAATTGGACAAAGGGACAAAATATCTCCAGTGGAATGGTGGGATAATTTTGGACAATCAACTCCGAACTTGCAAAAGTTTGCTGT
GAGAATTTTAGGTCTTACTTATAGTGCTTCTGGATGCTTGCGTAATTGGAGTGTAATTGAACAGTTGATTGGAAGATTGGATGACGATTCTGAGGAGGAGGATGAGTTGG
TATTTGACGACGATTCTTTAACGTGGGGTGATGTTTCAAGAGCTGTCGGAGCAAAAGAACCATCATTCTATTCTAAAGCTAGTACCTCAAGAGCAAAGACTAATGTTTCA
TGTTCATCCTTGTCTACTATGCAACGCAAACAAGTAAATTTGGATGACTTCATCTTGGAAGAAGATACTGGTGGCTATAAGTCTAACGAAAGAGTGAATGAAGACGAGGA
TCAATTTAGTGATGATGAGTTTGATCTTTAG
mRNA sequenceShow/hide mRNA sequence
GCCAAGGTTGGATGCACTGTTATGGCTGATGGATGGATCGATAGAAGAAATAGGACATTAATTAACTTTTTAGTTAACAGTCCTAAAGACACCATGTTTATAGAGTCCAT
CGATGCTTCATCTTATGTGAAAGATGGAAAGAAGATGTTTGAGTTACTTGACAATTTTGCAGACTGAATTGGAGAAGCGAATGTTGTACAAGTAGTTACAGATACTGCCT
CATCAAATGTGATGGCAAGGAGATTGTTAGAAGCAAAGCGACCACAGTTAATATGGTCTCCATGTGCCGCTCATTGCTTAGATTTGATGTTGGAGGATATATACCAGATC
TGTAATATTCGCAGAACATTGAAAAGAGGCATGGAGATTAGCAACATCATATATGTTCGTCCTGGATTGTTAAACATGATGCGACGTTTTACTAACCAAAAGGACTTAGT
TAGACTAGCTAAGACTCGTTTTGCTACTGCTTGCATTACATTATCGAGTATACGTTGTCAAAAGAATAACCTGAGGAAGATGTTTACTTTAGATGAATGGAAGGATAGCA
AATGTAAGTTCTTAGATTGGTTGATGGCGAGAAGAAGCCACCTAGGATATATGTATGAGGGCGGGGATAGAGGTCAGGGAGCTATTGCTAAGTCCTTCAATAATAATGAA
GAAAAATACAAGGACATTTTCACCATAATTGATAAAAGATGGGAGCTTCAGTTGCATCGTCCTCTGCATGCAGCGGGGTATTACTTAAACCCGTCATTCTATTATTCGAA
TCCCAACAGCCAAGAGGATGATGAAATAGTTAATGGACTCTACTCATGCATTACGAAAATGGTTGCTTCATTGGAAGTACAAGACAAAATACTTGTAGAGCTAAGCAAGT
ATAAGAGAGCTAAAGCATTATTCGGACAACCTTTAGCAATTGGACAAAGGGACAAAATATCTCCAGTGGAATGGTGGGATAATTTTGGACAATCAACTCCGAACTTGCAA
AAGTTTGCTGTGAGAATTTTAGGTCTTACTTATAGTGCTTCTGGATGCTTGCGTAATTGGAGTGTAATTGAACAGTTGATTGGAAGATTGGATGACGATTCTGAGGAGGA
GGATGAGTTGGTATTTGACGACGATTCTTTAACGTGGGGTGATGTTTCAAGAGCTGTCGGAGCAAAAGAACCATCATTCTATTCTAAAGCTAGTACCTCAAGAGCAAAGA
CTAATGTTTCATGTTCATCCTTGTCTACTATGCAACGCAAACAAGTAAATTTGGATGACTTCATCTTGGAAGAAGATACTGGTGGCTATAAGTCTAACGAAAGAGTGAAT
GAAGACGAGGATCAATTTAGTGATGATGAGTTTGATCTTTAG
Protein sequenceShow/hide protein sequence
MARRLLEAKRPQLIWSPCAAHCLDLMLEDIYQICNIRRTLKRGMEISNIIYVRPGLLNMMRRFTNQKDLVRLAKTRFATACITLSSIRCQKNNLRKMFTLDEWKDSKCKF
LDWLMARRSHLGYMYEGGDRGQGAIAKSFNNNEEKYKDIFTIIDKRWELQLHRPLHAAGYYLNPSFYYSNPNSQEDDEIVNGLYSCITKMVASLEVQDKILVELSKYKRA
KALFGQPLAIGQRDKISPVEWWDNFGQSTPNLQKFAVRILGLTYSASGCLRNWSVIEQLIGRLDDDSEEEDELVFDDDSLTWGDVSRAVGAKEPSFYSKASTSRAKTNVS
CSSLSTMQRKQVNLDDFILEEDTGGYKSNERVNEDEDQFSDDEFDL