; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008921 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008921
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionhAT transposon superfamily
Genome locationChr06:748220..752654
RNA-Seq ExpressionHG10008921
SyntenyHG10008921
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0008422 - beta-glucosidase activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042569.1 HAT transposon superfamily isoform 2 [Cucumis melo var. makuwa]3.8e-25069.9Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKP ESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

KGN49576.2 hypothetical protein Csa_000026 [Cucumis sativus]1.2e-24866.71Show/hide
Query:  DLLLQFDFAAPLPVERELRAKGVGI---FQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR
        +L+L FD +         +  GV +   + ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR
Subjt:  DLLLQFDFAAPLPVERELRAKGVGI---FQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATR

Query:  EEIKETSSGKKQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT
        EEIKE S+GKKQKLAEVKTVE+ PS+SMCKSVVS+E PSP+AKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT
Subjt:  EEIKETSSGKKQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT

Query:  APSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIM
         PSAETLKTTWLERIKTEVSLQSKDIEKEW TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCL DLFDSVIQDFGHENVVQIIM
Subjt:  APSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIM

Query:  DSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSR
        DSSLNYSG ANHILQTYGTIFVSPCASQCLN+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSR
Subjt:  DSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSR

Query:  LKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRE-----------------------------------------------
        LKHMFNSP+YTTN Y+NKPQSISC+AIIEDNDFWRAVEECVAISEPFLRVLRE                                               
Subjt:  LKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRE-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNA
                                                  LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG DLNTRQFNA
Subjt:  ------------------------------------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNA

Query:  AMFGASDHIFNL
        AMFGA+DHIFNL
Subjt:  AMFGASDHIFNL

XP_004145979.2 uncharacterized protein LOC101215128 isoform X1 [Cucumis sativus]1.3e-24768.87Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE S+GKKQKLAEVKTVE+ PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVS+E PSP+AKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCL DLFDSVIQDFGHENVVQIIMDSSLNYSG ANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDG DLNTRQFNAAMFGA+DHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

XP_008437565.1 PREDICTED: uncharacterized protein LOC103482941 [Cucumis melo]1.3e-25069.9Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

XP_038874524.1 uncharacterized protein LOC120067148 isoform X1 [Benincasa hispida]4.6e-25671.51Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        V+SMEAPSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDS LNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

TrEMBL top hitse value%identityAlignment
A0A1S4DSA2 uncharacterized protein LOC1034829416.3e-25169.9Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A5A7TMH8 HAT transposon superfamily isoform 21.8e-25069.9Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKE SSGKKQKLAEVKTVEN PS+SMCKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSME PSPIAKVFPT TPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLKTTWLERIKTEVSLQSKDIEKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKS+DASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLS QSILKQRSRLKHMFNSP+YTTN Y+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQMRTKP ESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1E643 uncharacterized protein LOC111430305 isoform X17.2e-24768.91Show/hide
Query:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK
        ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQK+AEVKT+ENAPSMS CK
Subjt:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK

Query:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW
        SVVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEW
Subjt:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW

Query:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
        ATTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCL
Subjt:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL

Query:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED
        N+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIED
Subjt:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED

Query:  NDFWRAVEECVAISEPFLRVLRE-----------------------------------------------------------------------------
        NDFWRAVEECVAISEPFLRVLRE                                                                             
Subjt:  NDFWRAVEECVAISEPFLRVLRE-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                    LARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  ------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1E893 uncharacterized protein LOC111430305 isoform X21.2e-24669.02Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQK+AEVKT+ENAPSMS CKS
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
        VVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEWA
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
        +ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIEDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------
        DFWRAVEECVAISEPFLRVLRE                                                                              
Subjt:  DFWRAVEECVAISEPFLRVLRE------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                   LARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  -----------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

A0A6J1IDC4 uncharacterized protein LOC111471543 isoform X11.6e-24668.91Show/hide
Query:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK
        ++VREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREE KETSSGKKQK+AEVKTVENAPSMS CK
Subjt:  MVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCK

Query:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW
        SVVSMEAPSPIAKVFPT TPMAPPSL NHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFT PSAETLK+TWLERIKTEVSLQSKDIEKEW
Subjt:  SVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEW

Query:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
        ATTGCTIIVDTWTDNKSRALINFLVSSPS+TFFHKS+DAS YFKNTKCLADLFDSVIQDFGHENVVQIIMDSS NY+GIANHILQTYGTIFVSPCASQCL
Subjt:  ATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL

Query:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED
        N+ILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTG QELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPY+NKPQSISC+AIIED
Subjt:  NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIED

Query:  NDFWRAVEECVAISEPFLRVLRE-----------------------------------------------------------------------------
        NDFWRAVEECVAISEPFLRVLRE                                                                             
Subjt:  NDFWRAVEECVAISEPFLRVLRE-----------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL
                    LARQM+TKPLESDPIQFDDIDMTSEWVEESEN SPTQWLDRFG SLDGGDLNTRQFNAA+F ASDHIFNL
Subjt:  ------------LARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily5.2e-16548.9Show/hide
Query:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS
        +VREKDICWEYAEKLDGNKVKCKFC RVLNGGISRLKHHLSRLPS+GVNPC+KVRDDV+DRVR+IL+ +++   T+  K             P +S    
Subjt:  VVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAEVKTVENAPSMSMCKS

Query:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA
            +AP+    VFP++ P A       + AE+SI+LFFFENK+DF++ARS SY  M+DA+ KCGPGF APS    KT WL+R+K+++SLQ KD EKEW 
Subjt:  VVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWA

Query:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN
        TTGCTII + WTDNKSRALINF VSSPSR FFHKS+DAS+YFKN+KCLADLFDSVIQD G E++VQIIMD+S  Y+GI+NH+LQ Y TIFVSPCASQCLN
Subjt:  TTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLN

Query:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN
         ILEEFSKVDWVN+CI QAQ ISKF+YN+S +LDL+R+ TG Q++IR+G+++ VS+FLSLQS++KQ++RLKHMFN PEYTTN  +NKPQSISC+ I+EDN
Subjt:  AILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDN

Query:  DFWRAVEECVAISEPFLRVLRELA----------------------------------------------------------------------------
        DFWRAVEE VAISEP L+VLRE++                                                                            
Subjt:  DFWRAVEECVAISEPFLRVLRELA----------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------RQM-------------
                                                                                            +QM             
Subjt:  ------------------------------------------------------------------------------------RQM-------------

Query:  -------------RTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDH-IFNL
                     R   LE+DPI  +DIDM SEWVEE+EN SP QWLDRFG++LDGGDLNTRQF  A+F A+DH IF L
Subjt:  -------------RTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDH-IFNL

AT3G17450.1 hAT dimerisation domain-containing protein2.9e-4626.67Show/hide
Query:  WEY--AEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAE-------VKTVENAP------
        WE+  A+     KVKC +C ++++GGI+R K HL+R+P   V PC    ++V  ++      +E +K   +GK+Q   +        +TV   P      
Subjt:  WEY--AEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGKKQKLAE-------VKTVENAP------

Query:  --------------------SMSMCKSVVS------MEAPSPIAKVFPTATPMAPPS----------LHNHENAEKSIALFFFENKLDFSIARSSSYQLM
                            S    KS  S       EA +  A++ P  +P +             + + ++   SI+ F     +    A S  +Q M
Subjt:  --------------------SMSMCKSVVS------MEAPSPIAKVFPTATPMAPPS----------LHNHENAEKSIALFFFENKLDFSIARSSSYQLM

Query:  IDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQ
        I+ IG  G GF  PS++      L+   + +    ++    W  TGC+I+ DTWT+ + + +I+FLVS P   +FH SIDA+   ++   L    D ++ 
Subjt:  IDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQ

Query:  DFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMR-RFTGSQELIRTGISKPVSS
        D G ENVVQ+I  ++  +      + +    ++ +PCA  C   +LE+FSK+++V+ C+ +AQ I++F+YN + LL+LM+  FT   +L+R  + +  S 
Subjt:  DFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMR-RFTGSQELIRTGISKPVSS

Query:  FLSLQSILKQRSRLKHMFNSPEYTTNPYSNK-PQSISCLAIIEDNDFWRAVEECVAISEPFLRVL
        F +LQS++  ++ L+ +F S  +  +  + K  +      ++    FW+ V+  +   +P ++V+
Subjt:  FLSLQSILKQRSRLKHMFNSPEYTTNPYSNK-PQSISCLAIIEDNDFWRAVEECVAISEPFLRVL

AT3G22220.1 hAT transposon superfamily1.0e-4331Show/hide
Query:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN
        P S    +    ++  F F+   DF  A S + Q  IDAI   G G + P+ E L+   L+    EV  +  + +  W  TGC+++V     N+   ++ 
Subjt:  PPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALIN

Query:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT
        FLV  P +  F KS+DAS    +   L +L   V+++ G  NVVQ+I     +Y+     ++  Y +++  PCA+ C++ +LEEF K+DW+   I QA+T
Subjt:  FLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQT

Query:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR
        +++ +YN S +L+LMR+FT   ++++   +   ++F ++  I   +  L+ M  S E+    YS +   ++    I D DFW+A+     I+ P LRVLR
Subjt:  ISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLR

AT4G15020.1 hAT transposon superfamily7.8e-4426.25Show/hide
Query:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----
        +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  +E     
Subjt:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----

Query:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI
                                              NA    S S    ++  +  + I     +   +  PS  + EN    +I  F F    DF  
Subjt:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI

Query:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL
          S ++Q MIDAI   G G +AP+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KS+DAS    +   L
Subjt:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL

Query:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT
         +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT   +++  
Subjt:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT

Query:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRELARQMR
          S   ++F +L  I + +S L+ M  S E+    YS +P  +  +  + D  FW+AV     ++ P LR LR +  + R
Subjt:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRELARQMR

AT4G15020.2 hAT transposon superfamily7.8e-4426.25Show/hide
Query:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----
        +++D  W++ E     D  +++C +C ++   GGI+R+K HL+    +G   C +V +DV   ++  +     R+  +  SS +   +A +  +E     
Subjt:  REKDICWEYAEKL---DGNKVKCKFCLRVL-NGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAIL---ATREEIKETSSGKKQKLAEVKTVE-----

Query:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI
                                              NA    S S    ++  +  + I     +   +  PS  + EN    +I  F F    DF  
Subjt:  --------------------------------------NA---PSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENA-EKSIALFFFENKLDFSI

Query:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL
          S ++Q MIDAI   G G +AP+ + L+   L+    E++ +  + +  W  TGC+I+V+    +K   ++NFLV  P +  F KS+DAS    +   L
Subjt:  ARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVSLQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCL

Query:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT
         +L   ++++ G  NVVQ+I      Y      ++  Y +++  PCA+ C++ +LEEF K+ W++  I QAQ I++F+YN S +L+LM +FT   +++  
Subjt:  ADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCLNAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRT

Query:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRELARQMR
          S   ++F +L  I + +S L+ M  S E+    YS +P  +  +  + D  FW+AV     ++ P LR LR +  + R
Subjt:  GISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEECVAISEPFLRVLRELARQMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGGGATTTGCTTTTACAGTTCGATTTTGCTGCTCCTTTGCCTGTCGAGAGAGAATTGAGGGCAAAGGGTGTTGGGATTTTTCAAATGGTGGTCCGTGAGAAAGA
TATTTGTTGGGAATATGCCGAGAAATTAGATGGTAACAAGGTGAAGTGCAAATTTTGTCTGAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGAT
TACCGAGTAGAGGTGTAAATCCATGTAGTAAAGTGAGGGACGATGTTTCTGATAGAGTGAGAGCCATACTAGCAACTAGAGAGGAGATCAAGGAAACATCCAGTGGGAAA
AAGCAGAAGCTAGCTGAAGTCAAGACGGTTGAAAATGCACCATCGATGTCAATGTGTAAATCTGTTGTTTCAATGGAGGCCCCATCACCAATTGCCAAAGTTTTTCCAAC
TGCTACTCCCATGGCTCCCCCGTCACTACACAACCATGAAAATGCTGAGAAAAGCATTGCTTTATTCTTTTTTGAGAATAAGCTAGATTTTAGTATAGCTAGATCTTCAT
CCTATCAGCTAATGATCGATGCAATAGGGAAATGTGGCCCTGGATTTACAGCCCCTTCTGCTGAAACTCTGAAGACTACTTGGTTGGAGAGGATCAAAACTGAAGTGAGC
CTTCAGTCAAAGGATATTGAGAAAGAGTGGGCTACCACCGGCTGCACAATCATTGTAGACACATGGACTGACAATAAATCAAGAGCTTTGATTAACTTTTTGGTTTCATC
CCCATCCCGGACCTTTTTTCACAAATCCATCGATGCATCTACATATTTCAAGAACACAAAGTGCCTTGCTGATTTATTTGATTCCGTCATTCAAGATTTCGGCCATGAAA
ATGTAGTGCAGATTATCATGGACAGTAGTTTGAATTATTCAGGTATTGCAAATCATATCCTTCAGACTTACGGGACTATATTTGTGTCTCCCTGTGCTTCACAGTGTCTG
AATGCAATTTTGGAGGAATTTTCAAAGGTAGATTGGGTAAACAGATGTATCCTGCAAGCACAAACCATATCAAAATTTCTATATAATAGTTCCTCACTGCTTGACCTGAT
GCGAAGGTTCACTGGCAGTCAAGAACTCATTCGGACTGGGATATCGAAACCCGTATCGAGTTTCCTGTCTTTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATA
TGTTCAACAGCCCTGAATACACCACAAATCCTTATTCAAATAAACCACAGAGCATTTCTTGTCTTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGT
GTGGCAATATCAGAGCCTTTCCTAAGAGTCTTGAGAGAATTGGCTAGACAGATGAGAACAAAACCCCTGGAATCTGACCCTATTCAGTTTGACGACATTGATATGACTTC
GGAGTGGGTAGAGGAGAGCGAAAACCAAAGCCCGACGCAGTGGCTCGACAGATTTGGTTCTTCTTTGGATGGGGGCGACTTGAATACCAGACAGTTCAATGCTGCCATGT
TTGGTGCAAGTGACCACATATTTAATCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGGGATTTGCTTTTACAGTTCGATTTTGCTGCTCCTTTGCCTGTCGAGAGAGAATTGAGGGCAAAGGGTGTTGGGATTTTTCAAATGGTGGTCCGTGAGAAAGA
TATTTGTTGGGAATATGCCGAGAAATTAGATGGTAACAAGGTGAAGTGCAAATTTTGTCTGAGAGTTTTGAATGGTGGGATTAGTAGATTGAAGCATCATTTATCTCGAT
TACCGAGTAGAGGTGTAAATCCATGTAGTAAAGTGAGGGACGATGTTTCTGATAGAGTGAGAGCCATACTAGCAACTAGAGAGGAGATCAAGGAAACATCCAGTGGGAAA
AAGCAGAAGCTAGCTGAAGTCAAGACGGTTGAAAATGCACCATCGATGTCAATGTGTAAATCTGTTGTTTCAATGGAGGCCCCATCACCAATTGCCAAAGTTTTTCCAAC
TGCTACTCCCATGGCTCCCCCGTCACTACACAACCATGAAAATGCTGAGAAAAGCATTGCTTTATTCTTTTTTGAGAATAAGCTAGATTTTAGTATAGCTAGATCTTCAT
CCTATCAGCTAATGATCGATGCAATAGGGAAATGTGGCCCTGGATTTACAGCCCCTTCTGCTGAAACTCTGAAGACTACTTGGTTGGAGAGGATCAAAACTGAAGTGAGC
CTTCAGTCAAAGGATATTGAGAAAGAGTGGGCTACCACCGGCTGCACAATCATTGTAGACACATGGACTGACAATAAATCAAGAGCTTTGATTAACTTTTTGGTTTCATC
CCCATCCCGGACCTTTTTTCACAAATCCATCGATGCATCTACATATTTCAAGAACACAAAGTGCCTTGCTGATTTATTTGATTCCGTCATTCAAGATTTCGGCCATGAAA
ATGTAGTGCAGATTATCATGGACAGTAGTTTGAATTATTCAGGTATTGCAAATCATATCCTTCAGACTTACGGGACTATATTTGTGTCTCCCTGTGCTTCACAGTGTCTG
AATGCAATTTTGGAGGAATTTTCAAAGGTAGATTGGGTAAACAGATGTATCCTGCAAGCACAAACCATATCAAAATTTCTATATAATAGTTCCTCACTGCTTGACCTGAT
GCGAAGGTTCACTGGCAGTCAAGAACTCATTCGGACTGGGATATCGAAACCCGTATCGAGTTTCCTGTCTTTGCAATCTATTCTGAAGCAAAGGTCAAGACTGAAGCATA
TGTTCAACAGCCCTGAATACACCACAAATCCTTATTCAAATAAACCACAGAGCATTTCTTGTCTTGCCATTATAGAAGATAATGATTTCTGGAGGGCAGTGGAAGAATGT
GTGGCAATATCAGAGCCTTTCCTAAGAGTCTTGAGAGAATTGGCTAGACAGATGAGAACAAAACCCCTGGAATCTGACCCTATTCAGTTTGACGACATTGATATGACTTC
GGAGTGGGTAGAGGAGAGCGAAAACCAAAGCCCGACGCAGTGGCTCGACAGATTTGGTTCTTCTTTGGATGGGGGCGACTTGAATACCAGACAGTTCAATGCTGCCATGT
TTGGTGCAAGTGACCACATATTTAATCTGTGA
Protein sequenceShow/hide protein sequence
MGWDLLLQFDFAAPLPVERELRAKGVGIFQMVVREKDICWEYAEKLDGNKVKCKFCLRVLNGGISRLKHHLSRLPSRGVNPCSKVRDDVSDRVRAILATREEIKETSSGK
KQKLAEVKTVENAPSMSMCKSVVSMEAPSPIAKVFPTATPMAPPSLHNHENAEKSIALFFFENKLDFSIARSSSYQLMIDAIGKCGPGFTAPSAETLKTTWLERIKTEVS
LQSKDIEKEWATTGCTIIVDTWTDNKSRALINFLVSSPSRTFFHKSIDASTYFKNTKCLADLFDSVIQDFGHENVVQIIMDSSLNYSGIANHILQTYGTIFVSPCASQCL
NAILEEFSKVDWVNRCILQAQTISKFLYNSSSLLDLMRRFTGSQELIRTGISKPVSSFLSLQSILKQRSRLKHMFNSPEYTTNPYSNKPQSISCLAIIEDNDFWRAVEEC
VAISEPFLRVLRELARQMRTKPLESDPIQFDDIDMTSEWVEESENQSPTQWLDRFGSSLDGGDLNTRQFNAAMFGASDHIFNL