; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012471 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012471
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionhAT transposon superfamily
Genome locationChr01:21507318..21511287
RNA-Seq ExpressionHG10012471
SyntenyHG10012471
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052155.1 uncharacterized protein E6C27_scaffold1589G00250 [Cucumis melo var. makuwa]0.0e+0071.77Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRP+VTRNASSF TLQCLL+H+ SLRRMFVS+EWTSSRFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVID+NWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES+SLDHILMEHLLDDWIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDG+S+GR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTDVDTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

XP_008438995.1 PREDICTED: uncharacterized protein LOC103483923 [Cucumis melo]0.0e+0071.29Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGM+P GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRP+VTRNASSF TLQCLL+H+ SLRRMFVS+EWTSSRFSKS EG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVID+NWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEV RGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES+SLDHILMEHLLDDWIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDG+S+GR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTD+DTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

XP_011651096.1 uncharacterized protein LOC101213851 [Cucumis sativus]0.0e+0072.12Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRPAVTRNASSF TLQCLL+HR +LRRMFVSNEWTSSRFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCH LHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHN+LSQRKMADLLYVHYNL+LRERQLRKQSNESISLDHILMEHLLDDWIVEP+KQ MQEDEEILCPG+EPLDAYENDLIDYEDGTSEGR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTDVDTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

XP_022137898.1 uncharacterized protein LOC111009208 isoform X1 [Momordica charantia]0.0e+0068.96Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYR+RGRQLMG+RN+ ANMT L                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAP+EVYLKIKENMKWHRTGRRH  TDANEISAYFMQSDNE+EEDEKEES+HHISKER +DGDKR SKDLRSTFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQVHKQALVKRG +RRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYG GLVGPSCQLISGR LQEE+ATIK+Y
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           G+ELLRPAVT+NASSF TLQCLLDHRASLRRMF+SNEWTS RFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        +EVE+ VLN SFWKKVQYVCKSVEPVLQVLQKVD VQSLS+SSIY+DMYRAKFAI+SIH DDARKYGPFWNVID+NWNSLFCHPLHMAA+FLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESDNSRRISAS+QISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTS-EG
        W+PF KE SQRHN LSQR+MAD+LYVHYNLRLRERQ RK+S++SISLDHIL EHLLD+WIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDGTS EG
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTS-EG

Query:  RKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        RKGCLQLV LTDV+TLDVNPANGG S DNDADVKFY  D+ELSD
Subjt:  RKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

XP_038882020.1 uncharacterized protein LOC120073319 isoform X1 [Benincasa hispida]0.0e+0072.48Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEES+HHISKERF+DGDKR SKDLRSTFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGG+RRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLE VGQYGSGLVGPSCQLISGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRPAVTRNAS+FTTLQCLLDHRASLRRMFVSNEWTSSRFSK+GEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQ+LSISSIY+DMYRAKFAIQS+HGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF+KEHSQ HNTLSQRKMADLLYVHYNLRLRERQLRKQSN S+SLDHILMEHLLD+W VEPQKQ +QEDEEILCPG+EPLDAYENDLIDYE+GTSEGR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTDVDTLDVNP NGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

TrEMBL top hitse value%identityAlignment
A0A0A0L859 Uncharacterized protein0.0e+0072.12Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRPAVTRNASSF TLQCLL+HR +LRRMFVSNEWTSSRFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCH LHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHN+LSQRKMADLLYVHYNL+LRERQLRKQSNESISLDHILMEHLLDDWIVEP+KQ MQEDEEILCPG+EPLDAYENDLIDYEDGTSEGR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTDVDTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

A0A1S3AXN6 uncharacterized protein LOC1034839230.0e+0071.29Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGM+P GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRP+VTRNASSF TLQCLL+H+ SLRRMFVS+EWTSSRFSKS EG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVID+NWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEV RGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES+SLDHILMEHLLDDWIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDG+S+GR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTD+DTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

A0A5A7UEM6 Uncharacterized protein0.0e+0071.77Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRP+VTRNASSF TLQCLL+H+ SLRRMFVS+EWTSSRFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVID+NWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES+SLDHILMEHLLDDWIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDG+S+GR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTDVDTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

A0A6J1CBM4 uncharacterized protein LOC111009208 isoform X20.0e+0068.96Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYR+RGRQLMG+RN+ ANMT L                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAP+EVYLKIKENMKWHRTGRRH  TDANEISAYFMQSDNE+EEDEKEES+HHISKER +DGDKR SKDLRSTFRGMSP GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQVHKQALVKRG +RRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYG GLVGPSCQLISGR LQEE+ATIK+Y
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           G+ELLRPAVT+NASSF TLQCLLDHRASLRRMF+SNEWTS RFSKSGEG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        +EVE+ VLN SFWKKVQYVCKSVEPVLQVLQKVD VQSLS+SSIY+DMYRAKFAI+SIH DDARKYGPFWNVID+NWNSLFCHPLHMAA+FLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEVVRGLNECIVRLESDNSRRISAS+QISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTS-EG
        W+PF KE SQRHN LSQR+MAD+LYVHYNLRLRERQ RK+S++SISLDHIL EHLLD+WIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDGTS EG
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTS-EG

Query:  RKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        RKGCLQLV LTDV+TLDVNPANGG S DNDADVKFY  D+ELSD
Subjt:  RKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

E5GC38 DNA binding protein0.0e+0071.29Show/hide
Query:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------
        MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDG HV YRNRGRQLMGNRNVG NMTPL                                       
Subjt:  MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPL---------------------------------------

Query:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP
             RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEE+EKEES+HHISKERF+DGDKR SKDL+STFRGM+P GGSEP
Subjt:  -----RIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEP

Query:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY
        SVKRSRLDSVFLKTTKRQTEQV KQALVKRGG+RRSRKEVM+AICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQL+SGRLLQEEVATIKSY
Subjt:  SVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSY

Query:  L---------------------------------------------------------------------------------------------------
        L                                                                                                   
Subjt:  L---------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG
                                                           GLELLRP+VTRNASSF TLQCLL+H+ SLRRMFVS+EWTSSRFSKS EG
Subjt:  ---------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEG

Query:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP
        QEVE+ VLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIY+DMYRAKFAIQSIHGDDARKYGPFWNVID+NWNSLFCHPLHMAAFFLNPSYRYRP
Subjt:  QEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRP

Query:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN
        DFVAHSEV RGLNECIVRLESD+SRRISASMQISDYNSAKSDFGTELAISTRTELDP         AAWWQQHGISCLELQQIAVRILSQTCSSLC EHN
Subjt:  DFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHN

Query:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR
        WTPF KEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES+SLDHILMEHLLDDWIVEPQKQ MQEDEEILCPG+EPLDAYENDLIDYEDG+S+GR
Subjt:  WTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGR

Query:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        KGCLQLVGLTD+DTLDVNPANGG S DNDADVKFY  DNELSD
Subjt:  KGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17450.1 hAT dimerisation domain-containing protein5.3e-12738.34Show/hide
Query:  MRENLEGCRSNKKPRQSEDDE-QSYLNFH--SNDDEED---------------GLH------------------VAYRNRGRQLMGNR-----NVGANMT
        M+ENL   RS KK RQSED+  QS  +FH  +NDDE D               GL                   +A   R +++  N      + G N  
Subjt:  MRENLEGCRSNKKPRQSEDDE-QSYLNFH--SNDDEED---------------GLH------------------VAYRNRGRQLMGNR-----NVGANMT

Query:  P---LRIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYF--MQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGS
             RIPGEVAPCK APEEVY+KIKENMKWHR G+R  + D    +  F  +  D ++EED ++   +  S++R M G+ R SKD R +F   +    S
Subjt:  P---LRIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYF--MQSDNEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGS

Query:  EPSVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRR--SRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVAT
        E   KR+R+          Q+    KQ  +    S R  SRK+V S+I KF  + G+P ++ANS+YF KM+E +G YG G V PS QL SGRLLQEE++T
Subjt:  EPSVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRR--SRKEVMSAICKFFCYAGIPFQSANSVYFHKMLETVGQYGSGLVGPSCQLISGRLLQEEVAT

Query:  IKSYL-----------------------------------------------------------------------------------------------
        IKSYL                                                                                               
Subjt:  IKSYL-----------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEW-TSSRFS
                                                               GL+LLRPAV R+AS FTTLQ L+DH+ASLR +F S+ W  S   +
Subjt:  -------------------------------------------------------GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEW-TSSRFS

Query:  KSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKV-DSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNP
        KS EG+EVE  VL+  FWKKVQYV KSV+PV+QV+  + D    LS+   Y  M  AK AI+SIH DDARKYGPFW VI+  WN LF HPL++AA+F NP
Subjt:  KSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKV-DSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNP

Query:  SYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSS
        +Y+YRPDF+A SEVVRG+NECIVRLE DN+RRI+A MQI DY  AK+DFGT++AI TRTELDP         +AWWQQHGISCLELQ++AVRILS TCSS
Subjt:  SYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSS

Query:  LCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRK----QSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLI
        + CE  W+ + + +SQ  +   ++   DL YVHYNLRLRE+QL++    +     +L+H L++ LL DW+V  +K    E+EE     L   D  E +  
Subjt:  LCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRK----QSNESISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLI

Query:  DYEDGTSEGRKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD
        D+ED   E  K  ++                  G++D + +      D++LSD
Subjt:  DYEDGTSEGRKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD

AT3G22220.1 hAT transposon superfamily3.9e-3731.66Show/hide
Query:  KSYLGLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSD
        K   G ++++P  T +A++FTT+  + D +  L+ M  S+EW    +SK   G  +   + +  FWK +        P+L+VL+ V S +  ++  +Y+ 
Subjt:  KSYLGLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSD

Query:  MYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTEL
        MYRAK AI++ +     +Y  +W +ID  W      PL+ A F+LNP + Y  D    SE+   + +CI +L  D + +      I+ Y +A   FG  L
Subjt:  MYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTEL

Query:  AISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESIS
        AI  R  + P         A WW  +G SCL L + A+RILSQTC SS+    N T  ++ +  + N++ ++++ DL++V YN+RLR        ++++ 
Subjt:  AISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESIS

Query:  -LDHILMEHLLDDWIVEPQ
         L H  ME +L+DW+   Q
Subjt:  -LDHILMEHLLDDWIVEPQ

AT3G22220.2 hAT transposon superfamily3.9e-3731.66Show/hide
Query:  KSYLGLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSD
        K   G ++++P  T +A++FTT+  + D +  L+ M  S+EW    +SK   G  +   + +  FWK +        P+L+VL+ V S +  ++  +Y+ 
Subjt:  KSYLGLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSD

Query:  MYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTEL
        MYRAK AI++ +     +Y  +W +ID  W      PL+ A F+LNP + Y  D    SE+   + +CI +L  D + +      I+ Y +A   FG  L
Subjt:  MYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTEL

Query:  AISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESIS
        AI  R  + P         A WW  +G SCL L + A+RILSQTC SS+    N T  ++ +  + N++ ++++ DL++V YN+RLR        ++++ 
Subjt:  AISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESIS

Query:  -LDHILMEHLLDDWIVEPQ
         L H  ME +L+DW+   Q
Subjt:  -LDHILMEHLLDDWIVEPQ

AT4G15020.1 hAT transposon superfamily8.8e-3732.48Show/hide
Query:  GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRA
        G ++L PA + +A++F TL  + + +++L+ M  S EW    +S+   G  +   + + +FWK V  V     P+L+ L+ V S +  ++  +Y+ +YRA
Subjt:  GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRA

Query:  KFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAIST
        K AI++ H  +   Y  +W +ID  W      PL  A FFLNP   Y  +    SE++  + +CI RL  D+  +     +++ Y +A   FG  LAI  
Subjt:  KFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAIST

Query:  RTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEH-SQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDH
        R  + P         A WW  +G SCL L + A+RILSQTC SS+ C  N  P   EH  Q  N++ Q++++DL++V YN+RLR+       +    L H
Subjt:  RTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTC-SSLCCEHNWTPFTKEH-SQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDH

Query:  ILMEHLLDDWI
          ++ +L +W+
Subjt:  ILMEHLLDDWI

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.6e-4332.31Show/hide
Query:  GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRA
        G  L RPA+TR A+SF TL      + +LR+M  S+EW +S+++K   G +++ F    SFWK V +  K   P++QVL+ VD  +   +  IY  M +A
Subjt:  GLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVEPVLQVLQKVDSVQSLSISSIYSDMYRA

Query:  KFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRY-RPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAIS
        K  I          Y   + +ID  W+     PLH A ++LNP + Y +PD + + EV+ G   C+ RL      +     ++  +  A   FG  +AI 
Subjt:  KFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRY-RPDFVAHSEVVRGLNECIVRLESDNSRRISASMQISDYNSAKSDFGTELAIS

Query:  TRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHI
         RT++ P         A WW  +G S   LQ  A+++LS TCS+  CE NW  F   H++R N L+Q ++ D+++V YN  L+ R  R  + + I L+ I
Subjt:  TRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNESISLDHI

Query:  LMEHLLDDWI---VEPQKQDMQEDE
              ++W+   +E    D + D+
Subjt:  LMEHLLDDWI---VEPQKQDMQEDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAAACCTGGAAGGTTGTCGTTCCAATAAGAAACCAAGGCAATCTGAAGATGATGAACAATCGTATTTGAACTTCCATTCCAATGATGATGAAGAAGATGGTTT
ACATGTGGCTTATAGAAACAGAGGAAGGCAACTGATGGGAAACAGGAATGTTGGTGCTAACATGACTCCTCTAAGAATTCCTGGAGAGGTAGCTCCTTGTAAACATGCTC
CTGAGGAAGTGTATCTTAAGATCAAAGAGAATATGAAATGGCATCGTACTGGTAGGAGACATGTACAGACTGATGCCAATGAGATATCGGCTTATTTTATGCAATCAGAC
AATGAAGAAGAAGAAGATGAGAAAGAGGAATCCATGCATCATATTAGCAAGGAAAGGTTTATGGATGGTGACAAAAGATCGAGCAAAGATTTGAGAAGTACTTTCAGGGG
AATGTCCCCCAGTGGTGGATCTGAACCGTCAGTTAAAAGATCAAGGTTAGATTCTGTTTTTCTGAAAACCACCAAAAGACAAACTGAACAGGTGCACAAACAAGCTTTAG
TAAAAAGAGGAGGCAGTAGGAGGTCACGGAAAGAAGTGATGTCTGCAATTTGCAAATTCTTTTGCTATGCAGGAATTCCGTTTCAATCTGCAAATTCTGTTTACTTTCAT
AAGATGTTGGAGACAGTTGGTCAATATGGGTCAGGCTTGGTTGGCCCTTCGTGTCAATTGATATCTGGTCGGTTATTACAGGAGGAAGTTGCAACCATTAAGTCTTACCT
GGGGTTGGAACTTCTGAGACCTGCAGTTACTCGGAATGCCTCAAGCTTTACTACTTTGCAGTGCTTGCTGGACCACAGAGCTAGTTTACGGAGAATGTTTGTCTCCAACG
AGTGGACTTCTAGCAGGTTTTCCAAATCTGGTGAGGGACAAGAAGTAGAGATATTTGTATTAAATCCTTCATTTTGGAAGAAGGTGCAGTATGTTTGTAAATCTGTGGAA
CCAGTATTGCAAGTTCTTCAAAAAGTCGATTCTGTTCAAAGCTTGTCAATTTCATCAATATATAGTGATATGTACAGAGCCAAGTTTGCTATACAATCCATTCATGGTGA
TGATGCCAGGAAATATGGACCATTCTGGAATGTGATAGATAGCAACTGGAATTCTTTATTTTGCCACCCTTTACATATGGCTGCTTTTTTCTTAAACCCATCATACAGAT
ATCGTCCCGATTTTGTGGCGCATTCGGAGGTGGTTCGTGGACTTAATGAATGCATAGTTCGACTAGAGTCTGACAATTCCAGAAGGATTTCTGCATCTATGCAGATTTCT
GACTATAATTCAGCAAAGTCTGATTTTGGAACTGAGCTGGCTATTAGTACAAGAACAGAACTTGATCCAGTCATCAATTCCATTTTTTCTTTAATAGCTGCATGGTGGCA
ACAACATGGAATCAGTTGCTTAGAACTGCAACAAATAGCTGTTCGCATACTGAGTCAGACATGTTCATCTTTGTGTTGTGAACACAACTGGACCCCATTCACTAAGGAAC
ACAGTCAAAGGCACAATACTTTGTCTCAGAGAAAAATGGCTGATTTGTTGTATGTTCACTACAACCTGCGACTTCGAGAACGCCAACTAAGAAAGCAATCTAATGAGTCT
ATTTCTCTTGATCATATTCTTATGGAACATTTATTGGATGATTGGATTGTGGAACCCCAGAAACAAGACATGCAAGAAGATGAGGAAATCCTTTGTCCTGGATTGGAGCC
GCTTGATGCATATGAGAATGATTTGATTGACTATGAGGATGGGACTTCAGAGGGCCGGAAGGGTTGCCTTCAACTGGTTGGTTTGACTGATGTAGACACATTGGATGTCA
ATCCTGCCAACGGAGGCGGTTCCATAGACAACGATGCCGATGTTAAGTTCTATGACAATGACAATGAGCTAAGTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAAAACCTGGAAGGTTGTCGTTCCAATAAGAAACCAAGGCAATCTGAAGATGATGAACAATCGTATTTGAACTTCCATTCCAATGATGATGAAGAAGATGGTTT
ACATGTGGCTTATAGAAACAGAGGAAGGCAACTGATGGGAAACAGGAATGTTGGTGCTAACATGACTCCTCTAAGAATTCCTGGAGAGGTAGCTCCTTGTAAACATGCTC
CTGAGGAAGTGTATCTTAAGATCAAAGAGAATATGAAATGGCATCGTACTGGTAGGAGACATGTACAGACTGATGCCAATGAGATATCGGCTTATTTTATGCAATCAGAC
AATGAAGAAGAAGAAGATGAGAAAGAGGAATCCATGCATCATATTAGCAAGGAAAGGTTTATGGATGGTGACAAAAGATCGAGCAAAGATTTGAGAAGTACTTTCAGGGG
AATGTCCCCCAGTGGTGGATCTGAACCGTCAGTTAAAAGATCAAGGTTAGATTCTGTTTTTCTGAAAACCACCAAAAGACAAACTGAACAGGTGCACAAACAAGCTTTAG
TAAAAAGAGGAGGCAGTAGGAGGTCACGGAAAGAAGTGATGTCTGCAATTTGCAAATTCTTTTGCTATGCAGGAATTCCGTTTCAATCTGCAAATTCTGTTTACTTTCAT
AAGATGTTGGAGACAGTTGGTCAATATGGGTCAGGCTTGGTTGGCCCTTCGTGTCAATTGATATCTGGTCGGTTATTACAGGAGGAAGTTGCAACCATTAAGTCTTACCT
GGGGTTGGAACTTCTGAGACCTGCAGTTACTCGGAATGCCTCAAGCTTTACTACTTTGCAGTGCTTGCTGGACCACAGAGCTAGTTTACGGAGAATGTTTGTCTCCAACG
AGTGGACTTCTAGCAGGTTTTCCAAATCTGGTGAGGGACAAGAAGTAGAGATATTTGTATTAAATCCTTCATTTTGGAAGAAGGTGCAGTATGTTTGTAAATCTGTGGAA
CCAGTATTGCAAGTTCTTCAAAAAGTCGATTCTGTTCAAAGCTTGTCAATTTCATCAATATATAGTGATATGTACAGAGCCAAGTTTGCTATACAATCCATTCATGGTGA
TGATGCCAGGAAATATGGACCATTCTGGAATGTGATAGATAGCAACTGGAATTCTTTATTTTGCCACCCTTTACATATGGCTGCTTTTTTCTTAAACCCATCATACAGAT
ATCGTCCCGATTTTGTGGCGCATTCGGAGGTGGTTCGTGGACTTAATGAATGCATAGTTCGACTAGAGTCTGACAATTCCAGAAGGATTTCTGCATCTATGCAGATTTCT
GACTATAATTCAGCAAAGTCTGATTTTGGAACTGAGCTGGCTATTAGTACAAGAACAGAACTTGATCCAGTCATCAATTCCATTTTTTCTTTAATAGCTGCATGGTGGCA
ACAACATGGAATCAGTTGCTTAGAACTGCAACAAATAGCTGTTCGCATACTGAGTCAGACATGTTCATCTTTGTGTTGTGAACACAACTGGACCCCATTCACTAAGGAAC
ACAGTCAAAGGCACAATACTTTGTCTCAGAGAAAAATGGCTGATTTGTTGTATGTTCACTACAACCTGCGACTTCGAGAACGCCAACTAAGAAAGCAATCTAATGAGTCT
ATTTCTCTTGATCATATTCTTATGGAACATTTATTGGATGATTGGATTGTGGAACCCCAGAAACAAGACATGCAAGAAGATGAGGAAATCCTTTGTCCTGGATTGGAGCC
GCTTGATGCATATGAGAATGATTTGATTGACTATGAGGATGGGACTTCAGAGGGCCGGAAGGGTTGCCTTCAACTGGTTGGTTTGACTGATGTAGACACATTGGATGTCA
ATCCTGCCAACGGAGGCGGTTCCATAGACAACGATGCCGATGTTAAGTTCTATGACAATGACAATGAGCTAAGTGACTAA
Protein sequenceShow/hide protein sequence
MRENLEGCRSNKKPRQSEDDEQSYLNFHSNDDEEDGLHVAYRNRGRQLMGNRNVGANMTPLRIPGEVAPCKHAPEEVYLKIKENMKWHRTGRRHVQTDANEISAYFMQSD
NEEEEDEKEESMHHISKERFMDGDKRSSKDLRSTFRGMSPSGGSEPSVKRSRLDSVFLKTTKRQTEQVHKQALVKRGGSRRSRKEVMSAICKFFCYAGIPFQSANSVYFH
KMLETVGQYGSGLVGPSCQLISGRLLQEEVATIKSYLGLELLRPAVTRNASSFTTLQCLLDHRASLRRMFVSNEWTSSRFSKSGEGQEVEIFVLNPSFWKKVQYVCKSVE
PVLQVLQKVDSVQSLSISSIYSDMYRAKFAIQSIHGDDARKYGPFWNVIDSNWNSLFCHPLHMAAFFLNPSYRYRPDFVAHSEVVRGLNECIVRLESDNSRRISASMQIS
DYNSAKSDFGTELAISTRTELDPVINSIFSLIAAWWQQHGISCLELQQIAVRILSQTCSSLCCEHNWTPFTKEHSQRHNTLSQRKMADLLYVHYNLRLRERQLRKQSNES
ISLDHILMEHLLDDWIVEPQKQDMQEDEEILCPGLEPLDAYENDLIDYEDGTSEGRKGCLQLVGLTDVDTLDVNPANGGGSIDNDADVKFYDNDNELSD