; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g020820 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g020820
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCsor_Chr04:2497948..2499252
RNA-Seq ExpressionCsor.00g020820
SyntenyCsor.00g020820
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.51e-296100Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]2.88e-29599.75Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMST LFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

XP_022941624.1 protein ALP1-like [Cucurbita moschata]3.92e-29399.26Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKA M+QKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

XP_022980954.1 protein ALP1-like [Cucurbita maxima]2.17e-29098.52Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSS+FEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHG+AVNQYLFGHG+YPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKA MIQKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.36e-29499.51Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSS+FEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKA MIQKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein1.17e-24180Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  -LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  -LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSSFEDIAGLPNCCGVISCT----------------------------SIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN
Subjt:  CPSELELTSSSFEDIAGLPNCCGVISCT----------------------------SIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVN

Query:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKACMIQKALALRARELHT
        H SQYV  GLN DS +EKA +IQ+ALALRARELH+
Subjt:  HSSQYVGIGLNEDSPDEKACMIQKALALRARELHT

A0A5D3CRB2 Putative nuclease HARBI19.14e-23979.31Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  -LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
         L RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  -LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSSFEDIAGLPNCCGVISCT----------------------------SIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN
Subjt:  CPSELELTSSSFEDIAGLPNCCGVISCT----------------------------SIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVN

Query:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFG GEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKACMIQKALALRARELHT
        H SQYV  GLN DS +EKA +IQ+ALA RARELH+
Subjt:  HSSQYVGIGLNEDSPDEKACMIQKALALRARELHT

A0A6J1D7F1 protein ALP1-like1.24e-25988.18Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        P+ELE TSS+FE +AGLPNCCGV++CTSIVAGFRGDKDDSTVLMS+TLFKDIEEGRLL SPPVYLHGMAVNQY FGHGEYPLLPWLMVPF+GAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFN+AHRLMCIPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH SQY+G GLNEDS DEKA +IQ+ALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELH+
Subjt:  ARELHT

A0A6J1FNZ2 protein ALP1-like1.90e-29399.26Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKA M+QKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

A0A6J1J0M5 protein ALP1-like1.05e-29098.52Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE
        PSELELTSS+FEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHG+AVNQYLFGHG+YPLLPWLMVPFAGAVSGSTEE
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEE

Query:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR
        SFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKA MIQKALALR
Subjt:  SFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALR

Query:  ARELHT
        ARELHT
Subjt:  ARELHT

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.4e-2125.15Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTSI--------------------------
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T I                          
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTSI--------------------------

Query:  -----------VAGFRGDKDDSTVLMSTTLFKDIEEGRLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI
                   V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  -----------VAGFRGDKDDSTVLMSTTLFKDIEEGRLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like1.3e-2226.73Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTSIV--------------------------------------AGFRGDKDDSTVLMSTTLFKDIEEGRLL
            PS+L+   S FE I+GLPNCCG I  T IV                                      AG+ G  +D  VL ++  +K +E+G+ L
Subjt:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTSIV--------------------------------------AGFRGDKDDSTVLMSTTLFKDIEEGRLL

Query:  GSPPVYL-HGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE
            + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++  E
Subjt:  GSPPVYL-HGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE

Query:  DFT
        D T
Subjt:  DFT

Arabidopsis top hitse value%identityAlignment
AT3G19120.1 PIF / Ping-Pong family of plant transposases1.7e-1424.43Show/hide
Query:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--
        +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+R    + SS      PS       G   V   R  + D  
Subjt:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--

Query:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE
                   +R+ + ++   F  +   L+P +       S L L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++
Subjt:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE

Query:  FPC-PSELELTSSSFEDIAGLPNCCGVISCT---------------------------SIVAGFR-----------GDKDDSTVLMSTTLFKDIEEGRLL
         P     L  T+  FE++  LPN CG I  T                            +VA  +           G +DDS+    + L+K +  G ++
Subjt:  FPC-PSELELTSSSFEDIAGLPNCCGVISCT---------------------------SIVAGFR-----------GDKDDSTVLMSTTLFKDIEEGRLL

Query:  GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
            + + G  V  Y+ G   YPLL +LM PF+   SG+  E+  +   +     +   I L    W +L Q ++     A   I AC +LHN
Subjt:  GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases9.1e-2426.73Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTSIV--------------------------------------AGFRGDKDDSTVLMSTTLFKDIEEGRLL
            PS+L+   S FE I+GLPNCCG I  T IV                                      AG+ G  +D  VL ++  +K +E+G+ L
Subjt:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTSIV--------------------------------------AGFRGDKDDSTVLMSTTLFKDIEEGRLL

Query:  GSPPVYL-HGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE
            + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++  E
Subjt:  GSPPVYL-HGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRE

Query:  DFT
        D T
Subjt:  DFT

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.7e-2225.15Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTSI--------------------------
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T I                          
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTSI--------------------------

Query:  -----------VAGFRGDKDDSTVLMSTTLFKDIEEGRLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI
                   V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  -----------VAGFRGDKDDSTVLMSTTLFKDIEEGRLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein3.1e-1625.77Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPSELELTSSSFEDIAGLPNCCGVISCTSI--------VAGF-------RGDKDDSTVLMSTTLFKD-IEEGRLLGSPPVY----------------LH
        P  SE+  T + FE +  +PN  G I  T I        VA +       R  K   ++ +   +  D I     +G+P                     
Subjt:  PCPSELELTSSSFEDIAGLPNCCGVISCTSI--------VAGF-------RGDKDDSTVLMSTTLFKD-IEEGRLLGSPPVY----------------LH

Query:  GMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED
        GM  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  GMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTACTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTCCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGAATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCTCTTCTCGAGTGTCGGGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACTGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCTGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTTGAATTAACATCCTCGAGCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGTATTGTTGCAGGATTTCGTGGCGATAAAGATGACTCGACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGGAAGGCT
ACTGGGTTCTCCTCCTGTTTACCTTCATGGGATGGCTGTGAATCAATACTTATTTGGACATGGCGAATATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTG
TTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCATCGATTGATGTGCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATG
CATGAGGAATTCAAAACTGCTGTTGCATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTT
AGCTTCACTCGATCATAGCTCTCAGTATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTGTATGATACAGAAAGCCTTGGCTCTCAGAGCTAGAGAGC
TTCACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTACTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTCCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGAATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCTCTTCTCGAGTGTCGGGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACTGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCTGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTTGAATTAACATCCTCGAGCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGTATTGTTGCAGGATTTCGTGGCGATAAAGATGACTCGACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGGAAGGCT
ACTGGGTTCTCCTCCTGTTTACCTTCATGGGATGGCTGTGAATCAATACTTATTTGGACATGGCGAATATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTG
TTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCATCGATTGATGTGCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATG
CATGAGGAATTCAAAACTGCTGTTGCATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTT
AGCTTCACTCGATCATAGCTCTCAGTATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTGTATGATACAGAAAGCCTTGGCTCTCAGAGCTAGAGAGC
TTCACACTTAA
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSF
RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNC
CGVISCTSIVAGFRGDKDDSTVLMSTTLFKDIEEGRLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPM
HEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKACMIQKALALRARELHT