; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G004830 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G004830
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCma_Chr04:2459802..2461106
RNA-Seq ExpressionCmaCh04G004830
SyntenyCmaCh04G004830
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.7e-22292.17Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKA MIQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]2.3e-22191.94Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMST LFKDIEE RLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKA MIQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

XP_022941624.1 protein ALP1-like [Cucurbita moschata]7.3e-22392.17Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKA+M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

XP_022980954.1 protein ALP1-like [Cucurbita maxima]3.5e-22593.55Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSSAFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]5.6e-22392.63Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSSAFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein9.3e-20885.75Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-

Query:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCTRFKIIRN++FYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Subjt:  CPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN

Query:  QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFGHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        H SQYV  GLN DS +EKAS+IQ+ALALRARELH+
Subjt:  HSSQYVGIGLNEDSPDEKASMIQKALALRARELHT

A0A5D3CRB2 Putative nuclease HARBI11.9e-20585.06Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-

Query:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCTRFKIIRN++FYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Subjt:  CPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVN

Query:  QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFG G+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        H SQYV  GLN DS +EKAS+IQ+ALA RARELH+
Subjt:  HSSQYVGIGLNEDSPDEKASMIQKALALRARELHT

A0A6J1D7F1 protein ALP1-like9.7e-19782.03Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        P+ELE TSSAFE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        Y FGHG+YPLLPWLMVPF+GAVSGSTEESFN+AHRLM IPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
         SQY+G GLNEDS DEKAS+IQ+ALALRARELH+
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

A0A6J1FNZ2 protein ALP1-like3.5e-22392.17Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKA+M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

A0A6J1J0M5 protein ALP1-like1.7e-22593.55Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
        PSELELTSSAFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ
Subjt:  PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQ

Query:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
        SSQYVGIGLNEDSPDEKASMIQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKASMIQKALALRARELHT

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 15.9e-2626.9Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T        ++ ++ + D     S+  Q 
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAI
        V D   R L++V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like2.8e-2829.18Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E
            PS+L+   S FE I+GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL ++  +K +E+ +
Subjt:  EFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E

Query:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM
        RL G          + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H   +  A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM

Query:  REDFT
         ED T
Subjt:  REDFT

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.8e-1530.54Show/hide
Query:  RLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRI
        RLA G  +  +  +FG  S S A      +C+++       ++ P P   + + +       LPNC GV+   RF++       + SI  Q +VDS+ R 
Subjt:  RLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRI

Query:  LSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHR------LMSIPALKAIISL
        + I AG+        +   T LF  I EE L G+P    +GV V +Y+ G    PLLPWL+ P+      S EESF E         L S+    A +  
Subjt:  LSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHR------LMSIPALKAIISL

Query:  RNWGVLS---QPMHEEFKTAVAYIGACSILHNALLMRED
        R W +L    +P   EF   V   G   +LHN L+   D
Subjt:  RNWGVLS---QPMHEEFKTAVAYIGACSILHNALLMRED

AT3G19120.1 PIF / Ping-Pong family of plant transposases2.9e-2025.95Show/hide
Query:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--
        +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+R    + SS      PS       G   V   R  + D  
Subjt:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--

Query:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE
                   +R+ + ++   F  +   L+P +       S L L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++
Subjt:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE

Query:  FPC-PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNT-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL
         P     L  T+  FE++  LPN CG I  T  K+ R T     N Y      D++  Q+V D       +     G +DDS+    + L+K +    ++
Subjt:  FPC-PSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNT-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL

Query:  GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
            + + G  V  Y+ G   YPLL +LM PF+   SG+  E+  +   +     +   I L    W +L Q ++     A   I AC +LHN
Subjt:  GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.0e-2929.18Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E
            PS+L+   S FE I+GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL ++  +K +E+ +
Subjt:  EFPCPSELELTSSAFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E

Query:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM
        RL G          + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H   +  A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM

Query:  REDFT
         ED T
Subjt:  REDFT

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)4.2e-2726.9Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T        ++ ++ + D     S+  Q 
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAI
        V D   R L++V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein8.5e-2026.49Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPSELELTSSAFEDIAGLPNCCGVISCTRFKII------------RNTNFYED---SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEE
        P  SE+  T + FE +  +PN  G I  T   II            R+T   +    SI  Q VV++      +  G  G   D  +L  ++L       
Subjt:  PCPSELELTSSAFEDIAGLPNCCGVISCTRFKII------------RNTNFYED---SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEE

Query:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMR
            S      G+  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR
Subjt:  RLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESFNEAHRLMSIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMR

Query:  ED
        ++
Subjt:  ED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTTCTCTTCCCTTCCTCCAACCCACATTCTCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTTCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGGATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACAGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTCGAATTAACATCCTCAGCCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTC
ATGGGGTGGCTGTGAATCAATACTTGTTTGGACATGGCGACTATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTC
AATGAAGCTCATCGCTTGATGTCCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGC
ATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGT
ATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTCTATGATACAGAAGGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTTCTCTTCCCTTCCTCCAACCCACATTCTCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTGTTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTTCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGGATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACAGGCTGCGATTTCTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTCGAATTAACATCCTCAGCCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTC
ATGGGGTGGCTGTGAATCAATACTTGTTTGGACATGGCGACTATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTC
AATGAAGCTCATCGCTTGATGTCCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGC
ATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGT
ATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTTCTATGATACAGAAGGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSF
RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSAFEDIAGLPNC
CGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGVAVNQYLFGHGDYPLLPWLMVPFAGAVSGSTEESF
NEAHRLMSIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKASMIQKALALRARELHT