; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G005170 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G005170
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCmo_Chr04:2567867..2569171
RNA-Seq ExpressionCmoCh04G005170
SyntenyCmoCh04G005170
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.0e-22492.86Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSSSFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKA M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.5e-22392.63Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSSSFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMST LFKDIEE RLLGSPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKA M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

XP_022941624.1 protein ALP1-like [Cucurbita moschata]5.4e-22693.55Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSSSFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

XP_022980954.1 protein ALP1-like [Cucurbita maxima]5.6e-22392.17Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKA+M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.9e-22492.63Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEE RLLGSPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKA+M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein4.2e-20885.29Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-

Query:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCTRFKIIRN++FYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN
Subjt:  CPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVN

Query:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        H SQYV  GLN DS +EKA+++Q+ALALRARELH+
Subjt:  HSSQYVGIGLNEDSPDEKATMLQKALALRARELHT

A0A5D3CRB2 Putative nuclease HARBI18.7e-20684.6Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRV-

Query:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCTRFKIIRN++FYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN
Subjt:  CPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVN

Query:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD
        +YLFG GEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  QYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLD

Query:  HSSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        H SQYV  GLN DS +EKA+++Q+ALA RARELH+
Subjt:  HSSQYVGIGLNEDSPDEKATMLQKALALRARELHT

A0A6J1D7F1 protein ALP1-like6.7e-19882.03Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSR+LAAL+SSLISQLLL L LLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQ+IA+SLSFLSVSRKRKRTH  E LEL PS  GG  GGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LL TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGLSRLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        P+ELE TSS+FE +AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIEE RLL SPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        Y FGHGEYPLLPWLMVPF+GAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDF+AMADEWE LASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
         SQY+G GLNEDS DEKA+++Q+ALALRARELH+
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

A0A6J1FNZ2 protein ALP1-like2.6e-22693.55Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSSSFEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

A0A6J1J0M5 protein ALP1-like2.7e-22392.17Show/hide
Query:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
        MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVH

Query:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LLRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ
        PSELELTSS+FEDIAGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHG+AVNQ
Subjt:  PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQ

Query:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFAGAVSGSTEESFNEAHRLM IPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDH

Query:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT
        SSQYVGIGLNEDSPDEKA+M+QKALALRARELHT
Subjt:  SSQYVGIGLNEDSPDEKATMLQKALALRARELHT

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 17.7e-2626.9Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T        ++ ++ + D     S+  Q 
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI
        V D   R L++V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like4.8e-2829.18Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E
            PS+L+   S FE I+GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL ++  +K +E+ +
Subjt:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E

Query:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM
        RL G          + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM

Query:  REDFT
         ED T
Subjt:  REDFT

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.8e-1430.25Show/hide
Query:  RLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRI
        RLA G  +  +  +FG  S S A      +C+++       ++ P P      S +      LPNC GV+   RF++       + SI  Q +VDS+ R 
Subjt:  RLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRI

Query:  LSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALK----AIISLR-
        + I AG+        +   T LF  I EE L G+P    +G+ V +Y+ G    PLLPWL+ P+      S EESF E    +    L     A   +R 
Subjt:  LSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALK----AIISLR-

Query:  NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMRED
         W +L    +P   EF   V   G   +LHN L+   D
Subjt:  NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMRED

AT3G19120.1 PIF / Ping-Pong family of plant transposases6.5e-2025.95Show/hide
Query:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--
        +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+R    + SS      PS       G   V   R  + D  
Subjt:  SLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD--

Query:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE
                   +R+ + ++   F  +   L+P +       S L L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++
Subjt:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVE

Query:  FPC-PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNT-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL
         P     L  T+  FE++  LPN CG I  T  K+ R T     N Y      D++  Q+V D       +     G +DDS+    + L+K +    ++
Subjt:  FPC-PSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNT-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL

Query:  GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
            + + G  V  Y+ G   YPLL +LM PF+   SG+  E+  +   +     +   I L    W +L Q ++     A   I AC +LHN
Subjt:  GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases3.4e-2929.18Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E
            PS+L+   S FE I+GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL ++  +K +E+ +
Subjt:  EFPCPSELELTSSSFEDIAGLPNCCGVISCTRFKIIRNTNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEE-E

Query:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM
        RL G          + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLM

Query:  REDFT
         ED T
Subjt:  REDFT

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)5.5e-2726.9Show/hide
Query:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS
        L  ++K  +    + +   P D    D        LR  SP        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L 
Subjt:  LSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLS

Query:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL
        RLA+G    ++   FGV +S       +    L    +  + +P    +E   S FE++ GLPNCCG I  T        ++ ++ + D     S+  Q 
Subjt:  RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNCCGVISCTR----FKIIRNTNFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI
        V D   R L++V G+ G    S +L  +  FK  E  ++L G+P     G  + +Y+ G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLL-GSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAI

Query:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  ISLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein6.5e-2026.82Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV + RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPSELELTSSSFEDIAGLPNCCGVISCTRFKII------------RNTNFYED---SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEE
        P  SE+  T + FE +  +PN  G I  T   II            R+T   +    SI  Q VV++      +  G  G   D  +L  ++L       
Subjt:  PCPSELELTSSSFEDIAGLPNCCGVISCTRFKII------------RNTNFYED---SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEE

Query:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMR
            S      GM  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR
Subjt:  RLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESFNEAHRLMCIPALKAIISLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMR

Query:  ED
        ++
Subjt:  ED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTACTCTTCCCTTCCTCCAATCCACATTCCCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTATTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTCCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGAATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCTCTTCTCGAGTGTCGGGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACTGGCTGCGATTTTTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCTGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTTGAATTAACATCCTCGAGCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTC
ATGGGATGGCTGTGAATCAATACTTATTTGGACATGGCGAATATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTC
AATGAAGCTCATCGATTGATGTGCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGC
ATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGT
ATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTACTATGCTACAGAAAGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCCGGCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTACTCCTCCTGCTATTGCTACTCTTCCCTTCCTCCAATCCACATTCCCTTTTGTCCAATTC
TTCATCTGATTCCAATTTCTATGCTAATCTCTTTCCTCTCTTCAATCACTTCCTATTTTCCCAGCAAATTGCTGCATCCCTTTCGTTTCTCTCCGTTTCGCGTAAGAGGA
AGAGGACGCATTCGTCGGAGCTGCTTGAATTAGGGCCATCCGATAGCGGTGGTGAGGACGGCGGCCGTGGACGAGTCCATCTGTTGCGGACTCGGAGTCCTGATTCTTTC
AGGAATCACTTTCGAATGACCTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCTCTTCTCGAGTGTCGGGACCCGGTAGGTTCGCCTCTTGATCTCTCCGCTGA
GATTCGACTCGGTGTTGGCTTGTCCCGGCTGGCCACTGGCTGCGATTTTTCGACGATTTCGGACCAATTTGGCGTCTCGGAGTCTGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGTGTTCTCTGCACTAATTTTCGCTTTTGGGTTGAATTCCCTTGCCCCAGTGAGCTTGAATTAACATCCTCGAGCTTTGAAGATATTGCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATACCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAAGACGACTCCACGGTGCTTATGTCCACAACGCTGTTTAAAGACATTGAAGAAGAAAGGCTACTGGGTTCTCCTCCTGTTTACCTTC
ATGGGATGGCTGTGAATCAATACTTATTTGGACATGGCGAATATCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTC
AATGAAGCTCATCGATTGATGTGCATTCCAGCTCTGAAAGCCATCATTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAATTCAAAACTGCTGTTGC
ATACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTACTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTCGATCATAGCTCTCAGT
ATGTTGGTATTGGATTGAATGAGGATTCACCTGATGAGAAGGCTACTATGCTACAGAAAGCCTTGGCTCTGAGAGCTAGAGAGCTTCACACTTAA
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLLLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQQIAASLSFLSVSRKRKRTHSSELLELGPSDSGGEDGGRGRVHLLRTRSPDSF
RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLSRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPSELELTSSSFEDIAGLPNC
CGVISCTRFKIIRNTNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSTTLFKDIEEERLLGSPPVYLHGMAVNQYLFGHGEYPLLPWLMVPFAGAVSGSTEESF
NEAHRLMCIPALKAIISLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFTAMADEWESLASLDHSSQYVGIGLNEDSPDEKATMLQKALALRARELHT