; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002950 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002950
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationscaffold6:4795903..4797207
RNA-Seq ExpressionSpg002950
SyntenySpg002950
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]9.5e-20785.52Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ+ AASL FLSVSRKRKRT+  + L+L  S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN
        C NELELTSS FEDL+GLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL+KDIE+GRLL+SPPVYLHGV+VN
Subjt:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN

Query:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD
        +YL G GEYPLLPWL++PF+GAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLD
Subjt:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD

Query:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        H SQYV  GLN  ST+EKASVIQRALA RARELHS
Subjt:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.9e-20785.25Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG DG RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         +ELELTSS FED++GLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL+KDIEEGRLL SPPVYLHG++VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        YL GHGEYPLLPWLM+PF+GAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        SSQYVG+GLN+ S DEKA +IQ+ALALRARELH+
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]9.5e-20785.02Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG DG RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         +ELELTSS FED++GLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+ L+KDIEEGRLL SPPVYLHG++VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        YL GHGEYPLLPWLM+PF+GAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        SSQYVG+GLN+ S DEKA +IQ+ALALRARELH+
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.6e-20986.21Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ+ AASL FLSVSRKRKRT+ S+ L+L  S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN
        C NELELTSS FEDL+GLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL+KDIE+GRLL+SPPVYLHGV+VN
Subjt:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN

Query:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD
        +YL GHGEYPLLPWL++PF+GAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLD
Subjt:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD

Query:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        H SQYV  GLN  ST+EKASVIQRALALRARELHS
Subjt:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]5.1e-20885.48Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG DG RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         +ELELTSS FED++GLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL+KDIEEGRLL SPPVYLHG++VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        YL GHGEYPLLPWLM+PF+GAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        SSQYVG+GLN+ S DEKAS+IQ+ALALRARELH+
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.2e-20986.21Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ+ AASL FLSVSRKRKRT+ S+ L+L  S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN
        C NELELTSS FEDL+GLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL+KDIE+GRLL+SPPVYLHGV+VN
Subjt:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN

Query:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD
        +YL GHGEYPLLPWL++PF+GAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLD
Subjt:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD

Query:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        H SQYV  GLN  ST+EKASVIQRALALRARELHS
Subjt:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI14.6e-20785.52Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ+ AASL FLSVSRKRKRT+  + L+L  S         GRV 
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRV-

Query:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN
        C NELELTSS FEDL+GLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL+KDIE+GRLL+SPPVYLHGV+VN
Subjt:  CANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVN

Query:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD
        +YL G GEYPLLPWL++PF+GAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLD
Subjt:  QYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLD

Query:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        H SQYV  GLN  ST+EKASVIQRALA RARELHS
Subjt:  HSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1D7F1 protein ALP1-like3.2e-20083.41Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSR+LAAL+SSLISQLLL LFLLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQEIA+SLSFLSVSRKRKRTH  EQL+LEPS GGGG G RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L  TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PL+LSAEIRLGVGLSRLATGCDF TISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         NELE TSS FE L+GLPNCCGV++CT                            SIVAGFRGDKDDSTVLMSSTL+KDIEEGRLLDSPPVYLHG++VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        Y  GHGEYPLLPWLM+PFSGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDFSAMADEWE L SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
         SQY+G GLN+ STDEKASVIQRALALRARELHS
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like7.9e-20784.79Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG DG RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         +ELELTSS FED++GLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL+KDIEE RLL SPPVYLHG++VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        YL GHGEYPLLPWLM+PF+GAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        SSQYVG+GLN+ S DEKA+++Q+ALALRARELH+
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like3.0e-20685.02Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG DG RGRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ
         +ELELTSS FED++GLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL+KDIEE RLL SPPVYLHGV+VNQ
Subjt:  ANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQ

Query:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH
        YL GHG+YPLLPWLM+PF+GAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL SLDH
Subjt:  YLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDH

Query:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        SSQYVG+GLN+ S DEKAS+IQ+ALALRARELH+
Subjt:  SSQYVGVGLNQGSTDEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.7e-2827.78Show/hide
Query:  LSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLS
        L  ++K  +    +++   P D    D D       R  SP        +F++ FR + +TF ++  L+   L  R P GL       LS E ++ + L 
Subjt:  LSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLS

Query:  RLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQL
        RLA+G   +++   FGV +S       +    L    +  + +P ++ +E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q 
Subjt:  RLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAI
        V D   R L++V G+ G    S +L  S  +K  E  ++LD  P  L  G  + +Y++G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAI

Query:  VSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  VSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like3.7e-2828.53Show/hide
Query:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV
        DG   R++   T  P +F + F+++  TF+++  L++     +     D  G PL L+   R+ V L RL +G     I E FG+++S       +    
Subjt:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV

Query:  LCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST
        +       + +P  ++L+   S FE +SGLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S 
Subjt:  LCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST

Query:  LYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC
         YK +E+G+ L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C
Subjt:  LYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC

Query:  SILHNALLMRED
         +LHN ++  ED
Subjt:  SILHNALLMRED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.3e-1232.07Show/hide
Query:  LPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLM
        LPNC GV+   RF++       + SI  Q +VDS+ R + I AG+        +   + L+   EE  +L   P  L +GV V +Y+LG    PLLPWL+
Subjt:  LPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLM

Query:  MPFSGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE
         P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  MPFSGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases1.3e-2326.46Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD--
        +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+R    + SS +           DGD   V  FR  + D  
Subjt:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD--

Query:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVE
                   +R+ + ++   F  +   L+P +   +     L L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +++
Subjt:  ----------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTN-FRFWVE

Query:  FPCA-NELELTSSGFEDLSGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLL
         P     L  T+ GFE+L+ LPN CG I  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S LYK +  G ++
Subjt:  FPCA-NELELTSSGFEDLSGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLL

Query:  DSPPVYLHGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
            + + G  V  Y++G   YPLL +LM PFS   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  DSPPVYLHGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.6e-2928.53Show/hide
Query:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV
        DG   R++   T  P +F + F+++  TF+++  L++     +     D  G PL L+   R+ V L RL +G     I E FG+++S       +    
Subjt:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV

Query:  LCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST
        +       + +P  ++L+   S FE +SGLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S 
Subjt:  LCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST

Query:  LYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC
         YK +E+G+ L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  M    +  +   I  C
Subjt:  LYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC

Query:  SILHNALLMRED
         +LHN ++  ED
Subjt:  SILHNALLMRED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.2e-2927.78Show/hide
Query:  LSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLS
        L  ++K  +    +++   P D    D D       R  SP        +F++ FR + +TF ++  L+   L  R P GL       LS E ++ + L 
Subjt:  LSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPD-------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLS

Query:  RLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQL
        RLA+G   +++   FGV +S       +    L    +  + +P ++ +E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q 
Subjt:  RLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLSGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQL

Query:  VVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAI
        V D   R L++V G+ G    S +L  S  +K  E  ++LD  P  L  G  + +Y++G   YPLLPWL+ P        +  +FNE H  +   A  A 
Subjt:  VVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYL-HGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAI

Query:  VSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
          L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  VSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein1.1e-1927.18Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCANE
        D FR  FRM+ STF  +   L+  +  ++ + L   + A  R+GV + RLATG     +SE+FG+  S       ++CR    VL   +  W   P  +E
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCANE

Query:  LELTSSGFEDLSGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKD-IEEGRLLD
        +  T + FE +  +PN  G I  T   II          N    E       SI  Q VV++      +  G  G   D  +L  S+L +     G L D
Subjt:  LELTSSGFEDLSGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKD-IEEGRLLD

Query:  SPPVYLHGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED
        S            +++G+  +PL  +L++P++      T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  SPPVYLHGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAGCTGCTGCTCCTCCTCTTTCTTCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
CTCTTCCGATTCCAATTTCTATGCTAATCTCTTCCCTCTCTTCAACCACTTCCTCTTTTCCCAGGAAATTGCCGCCTCCCTTTCGTTCCTCTCCGTTTCGCGCAAGAGGA
AGAGGACGCATTCGTCGGAGCAGCTCCAATTGGAGCCATCCGATGGCGGCGGCGGAGACGGCGACCGTGGACGAGTCCATCTGTTTCGGACTCGGAGTCCTGATTCTTTC
AGAAATCACTTCAGAATGACTTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCGACCCGGTGGGTTTGCCTCTCGATCTCTCCGCCGA
GATTCGACTCGGTGTCGGCCTGTCTCGGCTGGCCACCGGCTGCGATTTCTTGACAATTTCGGAGCAATTCGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGAGTTCTATGTACCAATTTTCGCTTCTGGGTCGAATTCCCTTGCGCCAATGAGCTCGAATTAACATCCTCGGGCTTTGAAGATCTTTCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTATAAAGACATTGAAGAAGGAAGGCTGCTGGATTCTCCTCCGGTTTACCTTC
ATGGGGTGTCAGTGAATCAGTACTTGTTGGGACATGGCGAATACCCTTTGCTTCCATGGTTAATGATGCCTTTTTCAGGAGCTGTTTCAGGGTCAACTGAGGAGAGTTTC
AACGAAGCCCATCGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCATGAGGAGTTCAAAACTGCTGTTGC
TTATATTGGTGCTTGCTCTATTCTTCATAATGCATTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAACTTCACTTGATCATAGTTCCCAGT
ATGTTGGGGTTGGATTAAATCAGGGTTCAACTGATGAGAAGGCTTCTGTAATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCATAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAGCTGCTGCTCCTCCTCTTTCTTCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
CTCTTCCGATTCCAATTTCTATGCTAATCTCTTCCCTCTCTTCAACCACTTCCTCTTTTCCCAGGAAATTGCCGCCTCCCTTTCGTTCCTCTCCGTTTCGCGCAAGAGGA
AGAGGACGCATTCGTCGGAGCAGCTCCAATTGGAGCCATCCGATGGCGGCGGCGGAGACGGCGACCGTGGACGAGTCCATCTGTTTCGGACTCGGAGTCCTGATTCTTTC
AGAAATCACTTCAGAATGACTTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCGACCCGGTGGGTTTGCCTCTCGATCTCTCCGCCGA
GATTCGACTCGGTGTCGGCCTGTCTCGGCTGGCCACCGGCTGCGATTTCTTGACAATTTCGGAGCAATTCGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAAT
TGTGTCGAGTTCTATGTACCAATTTTCGCTTCTGGGTCGAATTCCCTTGCGCCAATGAGCTCGAATTAACATCCTCGGGCTTTGAAGATCTTTCTGGGCTTCCGAATTGC
TGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTAT
TGTTGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTATAAAGACATTGAAGAAGGAAGGCTGCTGGATTCTCCTCCGGTTTACCTTC
ATGGGGTGTCAGTGAATCAGTACTTGTTGGGACATGGCGAATACCCTTTGCTTCCATGGTTAATGATGCCTTTTTCAGGAGCTGTTTCAGGGTCAACTGAGGAGAGTTTC
AACGAAGCCCATCGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCATGAGGAGTTCAAAACTGCTGTTGC
TTATATTGGTGCTTGCTCTATTCTTCATAATGCATTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAACTTCACTTGATCATAGTTCCCAGT
ATGTTGGGGTTGGATTAAATCAGGGTTCAACTGATGAGAAGGCTTCTGTAATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCATAGTTAA
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDRGRVHLFRTRSPDSF
RNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLSGLPNC
CGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYKDIEEGRLLDSPPVYLHGVSVNQYLLGHGEYPLLPWLMMPFSGAVSGSTEESF
NEAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLTSLDHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS