; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019642 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019642
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr5:44142915..44144225
RNA-Seq ExpressionLag0019642
SyntenyLag0019642
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.1e-20685.13Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ+ AASL FLSVSRKRKRT+  + L+L  S           GR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE
        V HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVE
Subjt:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE

Query:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA
        FPC NELELTSS FEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL++DIE+GRLL+SPPVYLHGVA
Subjt:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA

Query:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS
        VN+YL G GEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S
Subjt:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS

Query:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        L+H SQYV  GLN  ST+EKASVIQRALA RARELHS
Subjt:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]4.8e-20685.09Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG   DG RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC +ELELTSS FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL++DIEEGRLL SPPVYLHG+AV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQYL GHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +HSSQYVG+GLN+ S DEKA +IQ+ALALRARELH+
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]2.4e-20584.86Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG   DG RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC +ELELTSS FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+ L++DIEEGRLL SPPVYLHG+AV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQYL GHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +HSSQYVG+GLN+ S DEKA +IQ+ALALRARELH+
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]1.3e-20885.81Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ+ AASL FLSVSRKRKRT+ S+ L+L  S           GR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE
        V HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVE
Subjt:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE

Query:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA
        FPC NELELTSS FEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL++DIE+GRLL+SPPVYLHGVA
Subjt:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA

Query:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS
        VN+YL GHGEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S
Subjt:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS

Query:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        L+H SQYV  GLN  ST+EKASVIQRALALRARELHS
Subjt:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]1.3e-20685.32Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG   DG RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC +ELELTSS FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL++DIEEGRLL SPPVYLHG+AV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQYL GHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +HSSQYVG+GLN+ S DEKAS+IQ+ALALRARELH+
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein6.5e-20985.81Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ+ AASL FLSVSRKRKRT+ S+ L+L  S           GR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE
        V HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVE
Subjt:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE

Query:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA
        FPC NELELTSS FEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL++DIE+GRLL+SPPVYLHGVA
Subjt:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA

Query:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS
        VN+YL GHGEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S
Subjt:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS

Query:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        L+H SQYV  GLN  ST+EKASVIQRALALRARELHS
Subjt:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI11.0e-20685.13Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ+ AASL FLSVSRKRKRT+  + L+L  S           GR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE
        V HLFRTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLS EIRLGVGL RLATGCDF TIS+QFGVSESVARFC+KQLCRVLCTNFRFWVE
Subjt:  V-HLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVE

Query:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA
        FPC NELELTSS FEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTL++DIE+GRLL+SPPVYLHGVA
Subjt:  FPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVA

Query:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS
        VN+YL G GEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S
Subjt:  VNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLAS

Query:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        L+H SQYV  GLN  ST+EKASVIQRALA RARELHS
Subjt:  LNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1D7F1 protein ALP1-like1.9e-20083.26Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSR+LAAL+SSLISQLLL LFLLFPSSNPHSLLSN  SDS+FYAN FPL  HFLFSQEIA+SLSFLSVSRKRKRTH  EQL+LEPS GGGG   G RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL  TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PL+LSAEIRLGVGLSRLATGCDF TISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC NELE TSS FE LAGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMSSTL++DIEEGRLLDSPPVYLHG+AV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQY  GHGEYPLLPWLM+PF+GAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDFSAMADEWE LASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +H SQY+G GLN+ STDEKASVIQRALALRARELHS
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like1.9e-20584.63Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG   DG RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC +ELELTSS FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL++DIEE RLL SPPVYLHG+AV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQYL GHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +HSSQYVG+GLN+ S DEKA+++Q+ALALRARELH+
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like7.4e-20584.86Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQ+IAASLSFLSVSRKRKRTHSSE L+L PSD GG   DG RGR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGR

Query:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF
        VHL RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVG PLDLSAEIRLGVGLSRLATGCDF TIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEF
Subjt:  VHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEF

Query:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV
        PC +ELELTSS FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TL++DIEE RLL SPPVYLHGVAV
Subjt:  PCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAV

Query:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL
        NQYL GHG+YPLLPWLM+PFAGAVSGSTEESFN+AHRLM IPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASL
Subjt:  NQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASL

Query:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS
        +HSSQYVG+GLN+ S DEKAS+IQ+ALALRARELH+
Subjt:  NHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 14.1e-2728.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANEL
        +F++ FR + +TF ++  L+   L  R P GL       LS E ++ + L RLA+G   +++   FGV +S       +    L    +  + +P ++ +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANEL

Query:  ELTSSGFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  ++  E  ++LD  P  L  
Subjt:  ELTSSGFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-H

Query:  GVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like1.1e-2728.21Show/hide
Query:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV
        DG   R++   T  P +F + F+++  TF+++  L++     +     D  G PL L+   R+ V L RL +G     I E FG+++S       +    
Subjt:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV

Query:  LCTNFRFWVEFPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST
        +       + +P  ++L+   S FE ++GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S 
Subjt:  LCTNFRFWVEFPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST

Query:  LYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC
         Y+ +E+G+ L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W +++  M    +  +   I  C
Subjt:  LYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC

Query:  SILHNALLMRED
         +LHN ++  ED
Subjt:  SILHNALLMRED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)5.0e-1231.52Show/hide
Query:  LPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLM
        LPNC GV+   RF++       + SI  Q +VDS+ R + I AG+        +   + L+   EE  +L   P  L +GV V +Y+LG    PLLPWL+
Subjt:  LPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLM

Query:  MPFAGAVSGSTEESFNKAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE
         P+      S EESF +    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  MPFAGAVSGSTEESFNKAHRLMCIPALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases5.3e-2225.82Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGRVHLFRTRSPD
        +++S LL L   L P+S   S  S SS  S   ++L    +   L    +A+ LSFL+V+R    + SS +           DGD     V  FR  + D
Subjt:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHF-LFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGRVHLFRTRSPD

Query:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTN-FRFW
                     +R+ + ++   F  +   L+P +   +     L L A+  + + LSRLA GC   T++ ++ +   +       + R+L T  +  +
Subjt:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTN-FRFW

Query:  VEFPCA-NELELTSSGFEDLAGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGR
        ++ P     L  T+ GFE+L  LPN CG I  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S LY+ +  G 
Subjt:  VEFPCA-NELELTSSGFEDLAGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGR

Query:  LLDSPPVYLHGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
        ++    + + G  V  Y++G   YPLL +LM PF+   SG+  E+      +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  LLDSPPVYLHGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases7.7e-2928.21Show/hide
Query:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV
        DG   R++   T  P +F + F+++  TF+++  L++     +     D  G PL L+   R+ V L RL +G     I E FG+++S       +    
Subjt:  DGDRGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRV

Query:  LCTNFRFWVEFPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST
        +       + +P  ++L+   S FE ++GLPNCCG I  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S 
Subjt:  LCTNFRFWVEFPCANELELTSSGFEDLAGLPNCCGVISCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSST

Query:  LYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC
         Y+ +E+G+ L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W +++  M    +  +   I  C
Subjt:  LYRDIEEGRLLDSPPVYL-HGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGAC

Query:  SILHNALLMRED
         +LHN ++  ED
Subjt:  SILHNALLMRED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.9e-2828.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANEL
        +F++ FR + +TF ++  L+   L  R P GL       LS E ++ + L RLA+G   +++   FGV +S       +    L    +  + +P ++ +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGL----PLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANEL

Query:  ELTSSGFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  ++  E  ++LD  P  L  
Subjt:  ELTSSGFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYL-H

Query:  GVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein2.5e-1927.18Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCANE
        D FR  FRM+ STF  +   L+  +  ++ + L   + A  R+GV + RLATG     +SE+FG+  S       ++CR    VL   +  W   P  +E
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCANE

Query:  LELTSSGFEDLAGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRD-IEEGRLLD
        +  T + FE +  +PN  G I  T   II          N    E       SI  Q VV++      +  G  G   D  +L  S+L R     G L D
Subjt:  LELTSSGFEDLAGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRD-IEEGRLLD

Query:  SPPVYLHGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED
        S            +++G+  +PL  +L++P+       T+ +FN++   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  SPPVYLHGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCGTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAGCTGCTGCTCCTCCTCTTTCTTCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
CTCTTCCGATTCCAATTTCTATGCTAATCTCTTCCCTCTCTTCAACCACTTCCTCTTTTCCCAGGAAATTGCCGCCTCCCTTTCGTTCCTCTCCGTTTCGCGCAAGAGGA
AGAGGACGCATTCGTCGGAGCAGCTCCAATTGGAGCCATCAGATGGCGGCGGCGGAGACGGCGACGGCGACCGTGGACGAGTCCATCTGTTTCGGACTCGGAGTCCTGAT
TCTTTCAGAAACCACTTCAGAATGACTTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCGACCCGGTGGGTTTGCCTCTCGATCTCTC
CGCCGAGATTCGACTCGGTGTCGGCCTGTCTCGGCTGGCCACCGGCTGCGATTTCTTGACAATTTCGGAGCAATTCGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTA
AGCAATTGTGTCGAGTTCTCTGTACCAATTTTCGCTTCTGGGTCGAATTCCCTTGCGCCAATGAGCTCGAATTAACATCCTCCGGCTTTGAAGATCTTGCTGGGCTTCCG
AATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCT
TAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTATAGAGACATTGAAGAAGGAAGGCTTCTGGATTCTCCTCCGGTTT
ACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTGGGACATGGCGAATATCCGTTGCTTCCATGGTTAATGATGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAGGAG
AGTTTCAACAAAGCCCATCGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCATGAGGAGTTCAAAACTGC
TGTTGCTTATATTGGTGCTTGCTCTATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTTAATCATAGTT
CTCAGTATGTTGGGGTTGGATTAAATCAGGGTTCAACTGATGAGAAGGCTTCTGTAATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCATAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCGTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAGCTGCTGCTCCTCCTCTTTCTTCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
CTCTTCCGATTCCAATTTCTATGCTAATCTCTTCCCTCTCTTCAACCACTTCCTCTTTTCCCAGGAAATTGCCGCCTCCCTTTCGTTCCTCTCCGTTTCGCGCAAGAGGA
AGAGGACGCATTCGTCGGAGCAGCTCCAATTGGAGCCATCAGATGGCGGCGGCGGAGACGGCGACGGCGACCGTGGACGAGTCCATCTGTTTCGGACTCGGAGTCCTGAT
TCTTTCAGAAACCACTTCAGAATGACTTCCTCGACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCGACCCGGTGGGTTTGCCTCTCGATCTCTC
CGCCGAGATTCGACTCGGTGTCGGCCTGTCTCGGCTGGCCACCGGCTGCGATTTCTTGACAATTTCGGAGCAATTCGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTA
AGCAATTGTGTCGAGTTCTCTGTACCAATTTTCGCTTCTGGGTCGAATTCCCTTGCGCCAATGAGCTCGAATTAACATCCTCCGGCTTTGAAGATCTTGCTGGGCTTCCG
AATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCT
TAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGACTCGACGGTGCTTATGTCCTCGACGCTGTATAGAGACATTGAAGAAGGAAGGCTTCTGGATTCTCCTCCGGTTT
ACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTGGGACATGGCGAATATCCGTTGCTTCCATGGTTAATGATGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAGGAG
AGTTTCAACAAAGCCCATCGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTCTGAGCCAACCAATGCATGAGGAGTTCAAAACTGC
TGTTGCTTATATTGGTGCTTGCTCTATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTTAATCATAGTT
CTCAGTATGTTGGGGTTGGATTAAATCAGGGTTCAACTGATGAGAAGGCTTCTGTAATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCATAGTTAA
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFPLFNHFLFSQEIAASLSFLSVSRKRKRTHSSEQLQLEPSDGGGGDGDGDRGRVHLFRTRSPD
SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGLPLDLSAEIRLGVGLSRLATGCDFLTISEQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCANELELTSSGFEDLAGLP
NCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLYRDIEEGRLLDSPPVYLHGVAVNQYLLGHGEYPLLPWLMMPFAGAVSGSTEE
SFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLNHSSQYVGVGLNQGSTDEKASVIQRALALRARELHS