; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008800 (gene) of Snake gourd v1 genome

Gene IDTan0008800
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG01:6603135..6604886
RNA-Seq ExpressionTan0008800
SyntenyTan0008800
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.5e-21589.1Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF HFLFS++ AASL FLSVSRKRKRT+  +HLELG S       HGRV HLFR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR

Query:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE
        TR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL+RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNE
Subjt:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE

Query:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF
        LELTSSAFEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDI  GRLL+SPPVYLHGVAVN+YLF
Subjt:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF

Query:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ
        G GEYPLLPWL+VPFA AVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLDH SQ
Subjt:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ

Query:  YVGDGLNQDSTDEKASVIQRALALRARELHS
        YV  GLN DST+EKASVIQRALA RARELHS
Subjt:  YVGDGLNQDSTDEKASVIQRALALRARELHS

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.8e-20586.18Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYAN   LFNHFLFS+++AASLSFLSVSRKRKRTHSSE LELGPSDSGG D G GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ
        P+ELELTSS+FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TLFKDI +GRLL SPPVYLHG+AVNQ
Subjt:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFA AVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH

Query:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SSQYVG GLN+DS DEKA +IQ+ALALRARELH+
Subjt:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]8.8e-20585.94Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYAN   LFNHFLFS+++AASLSFLSVSRKRKRTHSSE LELGPSDSGG D G GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ
        P+ELELTSS+FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+ LFKDI +GRLL SPPVYLHG+AVNQ
Subjt:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFA AVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH

Query:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SSQYVG GLN+DS DEKA +IQ+ALALRARELH+
Subjt:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]2.0e-21789.79Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF HFLFS++ AASL FLSVSRKRKRT+ S+HLELG S       HGRV HLFR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR

Query:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE
        TR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL+RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNE
Subjt:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE

Query:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF
        LELTSSAFEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDI  GRLL+SPPVYLHGVAVN+YLF
Subjt:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF

Query:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ
        GHGEYPLLPWL+VPFA AVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLDH SQ
Subjt:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ

Query:  YVGDGLNQDSTDEKASVIQRALALRARELHS
        YV  GLN DST+EKASVIQRALALRARELHS
Subjt:  YVGDGLNQDSTDEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]2.1e-20686.64Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYAN   LFNHFLFS+++AASLSFLSVSRKRKRTHSSE LELGPSDSGG D G GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ
        P+ELELTSSAFED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TLFKDI +GRLL SPPVYLHG+AVNQ
Subjt:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFA AVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH

Query:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SSQYVG GLN+DS DEKAS+IQ+ALALRARELH+
Subjt:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein9.8e-21889.79Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF HFLFS++ AASL FLSVSRKRKRT+ S+HLELG S       HGRV HLFR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR

Query:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE
        TR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL+RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNE
Subjt:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE

Query:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF
        LELTSSAFEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDI  GRLL+SPPVYLHGVAVN+YLF
Subjt:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF

Query:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ
        GHGEYPLLPWL+VPFA AVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLDH SQ
Subjt:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ

Query:  YVGDGLNQDSTDEKASVIQRALALRARELHS
        YV  GLN DST+EKASVIQRALALRARELHS
Subjt:  YVGDGLNQDSTDEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI11.2e-21589.1Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR
        MDS +LAALLSSLISQLLLLLFLLFPSSNPHSL SNS+ DS+FYANLF HFLFS++ AASL FLSVSRKRKRT+  +HLELG S       HGRV HLFR
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRV-HLFR

Query:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE
        TR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL+RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNE
Subjt:  TRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNE

Query:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF
        LELTSSAFEDLAGLPNCCGV+SCTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDI  GRLL+SPPVYLHGVAVN+YLF
Subjt:  LELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLF

Query:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ
        G GEYPLLPWL+VPFA AVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+SLDH SQ
Subjt:  GHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQ

Query:  YVGDGLNQDSTDEKASVIQRALALRARELHS
        YV  GLN DST+EKASVIQRALA RARELHS
Subjt:  YVGDGLNQDSTDEKASVIQRALALRARELHS

A0A6J1D7F1 protein ALP1-like3.5e-19983.37Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLF---NHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHL
        MDSR+LAAL+SSLISQLLL LFLLFPSSNPHSLLSN  SDS+FYAN F    HFLFS+E+A+SLSFLSVSRKRKRTH  E LEL PS  GGG G GRVHL
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLF---NHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHL

Query:  FRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
          TR PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPL+LSAEIRLGVGL RLATGCDFSTIS+QFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP
Subjt:  FRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCP

Query:  NELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQY
        NELE TSSAFE LAGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMSSTLFKDI +GRLLDSPPVYLHG+AVNQY
Subjt:  NELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQY

Query:  LFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHS
         FGHGEYPLLPWLMVPF+ AVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPM EEFKTAVAYIGACSILHNALLMREDFSAMADEWE LASLDH 
Subjt:  LFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHS

Query:  SQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SQY+G+GLN+DSTDEKASVIQRALALRARELHS
Subjt:  SQYVGDGLNQDSTDEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like7.3e-20585.71Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYAN   LFNHFLFS+++AASLSFLSVSRKRKRTHSSE LELGPSDSGG D G GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ
        P+ELELTSS+FED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TLFKDI + RLL SPPVYLHG+AVNQ
Subjt:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH
        YLFGHGEYPLLPWLMVPFA AVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH

Query:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SSQYVG GLN+DS DEKA+++Q+ALALRARELH+
Subjt:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like1.2e-20486.18Show/hide
Query:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH
        MDSRQLAALLSSLISQLLLLL LLFPSSNPHSLLSNSSSDSNFYAN   LFNHFLFS+++AASLSFLSVSRKRKRTHSSE LELGPSDSGG D G GRVH
Subjt:  MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYAN---LFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGD-GHGRVH

Query:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
        L RTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  LFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ
        P+ELELTSSAFED+AGLPNCCGVISCT                            SIVAGFRGDKDDSTVLMS+TLFKDI + RLL SPPVYLHGVAVNQ
Subjt:  PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH
        YLFGHG+YPLLPWLMVPFA AVSGSTEESFN+AHRLM IPALKAI+SLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLASLDH
Subjt:  YLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDH

Query:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS
        SSQYVG GLN+DS DEKAS+IQ+ALALRARELH+
Subjt:  SSQYVGDGLNQDSTDEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 14.5e-2628.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK   + ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like5.3e-2726.93Show/hide
Query:  LAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLF
        LAA+ +  S S      ++ +  +         DG  R     +  P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L 
Subjt:  LAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLF

Query:  RLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYED----------
        RL +G   S I + FG+++S       RF      R +  +   W     P++L+   S FE ++GLPNCCG I  T   I+ N    E           
Subjt:  RLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYED----------

Query:  --SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLM
          S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +  G+ L+   + L     + +Y+ G   +PLLPWL+ P+    +   +  FNK H   
Subjt:  --SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLM

Query:  CIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRED
           A  A+  L++ W +++  M    +  +   I  C +LHN ++  ED
Subjt:  CIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.2e-1326.67Show/hide
Query:  ANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDG------HGRVHLFRTRSPDSFR--NHFRMTSSTFEWLSGLLEPLLECRDPVGS
        A + N FL + +L     FLS S+  +       L + PS S             R     T   D  R   +FRM+ STF  L  +L            
Subjt:  ANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDG------HGRVHLFRTRSPDSFR--NHFRMTSSTFEWLSGLLEPLLECRDPVGS

Query:  PLDLSAEIRLGVGLFRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYE
            S+       +FRLA G  +  +  +FG  S S A      +C+++       ++ P P+        F     LPNC GV+   RF++       +
Subjt:  PLDLSAEIRLGVGLFRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYE

Query:  DSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCI
         SI  Q +VDS+ R + I AG+        +   + LF  I +  L  +P    +GV V +Y+ G    PLLPWL+ P+      S EESF +    +  
Subjt:  DSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCI

Query:  PALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE
          L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  PALK----AIVSLR-NWGVLS---QPMHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases1.5e-2125.77Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHF----LFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPD---
        +++S LL L   L P+S   S  S SS  S   ++L +      L    LA+ LSFL+V+R    + SS             DG   V  FR  + D   
Subjt:  SLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHF----LFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPD---

Query:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEF
                  +R+ + ++   F  +   L+P +       S L L A+  + + L RLA GC   T++ ++ +   +       + R+L T  +  +++ 
Subjt:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEF

Query:  PC-PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLD
        P     L  T+  FE+L  LPN CG I  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S L+K +  G ++ 
Subjt:  PC-PNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLD

Query:  SPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +LM PF+   SG+  E+      +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  SPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSQPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases3.8e-2826.93Show/hide
Query:  LAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLF
        LAA+ +  S S      ++ +  +         DG  R     +  P +F + F+++  TF+++  L++     +     D  G+PL L+   R+ V L 
Subjt:  LAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSAEIRLGVGLF

Query:  RLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYED----------
        RL +G   S I + FG+++S       RF      R +  +   W     P++L+   S FE ++GLPNCCG I  T   I+ N    E           
Subjt:  RLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIRNSHFYED----------

Query:  --SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLM
          S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +  G+ L+   + L     + +Y+ G   +PLLPWL+ P+    +   +  FNK H   
Subjt:  --SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-HGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLM

Query:  CIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRED
           A  A+  L++ W +++  M    +  +   I  C +LHN ++  ED
Subjt:  CIPALKAIVSLRN-WGVLSQPMHEEFKTAV-AYIGACSILHNALLMRED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.2e-2728.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK   + ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVISCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSQPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein1.9e-1926.57Show/hide
Query:  SDSGGGDGHGRV-----------HLFRTRSP-DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQF
        S SG G  H R+            + R   P D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++F
Subjt:  SDSGGGDGHGRV-----------HLFRTRSP-DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQF

Query:  GVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSS
        G+  S       ++CR    VL   +  W   P  +E+  T + FE +  +PN  G I  T   II          N    E       SI  Q VV++ 
Subjt:  GVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVISCTRFKIIR---------NSHFYED------SIATQLVVDSS

Query:  SRILSIVAGFRGDKDDSTVLMSSTLFKD-IGDGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-
             +  G  G   D  +L  S+L +     G L DS            ++ G+  +PL  +L+VP+       T+ +FN++   +   A  A   L+ 
Subjt:  SRILSIVAGFRGDKDDSTVLMSSTLFKD-IGDGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAHRLMCIPALKAIVSLR-

Query:  NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED
         W  L +    + +     +GAC +LHN   MR++
Subjt:  NWGVLSQPMHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTCAATTGGCTGCTTTACTCTCTTCTCTGATCTCCCAACTCCTCCTCCTCCTCTTTCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTC
CTCTTCCGATTCCAATTTCTATGCTAATCTCTTCAACCACTTCCTCTTTTCCAAGGAACTTGCCGCCTCCCTTTCCTTTCTCTCCGTTTCGCGTAAGAGGAAGAGGACGC
ATTCCTCGGAGCACCTCGAATTGGGGCCATCCGATAGCGGCGGCGGCGATGGCCATGGACGAGTCCATCTGTTTCGGACTCGAAGTCCTGATTCCTTCAGAAATCACTTC
AGAATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCGACCCGGTAGGTTCGCCTCTCGATCTCTCCGCCGAGATTCGACTCGG
CGTCGGCCTGTTCCGGCTGGCCACCGGCTGCGATTTCTCGACAATTTCGGACCAATTTGGCGTCTCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAATTGTGTCGAGTTC
TCTGTACCAATTTTCGCTTCTGGGTTGAATTCCCTTGCCCCAATGAGCTCGAATTAACATCCTCGGCCTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTGATT
TCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATT
TCGTGGCGATAAGGACGACTCAACGGTGCTTATGTCCTCGACTCTGTTTAAAGACATTGGAGATGGAAGGCTTCTGGATTCTCCTCCGGTTTACCTTCATGGGGTGGCTG
TGAATCAGTACTTGTTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATGGTGCCTTTTGCAGAAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATAAAGCTCAC
CGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATGCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGC
TTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTTGATCATAGCTCTCAGTATGTTGGGGATG
GATTGAATCAGGATTCAACTGACGAGAAGGCTTCTGTGATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCATAGTTAA
mRNA sequenceShow/hide mRNA sequence
AAATAAAGAATGAAATTGAAAGCCACGTGAAACCAAAAACAGACTCGAGAGCATTGTCTGAGCAAGCAGCCAGGTGTTCCTATGGCGTAATTTCTACAAGATTCACACCT
TCCACGTCCGGCGCTGACAATAACCAACAACCCATTTCTCTCTCTCTCTCTCTCTATAGCAGACGCTCGATTCTCGAGAGAATCGAATCCCAATTCCCATCCCCGACAAT
TGCAGCTGCAGCTGCAACTGCAACTGCAACTGACACTCACCCATTAATGGATTCCCGTCAATTGGCTGCTTTACTCTCTTCTCTGATCTCCCAACTCCTCCTCCTCCTCT
TTCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTGTCCAATTCCTCTTCCGATTCCAATTTCTATGCTAATCTCTTCAACCACTTCCTCTTTTCCAAGGAACTTGCC
GCCTCCCTTTCCTTTCTCTCCGTTTCGCGTAAGAGGAAGAGGACGCATTCCTCGGAGCACCTCGAATTGGGGCCATCCGATAGCGGCGGCGGCGATGGCCATGGACGAGT
CCATCTGTTTCGGACTCGAAGTCCTGATTCCTTCAGAAATCACTTCAGAATGACCTCCTCAACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGCG
ACCCGGTAGGTTCGCCTCTCGATCTCTCCGCCGAGATTCGACTCGGCGTCGGCCTGTTCCGGCTGGCCACCGGCTGCGATTTCTCGACAATTTCGGACCAATTTGGCGTC
TCGGAGTCGGTAGCGAGGTTCTGTGCTAAGCAATTGTGTCGAGTTCTCTGTACCAATTTTCGCTTCTGGGTTGAATTCCCTTGCCCCAATGAGCTCGAATTAACATCCTC
GGCCTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTGATTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAAC
TTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGATAAGGACGACTCAACGGTGCTTATGTCCTCGACTCTGTTTAAAGACATTGGAGAT
GGAAGGCTTCTGGATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATGGTGCCTTTTGC
AGAAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATAAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATCGTTAGTTTGAGAAATTGGGGAGTTTTGAGCC
AACCAATGCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCACAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAATGG
GAGAGCTTAGCTTCACTTGATCATAGCTCTCAGTATGTTGGGGATGGATTGAATCAGGATTCAACTGACGAGAAGGCTTCTGTGATACAGAGGGCATTGGCTCTGAGAGC
TAGAGAGCTTCATAGTTAAAATTTCAATAACAAGAATCCAGTTTTTTGGTGGAATAAGTACAGCTCATTTGAAAGAGATTTCATCTCTTAGGATATTTATTGAAGGCAGC
TCATCCAGCTCCATTCAACGATTACTATTGTAGGTAATTGATGATTCTTTTACAAAACACTATAACAACACGTTCTCCCAGATTTTGGATGTTCAGCTTTTT
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLISQLLLLLFLLFPSSNPHSLLSNSSSDSNFYANLFNHFLFSKELAASLSFLSVSRKRKRTHSSEHLELGPSDSGGGDGHGRVHLFRTRSPDSFRNHF
RMTSSTFEWLSGLLEPLLECRDPVGSPLDLSAEIRLGVGLFRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVI
SCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIGDGRLLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFAEAVSGSTEESFNKAH
RLMCIPALKAIVSLRNWGVLSQPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASLDHSSQYVGDGLNQDSTDEKASVIQRALALRARELHS