; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C05G092530 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C05G092530
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCla97Chr05:10622142..10623425
RNA-Seq ExpressionCla97C05G092530
SyntenyCla97C05G092530
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]8.7e-22191.57Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTN  DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKTSLIQRALALRARELHS
         LN DSTNEK S+IQRALA RARELHS
Subjt:  RLNDDSTNEKTSLIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]2.7e-22291.8Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTNRSDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKTSLIQRALALRARELHS
         LN DSTNEK S+IQRALALRARELHS
Subjt:  RLNDDSTNEKTSLIQRALALRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]9.7e-19691.94Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN
        F + FAASL FLSVSRKRKRTN  DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+
Subjt:  FSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKTSLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEK S+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKTSLIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.6e-19079.77Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AG+PNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+GRLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS
          SQYV   LN+DS +EK S+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]4.1e-20285.25Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP
        MDSP+LAALLSSLISQL LLLFLLFPSSNP SLFSNST DSNFY N   LFTHFLFSQ+FAASL FLSVSRKRKRTN SDHLEL  SHGR  HLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAG+PNCCGVV+CT                            SIVAGFRGDKDD+TVLMSSTLFKDIEQGRLLDAP VYLHGVAVNQYL GHGE
Subjt:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKTSLIQRALALRARELHS
        +LN+DSTNEK S+IQRALA+RARELHS
Subjt:  RLNDDSTNEKTSLIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein1.3e-22291.8Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTNRSDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKTSLIQRALALRARELHS
         LN DSTNEK S+IQRALALRARELHS
Subjt:  RLNDDSTNEKTSLIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961694.7e-19691.94Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN
        F + FAASL FLSVSRKRKRTN  DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+
Subjt:  FSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKTSLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEK S+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKTSLIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI14.2e-22191.57Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTN  DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAG+PNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKTSLIQRALALRARELHS
         LN DSTNEK S+IQRALA RARELHS
Subjt:  RLNDDSTNEKTSLIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like1.2e-18878.85Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSS+FED+AG+PNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS
          SQYV   LN+DS +EK +++Q+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like2.1e-18879.31Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AG+PNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS
          SQYV   LN+DS +EK S+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKTSLIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.1e-2728.33Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G +  ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGIPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ G+PNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGIPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like8.1e-2827.72Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G + S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++G+PNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)6.2e-1527.85Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G ++  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL
             +PNC GVV   RF++       + SI  Q +VDS+ R + I AG+        +   + LF  I +  L  AP    +GV V +Y+LG    PLL
Subjt:  EDLAGIPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL

Query:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ
        PWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     D  E   +  +   
Subjt:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ

Query:  YVETRLNDDSTNEKTS
          E R +DD   E  S
Subjt:  YVETRLNDDSTNEKTS

AT3G19120.1 PIF / Ping-Pong family of plant transposases1.2e-1824.8Show/hide
Query:  SSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHL---------FRTRTPD------------SFRN
        S+ P SL S S++     P LF  FT        A+ LSFL+V+R    ++ S      PS      L         FR  T D             +R+
Subjt:  SSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHL---------FRTRTPD------------SFRN

Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELELTSSA
         + ++   F  +   L+P +       S L L  +  + + L RLA GC+  T++ ++ +   +       + R+L T  +  +++ P     L  T+  
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELELTSSA

Query:  FEDLAGIPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQ
        FE+L  +PN CG +  T  K+ R +     N Y      D++  Q+V D       +     G +DD++    S L+K +  G ++    + + G  V  
Subjt:  FEDLAGIPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQ

Query:  YLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        Y++G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  YLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases5.8e-2927.72Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G + S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++G+PNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.2e-2828.33Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G +  ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGIPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ G+PNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGIPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein4.9e-2026.73Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ
        P  +E+  T + FE +  IPN  G +  T   II          N    E       SI  Q VV++      +  G  G   D  +L  S+L +    +
Subjt:  PCPNELELTSSAFEDLAGIPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ

Query:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM
        G L D+            +++G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   M
Subjt:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM

Query:  RED
        R++
Subjt:  RED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTTCCTTCCTCCAACCCACGTTCCCTTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCGCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCAATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGATTCCGAATTGCTGTGGCGTTGTTTCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCGTTTGATCAAAGATCTCAGTATGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGACTTCTCTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTTCCTTCCTCCAACCCACGTTCCCTTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCGCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCAATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGATTCCGAATTGCTGTGGCGTTGTTTCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCGTTTGATCAAAGATCTCAGTATGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGACTTCTCTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNRSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMT
SSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGIPNCCGVVSCT
RFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLM
CIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKTSLIQRALALRARELHS