; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC05G090820 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC05G090820
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCiama_Chr05:11608754..11610037
RNA-Seq ExpressionCaUC05G090820
SyntenyCaUC05G090820
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]9.3e-22392.27Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALA RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]2.1e-22292.04Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALALRARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]8.0e-19892.74Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN
        F + FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.2e-19180.23Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+GRLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKAS+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]3.3e-20485.95Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSP+LAALLSSLISQL LLLFLLFPSSNP SLFSNST DSNFY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNPSDHLEL  SHGR  HLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVV+CT                            SIVAGFRGDKDD+TVLMSSTLFKDIEQGRLLDAP VYLHGVAVNQYL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
        +LN+DSTNEKAS+IQRALA+RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein1.0e-22292.04Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALALRARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961693.9e-19892.74Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN
        F + FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCN

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI14.5e-22392.27Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP SLFSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGC+FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALA RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like1.1e-18979.31Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKA+++Q+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like1.9e-18979.77Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP SL SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGC+FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKAS+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.8e-2728.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G +  ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ GLPNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like4.8e-2828.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G + S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)3.6e-1528.16Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G ++  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL
             LPNC GVV   RF++       + SI  Q +VDS+ R + I AG+        +   + LF  I +  L  AP    +GV V +Y+LG    PLL
Subjt:  EDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL

Query:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ
        PWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     D  E   +  +   
Subjt:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ

Query:  YVETRLNDDSTNEKAS
          E R +DD   E  S
Subjt:  YVETRLNDDSTNEKAS

AT3G19120.1 PIF / Ping-Pong family of plant transposases4.2e-1925.07Show/hide
Query:  SSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHL---------FRTRTPD------------SFRN
        S+ P SL S S++     P LF  FT        A+ LSFL+V+R    ++ S      PS      L         FR  T D             +R+
Subjt:  SSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHL---------FRTRTPD------------SFRN

Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELELTSSA
         + ++   F  +   L+P +       S L L  +  + + L RLA GC+  T++ ++ +   +       + R+L T  +  +++ P     L  T+  
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-PNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQ
        FE+L  LPN CG +  T  K+ R +     N Y      D++  Q+V D       +     G +DD++    S L+K +  G ++    + + G  V  
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQ

Query:  YLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        Y++G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  YLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases3.4e-2928.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G + S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.3e-2828.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G +  ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ GLPNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein6.4e-2026.4Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ
        P  +E+  T + FE +  +PN  G +  T   II          N    E       SI  Q VV++      +  G  G   D  +L  S+L +    +
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ

Query:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM
        G L D+            +++G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   M
Subjt:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM

Query:  RED
        R++
Subjt:  RED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTTCCTTCCTCCAACCCACGTTCCCTTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCCCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCAATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTTGTTTCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCAAAGATCTCAGTATGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGGCTTCTCTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTTCCTTCCTCCAACCCACGTTCCCTTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCCCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCAATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTTGTTTCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGCGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCAAAGATCTCAGTATGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGGCTTCTCTTATACAGAGGGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAA
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSLFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMT
SSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCNFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT
RFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLM
CIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS