; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G092980 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G092980
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCicolChr05:10962620..10963903
RNA-Seq ExpressionCcUC05G092980
SyntenyCcUC05G092980
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.2e-22292.27Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP S FSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALA RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]2.7e-22292.04Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP S FSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALALRARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]2.1e-19893.01Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.2e-19180.23Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP S  SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+GRLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKAS+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]3.3e-20485.95Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSP+LAALLSSLISQL LLLFLLFPSSNP S FSNST DSNFY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNPSDHLEL  SHGR  HLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVV+CT                            SIVAGFRGDKDD+TVLMSSTLFKDIEQGRLLDAP VYLHGVAVNQYL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
        +LN+DSTNEKAS+IQRALA+RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein1.3e-22292.04Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP S FSNS  DS+FY N   LF HFLFSQ+FAASL FLSVSRKRKRTN SDHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL GHGE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D +SQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALALRARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961691.0e-19893.01Show/hide
Query:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        G+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI15.9e-22392.27Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP
        MDSPRLAALLSSLISQL LLLFLLFPSSNP S FSNST DS+FY N   LFTHFLFSQ+FAASL FLSVSRKRKRTNP DHLEL  SHGRVHHLFRTRTP
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTP

Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT
        DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELT
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELT

Query:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE
        SSAFEDLAGLPNCCGVVSCTRFKIIRNS+FYEDS+ATQLVVDSSSRILSIVAGFRG+KDD+TVLMSSTLFKDIEQGRLL++P VYLHGVAVN+YL G GE
Subjt:  SSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGE

Query:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET
        YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S D RSQYVE 
Subjt:  YPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVET

Query:  RLNDDSTNEKASLIQRALALRARELHS
         LN DSTNEKAS+IQRALA RARELHS
Subjt:  RLNDDSTNEKASLIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like1.1e-18979.31Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP S  SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKA+++Q+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like1.9e-18979.77Show/hide
Query:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH
        MDS +LAALLSSLISQL LLL LLFPSSNP S  SNS+SDSNFY NLFPLF HFLFSQ+ AASLSFLSVSRKRKRT+ S+ LEL PS         GRV 
Subjt:  MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRGDKDD+TVLMS+TLFKDIE+ RLL +P VYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVN

Query:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYL GHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS
          SQYV   LN+DS +EKAS+IQ+ALALRARELH+
Subjt:  QRSQYVETRLNDDSTNEKASLIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.4e-2728.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ GLPNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like6.2e-2828.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)3.6e-1528.16Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G  +  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL
             LPNC GVV   RF++       + SI  Q +VDS+ R + I AG+        +   + LF  I +  L  AP    +GV V +Y+LG    PLL
Subjt:  EDLAGLPNCCGVVSCTRFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLL

Query:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ
        PWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     D  E   +  +   
Subjt:  PWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQ

Query:  YVETRLNDDSTNEKAS
          E R +DD   E  S
Subjt:  YVETRLNDDSTNEKAS

AT3G19120.1 PIF / Ping-Pong family of plant transposases4.6e-1824.3Show/hide
Query:  SLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHF-LFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHL---------FRTRTPD
        +++S L  L   L P+S   S  S S+  S    +L    +   L     A+ LSFL+V+R    ++ S      PS      L         FR  T D
Subjt:  SLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHF-LFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHL---------FRTRTPD

Query:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFW
                     +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +       + R+L T  +  +
Subjt:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTN-FRFW

Query:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
        ++ P     L  T+  FE+L  LPN CG +  T  K+ R +     N Y      D++  Q+V D       +     G +DD++    S L+K +  G 
Subjt:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS-----NFY-----EDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        ++    + + G  V  Y++G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  LLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases4.4e-2928.05Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSNFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGR

Query:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y++G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLDAPAVYL-HGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.7e-2828.67Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH
        E   S FE++ GLPNCCG +  T        ++ S+ + D     S+  Q V D   R L++V G+ G    + +L  S  FK  E  ++LD  P     
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSNFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDA-PAVYLH

Query:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y++G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein4.9e-2026.4Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ
        P  +E+  T + FE +  +PN  G +  T   II          N    E       SI  Q VV++      +  G  G   D  +L  S+L +    +
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSNFYED------SIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKD-IEQ

Query:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM
        G L D+            +++G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   M
Subjt:  GRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLM

Query:  RED
        R++
Subjt:  RED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACGTTCCCCTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCCCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCGATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTTGTATCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCAAAGATCTCAGTACGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGGCTTCTCTTATTCAGAGGGCATTGGCTCTGAGAGCCAGAGAGCTTCACAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCTCCTCGATTGGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCTTCCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACGTTCCCCTTTCTCCAATTC
CACTTCCGATTCCAATTTCTATCCCAATCTCTTCCCTCTCTTCACCCACTTCCTCTTTTCCCAAGAATTTGCCGCTTCCCTTTCCTTTCTCTCCGTTTCCCGCAAGAGGA
AGAGGACCAATCCCTCCGACCATCTCGAATTGGAGCCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACCCCTGATTCTTTCAGAAATCATTTCAGGATGACC
TCCTCAACGTTTGAATGGCTTTCTGGTTTGCTTGAGCCCCTTTTGGAGTGTCGCGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTCGGTGTTGGTCT
CTATCGCCTCGCCACCGGCTGCGATTTCTCTACAATCTCGGACCAATTTGGCGTTTCGGAGTCGGTAGCGAGGTTCTGTGCCAAACAATTGTGTCGAGTTCTCTGTACTA
ATTTTCGCTTCTGGGTCGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCAGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTTGTATCTTGTACA
AGGTTCAAGATCATTAGAAATAGCAATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCGA
TAAGGACGATACCACGGTGCTTATGTCCTCAACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGGATGCTCCTGCGGTTTACCTTCATGGGGTGGCTGTGAATCAGT
ACTTGCTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGCTTCAATGAAGCTCACCGATTGATG
TGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCCAACCAATTCACGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAAT
TCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCAAAGATCTCAGTACGTTGAAACTAGATTGAATG
ATGATTCAACTAATGAAAAGGCTTCTCTTATTCAGAGGGCATTGGCTCTGAGAGCCAGAGAGCTTCACAGTTAA
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLFLLLFLLFPSSNPRSPFSNSTSDSNFYPNLFPLFTHFLFSQEFAASLSFLSVSRKRKRTNPSDHLELEPSHGRVHHLFRTRTPDSFRNHFRMT
SSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT
RFKIIRNSNFYEDSIATQLVVDSSSRILSIVAGFRGDKDDTTVLMSSTLFKDIEQGRLLDAPAVYLHGVAVNQYLLGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLM
CIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDQRSQYVETRLNDDSTNEKASLIQRALALRARELHS