; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004404 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004404
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr06:11400498..11402231
RNA-Seq ExpressionPay0004404
SyntenyPay0004404
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]2.1e-243100Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALAQRARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]1.6e-23898.35Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]2.3e-21399.19Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]8.8e-18979.77Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFG GEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        H SQYV  GLN DS +EKAS+IQ+ALA RARELH+
Subjt:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]1.3e-20887.26Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDS+FYANLFTHFLFSQDFAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVV+CT                            SIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
         DSTNEKAS+IQRALA RARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein7.6e-23998.35Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961691.1e-21399.19Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

A0A5D3CRB2 Putative nuclease HARBI11.0e-243100Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALAQRARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

A0A6J1FNZ2 protein ALP1-like3.0e-18778.85Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFG GEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        H SQYV  GLN DS +EKA+++Q+ALA RARELH+
Subjt:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

A0A6J1J0M5 protein ALP1-like5.2e-18779.31Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFG G+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        H SQYV  GLN DS +EKAS+IQ+ALA RARELH+
Subjt:  HRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.3e-2729.01Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like9.5e-2928.38Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G+ +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         LN   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLNSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.1e-1427.68Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G  +  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-HGVAVNKYLFGRGEYPL
             LPNC GVV   RF++       + S+  Q +VDS+ R + I AG+        +   + LF   E+  +L+  P  L +GV V +Y+ G    PL
Subjt:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-HGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE
        LPWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases7.0e-1924.3Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHF----LFSQDFAASLPFLSVSRKRKRT---------NPPDHLELGSSHGRVHHLFRTRTPD
        +++S LL L   L P+S   S  S S+  S+  ++L +      L     A+ L FL+V+R    +         +PP  L  G         FR  T D
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHF----LFSQDFAASLPFLSVSRKRKRT---------NPPDHLELGSSHGRVHHLFRTRTPD

Query:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFW
                     +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +    +  + R+L T  +  +
Subjt:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFW

Query:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR
        ++ P     L  T+  FE+L  LPN CG +  T  K+ R +           +  D+V  Q+V D       +     G +DDS+    S L+K +  G 
Subjt:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        ++    + + G  V  Y+ G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  LLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases6.8e-3028.38Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G+ +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         LN   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLNSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.7e-2829.01Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein1.1e-1925.83Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQG
        P  +E+  T + FE +  +PN  G +  T   II          N    E       S+  Q VV++      +  G  G+  D  +L  S+L       
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQG

Query:  RLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMR
            S      G+  + ++ G   +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR
Subjt:  RLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMR

Query:  ED
        ++
Subjt:  ED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTC
CACTCCCGATTCCAGTTTCTATGCCAATCTCTTCACCCACTTCCTCTTTTCCCAGGATTTTGCCGCCTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCCCCCCGACCATCTCGAATTGGGGTCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGGTTCAAG
ATCATTAGAAATAGCCATTTTTATGAAGATAGCGTCGCAACTCAACTTGTTGTTGATTCTTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCAATAAGGACGA
CTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGGAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATAAGTACTTGTTTG
GACGTGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCA
GCTCTGAAAGCAATTGTTAGTTTGAGGAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAA
TGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAGATCTCAGTATGTTGAAGCTGGATTGAATGTGGACTCAA
CTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCAGAGAGCTAGAGAGCTTCACAGTTAG
mRNA sequenceShow/hide mRNA sequence
CGCAACGAAAAAAAATTATAAACCCTAAATTGAAAGTAACCTAAGAAAAAGATAGCAAGTTGAAAGGCATGTGAAACCAAAAAAGAGCGTGGAGACTGGCTAGCTGATTT
GTGCAGGCAACCAAGCAGGTGTTGATATGGCGTAATTTCTACAAGATTCAACAACTTCCACGCCGGCGCTGACAAAAAGAAACCGAAAACCCATCTCCTTCGCAGCCGCT
CGATTCCCGAGAGAATCAAAAATCCCAATTCCCAACTACTGCAACTGCAAAACTGACACACACCCATTAATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGAT
CTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTCCACTCCCGATTCCAGTTTCTATGCCAATCTCTTCACCCACT
TCCTCTTTTCCCAGGATTTTGCCGCCTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCAATCCCCCCGACCATCTCGAATTGGGGTCATCCCATGGACGA
GTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTG
TCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCTCGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTG
GTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCTTCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACA
TCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGGTTCAAGATCATTAGAAATAGCCATTTTTATGAAGATAGCGTCGCAAC
TCAACTTGTTGTTGATTCTTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTG
AACAAGGGAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATAAGTACTTGTTTGGACGTGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCT
TTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGGAATTGGGGAGTTTT
GAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATG
AGTGGGAGAGCTTATCTTCACTTGATCATAGATCTCAGTATGTTGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCAG
AGAGCTAGAGAGCTTCACAGTTAGGATTTCAATCACAAGAATGCAGTTGTTTAGTGAAATAAGTACAGCCCATTTGAAAGAGATTTCATCTCTTAGGATATTTATTGAAG
GCAGCTCATCCAGCTCCATTCAACAGTTACTATTGTAGGTAATTGATGATTCTTTTACAAATCTATATAACAACATTTTCTCCC
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFK
IIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIP
ALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS