; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G18330 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G18330
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationChr3:13966587..13968359
RNA-Seq ExpressionCSPI03G18330
SyntenyCSPI03G18330
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]3.5e-23898.35Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]1.8e-242100Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]2.0e-20997.85Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLNVDSTNEKASVIQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]7.2e-19180.46Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        H SQYV  GLN DS +EKAS+IQ+ALALRARELH+
Subjt:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]8.5e-20886.79Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDS+FYANLF HFLFSQDFAASLPFLSVSRKRKRTN SDHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVV+CT                            SIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYVE  LN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
         DSTNEKAS+IQRALA+RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein8.7e-243100Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALALRARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961699.7e-21097.85Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLNVDSTNEKASVIQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

A0A5D3CRB2 Putative nuclease HARBI11.7e-23898.35Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL
        FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLN

Query:  VDSTNEKASVIQRALALRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALALRARELHS

A0A6J1FNZ2 protein ALP1-like2.5e-18979.54Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFGHGEYPLLPWL+VPFAGAVSGSTEESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        H SQYV  GLN DS +EKA+++Q+ALALRARELH+
Subjt:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

A0A6J1J0M5 protein ALP1-like4.3e-18980Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYANLF    HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFA---HFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV+SCT                            SIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVN

Query:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD
        +YLFGHG+YPLLPWL+VPFAGAVSGSTEESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLD
Subjt:  KYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLD

Query:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS
        H SQYV  GLN DS +EKAS+IQ+ALALRARELH+
Subjt:  HKSQYVEAGLNVDSTNEKASVIQRALALRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 13.1e-2729.01Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like9.6e-2928.38Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G+ +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         LN   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLNSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)1.1e-1427.68Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAF
        +FRM+ STF  L  +L                S        ++RLA G  +  +  +FG  S S A      +C+++       ++ P P+        F
Subjt:  HFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGV-SESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAF

Query:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-HGVAVNKYLFGHGEYPL
             LPNC GVV   RF++       + S+  Q +VDS+ R + I AG+        +   + LF   E+  +L+  P  L +GV V +Y+ G    PL
Subjt:  EDLAGLPNCCGVVSCTRFKIIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-HGVAVNKYLFGHGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE
        LPWL+ P+      S EESF E    +    L     A   +R  W +L    +P   EF   V   G   +LHN L+   D     +E
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALK----AIVSLR-NWGVLS---QPIHEEFKTAVAYIGACSILHNALLMREDFSAMADE

AT3G19120.1 PIF / Ping-Pong family of plant transposases3.5e-1824.23Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF----AHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHH------LFRTRTPD---
        +++S LL L   L P+S   S  S S+  S+  ++L     A  L     A+ L FL+V+R    ++ S      S    +         FR  T D   
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLF----AHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHH------LFRTRTPD---

Query:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEF
                  +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +    +  + R+L T  +  +++ 
Subjt:  ---------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFWVEF

Query:  PC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLN
        P     L  T+  FE+L  LPN CG +  T  K+ R +           +  D+V  Q+V D       +     G +DDS+    S L+K +  G ++ 
Subjt:  PC-PNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNS----------HFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLN

Query:  SPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  SPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases6.8e-3028.38Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G+ +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIRNSHFYED------------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM
         LN   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++ 
Subjt:  LLNSPPVYL-HGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.2e-2829.01Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTR----FKIIRNSHFYED-----SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein8.3e-2025.83Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQG
        P  +E+  T + FE +  +PN  G +  T   II          N    E       S+  Q VV++      +  G  G+  D  +L  S+L       
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTRFKIIR---------NSHFYED------SVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQG

Query:  RLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMR
            S      G+  + ++ G+  +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR
Subjt:  RLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMR

Query:  ED
        ++
Subjt:  ED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTC
CGCCCCAGATTCCAGTTTCTATGCCAATCTCTTCGCCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCGCTCCGACCATCTCGAATTGGGGTCTTCCCATGGACGAGTTCATCATCTGTTTCGGACCCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGGTTCAAG
ATTATTAGAAATAGCCATTTTTATGAAGATAGCGTCGCAACTCAACTTGTTGTTGATTCTTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCAATAAGGACGA
CTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGAAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATAAGTACTTGTTTG
GACATGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCA
GCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAA
TGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAAATCTCAGTATGTTGAAGCTGGATTGAATGTGGACTCAA
CTAATGAGAAAGCTTCTGTTATACAGAGAGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAAGAAAAAAAAAGTCATAAACCCTAAATTGAAAGGAACTTAAGAAAAAGATAGCAAATAAAGAAGCAAGTTGAAAGGCATGTGAAACCAAAAAAGAGCGTGGAGAT
TGGCCAGCTGATTTGTGCAGGCAACCAAGCAGGTGTTGATATGGCGTAATTTCTACAAGATTCAACAACTCCCACGCCCGCGCTGACAAAAAAACCGAAAACCCATCTCC
TTCGCAGCCGCTCGATTCCCGAGAGAATCGAAAATTCCCAATTCCCAACTACTGCAACTGCAAAACTGACACACACCCATTAATGGATTCTCCTCGTTTAGCTGCTTTAC
TCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTCCGCCCCAGATTCCAGTTTCTATGCCAAT
CTCTTCGCCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCAATCGCTCCGACCATCTCGAATTGGGGTC
TTCCCATGGACGAGTTCATCATCTGTTTCGGACCCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACGTTTGAATGGCTCTCTGGTTTGCTCGAGC
CCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTCGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCTCGCCACTGGCTGCGATTTCTCCACAATC
TCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCTTCTGGGTTGAATTCCCTTGCCCCAATGA
GCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGGTTCAAGATTATTAGAAATAGCCATTTTTATGAAG
ATAGCGTCGCAACTCAACTTGTTGTTGATTCTTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTG
TTTAAAGACATTGAACAAGGAAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGGGGTGGCTGTGAATAAGTACTTGTTTGGACATGGTGAATATCCTTTGCTTCCATG
GTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGAA
ATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCT
GCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAAATCTCAGTATGTTGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAAGCTTCTGTTATACAGAG
AGCATTGGCTCTGAGAGCTAGAGAGCTTCACAGTTAGGATTTCTATCACAAAAATGCAGTTGTTTAGTGAAATAAGTACAGCCCATTTGAAAGAGATTTCCATCTCTTAA
GATATTTATTGAAGGCAGCTCATCCAGCTCCATTCAACAATTACTATTGTAGGTAATTGATGATTCTTTTACAAATCTCTATAACAACATTTTCTCCCAAATTTTAGGTG
TTCAGCTAATTCT
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSAPDSSFYANLFAHFLFSQDFAASLPFLSVSRKRKRTNRSDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTRFK
IIRNSHFYEDSVATQLVVDSSSRILSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGHGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIP
ALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHKSQYVEAGLNVDSTNEKASVIQRALALRARELHS