; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0162491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0162491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationCMiso1.1chr06:11418635..11420701
RNA-Seq ExpressionCmc06g0162491
SyntenyCmc06g0162491
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.1e-22293.4Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALAQRARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]8.4e-21891.75Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]1.2e-19291.67Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT                            SIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.2e-19385.26Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSSAFED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+GRLL SPPVYLHG+AVN+YLFG GEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALA 
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]4.8e-21393.43Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDS+FYANLFTHFLFSQDFAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC
        FEDLAGLPNCCGVV+CTSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPLLPWL++PFAGAVSGSTEESFN+AHRLMC
Subjt:  FEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMC

Query:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        IPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  IPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein4.1e-21891.75Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDSSFYANLF HFLFSQDFAASLPFLSVSRKRKRTN  DHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFG GEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDH+SQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALA RARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961695.9e-19391.67Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR
        FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT                            SIVAGFR
Subjt:  FSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCT----------------------------SIVAGFR

Query:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
        GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF
Subjt:  GNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS

A0A5D3CRB2 Putative nuclease HARBI15.5e-22393.4Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
        MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
        FEDLAGLPNCCGVVSCT                            SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL
Subjt:  FEDLAGLPNCCGVVSCT----------------------------SIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPL

Query:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
        LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN
Subjt:  LPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLN

Query:  VDSTNEKASVIQRALAQRARELHS
        VDSTNEKASVIQRALAQRARELHS
Subjt:  VDSTNEKASVIQRALAQRARELHS

A0A6J1FNZ2 protein ALP1-like1.1e-19184.28Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSS+FED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHG+AVN+YLFG GEYPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ
        ESFNEAHRLMCIPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKA+++Q+ALA 
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

A0A6J1J0M5 protein ALP1-like1.9e-19184.77Show/hide
Query:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH
        MDS +LAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DS+FYAN   LF HFLFSQ  AASL FLSVSRKRKRT+  + LELG S         GRV 
Subjt:  MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSS--------HGRVH

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFSTISDQFGVSESVARFC+KQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE
        CP+ELELTSSAFED+AGLPNCCGV+SCTSIVAGFRG+KDDSTVLMS+TLFKDIE+ RLL SPPVYLHGVAVN+YLFG G+YPLLPWL+VPFAGAVSGSTE
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVSCTSIVAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTE

Query:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ
        ESFNEAHRLM IPALKAI+SLRNWGVLSQP+HEEFKTAVAYIGACSILHNALLMREDF+AMADEWESL+SLDH SQYV  GLN DS +EKAS+IQ+ALA 
Subjt:  ESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQ

Query:  RARELHS
        RARELH+
Subjt:  RARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 12.8e-2226.62Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T I                                     V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like1.6e-2225.91Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL
            P++L+   S FE ++GLPNCCG +  T IV                                      AG+ G+ +D  VL +S  +K +E+G+ L
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL

Query:  NSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        N   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  NSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

Arabidopsis top hitse value%identityAlignment
AT3G19120.1 PIF / Ping-Pong family of plant transposases5.4e-1323.04Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHF----LFSQDFAASLPFLSVSRKRKRT---------NPPDHLELGSSHGRVHHLFRTRTPD
        +++S LL L   L P+S   S  S S+  S+  ++L +      L     A+ L FL+V+R    +         +PP  L  G         FR  T D
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHF----LFSQDFAASLPFLSVSRKRKRT---------NPPDHLELGSSHGRVHHLFRTRTPD

Query:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFW
                     +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC   T++ ++ +   +    +  + R+L T  +  +
Subjt:  ------------SFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTN-FRFW

Query:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCT---------------------------SIVAGFR-----------GNKDDSTVLMSSTLFKDIEQGR
        ++ P     L  T+  FE+L  LPN CG +  T                            +VA  +           G +DDS+    S L+K +  G 
Subjt:  VEFPC-PNELELTSSAFEDLAGLPNCCGVVSCT---------------------------SIVAGFR-----------GNKDDSTVLMSSTLFKDIEQGR

Query:  LLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN
        ++    + + G  V  Y+ G   YPLL +L+ PF+   SG+  E+  +   +     +   + L    W +L Q ++     A   I AC +LHN
Subjt:  LLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSL--RNWGVLSQPIHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases1.2e-2325.91Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I + FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVA-----RFCSKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL
            P++L+   S FE ++GLPNCCG +  T IV                                      AG+ G+ +D  VL +S  +K +E+G+ L
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV--------------------------------------AGFRGNKDDSTVLMSSTLFKDIEQGRLL

Query:  NSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE
        N   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FN+ H      A  A+  L++ W +++  +    +  +   I  C +LHN ++  E
Subjt:  NSPPVYL-HGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRN-WGVLSQPIHEEFKTAV-AYIGACSILHNALLMRE

Query:  D
        D
Subjt:  D

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.0e-2326.62Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G    ++   FGV +S     + +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H
        E   S FE++ GLPNCCG +  T I                                     V G+ G    S +L  S  FK  E  ++L+  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVSCTSI-------------------------------------VAGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYL-H

Query:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWLI P        +  +FNE H  +   A  A   L+ +W +LS+ +   + +   + I  C +LHN ++   D+
Subjt:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPI-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein5.2e-1625.09Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF
        D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S++FG+  S       ++CR    VL   +  W   
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCR----VLCTNFRFWVEF

Query:  PCPNELELTSSAFEDLAGLPNCCGVVSCTSI--------VAGF-------RGNKDDSTVLMSST-----LFKDI---EQGRLLN---------SPPVYLH
        P  +E+  T + FE +  +PN  G +  T I        VA +       R  K   ++ +        +F D+     G L +         S      
Subjt:  PCPNELELTSSAFEDLAGLPNCCGVVSCTSI--------VAGF-------RGNKDDSTVLMSST-----LFKDI---EQGRLLN---------SPPVYLH

Query:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED
        G+  + ++ G   +PL  +L+VP+       T+ +FNE+   +   A  A   L+  W  L +    + +     +GAC +LHN   MR++
Subjt:  GVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLR-NWGVLSQPIHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCTAATTC
CACTCCCGATTCCAGTTTCTATGCCAATCTCTTCACCCACTTCCTCTTTTCCCAGGATTTTGCCGCCTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCCCCCCGACCATCTCGAATTGGGGTCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACTTCTTCAACG
TTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTAGGTGTTGGTTTGTATCGCCT
CGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTTCTTGTACAAGTATTGTT
GCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGGAGGCTTCTGAATTCTCCTCCGGTTTACCTTCATGG
GGTGGCTGTGAATAAGTACTTGTTTGGACGTGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATG
AAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGGAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCAAAACTGCTGTTGCTTAC
ATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGATCATAGATCTCAGTATGT
TGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCAGAGAGCTAGAGAGCTTCACAGTTAG
mRNA sequenceShow/hide mRNA sequence
TTCGTTTAGATTATGTCATAAAGTTAATCGATTGTGTACAAATTAACTTGAACACTCGCGCAAGGAAAAAAAATCATAAACCCTAAATTGAAAGTAACCTAAGAAAAAGA
TAGCAAGTTGAAAGGCATGTGAAACCAAAAAAGAGCGTGGAGACTGGCTAGCTGATTTGTGCAGGCAACCAAGCAGGTGTTGATATGGCGTAATTTCTACAAGATTCAAC
AACTTCCACGCCGGCGCTGACAAAAAGAAACCGAAAACCCATCTCCTTCGCAGCCGCTCGATTCCCGAGAGAATCAAAAATCCCAATTCCCAACTACTGCAACTGCAAAA
CTGACACACACCCATTAATGGATTCTCCTCGTTTAGCTGCTTTACTCTCTTCTTTGATCTCCCAACTCCTTCTCCTCCTCTTCCTCCTCTTCCCTTCCTCCAACCCACAT
TCCCTTTTCTCTAATTCCACTCCCGATTCCAGTTTCTATGCCAATCTCTTCACCCACTTCCTCTTTTCCCAGGATTTTGCCGCCTCCCTTCCCTTTCTCTCTGTTTCCCG
CAAGAGGAAGAGAACCAATCCCCCCGACCATCTCGAATTGGGGTCATCCCATGGACGAGTTCATCATCTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCA
GAATGACTTCTTCAACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCCCTTCTCGAGTGTCGTGACCCGGTAGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTAGGT
GTTGGTTTGTATCGCCTCGCCACTGGCTGCGATTTCTCCACAATCTCCGACCAATTTGGTGTTTCGGAGTCTGTAGCGAGGTTCTGTTCTAAACAATTGTGTCGAGTTCT
CTGTACTAATTTTCGCTTCTGGGTTGAATTCCCTTGCCCCAATGAGCTGGAATTAACATCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGCGGTGTGGTTT
CTTGTACAAGTATTGTTGCAGGATTTCGTGGCAATAAGGACGACTCGACAGTGCTTATGTCCTCGACGCTGTTTAAAGACATTGAACAAGGGAGGCTTCTGAATTCTCCT
CCGGTTTACCTTCATGGGGTGGCTGTGAATAAGTACTTGTTTGGACGTGGTGAATATCCTTTGCTTCCATGGTTAATAGTGCCTTTTGCAGGAGCTGTTTCAGGGTCAAC
TGAAGAGAGTTTCAATGAAGCTCACCGATTGATGTGCATTCCAGCTCTGAAAGCAATTGTTAGTTTGAGGAATTGGGGAGTTTTGAGTCAACCAATTCATGAGGAGTTCA
AAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGAGGATTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTATCTTCACTTGAT
CATAGATCTCAGTATGTTGAAGCTGGATTGAATGTGGACTCAACTAATGAGAAGGCTTCTGTTATACAGAGGGCATTGGCTCAGAGAGCTAGAGAGCTTCACAGTTAGGA
TTTCAATCACAAGAATGCAGTTGTTTAGTGAAATAAGTACAGCCCATTTGAAAGAGATTTCATCTCTTAGGATATTTATTGAAGGCAGCTCATCCAGCTCCATTCAACAG
TTACTATTGTAGGTAATTGATGATTCTTTTACAAATCTATATAACAACATTTTCTCCCAATTTTTGGGTGTTCAGCTCTTTATTTTATCTCTTTTCCTTTTTACACAAAA
GTGGTTTAAATTTTTTTTAGTTTCTATGTCTAGAATTAGTTTAGTGGTTTTATGGTTATCAAAAAAAGTAGGCAAAAGACTGGGATATTTGGAGATACCCTTTAAGGCAT
GGGTAATCTTCAAGTGGAAAATTTGATAATTTAGGACTTTTGTTGGGGGTAATGATTCTCAGAATGACAAGAATGACCACAAGAAATAGAATCATATTGAATCAATGTGA
TCA
Protein sequenceShow/hide protein sequence
MDSPRLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSSFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPPDHLELGSSHGRVHHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSTISDQFGVSESVARFCSKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVSCTSIV
AGFRGNKDDSTVLMSSTLFKDIEQGRLLNSPPVYLHGVAVNKYLFGRGEYPLLPWLIVPFAGAVSGSTEESFNEAHRLMCIPALKAIVSLRNWGVLSQPIHEEFKTAVAY
IGACSILHNALLMREDFSAMADEWESLSSLDHRSQYVEAGLNVDSTNEKASVIQRALAQRARELHS