; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi01G001266 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi01G001266
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationchr1:35298746..35301896
RNA-Seq ExpressionBhi01G001266
SyntenyBhi01G001266
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]8.7e-22993.63Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDS+FYANLFTHFLFSQDFAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPL
Subjt:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN
Subjt:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN

Query:  EDSTNEKASIIQRALAVRARELHS
         DSTNEKAS+IQRALA RARELHS
Subjt:  EDSTNEKASIIQRALAVRARELHS

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]4.3e-22893.16Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDS+FYANLF HFLFSQDFAASLPFLSVSRKRKRTN SDHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYVE  LN
Subjt:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN

Query:  EDSTNEKASIIQRALAVRARELHS
         DSTNEKAS+IQRALA+RARELHS
Subjt:  EDSTNEKASIIQRALAVRARELHS

XP_008456140.1 PREDICTED: uncharacterized protein LOC103496169 [Cucumis melo]1.1e-19992.47Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR
        FS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEF
        G+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEF
Subjt:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]3.9e-18980Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DSNFYAN   LF HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GR  
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+GRLL +PPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLS+PMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS
        H SQYV   LNEDS +EKAS+IQ+ALA+RARELH+
Subjt:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS

XP_038880641.1 protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1-like [Benincasa hispida]8.4e-22493.4Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
        MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCCGVVTCT                            SIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
        LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
Subjt:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN

Query:  EDSTNEKASIIQRALAVRARELHS
        EDSTNEKASIIQRALAVRARELHS
Subjt:  EDSTNEKASIIQRALAVRARELHS

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.1e-22893.16Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNS PDS+FYANLF HFLFSQDFAASLPFLSVSRKRKRTN SDHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFGHGEYPL
Subjt:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DH+SQYVE  LN
Subjt:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN

Query:  EDSTNEKASIIQRALAVRARELHS
         DSTNEKAS+IQRALA+RARELHS
Subjt:  EDSTNEKASIIQRALAVRARELHS

A0A1S3C2M8 uncharacterized protein LOC1034961695.3e-20092.47Show/hide
Query:  FSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
        F + FAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD
Subjt:  FSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCD

Query:  FSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR
        FS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFR
Subjt:  FSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFR

Query:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEF
        G+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPLLPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEF
Subjt:  GDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEF

Query:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS
        KTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN DSTNEKAS+IQRALA RARELHS
Subjt:  KTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS

A0A5D3CRB2 Putative nuclease HARBI14.2e-22993.63Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF
        MDSP+LAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDS+FYANLFTHFLFSQDFAASLPFLSVSRKRKRTNP DHLELGSSHGR  HLFRTRTPDSF
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSF

Query:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA
        RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFS IS QFGVSESVARFC+KQLCRVLCTNFRFWVEFPCPNELELTSSA
Subjt:  RNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSA

Query:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL
        FEDLAGLPNCCGVV+CTRFKIIRNSHFYEDS+ATQLVVDSSSRILSIVAGFRG+KDDSTVLMSSTLFKDIEQGRLL++PPVYLHGVAVN+YLFG GEYPL
Subjt:  FEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPL

Query:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN
        LPWL++PFAGAVSGSTEESFN+AHRLMCIPALKAIVSLRNWGVLS+P+HEEFKTAVAYIGACSILHNALLMREDFSAMADEWESL+S DHRSQYVE  LN
Subjt:  LPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLN

Query:  EDSTNEKASIIQRALAVRARELHS
         DSTNEKAS+IQRALA RARELHS
Subjt:  EDSTNEKASIIQRALAVRARELHS

A0A6J1FNZ2 protein ALP1-like1.4e-18779.08Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DSNFYAN   LF HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GR  
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSS+FED+AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+ RLL +PPVYLHG+AVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHGEYPLLPWLM+PFAGAVSGSTEESFN+AHRLMCIPALKAI+SLRNWGVLS+PMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS
        H SQYV   LNEDS +EKA+++Q+ALA+RARELH+
Subjt:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS

A0A6J1J0M5 protein ALP1-like2.3e-18779.54Show/hide
Query:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD
        MDS QLAALLSSLISQLLLLL LLFPSSNPHSL SNS+ DSNFYAN   LF HFLFSQ  AASL FLSVSRKRKRT+ S+ LELG S         GR  
Subjt:  MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYAN---LFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSS--------HGRFD

Query:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP
        HL RTR+PDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLS EIRLGVGL RLATGCDFS IS QFGVSESVARFCAKQLCRVLCTNFRFWVEFP
Subjt:  HLFRTRTPDSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFP

Query:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN
        CP+ELELTSSAFED+AGLPNCCGV++CT                            SIVAGFRGDKDDSTVLMS+TLFKDIE+ RLL +PPVYLHGVAVN
Subjt:  CPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVN

Query:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD
        QYLFGHG+YPLLPWLM+PFAGAVSGSTEESFN+AHRLM IPALKAI+SLRNWGVLS+PMHEEFKTAVAYIGACSILHNALLMREDF+AMADEWESLAS D
Subjt:  QYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFD

Query:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS
        H SQYV   LNEDS +EKAS+IQ+ALA+RARELH+
Subjt:  HRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.8e-2729.35Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G     +   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVTCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVTCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LSK M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPM-HEEFKTAVAYIGACSILHNALLMREDF

Q9M2U3 Protein ALP1-like2.8e-2828.71Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I   FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR

Query:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSKPMHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSKPMHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

Arabidopsis top hitse value%identityAlignment
AT3G19120.1 PIF / Ping-Pong family of plant transposases5.0e-1723.47Show/hide
Query:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHF----LFSQDFAASLPFLSVSRKRKRTNPSDHLELGS-----SHGRF----------DHLF
        +++S LL L   L P+S   S  S S+  S   ++L +      L     A+ L FL+V+R    ++ S      S     + G +          DH++
Subjt:  SLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHF----LFSQDFAASLPFLSVSRKRKRTNPSDHLELGS-----SHGRF----------DHLF

Query:  RTRTP---DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTN-FRFWVEF
            P     +R+ + ++   F  +   L+P +       S L L  +  + + L RLA GC    ++ ++ +   +       + R+L T  +  +++ 
Subjt:  RTRTP---DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTN-FRFWVEF

Query:  PC-PNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD
        P     L  T+  FE+L  LPN CG +  T  K+ R +           +  D++  Q+V D       +     G +DDS+    S L+K +  G ++ 
Subjt:  PC-PNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNS----------HFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLD

Query:  APPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSKPMHEEFKTAVAYIGACSILHN
           + + G  V  Y+ G   YPLL +LM PF+   SG+  E+      +     +   + L    W +L + ++     A   I AC +LHN
Subjt:  APPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSL--RNWGVLSKPMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases2.0e-2928.71Show/hide
Query:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVA-----RFCAKQLCRVLCTNFRFWV
        P +F + F+++  TF+++  L++     +     D  G+P  LS+  R+ V L RL +G   S I   FG+++S       RF      R +  +   W 
Subjt:  PDSFRNHFRMTSSTFEWLSGLLEPLLECR-----DPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVA-----RFCAKQLCRVLCTNFRFWV

Query:  EFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR
            P++L+   S FE ++GLPNCCG +  T   I+ N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K +E+G+
Subjt:  EFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIRNSHFYED------------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGR

Query:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSKPMHEEFKTAV-AYIGACSILHNALLM
         L+   + L     + +Y+ G   +PLLPWL+ P+ G  +   +  FNK H      A  A+  L++ W +++  M    +  +   I  C +LHN ++ 
Subjt:  LLDAPPVYL-HGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLRN-WGVLSKPMHEEFKTAV-AYIGACSILHNALLM

Query:  RED
         ED
Subjt:  RED

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.3e-2829.35Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL
        +F++ FR + +TF ++  L+   L  R P G        LSVE ++ + L RLA+G     +   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLECRDPVG----SPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNEL

Query:  ELTSSAFEDLAGLPNCCGVVTCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H
        E   S FE++ GLPNCCG +  T        ++ S  + D     S+  Q V D   R L++V G+ G    S +L  S  FK  E  ++LD  P  L  
Subjt:  ELTSSAFEDLAGLPNCCGVVTCTR----FKIIRNSHFYED-----SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPM-HEEFKTAVAYIGACSILHNALLMREDF
        G  + +Y+ G   YPLLPWL+ P        +  +FN+ H  +   A  A   L+ +W +LSK M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPM-HEEFKTAVAYIGACSILHNALLMREDF

AT4G29780.1 unknown protein1.1e-1926.11Show/hide
Query:  FDHLFRTRTP-DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCR----V
        +D + R   P D FR  FRM+ STF  +   L+  +       RD + +P       R+GV ++RLATG     +S +FG+  S       ++CR    V
Subjt:  FDHLFRTRTP-DSFRNHFRMTSSTFEWLSGLLEPLLE-----CRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCR----V

Query:  LCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLM
        L   +  W   P  +E+  T + FE +  +PN  G +  T   II          N    E       SI  Q VV++      +  G  G   D  +L 
Subjt:  LCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFKIIR---------NSHFYED------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLM

Query:  SSTLFKD-IEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPMHEEFKTAVAYIG
         S+L +    +G L D+            ++ G+  +PL  +L++P+       T+ +FN++   +   A  A   L+  W  L K    + +     +G
Subjt:  SSTLFKD-IEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPMHEEFKTAVAYIG

Query:  ACSILHNALLMRED
        AC +LHN   MR++
Subjt:  ACSILHNALLMRED

AT5G12010.1 unknown protein8.3e-2025.93Show/hide
Query:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNE
        + F+  FRM+ STFE +   L   +  ++       + V  R+ V ++RLATG     +S +FG+  S       ++C+    VL   +  W   P    
Subjt:  DSFRNHFRMTSSTFEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCR----VLCTNFRFWVEFPCPNE

Query:  LELTSSAFEDLAGLPNCCGVVTCTRFKIIR-----NSHFYED----------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDA
        L      FE ++G+PN  G +  T   II       S+F +           SI  Q VV+       +  G+ G   D  VL  S L++    G LL  
Subjt:  LELTSSAFEDLAGLPNCCGVVTCTRFKIIR-----NSHFYED----------SIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDA

Query:  PPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPMHEEFKTAVAYIGACSILHNALLMRED
                    ++ G   +PLL W+++P+       T+ +FN+    +   A +A   L+  W  L K    + +     +GAC +LHN   MRE+
Subjt:  PPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIPALKAIVSLR-NWGVLSKPMHEEFKTAVAYIGACSILHNALLMRED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCCTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCTCAACTCCTCCTCCTTCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTC
CACTCCCGATTCCAATTTCTATGCCAATCTATTCACCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCA
ATCCCTCCGACCACCTCGAATTGGGCTCATCCCATGGTCGATTTGATCATTTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACCTCCTCAACG
TTCGAATGGCTCTCTGGTTTGCTTGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTCGGTGTTGGCCTGTATCGCCT
CGCCACCGGCTGCGATTTCTCCAAAATCTCCCACCAATTTGGCGTCTCGGAATCGGTAGCGAGGTTCTGTGCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCT
TCTGGGTTGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTGGTTACTTGTACAAGGTTCAAG
ATCATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGTGATAAGGACGA
TTCCACGGTGCTTATGTCTTCGACGCTGTTTAAAGACATCGAACAAGGAAGGCTTCTGGATGCTCCTCCAGTTTACCTTCATGGGGTGGCTGTCAATCAGTACTTGTTTG
GACATGGTGAATACCCTTTGCTTCCATGGTTAATGCTGCCTTTTGCAGGTGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATAAAGCTCACCGATTGATGTGCATTCCA
GCTCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCAAACCAATGCATGAGGAGTTCAAAACTGCAGTTGCTTATATTGGTGCATGCTCAATTCTTCATAA
TGCTTTGTTGATGAGGGAGGACTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAGATCTCAGTATGTTGAAGATAAATTGAATGAGGATTCAA
CTAATGAGAAGGCTTCTATTATACAGAGGGCGCTGGCTGTGAGAGCTAGAGAGCTTCACAGTTAA
mRNA sequenceShow/hide mRNA sequence
TGTAACTCGTATACTCACATTGAGAGAGGAAAAAAAAAAAAAAAAAACTCATAAACCCTAACTCCAAAGGAAACGTAAGAGAAAGATACCAAATAAAGGAGCAAGTTGAC
AGGCACATGAAACCAAAAAGATCGCGGAGATTGGCGAGCTGGTCTGACCAGGCAGGTGTTGGTATGCCGTAATTTCTACAAGATTCAACATCTCCCAGGTCCGGCGCTGA
CAAAAAAAAAACTGAAAACCCATCTCCTTCGCAGCCGCTCGATTCCCAAGAGAATCGAAAGTCCCAATTCTCAACTACTGCAACTGCAAAACTGACACAAACCCATTAAT
GGATTCCCCTCAATTGGCTGCTTTACTCTCTTCTTTGATCTCTCAACTCCTCCTCCTTCTCTTCCTCCTCTTCCCTTCCTCCAACCCACATTCCCTTTTCTCCAATTCCA
CTCCCGATTCCAATTTCTATGCCAATCTATTCACCCACTTCCTCTTTTCCCAGGATTTTGCCGCTTCCCTTCCCTTTCTCTCTGTTTCCCGCAAGAGGAAGAGAACCAAT
CCCTCCGACCACCTCGAATTGGGCTCATCCCATGGTCGATTTGATCATTTGTTTCGGACTCGGACTCCTGATTCTTTCAGAAATCACTTCAGAATGACCTCCTCAACGTT
CGAATGGCTCTCTGGTTTGCTTGAGCCCCTTCTCGAGTGTCGTGACCCGGTGGGTTCGCCTCTTGATCTCTCCGTTGAGATTCGACTCGGTGTTGGCCTGTATCGCCTCG
CCACCGGCTGCGATTTCTCCAAAATCTCCCACCAATTTGGCGTCTCGGAATCGGTAGCGAGGTTCTGTGCTAAACAATTGTGTCGAGTTCTCTGTACTAATTTTCGCTTC
TGGGTTGAATTCCCTTGCCCCAATGAGCTCGAATTAACGTCCTCGGCTTTTGAAGATCTTGCTGGGCTTCCGAATTGCTGTGGCGTGGTTACTTGTACAAGGTTCAAGAT
CATTAGAAATAGCCATTTTTATGAAGATAGCATCGCTACTCAACTTGTTGTTGATTCCTCGTCACGAATCCTTAGTATTGTTGCAGGATTTCGTGGTGATAAGGACGATT
CCACGGTGCTTATGTCTTCGACGCTGTTTAAAGACATCGAACAAGGAAGGCTTCTGGATGCTCCTCCAGTTTACCTTCATGGGGTGGCTGTCAATCAGTACTTGTTTGGA
CATGGTGAATACCCTTTGCTTCCATGGTTAATGCTGCCTTTTGCAGGTGCTGTTTCAGGGTCAACTGAAGAGAGTTTCAATAAAGCTCACCGATTGATGTGCATTCCAGC
TCTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAGTTTTGAGCAAACCAATGCATGAGGAGTTCAAAACTGCAGTTGCTTATATTGGTGCATGCTCAATTCTTCATAATG
CTTTGTTGATGAGGGAGGACTTTTCTGCCATGGCTGATGAGTGGGAGAGCTTAGCTTCATTTGATCATAGATCTCAGTATGTTGAAGATAAATTGAATGAGGATTCAACT
AATGAGAAGGCTTCTATTATACAGAGGGCGCTGGCTGTGAGAGCTAGAGAGCTTCACAGTTAAAATTTCAATCACAAGAATGCAGTTGTTTGCTGAAATAAGTACAGCTC
AATTGAAGGAGATTTCCATCTCTTAGGATATTTATTGAAGGCAGCTCATCCAGCTCCATTCAACAATTACTATTAATTGCCTACTTTTCAGTCATCTTGCTGTTAAAAGC
TCTATGAAAGCTTAAATTTCATTGCTGCCTGCCCCATGGTTAATTTGGCAGACACCACACACCACATCCTGCTGTGGGTTAAAGTTAGACTAGTTTCTTTTTTCACCATT
TGAGATCATTGAATTCTCAACTCTTAACTATCACTGTGTCTTCTGGTTGGACTATGAAGGTAAACATTTCCGGCCTATTCGGAATGACTTTCCAAGTGCTCAAAAAAAGA
TGTTAAAATGCAGCAAAATCGTTTGTAAGCACATG
Protein sequenceShow/hide protein sequence
MDSPQLAALLSSLISQLLLLLFLLFPSSNPHSLFSNSTPDSNFYANLFTHFLFSQDFAASLPFLSVSRKRKRTNPSDHLELGSSHGRFDHLFRTRTPDSFRNHFRMTSST
FEWLSGLLEPLLECRDPVGSPLDLSVEIRLGVGLYRLATGCDFSKISHQFGVSESVARFCAKQLCRVLCTNFRFWVEFPCPNELELTSSAFEDLAGLPNCCGVVTCTRFK
IIRNSHFYEDSIATQLVVDSSSRILSIVAGFRGDKDDSTVLMSSTLFKDIEQGRLLDAPPVYLHGVAVNQYLFGHGEYPLLPWLMLPFAGAVSGSTEESFNKAHRLMCIP
ALKAIVSLRNWGVLSKPMHEEFKTAVAYIGACSILHNALLMREDFSAMADEWESLASFDHRSQYVEDKLNEDSTNEKASIIQRALAVRARELHS