; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001279 (gene) of Chayote v1 genome

Gene IDSed0001279
OrganismSechium edule (Chayote v1)
DescriptionDDE Tnp4 domain-containing protein
Genome locationLG13:3316135..3317701
RNA-Seq ExpressionSed0001279
SyntenySed0001279
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037135.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]1.5e-18879.02Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT
        MDS +LAALLSSL SQLLLLLFLL   S+PHSLF NS P+S+FYANLF HFLFS   AA+L    +SRKRKR N PD L+L      G   G   H FRT
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT

Query:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        R+ DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLS E+RLG+GL RLATGCDFSTISD+FGVSESVARFC+KQLCRVLCTNFRFWVEFPC NEL
Subjt:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG
        E TSS FED+AGLPNCCGV+SCTRFKI+RNSH  E+S+A QLVVDSSSRILSIVAGFRG KDDSTVLMSSTLFKD+E+GR+L+SPPVYLHGVAVN+YLFG
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG

Query:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----
         GEYPLLPWL+VPF GAVSGS EESFNEAH+LMCIPALKAIVSLRNWG+LSQ +HEEFKTAVAYIGACSILHNALLMR+DFSAMADEWESL+SL+H    
Subjt:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----

Query:  IGVGLNEDFSDEKASLIQGALALRAKELH
        +  GLN D ++EKAS+IQ ALA RA+ELH
Subjt:  IGVGLNEDFSDEKASLIQGALALRAKELH

KAG6600319.1 Protein ALP1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.6e-17976.91Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H
        MDSRQLAALLSSL SQLLLLL LL   S+PHSL  NS  +SNFYAN   LFNHFLFS QIAA+L    +SRKRKR +S ++L+L PS  GGE GG G  H
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H

Query:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
          RTRS DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLSAE+RLG+GLSRLATGCDFSTISD+FGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ
         +ELE TSS FEDIAGLPNCCGVISCT                            SIVAGFRG KDDSTVLMS+TLFKD+EEGR+L SPPVYLHG+AVNQ
Subjt:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH
        YLFGHGEYPLLPWLMVPF GAVSGS EESFNEAH+LMCIPALKAI+SLRNWG+LSQ MHEEFKTAVAYIGACSILHNALLMR+DF+AMADEWESLASL+H
Subjt:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH

Query:  ----IGVGLNEDFSDEKASLIQGALALRAKELH
            +G+GLNED  DEKA +IQ ALALRA+ELH
Subjt:  ----IGVGLNEDFSDEKASLIQGALALRAKELH

KAG7030976.1 Protein ALP1-like protein [Cucurbita argyrosperma subsp. argyrosperma]8.1e-17976.67Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H
        MDSRQLAALLSSL SQLLLLL LL   S+PHSL  NS  +SNFYAN   LFNHFLFS QIAA+L    +SRKRKR +S ++L+L PS  GGE GG G  H
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H

Query:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
          RTRS DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLSAE+RLG+GLSRLATGCDFSTISD+FGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ
         +ELE TSS FEDIAGLPNCCGVISCT                            SIVAGFRG KDDSTVLMS+ LFKD+EEGR+L SPPVYLHG+AVNQ
Subjt:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH
        YLFGHGEYPLLPWLMVPF GAVSGS EESFNEAH+LMCIPALKAI+SLRNWG+LSQ MHEEFKTAVAYIGACSILHNALLMR+DF+AMADEWESLASL+H
Subjt:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH

Query:  ----IGVGLNEDFSDEKASLIQGALALRAKELH
            +G+GLNED  DEKA +IQ ALALRA+ELH
Subjt:  ----IGVGLNEDFSDEKASLIQGALALRAKELH

KGN57516.1 hypothetical protein Csa_011580 [Cucumis sativus]5.1e-18979.25Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT
        MDS +LAALLSSL SQLLLLLFLL   S+PHSLF NS P+S+FYANLF HFLFS   AA+L    +SRKRKR N  D L+L      G   G   H FRT
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT

Query:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        R+ DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLS E+RLG+GL RLATGCDFSTISD+FGVSESVARFC+KQLCRVLCTNFRFWVEFPC NEL
Subjt:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG
        E TSS FED+AGLPNCCGV+SCTRFKI+RNSH  E+S+A QLVVDSSSRILSIVAGFRG KDDSTVLMSSTLFKD+E+GR+L+SPPVYLHGVAVN+YLFG
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG

Query:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----
        HGEYPLLPWL+VPF GAVSGS EESFNEAH+LMCIPALKAIVSLRNWG+LSQ +HEEFKTAVAYIGACSILHNALLMR+DFSAMADEWESL+SL+H    
Subjt:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----

Query:  IGVGLNEDFSDEKASLIQGALALRAKELH
        +  GLN D ++EKAS+IQ ALALRA+ELH
Subjt:  IGVGLNEDFSDEKASLIQGALALRAKELH

XP_023536005.1 protein ALP1-like [Cucurbita pepo subsp. pepo]4.3e-18077.14Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H
        MDSRQLAALLSSL SQLLLLL LL   S+PHSL  NS  +SNFYAN   LFNHFLFS QIAA+L    +SRKRKR +S ++L+L PS  GGE GG G  H
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H

Query:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
          RTRS DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLSAE+RLG+GLSRLATGCDFSTISD+FGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ
         +ELE TSS FEDIAGLPNCCGVISCT                            SIVAGFRG KDDSTVLMS+TLFKD+EEGR+L SPPVYLHG+AVNQ
Subjt:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH
        YLFGHGEYPLLPWLMVPF GAVSGS EESFNEAH+LMCIPALKAI+SLRNWG+LSQ MHEEFKTAVAYIGACSILHNALLMR+DF+AMADEWESLASL+H
Subjt:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH

Query:  ----IGVGLNEDFSDEKASLIQGALALRAKELH
            +G+GLNED  DEKAS+IQ ALALRA+ELH
Subjt:  ----IGVGLNEDFSDEKASLIQGALALRAKELH

TrEMBL top hitse value%identityAlignment
A0A0A0LBX6 DDE Tnp4 domain-containing protein2.5e-18979.25Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT
        MDS +LAALLSSL SQLLLLLFLL   S+PHSLF NS P+S+FYANLF HFLFS   AA+L    +SRKRKR N  D L+L      G   G   H FRT
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT

Query:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        R+ DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLS E+RLG+GL RLATGCDFSTISD+FGVSESVARFC+KQLCRVLCTNFRFWVEFPC NEL
Subjt:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG
        E TSS FED+AGLPNCCGV+SCTRFKI+RNSH  E+S+A QLVVDSSSRILSIVAGFRG KDDSTVLMSSTLFKD+E+GR+L+SPPVYLHGVAVN+YLFG
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG

Query:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----
        HGEYPLLPWL+VPF GAVSGS EESFNEAH+LMCIPALKAIVSLRNWG+LSQ +HEEFKTAVAYIGACSILHNALLMR+DFSAMADEWESL+SL+H    
Subjt:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----

Query:  IGVGLNEDFSDEKASLIQGALALRAKELH
        +  GLN D ++EKAS+IQ ALALRA+ELH
Subjt:  IGVGLNEDFSDEKASLIQGALALRAKELH

A0A5D3CRB2 Putative nuclease HARBI17.2e-18979.02Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT
        MDS +LAALLSSL SQLLLLLFLL   S+PHSLF NS P+S+FYANLF HFLFS   AA+L    +SRKRKR N PD L+L      G   G   H FRT
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRT

Query:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        R+ DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLS E+RLG+GL RLATGCDFSTISD+FGVSESVARFC+KQLCRVLCTNFRFWVEFPC NEL
Subjt:  RSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG
        E TSS FED+AGLPNCCGV+SCTRFKI+RNSH  E+S+A QLVVDSSSRILSIVAGFRG KDDSTVLMSSTLFKD+E+GR+L+SPPVYLHGVAVN+YLFG
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFG

Query:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----
         GEYPLLPWL+VPF GAVSGS EESFNEAH+LMCIPALKAIVSLRNWG+LSQ +HEEFKTAVAYIGACSILHNALLMR+DFSAMADEWESL+SL+H    
Subjt:  HGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH----

Query:  IGVGLNEDFSDEKASLIQGALALRAKELH
        +  GLN D ++EKAS+IQ ALA RA+ELH
Subjt:  IGVGLNEDFSDEKASLIQGALALRAKELH

A0A6J1D7F1 protein ALP1-like1.4e-17374.31Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLF---NHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHW
        MDSR+LAAL+SSL SQLLL LFLL   S+PHSL  N   +S+FYAN F    HFLFS +IA++L    +SRKRKR + P+ L+L+PS GGG GG G  H 
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYANLF---NHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGDHW

Query:  FRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCH
          TR  DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPL+LSAE+RLG+GLSRLATGCDFSTIS++FGVSESVARFCAKQLCRVLCTNFRFWVEFPC 
Subjt:  FRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCH

Query:  NELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQY
        NELESTSS FE +AGLPNCCGV++CT                            SIVAGFRG KDDSTVLMSSTLFKD+EEGR+LDSPPVYLHG+AVNQY
Subjt:  NELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQY

Query:  LFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH-
         FGHGEYPLLPWLMVPF GAVSGS EESFN+AH+LMCIPALKAIVSLRNWG+LSQ M EEFKTAVAYIGACSILHNALLMR+DFSAMADEWE LASL+H 
Subjt:  LFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH-

Query:  ---IGVGLNEDFSDEKASLIQGALALRAKELH
           IG GLNED +DEKAS+IQ ALALRA+ELH
Subjt:  ---IGVGLNEDFSDEKASLIQGALALRAKELH

A0A6J1FNZ2 protein ALP1-like6.7e-17976.44Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H
        MDSRQLAALLSSL SQLLLLL LL   S+PHSL  NS  +SNFYAN   LFNHFLFS QIAA+L    +SRKRKR +S ++L+L PS  GGE GG G  H
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H

Query:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
          RTRS DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLSAE+RLG+GLSRLATGCDFSTISD+FGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ
         +ELE TSS FEDIAGLPNCCGVISCT                            SIVAGFRG KDDSTVLMS+TLFKD+EE R+L SPPVYLHG+AVNQ
Subjt:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH
        YLFGHGEYPLLPWLMVPF GAVSGS EESFNEAH+LMCIPALKAI+SLRNWG+LSQ MHEEFKTAVAYIGACSILHNALLMR+DF+AMADEWESLASL+H
Subjt:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH

Query:  ----IGVGLNEDFSDEKASLIQGALALRAKELH
            +G+GLNED  DEKA+++Q ALALRA+ELH
Subjt:  ----IGVGLNEDFSDEKASLIQGALALRAKELH

A0A6J1J0M5 protein ALP1-like2.6e-17876.67Show/hide
Query:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H
        MDSRQLAALLSSL SQLLLLL LL   S+PHSL  NS  +SNFYAN   LFNHFLFS QIAA+L    +SRKRKR +S ++L+L PS  GGE GG G  H
Subjt:  MDSRQLAALLSSLTSQLLLLLFLL---SDPHSLFPNSIPNSNFYAN---LFNHFLFSDQIAATL----LSRKRKRPNSPDILDLQPSTGGGEGGGGGD-H

Query:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC
          RTRS DSFRNHFRMTSSTFEWLSGLLEPLL+CRDPVGSPLDLSAE+RLG+GLSRLATGCDFSTISD+FGVSESVARFCAKQLCRVLCTNFRFWVEFPC
Subjt:  WFRTRSLDSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPC

Query:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ
         +ELE TSS FEDIAGLPNCCGVISCT                            SIVAGFRG KDDSTVLMS+TLFKD+EE R+L SPPVYLHGVAVNQ
Subjt:  HNELESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQ

Query:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH
        YLFGHG+YPLLPWLMVPF GAVSGS EESFNEAH+LM IPALKAI+SLRNWG+LSQ MHEEFKTAVAYIGACSILHNALLMR+DF+AMADEWESLASL+H
Subjt:  YLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNH

Query:  ----IGVGLNEDFSDEKASLIQGALALRAKELH
            +G+GLNED  DEKAS+IQ ALALRA+ELH
Subjt:  ----IGVGLNEDFSDEKASLIQGALALRAKELH

SwissProt top hitse value%identityAlignment
Q94K49 Protein ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 11.1e-2930.03Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ I L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRF-----KIVRNSHVCEE----SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        +  +   C++    S+  Q V D   R L++V G+ GG   S +L  S  FK  E  ++LD  P  L  
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRF-----KIVRNSHVCEE----SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGILSQSM-HEEFKTAVAYIGACSILHNALLMRDDF
        G  + +Y+ G   YPLLPWL+ P +      +  +FNE H+ +   A  A   L+ +W ILS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGILSQSM-HEEFKTAVAYIGACSILHNALLMRDDF

Q9M2U3 Protein ALP1-like8.0e-2828.38Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNE
        +F + F+++  TF+++  L++     +     D  G+PL L+   R+ + L RL +G   S I + FG+++S       +    +       + +P  ++
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNE

Query:  LESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEE------------SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPV
        L+   S FE I+GLPNCCG I  T   IV N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K VE+G+ L+   +
Subjt:  LESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEE------------SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPV

Query:  YL-HGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRN-WGILSQSMHEEFKTAV-AYIGACSILHNALLMRDD
         L     + +Y+ G   +PLLPWL+ P++G  +   +  FN+ H      A  A+  L++ W I++  M    +  +   I  C +LHN ++  +D
Subjt:  YL-HGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRN-WGILSQSMHEEFKTAV-AYIGACSILHNALLMRDD

Arabidopsis top hitse value%identityAlignment
AT1G72270.1 CONTAINS InterPro DOMAIN/s: Ribosome 60S biogenesis N-terminal (InterPro:IPR021714)2.5e-1626.23Show/hide
Query:  HFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCHNELESTSSGF
        +FRM+ STF  L  +L                S+       + RLA G  +  +  RFG  S S A      +C++        +      +L+     F
Subjt:  HFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGV-SESVARFCAKQLCRVLCTNFRFWVEFPCHNELESTSSGF

Query:  EDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-HGVAVNQYLFGHGEYPL
             LPNC GV+   RF++       + SI  Q +VDS+ R + I AG+        +   + LF   EE  VL   P  L +GV V +Y+ G    PL
Subjt:  EDIAGLPNCCGVISCTRFKIVRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-HGVAVNQYLFGHGEYPL

Query:  LPWLMVPFEGAVSGSNEESFNEAHKLMCIPALK----AIVSLR-NWGILSQSMHEEFKTAVAY-IGACSILHNALLMRDDFSAMADEWESLASLNHIGVG
        LPWL+ P++     S+EESF E    +    L     A   +R  W IL +    E    + + I    +LHN L+   D     +E  +       G  
Subjt:  LPWLMVPFEGAVSGSNEESFNEAHKLMCIPALK----AIVSLR-NWGILSQSMHEEFKTAVAY-IGACSILHNALLMRDDFSAMADEWESLASLNHIGVG

Query:  LNEDFSDEKASLIQGALALRAKEL
          +D  +E+    +G     +K +
Subjt:  LNEDFSDEKASLIQGALALRAKEL

AT3G19120.1 PIF / Ping-Pong family of plant transposases1.4e-2228.97Show/hide
Query:  SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-HNELESTSSGFEDIAGLPNCCGVISCTRFKI------
        S L L A+  + + LSRLA GC   T++ R+ +   +       + R+L T  +  +++ P     L  T+ GFE++  LPN CG I  T  K+      
Subjt:  SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTN-FRFWVEFPC-HNELESTSSGFEDIAGLPNCCGVISCTRFKI------

Query:  -VRNSHVCE---ESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNE
          RN + C+   +++  Q+V D       +     GG+DDS+    S L+K +  G ++    + + G  V  Y+ G   YPLL +LM PF    SG+  
Subjt:  -VRNSHVCE---ESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNE

Query:  ESFNEAHKLMCIPALKAIVSL--RNWGILSQSMHEEFKTAVAYIGACSILHN
        E+  +   +     +   + L    W IL QS++     A   I AC +LHN
Subjt:  ESFNEAHKLMCIPALKAIVSL--RNWGILSQSMHEEFKTAVAYIGACSILHN

AT3G55350.1 PIF / Ping-Pong family of plant transposases5.7e-2928.38Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNE
        +F + F+++  TF+++  L++     +     D  G+PL L+   R+ + L RL +G   S I + FG+++S       +    +       + +P  ++
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCR-----DPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNE

Query:  LESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEE------------SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPV
        L+   S FE I+GLPNCCG I  T   IV N    E             S+  Q VVD   R L ++AG+ G  +D  VL +S  +K VE+G+ L+   +
Subjt:  LESTSSGFEDIAGLPNCCGVISCTRFKIVRNSHVCEE------------SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPV

Query:  YL-HGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRN-WGILSQSMHEEFKTAV-AYIGACSILHNALLMRDD
         L     + +Y+ G   +PLLPWL+ P++G  +   +  FN+ H      A  A+  L++ W I++  M    +  +   I  C +LHN ++  +D
Subjt:  YL-HGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLRN-WGILSQSMHEEFKTAV-AYIGACSILHNALLMRDD

AT3G63270.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)7.9e-3130.03Show/hide
Query:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL
        +F++ FR + +TF ++  L+   L  R P G        LS E ++ I L RLA+G    ++   FGV +S       +    L    +  + +P  + +
Subjt:  SFRNHFRMTSSTFEWLSGLLEPLLDCRDPVG----SPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNEL

Query:  ESTSSGFEDIAGLPNCCGVISCTRF-----KIVRNSHVCEE----SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-H
        E   S FE++ GLPNCCG I  T        +  +   C++    S+  Q V D   R L++V G+ GG   S +L  S  FK  E  ++LD  P  L  
Subjt:  ESTSSGFEDIAGLPNCCGVISCTRF-----KIVRNSHVCEE----SIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYL-H

Query:  GVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGILSQSM-HEEFKTAVAYIGACSILHNALLMRDDF
        G  + +Y+ G   YPLLPWL+ P +      +  +FNE H+ +   A  A   L+ +W ILS+ M   + +   + I  C +LHN ++   D+
Subjt:  GVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGILSQSM-HEEFKTAVAYIGACSILHNALLMRDDF

AT4G29780.1 unknown protein9.1e-1924.47Show/hide
Query:  STGGGEGGGGGDHWFRTRSL-------------DSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSE
        ++G G G      W + R+              D FR  FRM+ STF  +   L+  +  ++ +     + A  R+G+ + RLATG     +S+RFG+  
Subjt:  STGGGEGGGGGDHWFRTRSL-------------DSFRNHFRMTSSTFEWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSE

Query:  SVARFCAKQLCR----VLCTNFRFWVEFPCHNELESTSSGFEDIAGLPNCCGVISCTRFKIV---------------RNSHVCEESIAAQLVVDSSSRIL
        S       ++CR    VL   +  W   P  +E+ ST + FE +  +PN  G I  T   I+                 +     SI  Q VV++     
Subjt:  SVARFCAKQLCR----VLCTNFRFWVEFPCHNELESTSSGFEDIAGLPNCCGVISCTRFKIV---------------RNSHVCEESIAAQLVVDSSSRIL

Query:  SIVAGFRGGKDDSTVLMSSTLFKD-VEEGRVLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGI
         +  G  G   D  +L  S+L +     G + DS            ++ G+  +PL  +L+VP+        + +FNE+   +   A  A   L+  W  
Subjt:  SIVAGFRGGKDDSTVLMSSTLFKD-VEEGRVLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPALKAIVSLR-NWGI

Query:  LSQSMHEEFKTAVAYIGACSILHNALLMRDD
        L +    + +     +GAC +LHN   MR +
Subjt:  LSQSMHEEFKTAVAYIGACSILHNALLMRDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCCGTCAATTAGCCGCTTTACTCTCTTCCTTAACCTCCCAGCTTCTCCTCCTCCTCTTTCTCCTCTCCGACCCACATTCCCTTTTCCCCAATTCCATTCCCAA
TTCCAATTTCTACGCCAATCTCTTCAACCACTTCCTCTTCTCCGACCAAATCGCCGCTACCCTTTTGTCCCGCAAGCGAAAGAGGCCTAACTCGCCGGACATTCTCGATT
TACAGCCATCCACCGGCGGCGGAGAAGGCGGAGGCGGCGGAGACCATTGGTTTCGGACTCGGAGTCTGGACTCGTTCAGAAACCACTTCAGGATGACCTCCTCCACGTTT
GAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGACTGTCGCGACCCGGTAGGTTCCCCTCTTGACCTCTCCGCCGAGGTTCGGCTCGGTATCGGCCTGTCTCGGCTGGC
TACTGGCTGCGATTTCTCCACGATTTCGGACCGGTTCGGGGTCTCCGAATCGGTTGCGAGGTTTTGTGCTAAACAGCTGTGTAGAGTTTTATGTACTAATTTTCGGTTCT
GGGTTGAGTTCCCTTGTCACAATGAGTTAGAGTCAACATCCTCAGGTTTTGAAGATATTGCTGGACTTCCCAATTGTTGTGGGGTGATTTCTTGTACTAGGTTTAAGATC
GTTAGGAATAGTCATGTTTGTGAAGAGAGCATTGCTGCTCAACTTGTTGTTGATTCGTCGTCGCGAATACTTAGTATTGTTGCGGGATTTCGCGGCGGTAAGGATGATTC
GACCGTGCTTATGTCGTCGACGCTGTTTAAAGACGTCGAAGAAGGAAGGGTACTGGATTCTCCTCCAGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGAC
ATGGTGAATACCCTTTGCTTCCATGGTTAATGGTGCCATTTGAAGGAGCTGTTTCAGGGTCAAATGAAGAGAGTTTCAATGAAGCTCACAAATTGATGTGCATTCCAGCT
TTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAATTTTGAGTCAATCTATGCACGAGGAGTTTAAAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGC
TTTGTTGATGAGGGACGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTTAATCATATTGGGGTTGGATTGAATGAGGATTTCAGTGATGAGAAGGCTT
CTTTGATACAAGGGGCATTGGCTTTGAGAGCTAAAGAGCTTCATAGATGA
mRNA sequenceShow/hide mRNA sequence
AGGTGTTCTCATGGCGTAACTCCTACAAGATCCTCACCTTCCAATCCGGCGTTGACGAATAACCCACCAACCCTTTCTCTTCTCTCTCACTCAAACGCTCGATTTTCAAA
ATCCCCAATTTCTCTGCAACTCCAAGAACTCAACCATCACCCATTCATGGATTCCCGTCAATTAGCCGCTTTACTCTCTTCCTTAACCTCCCAGCTTCTCCTCCTCCTCT
TTCTCCTCTCCGACCCACATTCCCTTTTCCCCAATTCCATTCCCAATTCCAATTTCTACGCCAATCTCTTCAACCACTTCCTCTTCTCCGACCAAATCGCCGCTACCCTT
TTGTCCCGCAAGCGAAAGAGGCCTAACTCGCCGGACATTCTCGATTTACAGCCATCCACCGGCGGCGGAGAAGGCGGAGGCGGCGGAGACCATTGGTTTCGGACTCGGAG
TCTGGACTCGTTCAGAAACCACTTCAGGATGACCTCCTCCACGTTTGAATGGCTCTCTGGTTTGCTCGAGCCGCTTCTCGACTGTCGCGACCCGGTAGGTTCCCCTCTTG
ACCTCTCCGCCGAGGTTCGGCTCGGTATCGGCCTGTCTCGGCTGGCTACTGGCTGCGATTTCTCCACGATTTCGGACCGGTTCGGGGTCTCCGAATCGGTTGCGAGGTTT
TGTGCTAAACAGCTGTGTAGAGTTTTATGTACTAATTTTCGGTTCTGGGTTGAGTTCCCTTGTCACAATGAGTTAGAGTCAACATCCTCAGGTTTTGAAGATATTGCTGG
ACTTCCCAATTGTTGTGGGGTGATTTCTTGTACTAGGTTTAAGATCGTTAGGAATAGTCATGTTTGTGAAGAGAGCATTGCTGCTCAACTTGTTGTTGATTCGTCGTCGC
GAATACTTAGTATTGTTGCGGGATTTCGCGGCGGTAAGGATGATTCGACCGTGCTTATGTCGTCGACGCTGTTTAAAGACGTCGAAGAAGGAAGGGTACTGGATTCTCCT
CCAGTTTACCTTCATGGGGTGGCTGTGAATCAGTACTTGTTTGGACATGGTGAATACCCTTTGCTTCCATGGTTAATGGTGCCATTTGAAGGAGCTGTTTCAGGGTCAAA
TGAAGAGAGTTTCAATGAAGCTCACAAATTGATGTGCATTCCAGCTTTGAAAGCAATTGTTAGTTTGAGAAATTGGGGAATTTTGAGTCAATCTATGCACGAGGAGTTTA
AAACTGCTGTTGCTTACATTGGTGCTTGCTCAATTCTTCATAATGCTTTGTTGATGAGGGACGATTTTTCTGCCATGGCTGATGAATGGGAGAGCTTAGCTTCACTTAAT
CATATTGGGGTTGGATTGAATGAGGATTTCAGTGATGAGAAGGCTTCTTTGATACAAGGGGCATTGGCTTTGAGAGCTAAAGAGCTTCATAGATGAAATTTCAATCATAA
GAATTCAGTGGTTTGATGGAATAAGTACAGCTCATTTGGCAGAGATTTCATCTCTTAGAATATTTATTGAAGGTTGCTCATCCAGCTCCATTCTACGATTACTATTGTAG
GTAATTGATGATTCATTTACAAATCAC
Protein sequenceShow/hide protein sequence
MDSRQLAALLSSLTSQLLLLLFLLSDPHSLFPNSIPNSNFYANLFNHFLFSDQIAATLLSRKRKRPNSPDILDLQPSTGGGEGGGGGDHWFRTRSLDSFRNHFRMTSSTF
EWLSGLLEPLLDCRDPVGSPLDLSAEVRLGIGLSRLATGCDFSTISDRFGVSESVARFCAKQLCRVLCTNFRFWVEFPCHNELESTSSGFEDIAGLPNCCGVISCTRFKI
VRNSHVCEESIAAQLVVDSSSRILSIVAGFRGGKDDSTVLMSSTLFKDVEEGRVLDSPPVYLHGVAVNQYLFGHGEYPLLPWLMVPFEGAVSGSNEESFNEAHKLMCIPA
LKAIVSLRNWGILSQSMHEEFKTAVAYIGACSILHNALLMRDDFSAMADEWESLASLNHIGVGLNEDFSDEKASLIQGALALRAKELHR