; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g09450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g09450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionhistidine protein methyltransferase 1 homolog
Genome locationchr5:7317078..7328966
RNA-Seq ExpressionMoc05g09450
SyntenyMoc05g09450
Gene Ontology termsGO:0006285 - base-excision repair, AP site formation (biological process)
GO:0032259 - methylation (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0005634 - nucleus (cellular component)
GO:0140078 - class I DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0000703 - oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR030841 - Endonuclease III-like protein 1
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR023170 - Helix-hairpin-helix, base-excision DNA repair, C-terminal
IPR011257 - DNA glycosylase
IPR004036 - Endonuclease III-like, conserved site-2
IPR004035 - Endonuclease III, iron-sulphur binding site
IPR000445 - Helix-hairpin-helix motif
IPR003265 - HhH-GPD domain
IPR003651 - Endonuclease III-like, iron-sulphur cluster loop motif


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008453987.1 PREDICTED: uncharacterized protein LOC103494545 [Cucumis melo]4.7e-19295.43Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLL+QCLPGL+PQDKGSHS+PSISERDVHLPSPAVEILPSKTAHPYKYAGENVDL GLNVFKGRVSVADIIGFN SES SSKPEG+LK WDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTL PSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
         TVLSVVRGDGFE PTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASE DQGEGGYD+ILMAEIP+SLNSLKKLYALIK+CVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRD+WKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

XP_022145216.1 endonuclease III homolog 1, chloroplastic-like isoform X2 [Momordica charantia]6.2e-19299.41Show/hide
Query:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
        ++SSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
Subjt:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI

Query:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
        LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
Subjt:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI

Query:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
        CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
Subjt:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC

Query:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
        TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
Subjt:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

XP_022145230.1 histidine protein methyltransferase 1 homolog [Momordica charantia]2.8e-200100Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
        STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

XP_022972108.1 histidine protein methyltransferase 1 homolog [Cucurbita maxima]4.7e-19296Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGLVPQDKGS S+PSISERDVHLPSPAVEILPSKTAHPYKYAGENVDL GLNVFKGRVSVADIIGFN SES+SSKPEGYLK WDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAE+VRCTTIPNVLANLEQARDRQSRQPESPLTPSRH+L+PSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
         TVLSVVRGDGFE PTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASE+DQGEGGYDVILMAEIPYSLNSLKKLY+LIK+CVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

XP_038897202.1 histidine protein methyltransferase 1 homolog [Benincasa hispida]1.6e-19296Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLL+QCLPGLVPQDKGSHS+PSISERDVHLPSPAVEILPSKTAHPYKYAGENVDL GLNVFKGRVSVADIIGFN SES SSKPEG+LK WDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTL PSVHFYAGDW+EL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
         TVLSVVRGDGFE PTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASE DQGEGGYDVILMAEIPYSLNSLKKLYALIK+CVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRD+WKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

TrEMBL top hitse value%identityAlignment
A0A1S3BXP6 uncharacterized protein LOC1034945452.3e-19295.43Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLL+QCLPGL+PQDKGSHS+PSISERDVHLPSPAVEILPSKTAHPYKYAGENVDL GLNVFKGRVSVADIIGFN SES SSKPEG+LK WDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTL PSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
         TVLSVVRGDGFE PTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASE DQGEGGYD+ILMAEIP+SLNSLKKLYALIK+CVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRD+WKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

A0A6J1CTU5 Endonuclease III homolog3.0e-19299.41Show/hide
Query:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
        ++SSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
Subjt:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI

Query:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
        LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
Subjt:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI

Query:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
        CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
Subjt:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC

Query:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
        TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
Subjt:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

A0A6J1CVP5 Endonuclease III homolog3.0e-19299.41Show/hide
Query:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
        ++SSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI
Subjt:  LKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKI

Query:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
        LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI
Subjt:  LEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKI

Query:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
        CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC
Subjt:  CLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTIC

Query:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
        TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
Subjt:  TPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

A0A6J1CVR1 histidine protein methyltransferase 1 homolog1.3e-200100Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
        STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

A0A6J1I8X6 histidine protein methyltransferase 1 homolog2.3e-19296Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGLVPQDKGS S+PSISERDVHLPSPAVEILPSKTAHPYKYAGENVDL GLNVFKGRVSVADIIGFN SES+SSKPEGYLK WDSSIDL
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAE+VRCTTIPNVLANLEQARDRQSRQPESPLTPSRH+L+PSVHFYAGDWEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY
         TVLSVVRGDGFE PTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASE+DQGEGGYDVILMAEIPYSLNSLKKLY+LIK+CVRPPY
Subjt:  STVLSVVRGDGFEVPTGMSLSFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPY

Query:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
Subjt:  GVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

SwissProt top hitse value%identityAlignment
A7M7B9 Endonuclease III-like protein 13.3e-6348.88Show/hide
Query:  KRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGIREMRSSEDAPVDTMGCGQAAST-LPPKERRFSVLASSLLSSQTKDHVT
        +RT+  P +++++P P  P         P  +         P NW++ LE IREMR   DAPVD MG  +   T  PP+  R+ VL S +LSSQTKD VT
Subjt:  KRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGIREMRSSEDAPVDTMGCGQAAST-LPPKERRFSVLASSLLSSQTKDHVT

Query:  HGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRIS
          A  RL++ G LT D++ + D+AT+  +IYPVGF+  K + +K+   I   KYGGDIP ++EEL+ LPG+GPK+AHL M +AWN V GI VDTHVHRI+
Subjt:  HGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRIS

Query:  NRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCSVSDLCPSA
        NRL WV     K++T  PEETRVALE WLP++ W  IN LLVGFGQ  C P+ P+C  C   D+CP+A
Subjt:  NRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCSVSDLCPSA

B9DFZ0 Endonuclease III homolog 2, chloroplastic3.6e-11059.41Show/hide
Query:  KSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKR--CCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEK
        ++++G SESETRV +R++  K  +++           A++  C  PDIED  +K+T    G+  S+   L+  +   E S        A  G PP NWEK
Subjt:  KSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKR--CCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEK

Query:  ILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAK
        +LEGIR+M+ SE+APV+ + C +  S LPPKERRF VL  +LLSSQTK+H+T  A  RL +NGLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AK
Subjt:  ILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAK

Query:  ICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTI
        ICL++Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRI NRLGWVS  G+KQKTS+PEETRVAL+ WLPK EWV IN LLVGFGQTI
Subjt:  ICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTI

Query:  CTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
        CTPLRP CG CS++++CPSAFKE+ S S KLK+S  +KKL
Subjt:  CTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

P78549 Endonuclease III-like protein 11.2e-6053.3Show/hide
Query:  PVNWEKILEGIREMRSSEDAPVDTMGCGQA-ASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKAR
        P +W++ L  IR MR+ +DAPVD +G      S+ PPK RR+ VL S +LSSQTKD VT GA  RL+  G LT D++ + D+AT+  LIYPVGF+ +K +
Subjt:  PVNWEKILEGIREMRSSEDAPVDTMGCGQA-ASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKAR

Query:  NLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLL
         +K+ + I    YGGDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI+NRL W     +K+ T +PEETR ALE WLP+E W  IN LL
Subjt:  NLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+ P+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q2KID2 Endonuclease III-like protein 19.9e-6052.42Show/hide
Query:  PVNWEKILEGIREMRSSEDAPVDTMGCGQAAS-TLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKAR
        P +W + L+ IR MRS +DAPVD +G       +  PK RR+ VL S +LSSQTKD VT GA  RL+  G LT D++ + D++T+ +LIYPVGF+ +K +
Subjt:  PVNWEKILEGIREMRSSEDAPVDTMGCGQAAS-TLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKAR

Query:  NLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLL
         +K+ + I   +Y GDIP S+ EL+ LPG+GPK+AHL M +AW  V GI VDTHVHRI+NRL W     +K+ T +PEETR ALE WLP+E W  IN LL
Subjt:  NLKKIAKICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLL

Query:  VGFGQTICTPLRPKCGNCSVSDLCPSA
        VGFGQ  C P+RP+C  C    LCP+A
Subjt:  VGFGQTICTPLRPKCGNCSVSDLCPSA

Q9SIC4 Endonuclease III homolog 1, chloroplastic2.1e-11061.68Show/hide
Query:  GVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGI
        G S SETRV+ R++  K    +         V+  + C  PDIEDFA+K+T  SP S +S         T I V++      +   G PP NW ++LEGI
Subjt:  GVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGI

Query:  REMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIK
        R+MRSSEDAPVD+MGC +A S LPP ERRF+VL  +LLSSQTKD V + A  RL +NGLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+K
Subjt:  REMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIK

Query:  YGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLR
        Y GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRI NRLGWVS  G+KQKT++PEETRVAL+ WLPKEEWV IN LLVGFGQ ICTP+R
Subjt:  YGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLR

Query:  PKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
        P+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  PKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

Arabidopsis top hitse value%identityAlignment
AT1G05900.2 endonuclease III 22.6e-11159.41Show/hide
Query:  KSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKR--CCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEK
        ++++G SESETRV +R++  K  +++           A++  C  PDIED  +K+T    G+  S+   L+  +   E S        A  G PP NWEK
Subjt:  KSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKR--CCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEK

Query:  ILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAK
        +LEGIR+M+ SE+APV+ + C +  S LPPKERRF VL  +LLSSQTK+H+T  A  RL +NGLLT +A+DKADE+TIK LIYPVGFY+ KA N+KK+AK
Subjt:  ILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAK

Query:  ICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTI
        ICL++Y GDIPR+LEELL LPG+GPKIAHL++ +AWNDVQGICVDTHVHRI NRLGWVS  G+KQKTS+PEETRVAL+ WLPK EWV IN LLVGFGQTI
Subjt:  ICLIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTI

Query:  CTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL
        CTPLRP CG CS++++CPSAFKE+ S S KLK+S  +KKL
Subjt:  CTPLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKKL

AT2G31450.1 DNA glycosylase superfamily protein1.5e-11161.68Show/hide
Query:  GVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGI
        G S SETRV+ R++  K    +         V+  + C  PDIEDFA+K+T  SP S +S         T I V++      +   G PP NW ++LEGI
Subjt:  GVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKILEGI

Query:  REMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIK
        R+MRSSEDAPVD+MGC +A S LPP ERRF+VL  +LLSSQTKD V + A  RL +NGLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+ICL+K
Subjt:  REMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICLIK

Query:  YGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLR
        Y GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRI NRLGWVS  G+KQKT++PEETRVAL+ WLPKEEWV IN LLVGFGQ ICTP+R
Subjt:  YGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLR

Query:  PKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
        P+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  PKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

AT2G31450.2 DNA glycosylase superfamily protein6.7e-11261.42Show/hide
Query:  SSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKIL
        S+ G S SETRV+ R++  K    +         V+  + C  PDIEDFA+K+T  SP S +S         T I V++      +   G PP NW ++L
Subjt:  SSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCP-PDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGKPPVNWEKIL

Query:  EGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKIC
        EGIR+MRSSEDAPVD+MGC +A S LPP ERRF+VL  +LLSSQTKD V + A  RL +NGLLT +A+DKADE+TIK LIYPVGFY+ KA  +KKIA+IC
Subjt:  EGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKIC

Query:  LIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICT
        L+KY GDIP SL++LL LPGIGPK+AHLI+ +AWNDVQGICVDTHVHRI NRLGWVS  G+KQKT++PEETRVAL+ WLPKEEWV IN LLVGFGQ ICT
Subjt:  LIKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICT

Query:  PLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK
        P+RP+C  CSVS LCP+AFKE+SSPS KLK+S+ +K+
Subjt:  PLRPKCGNCSVSDLCPSAFKESSSPSPKLKRSSSTKK

AT2G43320.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.6e-15075.71Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGL+PQD+G  S  ++SE+D+ LP+PAVEI+PSKT   ++Y+GEN+D  GL VFKG+VSVADIIG + SE+   K EG LK W+SS+ L
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLK+EIRDGQLSFRGKRVLELGC++G+PG+FACLKGAS VHFQDLSAET+RCTTIPNVLANLEQARDRQSRQPESPLTPSR  ++ SV FYAG+WEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFE--VPTGMSLSFSEEDFMDGCSSQDGSIIGH-ESSSRRSRKLSGSRAWERASETDQ-GEGGYDVILMAEIPYSLNSLKKLYALIKRCV
        STVLS++R D  E  +P  M+LSFSEEDFMDGCSSQDGSI G  + SSRRSRKLSGSRAWERA+ET Q GE GYDVILM EIPYS+ SLKKLY+LIK+C+
Subjt:  STVLSVVRGDGFE--VPTGMSLSFSEEDFMDGCSSQDGSIIGH-ESSSRRSRKLSGSRAWERASETDQ-GEGGYDVILMAEIPYSLNSLKKLYALIKRCV

Query:  RPPYGVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        RPPYGV+YLA KK YVGFNSGA+HLR+LVDEE + GAHLVKE TDRDIWKFFLK
Subjt:  RPPYGVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK

AT2G43320.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.6e-15075.71Show/hide
Query:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL
        MRAPSLLAQCLPGL+PQD+G  S  ++SE+D+ LP+PAVEI+PSKT   ++Y+GEN+D  GL VFKG+VSVADIIG + SE+   K EG LK W+SS+ L
Subjt:  MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDL

Query:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL
        VNVLK+EIRDGQLSFRGKRVLELGC++G+PG+FACLKGAS VHFQDLSAET+RCTTIPNVLANLEQARDRQSRQPESPLTPSR  ++ SV FYAG+WEEL
Subjt:  VNVLKHEIRDGQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEEL

Query:  STVLSVVRGDGFE--VPTGMSLSFSEEDFMDGCSSQDGSIIGH-ESSSRRSRKLSGSRAWERASETDQ-GEGGYDVILMAEIPYSLNSLKKLYALIKRCV
        STVLS++R D  E  +P  M+LSFSEEDFMDGCSSQDGSI G  + SSRRSRKLSGSRAWERA+ET Q GE GYDVILM EIPYS+ SLKKLY+LIK+C+
Subjt:  STVLSVVRGDGFE--VPTGMSLSFSEEDFMDGCSSQDGSIIGH-ESSSRRSRKLSGSRAWERASETDQ-GEGGYDVILMAEIPYSLNSLKKLYALIKRCV

Query:  RPPYGVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK
        RPPYGV+YLA KK YVGFNSGA+HLR+LVDEE + GAHLVKE TDRDIWKFFLK
Subjt:  RPPYGVLYLATKKNYVGFNSGARHLRSLVDEEGVFGAHLVKEMTDRDIWKFFLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCTCCATCATTACTTGCTCAATGCTTGCCTGGCTTGGTGCCCCAAGATAAAGGAAGCCACAGCACTCCGTCCATTTCAGAGAGAGATGTCCATCTTCCCTCACC
AGCCGTTGAGATTCTCCCTTCAAAGACAGCTCATCCGTACAAATATGCAGGGGAGAATGTAGATTTGCATGGTCTCAATGTTTTTAAGGGTAGAGTTAGCGTTGCCGATA
TAATTGGCTTCAATAACTCAGAATCCGTATCTTCAAAGCCTGAAGGTTATCTGAAATGTTGGGACAGTTCCATTGATCTTGTTAATGTCCTGAAGCACGAGATTCGTGAC
GGACAGCTGAGCTTTAGGGGTAAAAGAGTACTTGAGCTGGGTTGTAGCTATGGACTTCCTGGGGTTTTTGCTTGCCTCAAGGGAGCAAGCATTGTGCACTTTCAAGACCT
CAGTGCAGAAACTGTTAGATGCACAACCATACCGAACGTTCTTGCTAATCTCGAACAAGCTCGGGACAGGCAGAGCAGACAGCCAGAGAGTCCCTTAACTCCGTCCAGAC
ACACTTTGACCCCGTCGGTACATTTTTATGCCGGTGATTGGGAAGAGCTCTCGACGGTCTTATCTGTTGTCAGGGGTGATGGATTTGAAGTTCCTACTGGAATGAGCTTG
AGCTTCTCTGAAGAAGATTTTATGGATGGTTGCAGCAGTCAAGATGGCAGCATCATCGGCCATGAGTCTTCCTCAAGGAGATCGAGGAAATTATCGGGAAGCCGAGCATG
GGAGAGGGCTAGCGAGACCGATCAAGGAGAAGGTGGATATGATGTCATTTTAATGGCAGAAATTCCATATTCTCTCAACTCTTTGAAGAAACTATACGCGCTTATAAAAA
GATGTGTGAGGCCACCATATGGAGTGCTATACTTAGCTACAAAGAAGAATTACGTCGGTTTCAACAGCGGAGCGAGGCATTTGAGGAGTCTGGTCGATGAGGAAGGCGTT
TTTGGAGCTCACTTGGTGAAGGAGATGACCGACCGAGACATTTGGAAGTTCTTTCTCAAGTCTTCCAATGGCGTTTCCGAGTCTGAAACTCGTGTATTCGTGAGGAGAAG
AGTGAAAAAGAATGCGGAAGTTCAAGATAGCGGGCCTCAAGTTGAACCTAATGTCGACGCTAAACGCTGCTGTCCTCCCGATATTGAAGATTTTGCATTCAAAAGAACAA
AGGAATCCCCTGGATCATGGAAGTCAAAGCCTCCACCTCTAGATCCTCTTGTCACAGGAATTGAAGTTTCTAATCCAATTAGACAAAAAGGCATTGCAGAAAGAGGTAAG
CCACCTGTGAATTGGGAAAAAATCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGATGCACCAGTAGATACCATGGGATGTGGGCAAGCTGCTAGTACTCTTCCTCC
CAAGGAAAGAAGATTTTCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCAACACGTCTCCAAGAAAATGGCCTTCTTACTG
CTGATGCTATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATATACCCGGTTGGATTTTATTCAACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTT
ATAAAGTATGGTGGGGATATACCTAGGTCATTGGAGGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGATCATGATTATGGCATGGAACGATGTTCA
GGGGATATGTGTAGATACTCACGTGCACCGTATTTCCAATCGGCTTGGATGGGTGTCTGGAAAAGGCTCAAAACAGAAAACATCCACTCCTGAAGAAACTAGAGTGGCAT
TAGAACTGTGGCTGCCAAAGGAAGAATGGGTTCCAATTAATACTCTTCTGGTGGGTTTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTA
AGCGACTTGTGCCCATCAGCATTCAAGGAGTCTTCAAGCCCATCTCCCAAATTGAAGCGTTCAAGCTCCACCAAAAAGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCTCCATCATTACTTGCTCAATGCTTGCCTGGCTTGGTGCCCCAAGATAAAGGAAGCCACAGCACTCCGTCCATTTCAGAGAGAGATGTCCATCTTCCCTCACC
AGCCGTTGAGATTCTCCCTTCAAAGACAGCTCATCCGTACAAATATGCAGGGGAGAATGTAGATTTGCATGGTCTCAATGTTTTTAAGGGTAGAGTTAGCGTTGCCGATA
TAATTGGCTTCAATAACTCAGAATCCGTATCTTCAAAGCCTGAAGGTTATCTGAAATGTTGGGACAGTTCCATTGATCTTGTTAATGTCCTGAAGCACGAGATTCGTGAC
GGACAGCTGAGCTTTAGGGGTAAAAGAGTACTTGAGCTGGGTTGTAGCTATGGACTTCCTGGGGTTTTTGCTTGCCTCAAGGGAGCAAGCATTGTGCACTTTCAAGACCT
CAGTGCAGAAACTGTTAGATGCACAACCATACCGAACGTTCTTGCTAATCTCGAACAAGCTCGGGACAGGCAGAGCAGACAGCCAGAGAGTCCCTTAACTCCGTCCAGAC
ACACTTTGACCCCGTCGGTACATTTTTATGCCGGTGATTGGGAAGAGCTCTCGACGGTCTTATCTGTTGTCAGGGGTGATGGATTTGAAGTTCCTACTGGAATGAGCTTG
AGCTTCTCTGAAGAAGATTTTATGGATGGTTGCAGCAGTCAAGATGGCAGCATCATCGGCCATGAGTCTTCCTCAAGGAGATCGAGGAAATTATCGGGAAGCCGAGCATG
GGAGAGGGCTAGCGAGACCGATCAAGGAGAAGGTGGATATGATGTCATTTTAATGGCAGAAATTCCATATTCTCTCAACTCTTTGAAGAAACTATACGCGCTTATAAAAA
GATGTGTGAGGCCACCATATGGAGTGCTATACTTAGCTACAAAGAAGAATTACGTCGGTTTCAACAGCGGAGCGAGGCATTTGAGGAGTCTGGTCGATGAGGAAGGCGTT
TTTGGAGCTCACTTGGTGAAGGAGATGACCGACCGAGACATTTGGAAGTTCTTTCTCAAGTCTTCCAATGGCGTTTCCGAGTCTGAAACTCGTGTATTCGTGAGGAGAAG
AGTGAAAAAGAATGCGGAAGTTCAAGATAGCGGGCCTCAAGTTGAACCTAATGTCGACGCTAAACGCTGCTGTCCTCCCGATATTGAAGATTTTGCATTCAAAAGAACAA
AGGAATCCCCTGGATCATGGAAGTCAAAGCCTCCACCTCTAGATCCTCTTGTCACAGGAATTGAAGTTTCTAATCCAATTAGACAAAAAGGCATTGCAGAAAGAGGTAAG
CCACCTGTGAATTGGGAAAAAATCCTTGAAGGAATTCGTGAAATGAGATCCTCTGAAGATGCACCAGTAGATACCATGGGATGTGGGCAAGCTGCTAGTACTCTTCCTCC
CAAGGAAAGAAGATTTTCTGTCTTGGCATCTTCTCTTCTCTCAAGCCAAACCAAAGACCACGTGACTCATGGAGCAGCAACACGTCTCCAAGAAAATGGCCTTCTTACTG
CTGATGCTATGGACAAAGCTGATGAAGCAACCATTAAAAGCTTGATATACCCGGTTGGATTTTATTCAACAAAGGCTAGGAATTTGAAGAAGATTGCAAAAATATGTCTT
ATAAAGTATGGTGGGGATATACCTAGGTCATTGGAGGAGCTACTTCTACTACCTGGGATAGGTCCTAAGATTGCACATTTGATCATGATTATGGCATGGAACGATGTTCA
GGGGATATGTGTAGATACTCACGTGCACCGTATTTCCAATCGGCTTGGATGGGTGTCTGGAAAAGGCTCAAAACAGAAAACATCCACTCCTGAAGAAACTAGAGTGGCAT
TAGAACTGTGGCTGCCAAAGGAAGAATGGGTTCCAATTAATACTCTTCTGGTGGGTTTTGGACAGACTATTTGCACTCCTCTTAGACCCAAATGTGGAAATTGCAGTGTA
AGCGACTTGTGCCCATCAGCATTCAAGGAGTCTTCAAGCCCATCTCCCAAATTGAAGCGTTCAAGCTCCACCAAAAAGTTATGA
Protein sequenceShow/hide protein sequence
MRAPSLLAQCLPGLVPQDKGSHSTPSISERDVHLPSPAVEILPSKTAHPYKYAGENVDLHGLNVFKGRVSVADIIGFNNSESVSSKPEGYLKCWDSSIDLVNVLKHEIRD
GQLSFRGKRVLELGCSYGLPGVFACLKGASIVHFQDLSAETVRCTTIPNVLANLEQARDRQSRQPESPLTPSRHTLTPSVHFYAGDWEELSTVLSVVRGDGFEVPTGMSL
SFSEEDFMDGCSSQDGSIIGHESSSRRSRKLSGSRAWERASETDQGEGGYDVILMAEIPYSLNSLKKLYALIKRCVRPPYGVLYLATKKNYVGFNSGARHLRSLVDEEGV
FGAHLVKEMTDRDIWKFFLKSSNGVSESETRVFVRRRVKKNAEVQDSGPQVEPNVDAKRCCPPDIEDFAFKRTKESPGSWKSKPPPLDPLVTGIEVSNPIRQKGIAERGK
PPVNWEKILEGIREMRSSEDAPVDTMGCGQAASTLPPKERRFSVLASSLLSSQTKDHVTHGAATRLQENGLLTADAMDKADEATIKSLIYPVGFYSTKARNLKKIAKICL
IKYGGDIPRSLEELLLLPGIGPKIAHLIMIMAWNDVQGICVDTHVHRISNRLGWVSGKGSKQKTSTPEETRVALELWLPKEEWVPINTLLVGFGQTICTPLRPKCGNCSV
SDLCPSAFKESSSPSPKLKRSSSTKKL