; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg10066 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg10066
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein SIEVE ELEMENT OCCLUSION B-like
Genome locationCarg_Chr17:1007713..1023909
RNA-Seq ExpressionCarg10066
SyntenyCarg10066
Gene Ontology termsGO:0010088 - phloem development (biological process)
InterPro domainsIPR027942 - Sieve element occlusion, N-terminal
IPR027944 - Sieve element occlusion, C-terminal
IPR039299 - Protein SIEVE ELEMENT OCCLUSION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574941.1 Protein SIEVE ELEMENT OCCLUSION B, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.12Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSD+LVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLH ISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL                        NQPQLYLNELSDKITS+LTDIERHLYAI DLKEEVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDHIEHYHTGIPLVISKL                                         DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        E+LRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLN NSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKS+KS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

KAG7013507.1 Protein SIEVE ELEMENT OCCLUSION B [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVDHIEHYHTGIPLVISKLDDIRA
        QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVDHIEHYHTGIPLVISKLDDIRA
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVDHIEHYHTGIPLVISKLDDIRA

Query:  LHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFT
        LHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFT
Subjt:  LHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFT

Query:  QARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIV
        QARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIV
Subjt:  QARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIV

Query:  KSQIKGSSASETTEDILRLISYQNEDGWAVLTVGSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPMPPT
        KSQIKGSSASETTEDILRLISYQNEDGWAVLTVGSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPMPPT
Subjt:  KSQIKGSSASETTEDILRLISYQNEDGWAVLTVGSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPMPPT

Query:  VASALLKQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSE
        VASALLKQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSE
Subjt:  VASALLKQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSE

Query:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM
        LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM
Subjt:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM

Query:  NQIREFSKYDVKELPELPAALRQIPLITYWVIHTIVASRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEHYHT
        NQIREFSKYDVKELPELPAALRQIPLITYWVIHTIVASRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEHYHT
Subjt:  NQIREFSKYDVKELPELPAALRQIPLITYWVIHTIVASRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEHYHT

Query:  DITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMKWYS
        DITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMKWYS
Subjt:  DITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMKWYS

Query:  MQFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKHWPESTLVKFTHQPRLLSWAPTGAAPSLLQSKHAST
        MQFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKHWPESTLVKFTHQPRLLSWAPTGAAPSLLQSKHAST
Subjt:  MQFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKHWPESTLVKFTHQPRLLSWAPTGAAPSLLQSKHAST

Query:  HKEEVGTKHFPDELVTGHIYAKHRDDDSTKIDLPSYISVIENIITTADQIIDTVHRGIDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAH
        HKEEVGTKHFPDELVTGHIYAKHRDDDSTKIDLPSYISVIENIITTADQIIDTVHRGIDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAH
Subjt:  HKEEVGTKHFPDELVTGHIYAKHRDDDSTKIDLPSYISVIENIITTADQIIDTVHRGIDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAH

Query:  ETTLEIFEILANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVK
        ETTLEIFEILANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVK
Subjt:  ETTLEIFEILANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVK

Query:  ELSELPAALRQIPLESATEIFE
        ELSELPAALRQIPLESATEIFE
Subjt:  ELSELPAALRQIPLESATEIFE

XP_022959049.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita moschata]0.0e+0089.27Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSD+LVTGHIYAQHRDDDSTKIDLPNYIS+IESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLH ISSELSCKAPG+EKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYMDEIREFSKYDVKEL ALPAALRLIPL                        NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDHIEHYHTGIPLVISKL                                         DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        APVL+GRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

XP_023006704.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita maxima]0.0e+0082.02Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAP T+ GLLHPKQSSTSKEE+S+RHYSDELVTGHIYA+H DDDSTKIDLPNYISVIESIITTADRITETVHRGSEGR+VYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFA DYGDLWHLYHYSHTDPLAKSLA+IKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYM+EIREFSKYDVKELP LPAALRLIPL                        NQPQLYLNELSDKI  VL +IERHL AIR  ++EVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDH+EHYHTGIPLV+SKL                                         DDIRALHKIYDELKAR+ANYEIVWIPI PEPYHEDD + Y
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        EYLRS+MKW SIQFTTKISGMRYIEEKWQFREDPLVVVLNP SKVEFMNAIHLIRVWENEAIPFT ARTE LLKKHWPESTLLKFTHQPRLPNWI SQKS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGGK+Q+WIQQFEEKVE+LK DPLII+GGSFEIVRIGKDA  EDDPKLMARFWKVQWGYF+VKSQIKGSSASETTEDILRLISYQNEDGWAVLTVG 
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        APVLVGRD+LILRL+DDFP+WK TLRLKAFPDAFR+YFNDLAMK H+CD++ LPGFSG IPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

XP_023522594.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita pepo subsp. pepo]0.0e+0083.23Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAPTTSHGLLHPKQSSTSKEEMS+RHYSDELVTGHIYA+H DDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLH ISSELSCKAPGIEKAHETTLKIFEIL NYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL                        NQPQLYLNELSDKITSVLTDIERHLY IR   EEVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDHIEHYHT IPLV SKL                                         DDIRALHKIYDELKARNANYEIVWIPIIPEPYH DD K Y
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        EYLRSTMKW SIQFTT+ISGMRYI+EKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTL KFTHQPRLPN IKSQKS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGG+N +WIQQFEEKVE+LK+DPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKG SASET+EDILRL+SY+NEDGWAV+T+GS
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        AP+LVG D+LILRL DDFP WKQ L  KAFP+AFR+ FN+LA+KTH+CDRVI P FSGWIPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

TrEMBL top hitse value%identityAlignment
A0A0A0LIA4 Uncharacterized protein5.1e-28173.95Show/hide
Query:  MATALKAPTT-SHGLLHPKQS-STSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE
        MAT  KAPT  +  LLH KQS +T+KEE+S RHYSDE+VT HIYA+HRDDD+ KIDL NYISVIESIITTADRIT+TVHRGSEGR+VYS+DSLAS  VIE
Subjt:  MATALKAPTT-SHGLLHPKQS-STSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE

Query:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS
        PPLCTLHRISSELSCK PGIEKAHETT++IFEILANYPWEAKAALTLLAFA DYGDLWHLYHYS  DPLAKSLAIIK+VATLKKHLDSLRYRQVLLNPKS
Subjt:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS

Query:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY
        LIQSCLQAIKYM+EI+EF+KYDVKELP LPAALRLIPL                        NQPQ YLNELS+KI  VL ++E+HL AIR   EEVDLY
Subjt:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY

Query:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK
        RWLVDHIEHYHT I LV+ KL                                         DDI+A HKIY+ELK R+  YEIVWIPIIPEPY E+DRK
Subjt:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK

Query:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ
         YEYLRSTMKW S++FTTKISGMRYIEEKWQ REDPLVVVLNP SKVEFMNAIHLIRVWENEA PFT  RT++LL+++WPESTL+KFTHQPRL NWI   
Subjt:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ

Query:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV
        K+I+FYGGK+  WIQQFEE+ E+L+SDPLI+DGGSFEIVRIGKDAIG+DDP LMARFW  QWGYF+VKSQIKGSSASETTEDILRLISYQNEDGW VLTV
Subjt:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV

Query:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        G+APVLVGR +LIL+L++DFPKWKQ LR+KAFPD FREYFN+LA  +HQCDRVILPGFSGWIPM
Subjt:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

A0A1S3CNB5 protein SIEVE ELEMENT OCCLUSION B-like7.8e-28273.8Show/hide
Query:  MATALKAPTT-SHGLLHPKQSS-TSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE
        MAT  KAPTT +  LLH KQS+ T KEE+S RHYSDE+VTGHIYA+HRDDD+TKIDL +YISVIESIITTADRIT+TVHRGSEGR+VYS+DSLAS  VIE
Subjt:  MATALKAPTT-SHGLLHPKQSS-TSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE

Query:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS
        PPLCTLH ISSELSCKAPGIEKAHETT++IFEILANYPWEAKAALTLLAFATDYGDLWHLYHYS  DPLAKSLAIIK+V TLKKHLDSLRYRQVLLNPKS
Subjt:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS

Query:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY
        LIQSCLQA+KYM EI+EFSKYD KELP LPAALRLIPL                        NQPQ YLNELS+KI  VL ++E+HL AIR+  EEVDLY
Subjt:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY

Query:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK
        RWLVDHIEHYHT I  VI+KL                                         DDIRA HKIY+ELK R+  YEIVW+PII EPY E+DRK
Subjt:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK

Query:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ
         YEYLRSTMKW S++FTTKISGMRYIEEKWQ REDPLVVVLNP SKVEFMNAIHL+RVWENEAIPFT  RT++LL+++WPESTL+KFTHQPRL NWI   
Subjt:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ

Query:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV
        KSI+FYGGKN  WIQQFEE+ E+L+SDPLI+DGGSFEIVRIGKDA G+DDP LMARFW  QWGYF+VKSQIKGSSASETTEDILRLISYQNEDGW VLTV
Subjt:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV

Query:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        GSAPVLVGR +LIL+L++++PKWKQ+LR+KAFPD  REYFN+LA+++HQCDRVILPGFSGWIPM
Subjt:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

A0A5D3E2S2 Protein SIEVE ELEMENT OCCLUSION B-like7.8e-28273.8Show/hide
Query:  MATALKAPTT-SHGLLHPKQSS-TSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE
        MAT  KAPTT +  LLH KQS+ T KEE+S RHYSDE+VTGHIYA+HRDDD+TKIDL +YISVIESIITTADRIT+TVHRGSEGR+VYS+DSLAS  VIE
Subjt:  MATALKAPTT-SHGLLHPKQSS-TSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIE

Query:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS
        PPLCTLH ISSELSCKAPGIEKAHETT++IFEILANYPWEAKAALTLLAFATDYGDLWHLYHYS  DPLAKSLAIIK+V TLKKHLDSLRYRQVLLNPKS
Subjt:  PPLCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKS

Query:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY
        LIQSCLQA+KYM EI+EFSKYD KELP LPAALRLIPL                        NQPQ YLNELS+KI  VL ++E+HL AIR+  EEVDLY
Subjt:  LIQSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLY

Query:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK
        RWLVDHIEHYHT I  VI+KL                                         DDIRA HKIY+ELK R+  YEIVW+PII EPY E+DRK
Subjt:  RWLVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRK

Query:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ
         YEYLRSTMKW S++FTTKISGMRYIEEKWQ REDPLVVVLNP SKVEFMNAIHL+RVWENEAIPFT  RT++LL+++WPESTL+KFTHQPRL NWI   
Subjt:  MYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQ

Query:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV
        KSI+FYGGKN  WIQQFEE+ E+L+SDPLI+DGGSFEIVRIGKDA G+DDP LMARFW  QWGYF+VKSQIKGSSASETTEDILRLISYQNEDGW VLTV
Subjt:  KSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTV

Query:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        GSAPVLVGR +LIL+L++++PKWKQ+LR+KAFPD  REYFN+LA+++HQCDRVILPGFSGWIPM
Subjt:  GSAPVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

A0A6J1H6V1 protein SIEVE ELEMENT OCCLUSION B-like0.0e+0089.27Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSD+LVTGHIYAQHRDDDSTKIDLPNYIS+IESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLH ISSELSCKAPG+EKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYMDEIREFSKYDVKEL ALPAALRLIPL                        NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDHIEHYHTGIPLVISKL                                         DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        APVL+GRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

A0A6J1L2X0 protein SIEVE ELEMENT OCCLUSION B-like0.0e+0082.02Show/hide
Query:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP
        MATALKAP T+ GLLHPKQSSTSKEE+S+RHYSDELVTGHIYA+H DDDSTKIDLPNYISVIESIITTADRITETVHRGSEGR+VYSDDSLASNVVIEPP
Subjt:  MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPP

Query:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI
        LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFA DYGDLWHLYHYSHTDPLAKSLA+IKRVATLKKHLDSLRYRQVLLNPKSLI
Subjt:  LCTLHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLI

Query:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW
        QSCLQAIKYM+EIREFSKYDVKELP LPAALRLIPL                        NQPQLYLNELSDKI  VL +IERHL AIR  ++EVDLYRW
Subjt:  QSCLQAIKYMDEIREFSKYDVKELPALPAALRLIPL------------------------NQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRW

Query:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY
        LVDH+EHYHTGIPLV+SKL                                         DDIRALHKIYDELKAR+ANYEIVWIPI PEPYHEDD + Y
Subjt:  LVDHIEHYHTGIPLVISKL-----------------------------------------DDIRALHKIYDELKARNANYEIVWIPIIPEPYHEDDRKMY

Query:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS
        EYLRS+MKW SIQFTTKISGMRYIEEKWQFREDPLVVVLNP SKVEFMNAIHLIRVWENEAIPFT ARTE LLKKHWPESTLLKFTHQPRLPNWI SQKS
Subjt:  EYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKS

Query:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS
        IIFYGGK+Q+WIQQFEEKVE+LK DPLII+GGSFEIVRIGKDA  EDDPKLMARFWKVQWGYF+VKSQIKGSSASETTEDILRLISYQNEDGWAVLTVG 
Subjt:  IIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGS

Query:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM
        APVLVGRD+LILRL+DDFP+WK TLRLKAFPDAFR+YFNDLAMK H+CD++ LPGFSG IPM
Subjt:  APVLVGRDVLILRLIDDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPM

SwissProt top hitse value%identityAlignment
Q93XX2 Protein SIEVE ELEMENT OCCLUSION A1.5e-1924.32Show/hide
Query:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSELSCK-------------APGIE
        SDD V      K    D    D+   +SV+ +I    + +           LV  D     + S E     +  I+ E+ CK                ++
Subjt:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSELSCK-------------APGIE

Query:  NAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKY
        + +  T  +  L++ Y W+AK  L L A A  YG    L     T+ L KSLA+IK++  +    ++L  R  L     L+   +     +         
Subjt:  NAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKY

Query:  DVKELP--ELPAALR-QIPLITYWVIHTIVASRIEISSYLSETENQSQKY-----LNELSEKI----AIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEH
        D+ +LP   + AA    IP   YW++  ++     IS      ++Q   +     ++E SE++    A +L   +K    I E   E +    +      
Subjt:  DVKELP--ELPAALR-QIPLITYWVIHTIVASRIEISSYLSETENQSQKY-----LNELSEKI----AIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEH

Query:  YHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMK
         H D+   + +LL   I+      G + R V I   L  K+V+L+IS+L   + ++  L  +Y E  + +  EI+W+P + + + E D  ++E L   M+
Subjt:  YHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMK

Query:  WYSM--QFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNR-TELLLRKHWPESTLVKFT
        WY +      R   +R++ E W  +  P++V L+P+ +V  TNA  ++ +W   A PFT  R  +L   + W    L+  T
Subjt:  WYSM--QFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNR-TELLLRKHWPESTLVKFT

Q9FXE2 Protein SIEVE ELEMENT OCCLUSION C1.4e-1723.51Show/hide
Query:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNV---SIEPPLCALHNITSELSCKAPGIENAHEITLKIF
        ++D++   +   H D D   +D    +  +E I++        VL+    R + ++  +T+     S E    A+  I+ ++ C   G     + T+ +F
Subjt:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNV---SIEPPLCALHNITSELSCKAPGIENAHEITLKIF

Query:  ELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKYDVKELP----
        +LL  Y W+AKA L L   A  YG L    H +  DP+A S+A + ++      ++  ++R  L S N LI + +   K    I +F K   K+      
Subjt:  ELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKYDVKELP----

Query:  ELPAALRQIPLITYWVIHTIVASRIEISSYLSETEN-------------QSQKYLNELSE---KIAIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEHYH
         L   L  I L TY V+ + +    +I  Y  +T+              +S++   ELS    ++  +   L K ++    Q EE    R    +IE  H
Subjt:  ELPAALRQIPLITYWVIHTIVASRIEISSYLSETEN-------------QSQKYLNELSE---KIAIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEHYH

Query:  TDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDN---KHEIVWIPI-IPERFLEEDRRRYEYLRST
         D   V+  L S + +  PL   S  R++SI E +  K  +L++S+  + +     L Q+Y+     N    +EI+W+PI   +++ +E++  +++  ++
Subjt:  TDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDN---KHEIVWIPI-IPERFLEEDRRRYEYLRST

Query:  MKWYSMQ--FTTRVVGMRYIEEKWQLRE-DPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKH-WPESTLVKFTH
        + W S++  +      + + +++W  ++ + ++VV++   +    NA+ ++ +WG +A PF+ +R + L ++H W  + L+   H
Subjt:  MKWYSMQ--FTTRVVGMRYIEEKWQLRE-DPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKH-WPESTLVKFTH

Q9SS87 Protein SIEVE ELEMENT OCCLUSION B2.8e-4228.43Show/hide
Query:  KQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIM---TLANRITDAVL--RGTDGRLVPSD-ESLTSNVSIEPPLCALHNITSE
        K P+      + +   SD+ +   +  +    D  ++ +   +S++E+I+   TL +  T+A +    T+ +L+ S   S+  +VS      A+  +  E
Subjt:  KQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIM---TLANRITDAVL--RGTDGRLVPSD-ESLTSNVSIEPPLCALHNITSE

Query:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM
        ++ K+    ++HEIT+ +FE L+++ W+ K  LTL AFA +YG+ W L  +   + LAKSLA++K V +  +    +    V    N LI         +
Subjt:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM

Query:  NQIREF-SKYDVKELPELPAALRQIPLITYWVIHTIVA--SRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHL-DAIREQYEEVDLYR------W
         ++ E   +Y   ++P+L   L  IP+  YW I +++A  S+I + + +      +Q  L E S  +A  L  +  HL + +R  Y  ++  R       
Subjt:  NQIREF-SKYDVKELPELPAALRQIPLITYWVIHTIVA--SRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHL-DAIREQYEEVDLYR------W

Query:  LVDHIEHYHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKR-----DNK----HEIVWIPII-PER
        L    +  H D   +++ L+  K    PL DG T R+V + + L  K V+L+IS+LNI  D++    Q+Y E +R     D K    +E+VW+P++ P  
Subjt:  LVDHIEHYHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKR-----DNK----HEIVWIPII-PER

Query:  FLEED---RRRYEYLRSTMKWYSMQFTTRVVG--MRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRK
          E     ++++E LR  M WYS+     +    + ++  +W     P++VV++PQ      NA+H+I +WGTEA PFT +R E L R+
Subjt:  FLEED---RRRYEYLRSTMKWYSMQFTTRVVG--MRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRK

Arabidopsis top hitse value%identityAlignment
AT1G67790.1 unknown protein2.2e-2621.7Show/hide
Query:  LHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSC
        + RIS ++ C   G  +  + T+ +F++L  Y W+AKA L L   A  YG L    H +  DP+A S+A + ++      ++  ++R  L +   LI++ 
Subjt:  LHRISSELSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSC

Query:  LQAIKYMDEIREFSKYDVKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVD-HIEHYHTGIPLVISKLDDIR---
        +   K    I +F K               IP  Q +L  N L + ++++     R + +     +++  ++      I      + L++     +    
Subjt:  LQAIKYMDEIREFSKYDVKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVD-HIEHYHTGIPLVISKLDDIR---

Query:  -ALHKIYDELKARNA--NYEIVWIPI-IPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISG--MRYIEEKWQFRE-DPLVVVLNPNSKVEFMNAIHLIRVW
          L ++YD     N   NYEI+W+PI   + + +++++++++  +++ W+S++    +S   + + +++W +++ + ++VV++ N +   MNA+ ++ +W
Subjt:  -ALHKIYDELKARNA--NYEIVWIPI-IPEPYHEDDRKMYEYLRSTMKWLSIQFTTKISG--MRYIEEKWQFRE-DPLVVVLNPNSKVEFMNAIHLIRVW

Query:  ENEAIPFTQARTESLLKKH-WPESTLLKFTHQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGED----DPKLM
          +A PF+ +R + L K+H W  + LL   H    P +    + I  +G +N  WI +F      +++    ++       R  + A+ E      P L 
Subjt:  ENEAIPFTQARTESLLKKH-WPESTLLKFTHQPRLPNWIKSQKSIIFYGGKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGED----DPKLM

Query:  ARFW-KVQWGYFIVKSQIK-----GSSASETTEDILRLI--SYQNEDGWAVLTVGSAPVLVGRDVLILRL
          FW +++    I +S++K      S      E++  L+   Y    GW ++  GS    V  + +  R+
Subjt:  ARFW-KVQWGYFIVKSQIK-----GSSASETTEDILRLI--SYQNEDGWAVLTVGSAPVLVGRDVLILRL

AT3G01670.1 unknown protein1.1e-2024.32Show/hide
Query:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSELSCK-------------APGIE
        SDD V      K    D    D+   +SV+ +I    + +           LV  D     + S E     +  I+ E+ CK                ++
Subjt:  SDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLANRITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSELSCK-------------APGIE

Query:  NAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKY
        + +  T  +  L++ Y W+AK  L L A A  YG    L     T+ L KSLA+IK++  +    ++L  R  L     L+   +     +         
Subjt:  NAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKY

Query:  DVKELP--ELPAALR-QIPLITYWVIHTIVASRIEISSYLSETENQSQKY-----LNELSEKI----AIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEH
        D+ +LP   + AA    IP   YW++  ++     IS      ++Q   +     ++E SE++    A +L   +K    I E   E +    +      
Subjt:  DVKELP--ELPAALR-QIPLITYWVIHTIVASRIEISSYLSETENQSQKY-----LNELSEKI----AIVLAVLEKHLDAIREQYEEVDLYRWLVDHIEH

Query:  YHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMK
         H D+   + +LL   I+      G + R V I   L  K+V+L+IS+L   + ++  L  +Y E  + +  EI+W+P + + + E D  ++E L   M+
Subjt:  YHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYEYLRSTMK

Query:  WYSM--QFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNR-TELLLRKHWPESTLVKFT
        WY +      R   +R++ E W  +  P++V L+P+ +V  TNA  ++ +W   A PFT  R  +L   + W    L+  T
Subjt:  WYSM--QFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNR-TELLLRKHWPESTLVKFT

AT3G01680.1 CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640)2.0e-4328.43Show/hide
Query:  KQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIM---TLANRITDAVL--RGTDGRLVPSD-ESLTSNVSIEPPLCALHNITSE
        K P+      + +   SD+ +   +  +    D  ++ +   +S++E+I+   TL +  T+A +    T+ +L+ S   S+  +VS      A+  +  E
Subjt:  KQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIM---TLANRITDAVL--RGTDGRLVPSD-ESLTSNVSIEPPLCALHNITSE

Query:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM
        ++ K+    ++HEIT+ +FE L+++ W+ K  LTL AFA +YG+ W L  +   + LAKSLA++K V +  +    +    V    N LI         +
Subjt:  LSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAMLKKHLDSLRYRQVLLSPNSLINSCLQAIKYM

Query:  NQIREF-SKYDVKELPELPAALRQIPLITYWVIHTIVA--SRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHL-DAIREQYEEVDLYR------W
         ++ E   +Y   ++P+L   L  IP+  YW I +++A  S+I + + +      +Q  L E S  +A  L  +  HL + +R  Y  ++  R       
Subjt:  NQIREF-SKYDVKELPELPAALRQIPLITYWVIHTIVA--SRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHL-DAIREQYEEVDLYR------W

Query:  LVDHIEHYHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKR-----DNK----HEIVWIPII-PER
        L    +  H D   +++ L+  K    PL DG T R+V + + L  K V+L+IS+LNI  D++    Q+Y E +R     D K    +E+VW+P++ P  
Subjt:  LVDHIEHYHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKR-----DNK----HEIVWIPII-PER

Query:  FLEED---RRRYEYLRSTMKWYSMQFTTRVVG--MRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRK
          E     ++++E LR  M WYS+     +    + ++  +W     P++VV++PQ      NA+H+I +WGTEA PFT +R E L R+
Subjt:  FLEED---RRRYEYLRSTMKWYSMQFTTRVVG--MRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTGCACTCAAGGCACCCACCACTTCACATGGATTGCTGCATCCTAAGCAATCATCCACCTCTAAGGAGGAAATGAGCGTGAGACATTACTCCGACGAACTTGT
CACCGGTCACATTTACGCCCAACATCGCGACGATGACTCAACTAAAATTGACCTTCCAAATTACATTTCAGTTATTGAGAGCATCATTACCACTGCCGATCGAATTACTG
AAACCGTTCATCGCGGTAGCGAAGGGCGTGTGGTGTATTCAGATGATTCATTGGCATCTAATGTTGTGATCGAGCCTCCGCTATGTACTCTTCATCGTATCTCTAGCGAG
TTGTCATGCAAGGCTCCGGGGATAGAGAAAGCACACGAGACGACACTAAAAATCTTCGAAATATTGGCTAATTATCCATGGGAAGCCAAGGCAGCTCTCACTTTGTTAGC
CTTTGCAACAGATTATGGAGACTTATGGCATCTCTATCATTATTCCCATACCGATCCATTGGCTAAATCATTGGCCATTATCAAGCGAGTAGCTACCTTGAAGAAGCACT
TAGACTCACTTCGATACCGACAAGTGCTTCTCAATCCCAAGAGCCTTATTCAAAGCTGTTTGCAAGCAATCAAATACATGGATGAGATACGAGAATTCTCCAAATATGAT
GTCAAGGAGCTTCCTGCGTTGCCAGCTGCTCTTCGTCTGATTCCATTGAATCAGCCACAGCTATATTTGAATGAATTGTCGGACAAAATTACGAGCGTACTCACCGACAT
TGAAAGGCATCTATACGCCATCCGAGATCTGAAAGAGGAAGTTGATCTCTACCGATGGTTGGTTGACCACATTGAGCATTATCATACTGGCATTCCCTTGGTTATTTCCA
AGCTGGACGATATTAGAGCTTTACATAAGATTTACGATGAATTGAAAGCTAGAAACGCTAATTATGAGATAGTTTGGATTCCAATTATCCCGGAGCCTTATCATGAAGAT
GATCGCAAGATGTATGAGTATCTGCGTTCTACAATGAAGTGGCTCTCAATCCAGTTTACCACAAAAATATCTGGCATGAGATACATCGAGGAGAAGTGGCAATTTAGAGA
AGATCCATTAGTTGTGGTACTCAACCCAAATTCTAAAGTAGAATTCATGAATGCAATTCATCTCATTCGTGTTTGGGAAAATGAAGCTATCCCATTTACACAAGCAAGAA
CTGAATCTTTACTCAAAAAACATTGGCCCGAGTCAACTCTCCTCAAATTCACTCACCAACCAAGGCTACCAAATTGGATTAAGAGTCAGAAAAGTATTATATTTTATGGA
GGAAAGAATCAAGCATGGATCCAACAATTTGAAGAAAAAGTAGAAGTTTTGAAAAGCGATCCTTTGATAATCGATGGGGGTTCATTTGAGATCGTACGCATTGGAAAAGA
TGCAATAGGAGAGGATGATCCTAAACTTATGGCTCGTTTTTGGAAAGTACAATGGGGTTATTTTATAGTGAAGAGTCAGATAAAAGGTTCAAGTGCAAGCGAGACAACTG
AAGATATTTTAAGATTGATTTCTTACCAAAATGAAGATGGTTGGGCAGTTCTTACTGTAGGCTCAGCCCCTGTGTTAGTTGGTCGTGACGTTTTGATTTTGAGATTGATT
GATGACTTTCCAAAATGGAAACAAACTTTGCGCTTGAAGGCTTTCCCCGATGCTTTTAGAGAATACTTCAATGACCTGGCTATGAAGACTCATCAATGTGATCGAGTTAT
TCTTCCTGGATTTAGTGGATGGATTCCTATGCCACCCACTGTTGCTTCTGCATTGCTTAAGCAGCCAACCGCCATGACGAAGGAGGAGTCGAGCATGAAATATTACTCGG
ACGACCTCGTCACTGGCTACATTTACGACAAACATCGTGACGACGATACAACCAAAATCGATCTCCCTCATTACATCTCAGTTATCGAGAATATCATGACTCTTGCCAAC
CGAATTACCGACGCCGTTCTTCGGGGTACCGACGGACGCCTAGTACCTTCAGATGAATCTCTGACATCTAATGTTTCAATTGAGCCTCCGCTTTGTGCTCTTCACAATAT
CACGAGCGAGCTTTCGTGCAAGGCTCCCGGGATCGAAAATGCACACGAGATTACACTAAAAATCTTCGAATTATTGGCTAATTATCCATGGGAAGCCAAGGCAGCGCTCA
CTTTGATAGCCTTTGCAACGGATTATGGAGATTTATGGCATCTCTACCATTATTCCCATACCGATCCATTGGCTAAGTCATTGGCCATTATCAAGCGAGTAGCTATGTTG
AAGAAGCACTTGGACTCACTTCGATACCGTCAAGTGCTACTCAGCCCCAACAGTTTGATCAACAGCTGCTTGCAAGCAATAAAATACATGAACCAAATTAGAGAATTCTC
CAAATATGATGTCAAGGAGCTTCCTGAATTGCCTGCTGCTCTTCGTCAAATCCCATTAATCACTTATTGGGTTATACACACAATTGTTGCTTCTAGAATTGAGATCTCCA
GCTATCTTAGCGAAACCGAGAACCAATCGCAGAAATACTTGAATGAATTGTCTGAAAAGATCGCCATAGTATTGGCCGTGCTTGAAAAGCATCTAGACGCCATCCGAGAA
CAATATGAGGAGGTCGACCTCTACCGATGGCTGGTTGACCACATTGAGCATTATCATACGGACATTACATTGGTTATGTCTAAGCTTCTTAGTGGCAAAATTGAAGCCAA
GCCACTTATTGATGGCTCTACCCTAAGAGAGGTTAGCATTCAAGAAAGTTTAGCGGGGAAGAATGTGGTGTTGGTGATTTCTGAATTGAATATCTCAGATGATGACATGA
GAGCTCTTCATCAAGTTTACAATGAATTGAAAAGAGACAATAAGCATGAGATTGTTTGGATTCCAATTATCCCAGAACGTTTTCTTGAGGAAGATCGAAGGAGATATGAG
TATCTGCGGTCTACGATGAAGTGGTACTCAATGCAATTCACTACAAGAGTGGTTGGCATGAGGTATATTGAAGAGAAGTGGCAATTGAGAGAGGACCCATTAGTTGTGGT
ACTCAATCCACAGTCTAAAGTGGAATTTACTAATGCAATTCATTTGATTCGAGTTTGGGGAACCGAAGCAATCCCTTTTACTCATAATCGAACTGAGCTTCTTTTGAGAA
AACATTGGCCTGAATCAACCCTCGTCAAGTTCACTCATCAACCAAGGTTATTGAGTTGGGCACCCACCGGCGCTGCACCTTCTTTGCTGCAGTCTAAGCACGCATCCACC
CACAAGGAGGAGGTTGGCACCAAGCATTTCCCCGACGAACTCGTCACCGGACACATTTACGCCAAACATCGTGATGACGATAGTACCAAAATTGATCTCCCCAGTTACAT
CTCAGTTATCGAGAATATTATCACAACTGCTGATCAAATTATTGATACTGTTCATCGGGGAATCGACGGCCGCTTGGTACACTCCGATGCAAGTTTGGCATTCAATGTTG
TGATCGAGCCTCCACTTTGTACCCTTCATCGTATCTCTAGCGAGTTGTCATGCAAAGCTCCCGGGATAGAAAAAGCACACGAGACGACACTAGAAATCTTCGAAATATTG
GCTAATTATCCATGGGAAGCCAAGGCAGCTCTCACATTGATAGCCTTTGCAGCCGATTATGGAGACTTATGGCATCTCCATCATTACTCCCATGCTGATCCATTGGCTAA
ATCATTGGCCATTATCAAGCGAGTAGCTACCTTGAAGAAGCACTTAGACTCACTTCGATACCGACAAGTGCTTCTCAATCCCAAGAGCCTTATTCAAAGCTGTTTGCAAG
CAATCAAATACATGGATGAGATACGAGAATTCTCCAAATATGATGTCAAGGAACTTTCTGAGTTACCCGCTGCTCTTCGTCAGATCCCATTGGAATCAGCCACAGAGATA
TTTGAATGA
mRNA sequenceShow/hide mRNA sequence
CCTAACAATTTCAATTCACAACACAACCGAACCCTATTCGACCTTCTTTCCTTCTGTTTGCACTTCTCTCTTCATCCAAACCATGGCCACTGCACTCAAGGCACCCACCA
CTTCACATGGATTGCTGCATCCTAAGCAATCATCCACCTCTAAGGAGGAAATGAGCGTGAGACATTACTCCGACGAACTTGTCACCGGTCACATTTACGCCCAACATCGC
GACGATGACTCAACTAAAATTGACCTTCCAAATTACATTTCAGTTATTGAGAGCATCATTACCACTGCCGATCGAATTACTGAAACCGTTCATCGCGGTAGCGAAGGGCG
TGTGGTGTATTCAGATGATTCATTGGCATCTAATGTTGTGATCGAGCCTCCGCTATGTACTCTTCATCGTATCTCTAGCGAGTTGTCATGCAAGGCTCCGGGGATAGAGA
AAGCACACGAGACGACACTAAAAATCTTCGAAATATTGGCTAATTATCCATGGGAAGCCAAGGCAGCTCTCACTTTGTTAGCCTTTGCAACAGATTATGGAGACTTATGG
CATCTCTATCATTATTCCCATACCGATCCATTGGCTAAATCATTGGCCATTATCAAGCGAGTAGCTACCTTGAAGAAGCACTTAGACTCACTTCGATACCGACAAGTGCT
TCTCAATCCCAAGAGCCTTATTCAAAGCTGTTTGCAAGCAATCAAATACATGGATGAGATACGAGAATTCTCCAAATATGATGTCAAGGAGCTTCCTGCGTTGCCAGCTG
CTCTTCGTCTGATTCCATTGAATCAGCCACAGCTATATTTGAATGAATTGTCGGACAAAATTACGAGCGTACTCACCGACATTGAAAGGCATCTATACGCCATCCGAGAT
CTGAAAGAGGAAGTTGATCTCTACCGATGGTTGGTTGACCACATTGAGCATTATCATACTGGCATTCCCTTGGTTATTTCCAAGCTGGACGATATTAGAGCTTTACATAA
GATTTACGATGAATTGAAAGCTAGAAACGCTAATTATGAGATAGTTTGGATTCCAATTATCCCGGAGCCTTATCATGAAGATGATCGCAAGATGTATGAGTATCTGCGTT
CTACAATGAAGTGGCTCTCAATCCAGTTTACCACAAAAATATCTGGCATGAGATACATCGAGGAGAAGTGGCAATTTAGAGAAGATCCATTAGTTGTGGTACTCAACCCA
AATTCTAAAGTAGAATTCATGAATGCAATTCATCTCATTCGTGTTTGGGAAAATGAAGCTATCCCATTTACACAAGCAAGAACTGAATCTTTACTCAAAAAACATTGGCC
CGAGTCAACTCTCCTCAAATTCACTCACCAACCAAGGCTACCAAATTGGATTAAGAGTCAGAAAAGTATTATATTTTATGGAGGAAAGAATCAAGCATGGATCCAACAAT
TTGAAGAAAAAGTAGAAGTTTTGAAAAGCGATCCTTTGATAATCGATGGGGGTTCATTTGAGATCGTACGCATTGGAAAAGATGCAATAGGAGAGGATGATCCTAAACTT
ATGGCTCGTTTTTGGAAAGTACAATGGGGTTATTTTATAGTGAAGAGTCAGATAAAAGGTTCAAGTGCAAGCGAGACAACTGAAGATATTTTAAGATTGATTTCTTACCA
AAATGAAGATGGTTGGGCAGTTCTTACTGTAGGCTCAGCCCCTGTGTTAGTTGGTCGTGACGTTTTGATTTTGAGATTGATTGATGACTTTCCAAAATGGAAACAAACTT
TGCGCTTGAAGGCTTTCCCCGATGCTTTTAGAGAATACTTCAATGACCTGGCTATGAAGACTCATCAATGTGATCGAGTTATTCTTCCTGGATTTAGTGGATGGATTCCT
ATGCCACCCACTGTTGCTTCTGCATTGCTTAAGCAGCCAACCGCCATGACGAAGGAGGAGTCGAGCATGAAATATTACTCGGACGACCTCGTCACTGGCTACATTTACGA
CAAACATCGTGACGACGATACAACCAAAATCGATCTCCCTCATTACATCTCAGTTATCGAGAATATCATGACTCTTGCCAACCGAATTACCGACGCCGTTCTTCGGGGTA
CCGACGGACGCCTAGTACCTTCAGATGAATCTCTGACATCTAATGTTTCAATTGAGCCTCCGCTTTGTGCTCTTCACAATATCACGAGCGAGCTTTCGTGCAAGGCTCCC
GGGATCGAAAATGCACACGAGATTACACTAAAAATCTTCGAATTATTGGCTAATTATCCATGGGAAGCCAAGGCAGCGCTCACTTTGATAGCCTTTGCAACGGATTATGG
AGATTTATGGCATCTCTACCATTATTCCCATACCGATCCATTGGCTAAGTCATTGGCCATTATCAAGCGAGTAGCTATGTTGAAGAAGCACTTGGACTCACTTCGATACC
GTCAAGTGCTACTCAGCCCCAACAGTTTGATCAACAGCTGCTTGCAAGCAATAAAATACATGAACCAAATTAGAGAATTCTCCAAATATGATGTCAAGGAGCTTCCTGAA
TTGCCTGCTGCTCTTCGTCAAATCCCATTAATCACTTATTGGGTTATACACACAATTGTTGCTTCTAGAATTGAGATCTCCAGCTATCTTAGCGAAACCGAGAACCAATC
GCAGAAATACTTGAATGAATTGTCTGAAAAGATCGCCATAGTATTGGCCGTGCTTGAAAAGCATCTAGACGCCATCCGAGAACAATATGAGGAGGTCGACCTCTACCGAT
GGCTGGTTGACCACATTGAGCATTATCATACGGACATTACATTGGTTATGTCTAAGCTTCTTAGTGGCAAAATTGAAGCCAAGCCACTTATTGATGGCTCTACCCTAAGA
GAGGTTAGCATTCAAGAAAGTTTAGCGGGGAAGAATGTGGTGTTGGTGATTTCTGAATTGAATATCTCAGATGATGACATGAGAGCTCTTCATCAAGTTTACAATGAATT
GAAAAGAGACAATAAGCATGAGATTGTTTGGATTCCAATTATCCCAGAACGTTTTCTTGAGGAAGATCGAAGGAGATATGAGTATCTGCGGTCTACGATGAAGTGGTACT
CAATGCAATTCACTACAAGAGTGGTTGGCATGAGGTATATTGAAGAGAAGTGGCAATTGAGAGAGGACCCATTAGTTGTGGTACTCAATCCACAGTCTAAAGTGGAATTT
ACTAATGCAATTCATTTGATTCGAGTTTGGGGAACCGAAGCAATCCCTTTTACTCATAATCGAACTGAGCTTCTTTTGAGAAAACATTGGCCTGAATCAACCCTCGTCAA
GTTCACTCATCAACCAAGGTTATTGAGTTGGGCACCCACCGGCGCTGCACCTTCTTTGCTGCAGTCTAAGCACGCATCCACCCACAAGGAGGAGGTTGGCACCAAGCATT
TCCCCGACGAACTCGTCACCGGACACATTTACGCCAAACATCGTGATGACGATAGTACCAAAATTGATCTCCCCAGTTACATCTCAGTTATCGAGAATATTATCACAACT
GCTGATCAAATTATTGATACTGTTCATCGGGGAATCGACGGCCGCTTGGTACACTCCGATGCAAGTTTGGCATTCAATGTTGTGATCGAGCCTCCACTTTGTACCCTTCA
TCGTATCTCTAGCGAGTTGTCATGCAAAGCTCCCGGGATAGAAAAAGCACACGAGACGACACTAGAAATCTTCGAAATATTGGCTAATTATCCATGGGAAGCCAAGGCAG
CTCTCACATTGATAGCCTTTGCAGCCGATTATGGAGACTTATGGCATCTCCATCATTACTCCCATGCTGATCCATTGGCTAAATCATTGGCCATTATCAAGCGAGTAGCT
ACCTTGAAGAAGCACTTAGACTCACTTCGATACCGACAAGTGCTTCTCAATCCCAAGAGCCTTATTCAAAGCTGTTTGCAAGCAATCAAATACATGGATGAGATACGAGA
ATTCTCCAAATATGATGTCAAGGAACTTTCTGAGTTACCCGCTGCTCTTCGTCAGATCCCATTGGAATCAGCCACAGAGATATTTGAATGATTTGTCTGAAAAAATGGCT
AGGGTACTCGACGAACTCGAAAAGCATCTCGAAATATTACGAGAACAACATGAGGAGGTTGATCTCTACCGGTGGCTGGTTGACCATATTGAGCATTATCGTACCGACAT
TACATTGGTTGTTCCAAAGCTTCTTAGCGGCAAAACAGAAACCAAGCCACTTATTGATGGCTCTACCCTAAGAGAGGTTGGTGTTCATGAAAGTTTGTCAGGAAAGAACG
TGATATTGGTCATTTCTGGGTTGGATATCTCCGAGGATGATATCAAAGCTATTCATAATGTTTATGATGAATTGAAAAGTAGAGGCACTAATTATGAGATAGTTTGGATT
CCAATTATCCTGGCGTCTAATCATGAAAATGATCACAAGAAATATGAGTATGATCACAAGAAATATGAGTATCTGCGTTCTACAATGAAGTGGTACTCAATCCAGTTTAC
TACAAAAATATCGGGCATGAGATACCTTGAGGAGAAGTGGCAACTTAGAGAAGATCCATTAGTTGTGGTACTCAGCCCACAGTCTGAAGTGGTGTTCATTAATGCAATTC
ATCTAATTCGAGTTTGGGGAACTGAAGCAATCGATTTTAAGGAAGATAGAGCCAAGTTTTTATTGAGAAAAAATTGGCCCGATTCAACTCTTGTCAAGTTCGCTCACCAA
CCAAGATTGCAAAGTTGGATCAAGCAAGAGAAAAGCATCTTATTCTATGGTGGCAAAGAACCAATGTGGATCCAACAATTTGAAGAGAGGGTAGAAATTTTGAAGAGTGA
TCCGTTGATAAGGGACGGTGGTTCATTTGAGATCGTACGCATAGGAAAGAATGCAAAAGGAGAGGATGATCCTGCACTCATGGCTCGTTTTTGGAAAATACAATGGGGCT
ATTTTATAGTCAAAAGTCAGTTGATTGGTTCAAGTGCGAGCGAGACAACCGAAGACATTCTCAGGTTGATTTCTTACCAAAATGAAGATGGTTGGGTTGTTCTTTCTGTA
GGGTCTGCGCCTGTGTTAGTTGGTCGTGGGATATTGATTTTGAAGTTGCTTGAGGAATTCCCAAAATGGAAGCAGAGTTTGCGCCTAAAAGCTTTCCCAGATGCTTTTAG
AGATTACTTTAATGAGCTGGCTCTCAAGAGTCATCAATGTGATCGAGTAATTCTTCCAGGGTTTAGTGGATATATTCCTATGATTGTTAATTGTCCTGAGTGTCCTCGTT
TCATGGAGACTGGTATTAGCTTCAAGTGCTGCCACGGAGGTGCTCATATGTGAAGATGATCGACTCGACCTGATGCTAGAAGTCATATCCATACCTTTTCTTATAATACA
CCATTACTATTATTACTATGAATGATACCATTTGGTCCTTCATATGGATCTATACAACTTTATATTAAAATCGAGTCTATTAAGTCTCGATGTTGTCCTGAATCAGCTTT
TGTATGAATTATATTATTTAAGATAAATAATCATCTGGTCTTTATTT
Protein sequenceShow/hide protein sequence
MATALKAPTTSHGLLHPKQSSTSKEEMSVRHYSDELVTGHIYAQHRDDDSTKIDLPNYISVIESIITTADRITETVHRGSEGRVVYSDDSLASNVVIEPPLCTLHRISSE
LSCKAPGIEKAHETTLKIFEILANYPWEAKAALTLLAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYD
VKELPALPAALRLIPLNQPQLYLNELSDKITSVLTDIERHLYAIRDLKEEVDLYRWLVDHIEHYHTGIPLVISKLDDIRALHKIYDELKARNANYEIVWIPIIPEPYHED
DRKMYEYLRSTMKWLSIQFTTKISGMRYIEEKWQFREDPLVVVLNPNSKVEFMNAIHLIRVWENEAIPFTQARTESLLKKHWPESTLLKFTHQPRLPNWIKSQKSIIFYG
GKNQAWIQQFEEKVEVLKSDPLIIDGGSFEIVRIGKDAIGEDDPKLMARFWKVQWGYFIVKSQIKGSSASETTEDILRLISYQNEDGWAVLTVGSAPVLVGRDVLILRLI
DDFPKWKQTLRLKAFPDAFREYFNDLAMKTHQCDRVILPGFSGWIPMPPTVASALLKQPTAMTKEESSMKYYSDDLVTGYIYDKHRDDDTTKIDLPHYISVIENIMTLAN
RITDAVLRGTDGRLVPSDESLTSNVSIEPPLCALHNITSELSCKAPGIENAHEITLKIFELLANYPWEAKAALTLIAFATDYGDLWHLYHYSHTDPLAKSLAIIKRVAML
KKHLDSLRYRQVLLSPNSLINSCLQAIKYMNQIREFSKYDVKELPELPAALRQIPLITYWVIHTIVASRIEISSYLSETENQSQKYLNELSEKIAIVLAVLEKHLDAIRE
QYEEVDLYRWLVDHIEHYHTDITLVMSKLLSGKIEAKPLIDGSTLREVSIQESLAGKNVVLVISELNISDDDMRALHQVYNELKRDNKHEIVWIPIIPERFLEEDRRRYE
YLRSTMKWYSMQFTTRVVGMRYIEEKWQLREDPLVVVLNPQSKVEFTNAIHLIRVWGTEAIPFTHNRTELLLRKHWPESTLVKFTHQPRLLSWAPTGAAPSLLQSKHAST
HKEEVGTKHFPDELVTGHIYAKHRDDDSTKIDLPSYISVIENIITTADQIIDTVHRGIDGRLVHSDASLAFNVVIEPPLCTLHRISSELSCKAPGIEKAHETTLEIFEIL
ANYPWEAKAALTLIAFAADYGDLWHLHHYSHADPLAKSLAIIKRVATLKKHLDSLRYRQVLLNPKSLIQSCLQAIKYMDEIREFSKYDVKELSELPAALRQIPLESATEI
FE