; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021497 (gene) of Snake gourd v1 genome

Gene IDTan0021497
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein SIEVE ELEMENT OCCLUSION B-like
Genome locationLG06:77133771..77140966
RNA-Seq ExpressionTan0021497
SyntenyTan0021497
Gene Ontology termsGO:0010088 - phloem development (biological process)
InterPro domainsIPR027942 - Sieve element occlusion, N-terminal
IPR027944 - Sieve element occlusion, C-terminal
IPR039299 - Protein SIEVE ELEMENT OCCLUSION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574938.1 Protein SIEVE ELEMENT OCCLUSION B, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.92Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTHK+ELSLKNLSD+VVA H+YSKHR+DD VKIDV+NYI FLES+FA+V +I+EA+ QG H+GR  + EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+K+S K     LAFK+PG EKAH+TTLEILDIL+SY WEAKAIL L AF +DYG +WHLN++SH DPLAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLF+K+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTE QSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID +PTEITLVVSKLLEGK +AKPLINCSTL EE IED+L+ KNVILLISGL+ISNDDIRAL+L+YDELKRED+YKIVWIPMIN     EE+
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RK+YE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        QERSIIFYGGKD +WIQRFEEKVVDIKNDRSMRD+GI+FEIV IGNN+       MSRFWI QWGFFV+KSQLTGSSASETTEDILRLISYENENGWAIL
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLIL VFEDFN WKK+LN+KGFPN+F+D+FNE+AL+TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

XP_022959465.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita moschata]0.0e+0080.64Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTH++ELSLKNLSD+VVA H+YSKHR+DD VKIDV+NYI FLES+FA+V +I+E++ QG H+G   + EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+ +SIK     LAFK PG EKAHQTTLEILDIL+SY WEAKAIL L AF +DYG +WHLN++SH DPLAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLFSK+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTE QSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID +PTEITLVVSKLLEGK +AKPLINCSTL EE IED+L+ KNVILLISGL+ISNDDIRAL+L+YDELKRED+YKIVWIPMIN     EE+
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RK+YE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        QERSIIFYGGKD +WIQRFEEKVVDIKNDRSMRD GI+FEIV +GNN+       MSRFWI QWGFFV+KSQLTGSSASETTEDILRLISYENENGWAIL
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLIL VFEDFN WKK+LN+KGFPN+F+D+FNE+AL+TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

XP_023006705.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita maxima]0.0e+0081.36Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTHK+ELSLKNLSDDVV  H+YSKHR+DD VKIDV+NYI FL+S+FA V +I+EA+ QG H+GR +Y EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+K+SIK     LAFK PGIE AHQTTLEILDIL SY WEAKAIL L AF +DYG +WHLN++SHFD LAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLFSK+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTESQSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID FPTEITLVVSKLLEGK +AKPLINCSTL EE IED+L  KNVILLISGL+ISNDDI+AL+L+YDELKRED+YKIVWIPMIN     +ES
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RK+YE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        Q RSIIFYGGKD DWIQ FEEKVVDIKNDRSMR++GI+FEIVRIGNN        MSRFWI QWGFFV+KSQLTGSSASETTEDILRLISYENENGWAIL
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLIL VFEDFN WKK+LN+KGFPN+F+D+FNE+AL TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

XP_023547453.1 protein SIEVE ELEMENT OCCLUSION B-like [Cucurbita pepo subsp. pepo]0.0e+0081.07Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTHK+ELSLKNLSD+VVA H+YSKHR+DD VKIDV+NYI FLES+FANV +I++A+ QG H+GR ++ EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+K+SIK     L FK PGIE AH+TTLEILDIL+SY WEAKAIL L AF +DYG +WHLN++SH DPLAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLFSK+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTE QSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID FPTEITLVVSKLLEGK +AKPLINCSTL EE IED+L+ KNVILLISGL+ISNDDIRAL+L+YDELKRED+YKIVWIPMIN   + EE+
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RKKYE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        QERSIIFYGGKD DWIQRFEEKVVDIKNDRSMRDNGI+FEIVRIGNN+       MSRFWI QWGFF++KSQL+GSSASETTEDILRLISYENENGWA+L
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLI+ VFEDF  WKK+LN+KGFPN+F D+FNE+AL+TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

XP_038906722.1 protein SIEVE ELEMENT OCCLUSION B-like [Benincasa hispida]0.0e+0080.49Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ + RLQTHKEELSLK LSD+VVA H+YSKHR+DDRVKIDVDNYI FLES+F+NV +INEAASQG HEGR +Y EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+KVS +     L FK+PGIEKAHQTTLEILDILISY WEAKA + LAAF +DYG++WHLN++S FDPLAKSLANI  S SLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LC QVIKLMNQ+RLFSKYDSKEIPEL++ LRQIP+FTYWVIH+IVA  TEISSYL N+E QSQ YLNEL+ERLNAIL+ILED+LNIF+E+ EEI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        LYRWLIDHIDQFPT+ITLVVSKLLEGK++AKPLINCSTLKEE IED+L+ KNVILLISGL ISNDDIRAL+LVY+ELKRED+YKIVWIP+IN  +  EES
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
         K+YE++RS MKWYAVQY TKIAGLR+LEEIWQLRDDALMVVL+SKSK+KFSNAIHLLRVWG NAIPFS E+ANALLRKNWPESTIVKF++QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENG
        QER+IIFYGGKD DWIQ+FEEKVVDIKNDRSMRD+GI+FEIVRIGN+    K ED     S FWIIQWG+F+IKSQLTGSSA+ETTEDILRLISYENENG
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENG

Query:  WAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPNK
        WAILAVGSAPL+VGRGNLIL VFEDFN WK++LN+KGF NSFKDHFNELAL+ H+C+RV LP FSGWIPMIVNCPECPRFMETGINFNCCHGR  NK
Subjt:  WAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPNK

TrEMBL top hitse value%identityAlignment
A0A0A0LNR0 Uncharacterized protein0.0e+0079.45Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP
        M++LAPK PSTP+ +PRLQTHKE+LSLKNLSD+ VA H+YSKHR+DD +KIDVDNYI FL+SLF+NV +INEA+SQG HEGR +Y E+S+K+NV  +DPP
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP

Query:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS
         DIL+KVSIK     LAFK+PGIEKAHQTTLEILDIL+SY WEAKAIL LAAF +DYG++WHLN++S FDPLAKSLANI  S SLKKHLDSF YRQV+FS
Subjt:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS

Query:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI
        SRS++YLC ++IKLMNQ+RLFSKYDSKEIPEL+SALRQIPLF+YWVIH+IVA  TEISSYLANTESQS  YLNEL+ERLNAILN L D LNIF+EQ EEI
Subjt:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI

Query:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE
        +LYRWLIDHIDQFPTEITLVVSKLLEGK +AKPLINCST  EE IED+L+ KNVILLI SGL+ISNDDIRAL LVY+ELKRED+YKIVWIP++N +   E
Subjt:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE

Query:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW
        ES K+YE +RS MKWYAVQY TKIAGLR+LEEIWQLRDDALMVVL+SKSK+KFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF++QPRL+SW
Subjt:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW

Query:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIG--NNMIKVED-----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYEN
        IDQE++IIFYGGKD+DWIQ+FEEKVVDIKNDRSMRDNGI+FEIV IG   N  K ED     M+RFWI QWGFF+IKSQLTGSSASETTEDILRLISYEN
Subjt:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIG--NNMIKVED-----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYEN

Query:  ENGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
        ENGWAIL VGSAPL+V RGNL+L VFED N WKK+LN+KGFPNSFKD+F +LALRTH+CE+V LPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
Subjt:  ENGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR

A0A1S3CNP9 protein SIEVE ELEMENT OCCLUSION B-like0.0e+0079.42Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP
        M+VLAPK PSTP+ +PRLQTHKE+LSLKNLSD+ VA H+YSKHR+DD +KIDVDNYI FL+S+F+N+ +INEA+SQG HEGR +Y ED +K+NV  +DPP
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP

Query:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS
        VDIL+KVSIK     LAFK+PGIEKAHQTTLEILD+LI Y WEAKAIL LAAF +DYG++WHLN++S FDPLAKSLANI  S SLKKHLDSF YRQVVFS
Subjt:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS

Query:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI
        SRS++YLC ++IKLMNQ+RLFSKYDSKEIPEL+SALRQIPLFTYWVIH+IVA  TEISSYLA+TE+QS  YLNEL+ERLNAILN L D LNIFREQ EEI
Subjt:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI

Query:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE
        +LYRWLIDHIDQFPTEITLVVSKLLEGKA+AKPLINCST  EE IED+L+ KNVILLI SGL ISNDD+RAL+LVY+ELKRED+YKIVWIP++N     E
Subjt:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE

Query:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW
        ES K+YE++RS MKWYAVQY TKIAGLR+LEEIWQLRDDALMVVL+SKSK+KFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF++QPRL+SW
Subjt:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW

Query:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENE
        IDQ ++I+FYGGKD+DWIQ+FEEKVVDIKNDRSMRDNGI+FEIV +G N    K ED    MSRFWI QWGFF+IKSQLTGSSASETTEDILRLISYENE
Subjt:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENE

Query:  NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
        NGWAILAVGSAPL+V RGNL+L VFED N WKK+LN+KGFPNSFKD+F+ELA RTH+CE+V LPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
Subjt:  NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR

A0A5A7UVD0 Protein SIEVE ELEMENT OCCLUSION B-like0.0e+0079.42Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP
        M+VLAPK PSTP+ +PRLQTHKE+LSLKNLSD+ VA H+YSKHR+DD +KIDVDNYI FL+S+F+N+ +INEA+SQG HEGR +Y ED +K+NV  +DPP
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANV-IVDPP

Query:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS
        VDIL+KVSIK     LAFK+PGIEKAHQTTLEILD+LI Y WEAKAIL LAAF +DYG++WHLN++S FDPLAKSLANI  S SLKKHLDSF YRQVVFS
Subjt:  VDILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFS

Query:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI
        SRS++YLC ++IKLMNQ+RLFSKYDSKEIPEL+SALRQIPLFTYWVIH+IVA  TEISSYLA+TE+QS  YLNEL+ERLNAILN L D LNIFREQ EEI
Subjt:  SRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEI

Query:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE
        +LYRWLIDHIDQFPTEITLVVSKLLEGKA+AKPLINCST  EE IED+L+ KNVILLI SGL ISNDD+RAL+LVY+ELKRED+YKIVWIP++N     E
Subjt:  DLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLI-SGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGE

Query:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW
        ES K+YE++RS MKWYAVQY TKIAGLR+LEEIWQLRDDALMVVL+SKSK+KFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF++QPRL+SW
Subjt:  ESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSW

Query:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENE
        IDQ ++I+FYGGKD+DWIQ+FEEKVVDIKNDRSMRDNGI+FEIV +G N    K ED    MSRFWI QWGFF+IKSQLTGSSASETTEDILRLISYENE
Subjt:  IDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVED----MSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENE

Query:  NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
        NGWAILAVGSAPL+V RGNL+L VFED N WKK+LN+KGFPNSFKD+F+ELA RTH+CE+V LPGFSGWIPMIVNCPECPRFMETGINFNCCHGR
Subjt:  NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR

A0A6J1H6C7 protein SIEVE ELEMENT OCCLUSION B-like0.0e+0080.64Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTH++ELSLKNLSD+VVA H+YSKHR+DD VKIDV+NYI FLES+FA+V +I+E++ QG H+G   + EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+ +SIK     LAFK PG EKAHQTTLEILDIL+SY WEAKAIL L AF +DYG +WHLN++SH DPLAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLFSK+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTE QSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID +PTEITLVVSKLLEGK +AKPLINCSTL EE IED+L+ KNVILLISGL+ISNDDIRAL+L+YDELKRED+YKIVWIPMIN     EE+
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RK+YE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        QERSIIFYGGKD +WIQRFEEKVVDIKNDRSMRD GI+FEIV +GNN+       MSRFWI QWGFFV+KSQLTGSSASETTEDILRLISYENENGWAIL
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNM--IKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLIL VFEDFN WKK+LN+KGFPN+F+D+FNE+AL+TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

A0A6J1L0W4 protein SIEVE ELEMENT OCCLUSION B-like0.0e+0081.36Show/hide
Query:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV
        MSVLAPKIPSTP+ANPRLQTHK+ELSLKNLSDDVV  H+YSKHR+DD VKIDV+NYI FL+S+FA V +I+EA+ QG H+GR +Y EDS+KANV +DPP+
Subjt:  MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPV

Query:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS
        DIL+K+SIK     LAFK PGIE AHQTTLEILDIL SY WEAKAIL L AF +DYG +WHLN++SHFD LAK+LANI QSASLKKHLDSFKYRQVVFSS
Subjt:  DILEKVSIKVLFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSS

Query:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID
        RS+++LCLQVIKLMNQ+RLFSK+D+KEIPEL+SALRQIPLFTYWVIH+IVA   EISSYL NTESQSQIYLNEL+ERLNAILNIL D+LNIF+EQ  EI+
Subjt:  RSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEID

Query:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES
        L+RWLIDHID FPTEITLVVSKLLEGK +AKPLINCSTL EE IED+L  KNVILLISGL+ISNDDI+AL+L+YDELKRED+YKIVWIPMIN     +ES
Subjt:  LYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEES

Query:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID
        RK+YE +RS MKWYAVQYTTKIAGLR+LEEIWQ+R+DALMVVL+SKSKVKFSNAIHLLRVWG NAIPF+ ERANALLRKNWPESTIVKF +QPRL+SWID
Subjt:  RKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWID

Query:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL
        Q RSIIFYGGKD DWIQ FEEKVVDIKNDRSMR++GI+FEIVRIGNN        MSRFWI QWGFFV+KSQLTGSSASETTEDILRLISYENENGWAIL
Subjt:  QERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNN--MIKVEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAIL

Query:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN
         VGSAPLLVGRGNLIL VFEDFN WKK+LN+KGFPN+F+D+FNE+AL TH+CERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGR PN
Subjt:  AVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSGWIPMIVNCPECPRFMETGINFNCCHGRVPN

SwissProt top hitse value%identityAlignment
Q93XX2 Protein SIEVE ELEMENT OCCLUSION A6.9e-4524.01Show/hide
Query:  HKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLF-ANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKVLFILL----
        H+    + +LSDD V A    K    D +  DV + +  +  +F ++V  I+ +A +      ++  +D +  +   +   D+++++S ++    L    
Subjt:  HKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLF-ANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKVLFILL----

Query:  --AFKTPG--IEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVI
             T G  ++  + TT  +L ++  Y W+AK +L L+A A  YGV   L      + L KSLA I+Q  S             +FS ++ ++  L   
Subjt:  --AFKTPG--IEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVI

Query:  KLMNQVRL---FSKYDSKEIP--ELSSALR-QIPLFTYWVIHSIVACRTEISSYLANTESQSQIY-----LNELSERLNAILNILEDHLN----------
        +++ Q  +    +  D  ++P   +++A    IP   YW++  ++ C + IS      + Q   +     ++E SERL  I   L +             
Subjt:  KLMNQVRL---FSKYDSKEIP--ELSSALR-QIPLFTYWVIHSIVACRTEISSYLANTESQSQIY-----LNELSERLNAILNILEDHLN----------

Query:  IFREQQEEIDLYRWLIDHIDQFPTEITLV--VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWI
        I  E QE I  +  +I H+D  P  + L+  +  L  G   +K  +  + L +         K+V+LLIS L+    ++  L  +Y E  ++  ++I+W+
Subjt:  IFREQQEEIDLYRWLIDHIDQFPTEITLV--VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWI

Query:  PMINLDIHGEESRKKYEFMRSKMKWYAVQYTTKI--AGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANAL-LRKNWPEST
        P+   D   E    K+E +   M+WY +    K+  A +R++ E W  ++  ++V L+ K +V  +NA  ++ +W   A PF+T R   L   + W    
Subjt:  PMINLDIHGEESRKKYEFMRSKMKWYAVQYTTKI--AGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANAL-LRKNWPEST

Query:  IVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGN-----------NMIKVEDMSR----------FWIIQWGF
        ++  +D   L   +D  + I  YGG+D+ WI+ F     ++      +   I  E+V +G            N I+ E++S           FW      
Subjt:  IVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGN-----------NMIKVEDMSR----------FWIIQWGF

Query:  FVIKSQLTGSSASETTE------------DILRLISYENE-NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCE
        +  K ++  +   +  E            +++ ++ Y  E +GW +++  S  ++  +GNL      +FN W+ ++  KGF  +  DH   + L  H C 
Subjt:  FVIKSQLTGSSASETTE------------DILRLISYENE-NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCE

Query:  RVTLPGFSGWIPMIVNCPECPRFMETGINFNCC
        R  LP  +G IP  V C EC R ME    + CC
Subjt:  RVTLPGFSGWIPMIVNCPECPRFMETGINFNCC

Q9FXE2 Protein SIEVE ELEMENT OCCLUSION C3.5e-2022.1Show/hide
Query:  GIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVIK-LMNQVRL
        G  +  + T+ + D+L  Y W+AKA+L L   A  YG +    + +  DP+A S+A + Q       ++  K+R  + S   ++   + V K ++   ++
Subjt:  GIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVIK-LMNQVRL

Query:  FSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSY------------LANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEIDLYRWLID
          K    +   L   L  I L TY V+ S + C  +I  +             A    +S+    ELS     +LNI   H  + ++ +   D    + +
Subjt:  FSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSY------------LANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEIDLYRWLID

Query:  HIDQFPTEITLV-------VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYD---ELKREDDYKIVWIPMINLDIHG
         I+Q    I +        V  LL    D  PL   S     T    ++ K  +LL+S   +       L  +YD       E +Y+I+W+P+ +     
Subjt:  HIDQFPTEITLV-------VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYD---ELKREDDYKIVWIPMINLDIHG

Query:  EESRKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEIWQLRD-DALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPR
        +E ++ ++F  + + W +V+    ++   L + ++ W  +D +A++VV++S  +    NA+ ++ +WG+ A PFS  R + L +++     ++     P 
Subjt:  EESRKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEIWQLRD-DALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPR

Query:  LKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNMIKVEDMSR------------FWIIQWGFFVIKSQ---LTGSSASET
         +      R I  +G ++LDWI  F      +   R +++ G   E++ + N       M              FW+        K +   +  S     
Subjt:  LKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNMIKVEDMSR------------FWIIQWGFFVIKSQ---LTGSSASET

Query:  TEDILRLI--SYENENGWAILAVGS
         E++  L+   Y    GW I+  GS
Subjt:  TEDILRLI--SYENENGWAILAVGS

Q9SS87 Protein SIEVE ELEMENT OCCLUSION B5.1e-5625.22Show/hide
Query:  LESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKV--LFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYG
        +  L + V  I + A+  + +  A  L    +  ++    + +L+ VS  +  +   +A+K+     +H+ T+ + + L S+ W+ K +L+LAAFA +YG
Subjt:  LESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKV--LFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYG

Query:  VIWHLNNYSHFDPLAKSLANIR-QSASLKKHLDSFK--YRQVVFSSRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRT
          W L  +   + LAKSLA ++      +  L+S       ++   +S+    +++ +L +      +Y + ++P+LS  L  IP+  YW I S++AC +
Subjt:  VIWHLNNYSHFDPLAKSLANIR-QSASLKKHLDSFK--YRQVVFSSRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRT

Query:  EIS--SYLANTESQSQIYLNELSERLNAILNILEDHLN-----IFR--EQQEEIDLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIE
        +I+  + + +    +Q+ L E S   N + NI  DHL       +R  E+Q   +  + L    D    +   +++ L+  K    PL +  T K +   
Subjt:  EIS--SYLANTESQSQIYLNELSERLNAILNILEDHLN-----IFR--EQQEEIDLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIE

Query:  DSLKGKNVILLISGLDISNDDIRALSLVYDELKR---------EDDYKIVWIPMINLDIHGEES---RKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEI
        D L+ K V+LLIS L+I  D++     +Y E +R            Y++VW+P+++     E S   +KK+E +R  M WY+V     I    + ++   
Subjt:  DSLKGKNVILLISGLDISNDDIRALSLVYDELKR---------EDDYKIVWIPMINLDIHGEES---RKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEI

Query:  WQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRS
        W   +  ++VV++ +      NA+H++ +WG  A PF+  R   L R+      ++       + +WI  +  I  YGG DLDWI+RF      +    +
Subjt:  WQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRS

Query:  MRDNGISFEIVRIG-------------NNMIKVEDMSRFW---IIQWGFFV-IKSQL-------TGSSASETTEDILRLISYENENGWAILAVGSAPLLV
         +D+ ++ E+  +G             + +I+ E++S  W    + W F+  ++S L             +  + I +++SY+   GWA+L+ G   +++
Subjt:  MRDNGISFEIVRIG-------------NNMIKVEDMSRFW---IIQWGFFV-IKSQL-------TGSSASETTEDILRLISYENENGWAILAVGSAPLLV

Query:  GRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRT--HKCERVT--LPGFSGWIPMIVNCPECPRFMETGINFNCCH
          G +   +      WK  +  KG+  +  DH ++  LR     C      +   SG IP  +NC EC R ME  ++F+CCH
Subjt:  GRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRT--HKCERVT--LPGFSGWIPMIVNCPECPRFMETGINFNCCH

Arabidopsis top hitse value%identityAlignment
AT1G67790.1 unknown protein3.3e-1819.96Show/hide
Query:  GIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVIK-LMNQVRL
        G  +  + T+ + D+L  Y W+AKA+L L   A  YG +    + +  DP+A S+A + Q       ++  K+R  + S   ++   + V K ++   ++
Subjt:  GIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVIK-LMNQVRL

Query:  FSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEIDLYRWLIDHIDQFPTEITLV
          K    +   L   L  I L TY V+ S + C  +I  +                                  +Q ++I +            TE+   
Subjt:  FSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEIDLYRWLIDHIDQFPTEITLV

Query:  VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYD---ELKREDDYKIVWIPMINLDIHGEESRKKYEFMRSKMKWYAV
        V+ LL  K   +PL                                    L  +YD       E +Y+I+W+P+ +     +E ++ ++F  + + W +V
Subjt:  VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYD---ELKREDDYKIVWIPMINLDIHGEESRKKYEFMRSKMKWYAV

Query:  QYTTKIAG--LRYLEEIWQLRD-DALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDL
        +    ++   L + ++ W  +D +A++VV++S  +    NA+ ++ +WG+ A PFS  R + L +++     ++     P  +      R I  +G ++L
Subjt:  QYTTKIAG--LRYLEEIWQLRD-DALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDL

Query:  DWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNMIKVEDMSR------------FWIIQWGFFVIKSQ---LTGSSASETTEDILRLI--SYENENGWA
        DWI  F      +   R +++ G   E++ + N       M              FW+        K +   +  S      E++  L+   Y    GW 
Subjt:  DWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNMIKVEDMSR------------FWIIQWGFFVIKSQ---LTGSSASETTEDILRLI--SYENENGWA

Query:  ILAVGS
        I+  GS
Subjt:  ILAVGS

AT3G01670.1 unknown protein4.9e-4624.01Show/hide
Query:  HKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLF-ANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKVLFILL----
        H+    + +LSDD V A    K    D +  DV + +  +  +F ++V  I+ +A +      ++  +D +  +   +   D+++++S ++    L    
Subjt:  HKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLF-ANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKVLFILL----

Query:  --AFKTPG--IEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVI
             T G  ++  + TT  +L ++  Y W+AK +L L+A A  YGV   L      + L KSLA I+Q  S             +FS ++ ++  L   
Subjt:  --AFKTPG--IEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVI

Query:  KLMNQVRL---FSKYDSKEIP--ELSSALR-QIPLFTYWVIHSIVACRTEISSYLANTESQSQIY-----LNELSERLNAILNILEDHLN----------
        +++ Q  +    +  D  ++P   +++A    IP   YW++  ++ C + IS      + Q   +     ++E SERL  I   L +             
Subjt:  KLMNQVRL---FSKYDSKEIP--ELSSALR-QIPLFTYWVIHSIVACRTEISSYLANTESQSQIY-----LNELSERLNAILNILEDHLN----------

Query:  IFREQQEEIDLYRWLIDHIDQFPTEITLV--VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWI
        I  E QE I  +  +I H+D  P  + L+  +  L  G   +K  +  + L +         K+V+LLIS L+    ++  L  +Y E  ++  ++I+W+
Subjt:  IFREQQEEIDLYRWLIDHIDQFPTEITLV--VSKLLEGKADAKPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWI

Query:  PMINLDIHGEESRKKYEFMRSKMKWYAVQYTTKI--AGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANAL-LRKNWPEST
        P+   D   E    K+E +   M+WY +    K+  A +R++ E W  ++  ++V L+ K +V  +NA  ++ +W   A PF+T R   L   + W    
Subjt:  PMINLDIHGEESRKKYEFMRSKMKWYAVQYTTKI--AGLRYLEEIWQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANAL-LRKNWPEST

Query:  IVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGN-----------NMIKVEDMSR----------FWIIQWGF
        ++  +D   L   +D  + I  YGG+D+ WI+ F     ++      +   I  E+V +G            N I+ E++S           FW      
Subjt:  IVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGN-----------NMIKVEDMSR----------FWIIQWGF

Query:  FVIKSQLTGSSASETTE------------DILRLISYENE-NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCE
        +  K ++  +   +  E            +++ ++ Y  E +GW +++  S  ++  +GNL      +FN W+ ++  KGF  +  DH   + L  H C 
Subjt:  FVIKSQLTGSSASETTE------------DILRLISYENE-NGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCE

Query:  RVTLPGFSGWIPMIVNCPECPRFMETGINFNCC
        R  LP  +G IP  V C EC R ME    + CC
Subjt:  RVTLPGFSGWIPMIVNCPECPRFMETGINFNCC

AT3G01680.1 CONTAINS InterPro DOMAIN/s: Mediator complex subunit Med28 (InterPro:IPR021640)3.6e-5725.22Show/hide
Query:  LESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKV--LFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYG
        +  L + V  I + A+  + +  A  L    +  ++    + +L+ VS  +  +   +A+K+     +H+ T+ + + L S+ W+ K +L+LAAFA +YG
Subjt:  LESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKV--LFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYG

Query:  VIWHLNNYSHFDPLAKSLANIR-QSASLKKHLDSFK--YRQVVFSSRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRT
          W L  +   + LAKSLA ++      +  L+S       ++   +S+    +++ +L +      +Y + ++P+LS  L  IP+  YW I S++AC +
Subjt:  VIWHLNNYSHFDPLAKSLANIR-QSASLKKHLDSFK--YRQVVFSSRSIVYLCLQVIKLMNQVRLFSKYDSKEIPELSSALRQIPLFTYWVIHSIVACRT

Query:  EIS--SYLANTESQSQIYLNELSERLNAILNILEDHLN-----IFR--EQQEEIDLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIE
        +I+  + + +    +Q+ L E S   N + NI  DHL       +R  E+Q   +  + L    D    +   +++ L+  K    PL +  T K +   
Subjt:  EIS--SYLANTESQSQIYLNELSERLNAILNILEDHLN-----IFR--EQQEEIDLYRWLIDHIDQFPTEITLVVSKLLEGKADAKPLINCSTLKEETIE

Query:  DSLKGKNVILLISGLDISNDDIRALSLVYDELKR---------EDDYKIVWIPMINLDIHGEES---RKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEI
        D L+ K V+LLIS L+I  D++     +Y E +R            Y++VW+P+++     E S   +KK+E +R  M WY+V     I    + ++   
Subjt:  DSLKGKNVILLISGLDISNDDIRALSLVYDELKR---------EDDYKIVWIPMINLDIHGEES---RKKYEFMRSKMKWYAVQYTTKIAG--LRYLEEI

Query:  WQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRS
        W   +  ++VV++ +      NA+H++ +WG  A PF+  R   L R+      ++       + +WI  +  I  YGG DLDWI+RF      +    +
Subjt:  WQLRDDALMVVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRS

Query:  MRDNGISFEIVRIG-------------NNMIKVEDMSRFW---IIQWGFFV-IKSQL-------TGSSASETTEDILRLISYENENGWAILAVGSAPLLV
         +D+ ++ E+  +G             + +I+ E++S  W    + W F+  ++S L             +  + I +++SY+   GWA+L+ G   +++
Subjt:  MRDNGISFEIVRIG-------------NNMIKVEDMSRFW---IIQWGFFV-IKSQL-------TGSSASETTEDILRLISYENENGWAILAVGSAPLLV

Query:  GRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRT--HKCERVT--LPGFSGWIPMIVNCPECPRFMETGINFNCCH
          G +   +      WK  +  KG+  +  DH ++  LR     C      +   SG IP  +NC EC R ME  ++F+CCH
Subjt:  GRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRT--HKCERVT--LPGFSGWIPMIVNCPECPRFMETGINFNCCH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTCCTAGCTCCCAAAATTCCCTCCACCCCTTTGGCCAATCCCAGGCTGCAAACTCACAAGGAAGAGTTGAGTTTGAAAAACTTATCAGACGATGTTGTTGCAGC
TCATGTTTATTCTAAACACCGTGAAGATGATAGGGTCAAAATTGATGTCGATAATTACATTTTATTTCTTGAAAGTCTTTTTGCGAATGTTGCTAAAATCAATGAAGCTG
CTTCTCAAGGTACTCATGAAGGGCGTGCAATGTATTTAGAAGATTCCTTTAAAGCTAATGTCATCGTTGATCCACCCGTTGATATTCTTGAGAAAGTTTCTATCAAGGTG
TTATTTATATTATTGGCTTTCAAGACTCCTGGGATAGAGAAAGCTCACCAAACAACTCTTGAAATCCTTGACATACTGATTAGCTATTCTTGGGAAGCTAAGGCAATTCT
GAGTCTTGCAGCATTTGCTACTGATTATGGAGTTATATGGCACCTTAACAATTACTCTCATTTTGACCCACTTGCCAAATCCTTGGCAAACATTCGCCAAAGTGCTTCTT
TGAAAAAGCACTTGGACTCTTTCAAATATCGACAAGTAGTTTTCAGCTCAAGGAGTATAGTTTACTTGTGTTTACAAGTGATCAAACTCATGAATCAAGTGAGACTTTTC
TCCAAATATGATTCCAAGGAAATTCCTGAGTTGTCTTCTGCCCTTCGTCAAATTCCATTGTTTACTTATTGGGTTATACACAGTATTGTTGCTTGTAGAACTGAAATCTC
AAGCTATCTGGCTAATACCGAGAGTCAATCACAAATATACTTGAATGAATTGAGTGAGAGACTCAATGCCATCCTCAACATACTTGAAGACCATCTGAATATTTTTCGAG
AACAACAAGAGGAAATTGATCTATATCGATGGCTTATTGATCATATTGACCAGTTTCCTACTGAAATAACATTAGTCGTGTCTAAGCTGCTTGAAGGCAAGGCTGATGCC
AAGCCTTTAATAAATTGTTCCACTCTGAAAGAGGAAACAATTGAAGATTCATTGAAAGGCAAGAATGTGATATTACTCATTTCTGGATTAGACATTTCTAATGATGATAT
CCGAGCTCTGAGTTTAGTGTACGATGAGTTGAAAAGAGAGGACGATTACAAGATTGTATGGATTCCTATGATTAATTTGGATATTCATGGAGAAGAAAGTCGTAAAAAAT
ATGAGTTTATGAGGTCAAAAATGAAATGGTATGCAGTTCAATATACAACAAAAATTGCTGGATTAAGATATTTAGAAGAGATATGGCAACTTAGAGATGATGCTTTAATG
GTTGTCCTCAACTCAAAATCAAAAGTCAAATTTTCAAATGCCATTCACTTGCTTCGTGTTTGGGGAATTAATGCAATCCCTTTTTCCACTGAGAGGGCAAATGCTTTATT
GAGAAAGAATTGGCCAGAATCAACCATTGTGAAATTCAGCGACCAACCAAGATTAAAAAGCTGGATTGATCAAGAGAGGAGTATTATCTTCTATGGAGGAAAAGATCTTG
ATTGGATCCAACGGTTCGAAGAGAAAGTGGTGGACATAAAAAATGATAGATCAATGAGAGACAATGGAATTAGTTTCGAGATTGTACGTATAGGAAACAACATGATCAAA
GTAGAAGACATGTCTCGTTTTTGGATCATACAATGGGGATTTTTTGTAATCAAAAGTCAACTAACAGGTTCAAGTGCGAGTGAAACAACAGAAGACATCTTACGTTTGAT
TTCATACGAAAATGAAAATGGTTGGGCTATTTTAGCTGTCGGTTCAGCTCCTCTACTTGTGGGTCGTGGGAATTTGATTTTGGCAGTGTTTGAAGATTTTAACTTATGGA
AAAAGAGTTTGAATATGAAGGGTTTTCCCAATTCTTTCAAAGATCATTTCAATGAGTTGGCTCTTCGGACTCATAAATGTGAAAGAGTAACTCTTCCAGGTTTTAGTGGA
TGGATTCCTATGATTGTTAATTGTCCTGAGTGTCCTCGCTTCATGGAGACTGGTATTAACTTCAACTGCTGCCACGGTCGGGTTCCTAACAAATGA
mRNA sequenceShow/hide mRNA sequence
GTTTTTCCTGTTGTCTTTACCTTTATGTCATTTTCATTTTAAAATGTTTTATAAATAAGCTATTCATGTTAGGGCAACAAAACACACAAACACATTATTCCCCTTTGTCT
CCTTAGACAAGATCTCCAATCTTTCAGCCTTATACAACCATGTCTGTCCTAGCTCCCAAAATTCCCTCCACCCCTTTGGCCAATCCCAGGCTGCAAACTCACAAGGAAGA
GTTGAGTTTGAAAAACTTATCAGACGATGTTGTTGCAGCTCATGTTTATTCTAAACACCGTGAAGATGATAGGGTCAAAATTGATGTCGATAATTACATTTTATTTCTTG
AAAGTCTTTTTGCGAATGTTGCTAAAATCAATGAAGCTGCTTCTCAAGGTACTCATGAAGGGCGTGCAATGTATTTAGAAGATTCCTTTAAAGCTAATGTCATCGTTGAT
CCACCCGTTGATATTCTTGAGAAAGTTTCTATCAAGGTGTTATTTATATTATTGGCTTTCAAGACTCCTGGGATAGAGAAAGCTCACCAAACAACTCTTGAAATCCTTGA
CATACTGATTAGCTATTCTTGGGAAGCTAAGGCAATTCTGAGTCTTGCAGCATTTGCTACTGATTATGGAGTTATATGGCACCTTAACAATTACTCTCATTTTGACCCAC
TTGCCAAATCCTTGGCAAACATTCGCCAAAGTGCTTCTTTGAAAAAGCACTTGGACTCTTTCAAATATCGACAAGTAGTTTTCAGCTCAAGGAGTATAGTTTACTTGTGT
TTACAAGTGATCAAACTCATGAATCAAGTGAGACTTTTCTCCAAATATGATTCCAAGGAAATTCCTGAGTTGTCTTCTGCCCTTCGTCAAATTCCATTGTTTACTTATTG
GGTTATACACAGTATTGTTGCTTGTAGAACTGAAATCTCAAGCTATCTGGCTAATACCGAGAGTCAATCACAAATATACTTGAATGAATTGAGTGAGAGACTCAATGCCA
TCCTCAACATACTTGAAGACCATCTGAATATTTTTCGAGAACAACAAGAGGAAATTGATCTATATCGATGGCTTATTGATCATATTGACCAGTTTCCTACTGAAATAACA
TTAGTCGTGTCTAAGCTGCTTGAAGGCAAGGCTGATGCCAAGCCTTTAATAAATTGTTCCACTCTGAAAGAGGAAACAATTGAAGATTCATTGAAAGGCAAGAATGTGAT
ATTACTCATTTCTGGATTAGACATTTCTAATGATGATATCCGAGCTCTGAGTTTAGTGTACGATGAGTTGAAAAGAGAGGACGATTACAAGATTGTATGGATTCCTATGA
TTAATTTGGATATTCATGGAGAAGAAAGTCGTAAAAAATATGAGTTTATGAGGTCAAAAATGAAATGGTATGCAGTTCAATATACAACAAAAATTGCTGGATTAAGATAT
TTAGAAGAGATATGGCAACTTAGAGATGATGCTTTAATGGTTGTCCTCAACTCAAAATCAAAAGTCAAATTTTCAAATGCCATTCACTTGCTTCGTGTTTGGGGAATTAA
TGCAATCCCTTTTTCCACTGAGAGGGCAAATGCTTTATTGAGAAAGAATTGGCCAGAATCAACCATTGTGAAATTCAGCGACCAACCAAGATTAAAAAGCTGGATTGATC
AAGAGAGGAGTATTATCTTCTATGGAGGAAAAGATCTTGATTGGATCCAACGGTTCGAAGAGAAAGTGGTGGACATAAAAAATGATAGATCAATGAGAGACAATGGAATT
AGTTTCGAGATTGTACGTATAGGAAACAACATGATCAAAGTAGAAGACATGTCTCGTTTTTGGATCATACAATGGGGATTTTTTGTAATCAAAAGTCAACTAACAGGTTC
AAGTGCGAGTGAAACAACAGAAGACATCTTACGTTTGATTTCATACGAAAATGAAAATGGTTGGGCTATTTTAGCTGTCGGTTCAGCTCCTCTACTTGTGGGTCGTGGGA
ATTTGATTTTGGCAGTGTTTGAAGATTTTAACTTATGGAAAAAGAGTTTGAATATGAAGGGTTTTCCCAATTCTTTCAAAGATCATTTCAATGAGTTGGCTCTTCGGACT
CATAAATGTGAAAGAGTAACTCTTCCAGGTTTTAGTGGATGGATTCCTATGATTGTTAATTGTCCTGAGTGTCCTCGCTTCATGGAGACTGGTATTAACTTCAACTGCTG
CCACGGTCGGGTTCCTAACAAATGATCAGAAACTCAAAGCTACAAAACTTTGATTGCATATTCTACTAAATAATTATCATGTTGTTCTCTTTGTTGCATTTTATGTATGC
CCATTAGGGCAATTTAATTACCACATCAATCTTGAATTTTTATATACTATTATTATTACTACTACTAGCATATTTGTACTAAATAAATGTTGCCATTTGGTTCCTTTTGT
TGCATCTTATGTATGTATGCCCATTAGGGCACATGGTTTACACACCATTTTCAAAATCTCTTCTATGTATGTTATTTGGAG
Protein sequenceShow/hide protein sequence
MSVLAPKIPSTPLANPRLQTHKEELSLKNLSDDVVAAHVYSKHREDDRVKIDVDNYILFLESLFANVAKINEAASQGTHEGRAMYLEDSFKANVIVDPPVDILEKVSIKV
LFILLAFKTPGIEKAHQTTLEILDILISYSWEAKAILSLAAFATDYGVIWHLNNYSHFDPLAKSLANIRQSASLKKHLDSFKYRQVVFSSRSIVYLCLQVIKLMNQVRLF
SKYDSKEIPELSSALRQIPLFTYWVIHSIVACRTEISSYLANTESQSQIYLNELSERLNAILNILEDHLNIFREQQEEIDLYRWLIDHIDQFPTEITLVVSKLLEGKADA
KPLINCSTLKEETIEDSLKGKNVILLISGLDISNDDIRALSLVYDELKREDDYKIVWIPMINLDIHGEESRKKYEFMRSKMKWYAVQYTTKIAGLRYLEEIWQLRDDALM
VVLNSKSKVKFSNAIHLLRVWGINAIPFSTERANALLRKNWPESTIVKFSDQPRLKSWIDQERSIIFYGGKDLDWIQRFEEKVVDIKNDRSMRDNGISFEIVRIGNNMIK
VEDMSRFWIIQWGFFVIKSQLTGSSASETTEDILRLISYENENGWAILAVGSAPLLVGRGNLILAVFEDFNLWKKSLNMKGFPNSFKDHFNELALRTHKCERVTLPGFSG
WIPMIVNCPECPRFMETGINFNCCHGRVPNK