; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038951 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038951
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationchr2:31623764..31628100
RNA-Seq ExpressionLag0038951
SyntenyLag0038951
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607619.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.3e-25681.15Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF++GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHLRH+E   VRSVSPSN  +NISVGEDKQFLPFITPPSLSSS SHRQTDL P S+++TPEEK GF+P+ E EINSPPPSVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SR KSEVNVD+SVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS +SDV+ HHQLE QKSGFDSSP +SPSERI HPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

XP_022926259.1 uncharacterized protein LOC111433438 [Cucurbita moschata]1.8e-25480.49Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF++GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHL+ +E   VRSVSPSN  +N+SVGEDKQFLPFITPPSLSSSFSHRQTD+ P S+++TPEEKFGF+P+ E EINSPPPSVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SRTKSEVNVD+SV+LQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS + DV+ HHQLE QKSGFDSSP +SPSERI HPHQ QRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

XP_022981875.1 uncharacterized protein LOC111480885 [Cucurbita maxima]2.3e-25781.64Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF+KGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHLRH+E   VRSVSPSN  +NISVGEDKQFLPFITPPSLSSSFSHRQTDL P S+++TPEEKFGF+P+ E EINSPP SVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SRTKSEVNVD+SVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS +SDV+ HHQLE QKSGFDSSP +SPSERI HPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

XP_023521005.1 uncharacterized protein LOC111784587 [Cucurbita pepo subsp. pepo]7.8e-25881.48Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF++GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHLRH+E   VRSVSPSN  +NISVGEDKQFLPFITPPSLSSSFSHRQTDL P S+++TPEEK GF+P+ E EINSPPPSVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SRTKSEVNVD+SVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS +SDV+ HHQLE QKSGFDSSP +SPSERI HPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

XP_038884985.1 uncharacterized protein LOC120075564 [Benincasa hispida]1.3e-25282.28Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF+KGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAE+H+LDW+PA TEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL
        NG AQFVSASKLPLP+E H+E YNATPDLAS+PQPDSDSELDMLDFPEVPK+SV P P T +AG  SS++PPPSVP P  E++H S KYS  PE  PQDL
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL

Query:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS
        HLRH+E T VRSVSPS  +MNISVGEDKQFLPFITPPSLSSSFSHRQTD  PPS++  PEEKFGF+P++E EINSP PSVSR+      FKP+FE EIN+
Subjt:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS

Query:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ----------
        PP S SRT SEVN D+ VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND DPEI+CENPFHGATAS PDHKETFQLNQ          
Subjt:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ----------

Query:  -----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS
             QDSFQE + VS + DVS+HHQ E +K GFDSSPGDSPS  I  PHQPQRLPSMDDDPYFSYPNLFT QKPN GSDH S GGS
Subjt:  -----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS

TrEMBL top hitse value%identityAlignment
A0A0A0KBZ0 Uncharacterized protein3.9e-24780.61Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF+KGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTEL+QVQ+ F AKYGKEF+SAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDW+PA TEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL
        NG  QFV ASKLPLP+EKH+ET N T DLAS PQPDSDSELDMLDFPEVPKMSV P  PT++AG   S+IPPPSV  PP E+DH SF+YSG PE  PQ+L
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL

Query:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS
        H RH+E TLVRSVSPSN +MN+SVGEDKQFLPFITPPSLS SFS RQT+LSP S+  TPEEKF  +P++  EINSP PSVSR+      FKP+FE EINS
Subjt:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS

Query:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ----------
         P S SRT SEVN D+ VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND DPEI+CENPFHG TASP DH ET++LNQ          
Subjt:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ----------

Query:  -----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSE-RIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS
             QDSFQEH+K+SR+ +VSDH QL+ +KS FD SPG+SPSE + V PHQPQRLPSMDDDPYFSYPNLFTSQKPN G DHSS GGS
Subjt:  -----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSE-RIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS

A0A1S4E4M8 uncharacterized protein LOC1035022594.8e-24580.78Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF+KGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTEL+QVQ+ F AKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDW+PA TEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL
        NG   FV  SKLPLP+EKH+ET N T DLAS PQPDSDSELDMLDFPEVPKMSV P  PT++AG   S+I PPSV  P PE+D  SFKYSG PE  PQ+L
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL

Query:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS
        H RH E TLVRSVSPSN +MN+SVGEDKQFLPFITPPSLS SFSHRQTDLSPPS+  TPEEKF  +P++  EINSP PSVSR+      FKP+F+ EINS
Subjt:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP-----FKPQFEQEINS

Query:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQQDSF------
         P S SRT  EVN D+SVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND DPEI+CENPFHGATASP  H ET++LNQQDS       
Subjt:  PPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQQDSF------

Query:  ---------QEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSE-RIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS
                 QEH+K+SR+ +VSDH QLE +KS FDSSPG+SPSE + V PHQPQRLPSMDDDPYFSYPNLFTSQKPN G DHSS GGS
Subjt:  ---------QEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSE-RIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS

A0A6J1EEM3 uncharacterized protein LOC1114334388.7e-25580.49Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF++GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHL+ +E   VRSVSPSN  +N+SVGEDKQFLPFITPPSLSSSFSHRQTD+ P S+++TPEEKFGF+P+ E EINSPPPSVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SRTKSEVNVD+SV+LQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS + DV+ HHQLE QKSGFDSSP +SPSERI HPHQ QRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

A0A6J1EU69 uncharacterized protein LOC111436655 isoform X13.2e-23378.33Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSML SFF KGFK AQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FA KYGKEFVSAATELMPNCGVNRQL+ELLSVRAPSPEKKLKLLKEIAEEHD+DWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL
        NG AQFVSASKLPLP+EKH+E YN       APQPDSDSELDMLDFPEVPK+SVSPH  T++ G  SS+IPPPS   PPPE+ HDS KYS  PE  P+DL
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEI-PQDL

Query:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP---FKPQFEQEINSPP
         LR  E T VRSVSPS  +MNISVGEDKQFLPFI PPS                  RTPEEKF F+P+ EPEINSPPPSVSR+P   F  ++EQEI SPP
Subjt:  HLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSP---FKPQFEQEINSPP

Query:  PSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ------------
        PSASRTKS    D+S+DLQDVLAAAQAA ETAERAAAAARSAASL KVRID LT K ND  PEI CENPFHGAT SPPDH++TF LNQ            
Subjt:  PSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQ------------

Query:  ----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS
            QDSFQEH+K S  SDVSDHH+ E QKSGFDSSPGDSPSE I  PHQPQRLPSMDDDPYFSYPNLFTSQKPN G DHSS  GS
Subjt:  ----QDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS

A0A6J1J0W1 uncharacterized protein LOC1114808851.1e-25781.64Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSMLNSFF+KGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRCADLTELLQVQ+ FAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ
        NG AQFVSASKLPLPK+KHDETY+ATPDL SAPQPDSDSELD LDFPEVPK+SVS HPPT  +A     LIPPPSV  PPPE+D DSFK SG  PEI PQ
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTV-EAGPPSSLIPPPSVPLPPPESDHDSFKYSG-FPEI-PQ

Query:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------
        DLHLRH+E   VRSVSPSN  +NISVGEDKQFLPFITPPSLSSSFSHRQTDL P S+++TPEEKFGF+P+ E EINSPP SVSR                
Subjt:  DLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSR----------------

Query:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
                  SP     FKPQFEQ ++SPPPS SRTKSEVNVD+SVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKND D E++CENPF
Subjt:  ----------SP-----FKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
        HGA  +PPDHK+TFQ +           QDSF+EH++VS +SDV+ HHQLE QKSGFDSSP +SPSERI HPHQPQRLPSMDDDPYFSYPNLFTSQKPNL
Subjt:  HGATASPPDHKETFQLN----------QQDSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNL

Query:  GSDHSSGGGS
        GSD SSGGGS
Subjt:  GSDHSSGGGS

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog7.8e-1930.45Show/hide
Query:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC
        GFKA + +  L+L I R+KLL  ++    ++ R++IA  L  G++  ARIRVEHIIRE+ ++ A EILEL+C+L++ R  +I++ +E    L E++S++ 
Subjt:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC

Query:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------
        +AAPR  +++ EL  V     AKY KE+            VN +L+  LSV AP      + L EIA+ +++ ++P +   AE     E DL++      
Subjt:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------

Query:  ----GPAQFVSA----------SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS-------------------
            GP +  S             +P+P      + N TP     P+  SD + L M  +   P +    HPP + A PPS                   
Subjt:  ----GPAQFVSA----------SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS-------------------

Query:  --SLIPPPSVPLPP-PESDHDSFKYSGFPEIPQDL
             P  S  LP  P  ++D+F     P +P  L
Subjt:  --SLIPPPSVPLPP-PESDHDSFKYSGFPEIPQDL

Q3ZBV1 IST1 homolog1.5e-1729.55Show/hide
Query:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC
        G KA + +  L+L I R+KLL  ++    ++ R++IA  L  G++  ARIRVEHIIRE+ ++ A EILEL+C+L++ R  +I++ +E    L E++S++ 
Subjt:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC

Query:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------
        +AAPR  +++ EL  V     AKY KE+            VN +L+  LSV AP      + L EIA+ +++ ++P +   AE     E DL++      
Subjt:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------

Query:  ----GPAQ-----FVSA-----SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS----------------SLI
            GP +     F +        +P+P      + N TP     P+  SD + L +  +   P +    HPP + A PPS                 ++
Subjt:  ----GPAQ-----FVSA-----SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS----------------SLI

Query:  PPPSVPLPPPE------SDHDSFKYSGFPEIPQDL
         P   P PP +        +D+F     P +P  L
Subjt:  PPPSVPLPPPE------SDHDSFKYSGFPEIPQDL

Q54I39 IST1-like protein1.0e-1829.73Show/hide
Query:  FSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAIS
        F   + + + K  LKL + RI++L+N++   ++  +R++A+LL    E +ARIRVE IIR+E ++   +I+E+ CEL+  R+ +I    E PL++KE+I 
Subjt:  FSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAIS

Query:  SVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNC----GVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGP
        ++ +++ R   + EL Q++    AKYGK   + A     NC     VN +++  LS   P P    + L EIAE+ ++DW      +++   P+ ++  P
Subjt:  SVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNC----GVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGP

Query:  AQFVSASKL--PLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPT
               ++  P P+  H   +   P +   P P    +      P  P MS  P  PT
Subjt:  AQFVSASKL--PLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPT

Q568Z6 IST1 homolog9.2e-2029.91Show/hide
Query:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC
        GFKA + +  L+L I R+KLL  ++    ++ R++IA  L  G++  ARIRVEHIIRE+ ++ A EILEL+C+L++ R  +I++ +E    L E++S++ 
Subjt:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC

Query:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------
        +AAPR  +++ EL  V     AKY KE+            VN +L+  LSV AP      + L EIA+ +++ ++P +   AE     E DL++      
Subjt:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDP-AGTEAEFNKSPE-DLLN------

Query:  ------------------GPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS---------------
                          G         +P+P      + NA P     P+  SD S L +  +   P +    HPP + A PPS               
Subjt:  ------------------GPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS---------------

Query:  --SLIPPPSVPLPPPESDHDSFKYSGFPEIP
           + P P  P  PP    D++     PE+P
Subjt:  --SLIPPPSVPLPPPESDHDSFKYSGFPEIP

Q9CX00 IST1 homolog1.2e-1929.97Show/hide
Query:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC
        GFKA + +  L+L I R+KLL  ++    ++ R++IA  L  G++  ARIRVEHIIRE+ ++ A EILEL+C+L++ R  +I++ +E    L E++S++ 
Subjt:  GFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVC

Query:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWD---------PAGTEAEF---NKSPE
        +AAPR  +++ EL  V     AKY KE+            VN +L+  LSV AP      + L EIA+ +++ ++         P G E +      + +
Subjt:  FAAPRC-ADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWD---------PAGTEAEF---NKSPE

Query:  DLLNGPAQ-----FVSA-----SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS-----------------SL
            GP +     F +        +P+P      + NA P     P+  SD S L +  +   P +    HPP + A PPS                  +
Subjt:  DLLNGPAQ-----FVSA-----SKLPLPKEKHDETYNATPDLASAPQPDSD-SELDMLDFPEVPKMSVSPHPPTVEAGPPS-----------------SL

Query:  IPPPSVPLPPPESDHDSFKYSGFPEIP
         P P  P  PP    D++     PE+P
Subjt:  IPPPSVPLPPPESDHDSFKYSGFPEIP

Arabidopsis top hitse value%identityAlignment
AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein1.7e-6157.07Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MS+LN  F++G   A+CKT L L I R+KLL+N+R++QLK M+++IA  L+ GQE  ARIRVEH+IRE N+ AA EILELFCE I+ R+PI+E+++ECP 
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        +L+EAI+S+ FAAPRC+++ +LLQ++  F  KYGKEF+  A+EL P+ GVNR +IE LS  +PS   +LK+LKEIA+E+ L+WD + TEAEF KS EDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQ
         G  Q
Subjt:  NGPAQ

AT1G34220.1 Regulator of Vps4 activity in the MVB pathway protein5.0e-11443.95Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSML+SFF+KGFKAA+CKTLLKLTIPRIKL+RNRRE Q+KQMRR+IAKLLETGQEATARIRVEHIIREE MMAAQEILELFCELI VRLPIIE QRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNR------------------------------QLIELLSVRAPSPEKKLK
        DLKEAISSVCFAAPRC+DLTEL QVQI F +KYGKEFV+AA+EL P+ GVNR                              QL+ELLSVRAPSPE KLK
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNR------------------------------QLIELLSVRAPSPEKKLK

Query:  LLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLI
        LLKEIAEEH+LDWDPA TE +  KS EDLL+GP QF   SKLPLP+E++++T N T   A+  + DSDSE D+LDFPEVP + + P P       P    
Subjt:  LLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLI

Query:  PPPSVPLPPPESDHDSFKYSGFPEIPQDLHLRHDEATLVRSVSPSNH---EMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPK
                      D+ K + +     DL    + A + ++ S  +    + + +V E +Q     + P L  SF  +                      
Subjt:  PPPSVPLPPPESDHDSFKYSGFPEIPQDLHLRHDEATLVRSVSPSNH---EMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPK

Query:  LEPEINSPPPSVSRSPFKPQFEQEINSPPPSASRTKSEVNVDIS-VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF
             N  PPS+            +   P   S    +    IS  DLQDVL AAQAAA++AERAA+AARSAASLA++RI+ELT+K +D  PE   ENPF
Subjt:  LEPEINSPPPSVSRSPFKPQFEQEINSPPPSASRTKSEVNVDIS-VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPF

Query:  HGAT-------------------------------------ASPPDHK----------------------------ETFQLNQQDSFQEHY---------
        H  +                                       P  H                             E  Q + Q+S   +Y         
Subjt:  HGAT-------------------------------------ASPPDHK----------------------------ETFQLNQQDSFQEHY---------

Query:  ------------KVSRNSDVSDHHQLEQQKSGFD------SSPGD---------SPSERIV--HPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSG
                      S   D++ H     +K  FD      SS GD         SP +R+     HQ  RLPSM+DDPY+SYPNLFTSQK     D SSG
Subjt:  ------------KVSRNSDVSDHHQLEQQKSGFD------SSPGD---------SPSERIV--HPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSG

Query:  GGS
          S
Subjt:  GGS

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein2.6e-11845.77Show/hide
Query:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL
        MSML+SFF+KGFKAA+CKTLLKLTIPRIKL+RNRRE Q+KQMRR+IAKLLETGQEATARIRVEHIIREE MMAAQEILELFCELI VRLPIIE QRECPL
Subjt:  MSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPL

Query:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL
        DLKEAISSVCFAAPRC+DLTEL QVQI F +KYGKEFV+AA+EL P+ GVNR+L+ELLSVRAPSPE KLKLLKEIAEEH+LDWDPA TE +  KS EDLL
Subjt:  DLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLL

Query:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEIPQDLH
        +GP QF   SKLPLP+E++++T N T   A+  + DSDSE D+LDFPEVP + + P P       P                  D+ K + +     DL 
Subjt:  NGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGPPSSLIPPPSVPLPPPESDHDSFKYSGFPEIPQDLH

Query:  LRHDEATLVRSVSPSNH---EMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSPFKPQFEQEINSPPP
           + A + ++ S  +    + + +V E +Q     + P L  SF  +                           N  PPS+            +   P 
Subjt:  LRHDEATLVRSVSPSNH---EMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSPPPSVSRSPFKPQFEQEINSPPP

Query:  SASRTKSEVNVDIS-VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGAT--------------------------
          S    +    IS  DLQDVL AAQAAA++AERAA+AARSAASLA++RI+ELT+K +D  PE   ENPFH  +                          
Subjt:  SASRTKSEVNVDIS-VDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGAT--------------------------

Query:  -----------ASPPDHK----------------------------ETFQLNQQDSFQEHY---------------------KVSRNSDVSDHHQLEQQK
                     P  H                             E  Q + Q+S   +Y                       S   D++ H     +K
Subjt:  -----------ASPPDHK----------------------------ETFQLNQQDSFQEHY---------------------KVSRNSDVSDHHQLEQQK

Query:  SGFD------SSPGD---------SPSERIV--HPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS
          FD      SS GD         SP +R+     HQ  RLPSM+DDPY+SYPNLFTSQK     D SSG  S
Subjt:  SGFD------SSPGD---------SPSERIV--HPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein2.1e-5148.33Show/hide
Query:  LNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLK
        +     +GFK A+CKT L++   R+K+L+N++EIQ+KQ+RR++A+LLE+GQ  TARIRVEH++REE  +AA E++ ++CEL+VVRL +IE+Q+ CP+DLK
Subjt:  LNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLK

Query:  EAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGP
        EA++SV FA+ R +D+ EL ++   F  KYGK+F ++A EL P+ GV+R L+E LS +AP    K+K+L  IAEEH++ W+ A +  E +    +LLNG 
Subjt:  EAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGP

Query:  AQFVSASKL
          F  AS +
Subjt:  AQFVSASKL

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein1.1e-6557.47Show/hide
Query:  SFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEA
        S F +GF +++CKT  K+ + RIKL+RN+R + +KQMRRDIA LL++GQ+ATARIRVEH+IRE+N+ AA EI+ELFCELIV RL II  Q++CP+DLKE 
Subjt:  SFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRDIAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEA

Query:  ISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGPAQ
        I+S+ FAAPRC+++ EL  ++  FA KYGK+FVSAAT+L P+CGVNR LI+ LSVR P  E KLK++KEIA+E  +DWD   TE E  K  E+ ++GP +
Subjt:  ISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLIELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGPAQ

Query:  FVSASKLPLPKEKHDETYNAT
        FVSAS LP+ +   +E  + T
Subjt:  FVSASKLPLPKEKHDETYNAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGAAACGAACGACGCGTTTGGCGTCGCAAGCAAACGCATACGGCGTCGCATCGTCTCGCCATTCCTTTTCGAATTCTCAACTCGATCACAGCTTTCACACTTGT
CGATTCTTCCTTTCGGCTTCTTATCATCCTGCAATTCGGTGGTCAAAAGGGTTTTTGTTTTGATCTTCATTCGATCTTCATCTCCATGTCGATGCTCAACTCCTTCTTCA
GCAAGGGCTTCAAAGCTGCCCAATGTAAAACCCTGCTTAAATTGACAATTCCACGAATAAAGCTGCTGAGGAATCGGAGGGAGATTCAGCTTAAGCAGATGCGACGAGAC
ATAGCTAAGCTTCTTGAAACAGGCCAAGAAGCTACGGCTCGCATTCGGGTAGAGCATATAATCAGAGAAGAGAATATGATGGCTGCTCAGGAAATTCTTGAGTTGTTTTG
CGAGCTTATTGTCGTTCGTCTTCCAATTATTGAAACACAAAGGGAATGTCCTCTAGACTTGAAAGAAGCTATCTCAAGTGTGTGTTTTGCTGCTCCAAGGTGCGCTGATC
TTACAGAGCTTCTCCAGGTTCAGATTCATTTTGCTGCTAAATATGGGAAGGAATTTGTATCAGCTGCAACTGAGCTCATGCCTAACTGTGGTGTTAATCGTCAGTTGATA
GAGCTGCTTTCTGTTCGTGCTCCTTCACCTGAAAAGAAACTGAAGCTCTTAAAGGAAATTGCTGAAGAGCACGACTTAGATTGGGATCCTGCTGGAACGGAAGCTGAGTT
TAACAAATCTCCAGAAGATTTGCTTAATGGACCGGCGCAGTTTGTCAGTGCATCAAAATTACCCCTTCCTAAAGAGAAACATGACGAAACGTATAATGCTACTCCTGATT
TGGCTAGTGCACCACAACCTGATTCTGATTCAGAATTGGACATGTTAGACTTTCCTGAAGTTCCAAAGATGTCAGTATCTCCACATCCTCCTACTGTAGAGGCTGGACCT
CCATCATCACTGATCCCACCCCCATCTGTGCCACTACCGCCGCCCGAATCTGATCACGATTCATTCAAATATTCTGGTTTTCCTGAAATCCCACAAGATTTACACCTAAG
ACATGATGAGGCGACATTGGTGAGATCTGTTTCTCCCAGCAACCATGAAATGAATATCTCTGTTGGTGAAGATAAACAGTTTTTGCCTTTTATTACTCCGCCATCATTGT
CATCTTCGTTTTCTCATAGACAAACTGATTTGTCGCCACCCTCTGAAACAAGGACTCCCGAGGAGAAGTTTGGTTTTGAACCAAAGCTCGAACCCGAGATAAATTCACCA
CCACCCTCTGTTTCAAGGTCCCCTTTCAAACCACAGTTCGAACAGGAGATAAATTCACCACCACCCTCTGCTTCAAGGACAAAGAGTGAGGTCAATGTGGATATCAGTGT
GGATTTGCAGGATGTATTGGCCGCTGCTCAGGCTGCTGCTGAAACTGCTGAACGTGCTGCTGCTGCAGCTCGCTCTGCAGCTAGTCTTGCAAAGGTTAGAATTGACGAGC
TCACAAAGAAAAAGAACGATCACGACCCTGAAATTAATTGCGAGAATCCATTCCATGGAGCTACTGCAAGTCCACCAGATCATAAAGAAACTTTCCAACTAAACCAACAA
GATTCTTTCCAAGAACATTACAAGGTAAGCCGGAATTCAGATGTTTCTGATCACCACCAGCTCGAGCAACAGAAATCGGGTTTCGATTCCTCGCCCGGTGATTCACCATC
TGAACGTATAGTACATCCTCATCAACCCCAGAGGCTTCCGTCAATGGATGACGACCCGTACTTTTCATACCCGAATTTATTTACATCACAGAAACCAAATCTGGGATCTG
ATCATTCATCTGGTGGTGGTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGAAACGAACGACGCGTTTGGCGTCGCAAGCAAACGCATACGGCGTCGCATCGTCTCGCCATTCCTTTTCGAATTCTCAACTCGATCACAGCTTTCACACTTGT
CGATTCTTCCTTTCGGCTTCTTATCATCCTGCAATTCGGTGGTCAAAAGGGTTTTTGTTTTGATCTTCATTCGATCTTCATCTCCATGTCGATGCTCAACTCCTTCTTCA
GCAAGGGCTTCAAAGCTGCCCAATGTAAAACCCTGCTTAAATTGACAATTCCACGAATAAAGCTGCTGAGGAATCGGAGGGAGATTCAGCTTAAGCAGATGCGACGAGAC
ATAGCTAAGCTTCTTGAAACAGGCCAAGAAGCTACGGCTCGCATTCGGGTAGAGCATATAATCAGAGAAGAGAATATGATGGCTGCTCAGGAAATTCTTGAGTTGTTTTG
CGAGCTTATTGTCGTTCGTCTTCCAATTATTGAAACACAAAGGGAATGTCCTCTAGACTTGAAAGAAGCTATCTCAAGTGTGTGTTTTGCTGCTCCAAGGTGCGCTGATC
TTACAGAGCTTCTCCAGGTTCAGATTCATTTTGCTGCTAAATATGGGAAGGAATTTGTATCAGCTGCAACTGAGCTCATGCCTAACTGTGGTGTTAATCGTCAGTTGATA
GAGCTGCTTTCTGTTCGTGCTCCTTCACCTGAAAAGAAACTGAAGCTCTTAAAGGAAATTGCTGAAGAGCACGACTTAGATTGGGATCCTGCTGGAACGGAAGCTGAGTT
TAACAAATCTCCAGAAGATTTGCTTAATGGACCGGCGCAGTTTGTCAGTGCATCAAAATTACCCCTTCCTAAAGAGAAACATGACGAAACGTATAATGCTACTCCTGATT
TGGCTAGTGCACCACAACCTGATTCTGATTCAGAATTGGACATGTTAGACTTTCCTGAAGTTCCAAAGATGTCAGTATCTCCACATCCTCCTACTGTAGAGGCTGGACCT
CCATCATCACTGATCCCACCCCCATCTGTGCCACTACCGCCGCCCGAATCTGATCACGATTCATTCAAATATTCTGGTTTTCCTGAAATCCCACAAGATTTACACCTAAG
ACATGATGAGGCGACATTGGTGAGATCTGTTTCTCCCAGCAACCATGAAATGAATATCTCTGTTGGTGAAGATAAACAGTTTTTGCCTTTTATTACTCCGCCATCATTGT
CATCTTCGTTTTCTCATAGACAAACTGATTTGTCGCCACCCTCTGAAACAAGGACTCCCGAGGAGAAGTTTGGTTTTGAACCAAAGCTCGAACCCGAGATAAATTCACCA
CCACCCTCTGTTTCAAGGTCCCCTTTCAAACCACAGTTCGAACAGGAGATAAATTCACCACCACCCTCTGCTTCAAGGACAAAGAGTGAGGTCAATGTGGATATCAGTGT
GGATTTGCAGGATGTATTGGCCGCTGCTCAGGCTGCTGCTGAAACTGCTGAACGTGCTGCTGCTGCAGCTCGCTCTGCAGCTAGTCTTGCAAAGGTTAGAATTGACGAGC
TCACAAAGAAAAAGAACGATCACGACCCTGAAATTAATTGCGAGAATCCATTCCATGGAGCTACTGCAAGTCCACCAGATCATAAAGAAACTTTCCAACTAAACCAACAA
GATTCTTTCCAAGAACATTACAAGGTAAGCCGGAATTCAGATGTTTCTGATCACCACCAGCTCGAGCAACAGAAATCGGGTTTCGATTCCTCGCCCGGTGATTCACCATC
TGAACGTATAGTACATCCTCATCAACCCCAGAGGCTTCCGTCAATGGATGACGACCCGTACTTTTCATACCCGAATTTATTTACATCACAGAAACCAAATCTGGGATCTG
ATCATTCATCTGGTGGTGGTTCCTGA
Protein sequenceShow/hide protein sequence
MFGNERRVWRRKQTHTASHRLAIPFRILNSITAFTLVDSSFRLLIILQFGGQKGFCFDLHSIFISMSMLNSFFSKGFKAAQCKTLLKLTIPRIKLLRNRREIQLKQMRRD
IAKLLETGQEATARIRVEHIIREENMMAAQEILELFCELIVVRLPIIETQRECPLDLKEAISSVCFAAPRCADLTELLQVQIHFAAKYGKEFVSAATELMPNCGVNRQLI
ELLSVRAPSPEKKLKLLKEIAEEHDLDWDPAGTEAEFNKSPEDLLNGPAQFVSASKLPLPKEKHDETYNATPDLASAPQPDSDSELDMLDFPEVPKMSVSPHPPTVEAGP
PSSLIPPPSVPLPPPESDHDSFKYSGFPEIPQDLHLRHDEATLVRSVSPSNHEMNISVGEDKQFLPFITPPSLSSSFSHRQTDLSPPSETRTPEEKFGFEPKLEPEINSP
PPSVSRSPFKPQFEQEINSPPPSASRTKSEVNVDISVDLQDVLAAAQAAAETAERAAAAARSAASLAKVRIDELTKKKNDHDPEINCENPFHGATASPPDHKETFQLNQQ
DSFQEHYKVSRNSDVSDHHQLEQQKSGFDSSPGDSPSERIVHPHQPQRLPSMDDDPYFSYPNLFTSQKPNLGSDHSSGGGS