; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G001060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G001060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionVHS domain-containing protein
Genome locationCmo_Chr01:470209..473757
RNA-Seq ExpressionCmoCh01G001060
SyntenyCmoCh01G001060
Gene Ontology termsGO:0030136 - clathrin-coated vesicle (cellular component)
GO:0032588 - trans-Golgi network membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR008942 - ENTH/VHS
IPR013809 - ENTH domain
IPR016024 - Armadillo-type fold
IPR035802 - Tepsin, ENTH/VHS domain
IPR039273 - AP-4 complex accessory subunit Tepsin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606757.1 Protein MODIFIED TRANSPORT TO THE VACUOLE 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0097.62Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIV+QKALRLIKYG+GKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS I+GPWN DSWANKVEATNGNLSSGSSER+TREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALC+LESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG LEAEN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVD LFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDV GSSSEA VQEHTRKDVNDLM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVS QPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSS+P GYAPTGNFFAQQNLVSA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSNYQQFGNP LQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

KAG7036470.1 VHS domain-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.19Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVT VYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIV+QKALRLIKYG+GKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS I+GPWN DSWANKVEATNGNLSSGSSER+TREERLLETIATAGGVRIQPTRDAIQAFLVEAA                    
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
                                     +ENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG LEAEN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVD LFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDV GSSSEA VQEHTRKDVNDLM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVS QPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSS+P GYAPTGNFFAQQNLVSA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSNYQQFGNP LQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

XP_022949030.1 VHS domain-containing protein At3g16270-like [Cucurbita moschata]0.0e+00100Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

XP_022997847.1 VHS domain-containing protein At3g16270-like [Cucurbita maxima]0.0e+0096.73Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYG+GKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS ISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALC+LESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMN SEKS PNNPSSTIQMPDLIDTSDAGVLE EN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVD LFGDGVNT+ STSELKNDDDPFSDVSFQTTD+GENPDDLFSGM VDNSQVSNE+KRPASEQKNEHGVFDV GSSSEAAVQEHTRKDVN+LM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVS QPNHQYQTKDSSVNGIYSSPMVGTNMNAA FPG TYLPSGMMFNPAFSS+PMGYAPTGNFFAQQNL SA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSN QQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPK+VV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

XP_023524865.1 VHS domain-containing protein At3g16270-like [Cucurbita pepo subsp. pepo]0.0e+0097.18Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRL KY +GKSGVEF REMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS ISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALC+LESIVRK DDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLE EN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVD LFGDGVNTV STSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDN QVSNENKRPASEQKNEHGVF V GSSSEAAVQEHTRKDVNDLM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHE+ LKSNDIGDSKDSLSESLYSVS QPNHQYQTKDSSVNGIYSSPMVGTNMNAAF PGMTYLPSGMMFNPAFSS+PMGYAPTGNFFAQQNLVSA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

TrEMBL top hitse value%identityAlignment
A0A0A0L8E2 VHS domain-containing protein5.0e-31083.28Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSP+VKQKALRLIKY +GKSGVEFRREMQR+SVAVRQL HYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTA +AIS+IFAEEDN+PAPSENLN RIQGFGNSNYEPP EDKKSFLSEVVGLGSASIKQGLSN AQGHSSRKNGTS HRG NLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS  SG WN DS       +NG+ SSGSSE KTRE+RLL+TIATAGGVR+QPTRD+IQAFLVEA KLDA+ALS+ALE+KLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------
        QVRFKALCILESIVR+NDDD FSIV SYFSENQ+AVIGCSESPQASLREKA+KVMPLLDGGKGVPS+N  EKS P+N SSTIQMPDLIDTSDAG      
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------

Query:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR
          +E ENLS TPLVD LFGDG+NTV STSELKNDDDPFSDVSF T +  ENPDDLFSGMN DN+QVSNENK+ A E KNE GVFD+ GSSSE AVQEH R
Subjt:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR

Query:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA
        KDVNDLMSGLSIHED LK  D GDSKDSLSESL+S S QPNHQ Q    S+NGIYSSPM G+NMNAAFFPGMTYLPSGM+FNPAFSS+PM YA TGNFF 
Subjt:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA

Query:  QQNLVSAMSNYQQFGNPHLQSS--GGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        QQ L+SAMSNYQQFGNP+LQS+  GG  G+GGYSSP PDIFQPNLA Q S+SVMNSSKKEDTRAFDFIS+H+AAARDPKRVV
Subjt:  QQNLVSAMSNYQQFGNPHLQSS--GGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

A0A1S3BH26 VHS domain-containing protein At3g162700.0e+0084.58Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSP+VKQKALRLIKY +GKSGVEFRREMQR+SVAVRQL HYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTA +AIS+IFAEEDN+PAPSENLN RIQGFGNSNYEPP+EDKKSFLSEVVGLGSASIKQGLSN AQGHSSRKNGTS HRG NLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGT++S  SG WN DS       +NG+ SSGSSE KTRE+RLL+TIATAGGVR+QPTRD+IQAFLVEA KLDA+ALS+ALE+KLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------
        QVRFKALCILESIVR+NDDDHFSIV SYFSENQ+AVIGCSESPQASLREKASKVMPLLDGGKGVPSMN SEKS P+N SSTIQMPDLIDTSDAG      
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------

Query:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR
          +E ENLS TPLVD LFGDG+NTV STSELKNDDDPFSDVSF T +  ENPDDLFSGMN DN+QVSNENK+PA EQKNE GVFD+ GSSSE AVQEH R
Subjt:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR

Query:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA
        KDVNDLMSGLSIHED LKS D GDSKDSLSESL+S SGQPNHQ      S+NGIYSSPM GTNMNAAFFPGMTYLPSGMMFNPAFSS+PM YA +GNFF 
Subjt:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA

Query:  QQNLVSAMSNYQQFGNPHLQS-SGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        QQ L+SAMSNYQQFGNP+LQS SGG  G+GGYSSPLPDIFQPNLA Q S+SVMNSSKKEDTRAFDFIS+H+AAARDPKRVV
Subjt:  QQNLVSAMSNYQQFGNPHLQS-SGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

A0A6J1GBM7 VHS domain-containing protein At3g16270-like0.0e+00100Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

A0A6J1IW71 VHS domain-containing protein At3g16270-like0.0e+0084.12Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKY +GKSGVEFRREMQRHSVAVRQL HYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVR+TA DAISAIFAEEDN+PAPSENLN RIQGFGNSNYEPP EDKKSFLSEVVGLGSASIKQGLSN AQGHSSRKNGTS  RGPNLQRSLTTE+EYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS ISG WN DS       +NGN SSGSS  KTREERLLETIATAGGVR+QPTRDAIQAFLVEAA LDA+ALS+ALE+KLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------
        QVRFKALCILESIVR++ D+HFSIV SYFSENQDAVIGCSESPQASLR+KASKVMPLLDGGKGVP MNDSEKS P+N SSTIQMPDL+DTSDAG      
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAG------

Query:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR
          LE ENLS  PLVD LFG G+NTV STSELKNDDDPFSDV F TT+  ENPDD+FSGMN +N+QV++ENK+P SEQKNE GVFD+ GSSSE AVQEH R
Subjt:  -VLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTR

Query:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA
        KDV DLMSGLSIHED LK+ D GDSKDSLSESL+SVS QPNHQ Q    S+ G YSSPMVGTNMNA FFPGM YLPSGMMFNPAFSS+PMGYAPTGNFF 
Subjt:  KDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFA

Query:  QQNLVSAMSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        QQ L+SAMSNYQQFGNP+LQSSG     GGYSSPLPDIFQPNLA Q  SSVMNSSKKEDTRAFDFIS+H+AAARDPKRVV
Subjt:  QQNLVSAMSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

A0A6J1K668 VHS domain-containing protein At3g16270-like0.0e+0096.73Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDAATSDEDKVTPVYKLEEI EVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYG+GKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
        KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN
Subjt:  KAVRDTAQDAISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDN

Query:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
        RYEPVEYGRETLGTSKS ISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW
Subjt:  RYEPVEYGRETLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSW

Query:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN
        QVRFKALC+LESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMN SEKS PNNPSSTIQMPDLIDTSDAGVLE EN
Subjt:  QVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAEN

Query:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM
        LSKTPLVD LFGDGVNT+ STSELKNDDDPFSDVSFQTTD+GENPDDLFSGM VDNSQVSNE+KRPASEQKNEHGVFDV GSSSEAAVQEHTRKDVN+LM
Subjt:  LSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLM

Query:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA
        SGLSIHEDGLKSNDIGDSKDSLSESLYSVS QPNHQYQTKDSSVNGIYSSPMVGTNMNAA FPG TYLPSGMMFNPAFSS+PMGYAPTGNFFAQQNL SA
Subjt:  SGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSA

Query:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV
        MSN QQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPK+VV
Subjt:  MSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRVV

SwissProt top hitse value%identityAlignment
G3V8Y7 AP-4 complex accessory subunit Tepsin3.9e-0831.97Show/hide
Query:  TSDEDKVTPVYKLEEISEVLRSSHVSIVKE--FSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALNKAV
        TSD+D   P Y  EEI+++   SH S+       E++L RL+  S  VK K L+++ Y        F   ++R+S  +++   + G PDPL G++L + V
Subjt:  TSDEDKVTPVYKLEEISEVLRSSHVSIVKE--FSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALNKAV

Query:  RDTAQDAISAIFAEEDNRPAPSE--------------NLNSRIQGFG
        R  AQD  S +F++   +P PS+                +S +QGFG
Subjt:  RDTAQDAISAIFAEEDNRPAPSE--------------NLNSRIQGFG

Q3U3N6 AP-4 complex accessory subunit Tepsin4.7e-0624.79Show/hide
Query:  TSDEDKVTPVYKLEEISEVLRSSHVSIVKE--FSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALNKAV
        TSD+    P Y  EEI+++   SH S+       E++L RL+  S  VK K L+++ Y  G     F   ++R+S  +++   + G PDPL G++L + V
Subjt:  TSDEDKVTPVYKLEEISEVLRSSHVSIVKE--FSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALNKAV

Query:  RDTAQDAISAIFAEEDNRP-------APSENLN------SRIQGFG----NSNYEPPAEDKKSFLSEVVGLGSASIKQGLSN------LAQGHSSRKNGT
        R  AQD  S +F++   +P        P   +       S +QGFG    +S      E   S +     + + +++ G  N      L  G S +   T
Subjt:  RDTAQDAISAIFAEEDNRP-------APSENLN------SRIQGFG----NSNYEPPAEDKKSFLSEVVGLGSASIKQGLSN------LAQGHSSRKNGT

Query:  --SGHRGPNLQRSLTTEM--EYDNRYEPVEYGRETLGTSKSMISGP--WNPDSWANKVEATNGNLSSGSSERK---------------------TREERL
          + H  PN    L   +      R++P + G    G    + S P   N    +N   A++    SGS                          +E  L
Subjt:  --SGHRGPNLQRSLTTEM--EYDNRYEPVEYGRETLGTSKSMISGP--WNPDSWANKVEATNGNLSSGSSERK---------------------TREERL

Query:  LETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSWQVRFKALCILES
        + T+    G R+  +R+  Q F+ E   L+  A+   L  +L   S   + +ALC + S
Subjt:  LETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSWQVRFKALCILES

Q9C5H4 Protein MODIFIED TRANSPORT TO THE VACUOLE 11.8e-17854.19Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDA TSDEDKV PVYKLEEI ++LRSSHVSIVKEFSEFILKRL++KSPIVKQKALRLIKY +GKSG EFRREMQR+SVAVR L HYKG PDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEED-NRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRK--NGTSGHRGPNLQRSLTTEME
        KAVR+TA + ISAIF+EE+  +PA  E++N RI+GFGN+N++ P+ D KSFLSEVVG+GSASIKQG+SN AQGH  +K  NG+S +RGPNL RSLT E E
Subjt:  KAVRDTAQDAISAIFAEED-NRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRK--NGTSGHRGPNLQRSLTTEME

Query:  YDNRYEPVEYGRE-TLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLK
          +RY+PV+ G++   GTSK+   G     SW +     + + +S   E KTREE+LLETI T+GGVR+QPTRDA+  F++EAAK+DA+ALS AL+ KL 
Subjt:  YDNRYEPVEYGRE-TLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLK

Query:  SPSWQVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDA---
        SP WQVR KALC+LE+I+RK +D++FSIV +YFSEN DA+  C+ESPQ+SLREKA+KV+ LL+GG+    M+ S+ +      + + +PDLIDT D+   
Subjt:  SPSWQVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDA---

Query:  -GVLEAENLSKT-----PLV-DVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEA
           L A +   T     PL+ D  FGD  +   S+SE K DDDPF+DVSF   +  E+ DDLFSGM V         K  A    +   +FD+ GS+++ 
Subjt:  -GVLEAENLSKT-----PLV-DVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEA

Query:  AVQEHTRKDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMT--YLPSGMMFNPAFSSRPMG
          +    K++NDLM   SI E+   SN  G S  +L + L+++    +H  Q  ++ V GI  S   G   N     G+     P GMM NPAF+S+P+ 
Subjt:  AVQEHTRKDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMT--YLPSGMMFNPAFSSRPMG

Query:  YAPTGNFFA-QQNLVSAMSNYQQFGNPHLQSSGG--SAG-NGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRV
        YA   +  A QQ  +  MSN+QQFGN + Q SG   S G +GG  S LPDIFQPN   Q  +S MN SKKEDTRAFDFIS+H+ +ARD KRV
Subjt:  YAPTGNFFA-QQNLVSAMSNYQQFGNPHLQSSGG--SAG-NGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRV

Arabidopsis top hitse value%identityAlignment
AT3G16270.1 ENTH/VHS family protein1.3e-17954.19Show/hide
Query:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN
        MIDA TSDEDKV PVYKLEEI ++LRSSHVSIVKEFSEFILKRL++KSPIVKQKALRLIKY +GKSG EFRREMQR+SVAVR L HYKG PDPLKGDALN
Subjt:  MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALN

Query:  KAVRDTAQDAISAIFAEED-NRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRK--NGTSGHRGPNLQRSLTTEME
        KAVR+TA + ISAIF+EE+  +PA  E++N RI+GFGN+N++ P+ D KSFLSEVVG+GSASIKQG+SN AQGH  +K  NG+S +RGPNL RSLT E E
Subjt:  KAVRDTAQDAISAIFAEED-NRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRK--NGTSGHRGPNLQRSLTTEME

Query:  YDNRYEPVEYGRE-TLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLK
          +RY+PV+ G++   GTSK+   G     SW +     + + +S   E KTREE+LLETI T+GGVR+QPTRDA+  F++EAAK+DA+ALS AL+ KL 
Subjt:  YDNRYEPVEYGRE-TLGTSKSMISGPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLK

Query:  SPSWQVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDA---
        SP WQVR KALC+LE+I+RK +D++FSIV +YFSEN DA+  C+ESPQ+SLREKA+KV+ LL+GG+    M+ S+ +      + + +PDLIDT D+   
Subjt:  SPSWQVRFKALCILESIVRKNDDDHFSIVASYFSENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDA---

Query:  -GVLEAENLSKT-----PLV-DVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEA
           L A +   T     PL+ D  FGD  +   S+SE K DDDPF+DVSF   +  E+ DDLFSGM V         K  A    +   +FD+ GS+++ 
Subjt:  -GVLEAENLSKT-----PLV-DVLFGDGVNTVASTSELKNDDDPFSDVSFQTTDNGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEA

Query:  AVQEHTRKDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMT--YLPSGMMFNPAFSSRPMG
          +    K++NDLM   SI E+   SN  G S  +L + L+++    +H  Q  ++ V GI  S   G   N     G+     P GMM NPAF+S+P+ 
Subjt:  AVQEHTRKDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSSPMVGTNMNAAFFPGMT--YLPSGMMFNPAFSSRPMG

Query:  YAPTGNFFA-QQNLVSAMSNYQQFGNPHLQSSGG--SAG-NGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRV
        YA   +  A QQ  +  MSN+QQFGN + Q SG   S G +GG  S LPDIFQPN   Q  +S MN SKKEDTRAFDFIS+H+ +ARD KRV
Subjt:  YAPTGNFFA-QQNLVSAMSNYQQFGNPHLQSSGG--SAG-NGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFISEHIAAARDPKRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGATGCGGCGACTTCGGATGAGGATAAGGTCACGCCGGTGTACAAATTGGAAGAGATTTCTGAAGTGTTGAGATCTTCGCATGTCAGTATTGTCAAGGAATTTTC
GGAATTTATCTTGAAGAGGCTTGAACATAAGAGCCCGATTGTCAAACAGAAGGCTCTCAGGTTGATTAAGTATGGAATTGGGAAATCTGGTGTGGAATTCAGAAGGGAAA
TGCAGAGACACTCTGTGGCTGTCCGCCAGTTACTTCATTACAAGGGACAACCAGACCCCCTTAAAGGTGATGCACTTAATAAAGCTGTGAGGGATACTGCTCAGGACGCC
ATTTCTGCGATCTTTGCTGAAGAGGACAACAGGCCTGCCCCATCTGAGAATCTTAACAGTCGAATTCAAGGTTTTGGGAACTCAAATTATGAACCACCAGCAGAAGATAA
AAAATCATTTCTTAGCGAGGTAGTTGGTTTAGGAAGTGCATCAATCAAGCAAGGATTAAGTAATCTTGCACAAGGTCATTCATCGAGAAAGAATGGCACTAGTGGCCACA
GGGGTCCCAATCTTCAGAGGTCGTTGACTACTGAAATGGAGTATGACAATAGATATGAACCAGTTGAATATGGCCGTGAGACTCTCGGGACATCAAAGAGTATGATTAGT
GGACCATGGAACCCGGATTCTTGGGCAAATAAGGTGGAAGCTACTAATGGGAACCTGAGTTCTGGGTCTTCGGAGAGAAAAACTCGAGAAGAGAGGTTACTGGAGACCAT
TGCAACAGCAGGTGGTGTGCGCATACAACCAACTCGAGATGCCATTCAAGCATTTCTTGTGGAAGCTGCAAAGTTAGATGCAATGGCGCTGAGTAGTGCTCTTGAATCAA
AGCTTAAATCCCCATCATGGCAGGTTCGTTTCAAAGCTCTCTGCATCCTTGAGTCGATCGTTAGGAAAAATGATGACGATCATTTTTCAATTGTGGCATCGTATTTCAGT
GAAAATCAAGATGCAGTGATTGGATGTTCTGAATCTCCCCAAGCATCTCTTAGGGAAAAAGCTAGCAAGGTTATGCCACTTTTAGATGGAGGAAAAGGAGTCCCCTCCAT
GAATGATTCTGAAAAGTCCCGGCCAAACAACCCCAGTTCCACTATTCAGATGCCAGACTTAATAGACACAAGTGATGCAGGTGTTTTAGAGGCTGAAAACCTGTCGAAAA
CTCCATTAGTAGATGTCTTATTTGGAGATGGCGTAAACACCGTCGCAAGCACCAGTGAACTAAAGAATGATGATGACCCATTTTCAGATGTCTCTTTTCAGACAACTGAT
AATGGAGAAAATCCAGATGATCTTTTTTCTGGGATGAACGTCGATAATAGTCAGGTTAGTAATGAAAATAAAAGGCCTGCCTCGGAACAGAAAAATGAACATGGAGTTTT
TGATGTTCTTGGATCAAGTTCTGAAGCTGCAGTACAAGAACACACAAGGAAAGATGTTAATGACTTAATGAGTGGTTTGTCCATCCATGAAGATGGCTTGAAGAGTAATG
ATATAGGAGATTCCAAGGATTCACTGTCTGAATCTTTATATTCTGTTTCCGGTCAGCCAAACCATCAGTATCAGACTAAAGATTCTTCTGTAAATGGCATATACAGTTCA
CCAATGGTTGGGACAAATATGAATGCTGCCTTCTTCCCTGGAATGACATATCTTCCGTCCGGCATGATGTTCAATCCAGCCTTTTCATCTCGGCCAATGGGTTATGCTCC
CACAGGAAACTTCTTTGCTCAACAAAATTTAGTATCAGCCATGTCCAATTACCAACAGTTTGGGAACCCTCATCTCCAATCAAGTGGTGGAAGTGCGGGTAATGGAGGAT
ATTCTTCACCCCTTCCAGACATATTCCAGCCAAACCTTGCAACACAGCCATCTAGTTCCGTGATGAATAGTTCAAAGAAAGAAGATACCAGAGCTTTTGATTTTATCTCA
GAGCATATTGCAGCTGCTCGGGATCCAAAGCGGGTAGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGATTGCTAGGACCTCAATTCCGATACCCAATTACGTTTCACGTCTACCCCTTGACGATCTCCGGACAAAGATCGCCTGAATTGGGAGCTGCAAAAGTTTCAATTCC
AGACGAATCTGAGTTGGTTTGCGCGGTGAGACTGCTTGGAGAACAAGGAAAACATCGACTAAGCCAAAGGGTTTTCTGCATTTCTTTGCTTTCTGGGATTTGGATCAGGG
GTGGGATTGATTTAGGTCGGAGAGCTGTAGAGTCGTACTGGAGGTCGCGTATGATCGATGCGGCGACTTCGGATGAGGATAAGGTCACGCCGGTGTACAAATTGGAAGAG
ATTTCTGAAGTGTTGAGATCTTCGCATGTCAGTATTGTCAAGGAATTTTCGGAATTTATCTTGAAGAGGCTTGAACATAAGAGCCCGATTGTCAAACAGAAGGCTCTCAG
GTTGATTAAGTATGGAATTGGGAAATCTGGTGTGGAATTCAGAAGGGAAATGCAGAGACACTCTGTGGCTGTCCGCCAGTTACTTCATTACAAGGGACAACCAGACCCCC
TTAAAGGTGATGCACTTAATAAAGCTGTGAGGGATACTGCTCAGGACGCCATTTCTGCGATCTTTGCTGAAGAGGACAACAGGCCTGCCCCATCTGAGAATCTTAACAGT
CGAATTCAAGGTTTTGGGAACTCAAATTATGAACCACCAGCAGAAGATAAAAAATCATTTCTTAGCGAGGTAGTTGGTTTAGGAAGTGCATCAATCAAGCAAGGATTAAG
TAATCTTGCACAAGGTCATTCATCGAGAAAGAATGGCACTAGTGGCCACAGGGGTCCCAATCTTCAGAGGTCGTTGACTACTGAAATGGAGTATGACAATAGATATGAAC
CAGTTGAATATGGCCGTGAGACTCTCGGGACATCAAAGAGTATGATTAGTGGACCATGGAACCCGGATTCTTGGGCAAATAAGGTGGAAGCTACTAATGGGAACCTGAGT
TCTGGGTCTTCGGAGAGAAAAACTCGAGAAGAGAGGTTACTGGAGACCATTGCAACAGCAGGTGGTGTGCGCATACAACCAACTCGAGATGCCATTCAAGCATTTCTTGT
GGAAGCTGCAAAGTTAGATGCAATGGCGCTGAGTAGTGCTCTTGAATCAAAGCTTAAATCCCCATCATGGCAGGTTCGTTTCAAAGCTCTCTGCATCCTTGAGTCGATCG
TTAGGAAAAATGATGACGATCATTTTTCAATTGTGGCATCGTATTTCAGTGAAAATCAAGATGCAGTGATTGGATGTTCTGAATCTCCCCAAGCATCTCTTAGGGAAAAA
GCTAGCAAGGTTATGCCACTTTTAGATGGAGGAAAAGGAGTCCCCTCCATGAATGATTCTGAAAAGTCCCGGCCAAACAACCCCAGTTCCACTATTCAGATGCCAGACTT
AATAGACACAAGTGATGCAGGTGTTTTAGAGGCTGAAAACCTGTCGAAAACTCCATTAGTAGATGTCTTATTTGGAGATGGCGTAAACACCGTCGCAAGCACCAGTGAAC
TAAAGAATGATGATGACCCATTTTCAGATGTCTCTTTTCAGACAACTGATAATGGAGAAAATCCAGATGATCTTTTTTCTGGGATGAACGTCGATAATAGTCAGGTTAGT
AATGAAAATAAAAGGCCTGCCTCGGAACAGAAAAATGAACATGGAGTTTTTGATGTTCTTGGATCAAGTTCTGAAGCTGCAGTACAAGAACACACAAGGAAAGATGTTAA
TGACTTAATGAGTGGTTTGTCCATCCATGAAGATGGCTTGAAGAGTAATGATATAGGAGATTCCAAGGATTCACTGTCTGAATCTTTATATTCTGTTTCCGGTCAGCCAA
ACCATCAGTATCAGACTAAAGATTCTTCTGTAAATGGCATATACAGTTCACCAATGGTTGGGACAAATATGAATGCTGCCTTCTTCCCTGGAATGACATATCTTCCGTCC
GGCATGATGTTCAATCCAGCCTTTTCATCTCGGCCAATGGGTTATGCTCCCACAGGAAACTTCTTTGCTCAACAAAATTTAGTATCAGCCATGTCCAATTACCAACAGTT
TGGGAACCCTCATCTCCAATCAAGTGGTGGAAGTGCGGGTAATGGAGGATATTCTTCACCCCTTCCAGACATATTCCAGCCAAACCTTGCAACACAGCCATCTAGTTCCG
TGATGAATAGTTCAAAGAAAGAAGATACCAGAGCTTTTGATTTTATCTCAGAGCATATTGCAGCTGCTCGGGATCCAAAGCGGGTAGTCTGAGTTTAGGATTGTTGGCGA
CGTTGGTGAGGTGATGAATGAAAGCCACAGTTCTGTGCCATTAGAGATTGATGGCTCCATAACGTAGGCACTGTCGACGAACCACAACACGGTACACACTTGTGAAAAAT
GCGTAGATAGAAGTAGGAAAAAGAGAAAATGGACGTTGAAGAAGAATGAGAGTTGCTTTTACATAATCAGCTGTTTATACTGTGGTAATTCCTGGACTGGTTGTGTTCCA
ATCTGTAGAGTTTCATTTTGCCCAAAAAAGGAAAGAAAAAGAAAAAAAGAAAAAAACTTATTATTGTAGTGGTTATGGTAATAATGCAGCTTGTTCTTTCATGTATATGC
CTGTGGTTATACTTTATGCTGAGTTCTTAGATTGACACTCATAAGCTGAGTTATTAATATAACACTTCATTCTTGAATGGTACAATCAAAGGAAACTATTGTTA
Protein sequenceShow/hide protein sequence
MIDAATSDEDKVTPVYKLEEISEVLRSSHVSIVKEFSEFILKRLEHKSPIVKQKALRLIKYGIGKSGVEFRREMQRHSVAVRQLLHYKGQPDPLKGDALNKAVRDTAQDA
ISAIFAEEDNRPAPSENLNSRIQGFGNSNYEPPAEDKKSFLSEVVGLGSASIKQGLSNLAQGHSSRKNGTSGHRGPNLQRSLTTEMEYDNRYEPVEYGRETLGTSKSMIS
GPWNPDSWANKVEATNGNLSSGSSERKTREERLLETIATAGGVRIQPTRDAIQAFLVEAAKLDAMALSSALESKLKSPSWQVRFKALCILESIVRKNDDDHFSIVASYFS
ENQDAVIGCSESPQASLREKASKVMPLLDGGKGVPSMNDSEKSRPNNPSSTIQMPDLIDTSDAGVLEAENLSKTPLVDVLFGDGVNTVASTSELKNDDDPFSDVSFQTTD
NGENPDDLFSGMNVDNSQVSNENKRPASEQKNEHGVFDVLGSSSEAAVQEHTRKDVNDLMSGLSIHEDGLKSNDIGDSKDSLSESLYSVSGQPNHQYQTKDSSVNGIYSS
PMVGTNMNAAFFPGMTYLPSGMMFNPAFSSRPMGYAPTGNFFAQQNLVSAMSNYQQFGNPHLQSSGGSAGNGGYSSPLPDIFQPNLATQPSSSVMNSSKKEDTRAFDFIS
EHIAAARDPKRVV