; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016593 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016593
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein, putative
Genome locationchr06:17061848..17063705
RNA-Seq ExpressionPay0016593
SyntenyPay0016593
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063421.1 putative Regulator of Vps4 activity in the MVB pathway protein [Cucumis melo var. makuwa]7.9e-21999.03Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPII+KEKL
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
        NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT

Query:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
        KEASIPSSKIDYSDDRF FEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNL SDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
Subjt:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK

Query:  EISEENIKVHEPK
        EISEENIKVHE K
Subjt:  EISEENIKVHEPK

XP_008461584.1 PREDICTED: uncharacterized protein LOC103500154 [Cucumis melo]1.7e-221100Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
        NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT

Query:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
        KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
Subjt:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK

Query:  EISEENIKVHEPK
        EISEENIKVHEPK
Subjt:  EISEENIKVHEPK

XP_011651369.1 uncharacterized protein LOC105434874 [Cucumis sativus]2.0e-19087.38Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLK SKFKTLANMAISRTSILKN HRARCSLARADVLQLLNL YQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFG EFASSAIDLRNNCGVS KMIQKFSTKQP+AETKLKVLKEIASENGIPL+LE+E PII+K K 
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS
        NQKLE+KKEADLYNPEVTNTTDDLHEVITSEK+ASELVK+KKFKDVASA EE FQSAA        AIN  KS+SQDIDSDHEDGSHLQKNGK S+LNSS
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS

Query:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD
        YE KLNTKE  I S KIDYSDDRFSFEKICPV LESCSSQYEDADMEETNQEELPKEPA ESTS  L   SD+NN KEQSS+PNS FYNESDGDKI + D
Subjt:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD

Query:  NEVASGKEISEENIKVHEPK
        NEVASGKEISEE+I+VHE K
Subjt:  NEVASGKEISEENIKVHEPK

XP_038875570.1 uncharacterized protein LOC120067980 isoform X1 [Benincasa hispida]5.9e-17482.42Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRAR SLARADVLQLLNL YQHRAQLRVEIVIKENNMLDAL MIEDYCCLL+EKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVK-EK
        CPGEVKEAISSLIFAASRCGEFPELQEIRRI ELKFGTEFASSA++LRNNCGVSPKMIQKFSTKQP+AETKLKVLKEIASENGI  +LEEEP II+K EK
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVK-EK

Query:  LNQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNS
        LNQKLE K+ ADL NPEV NTTD+LHEVITS++ A E VKAKKFKDVASA +EAFQSAA        AI   KSE+QDIDSDHEDGSHLQ + KSS+L+S
Subjt:  LNQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNS

Query:  SYESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISK
        S+E KLNTKEASIPSSKIDYSDDRFSFEKICP  LESCSS+YEDADM E+NQ ELPKE A E  SSVL L SDI+N KEQSS+ NS FYNE +GDKII K
Subjt:  SYESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISK

Query:  DNEVASGKEISEENIKVHEPK
        +NEVAS  EISE+ I+VHE K
Subjt:  DNEVASGKEISEENIKVHEPK

XP_038875575.1 uncharacterized protein LOC120067980 isoform X2 [Benincasa hispida]2.4e-17582.62Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRAR SLARADVLQLLNL YQHRAQLRVEIVIKENNMLDAL MIEDYCCLL+EKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRI ELKFGTEFASSA++LRNNCGVSPKMIQKFSTKQP+AETKLKVLKEIASENGI  +LEEEP II+KEKL
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS
        NQKLE K+ ADL NPEV NTTD+LHEVITS++ A E VKAKKFKDVASA +EAFQSAA        AI   KSE+QDIDSDHEDGSHLQ + KSS+L+SS
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS

Query:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD
        +E KLNTKEASIPSSKIDYSDDRFSFEKICP  LESCSS+YEDADM E+NQ ELPKE A E  SSVL L SDI+N KEQSS+ NS FYNE +GDKII K+
Subjt:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD

Query:  NEVASGKEISEENIKVHEPK
        NEVAS  EISE+ I+VHE K
Subjt:  NEVASGKEISEENIKVHEPK

TrEMBL top hitse value%identityAlignment
A0A0A0L6Y2 Uncharacterized protein9.8e-19187.38Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLK SKFKTLANMAISRTSILKN HRARCSLARADVLQLLNL YQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFG EFASSAIDLRNNCGVS KMIQKFSTKQP+AETKLKVLKEIASENGIPL+LE+E PII+K K 
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS
        NQKLE+KKEADLYNPEVTNTTDDLHEVITSEK+ASELVK+KKFKDVASA EE FQSAA        AIN  KS+SQDIDSDHEDGSHLQKNGK S+LNSS
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS

Query:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD
        YE KLNTKE  I S KIDYSDDRFSFEKICPV LESCSSQYEDADMEETNQEELPKEPA ESTS  L   SD+NN KEQSS+PNS FYNESDGDKI + D
Subjt:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKD

Query:  NEVASGKEISEENIKVHEPK
        NEVASGKEISEE+I+VHE K
Subjt:  NEVASGKEISEENIKVHEPK

A0A1S3CEY6 uncharacterized protein LOC1035001548.3e-222100Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
        NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT

Query:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
        KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
Subjt:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK

Query:  EISEENIKVHEPK
        EISEENIKVHEPK
Subjt:  EISEENIKVHEPK

A0A5D3CIE3 Putative Regulator of Vps4 activity in the MVB pathway protein3.8e-21999.03Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPII+KEKL
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
        NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNT

Query:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
        KEASIPSSKIDYSDDRF FEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNL SDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK
Subjt:  KEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGK

Query:  EISEENIKVHEPK
        EISEENIKVHE K
Subjt:  EISEENIKVHEPK

A0A6J1F3M1 uncharacterized protein LOC1114418671.8e-13668.25Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSK K L NMAISRT+ILKNQHRARCSLARADVLQLLNL YQHRAQLRVEIVIKENNM+DALGMIEDYC LL+EKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFA+SR GEFPELQEIRRIFELKFG EFASSA++LRNNCGVSPKMIQKFSTKQP AETKLKVLKEIASENGI L+LEEEP I++  K+
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS
        NQK+E KK ADL NPEV NTTD+ HEVI SE++ASE V+ KKFKD ASAA+EAFQSAA        AI   +SESQDI        H+++  ++S+L SS
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS

Query:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQ--SSEPNSTFYNESDGDKIIS
        +E K+N  E                        LESCSS+YED D  E+NQEE P+EPA E   S  NL SD++N KEQ  SS+PNS  +NE + DK++S
Subjt:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQ--SSEPNSTFYNESDGDKIIS

Query:  KDNEVASGKEISEENIKVHEPK
        K++EV S  +ISEE+I+V E K
Subjt:  KDNEVASGKEISEENIKVHEPK

A0A6J1J838 uncharacterized protein LOC1114821341.3e-13769.43Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MGRKLDALLGRNRYLKPSK K LANMAISRT+ILKNQHRARCSLARADVLQLLNL YQHRAQLRVEIVIKENNM+DALGMIEDYC LL+EKVAL QMNKE
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CPGEVKEAISSLIFA+SR GEFPELQEIRRIFELKFG EFA SAIDLRNNCGVSPKMIQKFSTKQP AETKLKVLKEIASENGI L+LEEEP I++  K+
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS
        NQKLE KK ADL NPEV NTTD+ HEVI SE+ ASE V+AKKFKD ASAA+EAFQSAA        AI   +SESQDI        H+++  ++S+L SS
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAA-------KAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSS

Query:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQ--SSEPNSTFYNESDGDKIIS
        +E K+N  E                        LESCSS+YED D  E+NQEELP+EPA E   S  NL SDI+N KE+  SS+PNS  +NE + DK+ S
Subjt:  YESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESCSSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQ--SSEPNSTFYNESDGDKIIS

Query:  KDNEVASGKEISEENIKVHEPK
        K++EV S  +ISEE+I+V E K
Subjt:  KDNEVASGKEISEENIKVHEPK

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.1e-1027.57Show/hide
Query:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA
        K  + +    + I+R  +L+ +       AR ++   L  G   RA++RVE +I+E+ +++A+ ++E YC LL+ +  L Q  KE    + E++S+LI+A
Subjt:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLN-----LEEEPP
        A R   E  EL+ +      K+  E+    +   N  G V+ +++ K S + P      + L EIA    +P       + E PP
Subjt:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLN-----LEEEPP

Q3ZBV1 IST1 homolog6.7e-1127.42Show/hide
Query:  LKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIF
        +K  + +    + I+R  +L+ +       AR ++   L  G   RA++RVE +I+E+ +++A+ ++E YC LL+ +  L Q  KE    + E++S+LI+
Subjt:  LKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIF

Query:  AASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLN-----LEEEPP
        AA R   E  EL+ +      K+  E+    +   N  G V+ +++ K S + P      + L EIA    +P       + E PP
Subjt:  AASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLN-----LEEEPP

Q54I39 IST1-like protein3.2e-1328.57Show/hide
Query:  KFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFAASR
        K K    +A+SR  ILKN+        + +V +LL    +  A++RVE +I++  +++   +IE  C LL  ++ L     E P E+KE+I +L++++ R
Subjt:  KFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFAASR

Query:  CGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPP
          + PEL++I+   + K+G    + A +   +  V+PK++ K S   P      + L EIA +  +     + PP
Subjt:  CGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPP

Q568Z6 IST1 homolog8.7e-1127.91Show/hide
Query:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA
        K  + +    + I+R  +L+ +       AR ++   L  G   RA++RVE +I+E+ +++A+ ++E YC LL+ +  L Q  KE    + E++S+LI+A
Subjt:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIP
        A R   E  EL+ +      K+  E+    +   N  G V+ +++ K S + P      + L EIA    +P
Subjt:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIP

Q9CX00 IST1 homolog8.7e-1127.91Show/hide
Query:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA
        K  + +    + I+R  +L+ +       AR ++   L  G   RA++RVE +I+E+ +++A+ ++E YC LL+ +  L Q  KE    + E++S+LI+A
Subjt:  KPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIP
        A R   E  EL+ +      K+  E+    +   N  G V+ +++ K S + P      + L EIA    +P
Subjt:  ASRC-GEFPELQEIRRIFELKFGTEFASSAIDLRNNCG-VSPKMIQKFSTKQPTAETKLKVLKEIASENGIP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein1.5e-5045Show/hide
Query:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE
        MG+KLDALLGR+   K +KFK+L  +A++R SILKNQ +AR S A +DV +LL LG    A  RV+ V+K+ N LD L  I  Y  L ++++ LF+ N++
Subjt:  MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKE

Query:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL
        CP E+ EA+S L+FAASR GEFPELQEIR +   +FG + A+ +I+LR+NCGV PK+IQK ST+ P  E ++K LKEIA+EN I L L++          
Subjt:  CPGEVKEAISSLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKL

Query:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKA
             ++  +D+   ++T+          S+ +       KK+KDVA AA+ AF+SAA A
Subjt:  NQKLEVKKEADLYNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKA

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein4.3e-2938.82Show/hide
Query:  NRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISS
        NR +  +K KT  N+AI+R  +L+N+   +    + ++   L  G +  A++RVE VI+E N+  A  ++E +C  ++ +V + +  KECP E++EAI+S
Subjt:  NRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISS

Query:  LIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASE
        +IFAA RC E P+L +I+ +F  K+G EF   A +LR + GV+  +I+K S   P+   +LK+LKEIA E
Subjt:  LIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASE

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein2.1e-3138.01Show/hide
Query:  NRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISS
        N+  K +K KTL  + I R  +++N+  A+    R ++ +LL  G +  A++RVE +I+E  M+ A  ++E +C L+  ++ + +  +ECP ++KEAISS
Subjt:  NRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISS

Query:  LIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASEN
        + FAA RC +  ELQ+++ +F  K+G EF ++A +L+ + GV+ K+++  S + P+ ETKLK+LKEIA E+
Subjt:  LIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASEN

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein1.3e-3040.12Show/hide
Query:  RYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSL
        R  KP+K KT   MA SR  ILKN+   +    R ++ QLL  G    A++RVE V++E   + A  +I  YC LLV ++ + +  K CP ++KEA++S+
Subjt:  RYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSL

Query:  IFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGI
        +FA+ R  + PEL EI + F  K+G +F++SA++LR + GVS  +++K S K P   TK+K+L  IA E+ +
Subjt:  IFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGI

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein7.1e-3235.57Show/hide
Query:  RYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSL
        R    SK KT A MA++R  +++N+        R D+  LL  G    A++RVE VI+E N+  A  +IE +C L+V ++ +    K+CP ++KE I+SL
Subjt:  RYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAISSL

Query:  IFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVK---EKLNQKLEVKKE
        IFAA RC E PEL ++R IF  K+G +F S+A DLR +CGV+  +I K S + P  E KLK++KEIA E  +  +  E    ++K   E ++   +    
Subjt:  IFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVK---EKLNQKLEVKKE

Query:  ADL-YNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAI
        + L  N    N   D  + +   +  S +     + D  SAAE A + A +A+
Subjt:  ADL-YNPEVTNTTDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGAAACTGGATGCTTTACTGGGAAGGAACAGATATTTGAAGCCTTCCAAGTTTAAGACCCTTGCTAATATGGCCATTTCTAGGACTTCCATTCTAAAAAACCA
GCACCGCGCCCGTTGCTCCCTTGCCCGTGCCGACGTTCTTCAGCTCTTGAATCTTGGTTACCAACATCGTGCTCAGCTTAGAGTTGAGATTGTGATTAAGGAGAATAACA
TGTTGGATGCTCTTGGAATGATAGAGGATTACTGCTGCCTTCTCGTAGAAAAGGTCGCTCTTTTTCAGATGAACAAGGAATGCCCTGGTGAAGTTAAGGAGGCGATATCC
AGTCTGATCTTTGCAGCTTCAAGATGTGGAGAGTTTCCAGAACTTCAAGAGATTCGTCGGATTTTCGAGTTGAAGTTTGGTACGGAATTTGCAAGCAGTGCCATTGATTT
ACGCAACAACTGTGGTGTTAGTCCTAAGATGATTCAAAAATTTTCAACAAAACAGCCCACTGCCGAAACTAAACTGAAAGTGCTAAAGGAAATTGCTTCAGAAAATGGCA
TTCCTCTGAATTTAGAGGAAGAACCACCAATAATCGTCAAGGAAAAGCTAAACCAAAAGTTGGAAGTTAAGAAAGAAGCAGATTTGTATAATCCTGAAGTCACAAACACC
ACAGATGATCTACATGAAGTTATTACAAGTGAAAAGCTGGCATCGGAATTAGTGAAAGCAAAGAAGTTTAAAGATGTAGCTAGTGCCGCAGAAGAAGCCTTCCAATCAGC
AGCTAAAGCAATCAACTTTTTGAAGTCCGAATCACAGGATATTGATTCAGATCATGAAGATGGCTCTCATCTACAAAAGAATGGAAAAAGCTCAGAGTTGAATAGTTCTT
ATGAATCTAAGCTTAATACAAAAGAAGCAAGCATACCTTCCAGTAAGATTGACTATTCAGACGACAGATTTAGTTTTGAAAAGATTTGTCCCGTTGGGCTTGAAAGTTGC
AGTTCTCAATATGAAGATGCAGACATGGAAGAAACCAATCAAGAAGAATTACCAAAAGAACCTGCCATAGAAAGTACATCCTCTGTTCTAAACTTATGTTCAGATATTAA
TAATACGAAGGAGCAATCATCAGAACCAAATTCAACCTTCTACAACGAGTCAGATGGCGACAAGATAATCAGTAAAGATAATGAAGTTGCCAGTGGAAAAGAAATTTCAG
AAGAGAATATCAAAGTTCACGAACCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGAAACTGGATGCTTTACTGGGAAGGAACAGATATTTGAAGCCTTCCAAGTTTAAGACCCTTGCTAATATGGCCATTTCTAGGACTTCCATTCTAAAAAACCA
GCACCGCGCCCGTTGCTCCCTTGCCCGTGCCGACGTTCTTCAGCTCTTGAATCTTGGTTACCAACATCGTGCTCAGCTTAGAGTTGAGATTGTGATTAAGGAGAATAACA
TGTTGGATGCTCTTGGAATGATAGAGGATTACTGCTGCCTTCTCGTAGAAAAGGTCGCTCTTTTTCAGATGAACAAGGAATGCCCTGGTGAAGTTAAGGAGGCGATATCC
AGTCTGATCTTTGCAGCTTCAAGATGTGGAGAGTTTCCAGAACTTCAAGAGATTCGTCGGATTTTCGAGTTGAAGTTTGGTACGGAATTTGCAAGCAGTGCCATTGATTT
ACGCAACAACTGTGGTGTTAGTCCTAAGATGATTCAAAAATTTTCAACAAAACAGCCCACTGCCGAAACTAAACTGAAAGTGCTAAAGGAAATTGCTTCAGAAAATGGCA
TTCCTCTGAATTTAGAGGAAGAACCACCAATAATCGTCAAGGAAAAGCTAAACCAAAAGTTGGAAGTTAAGAAAGAAGCAGATTTGTATAATCCTGAAGTCACAAACACC
ACAGATGATCTACATGAAGTTATTACAAGTGAAAAGCTGGCATCGGAATTAGTGAAAGCAAAGAAGTTTAAAGATGTAGCTAGTGCCGCAGAAGAAGCCTTCCAATCAGC
AGCTAAAGCAATCAACTTTTTGAAGTCCGAATCACAGGATATTGATTCAGATCATGAAGATGGCTCTCATCTACAAAAGAATGGAAAAAGCTCAGAGTTGAATAGTTCTT
ATGAATCTAAGCTTAATACAAAAGAAGCAAGCATACCTTCCAGTAAGATTGACTATTCAGACGACAGATTTAGTTTTGAAAAGATTTGTCCCGTTGGGCTTGAAAGTTGC
AGTTCTCAATATGAAGATGCAGACATGGAAGAAACCAATCAAGAAGAATTACCAAAAGAACCTGCCATAGAAAGTACATCCTCTGTTCTAAACTTATGTTCAGATATTAA
TAATACGAAGGAGCAATCATCAGAACCAAATTCAACCTTCTACAACGAGTCAGATGGCGACAAGATAATCAGTAAAGATAATGAAGTTGCCAGTGGAAAAGAAATTTCAG
AAGAGAATATCAAAGTTCACGAACCAAAGTAA
Protein sequenceShow/hide protein sequence
MGRKLDALLGRNRYLKPSKFKTLANMAISRTSILKNQHRARCSLARADVLQLLNLGYQHRAQLRVEIVIKENNMLDALGMIEDYCCLLVEKVALFQMNKECPGEVKEAIS
SLIFAASRCGEFPELQEIRRIFELKFGTEFASSAIDLRNNCGVSPKMIQKFSTKQPTAETKLKVLKEIASENGIPLNLEEEPPIIVKEKLNQKLEVKKEADLYNPEVTNT
TDDLHEVITSEKLASELVKAKKFKDVASAAEEAFQSAAKAINFLKSESQDIDSDHEDGSHLQKNGKSSELNSSYESKLNTKEASIPSSKIDYSDDRFSFEKICPVGLESC
SSQYEDADMEETNQEELPKEPAIESTSSVLNLCSDINNTKEQSSEPNSTFYNESDGDKIISKDNEVASGKEISEENIKVHEPK