; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013231 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013231
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein, putative
Genome locationscaffold459:1400935..1402533
RNA-Seq ExpressionMS013231
SyntenyMS013231
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651369.1 uncharacterized protein LOC105434874 [Cucumis sativus]8.0e-13167.84Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LK SKFK LANMAISRT+ILKN H+AR SLA+ADVLQLLNL +QHRA+LRVEIVIKE+NMLDALGMIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFAASRCGEFPELQEIRRIFELKFG EFASSA+DLRNNCGV  KMIQKFSTKQPS ETKLKVL+EIASENGI LHLE+E  I IK + 
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS
        NQKLE K  ADL NPEV N TD+LHE +   K  SELV++KKF DVASA +E FQSAAYAA+AA+AAI LS+S+SQD++ D    SHLQ++    D++SS
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS

Query:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKES-----KPSDPN---LDSSNPDSISQNELEND
        +EPKLN +E      KID+SDD FSFEKI P   ES SS+ ED D+ E NQE       E P++P KES     K SD N     SS P+SI  NE + D
Subjt:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKES-----KPSDPN---LDSSNPDSISQNELEND

Query:  KLVSKENAFASKTEISDEDIRVRESK
        K+ + +N  AS  EIS+EDIRV ESK
Subjt:  KLVSKENAFASKTEISDEDIRVRESK

XP_022148119.1 uncharacterized protein LOC111016871 [Momordica charantia]2.9e-21399.02Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLA+ADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI
        NQKLEAKTSADLDNPEVINAT+NLHE+MFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVE DSHLQRSMTPDMDSSHEPKLNI
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI

Query:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD
        EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD
Subjt:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD

Query:  EDIRVRESK
        EDIRVRESK
Subjt:  EDIRVRESK

XP_022983568.1 uncharacterized protein LOC111482134 [Cucurbita maxima]1.6e-13168.48Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSK K LANMAISRTAILKNQH+AR SLA+ADVLQLLNL +QHRA+LRVEIVIKE+NM+DALGMIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFA+SR GEFPELQEIRRIFELKFG+EFA SA+DLRNNCGV  KMIQKFSTKQP+ ETKLKVL+EIASENGITLHLEEEP I IK  +
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL
        NQKLE K SADLDNPEVIN TDN HE +   E  SE VRAKKF D ASAAQEAFQSAAYAA+AA+AAIELSRSESQD+    H++     D++SSHEPK+
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL

Query:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS
        N +E                       ES SSE ED+D AE NQE       E PE+P KE   SDPNLD           SS P+SIS NE E+DKL S
Subjt:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS

Query:  KENAFASKTEISDEDIRVRESK
        KE+   SK +IS+EDIRV ESK
Subjt:  KENAFASKTEISDEDIRVRESK

XP_038875570.1 uncharacterized protein LOC120067980 isoform X1 [Benincasa hispida]6.8e-13872.68Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSKFK LANMAISRT+ILKNQH+ARSSLA+ADVLQLLNL +QHRA+LRVEIVIKE+NMLDAL MIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIK-EQ
        CPGEVKEAIS+LIFAASRCGEFPELQEIRRI ELKFG EFASSAV+LRNNCGV  KMIQKFSTKQPS ETKLKVL+EIASENGIT HLEEEP I IK E+
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIK-EQ

Query:  LNQKLEAKTSADLDNPEVINATDNLHEDMFKNES--ELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQR-SMTPDMDS
        LNQKLEAK +ADLDNPEVIN TDNLHE +  +ES  E V+AKKF DVASA QEAFQSAAYAA+AA+AAI LS+SE+QD++ D    SHLQ    + D+DS
Subjt:  LNQKLEAKTSADLDNPEVINATDNLHEDMFKNES--ELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQR-SMTPDMDS

Query:  SHEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQ-ETHLEHAGESPEKP-VKESKPSDPNLDSSNPDSISQNELENDKLVSK
        SHEPKLN +EAS P SKID+SDD FSFEKI P   ES SSE ED D+AE NQ E   E A E      + +S  S+    SS  +SI  NE E DK++ K
Subjt:  SHEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQ-ETHLEHAGESPEKP-VKESKPSDPNLDSSNPDSISQNELENDKLVSK

Query:  ENAFASKTEISDEDIRVRESK
        EN  ASK EIS EDIRV ESK
Subjt:  ENAFASKTEISDEDIRVRESK

XP_038875575.1 uncharacterized protein LOC120067980 isoform X2 [Benincasa hispida]2.8e-13972.86Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSKFK LANMAISRT+ILKNQH+ARSSLA+ADVLQLLNL +QHRA+LRVEIVIKE+NMLDAL MIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFAASRCGEFPELQEIRRI ELKFG EFASSAV+LRNNCGV  KMIQKFSTKQPS ETKLKVL+EIASENGIT HLEEEP I IKE+L
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNES--ELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQR-SMTPDMDSS
        NQKLEAK +ADLDNPEVIN TDNLHE +  +ES  E V+AKKF DVASA QEAFQSAAYAA+AA+AAI LS+SE+QD++ D    SHLQ    + D+DSS
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNES--ELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQR-SMTPDMDSS

Query:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQ-ETHLEHAGESPEKP-VKESKPSDPNLDSSNPDSISQNELENDKLVSKE
        HEPKLN +EAS P SKID+SDD FSFEKI P   ES SSE ED D+AE NQ E   E A E      + +S  S+    SS  +SI  NE E DK++ KE
Subjt:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQ-ETHLEHAGESPEKP-VKESKPSDPNLDSSNPDSISQNELENDKLVSKE

Query:  NAFASKTEISDEDIRVRESK
        N  ASK EIS EDIRV ESK
Subjt:  NAFASKTEISDEDIRVRESK

TrEMBL top hitse value%identityAlignment
A0A0A0L6Y2 Uncharacterized protein3.9e-13167.84Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LK SKFK LANMAISRT+ILKN H+AR SLA+ADVLQLLNL +QHRA+LRVEIVIKE+NMLDALGMIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFAASRCGEFPELQEIRRIFELKFG EFASSA+DLRNNCGV  KMIQKFSTKQPS ETKLKVL+EIASENGI LHLE+E  I IK + 
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS
        NQKLE K  ADL NPEV N TD+LHE +   K  SELV++KKF DVASA +E FQSAAYAA+AA+AAI LS+S+SQD++ D    SHLQ++    D++SS
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS

Query:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKES-----KPSDPN---LDSSNPDSISQNELEND
        +EPKLN +E      KID+SDD FSFEKI P   ES SS+ ED D+ E NQE       E P++P KES     K SD N     SS P+SI  NE + D
Subjt:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKES-----KPSDPN---LDSSNPDSISQNELEND

Query:  KLVSKENAFASKTEISDEDIRVRESK
        K+ + +N  AS  EIS+EDIRV ESK
Subjt:  KLVSKENAFASKTEISDEDIRVRESK

A0A5D3CIE3 Putative Regulator of Vps4 activity in the MVB pathway protein7.3e-13067.21Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSKFK LANMAISRT+ILKNQH+AR SLA+ADVLQLLNLG+QHRA+LRVEIVIKE+NMLDALGMIEDYC LL+E+V L +M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFAASRCGEFPELQEIRRIFELKFG EFASSA+DLRNNCGV  KMIQKFSTKQP+ ETKLKVL+EIASENGI L+LEEEP I IKE+L
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS
        NQKLE K  ADL NPEV N TD+LHE +   K  SELV+AKKF DVASAA+EAFQSAA        AI   +SESQD++ D    SHLQ++  + +++SS
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMF--KNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGD----SHLQRS-MTPDMDSS

Query:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNL---------DSSNPDSISQNELEN
        +E KLN +EAS P SKID+SDD F FEKI P   ES SS+ ED D+ E NQE       E P++P  ES  S  NL          SS P+S   NE + 
Subjt:  HEPKLNIEEASTPPSKIDHSDDIFSFEKIHPD--ESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNL---------DSSNPDSISQNELEN

Query:  DKLVSKENAFASKTEISDEDIRVRESK
        DK++SK+N  AS  EIS+E+I+V ESK
Subjt:  DKLVSKENAFASKTEISDEDIRVRESK

A0A6J1D319 uncharacterized protein LOC1110168711.4e-21399.02Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLA+ADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI
        NQKLEAKTSADLDNPEVINAT+NLHE+MFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVE DSHLQRSMTPDMDSSHEPKLNI
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI

Query:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD
        EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD
Subjt:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISD

Query:  EDIRVRESK
        EDIRVRESK
Subjt:  EDIRVRESK

A0A6J1F3M1 uncharacterized protein LOC1114418677.3e-13067.54Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSK K L NMAISRTAILKNQH+AR SLA+ADVLQLLNL +QHRA+LRVEIVIKE+NM+DALGMIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFA+SR GEFPELQEIRRIFELKFG+EFASSAV+LRNNCGV  KMIQKFSTKQP+ ETKLKVL+EIASENGITLHLEEEP I IK  +
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL
        NQK+E K SADLDNPEV+N TDN HE +   E  SE VR KKF D ASAA+EAFQSAAYAA+AA+AAIELSRSESQD+    H++     D++SSHEPK+
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL

Query:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS
        N +E                       ES SSE ED+D AE NQE       E PE+P KE   SDPNLD           SS P+SIS NE E+DKL+S
Subjt:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS

Query:  KENAFASKTEISDEDIRVRESK
        KE+   SK +IS+EDIRV ESK
Subjt:  KENAFASKTEISDEDIRVRESK

A0A6J1J838 uncharacterized protein LOC1114821347.8e-13268.48Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MGRKLDALLG+NR+LKPSK K LANMAISRTAILKNQH+AR SLA+ADVLQLLNL +QHRA+LRVEIVIKE+NM+DALGMIEDYC LL+E+V LL+M+KE
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CPGEVKEAIS+LIFA+SR GEFPELQEIRRIFELKFG+EFA SA+DLRNNCGV  KMIQKFSTKQP+ ETKLKVL+EIASENGITLHLEEEP I IK  +
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL
        NQKLE K SADLDNPEVIN TDN HE +   E  SE VRAKKF D ASAAQEAFQSAAYAA+AA+AAIELSRSESQD+    H++     D++SSHEPK+
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNE--SELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKL

Query:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS
        N +E                       ES SSE ED+D AE NQE       E PE+P KE   SDPNLD           SS P+SIS NE E+DKL S
Subjt:  NIEEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLD-----------SSNPDSISQNELENDKLVS

Query:  KENAFASKTEISDEDIRVRESK
        KE+   SK +IS+EDIRV ESK
Subjt:  KENAFASKTEISDEDIRVRESK

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog2.1e-0927.49Show/hide
Query:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA
        K  + +    + I+R  +L+ +    +  A+ ++   L  G   RA++RVE +I+E  +++A+ ++E YC LL+ R  L++  KE    + E++STLI+A
Subjt:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI
        A R   E  EL+ +      K+ +E+    +   N  G V  +++ K S + P      + L EIA    +
Subjt:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI

Q3ZBV1 IST1 homolog1.2e-0927.33Show/hide
Query:  LKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIF
        +K  + +    + I+R  +L+ +    +  A+ ++   L  G   RA++RVE +I+E  +++A+ ++E YC LL+ R  L++  KE    + E++STLI+
Subjt:  LKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIF

Query:  AASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI
        AA R   E  EL+ +      K+ +E+    +   N  G V  +++ K S + P      + L EIA    +
Subjt:  AASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI

Q54I39 IST1-like protein1.4e-1330.06Show/hide
Query:  KFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFAASR
        K K    +A+SR  ILKN+        + +V +LL   ++  A++RVE +I++  +++   +IE  C LL  R++L+  + E P E+KE+I TL++++ R
Subjt:  KFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFAASR

Query:  CGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASE
          + PEL++I+   + K+G+   + A +   +  V  K++ K S   P      + L EIA +
Subjt:  CGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASE

Q568Z6 IST1 homolog2.1e-0927.49Show/hide
Query:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA
        K  + +    + I+R  +L+ +    +  A+ ++   L  G   RA++RVE +I+E  +++A+ ++E YC LL+ R  L++  KE    + E++STLI+A
Subjt:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI
        A R   E  EL+ +      K+ +E+    +   N  G V  +++ K S + P      + L EIA    +
Subjt:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI

Q9CX00 IST1 homolog2.1e-0927.49Show/hide
Query:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA
        K  + +    + I+R  +L+ +    +  A+ ++   L  G   RA++RVE +I+E  +++A+ ++E YC LL+ R  L++  KE    + E++STLI+A
Subjt:  KPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTLIFA

Query:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI
        A R   E  EL+ +      K+ +E+    +   N  G V  +++ K S + P      + L EIA    +
Subjt:  ASRC-GEFPELQEIRRIFELKFGREFASSAVDLRNNCG-VRLKMIQKFSTKQPSIETKLKVLEEIASENGI

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein2.6e-4736.57Show/hide
Query:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE
        MG+KLDALLG  R  K +KFK+L  +A++R +ILKNQ QAR S A +DV +LL LG    A  RV+ V+K+ N LD L  I  Y  L ++R+ L   +++
Subjt:  MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKE

Query:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL
        CP E+ EA+S L+FAASR GEFPELQEIR +   +FG++ A+ +++LR+NCGV  K+IQK ST+ P  E ++K L+EIA+EN I L L++    T     
Subjt:  CPGEVKEAISTLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQL

Query:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI
              + ++D+   + + + D   E    ++S     KK+ DVA AAQ AF+SAA+AA AA+AA+ELS+   +  +   ++            E   + 
Subjt:  NQKLEAKTSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNI

Query:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENA
         E      + + +DD    E     ES  S  + EDI +    +  E   +  EK        +    S    +IS+++ E +++V    A
Subjt:  EEASTPPSKIDHSDDIFSFEKIHPDESSSSECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENA

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein9.4e-2938.82Show/hide
Query:  NRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAIST
        NR +  +K K   N+AI+R  +L+N+   +    + ++   L  G +  A++RVE VI+E N+  A  ++E +C  ++ RV +L   KECP E++EAI++
Subjt:  NRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAIST

Query:  LIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASE
        +IFAA RC E P+L +I+ +F  K+G+EF   A +LR + GV   +I+K S   PS   +LK+L+EIA E
Subjt:  LIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASE

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein2.2e-3036.84Show/hide
Query:  NRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAIST
        N+  K +K K L  + I R  +++N+ +A+    + ++ +LL  G +  A++RVE +I+E  M+ A  ++E +C L+  R+ ++   +ECP ++KEAIS+
Subjt:  NRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAIST

Query:  LIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASEN
        + FAA RC +  ELQ+++ +F  K+G+EF ++A +L+ + GV  K+++  S + PS ETKLK+L+EIA E+
Subjt:  LIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASEN

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein2.7e-2834.91Show/hide
Query:  RHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTL
        R  KP+K K    MA SR  ILKN+ + +    + ++ QLL  G    A++RVE V++E   + A  +I  YC LL+ R+ ++   K CP ++KEA++++
Subjt:  RHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTL

Query:  IFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLH----LEEEPEITIKEQLNQKLEAKT
        +FA+ R  + PEL EI + F  K+G++F++SAV+LR + GV   +++K S K P   TK+K+L  IA E+ +       +E +P+ T  E LN     + 
Subjt:  IFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLH----LEEEPEITIKEQLNQKLEAKT

Query:  SADLDNPEVINA
        ++ ++    IN+
Subjt:  SADLDNPEVINA

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein3.5e-3132.88Show/hide
Query:  RHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTL
        R    SK K  A MA++R  +++N+        + D+  LL  G    A++RVE VI+E N+  A  +IE +C L++ R+ ++   K+CP ++KE I++L
Subjt:  RHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAISTL

Query:  IFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQLN-----QKLEAK
        IFAA RC E PEL ++R IF  K+G++F S+A DLR +CGV   +I K S + P  E KLK+++EIA E  +     E  +  +K Q       +K  + 
Subjt:  IFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQLN-----QKLEAK

Query:  TSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIEL-----SRSESQDVEGDSHLQRSMTPDMDSSH
        +S  + N   IN   +  + + ++ S +     ++D  SAA+ A + A  A  AA+ A  L     S ++   V  D    +  +  MD  H
Subjt:  TSADLDNPEVINATDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIEL-----SRSESQDVEGDSHLQRSMTPDMDSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGAAACTGGATGCTCTACTGGGAAAGAACAGGCATTTGAAGCCTTCCAAGTTCAAGGCTCTCGCTAATATGGCCATTTCCAGAACTGCCATCCTCAAGAACCA
GCACCAAGCCCGCAGCTCCCTCGCCCAGGCCGACGTTCTCCAGCTCTTGAACCTCGGTCACCAACATCGCGCTAAGCTTAGAGTTGAGATTGTGATCAAGGAGAGTAACA
TGTTGGATGCACTTGGGATGATAGAGGATTACTGCTATCTCCTCATGGAAAGAGTGGATCTTCTTCGGATGAGCAAGGAGTGCCCGGGTGAAGTTAAGGAGGCCATATCG
ACTCTGATCTTTGCAGCTTCTAGGTGCGGAGAGTTTCCAGAACTTCAGGAGATTCGCCGGATTTTCGAGTTGAAATTTGGGAGAGAGTTCGCCAGCAGTGCTGTGGATTT
ACGCAACAACTGCGGAGTTCGTCTCAAGATGATTCAAAAGTTCTCAACAAAACAGCCCAGCATTGAAACTAAACTGAAAGTGCTCGAGGAAATTGCTTCTGAGAATGGCA
TTACTCTGCATTTAGAGGAAGAACCTGAAATAACCATCAAGGAACAGCTAAATCAGAAACTGGAAGCCAAGACATCAGCAGACTTGGATAATCCTGAAGTCATAAACGCC
ACCGATAACCTACATGAAGATATGTTCAAGAACGAGTCAGAATTAGTGAGAGCAAAGAAGTTCAACGATGTAGCTAGTGCGGCTCAAGAGGCCTTCCAATCAGCAGCTTA
TGCAGCAGTAGCAGCTAGAGCTGCAATCGAGCTTTCAAGGTCTGAATCTCAGGATGTAGAAGGTGACTCTCATCTTCAAAGAAGCATGACCCCAGATATGGATAGTTCCC
ATGAACCTAAACTTAATATAGAAGAAGCTAGCACGCCTCCCAGTAAGATTGACCATTCAGATGACATATTTAGCTTTGAAAAAATTCATCCTGATGAAAGTTCGAGCTCA
GAATGTGAAGATGAAGACATAGCAGAAAGAAATCAAGAAACTCATCTCGAACATGCAGGCGAATCGCCAGAAAAACCTGTCAAAGAAAGCAAGCCATCTGATCCAAATTT
GGATTCATCAAATCCAGATTCAATCTCCCAGAATGAGCTAGAGAATGACAAGTTGGTCAGTAAAGAAAATGCATTTGCCAGTAAAACTGAAATTTCAGATGAGGATATTA
GAGTTCGTGAATCGAAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGAAACTGGATGCTCTACTGGGAAAGAACAGGCATTTGAAGCCTTCCAAGTTCAAGGCTCTCGCTAATATGGCCATTTCCAGAACTGCCATCCTCAAGAACCA
GCACCAAGCCCGCAGCTCCCTCGCCCAGGCCGACGTTCTCCAGCTCTTGAACCTCGGTCACCAACATCGCGCTAAGCTTAGAGTTGAGATTGTGATCAAGGAGAGTAACA
TGTTGGATGCACTTGGGATGATAGAGGATTACTGCTATCTCCTCATGGAAAGAGTGGATCTTCTTCGGATGAGCAAGGAGTGCCCGGGTGAAGTTAAGGAGGCCATATCG
ACTCTGATCTTTGCAGCTTCTAGGTGCGGAGAGTTTCCAGAACTTCAGGAGATTCGCCGGATTTTCGAGTTGAAATTTGGGAGAGAGTTCGCCAGCAGTGCTGTGGATTT
ACGCAACAACTGCGGAGTTCGTCTCAAGATGATTCAAAAGTTCTCAACAAAACAGCCCAGCATTGAAACTAAACTGAAAGTGCTCGAGGAAATTGCTTCTGAGAATGGCA
TTACTCTGCATTTAGAGGAAGAACCTGAAATAACCATCAAGGAACAGCTAAATCAGAAACTGGAAGCCAAGACATCAGCAGACTTGGATAATCCTGAAGTCATAAACGCC
ACCGATAACCTACATGAAGATATGTTCAAGAACGAGTCAGAATTAGTGAGAGCAAAGAAGTTCAACGATGTAGCTAGTGCGGCTCAAGAGGCCTTCCAATCAGCAGCTTA
TGCAGCAGTAGCAGCTAGAGCTGCAATCGAGCTTTCAAGGTCTGAATCTCAGGATGTAGAAGGTGACTCTCATCTTCAAAGAAGCATGACCCCAGATATGGATAGTTCCC
ATGAACCTAAACTTAATATAGAAGAAGCTAGCACGCCTCCCAGTAAGATTGACCATTCAGATGACATATTTAGCTTTGAAAAAATTCATCCTGATGAAAGTTCGAGCTCA
GAATGTGAAGATGAAGACATAGCAGAAAGAAATCAAGAAACTCATCTCGAACATGCAGGCGAATCGCCAGAAAAACCTGTCAAAGAAAGCAAGCCATCTGATCCAAATTT
GGATTCATCAAATCCAGATTCAATCTCCCAGAATGAGCTAGAGAATGACAAGTTGGTCAGTAAAGAAAATGCATTTGCCAGTAAAACTGAAATTTCAGATGAGGATATTA
GAGTTCGTGAATCGAAG
Protein sequenceShow/hide protein sequence
MGRKLDALLGKNRHLKPSKFKALANMAISRTAILKNQHQARSSLAQADVLQLLNLGHQHRAKLRVEIVIKESNMLDALGMIEDYCYLLMERVDLLRMSKECPGEVKEAIS
TLIFAASRCGEFPELQEIRRIFELKFGREFASSAVDLRNNCGVRLKMIQKFSTKQPSIETKLKVLEEIASENGITLHLEEEPEITIKEQLNQKLEAKTSADLDNPEVINA
TDNLHEDMFKNESELVRAKKFNDVASAAQEAFQSAAYAAVAARAAIELSRSESQDVEGDSHLQRSMTPDMDSSHEPKLNIEEASTPPSKIDHSDDIFSFEKIHPDESSSS
ECEDEDIAERNQETHLEHAGESPEKPVKESKPSDPNLDSSNPDSISQNELENDKLVSKENAFASKTEISDEDIRVRESK