; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G04530 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G04530
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationChr7:3382225..3384654
RNA-Seq ExpressionCSPI07G04530
SyntenyCSPI07G04530
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031582.1 Ist1 domain-containing protein [Cucumis melo var. makuwa]4.7e-24084.84Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        KKLIKTALCRLKLL+KKRYSIVKLLREDLCELINNGYQQIAFKRVEQL+QDE LME YDLIEN CE ILVKFSH+RKHKTCPDD+ EAISSLIFASARFG
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
        DFPELKCVR LFEERF KSF VAAVELSPGNLVNKQIKEKLVM+ VSNHEKQRLINEIARDCF PAVLALE  PD  +KQVL N DQTTV+GKVESEELE
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE

Query:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
        RVVISLDS RD NCGVLD WCHSTSSS SV QSFPNDAS+SEFLPFCEEA+VYFDDVVELSDPSTE+GDLLDQRFFKFK  ITSTREENV D DDQSLIE
Subjt:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE

Query:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL
         H  SNKK+VSGRSNQIING PKEPSR+ M REQENHSLNHTKKK  KC CLSCHSLS ELQNY +E+QCYVYSECRTD  +I+SSEAN  DYDLSSK+L
Subjt:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL

Query:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL
        WNAESNEEIEFITFSRAK+QRNYGIGTVVYDVFVYS CQPDENKETNTKL+E      HEPSKFTKCMKG  KYPSHVHPKLP+Y+E+A +FI LKREYL
Subjt:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL

Query:  ERSMKGRN
        +RSMK  N
Subjt:  ERSMKGRN

KAE8645768.1 hypothetical protein Csa_020499 [Cucumis sativus]2.4e-28497.44Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDE LMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
        DFPELKCVRKLFEERF KSFIVAAVELSPGNLVNKQIKEKLVM+PV NHEKQRLINEIARDCF+PAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE

Query:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
        RVVISLDSTRD NCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
Subjt:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE

Query:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL
        NHY SNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYC+E+QCYV+SECRTD SVIQSSEANNSDYDLSSKNL
Subjt:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL

Query:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL
        WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKG DKYPSHVHPKLPNYEE+AAKFICLKREYL
Subjt:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL

Query:  ERSMKGRN
        ERSMKGRN
Subjt:  ERSMKGRN

XP_016901704.1 PREDICTED: uncharacterized protein LOC103495691 [Cucumis melo]9.4e-24185.04Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        KKLIKTALCRLKLL+KKRYSIVKLLREDLCELINNGYQQIAFKRVEQL+QDE LME YDLIEN CE ILVKFSH+RKHKTCPDD+ EAISSLIFASARFG
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
        DFPELKCVR LFEERF KSF VAAVELSPGNLVNKQIKEKLVM+ VSNHEKQRLINEIARDCF PAVLALE  PD  +KQVLQN DQTTV+GKVESEELE
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE

Query:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
        RVVISLDS RD NCGVLD WCHSTSSS SV QSFPNDAS+SEFLPFCEEA+VYFDDVVELSDPSTE+GDLLDQRFFKFK  ITSTREENV D DDQSLIE
Subjt:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE

Query:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL
         H  SNKK+VSGRSNQIING PKEPSR+ M REQENHSLNHTKKK  KC CLSCHSLS ELQNY +E+QCYVYSECRTD  +I+SSEAN  DYDLSSK+L
Subjt:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL

Query:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL
        WNAESNEEIEFITFSRAK+QRNYGIGTVVYDVFVYS CQPDENKETNTKL+E      HEPSKFTKCMKG  KYPSHVHPKLP+Y+E+A +FI LKREYL
Subjt:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL

Query:  ERSMKGRN
        +RSMK  N
Subjt:  ERSMKGRN

XP_031744656.1 uncharacterized protein LOC105436231 [Cucumis sativus]4.7e-24097.2Show/hide
Query:  TCPDDITEAISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKK
        TCPDDITEAISSLIFASARFGDFPELKCVRKLFEERF KSFIVAAVELSPGNLVNKQIKEKLVM+PV NHEKQRLINEIARDCF+PAVLALENCPDWHKK
Subjt:  TCPDDITEAISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKK

Query:  QVLQNGDQTTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFK
        QVLQNGDQTTVKGKVESEELERVVISLDSTRD NCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFK
Subjt:  QVLQNGDQTTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFK

Query:  SLITSTREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTD
        SLITSTREENVGDGDDQSLIENHY SNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYC+E+QCYV+SECRTD
Subjt:  SLITSTREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTD

Query:  CSVIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVH
         SVIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKG DKYPSHVH
Subjt:  CSVIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVH

Query:  PKLPNYEEMAAKFICLKREYLERSMKGRN
        PKLPNYEE+AAKFICLKREYLERSMKGRN
Subjt:  PKLPNYEEMAAKFICLKREYLERSMKGRN

XP_038888606.1 uncharacterized protein LOC120078409 [Benincasa hispida]6.8e-20772.42Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHK------TCPDDITEAISSLIF
        KKLI+TALCRL++L+KKRYSIVK LREDL EL+NNGYQQIAF+RVEQLIQDEILMEAYDLIENFCE ILVKFSHI+KHK      TCPDDI EAISSLIF
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHK------TCPDDITEAISSLIF

Query:  ASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKV
        ASAR GDFPELK VRKLFE+RF +SF VAAVEL PGNLVN QIKEKL+++PVS+HEKQR IN+IARDCFYP++LALE  PDWH+KQV  N D+T  +GK 
Subjt:  ASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKV

Query:  ESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFP-----------NDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLIT
        E E+LE+VVISL+S+RD N  V+D+ CHSTSSS  V QS P           NDAS+ E LPFC+E++VY D++VELSD S EYGD LDQRFFKFKSL+T
Subjt:  ESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFP-----------NDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLIT

Query:  STREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVI
        S   ENV DG+DQSLIE H DSN K VSGRSNQII GSPKE    SM REQENHSLN+TKKK MKC CLSCHSLSSELQNYC+E+ CYVYSECRTD  V+
Subjt:  STREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVI

Query:  QSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPS--------KFTKCMKGVDKYP
        +SS   ++DYDLSSK+ WN +SNEEIEF TFSR K   NYG GTVVYDVFVYS CQP ENKETN K +EL TNGK+E S        KFTKCMKG DKYP
Subjt:  QSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPS--------KFTKCMKGVDKYP

Query:  SHVHPKLPNYEEMAAKFICLKREYLERSMKGRN
        SHVHPKLP+Y+E+AAKFI LKREYL++SMKG N
Subjt:  SHVHPKLPNYEEMAAKFICLKREYLERSMKGRN

TrEMBL top hitse value%identityAlignment
A0A0A0K3R2 Uncharacterized protein7.6e-29797.53Show/hide
Query:  MKDDKDCRISYFLNLFQGSKKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTC
        MKDDKDCRISYFLNLFQGSKKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDE LMEAYDLIENFCEIILVKFSHIRKHKTC
Subjt:  MKDDKDCRISYFLNLFQGSKKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTC

Query:  PDDITEAISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQV
        PDDITEAISSLIFASARFGDFPELKCVRKLFEERF KSFIVAAVELSPGNLVNKQIKEKLVM+PV NHEKQRLINEIARDCF+PAVLALENCPDWHKKQV
Subjt:  PDDITEAISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQV

Query:  LQNGDQTTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSL
        LQNGDQTTVKGKVESEELERVVISLDSTRD NCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSL
Subjt:  LQNGDQTTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSL

Query:  ITSTREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCS
        ITSTREENVGDGDDQSLIENHY SNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYC+E+QCYV+SECRTD S
Subjt:  ITSTREENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCS

Query:  VIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPK
        VIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKG DKYPSHVHPK
Subjt:  VIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPK

Query:  LPNYEEMAAKFICLKREYLERSMKGRN
        LPNYEE+AAKFICLKREYLERSMKGRN
Subjt:  LPNYEEMAAKFICLKREYLERSMKGRN

A0A1S4E0G2 uncharacterized protein LOC1034956914.6e-24185.04Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        KKLIKTALCRLKLL+KKRYSIVKLLREDLCELINNGYQQIAFKRVEQL+QDE LME YDLIEN CE ILVKFSH+RKHKTCPDD+ EAISSLIFASARFG
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
        DFPELKCVR LFEERF KSF VAAVELSPGNLVNKQIKEKLVM+ VSNHEKQRLINEIARDCF PAVLALE  PD  +KQVLQN DQTTV+GKVESEELE
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE

Query:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
        RVVISLDS RD NCGVLD WCHSTSSS SV QSFPNDAS+SEFLPFCEEA+VYFDDVVELSDPSTE+GDLLDQRFFKFK  ITSTREENV D DDQSLIE
Subjt:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE

Query:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL
         H  SNKK+VSGRSNQIING PKEPSR+ M REQENHSLNHTKKK  KC CLSCHSLS ELQNY +E+QCYVYSECRTD  +I+SSEAN  DYDLSSK+L
Subjt:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL

Query:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL
        WNAESNEEIEFITFSRAK+QRNYGIGTVVYDVFVYS CQPDENKETNTKL+E      HEPSKFTKCMKG  KYPSHVHPKLP+Y+E+A +FI LKREYL
Subjt:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL

Query:  ERSMKGRN
        +RSMK  N
Subjt:  ERSMKGRN

A0A5D3C6Z3 Ist1 domain-containing protein2.3e-24084.84Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        KKLIKTALCRLKLL+KKRYSIVKLLREDLCELINNGYQQIAFKRVEQL+QDE LME YDLIEN CE ILVKFSH+RKHKTCPDD+ EAISSLIFASARFG
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE
        DFPELKCVR LFEERF KSF VAAVELSPGNLVNKQIKEKLVM+ VSNHEKQRLINEIARDCF PAVLALE  PD  +KQVL N DQTTV+GKVESEELE
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELE

Query:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE
        RVVISLDS RD NCGVLD WCHSTSSS SV QSFPNDAS+SEFLPFCEEA+VYFDDVVELSDPSTE+GDLLDQRFFKFK  ITSTREENV D DDQSLIE
Subjt:  RVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIE

Query:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL
         H  SNKK+VSGRSNQIING PKEPSR+ M REQENHSLNHTKKK  KC CLSCHSLS ELQNY +E+QCYVYSECRTD  +I+SSEAN  DYDLSSK+L
Subjt:  NHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNL

Query:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL
        WNAESNEEIEFITFSRAK+QRNYGIGTVVYDVFVYS CQPDENKETNTKL+E      HEPSKFTKCMKG  KYPSHVHPKLP+Y+E+A +FI LKREYL
Subjt:  WNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYL

Query:  ERSMKGRN
        +RSMK  N
Subjt:  ERSMKGRN

A0A6J1D262 uncharacterized protein LOC1110166732.6e-15159.27Show/hide
Query:  LIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFGDF
        +IK AL RLK+L+KKRYSI++ LREDL ELI+NGYQQIAF RVEQLI+DE LMEAYDLIENFCE IL+ FSH+RKHKTCPDD+ EAISSLIFASAR GD 
Subjt:  LIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFGDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVE-------
        PELK VRKLFEERF +SF   AVEL PGNLVN QIKEKL+++PVS+HEKQ LI+EIARDCF PA+LALE CPDWH+KQVL+NGDQT  + K E       
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVE-------

Query:  SEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDD
        +EE ER V+                C S S     S +   DAS+ E  PFCEE +VY DDVVEL +PS E G L DQR FKFKS +T  R ENV  G+ 
Subjt:  SEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDD

Query:  QSLIENHYDSNKK-AVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHS--LSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSD
        Q LIE H++++KK +VS RS+Q +N SPK   RRS    QE+ SL+H KK+LMKC CLSCHS  L S ++NYC+E+  YVYSE       +  SE    D
Subjt:  QSLIENHYDSNKK-AVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHS--LSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSD

Query:  YDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHE--------PSKFTKCMKGVDKYPSHVHPKLPN
              N  + +    +EF T  R +  RN  +GT+VYDVFVYS CQP ENKETN K +E+ST  KHE         S FTKC K  DKYPSHVHPKLP+
Subjt:  YDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHE--------PSKFTKCMKGVDKYPSHVHPKLPN

Query:  YEEMAAKFICLKREYLER
        Y+E+AAKFI LKREY +R
Subjt:  YEEMAAKFICLKREYLER

A0A6J1I0X9 uncharacterized protein LOC1114680611.9e-7439.42Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHK-------------TCPDDITE
        KKLI+ ALCRLK+L+KKRYSI + LREDL ELI+NGYQQIAF RVEQL++DE LMEAYDLIENFCE ILVKFSH+RK+K             TCPDD+ E
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHK-------------TCPDDITE

Query:  AISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQ
        AISSLIFASAR GDFPELK VRK FEER+ + F +AAVEL PGNLVN QIKEKLVM+ VS+ EKQRL+NEIARDCF+PA+LALE CPDW+          
Subjt:  AISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQ

Query:  TTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTRE
                                                                                                            
Subjt:  TTVKGKVESEELERVVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTRE

Query:  ENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQ-NYCMEKQ--CYVYSECRTDCSV--
                                                       + H L+ T  +           LS E+   Y ++KQ  CYV+SECR D  V  
Subjt:  ENVGDGDDQSLIENHYDSNKKAVSGRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQ-NYCMEKQ--CYVYSECRTDCSV--

Query:  IQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKL
        IQS   N  DYD      W    N+EI F          +Y IG++VYDVFVY  CQP EN                             KYP HVHPKL
Subjt:  IQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQRNYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKL

Query:  PNYEEMAAKFICLKREYLER
        P+Y+E+AAKFI +KREY+++
Subjt:  PNYEEMAAKFICLKREYLER

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog4.2e-1026.99Show/hide
Query:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF
        ++  + RLKLL KK+  + +  R+++ + +  G  + A  RVE +I+++ L+EA +++E +C+++L +F  I+  K     + E++S+LI+A+ R   + 
Subjt:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP
         ELK V      ++ K +         G  VN ++  KL +        +R + EIA++   P
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP

Q3ZBV1 IST1 homolog4.2e-1026.99Show/hide
Query:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF
        ++  + RLKLL KK+  + +  R+++ + +  G  + A  RVE +I+++ L+EA +++E +C+++L +F  I+  K     + E++S+LI+A+ R   + 
Subjt:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP
         ELK V      ++ K +         G  VN ++  KL +        +R + EIA++   P
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP

Q568Z6 IST1 homolog4.2e-1026.99Show/hide
Query:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF
        ++  + RLKLL KK+  + +  R+++ + +  G  + A  RVE +I+++ L+EA +++E +C+++L +F  I+  K     + E++S+LI+A+ R   + 
Subjt:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP
         ELK V      ++ K +         G  VN ++  KL +        +R + EIA++   P
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP

Q5R6G8 IST1 homolog4.2e-1026.99Show/hide
Query:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF
        ++  + RLKLL KK+  + +  R+++ + +  G  + A  RVE +I+++ L+EA +++E +C+++L +F  I+  K     + E++S+LI+A+ R   + 
Subjt:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP
         ELK V      ++ K +         G  VN ++  KL +        +R + EIA++   P
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP

Q9CX00 IST1 homolog4.2e-1026.99Show/hide
Query:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF
        ++  + RLKLL KK+  + +  R+++ + +  G  + A  RVE +I+++ L+EA +++E +C+++L +F  I+  K     + E++S+LI+A+ R   + 
Subjt:  IKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARF-GDF

Query:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP
         ELK V      ++ K +         G  VN ++  KL +        +R + EIA++   P
Subjt:  PELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYP

Arabidopsis top hitse value%identityAlignment
AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein3.3e-2637.57Show/hide
Query:  SYFLNLFQGS--KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEA
        S+F   F+ +  K L+K  + R+KL+R +R + +K +R ++ +L+  G +  A  RVE +I++E +M A +++E FCE+I V+   I   + CP D+ EA
Subjt:  SYFLNLFQGS--KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEA

Query:  ISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD
        ISS+ FA+ R  D  EL+ V+ LF  ++ K F+ AA EL P + VN+++ E L +R  S   K +L+ EIA +
Subjt:  ISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD

AT2G14830.1 Regulator of Vps4 activity in the MVB pathway protein1.8e-3541.71Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        K ++K   CRL LL+ K+Y+I   LR D+ +L+  G +  A  R +QL  DE LM  Y L+ +F +IIL+  S+IR+ +  PD I EA+S+L+FASAR G
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDC-FYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEEL
        D PEL+ +R LF +R+   F+  A+ L PGN VN Q+ EKL +  VS+  K +L+ EI  +      VLA+E  P++H KQVL          K E  E 
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDC-FYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEEL

Query:  ERVVISLDSTR
        E+ V+S +S +
Subjt:  ERVVISLDSTR

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein5.9e-2335.4Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        K  ++ A  RLK+L+ K+   +K LR +L +L+ +G    A  RVE ++++E  + AY+LI  +CE+++V+   I   K CP D+ EA++S++FAS R  
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD
        D PEL  + K F  ++ K F  +AVEL P + V++ + EKL  +      K +++  IA +
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD

AT4G29440.1 Regulator of Vps4 activity in the MVB pathway protein4.5e-2334.59Show/hide
Query:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG
        K  ++ A  RLK+L+ K+ + +K LR +L  L+ +G  Q A  RVE ++++E  + AY+L+  +CE+++ +   I   KTCP+D+ EA++S+++AS R  
Subjt:  KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISSLIFASARFG

Query:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIA
        D  EL  + K F  ++ K F+ AA+ L P + V++ + EKL ++      K +++ EIA
Subjt:  DFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIA

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein2.5e-2637.36Show/hide
Query:  ISYFLNLFQGS--KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITE
        +S F   F  S  K   K A+ R+KL+R KR  +VK +R D+  L+ +G    A  RVE +I+++ +  A ++IE FCE+I+ + + I K K CP D+ E
Subjt:  ISYFLNLFQGS--KKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITE

Query:  AISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD
         I+SLIFA+ R  + PEL  +R +F +++ K F+ AA +L P   VN+ + +KL +R      K +++ EIA++
Subjt:  AISSLIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATGACAAAGATTGTAGGATTAGTTACTTTCTCAATCTATTCCAAGGCAGTAAGAAGCTGATCAAAACAGCTCTATGCCGGCTTAAGTTGCTCAGGAAGAAGAG
ATATTCAATTGTTAAGCTATTACGTGAAGATTTGTGTGAGCTTATTAACAATGGCTATCAACAAATTGCTTTCAAAAGGGTGGAGCAGCTTATTCAAGATGAAATATTGA
TGGAGGCTTATGATTTGATTGAAAATTTCTGTGAAATCATCTTGGTGAAGTTTTCCCATATCAGGAAACACAAGACCTGCCCTGATGATATTACTGAAGCAATTTCAAGT
CTCATATTTGCCTCGGCAAGATTTGGGGATTTCCCTGAACTTAAATGTGTTCGAAAGCTGTTCGAGGAGCGTTTCAGAAAGAGTTTCATTGTGGCTGCTGTTGAATTAAG
TCCTGGAAATCTTGTGAACAAACAGATAAAAGAGAAGCTTGTAATGCGGCCTGTCTCAAATCATGAGAAACAAAGACTAATCAATGAAATAGCAAGAGATTGCTTTTATC
CAGCAGTTTTGGCTCTTGAAAACTGTCCGGATTGGCATAAAAAACAGGTACTGCAAAATGGAGATCAAACTACTGTTAAGGGAAAAGTAGAATCAGAAGAGTTGGAGAGA
GTTGTAATATCTCTCGATTCAACACGAGACTGCAACTGTGGCGTCTTGGATTCTTGGTGTCATTCTACTTCTTCCAGCCCTTCAGTTTCTCAATCTTTTCCAAATGATGC
ATCCACTTCTGAATTCCTTCCATTTTGTGAAGAAGCTATGGTTTATTTCGATGATGTTGTTGAGCTTTCTGACCCCTCAACAGAATATGGAGACTTGTTGGATCAGAGAT
TCTTTAAGTTCAAATCACTTATTACAAGCACGAGAGAAGAAAATGTTGGAGATGGTGACGATCAAAGCCTCATAGAAAATCATTATGATTCCAATAAAAAAGCAGTTTCG
GGAAGATCTAATCAGATTATAAATGGATCTCCGAAAGAACCGAGTAGAAGGTCGATGCGTCGAGAGCAAGAGAATCATAGCCTAAATCATACAAAGAAAAAACTCATGAA
GTGTTATTGCTTATCATGCCACAGCTTGTCATCTGAACTGCAAAACTACTGCATGGAGAAGCAATGCTATGTCTATTCTGAGTGTAGAACAGATTGTTCTGTTATTCAGT
CTTCAGAAGCTAATAATTCGGATTATGATTTGTCTAGCAAAAATTTATGGAATGCAGAATCAAATGAAGAGATTGAATTCATTACGTTTTCGAGGGCAAAGGATCAAAGA
AATTATGGGATTGGAACAGTGGTTTACGATGTTTTTGTTTACTCTCAGTGTCAGCCAGATGAAAACAAGGAAACTAATACAAAACTAAAAGAACTCAGTACAAATGGTAA
GCATGAACCTTCAAAGTTCACAAAATGTATGAAAGGTGTCGATAAGTATCCAAGCCATGTTCATCCAAAGTTGCCCAATTACGAAGAAATGGCAGCCAAATTCATATGTT
TGAAGAGAGAGTATTTGGAAAGATCAATGAAAGGACGCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATGACAAAGATTGTAGGATTAGTTACTTTCTCAATCTATTCCAAGGCAGTAAGAAGCTGATCAAAACAGCTCTATGCCGGCTTAAGTTGCTCAGGAAGAAGAG
ATATTCAATTGTTAAGCTATTACGTGAAGATTTGTGTGAGCTTATTAACAATGGCTATCAACAAATTGCTTTCAAAAGGGTGGAGCAGCTTATTCAAGATGAAATATTGA
TGGAGGCTTATGATTTGATTGAAAATTTCTGTGAAATCATCTTGGTGAAGTTTTCCCATATCAGGAAACACAAGACCTGCCCTGATGATATTACTGAAGCAATTTCAAGT
CTCATATTTGCCTCGGCAAGATTTGGGGATTTCCCTGAACTTAAATGTGTTCGAAAGCTGTTCGAGGAGCGTTTCAGAAAGAGTTTCATTGTGGCTGCTGTTGAATTAAG
TCCTGGAAATCTTGTGAACAAACAGATAAAAGAGAAGCTTGTAATGCGGCCTGTCTCAAATCATGAGAAACAAAGACTAATCAATGAAATAGCAAGAGATTGCTTTTATC
CAGCAGTTTTGGCTCTTGAAAACTGTCCGGATTGGCATAAAAAACAGGTACTGCAAAATGGAGATCAAACTACTGTTAAGGGAAAAGTAGAATCAGAAGAGTTGGAGAGA
GTTGTAATATCTCTCGATTCAACACGAGACTGCAACTGTGGCGTCTTGGATTCTTGGTGTCATTCTACTTCTTCCAGCCCTTCAGTTTCTCAATCTTTTCCAAATGATGC
ATCCACTTCTGAATTCCTTCCATTTTGTGAAGAAGCTATGGTTTATTTCGATGATGTTGTTGAGCTTTCTGACCCCTCAACAGAATATGGAGACTTGTTGGATCAGAGAT
TCTTTAAGTTCAAATCACTTATTACAAGCACGAGAGAAGAAAATGTTGGAGATGGTGACGATCAAAGCCTCATAGAAAATCATTATGATTCCAATAAAAAAGCAGTTTCG
GGAAGATCTAATCAGATTATAAATGGATCTCCGAAAGAACCGAGTAGAAGGTCGATGCGTCGAGAGCAAGAGAATCATAGCCTAAATCATACAAAGAAAAAACTCATGAA
GTGTTATTGCTTATCATGCCACAGCTTGTCATCTGAACTGCAAAACTACTGCATGGAGAAGCAATGCTATGTCTATTCTGAGTGTAGAACAGATTGTTCTGTTATTCAGT
CTTCAGAAGCTAATAATTCGGATTATGATTTGTCTAGCAAAAATTTATGGAATGCAGAATCAAATGAAGAGATTGAATTCATTACGTTTTCGAGGGCAAAGGATCAAAGA
AATTATGGGATTGGAACAGTGGTTTACGATGTTTTTGTTTACTCTCAGTGTCAGCCAGATGAAAACAAGGAAACTAATACAAAACTAAAAGAACTCAGTACAAATGGTAA
GCATGAACCTTCAAAGTTCACAAAATGTATGAAAGGTGTCGATAAGTATCCAAGCCATGTTCATCCAAAGTTGCCCAATTACGAAGAAATGGCAGCCAAATTCATATGTT
TGAAGAGAGAGTATTTGGAAAGATCAATGAAAGGACGCAACTAG
Protein sequenceShow/hide protein sequence
MKDDKDCRISYFLNLFQGSKKLIKTALCRLKLLRKKRYSIVKLLREDLCELINNGYQQIAFKRVEQLIQDEILMEAYDLIENFCEIILVKFSHIRKHKTCPDDITEAISS
LIFASARFGDFPELKCVRKLFEERFRKSFIVAAVELSPGNLVNKQIKEKLVMRPVSNHEKQRLINEIARDCFYPAVLALENCPDWHKKQVLQNGDQTTVKGKVESEELER
VVISLDSTRDCNCGVLDSWCHSTSSSPSVSQSFPNDASTSEFLPFCEEAMVYFDDVVELSDPSTEYGDLLDQRFFKFKSLITSTREENVGDGDDQSLIENHYDSNKKAVS
GRSNQIINGSPKEPSRRSMRREQENHSLNHTKKKLMKCYCLSCHSLSSELQNYCMEKQCYVYSECRTDCSVIQSSEANNSDYDLSSKNLWNAESNEEIEFITFSRAKDQR
NYGIGTVVYDVFVYSQCQPDENKETNTKLKELSTNGKHEPSKFTKCMKGVDKYPSHVHPKLPNYEEMAAKFICLKREYLERSMKGRN