; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015646 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015646
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein, putative
Genome locationtig00004836:607945..610310
RNA-Seq ExpressionSgr015646
SyntenySgr015646
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575541.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]6.2e-16769.41Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MF MFFKPKFYTKCKSCVK+TKTRLD IRKKK  VLK+LKNDI +LL+S LDYNAY RAEG L E+N+L+CYDLIDEFCGTIF  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG+SF SFTNKELVE+ RA+A TKE KLQLLQEIAQES I WNSKA EQQLY PPQNE D +RS +  RNK
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK

Query:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        +K+ SVP+Y R TNS  KK+NSDDDSIFDSRSE N TETS GD SSTDQDV KG +SEDEVEDQ+PFYLRFI PPYLK+KP KKE ++EEP+ VT     
Subjt:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL
         E N+ D K+  EEKPKPRSVRRR+ KPQPAR INI+D G S  DG  KISS   KGKE + GEEKG  +DDEERMLDGLL  YS KK+           
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL

Query:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
           RQ++KD+AEPQR K+++ ED + P +++S+P   TEP EP K+HAR  S+VHPKLPDYDQLAARFAALK K
Subjt:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

KAG7014085.1 IST1-like protein [Cucurbita argyrosperma subsp. argyrosperma]8.0e-16769.41Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MF MFFKPKFYTKCKSCVK+TKTRLD IRKKK  VLK+LKNDI +LL+S LDYNAY RAEG L E+N+L+CYDLIDEFCGTIF  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG+SF SFTNKELVE+ RA+A TKE KLQLLQEIAQES I WNSKA EQQLY PPQNE D +RS +  R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK

Query:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        +K+ SVP+Y R TNS  KKDNSDDDSIFDSRSE N TETS GD SSTDQDV KG +SEDEVEDQ+PFYLRFI PPYLK+KP KKE ++EEP+ VT     
Subjt:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL
         E N+ D K+  EEKPKPRSVRRR+ KPQPAR INI+D G S  DG  KISS   KGKE + GEEKG  +DDEERMLDGLL  YS KK+           
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL

Query:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
           RQ++KD+AEPQR K+++ ED + P +++S+P   TEP EP K+HAR  S+VHPKLPDYDQLAARFAALK K
Subjt:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

XP_008462153.1 PREDICTED: uncharacterized protein LOC103500577 [Cucumis melo]2.0e-17073.05Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMF KPKFYTKCKSCVK+TKTRLD IRKKKNAVLKYLKNDI +LL+S LDYNA+ RAEG L ERN+L+CY+LIDEFCGTI NQ+ VL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN
        EAVA+LIYAAARFADLPELRELR+LFTEKYGSSF SFTNKE +E+SRA   TKEMK+QLLQEIAQE+ I WNSKA EQQLYT PPQNE DG+RS A KRN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN

Query:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        KTKV SVP+YERK NS   K+NSD++SIFDSRSE N TETSTGD  STDQD+ KG+SEDEVEDQ+PF  RF+ PPYLK+KP K EA+ +EPRKVT  +  
Subjt:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKGN
        DESN  + KS +EEKPKPRSVRRRNVKPQPARDINI+DVG+ST D   KISSS  KGKET+IGEEKGA  DDEER+LDGLLM YS KK++ ESK   K N
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKGN

Query:  LKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        LK QRQQEKD+ E QR             RAVS+PTD  E T   K+H RT SFVHPKLP+YDQLAAR AALK K
Subjt:  LKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

XP_022146255.1 vacuolar protein sorting-associated protein IST1 [Momordica charantia]1.4e-18275.69Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMFFKPKF+TKCKSCVKL K RLD IRKKKNAVLK+LKNDIA+LLRS LD NAY RAEGLL ERN+LKCYDLIDEF GTI+NQLSVLSKQSECPDECR
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKT
        EAVASLIYAAARFADLPELRELRSLFT KYG SFE FTNKEL+E+  AMA TKE+KLQLLQEIAQES I+WNSKA EQQLYTPPQN PD + +AD+RNK 
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKT

Query:  KVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNKDE
        KV S+P++ERKT+ L KKDNSDDDSIFDSRSEDN TETSTGD SSTDQDVQKG+ EDEVED++PFYLRFIPPPYLKSKP+KKEAS E+P           
Subjt:  KVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNKDE

Query:  SNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKS
                  EEKPKPRSVRRRN K +P RDINI +V NS NDG EK SSS  KGKET++GEEKGAG  +EERMLDGLLMHYS KKS+ ES  AKGNLKS
Subjt:  SNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKS

Query:  QRQQEKDSAEPQRTKHSKA-EDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        Q+ +E+D  EPQRTK+SKA ED+  P R+VS+P D  EP  P KRH R  SFVHPKLPDYDQLAARFAALK K
Subjt:  QRQQEKDSAEPQRTKHSKA-EDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

XP_038897244.1 uncharacterized protein LOC120085370 [Benincasa hispida]5.0e-18576.58Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMF KPKFYTKCKSCVK+TKTRLD+IRKKK AVLKYLKNDI +LL+S LDYNAY RAEG L ERN+L+CY LIDEFCGTI +Q+ VLSK+SECP+EC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRSAD-KRN
        EAVA+LIYAAARFADLPELRELRSLFTEKYGSSF SFTNKEL+E+SRA A TKEMK+QLLQEIAQ+S I WNSKA EQQLYT PPQNEP G+RSA  KRN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRSAD-KRN

Query:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        KTK+ +VP++ERKT S  KKDNSD++SIFDSRSE N TETSTGD  STDQDV KGISEDEVEDQ+PFYLRF+PPPYLK+KP K EA++EE +KVT   N 
Subjt:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL
        DESNKQ  KS +EEKPKPRSVRRRNVKPQPARDINI+D G+ST DG EK S S  KGKET+IGEE+GAG DDEER+LDGLLM YS KK+S ES   KGNL
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL

Query:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        K QRQQEKD+AEPQ+ K +K ED F PARAVS+PTD    +EP K+H RT SFVHPKLP+YDQLAAR AALK K
Subjt:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

TrEMBL top hitse value%identityAlignment
A0A0A0K926 Uncharacterized protein1.2e-16371.43Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMF KPKFYTKCKSCVK+TKTRLD  RKKKNAVLKYLKNDI +LL+S LDYNAY RAEG L ERN+L+CY+LIDEFCGTI NQ+ VL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN
        E+VA+LIYAAARFADLPELRELR+LFTEKYGSSF SFTNKE +E+SR    TKEMK+QLLQEIAQE+ I WNSKA EQQLYT PP+NE DG+RS A KRN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN

Query:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEE-PRKVTDHVN
        KTKV SVP+YE+K NS   K+NSD++SIFDSRSE N TETSTGD  STDQDV KG+S DEV DQ+PF  RF+ PPYLK+KP K EA+ EE PRKVT  + 
Subjt:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEE-PRKVTDHVN

Query:  KDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKG
         DESN    KS +EEKPKPRSVRRR VKPQPARDINI+DVG+ST D  +KISS   KGKET+IGEEKGA  DDEER+LDGLLM YS KK++ ESK   K 
Subjt:  KDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKG

Query:  NLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        NLK QRQQEKD+ E QR            +RAVS P D     EP K+H RT SFVHPKLP+YDQLAAR AALK K
Subjt:  NLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

A0A1S3CGT5 uncharacterized protein LOC1035005779.9e-17173.05Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMF KPKFYTKCKSCVK+TKTRLD IRKKKNAVLKYLKNDI +LL+S LDYNA+ RAEG L ERN+L+CY+LIDEFCGTI NQ+ VL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN
        EAVA+LIYAAARFADLPELRELR+LFTEKYGSSF SFTNKE +E+SRA   TKEMK+QLLQEIAQE+ I WNSKA EQQLYT PPQNE DG+RS A KRN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYT-PPQNEPDGDRS-ADKRN

Query:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        KTKV SVP+YERK NS   K+NSD++SIFDSRSE N TETSTGD  STDQD+ KG+SEDEVEDQ+PF  RF+ PPYLK+KP K EA+ +EPRKVT  +  
Subjt:  KTKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKGN
        DESN  + KS +EEKPKPRSVRRRNVKPQPARDINI+DVG+ST D   KISSS  KGKET+IGEEKGA  DDEER+LDGLLM YS KK++ ESK   K N
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESK-EAKGN

Query:  LKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        LK QRQQEKD+ E QR             RAVS+PTD  E T   K+H RT SFVHPKLP+YDQLAAR AALK K
Subjt:  LKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

A0A6J1CWS8 vacuolar protein sorting-associated protein IST16.6e-18375.69Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFDMFFKPKF+TKCKSCVKL K RLD IRKKKNAVLK+LKNDIA+LLRS LD NAY RAEGLL ERN+LKCYDLIDEF GTI+NQLSVLSKQSECPDECR
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKT
        EAVASLIYAAARFADLPELRELRSLFT KYG SFE FTNKEL+E+  AMA TKE+KLQLLQEIAQES I+WNSKA EQQLYTPPQN PD + +AD+RNK 
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKT

Query:  KVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNKDE
        KV S+P++ERKT+ L KKDNSDDDSIFDSRSEDN TETSTGD SSTDQDVQKG+ EDEVED++PFYLRFIPPPYLKSKP+KKEAS E+P           
Subjt:  KVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNKDE

Query:  SNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKS
                  EEKPKPRSVRRRN K +P RDINI +V NS NDG EK SSS  KGKET++GEEKGAG  +EERMLDGLLMHYS KKS+ ES  AKGNLKS
Subjt:  SNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKS

Query:  QRQQEKDSAEPQRTKHSKA-EDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
        Q+ +E+D  EPQRTK+SKA ED+  P R+VS+P D  EP  P KRH R  SFVHPKLPDYDQLAARFAALK K
Subjt:  QRQQEKDSAEPQRTKHSKA-EDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

A0A6J1GMM1 uncharacterized protein LOC1114558256.6e-16769.41Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MF MFFKPKFYTKCKSCVK+TKTRLD IRKKK  VLK+LKNDI +LL+S LDYNAY RAEG L E+N+L+CYDLIDEFCGTIF  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADK-RNK
        EAVASLIYAAARF+DLPELR LRSLFTE+YG+SF SFTNKELVE+ RA+A TKE KLQLLQEIAQES I WNSKA EQQLY PPQNE D +RS    R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADK-RNK

Query:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        +K+ SVP+Y R TNS  KKDNSDDDSIFDSRSE N TETS GD SSTDQDV KG +SEDEVEDQ+PFYLRFI PPYLK+KP KKE ++EEP+      N 
Subjt:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKG-ISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL
         E N+ D K+  EEKPKPRSVRRR+ KPQPAR INI+D G S  DG  KISSS  KGKE + GEEKG  +DDEERMLDGLL  YS KK+           
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL

Query:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
           RQ++KD+AEPQR K+++ ED + P +++S+P   TEP EP K+HAR  S+VHPKLPDYDQLAARFA+LK K
Subjt:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

A0A6J1JV69 uncharacterized protein LOC1114886296.6e-16769.62Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MF MFFKPKFYTKCKSCVK+TKTRLD IRKKK  VLK+LKNDI +LL+S LDYNAY RAEG L E+N+L+CYDLIDEFCGTIFN +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK
        EAVASLIYAAARF+DLPELR LRSLFTE+YG+SF SFTNKELVE+ RA+A TKE KLQLLQEIAQES I WNSKA EQQLY PPQNE   +RS +  R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRS-ADKRNK

Query:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGI-SEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK
        +K+ SVP+Y R TNS  KKDNSDDDSIFDSRSE N TETS GD SSTDQDV KGI SEDEVEDQ+PFYLRFI PPYLK+KP KKE ++EEP+ VT     
Subjt:  TKVASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGI-SEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNK

Query:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL
         E  + + K+ +EEKPKPRSVRRR+ KPQPAR INI+D G S  DG  KISS   KGKE + GEEKG  +DDEERMLDGLL  YS K +           
Subjt:  DESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNL

Query:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK
           RQ++KD+AEPQR K+++ ED + PA+++S+PT  TEP EP K+HAR  S+VHPKLPDYDQLAARFAALK K
Subjt:  KSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALKGK

SwissProt top hitse value%identityAlignment
Q54I39 IST1-like protein2.1e-1325.15Show/hide
Query:  KCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAAR
        K K  +KL  +R+  ++ KK  +++  K ++A+LLR   + +A  R E ++ +  L++C+ +I+  C  +  ++++++  +E P E +E++ +L+Y++ R
Subjt:  KCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAAR

Query:  FADLPELRELRSLFTEKYGSSFE--------SFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQW
           +PEL ++++    KYG   E        +  N ++V +     P   +  Q L EIA++  + W
Subjt:  FADLPELRELRSLFTEKYGSSFE--------SFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQW

Arabidopsis top hitse value%identityAlignment
AT1G52315.1 Regulator of Vps4 activity in the MVB pathway protein1.1e-3633.33Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MF   FKPKFY K KS     K R+D +R+K+ A+++  K DI + L++  D  AY RAE LL E  ++ CYDLI+ FC  I   LS++ K+ ECP+ECR
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAM-APTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNK
        EAV+SLIYA A   D+PEL++LR++FT+++G+   S  N ELVE++  +  P++E+K+Q ++++A E +I W+    +  L            ++  + K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAM-APTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNK

Query:  TKVASVPIYERK-TNSLGKKDNSDDDSIF-DSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLK---SKPNKKEASLEEPRKVTD
         KV +   Y  K T      ++SDD+S+  +S   D+ ++ S   SSS+     + +S  + + ++      I  P  K       K +   EE RK+ D
Subjt:  TKVASVPIYERK-TNSLGKKDNSDDDSIF-DSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLK---SKPNKKEASLEEPRKVTD

Query:  HVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININD
          N      Q  +S++       S  R N +    +  N+ D
Subjt:  HVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDININD

AT1G79910.1 Regulator of Vps4 activity in the MVB pathway protein2.1e-7247.59Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFD  FKPKFYTKCKS VK+TKTR+D +++KKN+V KYLKNDI DLL+++LDYNAY RAEGL+ E+  L CY+ +++FC  + + +S+L K   CPDECR
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPP-QNEPDGDRSADKRNK
        EA++SL+YAAAR +++PELR+LRSLF E+YG++ + F N E VER +A  P+KEMK++LLQEIA+E +I+W++K+ EQ+LYTPP  +    +    K  K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPP-QNEPDGDRSADKRNK

Query:  TKVASVPIYERKTNSL--GKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFI-PPPYLKSKPNKKEASLEEPRKVTDHV
        T        E++++    G++D+ DD       S  +   TS  DS ST      G  +D   + +PFY RF+ P PY K K  K+E+    P K+T   
Subjt:  TKVASVPIYERKTNSL--GKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFI-PPPYLKSKPNKKEASLEEPRKVTDHV

Query:  NKDESNKQDHKSVSEEKPKPRSVRRRNVKPQP
            S+  D +S +  KPKPRSVRRR   P P
Subjt:  NKDESNKQDHKSVSEEKPKPRSVRRRNVKPQP

AT1G79910.2 Regulator of Vps4 activity in the MVB pathway protein7.5e-4642.12Show/hide
Query:  EGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQL
        EGL+ E+  L CY+ +++FC  + + +S+L K   CPDECREA++SL+YAAAR +++PELR+LRSLF E+YG++ + F N E VER +A  P+KEMK++L
Subjt:  EGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQL

Query:  LQEIAQESTIQWNSKAFEQQLYTPP-QNEPDGDRSADKRNKTKVASVPIYERKTNSL--GKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISE
        LQEIA+E +I+W++K+ EQ+LYTPP  +    +    K  KT        E++++    G++D+ DD       S  +   TS  DS ST      G  +
Subjt:  LQEIAQESTIQWNSKAFEQQLYTPP-QNEPDGDRSADKRNKTKVASVPIYERKTNSL--GKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISE

Query:  DEVEDQRPFYLRFI-PPPYLKSKPNKKEASLEEPRKVTDHVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQP
        D   + +PFY RF+ P PY K K  K+E+    P K+T       S+  D +S +  KPKPRSVRRR   P P
Subjt:  DEVEDQRPFYLRFI-PPPYLKSKPNKKEASLEEPRKVTDHVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQP

AT4G32350.1 Regulator of Vps4 activity in the MVB pathway protein7.8e-3526.24Show/hide
Query:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR
        MFD F    F  K K  +KLTK R+D +R+K+NA +K+LK D+ADL+ +  DYNA+ RA GLL E   L   D +++ C  ++ QLS + K  ECP++CR
Subjt:  MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLY-------------------
        EA++SL++AA+ F++LPELRELR +F EKY  S   F N+ELVE   +   + E K++L++++A E +I+W+SK FE+++                    
Subjt:  EAVASLIYAAARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLY-------------------

Query:  -------TPPQNEPDGDRSADKRN-KTKVASV---PIY----------------------------------------ERKTNSLGKKDN----------
                P + E +G  +    N KT  AS    P++                                        ERK   L  K N          
Subjt:  -------TPPQNEPDGDRSADKRN-KTKVASV---PIY----------------------------------------ERKTNSLGKKDN----------

Query:  -----------------------------------------------------------SDDDSI---------------------FDSRSEDNATETST
                                                                   +D D+I                        + E NA+E   
Subjt:  -----------------------------------------------------------SDDDSI---------------------FDSRSEDNATETST

Query:  GDSSSTDQDVQKGISED-----------------EVED--QRPFYLRFIPPPYLKSK---------------PNKKEASL------EEP-----------
          SS     +  G  ++                 +VED   RP      P P  KSK                +K+ A L      E+P           
Subjt:  GDSSSTDQDVQKGISED-----------------EVED--QRPFYLRFIPPPYLKSK---------------PNKKEASL------EEP-----------

Query:  -------RKVTD-----------------HVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDI------------NINDVGNST------------
               RKV +                 +V K  +  +  K+ + + PK R        P   +++             +ND+ N++            
Subjt:  -------RKVTD-----------------HVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQPARDI------------NINDVGNST------------

Query:  -NDGDEKISS----SLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDP-
         + GD+ ISS    +  KG + +I E++    D EE+M+D LLMHYS K SS+E    +   KS+R   K     +         +  PAR+ S+P +  
Subjt:  -NDGDEKISS----SLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDP-

Query:  TEPTEPAKRHARTASF----------VHPKLPDYDQLAARFAALKGK
          P+EPAK  AR ASF          VHPKLP+YD LAARFA LKG+
Subjt:  TEPTEPAKRHARTASF----------VHPKLPDYDQLAARFAALKGK

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein5.1e-2625.68Show/hide
Query:  TKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAA
        +KCK+  K+   R+  IR K+  V+K ++ DIA LL+S  D  A  R E ++ E+N+    ++I+ FC  I ++L++++KQ +CP + +E +ASLI+AA 
Subjt:  TKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAAA

Query:  RFADLPELRELRSLFTEKYGSSFESF---------TNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKTKV
        R +++PEL +LR +F +KYG  F S           N+ L+++     P  E KL++++EIA+E  + W++   EQ+L  P +   DG R          
Subjt:  RFADLPELRELRSLFTEKYGSSFESF---------TNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKTKV

Query:  ASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHV--NKDE
        +S+P+     N     +  D        +   +  T   D+ S  +   +   +     Q    L             ++++S +E    +DH    KD 
Subjt:  ASVPIYERKTNSLGKKDNSDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHV--NKDE

Query:  SNKQDHKSVSEEKPKPRSVRRRNVKPQP-------ARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKS--SHES
             H      + + R     +   +P        R  + N+ G + +D +E+ +++  + KET+                     H  N +S     +
Subjt:  SNKQDHKSVSEEKPKPRSVRRRNVKPQP-------ARDININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKS--SHES

Query:  KEAKGNLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALK
         E K +     ++E +  EP + + S      AP         P  P     R   +   VHPKLPDYD LAARF A++
Subjt:  KEAKGNLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTEPAKRHARTASFVHPKLPDYDQLAARFAALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGACATGTTCTTCAAGCCTAAGTTCTACACAAAATGCAAATCTTGCGTAAAGCTGACGAAGACGCGGCTGGATGCAATCAGGAAGAAGAAGAATGCCGTCCTCAA
GTATTTGAAGAACGACATTGCTGATCTTCTCAGGAGTGCTCTTGACTACAATGCTTATTGCAGGGCTGAAGGGCTTCTTGCTGAGCGAAATCTTTTAAAATGTTATGATC
TGATTGACGAGTTTTGTGGGACAATCTTCAATCAACTCTCTGTTTTGAGCAAACAGAGCGAGTGTCCCGACGAATGCAGAGAAGCAGTTGCGTCGTTGATCTATGCTGCA
GCGAGATTCGCTGACTTGCCTGAATTGCGAGAGCTCCGAAGTCTCTTTACTGAGAAATATGGAAGTTCTTTTGAATCATTTACCAACAAAGAGCTTGTCGAGAGGTCGAG
GGCAATGGCTCCAACTAAAGAGATGAAGCTTCAGCTACTCCAGGAGATAGCACAGGAATCGACCATTCAATGGAATTCCAAGGCTTTTGAACAGCAACTGTATACTCCTC
CCCAAAACGAACCTGATGGTGACAGGAGTGCAGACAAAAGAAACAAAACCAAGGTTGCTTCTGTTCCAATATATGAAAGAAAGACGAATTCACTCGGAAAAAAGGACAAT
AGCGACGACGATTCCATCTTCGATAGTAGAAGTGAAGACAACGCAACTGAAACCTCAACAGGAGACAGCAGTAGTACTGATCAAGATGTTCAAAAAGGTATTTCTGAGGA
TGAAGTAGAAGATCAGAGACCCTTCTACCTTAGATTTATCCCACCTCCCTACCTCAAATCTAAACCTAACAAAAAGGAGGCGAGTTTGGAGGAGCCTCGAAAAGTTACGG
ATCATGTCAACAAGGATGAAAGTAATAAGCAAGATCACAAGTCAGTTTCAGAAGAGAAGCCGAAACCCAGATCTGTGAGGCGAAGAAATGTGAAACCGCAGCCTGCAAGG
GACATCAACATCAATGATGTTGGCAATTCTACAAACGATGGTGATGAAAAGATTAGTTCGAGTCTAATCAAAGGGAAAGAAACCATTATAGGAGAAGAGAAAGGTGCTGG
GGATGATGATGAGGAGAGGATGTTAGATGGGCTTTTGATGCATTACAGTAATAAAAAATCAAGTCATGAATCAAAGGAAGCAAAAGGCAACCTCAAATCCCAAAGACAAC
AAGAGAAAGACAGTGCTGAACCTCAAAGGACCAAACATTCAAAAGCTGAGGATGTTTTTGCCCCAGCCAGAGCGGTGTCAGTTCCCACAGACCCAACTGAGCCAACTGAG
CCAGCGAAGAGGCATGCTAGAACAGCTTCCTTTGTGCATCCTAAGCTCCCGGACTACGACCAACTCGCAGCTCGGTTTGCGGCTCTCAAGGGGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGACATGTTCTTCAAGCCTAAGTTCTACACAAAATGCAAATCTTGCGTAAAGCTGACGAAGACGCGGCTGGATGCAATCAGGAAGAAGAAGAATGCCGTCCTCAA
GTATTTGAAGAACGACATTGCTGATCTTCTCAGGAGTGCTCTTGACTACAATGCTTATTGCAGGGCTGAAGGGCTTCTTGCTGAGCGAAATCTTTTAAAATGTTATGATC
TGATTGACGAGTTTTGTGGGACAATCTTCAATCAACTCTCTGTTTTGAGCAAACAGAGCGAGTGTCCCGACGAATGCAGAGAAGCAGTTGCGTCGTTGATCTATGCTGCA
GCGAGATTCGCTGACTTGCCTGAATTGCGAGAGCTCCGAAGTCTCTTTACTGAGAAATATGGAAGTTCTTTTGAATCATTTACCAACAAAGAGCTTGTCGAGAGGTCGAG
GGCAATGGCTCCAACTAAAGAGATGAAGCTTCAGCTACTCCAGGAGATAGCACAGGAATCGACCATTCAATGGAATTCCAAGGCTTTTGAACAGCAACTGTATACTCCTC
CCCAAAACGAACCTGATGGTGACAGGAGTGCAGACAAAAGAAACAAAACCAAGGTTGCTTCTGTTCCAATATATGAAAGAAAGACGAATTCACTCGGAAAAAAGGACAAT
AGCGACGACGATTCCATCTTCGATAGTAGAAGTGAAGACAACGCAACTGAAACCTCAACAGGAGACAGCAGTAGTACTGATCAAGATGTTCAAAAAGGTATTTCTGAGGA
TGAAGTAGAAGATCAGAGACCCTTCTACCTTAGATTTATCCCACCTCCCTACCTCAAATCTAAACCTAACAAAAAGGAGGCGAGTTTGGAGGAGCCTCGAAAAGTTACGG
ATCATGTCAACAAGGATGAAAGTAATAAGCAAGATCACAAGTCAGTTTCAGAAGAGAAGCCGAAACCCAGATCTGTGAGGCGAAGAAATGTGAAACCGCAGCCTGCAAGG
GACATCAACATCAATGATGTTGGCAATTCTACAAACGATGGTGATGAAAAGATTAGTTCGAGTCTAATCAAAGGGAAAGAAACCATTATAGGAGAAGAGAAAGGTGCTGG
GGATGATGATGAGGAGAGGATGTTAGATGGGCTTTTGATGCATTACAGTAATAAAAAATCAAGTCATGAATCAAAGGAAGCAAAAGGCAACCTCAAATCCCAAAGACAAC
AAGAGAAAGACAGTGCTGAACCTCAAAGGACCAAACATTCAAAAGCTGAGGATGTTTTTGCCCCAGCCAGAGCGGTGTCAGTTCCCACAGACCCAACTGAGCCAACTGAG
CCAGCGAAGAGGCATGCTAGAACAGCTTCCTTTGTGCATCCTAAGCTCCCGGACTACGACCAACTCGCAGCTCGGTTTGCGGCTCTCAAGGGGAAGTAA
Protein sequenceShow/hide protein sequence
MFDMFFKPKFYTKCKSCVKLTKTRLDAIRKKKNAVLKYLKNDIADLLRSALDYNAYCRAEGLLAERNLLKCYDLIDEFCGTIFNQLSVLSKQSECPDECREAVASLIYAA
ARFADLPELRELRSLFTEKYGSSFESFTNKELVERSRAMAPTKEMKLQLLQEIAQESTIQWNSKAFEQQLYTPPQNEPDGDRSADKRNKTKVASVPIYERKTNSLGKKDN
SDDDSIFDSRSEDNATETSTGDSSSTDQDVQKGISEDEVEDQRPFYLRFIPPPYLKSKPNKKEASLEEPRKVTDHVNKDESNKQDHKSVSEEKPKPRSVRRRNVKPQPAR
DININDVGNSTNDGDEKISSSLIKGKETIIGEEKGAGDDDEERMLDGLLMHYSNKKSSHESKEAKGNLKSQRQQEKDSAEPQRTKHSKAEDVFAPARAVSVPTDPTEPTE
PAKRHARTASFVHPKLPDYDQLAARFAALKGK