; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0291 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0291
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein, putative
Genome locationMC09:2609493..2612958
RNA-Seq ExpressionMC09g0291
SyntenyMC09g0291
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575541.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]8.68e-19467.74Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF MFFKPKF+TKCKSCVK+ K RLDTIRKKK  VLKFLKNDI ELL+S LD NAYNRAEG LVE+NVL+CYDLIDEF GTI+  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNAD-RRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG SF  FTNK L+EK  A+A TKE KLQLLQEIAQES I+WNSKALEQQLY PPQN  D E +    RNK
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNAD-RRNK

Query:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------
        +K+VS+PV+ R T+  +KK+NSDDDSIFDSRSE N TETS GDSSTDQDV KGV  EDEVED+KPFYLRFI PPYLK+KP+KKE + E+P          
Subjt:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------

Query:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ
           A+EEKPKPRSVRRR++K +P R INI +   SA DG  K SS RNKGKE M GEEKG   +EERMLDGLL  YSKKK+ QE             K+ 
Subjt:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ

Query:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
         EPQR KN++ E D  VP +S+SLP    E I P+K+H R NS+VHPKLPDYDQLAARFAALKEK
Subjt:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

KAG7014085.1 IST1-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.23e-19367.74Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF MFFKPKF+TKCKSCVK+ K RLDTIRKKK  VLKFLKNDI ELL+S LD NAYNRAEG LVE+NVL+CYDLIDEF GTI+  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNAD-RRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG SF  FTNK L+EK  A+A TKE KLQLLQEIAQES I+WNSKALEQQLY PPQN  D E +    R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNAD-RRNK

Query:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------
        +K+VS+PV+ R T+  +KKDNSDDDSIFDSRSE N TETS GDSSTDQDV KGV  EDEVED+KPFYLRFI PPYLK+KP+KKE + E+P          
Subjt:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------

Query:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ
           A+EEKPKPRSVRRR++K +P R INI +   SA DG  K SS RNKGKE M GEEKG   +EERMLDGLL  YSKKK+ QE             K+ 
Subjt:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ

Query:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
         EPQR KN++ E D  VP +S+SLP    E I P+K+H R NS+VHPKLPDYDQLAARFAALKEK
Subjt:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

XP_022146255.1 vacuolar protein sorting-associated protein IST1 [Momordica charantia]1.12e-30799.55Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA
        EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNK LIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA

Query:  KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR
        KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR
Subjt:  KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR

Query:  RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS
        RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS
Subjt:  RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS

Query:  VPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
        VPGRSVSLPRDTNE IVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
Subjt:  VPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

XP_022953212.1 uncharacterized protein LOC111455825 [Cucurbita moschata]3.52e-19367.74Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF MFFKPKF+TKCKSCVK+ K RLDTIRKKK  VLKFLKNDI ELL+S LD NAYNRAEG LVE+NVL+CYDLIDEF GTI+  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPD-EEPNADRRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG SF  FTNK L+EK  A+A TKE KLQLLQEIAQES I+WNSKALEQQLY PPQN  D E   +  R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPD-EEPNADRRNK

Query:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------
        +K+VS+PV+ R T+  +KKDNSDDDSIFDSRSE N TETS GDSSTDQDV KGV  EDEVED+KPFYLRFI PPYLK+KP+KKE + E+P          
Subjt:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------

Query:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ
           A+EEKPKPRSVRRR++K +P R INI +   SA DG  K SSSRNKGKE M GEEKG   +EERMLDGLL  YSKKK+ QE             K+ 
Subjt:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ

Query:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
         EPQR KN++ E D  VP +S+SLP    E I P+K+H R NS+VHPKLPDYDQLAARFA+LKEK
Subjt:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

XP_038897244.1 uncharacterized protein LOC120085370 [Benincasa hispida]1.87e-20169.28Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFDMF KPKF+TKCKSCVK+ K RLD+IRKKK AVLK+LKNDI ELL+S LD NAYNRAEG LVERNVL+CY LIDEF GTI +Q+ VLSK+SECP+EC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPDEEPNAD-RRN
        EAVA+LIYAAARFADLPELRELRSLFT KYG SF  FTNK LIEK  A A TKE+K+QLLQEIAQ+S I+WNSKALEQQLYTPP QN P  E +A  +RN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPDEEPNAD-RRN

Query:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFE------------
        K K+V++PV ERKT   +KKDNSD++SIFDSRSE N TETSTGD STDQDV KG+ EDEVED+KPFYLRF+PPPYLK+KP K EA+ E            
Subjt:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFE------------

Query:  ------KPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGE-EERMLDGLLMHYSKKKSTQESNRAKGNLKSQ
              K   EEKPKPRSVRRRN K +P RDINI +  +S  DG EK S S+NKGKETM+GEE+GAG+ EER+LDGLLM YS+KK++QESNR KGNLK Q
Subjt:  ------KPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGE-EERMLDGLLMHYSKKKSTQESNRAKGNLKSQ

Query:  KHKEQD--EPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
        + +E+D  EPQ+ K++K E D  VP R+VSLP D +E   P+K+HTR NSFVHPKLP+YDQLAAR AALKEK
Subjt:  KHKEQD--EPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

TrEMBL top hitse value%identityAlignment
A0A0A0K926 Uncharacterized protein1.56e-18265.81Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFDMF KPKF+TKCKSCVK+ K RLDT RKKKNAVLK+LKNDI ELL+S LD NAYNRAEG LVERNVL+CY+LIDEF GTI NQ+ VL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPD-EEPNADRRN
        E+VA+LIYAAARFADLPELRELR+LFT KYG SF  FTNK  IEK      TKE+K+QLLQEIAQE+ I+WNSKALEQQLYTPP +N  D E   A +RN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPD-EEPNADRRN

Query:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFE------------
        K KVVS+PV+E+K +  + K+NSD++SIFDSRSE N TETSTGD STDQDV KGV  DEV D+KPF  RF+ PPYLK+KP K EA+ E            
Subjt:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFE------------

Query:  ---KPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQES-NRAKGNLKSQKH
           K   EEKPKPRSVRRR  K +P RDINI +V +S  D T+K SS RNKGKETM+GEEKGA  +EER+LDGLLM YSKKK+ QES +R K NLK Q+ 
Subjt:  ---KPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQES-NRAKGNLKSQKH

Query:  KEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
        +E+D  +  + +          R+VS P D NE   PMK+HTR NSFVHPKLP+YDQLAAR AALKEK
Subjt:  KEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

A0A1S3CGT5 uncharacterized protein LOC1035005773.91e-18866.88Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFDMF KPKF+TKCKSCVK+ K RLDTIRKKKNAVLK+LKNDI ELL+S LD NA+NRAEG LVERNVL+CY+LIDEF GTI NQ+ VL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPD-EEPNADRRN
        EAVA+LIYAAARFADLPELRELR+LFT KYG SF  FTNK  IEK  A   TKE+K+QLLQEIAQE+ I+WNSKALEQQLYTPP QN  D E   A +RN
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPP-QNTPD-EEPNADRRN

Query:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPA---------
        K KVVS+PV+ERK +  + K+NSD++SIFDSRSE N TETSTGD STDQD+ KGV EDEVED+KPF  RF+ PPYLK+KP K EA+ ++P          
Subjt:  KAKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPA---------

Query:  ------VEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQES-NRAKGNLKSQKH
               EEKPKPRSVRRRN K +P RDINI +V +S  D T K SSSRNKGKETM+GEEKGA  +EER+LDGLLM YSKKK+ QES +R K NLK Q+ 
Subjt:  ------VEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQES-NRAKGNLKSQKH

Query:  KEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
        +E+D  +  + +          R+VSLP D NE     K+HTR NSFVHPKLP+YDQLAAR AALKEK
Subjt:  KEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

A0A6J1CWS8 vacuolar protein sorting-associated protein IST15.44e-30899.55Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA
        EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNK LIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKA

Query:  KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR
        KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR
Subjt:  KVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVR

Query:  RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS
        RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS
Subjt:  RRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLS

Query:  VPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
        VPGRSVSLPRDTNE IVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
Subjt:  VPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

A0A6J1GMM1 uncharacterized protein LOC1114558251.70e-19367.74Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF MFFKPKF+TKCKSCVK+ K RLDTIRKKK  VLKFLKNDI ELL+S LD NAYNRAEG LVE+NVL+CYDLIDEF GTI+  +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPD-EEPNADRRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG SF  FTNK L+EK  A+A TKE KLQLLQEIAQES I+WNSKALEQQLY PPQN  D E   +  R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPD-EEPNADRRNK

Query:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------
        +K+VS+PV+ R T+  +KKDNSDDDSIFDSRSE N TETS GDSSTDQDV KGV  EDEVED+KPFYLRFI PPYLK+KP+KKE + E+P          
Subjt:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------

Query:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ
           A+EEKPKPRSVRRR++K +P R INI +   SA DG  K SSSRNKGKE M GEEKG   +EERMLDGLL  YSKKK+ QE             K+ 
Subjt:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ

Query:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
         EPQR KN++ E D  VP +S+SLP    E I P+K+H R NS+VHPKLPDYDQLAARFA+LKEK
Subjt:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

A0A6J1JV69 uncharacterized protein LOC1114886296.50e-19166.88Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF MFFKPKF+TKCKSCVK+ K RLDTIRKKK  VLKFLKNDI ELL+S LD NAYNRAEG LVE+NVL+CYDLIDEF GTI+N +SVL+K+SECPDEC+
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNT-PDEEPNADRRNK
        EAVASLIYAAARF+DLPELR LRSLFT +YG SF  FTNK L+EK  A+A TKE KLQLLQEIAQES I+WNSKALEQQLY PPQN    E   +  R+K
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNT-PDEEPNADRRNK

Query:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------
        +K+VS+PV+ R T+  +KKDNSDDDSIFDSRSE N TETS GDSSTDQDV KG+  EDEVED+KPFYLRFI PPYLK+KP+KKE + E+P          
Subjt:  AKVVSIPVFERKTSLLQKKDNSDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVF-EDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKP----------

Query:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ
             EEKPKPRSVRRR++K +P R INI +   SA DG  K SS +NKGKE M GEEKG   +EERMLDGLL  YSKK + QE             K+ 
Subjt:  ---AVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGA-GEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQ

Query:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK
         EPQR KN++ E D  VP +S+SLP    E I P+K+H R NS+VHPKLPDYDQLAARFAALKEK
Subjt:  DEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLAARFAALKEK

SwissProt top hitse value%identityAlignment
Q3ZBV1 IST1 homolog3.5e-0523.31Show/hide
Query:  VKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAARF-ADL
        ++L+  RL  + KKK  + +  + +IA+ L +G D+ A  R E ++ E  +++  ++++ +   +  +  ++    E      E+V++LI+AA R  +++
Subjt:  VKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAARF-ADL

Query:  PELRELRSLFTAKYGRSFEQF--------TNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEW
         EL+ +     AKY + + +          N  L+ K S  A  K +  + L EIA+   + +
Subjt:  PELRELRSLFTAKYGRSFEQF--------TNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEW

Q54I39 IST1-like protein5.9e-1325.15Show/hide
Query:  KCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAAR
        K K  +KL   R+  ++ KK  +++  K ++AELLR   +++A  R E ++ +  +++C+ +I+     ++ ++++++  +E P E +E++ +L+Y++ R
Subjt:  KCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAAR

Query:  FADLPELRELRSLFTAKYGRSFEQ--------FTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEW
           +PEL ++++   AKYG+  E           N  ++ K S       +  Q L EIA++  ++W
Subjt:  FADLPELRELRSLFTAKYGRSFEQ--------FTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEW

Arabidopsis top hitse value%identityAlignment
AT1G52315.1 Regulator of Vps4 activity in the MVB pathway protein7.4e-3534.7Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MF   FKPKF+ K KS    +K+R+D +R+K+ A+++  K DI   L++G D  AY RAE LL E  ++ CYDLI+ F   I   LS++ K+ ECP+ECR
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAM-AMTKEVKLQLLQEIAQESGIEWNSKALEQQLY--TPPQNTPDE-EPNADR
        EAV+SLIYA A   D+PEL++LR++FT ++G       N  L+EK   +   ++E+K+Q ++++A E  I W+   L+  L   T      D+ E  AD 
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAM-AMTKEVKLQLLQEIAQESGIEWNSKALEQQLY--TPPQNTPDE-EPNADR

Query:  RNKAKVVSIPVFERKTSLLQKKDNSDDDSI------FDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAV
         NK         + + S +   ++SDD+S+       DS S+ + + +S+  SS+ +          V  RK    + +P   + S  +K  A  ++ A 
Subjt:  RNKAKVVSIPVFERKTSLLQKKDNSDDDSI------FDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAV

Query:  EEKPKPRSVR-RRNSKL
        EEK + R +  + NS+L
Subjt:  EEKPKPRSVR-RRNSKL

AT1G79910.1 Regulator of Vps4 activity in the MVB pathway protein1.7e-6345.71Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFD  FKPKF+TKCKS VK+ K R+DT+++KKN+V K+LKNDI +LL++ LD NAY RAEGL+ E+  L CY+ +++F   + + +S+L K   CPDECR
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQ----NTPDEEPNADR
        EA++SL+YAAAR +++PELR+LRSLF  +YG + +QF N   +E+  A   +KE+K++LLQEIA+E  I+W++K+LEQ+LYTPP     +T  EEP  ++
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQ----NTPDEEPNADR

Query:  RNKAKVVSIPVFERKTSL-LQKKDNSDDDS--IFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFI-PPPYLKSKPSKKEASFEKPAVEE
         N     ++   E+++SL    +++S DDS     S SE++   TS G   T         ED+ E  KPFY RF+ P PY K K  K+E+  EK     
Subjt:  RNKAKVVSIPVFERKTSL-LQKKDNSDDDS--IFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFI-PPPYLKSKPSKKEASFEKPAVEE

Query:  ----------KPKPRSVRRRNSKLEP
                  KPKPRSVRRR +   P
Subjt:  ----------KPKPRSVRRRNSKLEP

AT1G79910.2 Regulator of Vps4 activity in the MVB pathway protein4.5e-4041.57Show/hide
Query:  EGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQL
        EGL+ E+  L CY+ +++F   + + +S+L K   CPDECREA++SL+YAAAR +++PELR+LRSLF  +YG + +QF N   +E+  A   +KE+K++L
Subjt:  EGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQL

Query:  LQEIAQESGIEWNSKALEQQLYTPPQ----NTPDEEPNADRRNKAKVVSIPVFERKTSL-LQKKDNSDDDS--IFDSRSEDNGTETSTGDSSTDQDVQKG
        LQEIA+E  I+W++K+LEQ+LYTPP     +T  EEP  ++ N     ++   E+++SL    +++S DDS     S SE++   TS G   T       
Subjt:  LQEIAQESGIEWNSKALEQQLYTPPQ----NTPDEEPNADRRNKAKVVSIPVFERKTSL-LQKKDNSDDDS--IFDSRSEDNGTETSTGDSSTDQDVQKG

Query:  VFEDEVEDRKPFYLRFI-PPPYLKSKPSKKEASFEKPAVEE----------KPKPRSVRRRNSKLEP
          ED+ E  KPFY RF+ P PY K K  K+E+  EK               KPKPRSVRRR +   P
Subjt:  VFEDEVEDRKPFYLRFI-PPPYLKSKPSKKEASFEKPAVEE----------KPKPRSVRRRNSKLEP

AT4G32350.1 Regulator of Vps4 activity in the MVB pathway protein1.2e-2929.18Show/hide
Query:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR
        MFD F    F  K K  +KL K R+D +R+K+NA +KFLK D+A+L+ +G D NA++RA GLL E   L   D +++    +Y QLS + K  ECP++CR
Subjt:  MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECR

Query:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQL--------YTPPQNTPDEEP
        EA++SL++AA+ F++LPELRELR +F  KY  S   F N+ L+E  S+   + E K++L++++A E  I W+SK  E+++           P++T D+  
Subjt:  EAVASLIYAAARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQL--------YTPPQNTPDEEP

Query:  NADRRNK-AKVVSIPVFERKTSLLQKKDNSDD--DSIF--DSRSEDNGTE------TSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKK
          DR     K       E   SL +K   + +  D +F  D  S  NG        T    S       +   +D   +RK FYL      + K  P++ 
Subjt:  NADRRNK-AKVVSIPVFERKTSLLQKKDNSDD--DSIF--DSRSEDNGTE------TSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKK

Query:  EASFEKPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDN---------SANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDG--LLMHYSKKKSTQESN
                  EK +P      N        +N GN+            A+  TE  +S R +       E      +    DG  ++M    +   Q + 
Subjt:  EASFEKPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDN---------SANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDG--LLMHYSKKKSTQESN

Query:  RAKGNLKSQKHKEQDEPQRTKNSKAEE-DLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPD
           G +   K  E +  ++ K+S ++  D  V G          E      +H +N    H K+ D
Subjt:  RAKGNLKSQKHKEQDEPQRTKNSKAEE-DLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPD

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein5.3e-2526.86Show/hide
Query:  TKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAA
        +KCK+  K+   R+  IR K+  V+K ++ DIA LL+SG D  A  R E ++ E+N+    ++I+ F   I ++L++++KQ +CP + +E +ASLI+AA 
Subjt:  TKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAAA

Query:  RFADLPELRELRSLFTAKYGRSFEQF---------TNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKAKV
        R +++PEL +LR +F  KYG+ F             N+ LI+K S      E KL++++EIA+E  ++W++   EQ+L  P + + D       R     
Subjt:  RFADLPELRELRSLFTAKYGRSFEQF---------TNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKAKV

Query:  VSIPV--------------FERKTSLLQKKDNSDDD---------------------SIFDSRSEDNGTETS-TGDSSTDQDVQKGVFEDEVEDRKPFYL
         S+PV                R TS +    +  D                      S+  +R + +  E S + D ST Q       + +  D    + 
Subjt:  VSIPV--------------FERKTSLLQKKDNSDDD---------------------SIFDSRSEDNGTETS-TGDSSTDQDVQKGVFEDEVEDRKPFYL

Query:  RFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKST
           P    +S+ S+  + + KP  E     R + RR+S           N   + +D  E+ +++  + KETM                    Y+ +   
Subjt:  RFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASSSRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKST

Query:  QESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSF---VHPKLPDYDQLAARFAALK
          +       +S  ++E+ EP          D    GR  SLP +      P    +R +S    VHPKLPDYD LAARF A++
Subjt:  QESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSF---VHPKLPDYDQLAARFAALK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGACATGTTCTTCAAGCCTAAGTTCTTCACAAAATGCAAATCTTGCGTGAAGCTTTTGAAGATGCGGCTGGATACAATCCGGAAGAAGAAGAATGCGGTTCTCAA
GTTTTTGAAGAATGACATTGCTGAGCTTCTCAGAAGTGGTCTCGACCAAAACGCCTATAATAGGGCTGAAGGGCTTCTTGTTGAGCGGAATGTATTGAAATGTTATGATC
TGATTGATGAGTTTTCTGGGACGATCTATAATCAACTTTCTGTCTTGAGTAAGCAGAGTGAGTGTCCCGACGAATGCCGAGAAGCAGTCGCGTCGCTGATTTACGCTGCA
GCGAGATTCGCTGACTTGCCTGAATTGCGGGAGCTAAGGAGTCTCTTTACTGCGAAATATGGAAGATCCTTTGAACAATTTACCAACAAAGGGCTCATCGAGAAGCCGAG
TGCGATGGCAATGACTAAAGAGGTGAAGCTCCAGCTACTCCAGGAGATAGCACAGGAATCAGGCATTGAATGGAATTCCAAGGCTTTGGAACAACAACTGTATACTCCTC
CCCAAAATACACCTGATGAGGAGCCAAATGCAGACAGAAGAAACAAAGCCAAGGTTGTTTCTATTCCAGTGTTTGAAAGAAAGACGAGCTTGCTTCAAAAAAAGGACAAC
AGCGACGACGATTCCATCTTCGACAGTAGAAGCGAAGACAATGGAACCGAAACCTCAACAGGAGATAGCAGTACTGATCAAGATGTTCAGAAAGGTGTATTTGAGGATGA
AGTAGAAGATCGAAAACCTTTCTACCTTAGATTTATCCCACCTCCCTACCTCAAATCCAAGCCTAGCAAAAAGGAAGCGAGTTTTGAGAAGCCAGCCGTGGAAGAGAAGC
CGAAACCCAGATCTGTGAGGCGAAGAAATAGTAAACTTGAGCCTACAAGGGACATCAACATTGGCAATGTCGACAACTCCGCGAACGATGGTACAGAAAAGGCTAGTTCG
AGTCGAAACAAAGGAAAAGAAACCATGATGGGAGAAGAGAAAGGTGCAGGGGAGGAAGAGAGGATGTTAGATGGGCTTTTGATGCATTATAGTAAGAAAAAATCAACTCA
AGAATCAAACAGAGCAAAAGGCAACCTTAAATCCCAAAAACACAAAGAACAAGATGAACCTCAAAGGACCAAAAATTCAAAAGCTGAGGAGGACCTTTCTGTCCCAGGAA
GATCAGTTTCACTTCCCAGAGACACAAATGAGCTAATTGTTCCAATGAAGAGGCATACTAGAAACAATTCCTTTGTGCATCCTAAGCTCCCGGATTACGACCAACTCGCA
GCTCGGTTTGCAGCTCTCAAAGAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
AAATAAAATTCATTCCGATATTAATTACAATCTGTATTATCTTATATATATATATATATATATATAAAATCAGATGAAGAGATTGTGGCCGGAATTTTAAACGAGGGTAC
AACTGTCAAAATGGAAAAACTCCAAACAAATCCGTTAAGTAAGGCCAGCTATTAACGGACGGCTCCACGTTTTATCTCCCCTCGCTAAATTTCCGTCCCTTCGCCTTTTT
TACTTTTCTTTTTTTTCGGAGTTTCCATAAATTTTAACCGACAACGTTTTATCCATTTTATTACGGCCTAGATTTACCCCTCCAGATCCCTTGGCCGAACAAGGCCTACA
AATCACCACCAAATTTTCACTGCCAAAAAGTAAAATTAAAAATTAAAAAAAGAAAAAAGAGAACTTTTTGGTTAATAATTTCAAGATCCTTTCTCATGGATCCGCGCGTT
TTTATTTTTAGTGGGAGAGACAGAGTCCGACGTGGCACTTCCCTCTTCCACTCCCCGTCGTCGTCTCTCTTCAAAACACAGCTGGGATCATCTTTTTCTTCCTTAGACCG
CACTTTTTCGGCGGATTACACTCCTCTGCTTCTCTTGCTCTGTCTTTTCTGTGGAGATGATTTCATGATGTTCTTCCATTTGCCGTCCTCCTTTTCTTCTTTCAATCCTT
CAATGGTGGGACAGATTTTTTGACGTCTGGTTTTTGCTCTTCTGGGTTTCTTGCCATGTGGGTTTCTCGGGTTTTGGATTCTTGATTGATTGATGCGTTTACTGGTTCCA
ATCGCGGAGTTCTTCTTTGCCTTGTTATTGGGTTTTGGATTCTCCCCATTTTCCTGATCCCTCGGTTTTCTTCCTTATTTTAACCCCTCAACTTCCGTTTCCCATCGAGT
TTGCTTCCATTTCGAAATGTTCGACATGTTCTTCAAGCCTAAGTTCTTCACAAAATGCAAATCTTGCGTGAAGCTTTTGAAGATGCGGCTGGATACAATCCGGAAGAAGA
AGAATGCGGTTCTCAAGTTTTTGAAGAATGACATTGCTGAGCTTCTCAGAAGTGGTCTCGACCAAAACGCCTATAATAGGGCTGAAGGGCTTCTTGTTGAGCGGAATGTA
TTGAAATGTTATGATCTGATTGATGAGTTTTCTGGGACGATCTATAATCAACTTTCTGTCTTGAGTAAGCAGAGTGAGTGTCCCGACGAATGCCGAGAAGCAGTCGCGTC
GCTGATTTACGCTGCAGCGAGATTCGCTGACTTGCCTGAATTGCGGGAGCTAAGGAGTCTCTTTACTGCGAAATATGGAAGATCCTTTGAACAATTTACCAACAAAGGGC
TCATCGAGAAGCCGAGTGCGATGGCAATGACTAAAGAGGTGAAGCTCCAGCTACTCCAGGAGATAGCACAGGAATCAGGCATTGAATGGAATTCCAAGGCTTTGGAACAA
CAACTGTATACTCCTCCCCAAAATACACCTGATGAGGAGCCAAATGCAGACAGAAGAAACAAAGCCAAGGTTGTTTCTATTCCAGTGTTTGAAAGAAAGACGAGCTTGCT
TCAAAAAAAGGACAACAGCGACGACGATTCCATCTTCGACAGTAGAAGCGAAGACAATGGAACCGAAACCTCAACAGGAGATAGCAGTACTGATCAAGATGTTCAGAAAG
GTGTATTTGAGGATGAAGTAGAAGATCGAAAACCTTTCTACCTTAGATTTATCCCACCTCCCTACCTCAAATCCAAGCCTAGCAAAAAGGAAGCGAGTTTTGAGAAGCCA
GCCGTGGAAGAGAAGCCGAAACCCAGATCTGTGAGGCGAAGAAATAGTAAACTTGAGCCTACAAGGGACATCAACATTGGCAATGTCGACAACTCCGCGAACGATGGTAC
AGAAAAGGCTAGTTCGAGTCGAAACAAAGGAAAAGAAACCATGATGGGAGAAGAGAAAGGTGCAGGGGAGGAAGAGAGGATGTTAGATGGGCTTTTGATGCATTATAGTA
AGAAAAAATCAACTCAAGAATCAAACAGAGCAAAAGGCAACCTTAAATCCCAAAAACACAAAGAACAAGATGAACCTCAAAGGACCAAAAATTCAAAAGCTGAGGAGGAC
CTTTCTGTCCCAGGAAGATCAGTTTCACTTCCCAGAGACACAAATGAGCTAATTGTTCCAATGAAGAGGCATACTAGAAACAATTCCTTTGTGCATCCTAAGCTCCCGGA
TTACGACCAACTCGCAGCTCGGTTTGCAGCTCTCAAAGAGAAGTAAAAGTCATTGATATCGTTCCAGTATTATCGAGTAATTATTGAATGTGACTGACACAAGATGGAAA
AGTCTCCTCCTCGGAGATTCTCTTTTTTAGAAACCGGCCCAATTGCACACTTGCAATTCTATCTAGGAAATTCTAATTCTCCCTTGGCAATGTCTAAATACTTTATAGCA
TGTAGTGTGCAGTGCTTTACAAGTGCAGTAAAATTTCTCTAAAAAGTGGTACATAATTCACATTTACAGATATTTTCAGTGCCATTTTACTCTTGTCTTGAGC
Protein sequenceShow/hide protein sequence
MFDMFFKPKFFTKCKSCVKLLKMRLDTIRKKKNAVLKFLKNDIAELLRSGLDQNAYNRAEGLLVERNVLKCYDLIDEFSGTIYNQLSVLSKQSECPDECREAVASLIYAA
ARFADLPELRELRSLFTAKYGRSFEQFTNKGLIEKPSAMAMTKEVKLQLLQEIAQESGIEWNSKALEQQLYTPPQNTPDEEPNADRRNKAKVVSIPVFERKTSLLQKKDN
SDDDSIFDSRSEDNGTETSTGDSSTDQDVQKGVFEDEVEDRKPFYLRFIPPPYLKSKPSKKEASFEKPAVEEKPKPRSVRRRNSKLEPTRDINIGNVDNSANDGTEKASS
SRNKGKETMMGEEKGAGEEERMLDGLLMHYSKKKSTQESNRAKGNLKSQKHKEQDEPQRTKNSKAEEDLSVPGRSVSLPRDTNELIVPMKRHTRNNSFVHPKLPDYDQLA
ARFAALKEK