; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04980 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04980
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationClcChr08:15326254..15328411
RNA-Seq ExpressionClc08G04980
SyntenyClc08G04980
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.1e-12971.65Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDA LGRNFRASKFRPLL+LAV+RL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VI DQNALDAYVLIEGYLNLL+ER +LLEQ+ ECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEA+AGL+FAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P+LESRM++LK IASENGI LQ+D+   SNE    RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME
        +RQNQ+E H  V E  +F TEVPSGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP  PGS TT ++KQE ++ +VE+KPK ++E
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME

Query:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        Y   +E EG     E EVKNS+ G SS +RE    E  + D            ME  LEKTE++EK SFRLNLEK+PISVRTRRVRGY
Subjt:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

XP_011656519.1 uncharacterized protein LOC101211044 [Cucumis sativus]2.1e-13375.81Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDA LGRNFRASKFRPLL+L+++RLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVI DQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEAVAGL+FAASRCGDFPELHEIKSVLT RFGKEFTARAVELRNNCGVN  LMQKLSTRQP+LE+RM+ LK+IASENGIVLQIDQ   S +EK+ RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI
         RQ++AE  SE     +F  EV SGS+  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+ PGSGTT  NKQ+ E+ +VESK K +M EYG 
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI

Query:  RREGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
         R+GEGEGE EE       G          G K    METK+EKTE+SEK SFRLNLEK+PISVRTRRV GY
Subjt:  RREGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]1.1e-12971.65Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDA LGRNFRASKFRPLL+LA++RL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VI DQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEA+AGL+FAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P+LESRM++LK IASENGI LQ+D+   SNE    RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME
        +RQNQ+E H  V E  +F TEVPSGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP  PGS TT ++KQE ++ +VE+KPK ++E
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME

Query:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        Y   +E EG     E EVKNS+ G SS +RE    E  + D            ME  LEKTE++EK SFRLNLEK+PISVRTRRVRGY
Subjt:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]4.4e-12871.39Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDA LGRNFRASKFRPLL+LA++RL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VI DQNALDAYVLIEGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEA+AGL+FAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTRQP+LESRM++LK IASENGI LQ+D+   SNE    RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME
        + QNQ E H  V E  +  TEVPSGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSR      DGPSSP  PGSGTT ++KQE ++  VE+KPK ++E
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME

Query:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        Y   RE EG     E EVKNS+ G SS +RE    E  + D            ME  LEKTE++EK SFRLNLEK+PISVRTRRVRG+
Subjt:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

XP_038884116.1 uncharacterized protein LOC120075040 [Benincasa hispida]1.7e-14880.59Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDA LGRNFRASKFRPLL+LA++RLSILT QRR+RCSQA+SDVLQLLQLP+HHRALLRVEKVI DQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEAVAGL+FAASRCGDFPELHEIKSVLT RFGKEFTARAVELRNNCGVNHLLMQKLSTRQP+LESRMEVLKAIASENGIVLQID   PSNEEKL RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKMEYGIR
        TRQNQ E  SEV E  QF TEV SGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSP SPGSGTTPH+KQE+E+ +VESKP P+ E   R
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKMEYGIR

Query:  REGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        R G G  E EEE + +         E     K   ++E K+EKTE+SEK SFRLNLEK+PISVRTRRVRG+
Subjt:  REGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein9.9e-13475.81Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDA LGRNFRASKFRPLL+L+++RLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVI DQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEAVAGL+FAASRCGDFPELHEIKSVLT RFGKEFTARAVELRNNCGVN  LMQKLSTRQP+LE+RM+ LK+IASENGIVLQIDQ   S +EK+ RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI
         RQ++AE  SE     +F  EV SGS+  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+ PGSGTT  NKQ+ E+ +VESK K +M EYG 
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI

Query:  RREGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
         R+GEGEGE EE       G          G K    METK+EKTE+SEK SFRLNLEK+PISVRTRRV GY
Subjt:  RREGEGEGEGEEEVKNSVPGASSPSREFCYGEKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

A0A1S3BD83 uncharacterized protein LOC1034886061.1e-12481.15Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDA LGRNFRASKFRPLL+L+++RLSILT+QRRLRCSQA+SDVLQLLQL HHHRALLRVEKVI DQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEAVAGL+FAASRCGDFPELHEIKSVLT RFGKEFTARAVELRNNCGVN  LMQKLSTRQP+LE+RMEVLK+IASENGIVLQ DQ   SNEEK+ R+
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI
         RQ++AE  SE     +F  EV S S+++YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+ PGSGTTPH+KQ+ E+ +VESK K +M EYG 
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM-EYGI

Query:  RREGEGEGEGEEE
         R GEGEGEGE E
Subjt:  RREGEGEGEGEEE

A0A6J1CFG6 uncharacterized protein LOC1110103256.0e-12369.15Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDA LGRNFRASKFRPLL+LA++RL++LT QR +R SQA SD LQLLQL HHHRALLRVE+VI +QNALDAYVLIEGYLNLL+ERT LLEQ+ ECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEAV+GLIFAASRCGDFPEL EIKSVLT RFGKEFTARAVELRNNCGV+H +MQKLSTRQP+LESRM VL+AIASEN IVLQ+D+   SNEEK  RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQN-QAEAHSEVKEKFQFFTEVPSGS---QQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPK
         RQN Q E  +EV +  QF  +VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRS     D PS+P  PGSG+T + +QE ++ +VESKPK
Subjt:  TRQN-QAEAHSEVKEKFQFFTEVPSGS---QQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPK

Query:  PKMEYGIRREGEGEGEG-EEEVKNSVPGASSPSREFCYGEKEKS---------DMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
         ++EY  RRE E  G     EVKNS+P + SPSRE    E++++         + ETK E TE ++K SF LNLEK+P+SVRTRRVRG+
Subjt:  PKMEYGIRREGEGEGEG-EEEVKNSVPGASSPSREFCYGEKEKS---------DMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

A0A6J1E0Z0 uncharacterized protein LOC1114298145.1e-13071.65Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDA LGRNFRASKFRPLL+LA++RL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VI DQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEA+AGL+FAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P+LESRM++LK IASENGI LQ+D+   SNE    RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME
        +RQNQ+E H  V E  +F TEVPSGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP  PGS TT ++KQE ++ +VE+KPK ++E
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME

Query:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        Y   +E EG     E EVKNS+ G SS +RE    E  + D            ME  LEKTE++EK SFRLNLEK+PISVRTRRVRGY
Subjt:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

A0A6J1I2V3 uncharacterized protein LOC1114704475.3e-12770.62Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDA LGRNFRASKFRPLL+LAV+RL+ILT QRRLR +QA SDVLQLLQL H  RALLRVE+VI DQNALDAYVL+EGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EELKEA+AGL+FAASRCGDFPELHEIKSVLT  FGKEFTARAVELRNNCGVNHLLMQKLSTRQP+LESRM++LK IASENGI LQ+D+   SNE    RN
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME
        +RQNQ+E H  V E  +  TEVPSGS+QKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP  PGS T+ ++KQE ++  VE+KPK ++E
Subjt:  TRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKME

Query:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY
        Y   RE EG     + EVKNS+ G SS +RE    E  + D            ME  LEKT ++EK SFRLNLEK+PISVRTRRVRGY
Subjt:  YGIRREGEGEG-EGEEEVKNSVPGASSPSREFCYGEKEKSD------------METKLEKTELSEKASFRLNLEKQPISVRTRRVRGY

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog6.6e-1030.57Show/hide
Query:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA
        LG  F+A + R  L L + RL +L  ++     +A  ++   L      RA +RVE +I +   ++A  ++E Y +LLL R  L++   E    L E+V+
Subjt:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA

Query:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
         LI+AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q3ZBV1 IST1 homolog3.3e-0929.94Show/hide
Query:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA
        LG   +A + R  L L + RL +L  ++     +A  ++   L      RA +RVE +I +   ++A  ++E Y +LLL R  L++   E    L E+V+
Subjt:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA

Query:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
         LI+AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q54I39 IST1-like protein7.8e-1129.31Show/hide
Query:  GRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG
        G ++ + K +  L LAV+R+ IL  ++         +V +LL+  +   A +RVE +I D+  ++ + +IE    LL  R  L+   +E P E+KE++  
Subjt:  GRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG

Query:  LIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPSLESRMEVLKAIASE
        L++++ R    PEL +IK+ L  ++GK      +E   NC     VN  ++ KLS   P      + L  IA +
Subjt:  LIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPSLESRMEVLKAIASE

Q568Z6 IST1 homolog6.6e-1030.57Show/hide
Query:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA
        LG  F+A + R  L L + RL +L  ++     +A  ++   L      RA +RVE +I +   ++A  ++E Y +LLL R  L++   E    L E+V+
Subjt:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA

Query:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
         LI+AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q9CX00 IST1 homolog6.6e-1030.57Show/hide
Query:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA
        LG  F+A + R  L L + RL +L  ++     +A  ++   L      RA +RVE +I +   ++A  ++E Y +LLL R  L++   E    L E+V+
Subjt:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA

Query:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
         LI+AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  GLIFAASRC-GDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein2.8e-6449.07Show/hide
Query:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MGKKLDA LGR+F+ +KF+ L++LA+TRLSIL  QR+ R SQA SDV +LL+L  H  A  RV++V+ DQN LD    I GY  L L+R  L E   +CP
Subjt:  MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN
        EEL EAV+GL+FAASR G+FPEL EI++VL +RFGK+  AR++ELR+NCGV+  ++QKLSTR P  E RM+ LK IA+EN IVL++DQ   S E      
Subjt:  EELKEAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRN

Query:  TRQNQAEAHSEVK----EKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSR-SHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM
           + ++     K    E +     V  G ++KYKDVADAAQAAFESAA AA AA+AA+ELS+ S  G  SP + G   + H          E+K   + 
Subjt:  TRQNQAEAHSEVK----EKFQFFTEVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSR-SHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKM

Query:  EYGIRREGEGEGEGEEEVKNSV
        + G     EGEG+   E K S+
Subjt:  EYGIRREGEGEGEGEEEVKNSV

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein4.7e-2729.62Show/hide
Query:  LDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        L+    R    +K +  L+LA+ R+ +L  +R ++      ++   LQ      A +RVE VI + N   AY ++E +   +L R  +LE + ECP EL+
Subjt:  LDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASE---NGIVLQIDQPYPSNEEKLVRNT
        EA+A +IFAA RC + P+L +IK++   ++GKEF   A ELR + GVN  +++KLS   PS  +R+++LK IA E   N      +  +  + E L+   
Subjt:  EAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASE---NGIVLQIDQPYPSNEEKLVRNT

Query:  RQ-----------------NQAEAHSEVK-------EKFQFFT------------------EVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSR
        +Q                  Q+    EV+       ++FQ                     + P  +++   DV + A+AA  SA +A AAARAA +L  
Subjt:  RQ-----------------NQAEAHSEVK-------EKFQFFT------------------EVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSR

Query:  SHDGPSSPNSPGSG
           G ++P     G
Subjt:  SHDGPSSPNSPGSG

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein5.2e-2629.1Show/hide
Query:  LDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        LD+F  + F+A+K + LL L + R+ ++  +R  +  Q   ++ +LL+      A +RVE +I ++  + A  ++E +  L+  R  ++E Q ECP +LK
Subjt:  LDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNE-------EKL
        EA++ + FAA RC D  EL +++ +  +++GKEF A A EL+ + GVN  L++ LS R PS E+++++LK IA E+    ++D    S E       E L
Subjt:  EAVAGLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNE-------EKL

Query:  VRNTRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQE
        +   +Q    +   + E+    T + S S  K K  +D+     +               ++  A  AA++A     SHD P    + G   T   + E
Subjt:  VRNTRQNQAEAHSEVKEKFQFFTEVPSGSQQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQE

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein6.6e-2938.86Show/hide
Query:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA
        L R F+ +K +  L +A +RL IL  ++ ++  Q   ++ QLL+      A +RVE V+ ++  + AY LI  Y  LL+ R  ++E Q  CP +LKEAV 
Subjt:  LGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVA

Query:  GLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIV
         ++FA+ R  D PEL EI    T ++GK+F+  AVELR + GV+ LL++KLS + P   +++++L AIA E+ +V
Subjt:  GLIFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIV

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein2.0e-3030.21Show/hide
Query:  RNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAGL
        R F +SK +    +AV R+ ++  +R +   Q   D+  LLQ      A +RVE VI +QN   A  +IE +  L++ R T++ +Q +CP +LKE +A L
Subjt:  RNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAGL

Query:  IFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRNTRQN------
        IFAA RC + PEL +++ +   ++GK+F + A +LR +CGVN +L+ KLS R P  E +++++K IA E     Q+D      E++L++   ++      
Subjt:  IFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRNTRQN------

Query:  --QAEA----HSEVKEKFQFFTEVPSGSQQ-----KYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKP
           A +     + + E       VP  +        Y D   AA+AA E A QA AAA+ A  L+   D  +   S  S  + H K           P  
Subjt:  --QAEA----HSEVKEKFQFFTEVPSGSQQ-----KYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKP

Query:  KMEYGIRREGEGEGEGEEEVKNSVPGASS--PSREFCYGEK--EKSDMETKLEKTELSEKASFRLNLEKQPISV
            G RR+     + E     + PGA +    R   Y      +SD E +   TE   K + R      P SV
Subjt:  KMEYGIRREGEGEGEGEEEVKNSVPGASS--PSREFCYGEK--EKSDMETKLEKTELSEKASFRLNLEKQPISV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAAAGCTTGACGCTTTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAGTCTTGCCGTCACTCGCCTTTCCATCCTCACTACCCAACGCCG
TCTCAGATGCTCTCAGGCTCATTCCGACGTCCTTCAACTCCTCCAACTTCCCCACCACCACCGTGCTCTTCTTCGAGTCGAGAAAGTGATTGTGGATCAGAATGCTCTGG
ATGCCTATGTTTTGATCGAAGGGTATCTCAACCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGTGAGTGCCCTGAGGAATTGAAAGAAGCGGTTGCAGGGTTG
ATATTTGCTGCTTCCAGATGTGGAGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCAATCGTTTTGGCAAGGAATTCACTGCTCGTGCCGTTGAATTACGCAA
CAATTGTGGAGTCAATCATTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCAGTTTGGAGAGTAGAATGGAGGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTAT
TGCAGATTGACCAACCTTACCCTTCCAATGAGGAAAAACTAGTCAGAAACACAAGGCAAAACCAGGCAGAGGCACACAGTGAAGTGAAAGAGAAGTTTCAATTTTTCACT
GAAGTGCCATCTGGTTCTCAACAGAAGTACAAAGATGTGGCGGATGCAGCGCAAGCCGCTTTCGAATCCGCAGCTCAAGCAGCAGCTGCTGCCCGAGCTGCCATGGAGCT
CTCCCGCTCCCATGATGGTCCAAGCAGCCCGAATAGCCCTGGCTCTGGAACAACTCCACATAACAAACAAGAGGTAGAGGAACCAAAGGTTGAATCAAAACCAAAACCAA
AAATGGAATATGGAATTCGGAGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGAAGAAGTGAAAAATTCAGTGCCTGGTGCTTCTTCTCCAAGTAGAGAGTTCTGTTATGGG
GAAAAGGAAAAGAGTGACATGGAAACAAAGTTAGAGAAGACAGAGTTAAGTGAGAAGGCAAGCTTTCGTTTAAATCTGGAGAAGCAGCCAATTTCAGTGAGAACGAGAAG
AGTGCGAGGATACTGA
mRNA sequenceShow/hide mRNA sequence
GCGTGTTGGAGGGTTTCCACACAAAAAGGTTTGGGCCAACGAAGACTTCGTCTATAAGCAATACACAGAAATCAAACCCTTCCAAGTTCCCACGCGCTTCATCTTCAACC
CTTCTTCACACTGCCCTGTTCTCCACTCTGAAACAACCGTATATAAACAAATCAATTGAAATTCCACATAATCTGAATTAACCCATTTCGCCATGGGAAAAAAGCTTGAC
GCTTTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAGTCTTGCCGTCACTCGCCTTTCCATCCTCACTACCCAACGCCGTCTCAGATGCTCTCAGGC
TCATTCCGACGTCCTTCAACTCCTCCAACTTCCCCACCACCACCGTGCTCTTCTTCGAGTCGAGAAAGTGATTGTGGATCAGAATGCTCTGGATGCCTATGTTTTGATCG
AAGGGTATCTCAACCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGTGAGTGCCCTGAGGAATTGAAAGAAGCGGTTGCAGGGTTGATATTTGCTGCTTCCAGA
TGTGGAGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCAATCGTTTTGGCAAGGAATTCACTGCTCGTGCCGTTGAATTACGCAACAATTGTGGAGTCAATCA
TTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCAGTTTGGAGAGTAGAATGGAGGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTATTGCAGATTGACCAACCTT
ACCCTTCCAATGAGGAAAAACTAGTCAGAAACACAAGGCAAAACCAGGCAGAGGCACACAGTGAAGTGAAAGAGAAGTTTCAATTTTTCACTGAAGTGCCATCTGGTTCT
CAACAGAAGTACAAAGATGTGGCGGATGCAGCGCAAGCCGCTTTCGAATCCGCAGCTCAAGCAGCAGCTGCTGCCCGAGCTGCCATGGAGCTCTCCCGCTCCCATGATGG
TCCAAGCAGCCCGAATAGCCCTGGCTCTGGAACAACTCCACATAACAAACAAGAGGTAGAGGAACCAAAGGTTGAATCAAAACCAAAACCAAAAATGGAATATGGAATTC
GGAGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGAAGAAGTGAAAAATTCAGTGCCTGGTGCTTCTTCTCCAAGTAGAGAGTTCTGTTATGGGGAAAAGGAAAAGAGTGAC
ATGGAAACAAAGTTAGAGAAGACAGAGTTAAGTGAGAAGGCAAGCTTTCGTTTAAATCTGGAGAAGCAGCCAATTTCAGTGAGAACGAGAAGAGTGCGAGGATACTGAAA
AATATATGTAGTTCAGAATTGGGTTTGCTTTTGTATTCCTTCTACATATGTATAATTTTATTTTTGGTTTGGAAAGATCATGAAAATTGTGTAATTATGTGTGTGCAAAT
ATTGGAATTTTATTGTTAAATTCGTTACTTTACAATGAAAACAATACGTGGGTTAATATACAAAATGTAGAATGAACAACGAAAGTCAAGTCTAGCCTCTAAATTAGCCT
AAATTAAGAATAATTTACAACTTAAGAAATTCAAATATATATGGTACCACCCTTCCTTAAACTCAAGGTGGTAGAATAGAAACCAACTTGAGTTAGTAAAGCAATTGAGT
AAACGACTAGGTAAATGGTTTCGTGAGGATATCAACAGGTTGTTCGATAGTAGAGATGGATTGTAAGATGAGAGTGTTGCTATGGAGATGATGACGAA
Protein sequenceShow/hide protein sequence
MGKKLDAFLGRNFRASKFRPLLSLAVTRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIVDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAGL
IFAASRCGDFPELHEIKSVLTNRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPSLESRMEVLKAIASENGIVLQIDQPYPSNEEKLVRNTRQNQAEAHSEVKEKFQFFT
EVPSGSQQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSPGSGTTPHNKQEVEEPKVESKPKPKMEYGIRREGEGEGEGEEEVKNSVPGASSPSREFCYG
EKEKSDMETKLEKTELSEKASFRLNLEKQPISVRTRRVRGY