; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003859 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003859
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationChr08:10541255..10542837
RNA-Seq ExpressionHG10003859
SyntenyHG10003859
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-13172.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLA+SRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ER +LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_011656519.1 uncharacterized protein LOC101211044 [Cucumis sativus]1.8e-13274.55Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNL+LSRLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVN  LMQKLSTRQPTLE RM+ LK+IASENGIVL+IDQ   S + K+ RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ++AE Q E     +FS EVASGSK  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+  GSGT+  NKQ+ EK EVESK K E+     
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY
            EE G+ ++GE                 E +E+E   MDEER S   NGL +ETK     VSEK SFRLNLEKKPISVRTRRV GY
Subjt:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]1.8e-13273.42Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]1.3e-13072.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTRQP LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
          Q Q E    V ENL+ STEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR      DGPSSP   GSGT+ ++KQE +K++VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RASN++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRG+
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_038884116.1 uncharacterized protein LOC120075040 [Benincasa hispida]2.6e-15580.56Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNLALSRLSILT QRR+RCSQA+SDVLQLLQLP+HHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLE+RMEVLKAIASENGIVL+ID   PSNE KLTRN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ Q E + EV ENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSP S GSGT+PH+KQE EKAEVESKP PE   RNR
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REG-------DEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        R G       +EEEG+K RG+                         EMDEER S +KNGL++E     T+VSEK SFRLNLEKKPISVRTRRVRG+
Subjt:  REG-------DEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein8.7e-13374.55Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNL+LSRLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVN  LMQKLSTRQPTLE RM+ LK+IASENGIVL+IDQ   S + K+ RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ++AE Q E     +FS EVASGSK  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+  GSGT+  NKQ+ EK EVESK K E+     
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY
            EE G+ ++GE                 E +E+E   MDEER S   NGL +ETK     VSEK SFRLNLEKKPISVRTRRV GY
Subjt:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1CFG6 uncharacterized protein LOC1110103253.4e-12970.93Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNLALSRL++LT QR +R SQA SD LQLLQL HHHRALLRVE+VIK+QNALDAYVLIEGYLNLL+ERT LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAV+GL+FAASRCGDFPEL EIKSVLTTRFGKEFTARAVELRNNCGV+H +MQKLSTRQP LE+RM VL+AIASEN IVL++D+   SNE K  RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQT-QAEAQIEVAENLQFSTEVASGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPK
         RQ  Q E + EV +NLQFS +V SGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     D PS+P   GSG S + +QE +K+EVESKPK
Subjt:  VRQT-QAEAQIEVAENLQFSTEVASGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPK

Query:  PEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETKV-------SEKASFRLNLEKKPISVRTRRVRGY
         EI Y+NRRE +EE GSK   EVKNS+    SPS         +E SEM+E+RA N++ GLD+ET+        ++K SF LNLEKKP+SVRTRRVRG+
Subjt:  PEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETKV-------SEKASFRLNLEKKPISVRTRRVRGY

A0A6J1E0Z0 uncharacterized protein LOC1114298148.7e-13373.42Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1E4E5 uncharacterized protein LOC1114298233.9e-12570.38Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNL LSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ER +LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKS LT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR      DGPSSP   GS T+ ++KQ  ++++VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSK--KRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK   + EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E      +++EK SF LNLEKKPI VRT RVRGY
Subjt:  YRNRREGDEEEGSK--KRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1I2V3 uncharacterized protein LOC1114704478.1e-13172.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLA+SRL+ILT QRRLR +QA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVL+EGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLTT FGKEFTARAVELRNNCGVNHLLMQKLSTRQP LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+ STEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS TS ++KQE +K+ VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RASN++ G ++E     T ++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q3ZBV1 IST1 homolog5.3e-1029.11Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG   +A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q54I39 IST1-like protein3.6e-1129.31Show/hide
Query:  GRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG
        G ++ + K +  L LA+SR+ IL  ++         +V +LL+  +   A +RVE +I+D+  ++ + +IE    LL  R  L+   +E P E+KE++  
Subjt:  GRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG

Query:  LLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPTLENRMEVLKAIASE
        L++++ R    PEL +IK+ L  ++GK      +E   NC     VN  ++ KLS   P      + L  IA +
Subjt:  LLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPTLENRMEVLKAIASE

Q568Z6 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q9CX00 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein2.3e-6945.99Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MGKKLDALLGR+F+ +KF+ L+ LAL+RLSIL  QR+ R SQA SDV +LL+L  H  A  RV++V+KDQN LD    I GY  L L+R  L E   +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKL---
        EEL EAV+GLLFAASR G+FPEL EI++VL +RFGK+  AR++ELR+NCGV+  ++QKLSTR P  E RM+ LK IA+EN IVL++DQ S S EG     
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKL---

Query:  -TRNVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELS----RSHDGPSS---PNSHGSGTSPHNKQEAEKAEVES
         T +V +T+  ++    E    S  V  G K+KYKDVADAAQAAFESAA AA AA+AA+ELS    R HD P +    NS     +  ++QE E  +  S
Subjt:  -TRNVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELS----RSHDGPSS---PNSHGSGTSPHNKQEAEKAEVES

Query:  KPKPEIRYRNRREGDEEEG-------SKKRGEV----KNSVAGDS----SPSCELCDEEKEK-EKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKP
        + + ++   ++R   + E        S +   V    K+++  DS     PS E     K K E++ M     ++ ++ +D   +  E    R    K P
Subjt:  KPKPEIRYRNRREGDEEEG-------SKKRGEV----KNSVAGDS----SPSCELCDEEKEK-EKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKP

Query:  ISVRTRRVRGY
        +SVRTR+VRGY
Subjt:  ISVRTRRVRGY

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein5.2e-2930.57Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        L+ L  R    +K +  LNLA++R+ +L  +R ++      ++   LQ      A +RVE VI++ N   AY ++E +   +L R  +LE + ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLE-------------------
        EA+A ++FAA RC + P+L +IK++  T++GKEF   A ELR + GVN  +++KLS   P+   R+++LK IA E  +  +                   
Subjt:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLE-------------------

Query:  --------IDQPSPSNEG----KLTRNVRQTQAE---------AQIEVAENLQFST-----EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR
                I +  PS +G     ++R V    AE         AQ  V++++  S      +    +++   DV + A+AA  SA +A AAARAA +L  
Subjt:  --------IDQPSPSNEG----KLTRNVRQTQAE---------AQIEVAENLQFST-----EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR

Query:  SHDGPSSPNSHGSG
           G ++P     G
Subjt:  SHDGPSSPNSHGSG

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein3.5e-2528.08Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        LD+   + F+A+K + LL L + R+ ++  +R  +  Q   ++ +LL+      A +RVE +I+++  + A  ++E +  L+  R  ++E Q ECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN----
        EA++ + FAA RC D  EL +++ +  +++GKEF A A EL+ + GVN  L++ LS R P+ E ++++LK IA E+    E+D    S E  L ++    
Subjt:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN----

Query:  ---VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQE-
            +Q    +++ + E     T + S S  K K  +D+     +               ++  A  AA++A     SHD P    + G   +   + E 
Subjt:  ---VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQE-

Query:  ---AEKAEVESKPKPEI
           A K  VE +    I
Subjt:  ---AEKAEVESKPKPEI

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein6.6e-2428.42Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +L R F+ +K +  L +A SRL IL  ++ ++  Q   ++ QLL+      A +RVE V++++  + AY LI  Y  LL+ R  ++E Q  CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAE
          +LFA+ R  D PEL EI    TT++GK+F+  AVELR + GV+ LL++KLS + P    ++++L AIA E+ +V E      S+     ++       
Subjt:  AGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAE

Query:  AQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEE
           + A ++   + + S  +Q     A A   A   +++       + E S ++ G SS  S+   +   +     KA        E   RN   G E  
Subjt:  AQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEE

Query:  GSKKRGEVKNSVAGDSSPSCELCDEEKEKEK-SEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTR
         S+ + + +     DS+ +     E  E+   +       SN +     ++  S  +S  +NL  +P   R R
Subjt:  GSKKRGEVKNSVAGDSSPSCELCDEEKEKEK-SEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTR

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein7.5e-2829.05Show/hide
Query:  ALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEA
        +L  R F +SK +    +A++R+ ++  +R +   Q   D+  LLQ      A +RVE VI++QN   A  +IE +  L++ R T++ +Q +CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEA

Query:  VAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTR-------
        +A L+FAA RC + PEL +++ +   ++GK+F + A +LR +CGVN +L+ KLS R P  E +++++K IA E     ++D  +   E +L +       
Subjt:  VAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTR-------

Query:  -----------NVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVE
                    V +      I+  + +  ST   S     Y D   AA+AA E A QA AAA+ A  L+   D  +   S  S  S H K         
Subjt:  -----------NVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVE

Query:  SKPKPEIRYRN-------RREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKS
          P    + R+        + G E  G  +R    N    +S    E  + E E +++
Subjt:  SKPKPEIRYRN-------RREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAAAGCTGGATGCTCTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTTGCTCTCTCTCGCCTTTCCATCCTCACCACCCAACGCCG
TCTCAGATGCTCTCAAGCCCATTCCGACGTCCTCCAACTCCTCCAACTTCCCCACCATCACCGCGCTCTCCTTCGAGTTGAGAAAGTGATTAAGGATCAGAATGCTCTGG
ATGCTTATGTTTTGATTGAAGGCTACCTCAATCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGCGAGTGCCCTGAGGAACTGAAAGAGGCGGTTGCAGGGCTG
CTATTTGCTGCTTCCAGATGTGGGGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCACTCGTTTTGGCAAGGAATTCACTGCTCGTGCTGTTGAATTACGCAA
CAATTGTGGAGTTAATCATTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCACGTTGGAGAATAGAATGGAAGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTAT
TGGAAATTGACCAACCTTCTCCTTCCAATGAGGGAAAACTAACAAGAAACGTAAGACAAACCCAGGCAGAGGCACAGATTGAAGTGGCAGAGAATTTGCAATTCTCCACT
GAAGTCGCATCTGGTTCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCACAAGCTGCATTTGAATCAGCAGCTCAAGCAGCAGCTGCTGCACGAGCTGCCATGGAGCT
CTCCCGCTCCCATGACGGTCCAAGCAGCCCGAATAGCCATGGCTCTGGAACAAGTCCACATAACAAACAAGAGGCAGAGAAAGCAGAGGTCGAATCGAAACCAAAACCGG
AAATCAGATATCGAAATCGGAGAGAAGGAGATGAAGAAGAAGGCAGCAAAAAGAGAGGAGAAGTGAAGAATTCAGTGGCTGGTGATTCTTCTCCAAGTTGTGAGTTATGT
GATGAGGAAAAGGAAAAGGAAAAGAGTGAAATGGATGAAGAGAGAGCTTCAAATCTTAAAAATGGGTTGGACATTGAAACAAAGGTAAGTGAGAAGGCAAGCTTTCGTTT
AAATCTGGAGAAGAAGCCAATTTCAGTGAGAACAAGAAGAGTGCGAGGATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAAAGCTGGATGCTCTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTTGCTCTCTCTCGCCTTTCCATCCTCACCACCCAACGCCG
TCTCAGATGCTCTCAAGCCCATTCCGACGTCCTCCAACTCCTCCAACTTCCCCACCATCACCGCGCTCTCCTTCGAGTTGAGAAAGTGATTAAGGATCAGAATGCTCTGG
ATGCTTATGTTTTGATTGAAGGCTACCTCAATCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGCGAGTGCCCTGAGGAACTGAAAGAGGCGGTTGCAGGGCTG
CTATTTGCTGCTTCCAGATGTGGGGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCACTCGTTTTGGCAAGGAATTCACTGCTCGTGCTGTTGAATTACGCAA
CAATTGTGGAGTTAATCATTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCACGTTGGAGAATAGAATGGAAGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTAT
TGGAAATTGACCAACCTTCTCCTTCCAATGAGGGAAAACTAACAAGAAACGTAAGACAAACCCAGGCAGAGGCACAGATTGAAGTGGCAGAGAATTTGCAATTCTCCACT
GAAGTCGCATCTGGTTCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCACAAGCTGCATTTGAATCAGCAGCTCAAGCAGCAGCTGCTGCACGAGCTGCCATGGAGCT
CTCCCGCTCCCATGACGGTCCAAGCAGCCCGAATAGCCATGGCTCTGGAACAAGTCCACATAACAAACAAGAGGCAGAGAAAGCAGAGGTCGAATCGAAACCAAAACCGG
AAATCAGATATCGAAATCGGAGAGAAGGAGATGAAGAAGAAGGCAGCAAAAAGAGAGGAGAAGTGAAGAATTCAGTGGCTGGTGATTCTTCTCCAAGTTGTGAGTTATGT
GATGAGGAAAAGGAAAAGGAAAAGAGTGAAATGGATGAAGAGAGAGCTTCAAATCTTAAAAATGGGTTGGACATTGAAACAAAGGTAAGTGAGAAGGCAAGCTTTCGTTT
AAATCTGGAGAAGAAGCCAATTTCAGTGAGAACAAGAAGAGTGCGAGGATACTGA
Protein sequenceShow/hide protein sequence
MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAGL
LFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAEAQIEVAENLQFST
EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELC
DEEKEKEKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTRRVRGY