; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G003990 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G003990
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationchr08:11030842..11034328
RNA-Seq ExpressionLsi08G003990
SyntenyLsi08G003990
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-13172.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLA+SRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ER +LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_011656519.1 uncharacterized protein LOC101211044 [Cucumis sativus]1.8e-13274.55Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNL+LSRLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVN  LMQKLSTRQPTLE RM+ LK+IASENGIVL+IDQ   S + K+ RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ++AE Q E     +FS EVASGSK  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+  GSGT+  NKQ+ EK EVESK K E+     
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY
            EE G+ ++GE                 E +E+E   MDEER S   NGL +ETK     VSEK SFRLNLEKKPISVRTRRV GY
Subjt:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]1.8e-13273.42Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]1.3e-13072.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTRQP LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
          Q Q E    V ENL+ STEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR      DGPSSP   GSGT+ ++KQE +K++VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RASN++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRG+
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

XP_038884116.1 uncharacterized protein LOC120075040 [Benincasa hispida]2.6e-15580.56Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNLALSRLSILT QRR+RCSQA+SDVLQLLQLP+HHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLE+RMEVLKAIASENGIVL+ID   PSNE KLTRN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ Q E + EV ENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSP S GSGT+PH+KQE EKAEVESKP PE   RNR
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REG-------DEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        R G       +EEEG+K RG+                         EMDEER S +KNGL++E     T+VSEK SFRLNLEKKPISVRTRRVRG+
Subjt:  REG-------DEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein8.7e-13374.55Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNL+LSRLSILT QRR+ CSQA+SDVLQLLQL HHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVN  LMQKLSTRQPTLE RM+ LK+IASENGIVL+IDQ   S + K+ RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR
         RQ++AE Q E     +FS EVASGSK  YKDVADAAQAAFESAAQAAAAARAAMELSRSH+GPSSP+  GSGT+  NKQ+ EK EVESK K E+     
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNR

Query:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY
            EE G+ ++GE                 E +E+E   MDEER S   NGL +ETK     VSEK SFRLNLEKKPISVRTRRV GY
Subjt:  REGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETK-----VSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1CFG6 uncharacterized protein LOC1110103253.4e-12970.93Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+KLDALLGRNFRASKFRPLLNLALSRL++LT QR +R SQA SD LQLLQL HHHRALLRVE+VIK+QNALDAYVLIEGYLNLL+ERT LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEAV+GL+FAASRCGDFPEL EIKSVLTTRFGKEFTARAVELRNNCGV+H +MQKLSTRQP LE+RM VL+AIASEN IVL++D+   SNE K  RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQT-QAEAQIEVAENLQFSTEVASGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPK
         RQ  Q E + EV +NLQFS +V SGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     D PS+P   GSG S + +QE +K+EVESKPK
Subjt:  VRQT-QAEAQIEVAENLQFSTEVASGS---KQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPK

Query:  PEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETKV-------SEKASFRLNLEKKPISVRTRRVRGY
         EI Y+NRRE +EE GSK   EVKNS+    SPS         +E SEM+E+RA N++ GLD+ET+        ++K SF LNLEKKP+SVRTRRVRG+
Subjt:  PEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIETKV-------SEKASFRLNLEKKPISVRTRRVRGY

A0A6J1E0Z0 uncharacterized protein LOC1114298148.7e-13373.42Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLALSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ERT+LLEQQ +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS T+ ++KQE +K+EVE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E     T+++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1E4E5 uncharacterized protein LOC1114298233.9e-12570.38Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNL LSRL+ILT QRRLR SQA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVLIEGYLNLL+ER +LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKS LT+RFGKEFTARAVELRNNCGVNHLLMQKLSTR P LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q E    V ENL+FSTEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR      DGPSSP   GS T+ ++KQ  ++++VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSK--KRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK   + EVKNS+ G     C    E   KE  EMDE+RA+N++ G ++E      +++EK SF LNLEKKPI VRT RVRGY
Subjt:  YRNRREGDEEEGSK--KRGEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

A0A6J1I2V3 uncharacterized protein LOC1114704478.1e-13172.91Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MG+ LDALLGRNFRASKFRPLLNLA+SRL+ILT QRRLR +QA SDVLQLLQL H  RALLRVE+VIKDQNALDAYVL+EGYLNLL+ERT+LLEQ+ ECP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN
        EELKEA+AGLLFAASRCGDFPELHEIKSVLTT FGKEFTARAVELRNNCGVNHLLMQKLSTRQP LE+RM++LK IASENGI L++D+   SNE    RN
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN

Query:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR
         RQ Q+E    V ENL+ STEV SGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRS     DGPSSP   GS TS ++KQE +K+ VE+KPK EI 
Subjt:  VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSH----DGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIR

Query:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY
        Y     G EEEGSK R   EVKNS+ G     C    E   KE  EMDE+RASN++ G ++E     T ++EK SFRLNLEKKPISVRTRRVRGY
Subjt:  YRNRREGDEEEGSKKR--GEVKNSVAGDSSPSCELCDEEKEKEKSEMDEERASNLKNGLDIE-----TKVSEKASFRLNLEKKPISVRTRRVRGY

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q3ZBV1 IST1 homolog5.3e-1029.11Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG   +A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q54I39 IST1-like protein3.6e-1129.31Show/hide
Query:  GRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG
        G ++ + K +  L LA+SR+ IL  ++         +V +LL+  +   A +RVE +I+D+  ++ + +IE    LL  R  L+   +E P E+KE++  
Subjt:  GRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAG

Query:  LLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPTLENRMEVLKAIASE
        L++++ R    PEL +IK+ L  ++GK      +E   NC     VN  ++ KLS   P      + L  IA +
Subjt:  LLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNC----GVNHLLMQKLSTRQPTLENRMEVLKAIASE

Q568Z6 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Q9CX00 IST1 homolog1.1e-1029.75Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +A  ++   L      RA +RVE +I++   ++A  ++E Y +LLL R  L++   E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP
        + L++AA R   +  EL  +   L  ++ KE+  +         VN  LM KLS   P
Subjt:  AGLLFAASRC-GDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein2.3e-6945.99Show/hide
Query:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP
        MGKKLDALLGR+F+ +KF+ L+ LAL+RLSIL  QR+ R SQA SDV +LL+L  H  A  RV++V+KDQN LD    I GY  L L+R  L E   +CP
Subjt:  MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECP

Query:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKL---
        EEL EAV+GLLFAASR G+FPEL EI++VL +RFGK+  AR++ELR+NCGV+  ++QKLSTR P  E RM+ LK IA+EN IVL++DQ S S EG     
Subjt:  EELKEAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKL---

Query:  -TRNVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELS----RSHDGPSS---PNSHGSGTSPHNKQEAEKAEVES
         T +V +T+  ++    E    S  V  G K+KYKDVADAAQAAFESAA AA AA+AA+ELS    R HD P +    NS     +  ++QE E  +  S
Subjt:  -TRNVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELS----RSHDGPSS---PNSHGSGTSPHNKQEAEKAEVES

Query:  KPKPEIRYRNRREGDEEEG-------SKKRGEV----KNSVAGDS----SPSCELCDEEKEK-EKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKP
        + + ++   ++R   + E        S +   V    K+++  DS     PS E     K K E++ M     ++ ++ +D   +  E    R    K P
Subjt:  KPKPEIRYRNRREGDEEEG-------SKKRGEV----KNSVAGDS----SPSCELCDEEKEK-EKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKP

Query:  ISVRTRRVRGY
        +SVRTR+VRGY
Subjt:  ISVRTRRVRGY

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein5.2e-2930.57Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        L+ L  R    +K +  LNLA++R+ +L  +R ++      ++   LQ      A +RVE VI++ N   AY ++E +   +L R  +LE + ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLE-------------------
        EA+A ++FAA RC + P+L +IK++  T++GKEF   A ELR + GVN  +++KLS   P+   R+++LK IA E  +  +                   
Subjt:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLE-------------------

Query:  --------IDQPSPSNEG----KLTRNVRQTQAE---------AQIEVAENLQFST-----EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR
                I +  PS +G     ++R V    AE         AQ  V++++  S      +    +++   DV + A+AA  SA +A AAARAA +L  
Subjt:  --------IDQPSPSNEG----KLTRNVRQTQAE---------AQIEVAENLQFST-----EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSR

Query:  SHDGPSSPNSHGSG
           G ++P     G
Subjt:  SHDGPSSPNSHGSG

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein3.5e-2528.08Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK
        LD+   + F+A+K + LL L + R+ ++  +R  +  Q   ++ +LL+      A +RVE +I+++  + A  ++E +  L+  R  ++E Q ECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELK

Query:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN----
        EA++ + FAA RC D  EL +++ +  +++GKEF A A EL+ + GVN  L++ LS R P+ E ++++LK IA E+    E+D    S E  L ++    
Subjt:  EAVAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRN----

Query:  ---VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQE-
            +Q    +++ + E     T + S S  K K  +D+     +               ++  A  AA++A     SHD P    + G   +   + E 
Subjt:  ---VRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFE---------------SAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQE-

Query:  ---AEKAEVESKPKPEI
           A K  VE +    I
Subjt:  ---AEKAEVESKPKPEI

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein6.6e-2428.42Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV
        +L R F+ +K +  L +A SRL IL  ++ ++  Q   ++ QLL+      A +RVE V++++  + AY LI  Y  LL+ R  ++E Q  CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAV

Query:  AGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAE
          +LFA+ R  D PEL EI    TT++GK+F+  AVELR + GV+ LL++KLS + P    ++++L AIA E+ +V E      S+     ++       
Subjt:  AGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAE

Query:  AQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEE
           + A ++   + + S  +Q     A A   A   +++       + E S ++ G SS  S+   +   +     KA        E   RN   G E  
Subjt:  AQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEE

Query:  GSKKRGEVKNSVAGDSSPSCELCDEEKEKEK-SEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTR
         S+ + + +     DS+ +     E  E+   +       SN +     ++  S  +S  +NL  +P   R R
Subjt:  GSKKRGEVKNSVAGDSSPSCELCDEEKEKEK-SEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTR

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein7.5e-2829.05Show/hide
Query:  ALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEA
        +L  R F +SK +    +A++R+ ++  +R +   Q   D+  LLQ      A +RVE VI++QN   A  +IE +  L++ R T++ +Q +CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEA

Query:  VAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTR-------
        +A L+FAA RC + PEL +++ +   ++GK+F + A +LR +CGVN +L+ KLS R P  E +++++K IA E     ++D  +   E +L +       
Subjt:  VAGLLFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTR-------

Query:  -----------NVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVE
                    V +      I+  + +  ST   S     Y D   AA+AA E A QA AAA+ A  L+   D  +   S  S  S H K         
Subjt:  -----------NVRQTQAEAQIEVAENLQFSTEVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVE

Query:  SKPKPEIRYRN-------RREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKS
          P    + R+        + G E  G  +R    N    +S    E  + E E +++
Subjt:  SKPKPEIRYRN-------RREGDEEEGSKKRGEVKNSVAGDSSPSCELCDEEKEKEKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAAAGCTGGATGCTCTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTTGCTCTCTCTCGCCTTTCCATCCTCACCACCCAACGCCG
TCTCAGATGCTCTCAAGCCCATTCCGACGTCCTCCAACTCCTCCAACTTCCCCACCATCACCGCGCTCTCCTTCGAGTTGAGAAAGTGATTAAGGATCAGAATGCTCTGG
ATGCTTATGTTTTGATTGAAGGCTACCTCAATCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGCGAGTGCCCTGAGGAACTGAAAGAGGCGGTTGCAGGGCTG
CTATTTGCTGCTTCCAGATGTGGGGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCACTCGTTTTGGCAAGGAATTCACTGCTCGTGCTGTTGAATTACGCAA
CAATTGTGGAGTTAATCATTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCACGTTGGAGAATAGAATGGAAGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTAT
TGGAAATTGACCAACCTTCTCCTTCCAATGAGGGAAAACTAACAAGAAACGTAAGACAAACCCAGGCAGAGGCACAGATTGAAGTGGCAGAGAATTTGCAATTCTCCACT
GAAGTCGCATCTGGTTCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCACAAGCTGCATTTGAATCAGCAGCTCAAGCAGCAGCTGCTGCACGAGCTGCCATGGAGCT
CTCCCGCTCCCATGACGGTCCAAGCAGCCCGAATAGCCATGGCTCTGGAACAAGTCCACATAACAAACAAGAGGCAGAGAAAGCAGAGGTCGAATCGAAACCAAAACCGG
AAATCAGATATCGAAATCGGAGAGAAGGAGATGAAGAAGAAGGCAGCAAAAAGAGAGGAGAAGTGAAGAATTCAGTGGCTGGTGATTCTTCTCCAAGTTGTGAGTTATGT
GATGAGGAAAAGGAAAAGGAAAAGAGTGAAATGGATGAAGAGAGAGCTTCAAATCTTAAAAATGGGTTGGACATTGAAACAAAGGTAAGTGAGAAGGCAAGCTTTCGTTT
AAATCTGGAGAAGAAGCCAATTTCAGTGAGAACAAGAAGAGTGCGAGGATACTGA
mRNA sequenceShow/hide mRNA sequence
TTTTCACACAACAATCAGAAATGTTTCAGCGAAGACTTCGTCTATAAGCAATACACAGAAATCCAATCCTTCCCACGCGCTTCTTCTTCTTCAACCCTTCTTCACACTGC
CTTGCCCCTGCCCTCCCCTGCTCTCCACTCTCAAACAACATATATAAACAAATCAATTCAATTTCCATATAATCTGAATTAACCCATTTCGCCATGGGAAAAAAGCTGGA
TGCTCTTCTTGGTAGGAATTTCAGAGCCTCCAAATTCCGTCCCCTTCTCAATCTTGCTCTCTCTCGCCTTTCCATCCTCACCACCCAACGCCGTCTCAGATGCTCTCAAG
CCCATTCCGACGTCCTCCAACTCCTCCAACTTCCCCACCATCACCGCGCTCTCCTTCGAGTTGAGAAAGTGATTAAGGATCAGAATGCTCTGGATGCTTATGTTTTGATT
GAAGGCTACCTCAATCTCTTGCTTGAAAGAACCACCCTCCTCGAACAACAAAGCGAGTGCCCTGAGGAACTGAAAGAGGCGGTTGCAGGGCTGCTATTTGCTGCTTCCAG
ATGTGGGGATTTCCCTGAACTTCATGAGATCAAATCAGTTTTGACCACTCGTTTTGGCAAGGAATTCACTGCTCGTGCTGTTGAATTACGCAACAATTGTGGAGTTAATC
ATTTGTTAATGCAAAAACTTTCCACAAGGCAGCCCACGTTGGAGAATAGAATGGAAGTCCTCAAAGCCATTGCTTCTGAGAATGGGATTGTATTGGAAATTGACCAACCT
TCTCCTTCCAATGAGGGAAAACTAACAAGAAACGTAAGACAAACCCAGGCAGAGGCACAGATTGAAGTGGCAGAGAATTTGCAATTCTCCACTGAAGTCGCATCTGGTTC
TAAACAGAAGTACAAAGATGTGGCGGATGCAGCACAAGCTGCATTTGAATCAGCAGCTCAAGCAGCAGCTGCTGCACGAGCTGCCATGGAGCTCTCCCGCTCCCATGACG
GTCCAAGCAGCCCGAATAGCCATGGCTCTGGAACAAGTCCACATAACAAACAAGAGGCAGAGAAAGCAGAGGTCGAATCGAAACCAAAACCGGAAATCAGATATCGAAAT
CGGAGAGAAGGAGATGAAGAAGAAGGCAGCAAAAAGAGAGGAGAAGTGAAGAATTCAGTGGCTGGTGATTCTTCTCCAAGTTGTGAGTTATGTGATGAGGAAAAGGAAAA
GGAAAAGAGTGAAATGGATGAAGAGAGAGCTTCAAATCTTAAAAATGGGTTGGACATTGAAACAAAGGTAAGTGAGAAGGCAAGCTTTCGTTTAAATCTGGAGAAGAAGC
CAATTTCAGTGAGAACAAGAAGAGTGCGAGGATACTGAAGAACATATGTATTTGGATTCTTTCTACACAAACATATATAGTTTTATTTTTGGTTTGGAAAGATCATGGAG
AGTGTAATTATGTGGTTTGGGAATTTTGACAATGTGTGAAAAATACAACATTTGTGTTTGTTAGACAATGGTATATTGGTATATTCACATTAATTTGACATATTCACGCT
TGATTTTTGGACATCCCTATTATTTAATCAAGTAAACTGTTTTAATTGTTAAATATTATCTTAGTATTAAGATGTATATATGCACTAAAAAAAAAAGAAAAAGATGTGTA
TGGTCGATTTGAATATGTCAAATATTATTTGATCATTTTGTCTATGACTAAAGTTTAAAGTAGAATGAAAACTTATT
Protein sequenceShow/hide protein sequence
MGKKLDALLGRNFRASKFRPLLNLALSRLSILTTQRRLRCSQAHSDVLQLLQLPHHHRALLRVEKVIKDQNALDAYVLIEGYLNLLLERTTLLEQQSECPEELKEAVAGL
LFAASRCGDFPELHEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLSTRQPTLENRMEVLKAIASENGIVLEIDQPSPSNEGKLTRNVRQTQAEAQIEVAENLQFST
EVASGSKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSHDGPSSPNSHGSGTSPHNKQEAEKAEVESKPKPEIRYRNRREGDEEEGSKKRGEVKNSVAGDSSPSCELC
DEEKEKEKSEMDEERASNLKNGLDIETKVSEKASFRLNLEKKPISVRTRRVRGY