; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025981 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025981
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationtig00153017:2238298..2240139
RNA-Seq ExpressionSgr025981
SyntenySgr025981
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-13775.44Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RRSQA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVLIEGYLNLLIER +LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKL+TR PNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+FS++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPD PS+P        + KQEQKKS+VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY    E  +EEEG KI   AEVKNSM     SS+REL  +EN +M+EQR +N++ G +ME   E TE+ +K SFRLNLEKKP+SVRTRRVRGY
Subjt:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

XP_022139373.1 uncharacterized protein LOC111010325 [Momordica charantia]3.5e-15682.28Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLA SRLA+LTNQRHVRRSQARSD LQLLQLGHHHRALLRVEQVIKEQN LDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEAVSGL+FAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGV+H +MQKL+TRQPNLESRM+VL+AIASE+ I LQLDE PL NEEK   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQN-QQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTPP----NPHDKQEQKKSKVESKPKRE
        R QN QQEP AEVG+NLQFS+DVPSGS+Q +QKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPDDPSTP     + + +QEQKKS+VESKPK+E
Subjt:  RGQN-QQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTPP----NPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDM--ETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EYQNRR  EEEE G KITAEVKNSMPAS S S       E+S+MEEQR  N++TGLDM  ETKPE TE  QK SF LNLEKKPMSVRTRRVRG+
Subjt:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDM--ETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]4.8e-13774.94Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RRSQA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVLIEGYLNLLIER +LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKL+TR PNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+FS++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPD PS+P        + KQEQKKS+VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY    E  +EEEG KI   AEVKNSM     SS+REL  +EN +M+EQR +N++ G +ME   E TE+ +K SFRLNLEKKP+SVRTRRVRGY
Subjt:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

XP_022971762.1 uncharacterized protein LOC111470447 [Cucurbita maxima]6.3e-13774.94Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RR+QA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVL+EGYLNLLIER +LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGVNHLLMQKL+TRQPNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+ S++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPD PS+P      + +DKQEQKKS VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY+  R    EEEG KI    EVKNSM     SS+REL  +EN +M+EQR SN++ G +ME   E T + +K SFRLNLEKKP+SVRTRRVRGY
Subjt:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]2.8e-13775.06Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RRSQA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVLIEGYLNLLIER +LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKL+TRQPNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+ S++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSR ESQDPD PS+P        +DKQEQKKS VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY+  REEE  +   +  AEVKNSM     SS+REL  +EN +M+EQR SN++ G +ME   E TE+ +K SFRLNLEKKP+SVRTRRVRG+
Subjt:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein3.7e-11165.98Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNL+ SRL+ILT QR V  SQA SDVLQLLQL HHHRALLRVE+VIK+QN LDAYVLIEGYLNLL+ER TLLEQ+ ECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEAV+GLLFAASRCGDFPEL EIKSVLTTRFGKEFTARAVELRNNCGVN  LMQKL+TRQP LE+RM  LK+IASE+GI LQ+D+ P   +EK+G N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSES--QDPDDPSTPPNPHDKQEQKKSKVESKPKRE-TE
          Q++ E     G++ +FS++V SGS   +  YKDVADAAQAAFESAAQAAAAARAAMELSRS      P  P +     +KQ+++K +VESK K+E  E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSES--QDPDDPSTPPNPHDKQEQKKSKVESKPKRE-TE

Query:  YQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
        Y N R+ E E E                        +EE   M+E+R SN   GL METK E TEV++K SFRLNLEKKP+SVRTRRV GY
Subjt:  YQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

A0A6J1CFG6 uncharacterized protein LOC1110103251.7e-15682.28Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLA SRLA+LTNQRHVRRSQARSD LQLLQLGHHHRALLRVEQVIKEQN LDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEAVSGL+FAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGV+H +MQKL+TRQPNLESRM+VL+AIASE+ I LQLDE PL NEEK   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQN-QQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTPP----NPHDKQEQKKSKVESKPKRE
        R QN QQEP AEVG+NLQFS+DVPSGS+Q +QKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPDDPSTP     + + +QEQKKS+VESKPK+E
Subjt:  RGQN-QQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTPP----NPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDM--ETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EYQNRR  EEEE G KITAEVKNSMPAS S S       E+S+MEEQR  N++TGLDM  ETKPE TE  QK SF LNLEKKPMSVRTRRVRG+
Subjt:  TEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDM--ETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

A0A6J1E0Z0 uncharacterized protein LOC1114298142.3e-13774.94Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RRSQA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVLIEGYLNLLIER +LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNHLLMQKL+TR PNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+FS++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPD PS+P        + KQEQKKS+VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY    E  +EEEG KI   AEVKNSM     SS+REL  +EN +M+EQR +N++ G +ME   E TE+ +K SFRLNLEKKP+SVRTRRVRGY
Subjt:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

A0A6J1E4E5 uncharacterized protein LOC1114298233.6e-13072.41Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNL  SRLAILTNQR +RRSQA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVLIEGYLNLLIER +LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKS LT+RFGKEFTARAVELRNNCGVNHLLMQKL+TR PNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ E    VGENL+FS++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSR ESQDPD PS+P        + KQ QK+S VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKIT--AEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY+  R    EEEG K+T   EVKNSM     SS+REL  +EN +M+EQR +N++ G +ME   E  E+ +K SF LNLEKKP+ VRT RVRGY
Subjt:  TEYQNRREEEEEEEGCKIT--AEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

A0A6J1I2V3 uncharacterized protein LOC1114704473.0e-13774.94Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA SRLAILTNQR +RR+QA+SDVLQLLQL H  RALLRVEQVIK+QN LDAYVL+EGYLNLLIER +LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EELKEA++GLLFAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGVNHLLMQKL+TRQPNLESRM +LK IASE+GI LQLDE PL NE +   N
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE
          QNQ EPH  VGENL+ S++VPSGS   +QKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPD PS+P      + +DKQEQKKS VE+KPK E
Subjt:  RGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPSTP-----PNPHDKQEQKKSKVESKPKRE

Query:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY
         EY+  R    EEEG KI    EVKNSM     SS+REL  +EN +M+EQR SN++ G +ME   E T + +K SFRLNLEKKP+SVRTRRVRGY
Subjt:  TEYQNRREEEEEEEGCKI--TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRTRRVRGY

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog1.0e-1231.65Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +LG  F+A + R  L L  +RL +L  ++     +AR ++   L  G   RA +RVE +I+E  L++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP
        S L++AA R   +  EL+ +   L  ++ KE+  +         VN  LM KL+   P
Subjt:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP

Q3ZBV1 IST1 homolog5.0e-1231.01Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +LG   +A + R  L L  +RL +L  ++     +AR ++   L  G   RA +RVE +I+E  L++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP
        S L++AA R   +  EL+ +   L  ++ KE+  +         VN  LM KL+   P
Subjt:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP

Q568Z6 IST1 homolog1.0e-1231.65Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +LG  F+A + R  L L  +RL +L  ++     +AR ++   L  G   RA +RVE +I+E  L++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP
        S L++AA R   +  EL+ +   L  ++ KE+  +         VN  LM KL+   P
Subjt:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP

Q5R6G8 IST1 homolog6.5e-1231.01Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +LG  F+A + R  L L  +RL +L  ++     +AR ++   L  G   RA +RVE +I+E  L++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP
        S L++AA R   +  EL+ +   L  ++ K +  +         VN  LM KL+   P
Subjt:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP

Q9CX00 IST1 homolog1.0e-1231.65Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +LG  F+A + R  L L  +RL +L  ++     +AR ++   L  G   RA +RVE +I+E  L++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP
        S L++AA R   +  EL+ +   L  ++ KE+  +         VN  LM KL+   P
Subjt:  SGLLFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQP

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein1.1e-6240.95Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP
        MG+KLDALLGR+F+ +KF+ L+ LA +RL+IL NQR  R SQA SDV +LL+LG H  A  RV+QV+K+QN LD    I GY  L ++RI L E  R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECP

Query:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN
        EEL EAVSGLLFAASR G+FPELQEI++VL +RFGK+  AR++ELR+NCGV+  ++QKL+TR P  E RM  LK IA+E+ I L+LD+     E      
Subjt:  EELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGN

Query:  RGQNQQEPHAEVGENLQFSSD-------VPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPST---------PPNPHDKQEQK
         G    +  ++V +    S D       +     +G++KYKDVADAAQAAFESAA AA AA+AA+ELS+   +  D P             N   +QEQ+
Subjt:  RGQNQQEPHAEVGENLQFSSD-------VPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPST---------PPNPHDKQEQK

Query:  KSKVESKPKRETEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLE----------
         +   S+ + +   +++R   + E+   +        P         + D E    EE +PS     +      ++  V   ++   +++          
Subjt:  KSKVESKPKRETEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLE----------

Query:  ------KKPMSVRTRRVRGY
              K P+SVRTR+VRGY
Subjt:  ------KKPMSVRTRRVRGY

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein9.9e-3233.11Show/hide
Query:  LDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELK
        L+ L  R    +K +  LNLA +R+ +L N+R ++    + ++   LQ G    A +RVE VI+E NL  AY ++E +   ++ R+ +LE E+ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELK

Query:  EAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASE-----DGIALQLDEPPLPNEEKLGG
        EA++ ++FAA RC + P+L +IK++  T++GKEF   A ELR + GVN  +++KL+   P+  +R+ +LK IA E     D  A +  E    +E+ LGG
Subjt:  EAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASE-----DGIALQLDEPPLPNEEKLGG

Query:  NR------GQNQQEPH-----------------AEVGENLQ-------FSSDVPSGSIQG--------RQKYKDVADAAQAAFESAAQAAAAARAAMEL
         +      G ++  P                  AE  +  Q        S  +PS  +          R+   DV + A+AA  SA +A AAARAA +L
Subjt:  NR------GQNQQEPH-----------------AEVGENLQ-------FSSDVPSGSIQG--------RQKYKDVADAAQAAFESAAQAAAAARAAMEL

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein2.0e-3236.79Show/hide
Query:  LDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELK
        LD+   + F+A+K + LL L   R+ ++ N+R  +  Q R ++ +LL+ G    A +RVE +I+E+ ++ A  ++E +  L+  R+ ++E +RECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELK

Query:  EAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKL
        EA+S + FAA RC D  ELQ+++ +  +++GKEF A A EL+ + GVN  L++ L+ R P+ E+++ +LK IA E     +LD  P   E  L
Subjt:  EAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKL

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein1.2e-2427.62Show/hide
Query:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV
        +L R F+ +K +  L +A SRL IL N++ ++  Q R ++ QLL+ G    A +RVE V++E+  + AY LI  Y  LL+ R+ ++E ++ CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAV

Query:  SGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQ----LDEPPLPNEEKLGGNRGQ
        + +LFA+ R  D PEL EI    TT++GK+F+  AVELR + GV+ LL++KL+ + P+  +++ +L AIA E  +  +    ++  P   E   G N  Q
Subjt:  SGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASEDGIALQ----LDEPPLPNEEKLGGNRGQ

Query:  -------------NQQEP-------------------------HAEVGENLQFSSDVPSG--------------------------------SIQGRQKY
                     N+++P                         +A  G +   S++V SG                                S + +QK+
Subjt:  -------------NQQEP-------------------------HAEVGENLQFSSDVPSG--------------------------------SIQGRQKY

Query:  K----DVADAAQAAFESAAQAAAAARAAMELSRSESQDPDD----------------PSTPPNPHDKQEQKKSKVESKPKRETEYQNRREEEEEEEGCKI
        +    D  DAA+AA E+A +A+ AARAA ELS  E     D                PS   +  + Q +  S+    P+R      R + E+ +   + 
Subjt:  K----DVADAAQAAFESAAQAAAAARAAMELSRSESQDPDD----------------PSTPPNPHDKQEQKKSKVESKPKRETEYQNRREEEEEEEGCKI

Query:  TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSV
          +    +P     S R   D  NS+      +N   G + +   + T++N   S  ++L K+   V
Subjt:  TAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSV

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein5.5e-3030.24Show/hide
Query:  ALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEA
        +L  R F +SK +    +A +R+ ++ N+R V   Q R D+  LLQ G    A +RVE VI+EQN+  A  +IE +  L++ R+T++ ++++CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQLGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEA

Query:  VSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASE---DGIALQLDEPPL-PNEEKLGGNR-
        ++ L+FAA RC + PEL +++ +   ++GK+F + A +LR +CGVN +L+ KL+ R P  E ++ ++K IA E   D    + ++  L P EE + G R 
Subjt:  VSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTRQPNLESRMHVLKAIASE---DGIALQLDEPPL-PNEEKLGGNR-

Query:  ---GQNQQEPHAEVGENLQFSSDVP--SGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSESQDPDDPST--------------PPN
             +     A + E +  +  VP  + S+     Y D   AA+AA E A QA AAA+ A  L      S  E     D ST              P +
Subjt:  ---GQNQQEPHAEVGENLQFSSDVP--SGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSESQDPDDPST--------------PPN

Query:  PHDKQEQKKSKVESKPKRETEYQNRR---------EEEEEEEGCKITAEVKNSMPASSSSSSRELS---------DEENSQMEEQRPSNLQTGLDMETKP
            ++ + S   +KP  E     RR         E + EEE     AE K +M    S + R +          DE +   EE  P     G      P
Subjt:  PHDKQEQKKSKVESKPKRETEYQNRR---------EEEEEEEGCKITAEVKNSMPASSSSSSRELS---------DEENSQMEEQRPSNLQTGLDMETKP

Query:  EHTEVNQKQS
                QS
Subjt:  EHTEVNQKQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCGGCGAGAAGCGTGTTGTCGATCCATTTCATACAAATTCTTACGGCGAAGTGAGACTTCGTTCCGAAGCAACCCCACCCACGCGCTTCAACACACATATATAAC
CAACTTCTTCTCTCTCTCTCTCTGCAACTTCTTCTTGAACCCATTTCGATTCTGCAGCCTCGCCATGGGAAGGAAGCTCGACGCACTTCTCGGAAGGAATTTCAGGGCCT
CCAAGTTCCGCCCGCTTCTCAATCTCGCTTTCTCTCGCCTCGCTATTCTCACCAACCAGCGCCATGTCAGGCGCTCTCAAGCTCGCTCCGACGTCCTCCAACTCCTCCAG
CTCGGCCACCACCACCGAGCTCTCCTTCGAGTCGAGCAAGTAATTAAGGAGCAGAATCTCTTGGATGCCTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAG
GATCACCCTCCTCGAACAAGAAAGAGAATGCCCTGAAGAATTGAAAGAGGCAGTATCAGGGTTGCTATTTGCAGCTTCAAGATGTGGGGATTTCCCAGAGCTTCAAGAGA
TCAAATCGGTTTTGACCACTCGTTTCGGCAAAGAGTTCACTGCTCGCGCTGTTGAATTACGCAACAACTGTGGAGTCAATCATTTGTTAATGCAGAAACTGACCACAAGG
CAGCCAAATTTGGAGAGTAGAATGCATGTGCTGAAAGCCATTGCTTCTGAAGACGGCATTGCTTTGCAACTCGACGAACCTCCTCTTCCCAATGAGGAAAAACTGGGCGG
AAACAGAGGGCAGAACCAGCAGGAGCCACATGCCGAAGTGGGAGAGAATTTGCAATTCTCTTCTGATGTCCCATCTGGTTCTATTCAAGGTAGACAAAAGTACAAAGATG
TGGCGGATGCAGCCCAAGCCGCATTCGAATCAGCAGCTCAAGCCGCAGCTGCTGCCAGAGCTGCCATGGAGCTCTCCCGCTCTGAATCACAAGACCCTGATGATCCAAGC
ACCCCACCGAACCCACATGACAAACAAGAGCAAAAGAAATCCAAGGTAGAATCTAAACCGAAACGGGAAACCGAATATCAAAATCGAAGAGAAGAAGAAGAAGAAGAAGA
AGGCTGCAAAATCACAGCAGAAGTGAAGAACTCAATGCCTGCTTCTTCTTCTTCTTCAAGTAGAGAGTTATCAGATGAAGAAAACAGTCAAATGGAGGAGCAGAGGCCTT
CAAATCTTCAAACTGGCTTGGACATGGAAACAAAGCCAGAACACACAGAGGTAAATCAGAAGCAAAGCTTTCGCTTAAACCTGGAGAAGAAGCCAATGTCGGTGAGAACA
AGAAGGGTGCGAGGATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCGGCGAGAAGCGTGTTGTCGATCCATTTCATACAAATTCTTACGGCGAAGTGAGACTTCGTTCCGAAGCAACCCCACCCACGCGCTTCAACACACATATATAAC
CAACTTCTTCTCTCTCTCTCTCTGCAACTTCTTCTTGAACCCATTTCGATTCTGCAGCCTCGCCATGGGAAGGAAGCTCGACGCACTTCTCGGAAGGAATTTCAGGGCCT
CCAAGTTCCGCCCGCTTCTCAATCTCGCTTTCTCTCGCCTCGCTATTCTCACCAACCAGCGCCATGTCAGGCGCTCTCAAGCTCGCTCCGACGTCCTCCAACTCCTCCAG
CTCGGCCACCACCACCGAGCTCTCCTTCGAGTCGAGCAAGTAATTAAGGAGCAGAATCTCTTGGATGCCTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAG
GATCACCCTCCTCGAACAAGAAAGAGAATGCCCTGAAGAATTGAAAGAGGCAGTATCAGGGTTGCTATTTGCAGCTTCAAGATGTGGGGATTTCCCAGAGCTTCAAGAGA
TCAAATCGGTTTTGACCACTCGTTTCGGCAAAGAGTTCACTGCTCGCGCTGTTGAATTACGCAACAACTGTGGAGTCAATCATTTGTTAATGCAGAAACTGACCACAAGG
CAGCCAAATTTGGAGAGTAGAATGCATGTGCTGAAAGCCATTGCTTCTGAAGACGGCATTGCTTTGCAACTCGACGAACCTCCTCTTCCCAATGAGGAAAAACTGGGCGG
AAACAGAGGGCAGAACCAGCAGGAGCCACATGCCGAAGTGGGAGAGAATTTGCAATTCTCTTCTGATGTCCCATCTGGTTCTATTCAAGGTAGACAAAAGTACAAAGATG
TGGCGGATGCAGCCCAAGCCGCATTCGAATCAGCAGCTCAAGCCGCAGCTGCTGCCAGAGCTGCCATGGAGCTCTCCCGCTCTGAATCACAAGACCCTGATGATCCAAGC
ACCCCACCGAACCCACATGACAAACAAGAGCAAAAGAAATCCAAGGTAGAATCTAAACCGAAACGGGAAACCGAATATCAAAATCGAAGAGAAGAAGAAGAAGAAGAAGA
AGGCTGCAAAATCACAGCAGAAGTGAAGAACTCAATGCCTGCTTCTTCTTCTTCTTCAAGTAGAGAGTTATCAGATGAAGAAAACAGTCAAATGGAGGAGCAGAGGCCTT
CAAATCTTCAAACTGGCTTGGACATGGAAACAAAGCCAGAACACACAGAGGTAAATCAGAAGCAAAGCTTTCGCTTAAACCTGGAGAAGAAGCCAATGTCGGTGAGAACA
AGAAGGGTGCGAGGATACTGA
Protein sequenceShow/hide protein sequence
MKRREACCRSISYKFLRRSETSFRSNPTHALQHTYITNFFSLSLCNFFLNPFRFCSLAMGRKLDALLGRNFRASKFRPLLNLAFSRLAILTNQRHVRRSQARSDVLQLLQ
LGHHHRALLRVEQVIKEQNLLDAYVLIEGYLNLLIERITLLEQERECPEELKEAVSGLLFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHLLMQKLTTR
QPNLESRMHVLKAIASEDGIALQLDEPPLPNEEKLGGNRGQNQQEPHAEVGENLQFSSDVPSGSIQGRQKYKDVADAAQAAFESAAQAAAAARAAMELSRSESQDPDDPS
TPPNPHDKQEQKKSKVESKPKRETEYQNRREEEEEEEGCKITAEVKNSMPASSSSSSRELSDEENSQMEEQRPSNLQTGLDMETKPEHTEVNQKQSFRLNLEKKPMSVRT
RRVRGY