; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0900 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0900
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationMC04:16337673..16339906
RNA-Seq ExpressionMC04g0900
SyntenyMC04g0900
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.37e-17475Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGV+H +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS +T+  +QEQKKSEVE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022139373.1 uncharacterized protein LOC111010325 [Momordica charantia]1.18e-257100Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE
        RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE

Query:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
Subjt:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]1.32e-17475Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGV+H +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS +T+  +QEQKKSEVE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022971762.1 uncharacterized protein LOC111470447 [Cucurbita maxima]1.18e-17074.62Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RR+QA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVL+EGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGV+H +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS +++  +QEQKKS VE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE  GSKI    EVKNSM    S      +E+ EM+EQRA N+E G +ME     E T  T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]1.08e-17375.45Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGV+H +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
          QN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSR +SQDPD PS+P PGSG+T+  +QEQKKS+VE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRG
        EIEY+  REEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRG

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein3.69e-14167.44Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNL+LSRL++LT QR V  SQA SD LQLLQL HHHRALLRVE+VIK+QNALDAYVLIEGYLNLL+ERT LLEQ+ ECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAV+GL+FAASRCGDFPEL EIKSVLTTRFGKEFTARAVELRNNCGV+ S+MQKLSTRQP LE+RM+ L++IASEN IVLQ+D+ P S +EK  RN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTP-KPGSGSTN-KRQEQKKSEVESKPK
         RQ+     +AE G + +FS +V SGS   K  YKDVADAAQAAFESAAQAAAAARAAMELSRS     + PS+P KPGSG+T+  +Q+++K EVESK K
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTP-KPGSGSTN-KRQEQKKSEVESKPK

Query:  QEIE-YQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        QE+E Y N R+ E  G +                 E   M+E+R  N   GL MET  K E TE ++K SF LNLEKKP+SVRTRRV G+
Subjt:  QEIE-YQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1CFG6 uncharacterized protein LOC1110103255.69e-258100Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE
        RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE

Query:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
Subjt:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1E0Z0 uncharacterized protein LOC1114298146.41e-17575Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGV+H +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS +T+  +QEQKKSEVE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1E4E5 uncharacterized protein LOC1114298231.26e-16773.6Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNL LSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKS LT+RFGKEFTARAVELRNNCGV+H +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q E    VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSR +SQDPD PS+P PGS +T+  +Q QK+S+VE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE  GSK+T   EVKNSM    S      +E+ EM+EQRA N+E G +ME     E  E T+KPSFGLNLEKKP+ VRT RVRG+
Subjt:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1I2V3 uncharacterized protein LOC1114704475.74e-17174.62Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RR+QA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVL+EGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGV+H +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS +++  +QEQKKS VE+KPK 
Subjt:  RRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTN-KRQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE  GSKI    EVKNSM    S      +E+ EM+EQRA N+E G +ME     E T  T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITA--EVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog5.7e-1229.83Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         V+  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q3ZBV1 IST1 homolog2.8e-1129.28Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG   +A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         V+  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q54I39 IST1-like protein9.7e-1229.78Show/hide
Query:  GRNFRASKFRPLLNLALSRLAVLTNQR-HVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVS
        G ++ + K +  L LA+SR+ +L N++ ++ R + R+ A +LL+  +   A +RVE +I+++  ++ + +IE    LL  R +L+    E P E+KE++ 
Subjt:  GRNFRASKFRPLLNLALSRLAVLTNQR-HVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVS

Query:  GLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNC----GVDHSIMQKLSTRQPNLESRMNVLEAIASENNI
         L++++ R    PEL++IK+ L  ++GK      +E   NC     V+  I+ KLS   P+       L  IA + N+
Subjt:  GLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNC----GVDHSIMQKLSTRQPNLESRMNVLEAIASENNI

Q568Z6 IST1 homolog5.7e-1229.83Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         V+  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q9CX00 IST1 homolog5.7e-1229.83Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         V+  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein1.3e-6743.51Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MG+KLDALLGR+F+ +KF+ L+ LAL+RL++L NQR  R SQA SD  +LL+LG H  A  RV+QV+K+QN LD    I GY  L ++R  L E  R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EEL EAVSGL+FAASR G+FPELQEI++VL +RFGK+  AR++ELR+NCGVD  I+QKLSTR P  E RM  L+ IA+ENNIVL+LD+   S E      
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  RRQNYQQEPKAEV------GDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPST---PKPGSGSTNK--RQEQK
          Q      K ++      G+    S  V  G    K+KYKDVADAAQAAFESAA AA AA+AA+ELS+   +  D P          GS NK   QEQ+
Subjt:  RRQNYQQEPKAEV------GDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPST---PKPGSGSTNK--RQEQK

Query:  KSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSPSREDS---EMEEQRAPNVET---------------GLDMETETKPELTEETQKPSFGLN
         ++  S+ + ++  +++R   ++   I   V +          +D+   + EE+  P+VET                   +T     +    + P     
Subjt:  KSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSPSREDS---EMEEQRAPNVET---------------GLDMETETKPELTEETQKPSFGLN

Query:  LEKKPMSVRTRRVRGF
          K P+SVRTR+VRG+
Subjt:  LEKKPMSVRTRRVRGF

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein1.8e-2931.21Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK
        L+ L  R    +K +  LNLA++R+ +L N+R ++    + +    LQ G    A +RVE VI+E N   AY ++E +   ++ R  +LE E+ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK

Query:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNI----------VLQLDEHPLSNE
        EA++ +IFAA RC + P+L +IK++  T++GKEF   A ELR + GV+ +I++KLS   P+  +R+ +L+ IA E ++           ++  E  L   
Subjt:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNI----------VLQLDEHPLSNE

Query:  EKF--------ARNRRQNYQQEPKAEVGDNL---------------QFSADVPSGSM--------QAKQKYKDVADAAQAAFESAAQAAAAARAAMEL
        ++         +R  +Q Y Q   +   ++L                 S  +PS  +          ++   DV + A+AA  SA +A AAARAA +L
Subjt:  EKF--------ARNRRQNYQQEPKAEVGDNL---------------QFSADVPSGSM--------QAKQKYKDVADAAQAAFESAAQAAAAARAAMEL

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein1.9e-3136.32Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK
        LD+   + F+A+K + LL L + R+ ++ N+R  +  Q R +  +LL+ G    A +RVE +I+E+  + A  ++E +  L+  R  ++E +RECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK

Query:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNE
        EA+S + FAA RC D  ELQ+++ +  +++GKEF A A EL+ + GV+  +++ LS R P+ E+++ +L+ IA E+    +LD  P S E
Subjt:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNE

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein4.0e-2930Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +L R F+ +K +  L +A SRL +L N++ ++  Q R +  QLL+ G    A +RVE V++E+  + AY LI  Y  LL+ R  ++E ++ CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIV------------------------
        + ++FA+ R  D PEL EI    TT++GK+F+  AVELR + GV   +++KLS + P+  +++ +L AIA E+N+V                        
Subjt:  SGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIV------------------------

Query:  ----LQLDEHPLSNEEK-------------FARNRRQNYQQEPKAEVGDNLQFSADVPSG--------------------------------SMQAKQKY
            + +D    SN+E+                + R +  +   A  G +   S +V SG                                S + KQK+
Subjt:  ----LQLDEHPLSNEEK-------------FARNRRQNYQQEPKAEVGDNLQFSADVPSG--------------------------------SMQAKQKY

Query:  K----DVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE
        +    D  DAA+AA E+A +A+ AARAA ELS  +     D +     S S N R E       S  ++E
Subjt:  K----DVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQE

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein1.5e-2829.89Show/hide
Query:  ALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEA
        +L  R F +SK +    +A++R+ ++ N+R V   Q R D   LLQ G    A +RVE VI+EQN   A  +IE +  L++ R  ++ ++++CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEA

Query:  VSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNRRQNYQ
        ++ LIFAA RC + PEL +++ +   ++GK+F + A +LR +CGV+  ++ KLS R P  E ++ +++ IA E     Q+D      E++  + + ++  
Subjt:  VSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNRRQNYQ

Query:  QEPK-----------AEVGDNLQFSADVP--SGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSQSQDPDDPSTPKPGS--------
           K           A + + +  +  VP  + SM     Y D   AA+AA E A QA AAA+ A  L      S  +     D ST +  S        
Subjt:  QEPK-----------AEVGDNLQFSADVP--SGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSQSQDPDDPSTPKPGS--------

Query:  --GSTNKRQEQKKSEVESKPKQEIEYQNRR-----------EEEEAGSKITAEVKNSM
          GS  + ++ + S   +KP  E     RR           + EE  +   AE K +M
Subjt:  --GSTNKRQEQKKSEVESKPKQEIEYQNRR-----------EEEEAGSKITAEVKNSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGAAGCTCGACGCTCTTCTCGGAAGGAATTTCAGGGCCTCCAAGTTCCGTCCGCTCCTCAATCTCGCCCTCTCTCGCCTCGCCGTCCTCACCAACCAGCGCCA
TGTGAGGCGCTCTCAAGCTCGCTCTGATGCCCTCCAACTTCTCCAATTAGGCCACCACCACCGCGCTCTCCTTCGAGTTGAGCAAGTGATTAAGGAGCAGAATGCTTTGG
ATGCTTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAGGACCGACCTCCTCGAACAAGAAAGAGAATGCCCTGAGGAATTGAAAGAGGCAGTATCGGGACTG
ATTTTTGCGGCTTCAAGATGTGGGGATTTTCCAGAACTTCAAGAGATCAAATCGGTTTTGACCACTCGCTTCGGCAAAGAGTTCACTGCTCGTGCTGTTGAATTACGGAA
CAACTGTGGAGTCGATCATTCGATAATGCAAAAACTGTCTACCAGGCAGCCAAATTTGGAGAGCAGAATGAATGTGCTGGAAGCCATTGCTTCTGAGAATAACATTGTTC
TGCAACTCGATGAACATCCTCTTTCCAATGAGGAAAAGTTTGCCCGAAACAGAAGGCAGAATTACCAGCAGGAGCCTAAGGCTGAAGTGGGGGACAATTTGCAATTCTCT
GCTGATGTTCCATCTGGTTCTATGCAAGCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCTCAAGCTGCTTTCGAATCAGCCGCTCAGGCGGCAGCGGCTGCCAGAGC
TGCCATGGAGCTCTCCCGCTCTCAATCACAAGATCCTGATGATCCAAGCACCCCAAAACCTGGCTCTGGAAGTACAAATAAGAGACAAGAGCAAAAGAAATCCGAGGTAG
AATCGAAACCGAAACAGGAAATCGAATACCAAAATCGAAGAGAAGAAGAAGAAGCAGGCAGCAAAATCACAGCAGAAGTGAAGAACTCAATGCCTGCTTCGTTTTCTCCA
AGTAGAGAAGACAGTGAAATGGAGGAGCAGAGAGCTCCAAATGTTGAAACTGGCTTGGACATGGAAACAGAAACAAAGCCAGAACTCACAGAGGAAACTCAGAAGCCAAG
CTTTGGCTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGGAGAGTTCGTGGATTCTGA
mRNA sequenceShow/hide mRNA sequence
GTTCAATAGAAACTCCCAACGACCTTCTAATGGAAGATATATGCTAATTAACGTACTATCACGATATTGGAAGTTAATCATAATTATAGTATATATATATATATATTGGT
TGTAAATGATAAATAAACCCAAAATTTGTCTCCATGGACCTGAGATTCTGAGAAGGCCGTAGTGACGTTCGGGAAGCGCGTTGGACAATTTTCCCAACAAAATTTTACGA
GGCGAAGAGAGACTTCGGCCGTAAGAAAACTGAAACACACACAAAGCCCCGTTCCCACGCGCTTCAACAACATATATAAACCCCCACCGGCGCCGATCCCCTCACTCTTT
TTCTTCCATTTCTCCGGCCGACGACGCCATGGGAAGGAAGCTCGACGCTCTTCTCGGAAGGAATTTCAGGGCCTCCAAGTTCCGTCCGCTCCTCAATCTCGCCCTCTCTC
GCCTCGCCGTCCTCACCAACCAGCGCCATGTGAGGCGCTCTCAAGCTCGCTCTGATGCCCTCCAACTTCTCCAATTAGGCCACCACCACCGCGCTCTCCTTCGAGTTGAG
CAAGTGATTAAGGAGCAGAATGCTTTGGATGCTTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAGGACCGACCTCCTCGAACAAGAAAGAGAATGCCCTGA
GGAATTGAAAGAGGCAGTATCGGGACTGATTTTTGCGGCTTCAAGATGTGGGGATTTTCCAGAACTTCAAGAGATCAAATCGGTTTTGACCACTCGCTTCGGCAAAGAGT
TCACTGCTCGTGCTGTTGAATTACGGAACAACTGTGGAGTCGATCATTCGATAATGCAAAAACTGTCTACCAGGCAGCCAAATTTGGAGAGCAGAATGAATGTGCTGGAA
GCCATTGCTTCTGAGAATAACATTGTTCTGCAACTCGATGAACATCCTCTTTCCAATGAGGAAAAGTTTGCCCGAAACAGAAGGCAGAATTACCAGCAGGAGCCTAAGGC
TGAAGTGGGGGACAATTTGCAATTCTCTGCTGATGTTCCATCTGGTTCTATGCAAGCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCTCAAGCTGCTTTCGAATCAG
CCGCTCAGGCGGCAGCGGCTGCCAGAGCTGCCATGGAGCTCTCCCGCTCTCAATCACAAGATCCTGATGATCCAAGCACCCCAAAACCTGGCTCTGGAAGTACAAATAAG
AGACAAGAGCAAAAGAAATCCGAGGTAGAATCGAAACCGAAACAGGAAATCGAATACCAAAATCGAAGAGAAGAAGAAGAAGCAGGCAGCAAAATCACAGCAGAAGTGAA
GAACTCAATGCCTGCTTCGTTTTCTCCAAGTAGAGAAGACAGTGAAATGGAGGAGCAGAGAGCTCCAAATGTTGAAACTGGCTTGGACATGGAAACAGAAACAAAGCCAG
AACTCACAGAGGAAACTCAGAAGCCAAGCTTTGGCTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGGAGAGTTCGTGGATTCTGAAGAACATATGTAAATTTG
TAATTGAAAATTGGGTTTTACGTGCCCATTTGGTTTTGGTTTTGGTTTCTTTCTAAATCTTTTTTTTTCCTTCTTTTTTGGTTTGGAAAGATCGTGGAAAGAGTGTGTAA
TTAAATGATTTAGGAATTTTGGTAATGTATGGGAATTCAACATTTTTGTTTGTTTACATGATGTATTGTATTCACAGGAATTTGACATATTGACCTTTTTATTTTTTATT
TTTCCATCCCA
Protein sequenceShow/hide protein sequence
MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVSGL
IFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVDHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNRRQNYQQEPKAEVGDNLQFS
ADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSTNKRQEQKKSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSP
SREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF