; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS017459 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS017459
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
Genome locationscaffold435:599070..600694
RNA-Seq ExpressionMS017459
SyntenyMS017459
Gene Ontology termsGO:0015031 - protein transport (biological process)
InterPro domainsIPR005061 - Vacuolar protein sorting-associated protein Ist1
IPR042277 - Vacuolar protein sorting-associated protein IST1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601794.1 IST1-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.4e-13775Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNH +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS + +   QEQKKSEVE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022139373.1 uncharacterized protein LOC111010325 [Momordica charantia]3.8e-19998.97Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGV+HSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE
        +RQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGS NK QEQKKSEVESKPKQE
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE

Query:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
Subjt:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022921602.1 uncharacterized protein LOC111429814 [Cucurbita moschata]8.4e-13875Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNH +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS + +   QEQKKSEVE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_022971762.1 uncharacterized protein LOC111470447 [Cucurbita maxima]2.3e-13574.49Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RR+QA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVL+EGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGVNH +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSG-SANKSQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS  S+   QEQKKS VE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSG-SANKSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE +  +   EVKNSM    S      +E+ EM+EQRA N+E G +ME     E T  T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

XP_023538782.1 uncharacterized protein LOC111799606 [Cucurbita pepo subsp. pepo]4.2e-13775.45Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNH +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ
          QN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSR +SQDPD PS+P PGSG+ +   QEQKKS+VE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRG
        EIEY+  REEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRG

TrEMBL top hitse value%identityAlignment
A0A0A0KAU0 Uncharacterized protein2.2e-11267.78Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNL+LSRL++LT QR V  SQA SD LQLLQL HHHRALLRVE+VIK+QNALDAYVLIEGYLNLL+ERT LLEQ+ ECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAV+GL+FAASRCGDFPEL EIKSVLTTRFGKEFTARAVELRNNCGVN S+MQKLSTRQP LE+RM+ L++IASEN IVLQ+D+ P S +EK  RN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE
         RQ+     +AE G + +FS +V SGS   K  YKDVADAAQAAFESAAQAAAAARAAMELSRS  + P  PS P  G+ S NK Q+++K EVESK KQE
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE

Query:  I-EYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        + EY N R+ E  G +                 E   M+E+R  N   GL M  ETK E TE ++K SF LNLEKKP+SVRTRRV G+
Subjt:  I-EYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1CFG6 uncharacterized protein LOC1110103251.9e-19998.97Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGV+HSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE
        +RQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGS NK QEQKKSEVESKPKQE
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQE

Query:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
Subjt:  IEYQNRREEEEAGSKITAEVKNSMPASFSPSREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1E0Z0 uncharacterized protein LOC1114298144.1e-13875Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLALSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIERT LLEQ+R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLT+RFGKEFTARAVELRNNCGVNH +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS + +   QEQKKSEVE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  +EEE +  +  AEVKNSM    S      +E+ EM+EQRA N+E G +ME     E TE T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1E4E5 uncharacterized protein LOC1114298231.5e-13273.6Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNL LSRLA+LTNQR +RRSQA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVLIEGYLNLLIER  LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKS LT+RFGKEFTARAVELRNNCGVNH +MQKLSTR PNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ
         RQN Q E    VG+NL+FS +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSR +SQDPD PS+P PGS + +   Q QK+S+VE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN-KSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKIT--AEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE  GSK+T   EVKNSM    S      +E+ EM+EQRA N+E G +ME     E  E T+KPSFGLNLEKKP+ VRT RVRG+
Subjt:  EIEYQNRREEEEAGSKIT--AEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

A0A6J1I2V3 uncharacterized protein LOC1114704471.1e-13574.49Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MGR LDALLGRNFRASKFRPLLNLA+SRLA+LTNQR +RR+QA+SD LQLLQL H  RALLRVEQVIK+QNALDAYVL+EGYLNLLIERT LLEQERECP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EELKEA++GL+FAASRCGDFPEL EIKSVLTT FGKEFTARAVELRNNCGVNH +MQKLSTRQPNLESRM++L+ IASEN I LQLDE PLSNE   ARN
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSG-SANKSQEQKKSEVESKPKQ
         RQN Q EP   VG+NL+ S +VPSGS   KQKYKDVADAAQAAFESAAQAAAAARAAMELSRS+SQDPD PS+P PGS  S+   QEQKKS VE+KPK 
Subjt:  KRQNYQQEPKAEVGDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSG-SANKSQEQKKSEVESKPKQ

Query:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF
        EIEY+  REEE +  +   EVKNSM    S      +E+ EM+EQRA N+E G +ME     E T  T+KPSF LNLEKKP+SVRTRRVRG+
Subjt:  EIEYQNRREEEEAGSKITAEVKNSMPASFSPS----REDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF

SwissProt top hitse value%identityAlignment
P53990 IST1 homolog2.5e-1230.39Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         VN  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q3ZBV1 IST1 homolog1.3e-1129.83Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG   +A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         VN  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q54I39 IST1-like protein4.3e-1230.34Show/hide
Query:  GRNFRASKFRPLLNLALSRLAVLTNQR-HVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVS
        G ++ + K +  L LA+SR+ +L N++ ++ R + R+ A +LL+  +   A +RVE +I+++  ++ + +IE    LL  R +L+    E P E+KE++ 
Subjt:  GRNFRASKFRPLLNLALSRLAVLTNQR-HVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVS

Query:  GLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNC----GVNHSIMQKLSTRQPNLESRMNVLEAIASENNI
         L++++ R    PEL++IK+ L  ++GK      +E   NC     VN  I+ KLS   P+       L  IA + N+
Subjt:  GLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNC----GVNHSIMQKLSTRQPNLESRMNVLEAIASENNI

Q568Z6 IST1 homolog2.5e-1230.39Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         VN  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Q9CX00 IST1 homolog2.5e-1230.39Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +LG  F+A + R  L L ++RL +L  ++     +AR +    L  G   RA +RVE +I+E   ++A  ++E Y +LL+ R  L++  +E    L E+V
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD
        S LI+AA R   +  EL+ +   L  ++ KE+  +         VN  +M KLS   P        L  IA   N+  + D
Subjt:  SGLIFAASRC-GDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLD

Arabidopsis top hitse value%identityAlignment
AT1G13340.1 Regulator of Vps4 activity in the MVB pathway protein8.3e-6743.27Show/hide
Query:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP
        MG+KLDALLGR+F+ +KF+ L+ LAL+RL++L NQR  R SQA SD  +LL+LG H  A  RV+QV+K+QN LD    I GY  L ++R  L E  R+CP
Subjt:  MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECP

Query:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN
        EEL EAVSGL+FAASR G+FPELQEI++VL +RFGK+  AR++ELR+NCGV+  I+QKLSTR P  E RM  L+ IA+ENNIVL+LD+   S E      
Subjt:  EELKEAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARN

Query:  KRQNYQQEPKAEV------GDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPST---PKPGSGSANK--SQEQK
          Q      K ++      G+    S  V  G    K+KYKDVADAAQAAFESAA AA AA+AA+ELS+   +  D P          GS NK   QEQ+
Subjt:  KRQNYQQEPKAEV------GDNLQFSADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPST---PKPGSGSANK--SQEQK

Query:  KSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSPSREDS---EMEEQRAPNVET---------------GLDMETETKPELTEETQKPSFGLN
         ++  S+ + ++  +++R   ++   I   V +          +D+   + EE+  P+VET                   +T     +    + P     
Subjt:  KSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSPSREDS---EMEEQRAPNVET---------------GLDMETETKPELTEETQKPSFGLN

Query:  LEKKPMSVRTRRVRGF
          K P+SVRTR+VRG+
Subjt:  LEKKPMSVRTRRVRGF

AT1G25420.1 Regulator of Vps4 activity in the MVB pathway protein6.2e-3031.54Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK
        L+ L  R    +K +  LNLA++R+ +L N+R ++    + +    LQ G    A +RVE VI+E N   AY ++E +   ++ R  +LE E+ECP EL+
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK

Query:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNI----------VLQLDEHPLSNE
        EA++ +IFAA RC + P+L +IK++  T++GKEF   A ELR + GVN +I++KLS   P+  +R+ +L+ IA E ++           ++  E  L   
Subjt:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNI----------VLQLDEHPLSNE

Query:  EKF--------ARNKRQNYQQEPKAEVGDNL---------------QFSADVPSGSM--------QAKQKYKDVADAAQAAFESAAQAAAAARAAMEL
        ++         +R  +Q Y Q   +   ++L                 S  +PS  +          ++   DV + A+AA  SA +A AAARAA +L
Subjt:  EKF--------ARNKRQNYQQEPKAEVGDNL---------------QFSADVPSGSM--------QAKQKYKDVADAAQAAFESAAQAAAAARAAMEL

AT1G34220.2 Regulator of Vps4 activity in the MVB pathway protein6.6e-3236.84Show/hide
Query:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK
        LD+   + F+A+K + LL L + R+ ++ N+R  +  Q R +  +LL+ G    A +RVE +I+E+  + A  ++E +  L+  R  ++E +RECP +LK
Subjt:  LDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELK

Query:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNE
        EA+S + FAA RC D  ELQ+++ +  +++GKEF A A EL+ + GVN  +++ LS R P+ E+++ +L+ IA E+    +LD  P S E
Subjt:  EAVSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNE

AT2G19710.1 Regulator of Vps4 activity in the MVB pathway protein1.6e-2526.97Show/hide
Query:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV
        +L R F+ +K +  L +A SRL +L N++ ++  Q R +  QLL+ G    A +RVE V++E+  + AY LI  Y  LL+ R  ++E ++ CP +LKEAV
Subjt:  LLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAV

Query:  SGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIV------------------------
        + ++FA+ R  D PEL EI    TT++GK+F+  AVELR + GV+  +++KLS + P+  +++ +L AIA E+N+V                        
Subjt:  SGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIV------------------------

Query:  ----LQLDEHPLSNEEK-------------FARNKRQNYQQEPKAEVGDNLQFSADVPSG--------------------------------SMQAKQKY
            + +D    SN+E+                ++R +  +   A  G +   S +V SG                                S + KQK+
Subjt:  ----LQLDEHPLSNEEK-------------FARNKRQNYQQEPKAEVGDNLQFSADVPSG--------------------------------SMQAKQKY

Query:  K----DVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN------------KSQEQKKSEVESKPKQEIEYQ------NRREEEEA
        +    D  DAA+AA E+A +A+ AARAA ELS  +     D +     S S N             +Q +  SE    P++ +  Q       R++  + 
Subjt:  K----DVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSAN------------KSQEQKKSEVESKPKQEIEYQ------NRREEEEA

Query:  GSKITAEVKNSMPASFSPSREDSEMEEQRAPNV-ETGLDMETETKPELTEETQKPS
          +I    + S   S   SR +     ++ P+  ET +++       L +++ + S
Subjt:  GSKITAEVKNSMPASFSPSREDSEMEEQRAPNV-ETGLDMETETKPELTEETQKPS

AT4G35730.1 Regulator of Vps4 activity in the MVB pathway protein2.4e-2930.45Show/hide
Query:  ALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEA
        +L  R F +SK +    +A++R+ ++ N+R V   Q R D   LLQ G    A +RVE VI+EQN   A  +IE +  L++ R  ++ ++++CP +LKE 
Subjt:  ALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEA

Query:  VSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNKRQNYQ
        ++ LIFAA RC + PEL +++ +   ++GK+F + A +LR +CGVN  ++ KLS R P  E ++ +++ IA E     Q+D      E++  + + ++  
Subjt:  VSGLIFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNKRQNYQ

Query:  QEPK-----------AEVGDNLQFSADVP--SGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSQSQDPDDPSTPKPGS--------
           K           A + + +  +  VP  + SM     Y D   AA+AA E A QA AAA+ A  L      S  +     D ST +  S        
Subjt:  QEPK-----------AEVGDNLQFSADVP--SGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMEL------SRSQSQDPDDPSTPKPGS--------

Query:  --GSANKSQEQKKSEVESKPKQEIEYQNRR-----------EEEEAGSKITAEVKNSM
          GS  +S++ + S   +KP  E     RR           + EE  +   AE K +M
Subjt:  --GSANKSQEQKKSEVESKPKQEIEYQNRR-----------EEEEAGSKITAEVKNSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGAAGCTCGACGCTCTTCTCGGAAGGAATTTCAGGGCCTCCAAGTTCCGTCCGCTCCTCAATCTCGCCCTCTCTCGCCTCGCCGTCCTCACCAACCAGCGCCA
TGTGAGGCGCTCTCAAGCTCGCTCTGATGCCCTCCAACTTCTCCAATTAGGCCACCACCACCGCGCTCTCCTTCGAGTCGAGCAAGTGATTAAGGAGCAGAATGCTTTGG
ATGCGTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAGGACCGACCTCCTCGAACAAGAAAGAGAATGCCCTGAGGAATTGAAAGAGGCAGTATCGGGACTG
ATTTTTGCGGCTTCAAGATGTGGGGATTTTCCAGAACTTCAAGAGATCAAATCGGTTTTGACCACTCGCTTCGGCAAAGAGTTCACTGCTCGTGCTGTTGAATTACGGAA
CAACTGTGGAGTCAATCATTCGATAATGCAAAAACTGTCTACCAGGCAGCCAAATTTGGAGAGCAGAATGAATGTGCTGGAAGCCATTGCTTCTGAGAATAACATTGTTC
TGCAACTCGATGAACATCCTCTTTCCAATGAGGAAAAATTTGCCCGAAACAAAAGGCAGAATTACCAGCAGGAGCCTAAGGCTGAAGTGGGGGACAATTTGCAATTCTCT
GCTGATGTTCCATCTGGTTCTATGCAAGCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCTCAAGCCGCTTTCGAATCAGCCGCTCAGGCGGCAGCGGCTGCCAGAGC
TGCCATGGAGCTCTCCCGCTCTCAATCACAAGATCCTGATGATCCAAGCACCCCAAAACCTGGCTCTGGAAGTGCAAATAAGAGCCAAGAGCAAAAGAAATCCGAGGTAG
AATCGAAACCGAAACAGGAAATCGAATACCAAAATCGAAGAGAAGAAGAAGAAGCAGGCAGCAAAATCACAGCAGAAGTGAAGAACTCAATGCCTGCTTCGTTTTCTCCA
AGTAGAGAAGACAGTGAAATGGAGGAGCAGAGAGCTCCAAATGTTGAAACTGGCTTGGACATGGAAACAGAAACAAAGCCAGAACTCACAGAGGAAACTCAGAAGCCAAG
CTTTGGCTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGGAGAGTTCGTGGATTC
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGAAGCTCGACGCTCTTCTCGGAAGGAATTTCAGGGCCTCCAAGTTCCGTCCGCTCCTCAATCTCGCCCTCTCTCGCCTCGCCGTCCTCACCAACCAGCGCCA
TGTGAGGCGCTCTCAAGCTCGCTCTGATGCCCTCCAACTTCTCCAATTAGGCCACCACCACCGCGCTCTCCTTCGAGTCGAGCAAGTGATTAAGGAGCAGAATGCTTTGG
ATGCGTATGTTCTGATTGAAGGCTATCTCAATCTCTTGATCGAAAGGACCGACCTCCTCGAACAAGAAAGAGAATGCCCTGAGGAATTGAAAGAGGCAGTATCGGGACTG
ATTTTTGCGGCTTCAAGATGTGGGGATTTTCCAGAACTTCAAGAGATCAAATCGGTTTTGACCACTCGCTTCGGCAAAGAGTTCACTGCTCGTGCTGTTGAATTACGGAA
CAACTGTGGAGTCAATCATTCGATAATGCAAAAACTGTCTACCAGGCAGCCAAATTTGGAGAGCAGAATGAATGTGCTGGAAGCCATTGCTTCTGAGAATAACATTGTTC
TGCAACTCGATGAACATCCTCTTTCCAATGAGGAAAAATTTGCCCGAAACAAAAGGCAGAATTACCAGCAGGAGCCTAAGGCTGAAGTGGGGGACAATTTGCAATTCTCT
GCTGATGTTCCATCTGGTTCTATGCAAGCTAAACAGAAGTACAAAGATGTGGCGGATGCAGCTCAAGCCGCTTTCGAATCAGCCGCTCAGGCGGCAGCGGCTGCCAGAGC
TGCCATGGAGCTCTCCCGCTCTCAATCACAAGATCCTGATGATCCAAGCACCCCAAAACCTGGCTCTGGAAGTGCAAATAAGAGCCAAGAGCAAAAGAAATCCGAGGTAG
AATCGAAACCGAAACAGGAAATCGAATACCAAAATCGAAGAGAAGAAGAAGAAGCAGGCAGCAAAATCACAGCAGAAGTGAAGAACTCAATGCCTGCTTCGTTTTCTCCA
AGTAGAGAAGACAGTGAAATGGAGGAGCAGAGAGCTCCAAATGTTGAAACTGGCTTGGACATGGAAACAGAAACAAAGCCAGAACTCACAGAGGAAACTCAGAAGCCAAG
CTTTGGCTTAAATCTGGAGAAGAAGCCAATGTCAGTGAGAACAAGGAGAGTTCGTGGATTC
Protein sequenceShow/hide protein sequence
MGRKLDALLGRNFRASKFRPLLNLALSRLAVLTNQRHVRRSQARSDALQLLQLGHHHRALLRVEQVIKEQNALDAYVLIEGYLNLLIERTDLLEQERECPEELKEAVSGL
IFAASRCGDFPELQEIKSVLTTRFGKEFTARAVELRNNCGVNHSIMQKLSTRQPNLESRMNVLEAIASENNIVLQLDEHPLSNEEKFARNKRQNYQQEPKAEVGDNLQFS
ADVPSGSMQAKQKYKDVADAAQAAFESAAQAAAAARAAMELSRSQSQDPDDPSTPKPGSGSANKSQEQKKSEVESKPKQEIEYQNRREEEEAGSKITAEVKNSMPASFSP
SREDSEMEEQRAPNVETGLDMETETKPELTEETQKPSFGLNLEKKPMSVRTRRVRGF