; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038648 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038648
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr2:22292696..22295064
RNA-Seq ExpressionLag0038648
SyntenyLag0038648
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136208.1 uncharacterized protein LOC111007960 [Momordica charantia]2.0e-22283.99Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEF QTDS+A+ LQIRRQE+LLK KRRWLLGLPTS  GQK SDHSDFLNKRNLPE LLREDDVFYETVKTR+EEAFGALNVETRH GI+ D+I DTCK+ 
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLYLLAI+LT+DSVKLEKTRWKLKRV+REFIP VLRRKSQDC QLE+VK+LSQ  NDP NFRRRCS T TSSSPS+HDAASQVLYRL
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GDLPTQGLLAM RKLEGVRVMPQIKRHRHGWGRDRLIN+LT+ S KMLSS GEGDELQESLAKAMAVADLSLKLVPG HNSS IEFYPF PQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIW VR K N QKLKQLKSLLDPDAKVSNR LRTAIKKMLIDYLFECSDMD++PKSLLKALAII+ DSR+A +SF S +EIE+EVECVF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNGS------VYVEGMGESMPANLDHSSAGNAMSPSQA
        VWDLLPNCDFEHDFADAYMEELEESDDDFDDND+D+CDGLP +DNGS       +VEGMGESMPANL+HSS GN +SPS A
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNGS------VYVEGMGESMPANLDHSSAGNAMSPSQA

XP_022969011.1 uncharacterized protein LOC111468137 isoform X1 [Cucurbita maxima]9.1e-22081.71Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL Q+RRQE+LLKSKRRWLLGLPTS+ G K+SDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFG LN+ETRHLGIRADQILDTCKVR
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLY LA +L+EDSVKLEKTRWKLKRVIREFIP+VL RKSQDC QLE  K+LSQ  ND  NFRR  S TSTSS+ SFHDAASQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GD+PTQ LLAM RKLEGVR +PQIK  + GWGRDRLINLLTKISKKMLSSLGEG ELQESLAKAMAVADLSLKLVPGRHN S IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR  CNI+KLK+LK LLDPDA+VS+R LR  IK+ML DYLFECSDMD++PKSLLKALA+I  DSRSAPHS  SQDEI EEVE VF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP
        VWDLLPNCDFEHDFADAYMEELEESDDD+ DNDD+  DGLP+ED+G  SV+VEGMGESMPANLD++S GN +SPSQAS+KN D+EPF+CS+P
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP

XP_038890245.1 uncharacterized protein LOC120079870 isoform X1 [Benincasa hispida]7.2e-22583.16Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL+QIRRQE LLKSKRRWLLGLPTS+ + K+SDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFG LNVETRHLGIRA++ILDTCKV 
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K I+SCLNDL+ RGLYLLAI+LTEDSV+LEKTRWKLKR I+EFIP+VLRRKS+DCRQLE+VK LSQ FND KNFRRRCS T TSSS S HDA SQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GDLPTQ LLAM RKLEGVR MPQ+KRHRHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR K N QKLKQLKSLLDPDAKVS+R+LR +IK MLIDYLFECSDMD++PKSLLKALA+++ DSRSA  S  SQDEIEE+ ECVF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM
        VWDLLPNCDFEHDFADAYMEELEESDDDF++ +DDSCDG P+ED    SVYVEGMGESMPANLDHSS GN ++PSQAS+ N D+E  Q S PM
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM

XP_038890247.1 uncharacterized protein LOC120079870 isoform X2 [Benincasa hispida]7.2e-22583.16Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL+QIRRQE LLKSKRRWLLGLPTS+ + K+SDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFG LNVETRHLGIRA++ILDTCKV 
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K I+SCLNDL+ RGLYLLAI+LTEDSV+LEKTRWKLKR I+EFIP+VLRRKS+DCRQLE+VK LSQ FND KNFRRRCS T TSSS S HDA SQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GDLPTQ LLAM RKLEGVR MPQ+KRHRHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR K N QKLKQLKSLLDPDAKVS+R+LR +IK MLIDYLFECSDMD++PKSLLKALA+++ DSRSA  S  SQDEIEE+ ECVF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM
        VWDLLPNCDFEHDFADAYMEELEESDDDF++ +DDSCDG P+ED    SVYVEGMGESMPANLDHSS GN ++PSQAS+ N D+E  Q S PM
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM

XP_038890248.1 uncharacterized protein LOC120079870 isoform X3 [Benincasa hispida]7.2e-22583.16Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL+QIRRQE LLKSKRRWLLGLPTS+ + K+SDHSDFLNKRNLPESLLREDDVFYETVKTR+EEAFG LNVETRHLGIRA++ILDTCKV 
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQ-KHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K I+SCLNDL+ RGLYLLAI+LTEDSV+LEKTRWKLKR I+EFIP+VLRRKS+DCRQLE+VK LSQ FND KNFRRRCS T TSSS S HDA SQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GDLPTQ LLAM RKLEGVR MPQ+KRHRHGWGRDRLINLLTKIS+KMLSS+GEGDELQESLAKAMAVADLS KLVPGRHNSS IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR K N QKLKQLKSLLDPDAKVS+R+LR +IK MLIDYLFECSDMD++PKSLLKALA+++ DSRSA  S  SQDEIEE+ ECVF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM
        VWDLLPNCDFEHDFADAYMEELEESDDDF++ +DDSCDG P+ED    SVYVEGMGESMPANLDHSS GN ++PSQAS+ N D+E  Q S PM
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDN--GSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPM

TrEMBL top hitse value%identityAlignment
A0A6J1C3N7 uncharacterized protein LOC1110079609.5e-22383.99Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEF QTDS+A+ LQIRRQE+LLK KRRWLLGLPTS  GQK SDHSDFLNKRNLPE LLREDDVFYETVKTR+EEAFGALNVETRH GI+ D+I DTCK+ 
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLYLLAI+LT+DSVKLEKTRWKLKRV+REFIP VLRRKSQDC QLE+VK+LSQ  NDP NFRRRCS T TSSSPS+HDAASQVLYRL
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GDLPTQGLLAM RKLEGVRVMPQIKRHRHGWGRDRLIN+LT+ S KMLSS GEGDELQESLAKAMAVADLSLKLVPG HNSS IEFYPF PQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIW VR K N QKLKQLKSLLDPDAKVSNR LRTAIKKMLIDYLFECSDMD++PKSLLKALAII+ DSR+A +SF S +EIE+EVECVF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNGS------VYVEGMGESMPANLDHSSAGNAMSPSQA
        VWDLLPNCDFEHDFADAYMEELEESDDDFDDND+D+CDGLP +DNGS       +VEGMGESMPANL+HSS GN +SPS A
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNGS------VYVEGMGESMPANLDHSSAGNAMSPSQA

A0A6J1GJE0 uncharacterized protein LOC111454828 isoform X15.4e-21881.3Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL Q+RRQE+LLKSKRRWLLGLPTSI G K+SDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFG LN+ETRHLGIRADQILDTCKVR
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLY LA +L+EDSVKLEKTRWKLKRVIREFIP+VL RKSQDC QLE  K+LSQ  ND  NFRR  S TSTSS+ SFHDAASQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GD+PTQ LLAM RKLEGVR +PQIK  + GWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSS IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFV   CNI+KLK+LK LLDPDA+VS+R LR  IK+ML DYLFECSDMD++PKSLLKALA+I  DSRSAPHS   QDEI +EVE VF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP
        VWDLLPNCDFEHDF DAYMEELEESDDD+DDNDDD  DGLP+ED+G  SV+VEGMGESMPANLD++S GN +SPSQAS+KN D++  +CS+P
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP

A0A6J1GJF1 uncharacterized protein LOC111454828 isoform X25.4e-21881.3Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL Q+RRQE+LLKSKRRWLLGLPTSI G K+SDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFG LN+ETRHLGIRADQILDTCKVR
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLY LA +L+EDSVKLEKTRWKLKRVIREFIP+VL RKSQDC QLE  K+LSQ  ND  NFRR  S TSTSS+ SFHDAASQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GD+PTQ LLAM RKLEGVR +PQIK  + GWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSS IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFV   CNI+KLK+LK LLDPDA+VS+R LR  IK+ML DYLFECSDMD++PKSLLKALA+I  DSRSAPHS   QDEI +EVE VF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP
        VWDLLPNCDFEHDF DAYMEELEESDDD+DDNDDD  DGLP+ED+G  SV+VEGMGESMPANLD++S GN +SPSQAS+KN D++  +CS+P
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP

A0A6J1HWI3 uncharacterized protein LOC111468137 isoform X14.4e-22081.71Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL Q+RRQE+LLKSKRRWLLGLPTS+ G K+SDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFG LN+ETRHLGIRADQILDTCKVR
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLY LA +L+EDSVKLEKTRWKLKRVIREFIP+VL RKSQDC QLE  K+LSQ  ND  NFRR  S TSTSS+ SFHDAASQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GD+PTQ LLAM RKLEGVR +PQIK  + GWGRDRLINLLTKISKKMLSSLGEG ELQESLAKAMAVADLSLKLVPGRHN S IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR  CNI+KLK+LK LLDPDA+VS+R LR  IK+ML DYLFECSDMD++PKSLLKALA+I  DSRSAPHS  SQDEI EEVE VF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP
        VWDLLPNCDFEHDFADAYMEELEESDDD+ DNDD+  DGLP+ED+G  SV+VEGMGESMPANLD++S GN +SPSQAS+KN D+EPF+CS+P
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP

A0A6J1HZR8 uncharacterized protein LOC111468137 isoform X24.4e-22081.71Show/hide
Query:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR
        MEFYQTDSQALL Q+RRQE+LLKSKRRWLLGLPTS+ G K+SDHSD LNKRNLPESLLREDDVF+ TVKTR+EEAFG LN+ETRHLGIRADQILDTCKVR
Subjt:  MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSI-GQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVR

Query:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL
        K ILSCL+DL+TRGLY LA +L+EDSVKLEKTRWKLKRVIREFIP+VL RKSQDC QLE  K+LSQ  ND  NFRR  S TSTSS+ SFHDAASQVLY L
Subjt:  KHILSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRL

Query:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI
        GD+PTQ LLAM RKLEGVR +PQIK  + GWGRDRLINLLTKISKKMLSSLGEG ELQESLAKAMAVADLSLKLVPGRHN S IEFYPFSPQIKTLHNEI
Subjt:  GDLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEI

Query:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV
        VKAIWFVR  CNI+KLK+LK LLDPDA+VS+R LR  IK+ML DYLFECSDMD++PKSLLKALA+I  DSRSAPHS  SQDEI EEVE VF+LSAQMKQV
Subjt:  VKAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP
        VWDLLPNCDFEHDFADAYMEELEESDDD+ DNDD+  DGLP+ED+G  SV+VEGMGESMPANLD++S GN +SPSQAS+KN D+EPF+CS+P
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCDGLPREDNG--SVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G40520.1 unknown protein2.9e-7547.35Show/hide
Query:  IVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRC--STTSTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEG
        ++LT  S   +KTR K+K +IR+ + R   +  +   + EI+ QL Q  +DP NFR  C  +   T +  S  DAA +VL  L  L TQ L AM RKL+G
Subjt:  IVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRC--STTSTSSSPSFHDAASQVLYRLGDLPTQGLLAMHRKLEG

Query:  VRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEIVKAIWFVRNKCNIQKLK
         R++PQ+K  R G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL PG   ++  +F+ FSP+ K L NEIVKA+W +R K   ++LK
Subjt:  VRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEIVKAIWFVRNKCNIQKLK

Query:  QLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQVVWDLLPNCDFEHDFADA
        +L   LDP+A+VSN SLR+A++K LI+YLFECSD+D++PKSL++AL++++  + +  H    ++ IEEE EC+  +SAQ+KQ+    +PN + + DF DA
Subjt:  QLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQVVWDLLPNCDFEHDFADA

Query:  YMEELEESDDDFDDNDDDSCD
        YME+LE+SDD+ DD+DDD  D
Subjt:  YMEELEESDDDFDDNDDDSCD

AT5G40520.2 unknown protein2.1e-9745.54Show/hide
Query:  FYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVRKHI
        F++TD  ++L QI+ Q++ ++ KRRWLLG   S   K  DH+       +PESLLREDD+FYET+K+R+EEAFG    +         + L  C +   +
Subjt:  FYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVRKHI

Query:  LSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRC--STTSTSSSPSFHDAASQVLYRLG
        +  L+ L  +GLYL+A++LT  S   +KTR K+K +IR+ + R   +  +   + EI+ QL Q  +DP NFR  C  +   T +  S  DAA +VL  L 
Subjt:  LSCLNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRC--STTSTSSSPSFHDAASQVLYRLG

Query:  DLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEIV
         L TQ L AM RKL+G R++PQ+K  R G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL PG   ++  +F+ FSP+ K L NEIV
Subjt:  DLPTQGLLAMHRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEIV

Query:  KAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQVV
        KA+W +R K   ++LK+L   LDP+A+VSN SLR+A++K LI+YLFECSD+D++PKSL++AL++++  + +  H    ++ IEEE EC+  +SAQ+KQ+ 
Subjt:  KAIWFVRNKCNIQKLKQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQVV

Query:  WDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCD
           +PN + + DF DAYME+LE+SDD+ DD+DDD  D
Subjt:  WDLLPNCDFEHDFADAYMEELEESDDDFDDNDDDSCD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTTATCAGACGGATTCGCAAGCACTTTTGTTACAGATTCGACGCCAAGAGGATCTTTTGAAATCTAAAAGAAGATGGCTGCTGGGCCTTCCTACATCT
ATTGGACAGAAGCATTCAGATCATTCAGACTTTTTGAATAAACGAAACTTGCCTGAATCGTTGCTACGGGAAGATGATGTTTTCTATGAGACTGTCAAAACAAGA
ATTGAAGAAGCTTTTGGAGCGTTAAATGTTGAAACAAGGCATCTTGGTATTCGAGCTGATCAAATATTAGATACTTGCAAAGTTAGAAAACACATCTTGTCATGT
CTTAATGATCTGAACACCAGAGGACTTTACCTTCTTGCTATAGTACTTACGGAAGACTCTGTCAAATTGGAAAAAACTCGCTGGAAGCTGAAAAGGGTCATCAGA
GAATTTATTCCAAGAGTTCTGAGAAGGAAAAGTCAAGATTGCCGTCAATTAGAGATTGTTAAACAATTGTCTCAATTTTTCAACGACCCAAAAAATTTCCGAAGA
AGATGCTCAACAACTTCGACATCAAGTTCGCCATCTTTCCATGATGCAGCATCACAGGTACTCTATAGATTAGGAGACCTGCCCACCCAAGGTCTCTTAGCTATG
CATCGAAAACTTGAAGGAGTTCGAGTTATGCCTCAGATAAAACGCCACAGGCATGGGTGGGGTCGTGATCGTCTTATTAATCTTCTTACCAAAATTAGTAAGAAG
ATGCTTTCATCACTTGGTGAAGGAGATGAATTGCAAGAATCACTAGCAAAAGCCATGGCGGTGGCTGATTTATCACTTAAACTAGTACCAGGTCGCCATAATTCA
TCCGAAATTGAGTTTTATCCCTTCTCACCCCAAATAAAAACCTTGCACAATGAAATAGTAAAAGCCATATGGTTTGTTAGAAACAAGTGTAACATTCAGAAGCTC
AAACAGTTAAAGTCTTTGTTGGATCCTGATGCTAAAGTGTCGAATAGGAGTCTAAGAACGGCTATTAAGAAGATGTTAATAGACTATCTTTTTGAGTGTAGTGAT
ATGGACAGTATGCCAAAGTCTCTTTTGAAAGCTTTAGCTATAATAAGTGTAGATTCTCGAAGTGCACCACATTCATTTTCCTCACAAGATGAAATTGAGGAGGAG
GTTGAATGTGTATTTACTTTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTACCTAATTGTGACTTTGAACACGACTTTGCTGATGCATATATGGAAGAG
TTAGAAGAAAGTGATGATGATTTTGATGATAATGACGATGATAGTTGTGATGGTTTGCCTCGAGAAGACAATGGCTCCGTTTATGTTGAAGGTATGGGGGAATCA
ATGCCAGCCAATCTGGATCATTCATCAGCGGGGAATGCCATGTCCCCTAGTCAGGCATCCATGAAAAATGAAGATATGGAGCCTTTTCAATGTTCTGAGCCTATG
CNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTT
TACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTTATCAGACGGATTCGCAAGCACTTTTGTTACAGATTCGACGCCAAGAGGATCTTTTGAAATCTAAAAGAAGATGGCTGCTGGGCCTTCCTACATCT
ATTGGACAGAAGCATTCAGATCATTCAGACTTTTTGAATAAACGAAACTTGCCTGAATCGTTGCTACGGGAAGATGATGTTTTCTATGAGACTGTCAAAACAAGA
ATTGAAGAAGCTTTTGGAGCGTTAAATGTTGAAACAAGGCATCTTGGTATTCGAGCTGATCAAATATTAGATACTTGCAAAGTTAGAAAACACATCTTGTCATGT
CTTAATGATCTGAACACCAGAGGACTTTACCTTCTTGCTATAGTACTTACGGAAGACTCTGTCAAATTGGAAAAAACTCGCTGGAAGCTGAAAAGGGTCATCAGA
GAATTTATTCCAAGAGTTCTGAGAAGGAAAAGTCAAGATTGCCGTCAATTAGAGATTGTTAAACAATTGTCTCAATTTTTCAACGACCCAAAAAATTTCCGAAGA
AGATGCTCAACAACTTCGACATCAAGTTCGCCATCTTTCCATGATGCAGCATCACAGGTACTCTATAGATTAGGAGACCTGCCCACCCAAGGTCTCTTAGCTATG
CATCGAAAACTTGAAGGAGTTCGAGTTATGCCTCAGATAAAACGCCACAGGCATGGGTGGGGTCGTGATCGTCTTATTAATCTTCTTACCAAAATTAGTAAGAAG
ATGCTTTCATCACTTGGTGAAGGAGATGAATTGCAAGAATCACTAGCAAAAGCCATGGCGGTGGCTGATTTATCACTTAAACTAGTACCAGGTCGCCATAATTCA
TCCGAAATTGAGTTTTATCCCTTCTCACCCCAAATAAAAACCTTGCACAATGAAATAGTAAAAGCCATATGGTTTGTTAGAAACAAGTGTAACATTCAGAAGCTC
AAACAGTTAAAGTCTTTGTTGGATCCTGATGCTAAAGTGTCGAATAGGAGTCTAAGAACGGCTATTAAGAAGATGTTAATAGACTATCTTTTTGAGTGTAGTGAT
ATGGACAGTATGCCAAAGTCTCTTTTGAAAGCTTTAGCTATAATAAGTGTAGATTCTCGAAGTGCACCACATTCATTTTCCTCACAAGATGAAATTGAGGAGGAG
GTTGAATGTGTATTTACTTTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTACCTAATTGTGACTTTGAACACGACTTTGCTGATGCATATATGGAAGAG
TTAGAAGAAAGTGATGATGATTTTGATGATAATGACGATGATAGTTGTGATGGTTTGCCTCGAGAAGACAATGGCTCCGTTTATGTTGAAGGTATGGGGGAATCA
ATGCCAGCCAATCTGGATCATTCATCAGCGGGGAATGCCATGTCCCCTAGTCAGGCATCCATGAAAAATGAAGATATGGAGCCTTTTCAATGTTCTGAGCCTATG
CNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTT
TACTAG
Protein sequenceShow/hide protein sequence
MEFYQTDSQALLLQIRRQEDLLKSKRRWLLGLPTSIGQKHSDHSDFLNKRNLPESLLREDDVFYETVKTRIEEAFGALNVETRHLGIRADQILDTCKVRKHILSC
LNDLNTRGLYLLAIVLTEDSVKLEKTRWKLKRVIREFIPRVLRRKSQDCRQLEIVKQLSQFFNDPKNFRRRCSTTSTSSSPSFHDAASQVLYRLGDLPTQGLLAM
HRKLEGVRVMPQIKRHRHGWGRDRLINLLTKISKKMLSSLGEGDELQESLAKAMAVADLSLKLVPGRHNSSEIEFYPFSPQIKTLHNEIVKAIWFVRNKCNIQKL
KQLKSLLDPDAKVSNRSLRTAIKKMLIDYLFECSDMDSMPKSLLKALAIISVDSRSAPHSFSSQDEIEEEVECVFTLSAQMKQVVWDLLPNCDFEHDFADAYMEE
LEESDDDFDDNDDDSCDGLPREDNGSVYVEGMGESMPANLDHSSAGNAMSPSQASMKNEDMEPFQCSEPMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXF
Y