; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015259 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015259
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUncharacterised conserved protein (UCP030210)
Genome locationtig00003412:146058..160903
RNA-Seq ExpressionSgr015259
SyntenySgr015259
Gene Ontology termsNA
InterPro domainsIPR002791 - Damage-control phosphatase ARMT1-like, metal-binding domain
IPR035073 - At2g17340, three-helix bundle domain
IPR036075 - AF1104-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136208.1 uncharacterized protein LOC111007960 [Momordica charantia]1.1e-24171.71Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEF QTDSEAI LQIRRQEE LK KRRWLLGLPTS SGQK SDHSDFLNKRNLPE LLREDDVFYETVKTRVEEAFGALNVETRH GIQ DR+FDTCKI 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLILS LDDLSTRGLYLLAIILT+DSVK EKTRWKLKRV+REF+P++LRRKSQDCHQLE+VK+LSQL+NDP NFRRR  VTLTSSSPSY+DAASQVL RL
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLIN+L ETS KMLSS GEGDELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +KSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALA+INADSR+A  SFFS EEIEKEVECVFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDDFD ND+D CDGLP +DNGSHS N+ H VEGMGESMP NLEHSSVGN LSPS A                 FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQ----------QVGNKDTISVS--------------HHRDESTFRNQYLVIQEACDATSMIAYNFIGRLL
        GE  LDSSF Y P FMESKFQKDT++   +              + DT + +                   STF+NQYLVIQEACD TS+IAYN IGRLL
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQ----------QVGNKDTISVS--------------HHRDESTFRNQYLVIQEACDATSMIAYNFIGRLL

Query:  EEFAKSEGVELD-CA-----------------EDEQTHMKENANDSVIIQVCEELIPSLSK
        EEFAKSEGVELD CA                 EDEQTH+K N N    IQVC+ELIPSLSK
Subjt:  EEFAKSEGVELD-CA-----------------EDEQTHMKENANDSVIIQVCEELIPSLSK

XP_022952061.1 uncharacterized protein LOC111454828 isoform X1 [Cucurbita moschata]2.5e-23065.14Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTS+SG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEGDELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S   Q+EI  EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDF DAYMEELEESDDD+D NDDD  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DV+  + S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSF Y P FMESK Q DTYNLSSNQQVG+KDT                           I        STF+NQYLV+QEACD TSMIAYNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVEL----------------DCAEDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+EL                D  E EQT +K    DSVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVEL----------------DCAEDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

XP_022969011.1 uncharacterized protein LOC111468137 isoform X1 [Cucurbita maxima]1.0e-23165.43Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTSVSG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        K ILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEG ELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S  SQ+EI +EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDD+  NDD+  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DVEPF+ S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSFNY P FMESK Q +TYNLSSNQQVGN+DT                           I        STF+NQYLV+QEACD TSMI YNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+ELD CA               E EQT +K     SVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

XP_022969013.1 uncharacterized protein LOC111468137 isoform X2 [Cucurbita maxima]1.4e-23165.67Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTSVSG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        K ILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEG ELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S  SQ+EI +EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDD+  NDD+  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DVEPF+ S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSFNY P FMESK Q +TYNLSSNQQVGN+DT                           I        STF+NQYLV+QEACD TSMI YNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+ELD CA           ED   EQT +K     SVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

XP_038890245.1 uncharacterized protein LOC120079870 isoform X1 [Benincasa hispida]3.3e-23067.98Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L+QIRRQE  LKSKRRWLLGLPTSVS  KYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFG LNVETRHLGI+A+R+ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLI+S L+DLS RGLYLLAIILTEDSV+ EKTRWKLKR I+EF+P +LRRKS+DC QLE+VK LSQL ND KNFRRR   TLTSSS S +DA SQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GDLPTQ LLAMRRKLEGVR MPQ+KRHRHGWGRDRLINLL + S+KMLSS+GEGDELQESLAKAMAVADLS KLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +KSLLDPDAKVS+R LR +IK MLIDYLFECSDMDTVPKSLLKALA++NADSRSA  S FSQ+EIE++ ECVFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDDF+  +DD+CDG P+ED    SV     VEGMGESMP NL+HSSVGN+L+PS ASL N DVE  Q S PMH  
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDE---------------------------STFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSF+    FMESK Q D  NLSSNQQVGN+ T ++   ++                            STF+NQYL++QEACD TSMIAYNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDE---------------------------STFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLSK
        RLLEEFA+SEGVELD CA               E EQTH+K   NDSVII+VC+ELIP LSK
Subjt:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLSK

TrEMBL top hitse value%identityAlignment
A0A6J1C3N7 uncharacterized protein LOC1110079605.3e-24271.71Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEF QTDSEAI LQIRRQEE LK KRRWLLGLPTS SGQK SDHSDFLNKRNLPE LLREDDVFYETVKTRVEEAFGALNVETRH GIQ DR+FDTCKI 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLILS LDDLSTRGLYLLAIILT+DSVK EKTRWKLKRV+REF+P++LRRKSQDCHQLE+VK+LSQL+NDP NFRRR  VTLTSSSPSY+DAASQVL RL
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLIN+L ETS KMLSS GEGDELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +KSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALA+INADSR+A  SFFS EEIEKEVECVFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDDFD ND+D CDGLP +DNGSHS N+ H VEGMGESMP NLEHSSVGN LSPS A                 FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQ----------QVGNKDTISVS--------------HHRDESTFRNQYLVIQEACDATSMIAYNFIGRLL
        GE  LDSSF Y P FMESKFQKDT++   +              + DT + +                   STF+NQYLVIQEACD TS+IAYN IGRLL
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQ----------QVGNKDTISVS--------------HHRDESTFRNQYLVIQEACDATSMIAYNFIGRLL

Query:  EEFAKSEGVELD-CA-----------------EDEQTHMKENANDSVIIQVCEELIPSLSK
        EEFAKSEGVELD CA                 EDEQTH+K N N    IQVC+ELIPSLSK
Subjt:  EEFAKSEGVELD-CA-----------------EDEQTHMKENANDSVIIQVCEELIPSLSK

A0A6J1GJE0 uncharacterized protein LOC111454828 isoform X11.2e-23065.14Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTS+SG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEGDELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S   Q+EI  EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDF DAYMEELEESDDD+D NDDD  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DV+  + S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSF Y P FMESK Q DTYNLSSNQQVG+KDT                           I        STF+NQYLV+QEACD TSMIAYNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVEL----------------DCAEDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+EL                D  E EQT +K    DSVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVEL----------------DCAEDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

A0A6J1GJF1 uncharacterized protein LOC111454828 isoform X22.1e-23065.52Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTS+SG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        KLILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEGDELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S   Q+EI  EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDF DAYMEELEESDDD+D NDDD  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DV+  + S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSF Y P FMESK Q DTYNLSSNQQVG+KDT                           I        STF+NQYLV+QEACD TSMIAYNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+EL+ CA           ED   EQT +K    DSVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

A0A6J1HWI3 uncharacterized protein LOC111468137 isoform X15.0e-23265.43Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTSVSG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        K ILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEG ELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S  SQ+EI +EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDD+  NDD+  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DVEPF+ S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSFNY P FMESK Q +TYNLSSNQQVGN+DT                           I        STF+NQYLV+QEACD TSMI YNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+ELD CA               E EQT +K     SVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVELD-CA---------------EDEQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

A0A6J1HZR8 uncharacterized protein LOC111468137 isoform X26.5e-23265.67Show/hide
Query:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII
        MEFYQTDS+A+L Q+RRQEE LKSKRRWLLGLPTSVSG KYSDHSD LNKRNLPESLLREDDVF+ TVKTRVEEAFG LN+ETRHLGI+AD++ DTCK+ 
Subjt:  MEFYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKII

Query:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL
        K ILS LDDLSTRGLY LA IL+EDSVK EKTRWKLKRVIREF+P +L RKSQDC+QLE  K+LSQL+ND  NFRR    T TSS+ S++DAASQVL  L
Subjt:  KLILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------
        GD+PTQ LLAMRRKLEGVR +PQIK  + GWGRDRLINLL + SKKMLSSLGEG ELQESLAKAMAVADLSLKLVP                        
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVP------------------------

Query:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV
                      + + +K LLDPDA+VS+RCLR  IK+ML DYLFECSDMDTVPKSLLKALAMI  DSRSAP S  SQ+EI +EVE VFSLSAQMKQV
Subjt:  --------------EAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQV

Query:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT
        VWDLLPNCDFEHDFADAYMEELEESDDD+  NDD+  DGLP+ED+G HSV     VEGMGESMP NL+++SVGN+LSPS ASL+N DVEPF+ S+P  FT
Subjt:  VWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPMHFT

Query:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG
         EGSLDSSFNY P FMESK Q +TYNLSSNQQVGN+DT                           I        STF+NQYLV+QEACD TSMI YNFIG
Subjt:  GEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDT---------------------------ISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIG

Query:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA
        RLLEE+AKSEG+ELD CA           ED   EQT +K     SVIIQVCEELIPSLS    KR + L+  +           +GF  SEL V LWA
Subjt:  RLLEEFAKSEGVELD-CA-----------ED---EQTHMKENANDSVIIQVCEELIPSLS----KRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWA

SwissProt top hitse value%identityAlignment
Q0J035 Pantothenate kinase 21.1e-4240.07Show/hide
Query:  RTSRKTLKVMEDHLMASRLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKW--SKKS--WKKVII
        R +  +L V+ D LM     D++ ++  RL  L+ G+ A NIFD GS    +++ +   +      +  + RPW IDD D+FK +    KK   +K+ ++
Subjt:  RTSRKTLKVMEDHLMASRLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKW--SKKS--WKKVII

Query:  FVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSN----------------------LLVANSGNDLPV
        FVDNSGAD++LG++P ARELLR+G++VVL AN LP++NDVT +EL  I+++     G L     +                       L+V  +G   P 
Subjt:  FVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSN----------------------LLVANSGNDLPV

Query:  IDLTQVSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFL-GGRLYDCVFK
        ID  QVS EL+  A DADL+ILEGMGR + TNL A+FKCD+LK+ MVK+  +A+ L  G +YDC+ K
Subjt:  IDLTQVSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFL-GGRLYDCVFK

Q5R5F8 4'-phosphopantetheine phosphatase5.6e-3932.4Show/hide
Query:  WINVFLNAIPSFKKRA-ESDPTVPDAEVKAEKFAQRYAEYLRTSR-----------KTLKVMEDHLM--------------------------ASRLNDA
        W+  F  A+    KRA  S P   DA  +AEKF Q+Y   L+T R           ++L    +H +                            R  DA
Subjt:  WINVFLNAIPSFKKRA-ESDPTVPDAEVKAEKFAQRYAEYLRTSR-----------KTLKVMEDHLM--------------------------ASRLNDA

Query:  IEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQ
        +  + ++L  LV+G+ AGN+FD G+  +++V   D    F  + + +  RPW++D    +  +      K  +IF DNSG DIILG+ PF RELL  G++
Subjt:  IEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQ

Query:  VVLAANDLPSINDVTYHELIEILSQLKD-DHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELSYLATD--ADLVILEGMGRGIETNLYAQFKCDSLKIG
        V+LA N  P++NDVT+ E + +  ++   D      +    LL+  +G+  P +DL+++ + L+ L  +  ADLV++EGMGR + TN +A  +C+SLK+ 
Subjt:  VVLAANDLPSINDVTYHELIEILSQLKD-DHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELSYLATD--ADLVILEGMGRGIETNLYAQFKCDSLKIG

Query:  MVKHLEVAQFLGGRLYDCVFK
        ++K+  +A+ LGGRL+  +FK
Subjt:  MVKHLEVAQFLGGRLYDCVFK

Q8L5Y9 Pantothenate kinase 24.6e-4137.31Show/hide
Query:  KTLKVMEDHLMASRLNDAIED-----DGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKW------SKKSWKK
        +++K  E+    + L D +E+     +  RL  L+ G+ A NIFD GS    +++ +   +      ++ + RPW +DD D FK +            K+
Subjt:  KTLKVMEDHLMASRLNDAIED-----DGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKW------SKKSWKK

Query:  VIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEIL----------------------SQLKDDHGKLMGVDTSNLLVANSGND
         ++FVDNSGAD+ILG+LP ARE LR G++VVL AN LP++NDVT  EL +I+                      + +    G      ++ L+V  +G  
Subjt:  VIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEIL----------------------SQLKDDHGKLMGVDTSNLLVANSGND

Query:  LPVIDLTQVSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQ-FLGGRLYDCV
         P IDL QVS EL+  A DADLV+LEGMGR + TN  AQF+C++LK+ MVK+  +A+  + G +YDCV
Subjt:  LPVIDLTQVSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQ-FLGGRLYDCV

Q949P3 Damage-control phosphatase At2g173401.5e-14071.07Show/hide
Query:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------
        MES SE+VPFP L  PIE+NYRACTIPYRFPSD+P+K TP E+SWINVF N+IPSFKKRAESD TVPDA  +AEKFA+RYA  L   +K           
Subjt:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------

Query:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL
                            K ++D   A           L+DAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQ++VPRPWVIDDL
Subjt:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL

Query:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ
        + F+ KW  KSWKK +IFVDNSGADIILGILPFARELLR G+QVVLAAN+LPSIND+T  EL EILSQLKD++G+L+GVDTS LL+ANSGNDLPVIDL++
Subjt:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ

Query:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS
        VSQEL+YL++DADLVI+EGMGRGIETNLYAQFKCDSLKIGMVKHLEVA+FLGGRLYDCVFK +
Subjt:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS

Q9NVE7 4'-phosphopantetheine phosphatase7.4e-3932.4Show/hide
Query:  WINVFLNAIPSFKKRA-ESDPTVPDAEVKAEKFAQRYAEYLRTSR-----------KTLKVMEDHLM--------------------------ASRLNDA
        W+  F  A+    KRA  S P   DA  +AEKF Q+Y   L+T R           ++L    +H +                            R  DA
Subjt:  WINVFLNAIPSFKKRA-ESDPTVPDAEVKAEKFAQRYAEYLRTSR-----------KTLKVMEDHLM--------------------------ASRLNDA

Query:  IEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQ
        +  + ++L  LV+G+ AGN+FD G+  ++ V   D    F  + + +  RPW++D    +  +      K  +IF DNSG DIILG+ PF RELL  G++
Subjt:  IEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDG-MSFLASCQHIVPRPWVIDDLDIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQ

Query:  VVLAANDLPSINDVTYHELIEILSQLKD-DHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELSYLATD--ADLVILEGMGRGIETNLYAQFKCDSLKIG
        V+LA N  P++NDVT+ E + +  ++   D      +    LL+  +G+  P +DL+++ + L+ L  +  ADLV++EGMGR + TN +A  +C+SLK+ 
Subjt:  VVLAANDLPSINDVTYHELIEILSQLKD-DHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELSYLATD--ADLVILEGMGRGIETNLYAQFKCDSLKIG

Query:  MVKHLEVAQFLGGRLYDCVFK
        ++K+  +A+ LGGRL+  +FK
Subjt:  MVKHLEVAQFLGGRLYDCVFK

Arabidopsis top hitse value%identityAlignment
AT2G17320.1 Uncharacterised conserved protein (UCP030210)1.3e-13468.63Show/hide
Query:  LVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK------------TLKVM
        ++PFPLL TPIE+NYRACTIPYRFPSDN +K TP E+SWINVF N+IPSFKKRAESD +VPDA  +A+ F QRYA +L   +K             L  +
Subjt:  LVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK------------TLKVM

Query:  EDHLMAS--------------------------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDLDIFKLK
         +HL+                             L+DAIEDDG+RLENLVRGIFAGNIFDLGSAQLAEVFSRDG+SFLA+ Q++VPRPWVIDDL+ F+ K
Subjt:  EDHLMAS--------------------------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDLDIFKLK

Query:  WSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELS
        W KK WKK +IFVDNSGADIILGILPFARELLR G QVVLAAN+LP+INDVTY EL +I+SQLKD +G+L+GVDTS LL+ANSGNDL VIDL++VS EL+
Subjt:  WSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELS

Query:  YLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS
        YL++DADLVILEGMGRGIETNLYAQFKCDSLKIGMVKH+EVAQFLGGRLYDCVFK +
Subjt:  YLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS

AT2G17340.1 Uncharacterised conserved protein (UCP030210)1.1e-14171.07Show/hide
Query:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------
        MES SE+VPFP L  PIE+NYRACTIPYRFPSD+P+K TP E+SWINVF N+IPSFKKRAESD TVPDA  +AEKFA+RYA  L   +K           
Subjt:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------

Query:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL
                            K ++D   A           L+DAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQ++VPRPWVIDDL
Subjt:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL

Query:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ
        + F+ KW  KSWKK +IFVDNSGADIILGILPFARELLR G+QVVLAAN+LPSIND+T  EL EILSQLKD++G+L+GVDTS LL+ANSGNDLPVIDL++
Subjt:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ

Query:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS
        VSQEL+YL++DADLVI+EGMGRGIETNLYAQFKCDSLKIGMVKHLEVA+FLGGRLYDCVFK +
Subjt:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLS

AT4G35360.1 Uncharacterised conserved protein (UCP030210)7.6e-14070.91Show/hide
Query:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------
        MES SE+V  PLL TPIESNYRACTIPYRFPSDNPRK TP E+SWI++F N+IPSFK+RAESD TVPDA V+AEKFA+RYAE L   +K           
Subjt:  MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRK-----------

Query:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL
                            K ++D   A          RL+DAI D+GKR+ENLVRGIFAGNIFDLGSAQLAEVFS+DGMSFLASCQ++V RPWVIDDL
Subjt:  ------------------TLKVMEDHLMAS---------RLNDAIEDDGKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDL

Query:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ
        D F+ +W KK WKK +IFVDNSGADIILGILPFARE+LR G QVVLAAN+LPSINDVTY EL EILS+L D++G+LMGVDTSNLL+ANSGNDLPVIDL +
Subjt:  DIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVTYHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQ

Query:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFK
        VSQE++YL++DADLVILEGMGRGIETNLYAQFKCDSLKIGMVKH EVAQFLGGRLYDCV K
Subjt:  VSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFK

AT5G40520.1 unknown protein1.1e-6632.8Show/hide
Query:  IILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRR--RWPVTLTSSSPSYYDAASQVLNRLGDLPTQGLLAMRRKLEG
        +ILT  S  F+KTR K+K +IR+ +     +  +   + EI+ QL Q+++DP NFR   R  +  T +  S+ DAA +VLN L  L TQ L AM+RKL+G
Subjt:  IILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRR--RWPVTLTSSSPSYYDAASQVLNRLGDLPTQGLLAMRRKLEG

Query:  VRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVPEAQT--------------------VKSL-------------
         R++PQ+K  R G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL P  +T                    VK++             
Subjt:  VRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVPEAQT--------------------VKSL-------------

Query:  ----LDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQVVWDLLPNCDFEHDFADAY
            LDP+A+VSN  LR+A++K LI+YLFECSD+DT+PKSL++AL+++N+ + +       +E IE+E EC+ ++SAQ+KQ+    +PN + + DF DAY
Subjt:  ----LDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQVVWDLLPNCDFEHDFADAY

Query:  MEELEES-----DDDFDGNDDDTCDGLPREDNGSH--------SVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSS-----------
        ME+LE+S     DDD D +DD+ C+    E  G+          + +    +   ES  ++ E S    ++     S    +   F SS           
Subjt:  MEELEES-----DDDFDGNDDDTCDGLPREDNGSH--------SVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSS-----------

Query:  -------EPMHFTGEGSL--------DSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDESTFR---NQYLVIQEACDATSMIAYNFIGRLL
                 ++ T   +         D + +     +E   + +  N  S++ + + + I    H ++   R   NQYL IQE  D TS++A+N IGRLL
Subjt:  -------EPMHFTGEGSL--------DSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDESTFR---NQYLVIQEACDATSMIAYNFIGRLL

Query:  EEFAKSEGVEL------------------DCAEDEQTHMKENANDSVIIQVCEELIPSLSK
        E+FA  + + L                  + +E +Q   +  +++++I+ V +E I SL +
Subjt:  EEFAKSEGVEL------------------DCAEDEQTHMKENANDSVIIQVCEELIPSLSK

AT5G40520.2 unknown protein7.2e-9034.51Show/hide
Query:  FYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKIIKL
        F++TD  +IL QI+ Q+E+++ KRRWLLG   S S +   DH+       +PESLLREDD+FYET+K+RVEEAFG    +         +    C +   
Subjt:  FYQTDSEAILLQIRRQEEKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKIIKL

Query:  ILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRR--RWPVTLTSSSPSYYDAASQVLNRL
        ++  LD L+ +GLYL+A+ILT  S  F+KTR K+K +IR+ +     +  +   + EI+ QL Q+++DP NFR   R  +  T +  S+ DAA +VLN L
Subjt:  ILSYLDDLSTRGLYLLAIILTEDSVKFEKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRR--RWPVTLTSSSPSYYDAASQVLNRL

Query:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVPEAQT--------------------
          L TQ L AM+RKL+G R++PQ+K  R G  R  LIN + + S+KMLS L  GD+LQE LAKA++V DLSLKL P  +T                    
Subjt:  GDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINLLIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVPEAQT--------------------

Query:  VKSL-----------------LDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQVV
        VK++                 LDP+A+VSN  LR+A++K LI+YLFECSD+DT+PKSL++AL+++N+ + +       +E IE+E EC+ ++SAQ+KQ+ 
Subjt:  VKSL-----------------LDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEVECVFSLSAQMKQVV

Query:  WDLLPNCDFEHDFADAYMEELEES-----DDDFDGNDDDTCDGLPREDNGSH--------SVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDV
           +PN + + DF DAYME+LE+S     DDD D +DD+ C+    E  G+          + +    +   ES  ++ E S    ++     S    + 
Subjt:  WDLLPNCDFEHDFADAYMEELEES-----DDDFDGNDDDTCDGLPREDNGSH--------SVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDV

Query:  EPFQSS------------------EPMHFTGEGSL--------DSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDESTFR---NQYLVIQE
          F SS                    ++ T   +         D + +     +E   + +  N  S++ + + + I    H ++   R   NQYL IQE
Subjt:  EPFQSS------------------EPMHFTGEGSL--------DSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDESTFR---NQYLVIQE

Query:  ACDATSMIAYNFIGRLLEEFAKSEGVEL------------------DCAEDEQTHMKENANDSVIIQVCEELIPSLSK
          D TS++A+N IGRLLE+FA  + + L                  + +E +Q   +  +++++I+ V +E I SL +
Subjt:  ACDATSMIAYNFIGRLLEEFAKSEGVEL------------------DCAEDEQTHMKENANDSVIIQVCEELIPSLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGCGCGTCGGAGCTGGTGCCGTTTCCGCTGCTGCTCACACCGATTGAGTCCAATTACAGAGCCTGCACCATTCCCTACAGATTCCCTTCCGACAATCCTCGCAA
GCCCACCCCCATTGAGCTGTCTTGGATCAACGTCTTCCTCAATGCCATCCCTTCTTTCAAGAAACGGGCAGAGAGTGATCCCACAGTTCCAGATGCTGAGGTGAAAGCTG
AAAAATTTGCTCAGAGGTATGCTGAATACTTGAGGACTTCAAGAAAGACCCTGAAAGTCATGGAGGACCACCTGATGGCATCTCGTCTTAATGATGCCATTGAAGACGAT
GGTAAACGTTTGGAGAATCTGGTTAGAGGAATATTTGCAGGGAACATTTTTGATCTTGGTTCTGCTCAGCTAGCTGAAGTTTTCTCAAGGGATGGTATGTCTTTCTTAGC
GAGTTGTCAACATATTGTTCCTCGTCCTTGGGTGATTGACGATTTGGACATATTCAAACTCAAATGGAGTAAAAAATCATGGAAGAAGGTCATAATATTTGTTGATAATT
CTGGTGCAGATATTATTTTGGGCATTTTGCCATTTGCTAGAGAGTTACTCCGGCATGGATCTCAGGTTGTCTTAGCAGCTAATGACTTGCCCTCAATTAACGACGTAACT
TATCATGAACTAATTGAAATTTTATCGCAGCTGAAGGATGACCATGGAAAACTCATGGGAGTTGATACTTCCAACCTTTTAGTTGCCAATTCTGGCAATGATTTGCCGGT
CATTGACCTTACCCAAGTATCCCAAGAGCTTAGCTACCTGGCAACTGATGCAGATCTAGTTATCTTGGAAGGAATGGGTCGTGGAATAGAGACAAACCTTTACGCTCAAT
TTAAGTGTGATTCGCTAAAGATTGGCATGGTAAAGCATCTGGAAGTTGCACAGTTTCTTGGAGGAAGGCTCTACGACTGTGTATTCAAGCTTTCTCACCAAGTTATGTTC
ATTATTATTATTATTTTTTTTGCATTAGGTAAATGGCCAACAGGGTTTGTCGCACACCCCTCCTCCTTTGCTTCCTCTCTTGCCTCTTGCCTCGCCCCGCCGCCGCCGCC
GCCGCCGCCGCGCCGCCGCCGCCGCCGCCTCCACATCCCTCTCTCTCGCCGCCTCGCGCCAACACCTCTCTTCTCTCTCGCAGTCGCCGCCTGGCGCCAGCTTAAGCCAC
CCCTTTTCTCTGGTCGGTTCTCTGCACCGAACCAGAACGGGTGGCAAAAGAAGATGGAGTTTTATCAGACGGATTCGGAAGCAATTTTGTTACAGATTCGACGTCAAGAG
GAGAAGTTGAAATCCAAAAGAAGATGGTTGCTGGGGCTTCCTACATCTGTATCTGGACAAAAGTATTCTGATCATTCAGACTTTTTAAATAAACGAAACTTGCCTGAATC
TTTGCTGAGAGAAGATGATGTTTTCTATGAGACCGTCAAAACAAGAGTTGAAGAAGCTTTTGGAGCGCTAAATGTTGAAACAAGGCATCTTGGTATTCAAGCTGATCGAT
TATTTGATACTTGCAAAATTATAAAACTCATTTTGTCATATCTTGATGATCTGAGCACTAGGGGGCTTTACCTTCTTGCTATAATACTTACAGAAGACTCTGTCAAATTT
GAAAAAACTCGTTGGAAGCTGAAAAGGGTCATTAGAGAATTTCTTCCAGACCTTCTGAGAAGGAAGAGTCAAGATTGCCATCAATTGGAAATTGTTAAACAATTGTCTCA
ACTTGTCAATGACCCAAAAAATTTCCGAAGAAGATGGCCAGTAACTTTGACATCAAGTTCACCATCTTACTATGATGCTGCATCACAGGTACTGAATAGATTAGGAGACC
TGCCCACCCAAGGTCTCTTAGCTATGCGTCGAAAGCTTGAAGGAGTTCGAGTTATGCCTCAGATAAAACGTCATAGGCATGGGTGGGGTCGTGATCGTCTTATTAATCTT
CTTATAGAAACTAGTAAGAAGATGCTTTCATCGCTTGGTGAAGGAGATGAATTGCAAGAATCACTAGCAAAAGCCATGGCGGTGGCCGATTTATCTCTTAAACTGGTACC
AGAAGCTCAAACAGTTAAGTCTTTGTTGGATCCCGATGCTAAAGTGTCCAATAGGTGTCTAAGAACAGCTATTAAGAAAATGTTAATTGACTATCTTTTTGAGTGCAGTG
ATATGGACACTGTGCCAAAGTCTCTTTTGAAAGCTCTAGCTATGATAAATGCAGATTCTCGAAGTGCACCACAGTCATTTTTCTCCCAGGAGGAAATTGAAAAGGAGGTT
GAGTGTGTATTCAGTCTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTGCCTAACTGTGACTTTGAACATGACTTTGCTGATGCATATATGGAAGAGTTAGAAGA
AAGTGATGATGATTTTGATGGTAATGACGATGATACTTGTGATGGCTTGCCTCGAGAAGACAATGGGTCCCACTCTGTTAATTTGCATCACCTTGTTGAAGGTATGGGGG
AATCAATGCCAACCAATCTGGAACATTCATCAGTGGGAAATGTCTTGTCCCCTAGTCTTGCGTCCTTGAGAAATGTAGATGTGGAGCCTTTCCAAAGTTCTGAACCTATG
CATTTTACTGGGGAAGGTTCTTTAGATTCTTCTTTTAATTACCCCCCTCCGTTCATGGAATCCAAATTTCAGAAAGATACATATAACTTATCTTCTAACCAACAAGTGGG
GAACAAAGACACCATCTCTGTTTCACATCATAGAGATGAAAGCACGTTTAGGAATCAGTACCTTGTGATTCAAGAAGCTTGTGATGCGACCAGCATGATTGCTTATAATT
TCATTGGCCGCTTGCTTGAAGAGTTTGCAAAGAGTGAAGGCGTTGAATTAGATTGTGCAGAAGATGAACAGACTCATATGAAGGAGAATGCGAATGATTCAGTAATTATT
CAAGTTTGTGAAGAGTTAATACCTTCTCTTTCTAAAAGGTATGATGATCTAATGTGGAACAAAGAGACTCCAGCAGGTGCTGGGATTGTGATTCTCAGGGGATTTTGGTG
GTCTGAGTTAGCAGTTGATCTTTGGGCAGGAATTGTCTCACTTGGGATCATCGCTGGGCAGAATGAAGCAGTTGGTGGAGGGTTTGAGGAAGTTCATTTGAGACATTTTG
GTAAGACTACGTACCTTGAGAAAAGGGAGCACAAGCCAGCTGAAGACTGGGTATCCTTTAGAGAGCATGATGCAACTCAGGCTTTCAATGGCTCCTTCTTCCCAGTGCTG
GCCAACATATGCCTCAACACCTTGAGGTACTTGGCTCTTGAAACCTGGAGGGAAAGAAACGGGGGGATTCTTGGGAATCAGAAGTCCTCCTGCTTTACTTTTTATACTCG
TTTTTTCCTATTAGGATCTGTGAAGTTCAAAGCTTTTGAGGTTTCAGGTTGTAATTGTTGTGCTTCTTTAGATACTTGTAATGACGATTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGCGCGTCGGAGCTGGTGCCGTTTCCGCTGCTGCTCACACCGATTGAGTCCAATTACAGAGCCTGCACCATTCCCTACAGATTCCCTTCCGACAATCCTCGCAA
GCCCACCCCCATTGAGCTGTCTTGGATCAACGTCTTCCTCAATGCCATCCCTTCTTTCAAGAAACGGGCAGAGAGTGATCCCACAGTTCCAGATGCTGAGGTGAAAGCTG
AAAAATTTGCTCAGAGGTATGCTGAATACTTGAGGACTTCAAGAAAGACCCTGAAAGTCATGGAGGACCACCTGATGGCATCTCGTCTTAATGATGCCATTGAAGACGAT
GGTAAACGTTTGGAGAATCTGGTTAGAGGAATATTTGCAGGGAACATTTTTGATCTTGGTTCTGCTCAGCTAGCTGAAGTTTTCTCAAGGGATGGTATGTCTTTCTTAGC
GAGTTGTCAACATATTGTTCCTCGTCCTTGGGTGATTGACGATTTGGACATATTCAAACTCAAATGGAGTAAAAAATCATGGAAGAAGGTCATAATATTTGTTGATAATT
CTGGTGCAGATATTATTTTGGGCATTTTGCCATTTGCTAGAGAGTTACTCCGGCATGGATCTCAGGTTGTCTTAGCAGCTAATGACTTGCCCTCAATTAACGACGTAACT
TATCATGAACTAATTGAAATTTTATCGCAGCTGAAGGATGACCATGGAAAACTCATGGGAGTTGATACTTCCAACCTTTTAGTTGCCAATTCTGGCAATGATTTGCCGGT
CATTGACCTTACCCAAGTATCCCAAGAGCTTAGCTACCTGGCAACTGATGCAGATCTAGTTATCTTGGAAGGAATGGGTCGTGGAATAGAGACAAACCTTTACGCTCAAT
TTAAGTGTGATTCGCTAAAGATTGGCATGGTAAAGCATCTGGAAGTTGCACAGTTTCTTGGAGGAAGGCTCTACGACTGTGTATTCAAGCTTTCTCACCAAGTTATGTTC
ATTATTATTATTATTTTTTTTGCATTAGGTAAATGGCCAACAGGGTTTGTCGCACACCCCTCCTCCTTTGCTTCCTCTCTTGCCTCTTGCCTCGCCCCGCCGCCGCCGCC
GCCGCCGCCGCGCCGCCGCCGCCGCCGCCTCCACATCCCTCTCTCTCGCCGCCTCGCGCCAACACCTCTCTTCTCTCTCGCAGTCGCCGCCTGGCGCCAGCTTAAGCCAC
CCCTTTTCTCTGGTCGGTTCTCTGCACCGAACCAGAACGGGTGGCAAAAGAAGATGGAGTTTTATCAGACGGATTCGGAAGCAATTTTGTTACAGATTCGACGTCAAGAG
GAGAAGTTGAAATCCAAAAGAAGATGGTTGCTGGGGCTTCCTACATCTGTATCTGGACAAAAGTATTCTGATCATTCAGACTTTTTAAATAAACGAAACTTGCCTGAATC
TTTGCTGAGAGAAGATGATGTTTTCTATGAGACCGTCAAAACAAGAGTTGAAGAAGCTTTTGGAGCGCTAAATGTTGAAACAAGGCATCTTGGTATTCAAGCTGATCGAT
TATTTGATACTTGCAAAATTATAAAACTCATTTTGTCATATCTTGATGATCTGAGCACTAGGGGGCTTTACCTTCTTGCTATAATACTTACAGAAGACTCTGTCAAATTT
GAAAAAACTCGTTGGAAGCTGAAAAGGGTCATTAGAGAATTTCTTCCAGACCTTCTGAGAAGGAAGAGTCAAGATTGCCATCAATTGGAAATTGTTAAACAATTGTCTCA
ACTTGTCAATGACCCAAAAAATTTCCGAAGAAGATGGCCAGTAACTTTGACATCAAGTTCACCATCTTACTATGATGCTGCATCACAGGTACTGAATAGATTAGGAGACC
TGCCCACCCAAGGTCTCTTAGCTATGCGTCGAAAGCTTGAAGGAGTTCGAGTTATGCCTCAGATAAAACGTCATAGGCATGGGTGGGGTCGTGATCGTCTTATTAATCTT
CTTATAGAAACTAGTAAGAAGATGCTTTCATCGCTTGGTGAAGGAGATGAATTGCAAGAATCACTAGCAAAAGCCATGGCGGTGGCCGATTTATCTCTTAAACTGGTACC
AGAAGCTCAAACAGTTAAGTCTTTGTTGGATCCCGATGCTAAAGTGTCCAATAGGTGTCTAAGAACAGCTATTAAGAAAATGTTAATTGACTATCTTTTTGAGTGCAGTG
ATATGGACACTGTGCCAAAGTCTCTTTTGAAAGCTCTAGCTATGATAAATGCAGATTCTCGAAGTGCACCACAGTCATTTTTCTCCCAGGAGGAAATTGAAAAGGAGGTT
GAGTGTGTATTCAGTCTGAGTGCTCAGATGAAACAAGTAGTTTGGGATTTACTGCCTAACTGTGACTTTGAACATGACTTTGCTGATGCATATATGGAAGAGTTAGAAGA
AAGTGATGATGATTTTGATGGTAATGACGATGATACTTGTGATGGCTTGCCTCGAGAAGACAATGGGTCCCACTCTGTTAATTTGCATCACCTTGTTGAAGGTATGGGGG
AATCAATGCCAACCAATCTGGAACATTCATCAGTGGGAAATGTCTTGTCCCCTAGTCTTGCGTCCTTGAGAAATGTAGATGTGGAGCCTTTCCAAAGTTCTGAACCTATG
CATTTTACTGGGGAAGGTTCTTTAGATTCTTCTTTTAATTACCCCCCTCCGTTCATGGAATCCAAATTTCAGAAAGATACATATAACTTATCTTCTAACCAACAAGTGGG
GAACAAAGACACCATCTCTGTTTCACATCATAGAGATGAAAGCACGTTTAGGAATCAGTACCTTGTGATTCAAGAAGCTTGTGATGCGACCAGCATGATTGCTTATAATT
TCATTGGCCGCTTGCTTGAAGAGTTTGCAAAGAGTGAAGGCGTTGAATTAGATTGTGCAGAAGATGAACAGACTCATATGAAGGAGAATGCGAATGATTCAGTAATTATT
CAAGTTTGTGAAGAGTTAATACCTTCTCTTTCTAAAAGGTATGATGATCTAATGTGGAACAAAGAGACTCCAGCAGGTGCTGGGATTGTGATTCTCAGGGGATTTTGGTG
GTCTGAGTTAGCAGTTGATCTTTGGGCAGGAATTGTCTCACTTGGGATCATCGCTGGGCAGAATGAAGCAGTTGGTGGAGGGTTTGAGGAAGTTCATTTGAGACATTTTG
GTAAGACTACGTACCTTGAGAAAAGGGAGCACAAGCCAGCTGAAGACTGGGTATCCTTTAGAGAGCATGATGCAACTCAGGCTTTCAATGGCTCCTTCTTCCCAGTGCTG
GCCAACATATGCCTCAACACCTTGAGGTACTTGGCTCTTGAAACCTGGAGGGAAAGAAACGGGGGGATTCTTGGGAATCAGAAGTCCTCCTGCTTTACTTTTTATACTCG
TTTTTTCCTATTAGGATCTGTGAAGTTCAAAGCTTTTGAGGTTTCAGGTTGTAATTGTTGTGCTTCTTTAGATACTTGTAATGACGATTTCTGA
Protein sequenceShow/hide protein sequence
MESASELVPFPLLLTPIESNYRACTIPYRFPSDNPRKPTPIELSWINVFLNAIPSFKKRAESDPTVPDAEVKAEKFAQRYAEYLRTSRKTLKVMEDHLMASRLNDAIEDD
GKRLENLVRGIFAGNIFDLGSAQLAEVFSRDGMSFLASCQHIVPRPWVIDDLDIFKLKWSKKSWKKVIIFVDNSGADIILGILPFARELLRHGSQVVLAANDLPSINDVT
YHELIEILSQLKDDHGKLMGVDTSNLLVANSGNDLPVIDLTQVSQELSYLATDADLVILEGMGRGIETNLYAQFKCDSLKIGMVKHLEVAQFLGGRLYDCVFKLSHQVMF
IIIIIFFALGKWPTGFVAHPSSFASSLASCLAPPPPPPPPRRRRRRLHIPLSRRLAPTPLFSLAVAAWRQLKPPLFSGRFSAPNQNGWQKKMEFYQTDSEAILLQIRRQE
EKLKSKRRWLLGLPTSVSGQKYSDHSDFLNKRNLPESLLREDDVFYETVKTRVEEAFGALNVETRHLGIQADRLFDTCKIIKLILSYLDDLSTRGLYLLAIILTEDSVKF
EKTRWKLKRVIREFLPDLLRRKSQDCHQLEIVKQLSQLVNDPKNFRRRWPVTLTSSSPSYYDAASQVLNRLGDLPTQGLLAMRRKLEGVRVMPQIKRHRHGWGRDRLINL
LIETSKKMLSSLGEGDELQESLAKAMAVADLSLKLVPEAQTVKSLLDPDAKVSNRCLRTAIKKMLIDYLFECSDMDTVPKSLLKALAMINADSRSAPQSFFSQEEIEKEV
ECVFSLSAQMKQVVWDLLPNCDFEHDFADAYMEELEESDDDFDGNDDDTCDGLPREDNGSHSVNLHHLVEGMGESMPTNLEHSSVGNVLSPSLASLRNVDVEPFQSSEPM
HFTGEGSLDSSFNYPPPFMESKFQKDTYNLSSNQQVGNKDTISVSHHRDESTFRNQYLVIQEACDATSMIAYNFIGRLLEEFAKSEGVELDCAEDEQTHMKENANDSVII
QVCEELIPSLSKRYDDLMWNKETPAGAGIVILRGFWWSELAVDLWAGIVSLGIIAGQNEAVGGGFEEVHLRHFGKTTYLEKREHKPAEDWVSFREHDATQAFNGSFFPVL
ANICLNTLRYLALETWRERNGGILGNQKSSCFTFYTRFFLLGSVKFKAFEVSGCNCCASLDTCNDDF