; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0099791 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0099791
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionUDP-N-acetylglucosamine pyrophosphorylase
Genome locationCMiso1.1chr04:15657875..15661484
RNA-Seq ExpressionCmc04g0099791
SyntenyCmc04g0099791
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08503.1 uncharacterized protein E5676_scaffold323G00310 [Cucumis melo var. makuwa]1.1e-28399Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQ  VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

XP_008447555.1 PREDICTED: uncharacterized protein LOC103489972 isoform X1 [Cucumis melo]2.7e-28298.8Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQ  VGLSLPHIDNTPMLKECESVFRVGK AGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

XP_008447557.1 PREDICTED: uncharacterized protein LOC103489972 isoform X2 [Cucumis melo]8.5e-28499.2Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGK AGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

XP_011651484.1 uncharacterized protein LOC105434902 isoform X1 [Cucumis sativus]1.8e-25790.4Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ
        MPV  KSS+KKP++DVSN +YP TSSKSLTT  T  +DFN SK EDLDDSLDRLLL+QSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTH LSD+ 
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ

Query:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN
        SSLKPW+PRFQK+FSHPSKDSDDGIGQSLAN GN LVN  ENNVADSPDH EAQDLVSPSPLVSWRAGCNIERGRQ+FLLTPLPISKS SSKHVAKSVLN
Subjt:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN

Query:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS
        GMKSGILKSTQPCFIACGDLNENPLECNVIEPSV KPSG DLSTLGENLLECNGTE SVV ++L+EGN LEPSGAEPSGSDLTQA IIHQRGFASPPLLS
Subjt:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS

Query:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
        KKNCSML+MTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
Subjt:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL

Query:  EPSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        EPSDSHS++SATSSGCHEA KSFSHQ  VG+SLPHIDNTPMLK CESVFRVGK AGEETLKKELWMKFEAAS NPFPCD+ALQKTSKKGFLDLLDEVSCD
Subjt:  EPSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

XP_011651485.1 uncharacterized protein LOC105434902 isoform X2 [Cucumis sativus]5.5e-25990.76Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ
        MPV  KSS+KKP++DVSN +YP TSSKSLTT  T  +DFN SK EDLDDSLDRLLL+QSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTH LSD+ 
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ

Query:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN
        SSLKPW+PRFQK+FSHPSKDSDDGIGQSLAN GN LVN  ENNVADSPDH EAQDLVSPSPLVSWRAGCNIERGRQ+FLLTPLPISKS SSKHVAKSVLN
Subjt:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN

Query:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS
        GMKSGILKSTQPCFIACGDLNENPLECNVIEPSV KPSG DLSTLGENLLECNGTE SVV ++L+EGN LEPSGAEPSGSDLTQA IIHQRGFASPPLLS
Subjt:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS

Query:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
        KKNCSML+MTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
Subjt:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL

Query:  EPSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        EPSDSHS++SATSSGCHEA KSFSHQVG+SLPHIDNTPMLK CESVFRVGK AGEETLKKELWMKFEAAS NPFPCD+ALQKTSKKGFLDLLDEVSCD
Subjt:  EPSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

TrEMBL top hitse value%identityAlignment
A0A0A0LBM5 Uncharacterized protein8.6e-25890.4Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ
        MPV  KSS+KKP++DVSN +YP TSSKSLTT  T  +DFN SK EDLDDSLDRLLL+QSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTH LSD+ 
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTA-TRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQ

Query:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN
        SSLKPW+PRFQK+FSHPSKDSDDGIGQSLAN GN LVN  ENNVADSPDH EAQDLVSPSPLVSWRAGCNIERGRQ+FLLTPLPISKS SSKHVAKSVLN
Subjt:  SSLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLN

Query:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS
        GMKSGILKSTQPCFIACGDLNENPLECNVIEPSV KPSG DLSTLGENLLECNGTE SVV ++L+EGN LEPSGAEPSGSDLTQA IIHQRGFASPPLLS
Subjt:  GMKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLS

Query:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
        KKNCSML+MTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL
Subjt:  KKNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLL

Query:  EPSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        EPSDSHS++SATSSGCHEA KSFSHQ  VG+SLPHIDNTPMLK CESVFRVGK AGEETLKKELWMKFEAAS NPFPCD+ALQKTSKKGFLDLLDEVSCD
Subjt:  EPSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

A0A1S3BH47 uncharacterized protein LOC103489972 isoform X11.3e-28298.8Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQ  VGLSLPHIDNTPMLKECESVFRVGK AGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

A0A1S3BHQ1 uncharacterized protein LOC103489972 isoform X24.1e-28499.2Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGK AGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

A0A5A7U6G3 Uncharacterized protein1.3e-28298.8Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQ  VGLSLPHIDNTPMLKECESVFRVGK AGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

A0A5D3C9D6 Uncharacterized protein5.3e-28499Show/hide
Query:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
        MPV GKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS
Subjt:  MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQS

Query:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
        SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALV+VKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG
Subjt:  SLKPWVPRFQKMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNG

Query:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
        MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK
Subjt:  MKSGILKSTQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSK

Query:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
        KNCSMLVMTPCFKMSPPKSCVLLEPISESSHK+KKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE
Subjt:  KNCSMLVMTPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLE

Query:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
        PSDSHSIESATSSGCHEATKSFSHQ  VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD
Subjt:  PSDSHSIESATSSGCHEATKSFSHQ--VGLSLPHIDNTPMLKECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12540.1 unknown protein1.9e-6037.3Show/hide
Query:  TRREDFN-----PSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQSSLK-----------------PWVPRFQ
        TRR+  N       + E  D  LD+L LV SD+ ++  QIDELVV+A + K + K G  E+ESF  VLSDM SSLK                 PW PR Q
Subjt:  TRREDFN-----PSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQSSLK-----------------PWVPRFQ

Query:  KMFSHPSKDSDDGIGQSL--ANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNGMKSGILKS
        +  S      +D   QSL   NE   L +V      +SP+ T+ + LVSPSPLV WR   N ++GRQLFLLTPLP+ KS   KH   S L        K 
Subjt:  KMFSHPSKDSDDGIGQSL--ANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNGMKSGILKS

Query:  TQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSKKNCSMLVM
          P  +A       PLE +       K +  D+  LG   L+  G   S+V       N +E    +P                 S P+L +K  S L+M
Subjt:  TQPCFIACGDLNENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSKKNCSMLVM

Query:  TPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLEPSDSHS--
        TPC K+SPPKSC + +P+ ESS   K+   K+T   +G    SSG + +D L  KYPELLGIQ A  T  +  +E+SP W+ SPPKTCVL+EP +     
Subjt:  TPCFKMSPPKSCVLLEPISESSHKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLEPSDSHS--

Query:  IESATSSGCHEATKSFSHQVGLSLPH-IDNTPMLKECESVFRVGKC-AGEETLKKELWMKFEAAST-----NPFPCDQALQKTSKKGFLDLLDEVS
         E+  S           H    S+   +++TP+ KE ES+    +  AGE TLKKELW +FE A+      N       ++  +KK F+++L+EVS
Subjt:  IESATSSGCHEATKSFSHQVGLSLPH-IDNTPMLKECESVFRVGKC-AGEETLKKELWMKFEAAST-----NPFPCDQALQKTSKKGFLDLLDEVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGTCATGGGAAAATCATCTGTGAAGAAGCCATTGAGGGACGTATCCAACCACAGATACCCTCGAACTTCCTCTAAATCTCTCACTACAGCCACCAGAAGGGAAGA
TTTCAACCCGTCTAAGGTTGAAGACCTAGACGATTCTCTCGATCGCCTCCTTCTAGTTCAGTCCGATCTCTCCGCCCTCACTCACCAGATCGATGAACTCGTTGTGAAAG
CACTGGAATTGAAGGAAATAGACAAACAGGGGAGGAAAGAGATCGAATCTTTCACTCATGTCTTATCCGATATGCAATCTTCTTTGAAGCCCTGGGTTCCTAGGTTTCAG
AAGATGTTCTCTCACCCATCAAAAGATTCTGATGATGGTATAGGACAATCATTGGCTAATGAAGGCAATGCATTGGTTAACGTTAAGGAAAATAACGTTGCTGACAGCCC
AGACCATACTGAAGCTCAAGATTTGGTCTCTCCTTCACCTCTCGTATCATGGCGTGCTGGTTGCAATATTGAGAGAGGAAGACAATTGTTTTTACTCACTCCTCTTCCTA
TTTCTAAATCACTTTCATCGAAACATGTGGCTAAATCTGTTCTCAACGGAATGAAGTCGGGTATACTTAAGAGTACACAACCATGTTTTATTGCATGTGGAGATTTAAAT
GAAAATCCACTTGAATGTAATGTAATTGAGCCTAGTGTTGGCAAGCCTTCTGGGTCTGATTTATCAACACTTGGTGAGAATCTGCTTGAATGCAATGGAACGGAGGGTAG
TGTTGTAGAGAATAGTTTGCTAGAAGGTAATGCACTTGAGCCTAGCGGTGCTGAGCCTTCTGGGTCTGATTTAACACAGGCAGAGATAATTCATCAGCGTGGATTTGCTT
CCCCGCCATTGTTATCAAAGAAGAATTGCTCTATGTTAGTTATGACACCGTGCTTTAAAATGTCTCCTCCAAAATCTTGTGTGCTGCTTGAACCCATTTCAGAGTCATCA
CATAAAGATAAAAAAAGGTTTTACAAGGCCACGCCTTTTCCTGTTGGAGTTCATGATTGCTCTTCTGGAAGTGACGCTTCTGATGGGTTGGCTTTAAAGTATCCAGAACT
CTTAGGTATTCAACAAGCTCACAAAACAGGAATCAGAAAGAAAGTTGAAGCCTCACCAGACTGGTATATGTCACCTCCAAAAACATGCGTTTTACTGGAACCATCTGATT
CTCATTCCATTGAAAGTGCTACTTCCAGTGGATGTCACGAAGCCACTAAATCTTTCAGCCACCAAGTTGGGTTGAGCTTGCCGCACATAGATAACACTCCCATGTTGAAG
GAATGTGAAAGTGTATTCCGGGTTGGGAAATGTGCTGGGGAGGAGACTCTTAAGAAAGAACTATGGATGAAATTTGAAGCAGCCTCAACCAATCCATTTCCTTGTGATCA
AGCTCTTCAAAAGACATCGAAGAAAGGCTTCTTGGATTTGTTGGATGAGGTTTCATGTGATTAG
mRNA sequenceShow/hide mRNA sequence
TAGGGTGAGTGTGTTGGCCCATTCCATTGTCTGATTTGTAGAACATTTGATTTTACAACAAAAAGCCACTTCAAAATCCCGCCAACGGACACAATTTCAAAAAGTAATCA
GGCCATCCTTTGCAGGTTGAACAAGGGCATCATACCGGCGCCGATCGACATGCCGGTCATGGGAAAATCATCTGTGAAGAAGCCATTGAGGGACGTATCCAACCACAGAT
ACCCTCGAACTTCCTCTAAATCTCTCACTACAGCCACCAGAAGGGAAGATTTCAACCCGTCTAAGGTTGAAGACCTAGACGATTCTCTCGATCGCCTCCTTCTAGTTCAG
TCCGATCTCTCCGCCCTCACTCACCAGATCGATGAACTCGTTGTGAAAGCACTGGAATTGAAGGAAATAGACAAACAGGGGAGGAAAGAGATCGAATCTTTCACTCATGT
CTTATCCGATATGCAATCTTCTTTGAAGCCCTGGGTTCCTAGGTTTCAGAAGATGTTCTCTCACCCATCAAAAGATTCTGATGATGGTATAGGACAATCATTGGCTAATG
AAGGCAATGCATTGGTTAACGTTAAGGAAAATAACGTTGCTGACAGCCCAGACCATACTGAAGCTCAAGATTTGGTCTCTCCTTCACCTCTCGTATCATGGCGTGCTGGT
TGCAATATTGAGAGAGGAAGACAATTGTTTTTACTCACTCCTCTTCCTATTTCTAAATCACTTTCATCGAAACATGTGGCTAAATCTGTTCTCAACGGAATGAAGTCGGG
TATACTTAAGAGTACACAACCATGTTTTATTGCATGTGGAGATTTAAATGAAAATCCACTTGAATGTAATGTAATTGAGCCTAGTGTTGGCAAGCCTTCTGGGTCTGATT
TATCAACACTTGGTGAGAATCTGCTTGAATGCAATGGAACGGAGGGTAGTGTTGTAGAGAATAGTTTGCTAGAAGGTAATGCACTTGAGCCTAGCGGTGCTGAGCCTTCT
GGGTCTGATTTAACACAGGCAGAGATAATTCATCAGCGTGGATTTGCTTCCCCGCCATTGTTATCAAAGAAGAATTGCTCTATGTTAGTTATGACACCGTGCTTTAAAAT
GTCTCCTCCAAAATCTTGTGTGCTGCTTGAACCCATTTCAGAGTCATCACATAAAGATAAAAAAAGGTTTTACAAGGCCACGCCTTTTCCTGTTGGAGTTCATGATTGCT
CTTCTGGAAGTGACGCTTCTGATGGGTTGGCTTTAAAGTATCCAGAACTCTTAGGTATTCAACAAGCTCACAAAACAGGAATCAGAAAGAAAGTTGAAGCCTCACCAGAC
TGGTATATGTCACCTCCAAAAACATGCGTTTTACTGGAACCATCTGATTCTCATTCCATTGAAAGTGCTACTTCCAGTGGATGTCACGAAGCCACTAAATCTTTCAGCCA
CCAAGTTGGGTTGAGCTTGCCGCACATAGATAACACTCCCATGTTGAAGGAATGTGAAAGTGTATTCCGGGTTGGGAAATGTGCTGGGGAGGAGACTCTTAAGAAAGAAC
TATGGATGAAATTTGAAGCAGCCTCAACCAATCCATTTCCTTGTGATCAAGCTCTTCAAAAGACATCGAAGAAAGGCTTCTTGGATTTGTTGGATGAGGTTTCATGTGAT
TAGATGTGGAACTGATACACCTTTGAAATGCATACCCCGAAAGGCATTGCAGCATATGTTGTGGTAGTGCAGAAAGAATCTATGAAGCTGTACTTCTCATAATGATGGTG
GAAGCAGATTTGATGTAAAGAATGCCTGTGTTGTACTATTAGGCTACATTTTGTCCTGTAAGCTTGCAAAAGTTGTATATTCTCAATGATGCATACAACTTTCCACAGAT
ATTTTGAGCCTTCAGTGATAAATTGGTTATCCAATTTAATTAAATTTTCTTTGAAGGATATTCATGAGTGAACTCCAGCAAGCCAAAGTCCATTGTACATCTGATCATCT
GGCCGATGAAGGCGAAGCACAACATTTGCCGGCATGAATCCGTTGGTGCCATTCATTAAATATTTCGACTTCTTCAATACAGTAACATGGATAAGCAGAGATGGTGTAAT
TGATTTGTTTCGAAGAGTTTTATTGTTGAATTTAGACGTCTGTAGTATGAATTGAAAATTTTATAAGTTTTGAAGGAAAAAAAGGAAGAGTAGGTTGCTTCTAAAACACC
TTAGCCTCTTGTTTAAGTTAGCAATTGTGAAGAAATTAGAGATAAATCACTCATCTAATGTACAAAATGCAACTCAAGGACCAAAAGTTGTTGAAACCTTCAATCTATCA
AATACAATTGGGTAGTATTAGTAACAAAACTGAGTCAATCCACAGCGAGCTATGTGTTTTCTTCTAATCAATGTTCAAATTTCTTACC
Protein sequenceShow/hide protein sequence
MPVMGKSSVKKPLRDVSNHRYPRTSSKSLTTATRREDFNPSKVEDLDDSLDRLLLVQSDLSALTHQIDELVVKALELKEIDKQGRKEIESFTHVLSDMQSSLKPWVPRFQ
KMFSHPSKDSDDGIGQSLANEGNALVNVKENNVADSPDHTEAQDLVSPSPLVSWRAGCNIERGRQLFLLTPLPISKSLSSKHVAKSVLNGMKSGILKSTQPCFIACGDLN
ENPLECNVIEPSVGKPSGSDLSTLGENLLECNGTEGSVVENSLLEGNALEPSGAEPSGSDLTQAEIIHQRGFASPPLLSKKNCSMLVMTPCFKMSPPKSCVLLEPISESS
HKDKKRFYKATPFPVGVHDCSSGSDASDGLALKYPELLGIQQAHKTGIRKKVEASPDWYMSPPKTCVLLEPSDSHSIESATSSGCHEATKSFSHQVGLSLPHIDNTPMLK
ECESVFRVGKCAGEETLKKELWMKFEAASTNPFPCDQALQKTSKKGFLDLLDEVSCD