; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0570 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0570
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSASA domain-containing protein
Genome locationMC04:4978624..4981314
RNA-Seq ExpressionMC04g0570
SyntenyMC04g0570
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011640.1 putative carbohydrate esterase [Cucurbita argyrosperma subsp. argyrosperma]5.04e-14482.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDL LPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

XP_008455001.1 PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo]2.40e-14483.95Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNM+GRGGV KK ++WDGVVPPE+QPHPSI RLSAK  WEAAREPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRARESV+GGGEIKA+LWFQGESDT+TE DA AY+GNME FVANVRRDLALPSLPIIQVALASG +YIEKVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY

XP_022148694.1 probable carbohydrate esterase At4g34215 [Momordica charantia]3.65e-177100Show/hide
Query:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
        MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
Subjt:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL
        MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL

XP_022952956.1 probable carbohydrate esterase At4g34215 [Cucurbita moschata]1.02e-14382.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RL AKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

XP_023553837.1 probable carbohydrate esterase At4g34215 [Cucurbita pepo subsp. pepo]1.06e-14583.53Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        KVREAQLGM++ENVVCVDAKGL+LQEDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

TrEMBL top hitse value%identityAlignment
A0A0A0K303 SASA domain-containing protein3.20e-14283.13Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNMAGRGGV KK +RWDGVVPPE+ PHPSI RLSAK  WEAA EPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRAR+SV+GGGEIKA+LWFQGESDTSTE DA AYQGNME  VANVRRDLALPSLPIIQVALASG +Y +KVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY

A0A1S3BZF3 probable carbohydrate esterase At4g342151.16e-14483.95Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNM+GRGGV KK ++WDGVVPPE+QPHPSI RLSAK  WEAAREPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRARESV+GGGEIKA+LWFQGESDT+TE DA AY+GNME FVANVRRDLALPSLPIIQVALASG +YIEKVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAY

A0A6J1D4Q9 probable carbohydrate esterase At4g342151.77e-177100Show/hide
Query:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
        MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
Subjt:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL
        MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTHGVL

A0A6J1GLP0 probable carbohydrate esterase At4g342154.92e-14482.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RL AKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

A0A6J1I4P5 probable carbohydrate esterase At4g342152.00e-14381.93Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D  P         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADID+KKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAI+EWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        KVREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342151.1e-8065.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)8.7e-7055.7Show/hide
Query:  QNKQIFILSGQSNMAGRGGV----SKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGT
        +N  IFIL+GQSNMAGRGGV    +     WDGV+PPE + +PSILRL++KLEW+ A+EPLH DID  KT GVGPGM FAN V  R G V LVPC++GGT
Subjt:  QNKQIFILSGQSNMAGRGGV----SKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGT

Query:  AIKEWARGEKLYEDMVKRARESVR--GGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGF-EYIEKVREAQLGMKIE
         + +W +GE LYE+ VKRA+ ++   GGG  +A+LW+QGESDT    DA  Y+  + +F +++R DL  P+LPIIQVALA+G   Y++ VR+AQL   +E
Subjt:  AIKEWARGEKLYEDMVKRARESVR--GGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGF-EYIEKVREAQLGMKIE

Query:  NVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        NV CVDA+GL L+ D LHL+T SQV LG M+A+++L+
Subjt:  NVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

AT4G34215.1 Domain of unknown function (DUF303)7.6e-8265.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

AT4G34215.2 Domain of unknown function (DUF303)7.6e-8265.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGACGAGCGATCCAGATCCAGATCCCCAAAACAAGCAGATCTTCATCTTATCAGGGCAGAGCAACATGGCCGGGCGCGGCGGCGTGTCGAAGAAGCCCAAGCG
CTGGGACGGCGTCGTTCCGCCGGAGTCGCAGCCGCACCCCTCGATACTCCGGCTGAGCGCGAAGCTGGAGTGGGAGGCGGCGCGTGAGCCCCTCCACGCGGACATCGACA
CGAAGAAGACGTGCGGGGTGGGGCCGGGGATGTCGTTCGCAAACGGCGTGAGGGAGCGCGTGGGAGTGGTGGCGCTGGTGCCGTGCGCGGTGGGGGGGACGGCCATAAAG
GAGTGGGCGCGTGGAGAGAAGCTGTACGAGGACATGGTGAAGAGGGCGAGGGAGAGCGTGAGGGGCGGCGGAGAGATCAAGGCGCTTCTGTGGTTCCAAGGGGAGAGCGA
CACCAGTACGGAAGCTGATGCGGTCGCGTACCAGGGGAATATGGAGGAGTTCGTTGCTAATGTGCGCCGCGACCTCGCCTTGCCTTCCCTCCCCATAATTCAGGTGGCAC
TTGCATCTGGATTTGAGTACATTGAAAAAGTGAGAGAAGCACAATTGGGAATGAAAATAGAGAATGTGGTGTGTGTGGATGCAAAGGGGCTGCAACTCCAAGAAGACAAC
CTCCACCTCTCAACTCAGTCTCAGGTTGTTTTGGGTCAAATGCTGGCTCAGGCTTACCTCTCCCCAACCCATGGTGTCCTCTAA
mRNA sequenceShow/hide mRNA sequence
CATAACTAAATTATCAATTTAAAATTTAGAAGGCTGATATAAAATAATTTATAACTTAAAATTATATAATTTATTTTTATTAGTTATCAATCGATGTTCACCCTGACATT
AAATATATACTTTTAAAGTAATGACACGAAATAAACGTATATATAATTTATCTAATCCTAGATATCACTGCCTTTTTCGTAACCCTCTACCAAACAGCCACCGATGGTTT
AAAACTCTCATGAATCGCGTGTTTTGGGCGGTACAAGTTAATAAGTAGATGAAGAGAAAAGCGTCCATCCAAAACGCACGTTTCGACAAAAGCATTTATAATTATCGCCG
AGACGAGACCCCACCGGACGGAATTATATAAACCAACCACCACCACCACCACCGTCCCACAAGTCCTCGAATCGGTATCATGGCGACGACGAGCGATCCAGATCCAGATC
CCCAAAACAAGCAGATCTTCATCTTATCAGGGCAGAGCAACATGGCCGGGCGCGGCGGCGTGTCGAAGAAGCCCAAGCGCTGGGACGGCGTCGTTCCGCCGGAGTCGCAG
CCGCACCCCTCGATACTCCGGCTGAGCGCGAAGCTGGAGTGGGAGGCGGCGCGTGAGCCCCTCCACGCGGACATCGACACGAAGAAGACGTGCGGGGTGGGGCCGGGGAT
GTCGTTCGCAAACGGCGTGAGGGAGCGCGTGGGAGTGGTGGCGCTGGTGCCGTGCGCGGTGGGGGGGACGGCCATAAAGGAGTGGGCGCGTGGAGAGAAGCTGTACGAGG
ACATGGTGAAGAGGGCGAGGGAGAGCGTGAGGGGCGGCGGAGAGATCAAGGCGCTTCTGTGGTTCCAAGGGGAGAGCGACACCAGTACGGAAGCTGATGCGGTCGCGTAC
CAGGGGAATATGGAGGAGTTCGTTGCTAATGTGCGCCGCGACCTCGCCTTGCCTTCCCTCCCCATAATTCAGGTGGCACTTGCATCTGGATTTGAGTACATTGAAAAAGT
GAGAGAAGCACAATTGGGAATGAAAATAGAGAATGTGGTGTGTGTGGATGCAAAGGGGCTGCAACTCCAAGAAGACAACCTCCACCTCTCAACTCAGTCTCAGGTTGTTT
TGGGTCAAATGCTGGCTCAGGCTTACCTCTCCCCAACCCATGGTGTCCTCTAAGCCTAATCCTAATATTGTTTGGTGTTTGCCACTTGTGTGTTTTTTTGTTATAAATGT
AATGTACCTACATGTTGTAGGTCTTATCCCATCCTTTATTACGACTAGTGCGTAACACGTGATATGCACGTGGTATTTTTAGTTTTATTTTAGTATATGGA
Protein sequenceShow/hide protein sequence
MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGTAIK
EWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLGMKIENVVCVDAKGLQLQEDN
LHLSTQSQVVLGQMLAQAYLSPTHGVL