; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012226 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012226
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSASA domain-containing protein
Genome locationscaffold797:604394..606519
RNA-Seq ExpressionMS012226
SyntenyMS012226
Gene Ontology termsNA
InterPro domainsIPR005181 - Sialate O-acetylesterase domain
IPR036514 - SGNH hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011640.1 putative carbohydrate esterase [Cucurbita argyrosperma subsp. argyrosperma]9.5e-11182.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDL LPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

XP_008455001.1 PREDICTED: probable carbohydrate esterase At4g34215 [Cucumis melo]7.3e-11183.27Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNM+GRGGV KK ++WDGVVPPE+QPHPSI RLSAK  WEAAREPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRARESV+GGGEIKA+LWFQGESDT+TE DA AY+GNME FVANVRRDLALPSLPIIQVALASG +YIEKVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY +
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

XP_022148694.1 probable carbohydrate esterase At4g34215 [Momordica charantia]2.7e-134100Show/hide
Query:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
        MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
Subjt:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH
        MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH

XP_022952956.1 probable carbohydrate esterase At4g34215 [Cucurbita moschata]1.6e-11082.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RL AKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

XP_023553837.1 probable carbohydrate esterase At4g34215 [Cucurbita pepo subsp. pepo]5.0e-11283.53Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        KVREAQLGM++ENVVCVDAKGL+LQEDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

TrEMBL top hitse value%identityAlignment
A0A0A0K303 SASA domain-containing protein1.9e-10982.45Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNMAGRGGV KK +RWDGVVPPE+ PHPSI RLSAK  WEAA EPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRAR+SV+GGGEIKA+LWFQGESDTSTE DA AYQGNME  VANVRRDLALPSLPIIQVALASG +Y +KVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY +
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

A0A1S3BZF3 probable carbohydrate esterase At4g342153.5e-11183.27Show/hide
Query:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV
        MATT+D DP     P NKQIFILSGQSNM+GRGGV KK ++WDGVVPPE+QPHPSI RLSAK  WEAAREPLHADIDTKKTCGVGPGM FANGVRERVG 
Subjt:  MATTSDPDP----DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGV

Query:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE
        VALVPCAVGGTAI+EWARGEKLYE+MVKRARESV+GGGEIKA+LWFQGESDT+TE DA AY+GNME FVANVRRDLALPSLPIIQVALASG +YIEKVRE
Subjt:  VALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVRE

Query:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        AQLGMK+EN+VCVDA GL+LQEDNLHL+T SQV+LGQML  AY +
Subjt:  AQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

A0A6J1D4Q9 probable carbohydrate esterase At4g342151.3e-134100Show/hide
Query:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
        MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV
Subjt:  MATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH
        MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLSPTH

A0A6J1GLP0 probable carbohydrate esterase At4g342157.8e-11182.33Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D DP         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RL AKL WE A EPLHADIDTKKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAIKEWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +VREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

A0A6J1I4P5 probable carbohydrate esterase At4g342152.3e-11081.93Show/hide
Query:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE
        MA T+D  P         P NKQIFILSGQSNMAGRGGV KK  RWDGVVPPE+QPHPSI RLSAKL WE A EPLHADID+KKTCGVGPGM+FANGVRE
Subjt:  MATTSDPDP--------DPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRE

Query:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE
        RVG VALVPCAVGGTAI+EWARGEKLYEDMVKRAR SV+ GGEI+A+LWFQGESDTSTE DA AYQGNME FVANVRRDLALPSLPIIQVALASG +YIE
Subjt:  RVGVVALVPCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIE

Query:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        KVREAQLGM++ENVVCVDAKGL+L+EDNLHL+TQ+QV+LGQMLA AYL+
Subjt:  KVREAQLGMKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

SwissProt top hitse value%identityAlignment
Q8L9J9 Probable carbohydrate esterase At4g342151.1e-8065.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

Arabidopsis top hitse value%identityAlignment
AT3G53010.1 Domain of unknown function (DUF303)8.7e-7055.7Show/hide
Query:  QNKQIFILSGQSNMAGRGGV----SKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGT
        +N  IFIL+GQSNMAGRGGV    +     WDGV+PPE + +PSILRL++KLEW+ A+EPLH DID  KT GVGPGM FAN V  R G V LVPC++GGT
Subjt:  QNKQIFILSGQSNMAGRGGV----SKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGT

Query:  AIKEWARGEKLYEDMVKRARESVR--GGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGF-EYIEKVREAQLGMKIE
         + +W +GE LYE+ VKRA+ ++   GGG  +A+LW+QGESDT    DA  Y+  + +F +++R DL  P+LPIIQVALA+G   Y++ VR+AQL   +E
Subjt:  AIKEWARGEKLYEDMVKRARESVR--GGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGF-EYIEKVREAQLGMKIE

Query:  NVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        NV CVDA+GL L+ D LHL+T SQV LG M+A+++L+
Subjt:  NVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

AT4G34215.1 Domain of unknown function (DUF303)7.6e-8265.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS

AT4G34215.2 Domain of unknown function (DUF303)7.6e-8265.56Show/hide
Query:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV
        P P N QIFILSGQSNMAGRGGV K        WD ++PPE  P+ SILRLSA L WE A EPLH DIDT K CGVGPGM+FAN V+ R+     V+ LV
Subjt:  PDPQNKQIFILSGQSNMAGRGGVSKKPKR----WDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERV----GVVALV

Query:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG
        PCA GGTAIKEW RG  LYE MVKR  ES + GGEIKA+LW+QGESD     DA +Y  NM+  + N+R DL LPSLPIIQVA+ASG  YI+KVREAQLG
Subjt:  PCAVGGTAIKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLG

Query:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS
        +K+ NVVCVDAKGL L+ DNLHL+T++QV LG  LAQAYLS
Subjt:  MKIENVVCVDAKGLQLQEDNLHLSTQSQVVLGQMLAQAYLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTATCATGGCGACGACGAGCGATCCAGATCCAGATCCCCAAAACAAGCAGATCTTCATCTTATCAGGGCAGAGCAACATGGCCGGGCGCGGCGGCGTGTCGAAGAAGCC
CAAGCGCTGGGACGGCGTCGTTCCGCCGGAGTCGCAGCCGCACCCCTCGATACTCCGGCTGAGCGCGAAGCTGGAGTGGGAGGCGGCGCGTGAGCCCCTCCACGCGGACA
TCGACACGAAGAAGACGTGCGGGGTGGGGCCGGGGATGTCGTTCGCAAACGGCGTGAGGGAGCGCGTGGGAGTGGTGGCGCTGGTGCCGTGCGCGGTGGGGGGGACGGCC
ATCAAGGAGTGGGCGCGTGGAGAGAAGCTGTACGAGGACATGGTGAAGAGGGCGAGGGAGAGCGTGAGGGGCGGCGGAGAGATCAAGGCGCTTCTGTGGTTCCAAGGGGA
GAGCGACACCAGTACGGAAGCTGATGCGGTCGCGTACCAGGGGAATATGGAGGAGTTCGTTGCTAATGTGCGCCGCGACCTCGCCTTGCCTTCCCTCCCCATAATTCAGG
TGGCACTTGCATCTGGATTTGAGTACATTGAAAAAGTGAGAGAAGCACAATTGGGAATGAAAATAGAGAATGTGGTGTGTGTGGATGCAAAAGGGCTGCAACTCCAAGAA
GACAACCTCCACCTCTCAACTCAGTCTCAGGTTGTTTTGGGTCAAATGCTGGCTCAGGCTTACCTCTCCCCAACCCAT
mRNA sequenceShow/hide mRNA sequence
GGTATCATGGCGACGACGAGCGATCCAGATCCAGATCCCCAAAACAAGCAGATCTTCATCTTATCAGGGCAGAGCAACATGGCCGGGCGCGGCGGCGTGTCGAAGAAGCC
CAAGCGCTGGGACGGCGTCGTTCCGCCGGAGTCGCAGCCGCACCCCTCGATACTCCGGCTGAGCGCGAAGCTGGAGTGGGAGGCGGCGCGTGAGCCCCTCCACGCGGACA
TCGACACGAAGAAGACGTGCGGGGTGGGGCCGGGGATGTCGTTCGCAAACGGCGTGAGGGAGCGCGTGGGAGTGGTGGCGCTGGTGCCGTGCGCGGTGGGGGGGACGGCC
ATCAAGGAGTGGGCGCGTGGAGAGAAGCTGTACGAGGACATGGTGAAGAGGGCGAGGGAGAGCGTGAGGGGCGGCGGAGAGATCAAGGCGCTTCTGTGGTTCCAAGGGGA
GAGCGACACCAGTACGGAAGCTGATGCGGTCGCGTACCAGGGGAATATGGAGGAGTTCGTTGCTAATGTGCGCCGCGACCTCGCCTTGCCTTCCCTCCCCATAATTCAGG
TGGCACTTGCATCTGGATTTGAGTACATTGAAAAAGTGAGAGAAGCACAATTGGGAATGAAAATAGAGAATGTGGTGTGTGTGGATGCAAAAGGGCTGCAACTCCAAGAA
GACAACCTCCACCTCTCAACTCAGTCTCAGGTTGTTTTGGGTCAAATGCTGGCTCAGGCTTACCTCTCCCCAACCCAT
Protein sequenceShow/hide protein sequence
GIMATTSDPDPDPQNKQIFILSGQSNMAGRGGVSKKPKRWDGVVPPESQPHPSILRLSAKLEWEAAREPLHADIDTKKTCGVGPGMSFANGVRERVGVVALVPCAVGGTA
IKEWARGEKLYEDMVKRARESVRGGGEIKALLWFQGESDTSTEADAVAYQGNMEEFVANVRRDLALPSLPIIQVALASGFEYIEKVREAQLGMKIENVVCVDAKGLQLQE
DNLHLSTQSQVVLGQMLAQAYLSPTH