; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021316 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021316
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCBS domain-containing protein CBSX5
Genome locationscaffold358:973523..975134
RNA-Seq ExpressionMS021316
SyntenyMS021316
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0042149 - cellular response to glucose starvation (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0031588 - nucleotide-activated protein kinase complex (cellular component)
GO:0016208 - AMP binding (molecular function)
GO:0019887 - protein kinase regulator activity (molecular function)
GO:0019901 - protein kinase binding (molecular function)
InterPro domainsIPR000644 - CBS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141365.1 CBS domain-containing protein CBSX5 [Cucumis sativus]3.7e-18988.56Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKP LRPLSLSAT+ADALLAL+FS DYFVSVW CRL K GC  G   GG  GGDF CCRC+GKLCMVDVICYLC+EENLLSPS+A
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L ASVSEILPQIP IVMHLEPSASLLEAIDLVLQGAQNLVVPIKT+L    RRKQLKNSTN IHGG EFCWLTQEDIIRYLL SIGLFSP+AALSLD+LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LSVNYHSPASSAI AIS SI NQTSVAV+DG+GILIGEISP ALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLE
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAI HRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

XP_022139716.1 CBS domain-containing protein CBSX5 [Momordica charantia]8.1e-21398.99Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
        MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTC GGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS

Query:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP
        ASV+EILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP
Subjt:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP

Query:  LSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP
        LSVNYHSPASSAI AIS SIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP
Subjt:  LSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP

Query:  SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE
        SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE
Subjt:  SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE

XP_022940315.1 CBS domain-containing protein CBSX5-like [Cucurbita moschata]1.5e-18787.56Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKPALR LSLSATVADAL ALKFS+DYFVSVW CRL KNGCG G   G   GG+F CCRC+GKLCMVDVICYLCR++NLLSPSAA
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L A +SEILPQIP IVMHLEPSASLL+AIDLVLQGAQNLVVPIK KL    RRKQLK   N IHGGREFCWLTQEDIIRYLLSSIG FSP+AALSLD LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTNPLSVNYHSPASSAI AIS  IANQTSVAVVD +GILIGEISP ALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AV+IQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

XP_022981296.1 CBS domain-containing protein CBSX5-like [Cucurbita maxima]3.7e-18988.31Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKPALR LSLSATVADAL ALKFS+DYFVSVW CRLAKNGCG G   G   GGDF CCRC+GKLCMVDVICYLCR+ENLLSPSAA
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L A +SEILPQIP IVMHLEPSASLL+AIDLVLQGAQNLVVPIK KL    RRKQLKN TN IHGGREFCWLTQEDIIRYLLSSIG FSP+AALSLD LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LS+NYHSPASSAI AIS  IANQTSVAVVD +GILIGEISP ALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AV+IQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

XP_038898999.1 CBS domain-containing protein CBSX5 [Benincasa hispida]7.2e-19389.78Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGC-GDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL
        MAVS FSHDVSDLCLGKPALRPLSLSA VADAL+AL+FS+DYFVSVW CRLAK+GC G+G    GGGDF CCRC+GKLCMVDVICYLC++ENLLSPSAAL
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGC-GDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL

Query:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGI
         ASVSEILPQIP IVMHLEP+ASLLEAIDLVLQGAQNLVVPIKTKL    RRKQLKN TN IHGG EFCWLTQEDIIRYLLSSIGLFS +AALSLD+LGI
Subjt:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGI

Query:  ICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        ICTNPLSVNYHSPASSAI AIS  IANQTSVAV+DGEGILIGEISP ALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLEE
Subjt:  ICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETT
        FTNSPSS+GSVSFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLE T
Subjt:  FTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETT

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A0A0L3G0 Uncharacterized protein1.8e-18988.56Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKP LRPLSLSAT+ADALLAL+FS DYFVSVW CRL K GC  G   GG  GGDF CCRC+GKLCMVDVICYLC+EENLLSPS+A
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L ASVSEILPQIP IVMHLEPSASLLEAIDLVLQGAQNLVVPIKT+L    RRKQLKNSTN IHGG EFCWLTQEDIIRYLL SIGLFSP+AALSLD+LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LSVNYHSPASSAI AIS SI NQTSVAV+DG+GILIGEISP ALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLE
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAI HRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

A0A5A7VF56 CBS domain-containing protein CBSX57.5e-18887.78Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAG-GGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL
        MAVS FSHDVSDLCLGKPALRPLSLSAT+ADALLAL+FS DYFVSVW C L K+GC  G   G  GGDF CCRC+GKLCMVDV+CYLC+EENLLSPSAAL
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAG-GGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL

Query:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGI
         ASVSEILPQIP IVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL    RRKQLKNS N IHGGREFCWLTQEDIIRYLLSSIG FSP+AALSLD+LGI
Subjt:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGI

Query:  ICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        I TN LSV+YHSPASSAI AIS SI NQTSVAV+DG+GILIGEISP ALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVK+VKARLK+S LEGMLEE
Subjt:  ICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETT
        FTNSPSS+GS SFTSSSSDEEFSPSP SR+ RRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLET+
Subjt:  FTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETT

Query:  E
        E
Subjt:  E

A0A6J1CDJ1 CBS domain-containing protein CBSX53.9e-21398.99Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
        MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTC GGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS

Query:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP
        ASV+EILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP
Subjt:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNP

Query:  LSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP
        LSVNYHSPASSAI AIS SIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP
Subjt:  LSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSP

Query:  SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE
        SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE
Subjt:  SSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE

A0A6J1FQ76 CBS domain-containing protein CBSX5-like7.5e-18887.56Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKPALR LSLSATVADAL ALKFS+DYFVSVW CRL KNGCG G   G   GG+F CCRC+GKLCMVDVICYLCR++NLLSPSAA
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L A +SEILPQIP IVMHLEPSASLL+AIDLVLQGAQNLVVPIK KL    RRKQLK   N IHGGREFCWLTQEDIIRYLLSSIG FSP+AALSLD LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTNPLSVNYHSPASSAI AIS  IANQTSVAVVD +GILIGEISP ALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AV+IQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

A0A6J1IW58 CBS domain-containing protein CBSX5-like1.8e-18988.31Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA
        MAVS FSHDVSDLCLGKPALR LSLSATVADAL ALKFS+DYFVSVW CRLAKNGCG G   G   GGDF CCRC+GKLCMVDVICYLCR+ENLLSPSAA
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGG--GGDFGCCRCIGKLCMVDVICYLCREENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG
        L A +SEILPQIP IVMHLEPSASLL+AIDLVLQGAQNLVVPIK KL    RRKQLKN TN IHGGREFCWLTQEDIIRYLLSSIG FSP+AALSLD LG
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKL----RRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLG

Query:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LS+NYHSPASSAI AIS  IANQTSVAVVD +GILIGEISP ALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSS+GS SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AV+IQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  TE
        +E
Subjt:  TE

SwissProt top hitse value%identityAlignment
Q84WQ5 CBS domain-containing protein CBSX54.6e-9450.61Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPL-SLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL
        MA+S  S++VSDLCLGKP LR L S S++V+DA+ ALK SED F+SVW+C    N   D            C C+GK+ M DVIC+L ++ +      AL
Subjt:  MAVSFFSHDVSDLCLGKPALRPL-SLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL

Query:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVI------HGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTL
        ++SVS +LP+  +IV+H++PS SL+EAIDL+++GAQNL+VPI TK   K+ +++ NV         G+ FCW+TQEDII++LL  I  FSP+ A+SL  L
Subjt:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVI------HGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTL

Query:  GIICT--NPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG-----ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE
        G+I +    ++V+YHS AS+ + A+S ++A QTSVAVVDGEG      LIGEISP+ L  CD+  AAA+ TLS+GDLMAYID   PPE LV++V+ RL++
Subjt:  GIICT--NPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG-----ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE

Query:  SNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLK
          L G++  F     S+ S S +S  S EE +P  ++  Y RS S SAR+ R++EAIVC+P+SSL+AVMIQA+AHRVNY WV+E D   +G+VTF+D+LK
Subjt:  SNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLK

Query:  VFREHLE
        VFR+ LE
Subjt:  VFREHLE

Q8GXI9 SNF1-related protein kinase regulatory subunit gamma-like PV42b4.0e-0519.55Show/hide
Query:  VSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCR--CIGKLCMVDVICYLCREENLLSPSAALSASVSEIL
        V DL + K  L  +  +AT+ DAL  +  +    V V +      G G           G  R   IG + M+DV+ ++  ++        ++A VS I+
Subjt:  VSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCR--CIGKLCMVDVICYLCREENLLSPSAALSASVSEIL

Query:  PQIP--AIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNY
           P    +  L P+ S+++ ++++ +G   ++VP+ +            ++     +  L+Q D+I +          + + ++  L  I    L++  
Subjt:  PQIP--AIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNY

Query:  HSPASSAIRAISCSIANQTSVAVVDGEG------------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGML
         +    AI+ +S ++ N   +    GEG             ++G  S   L GC  A   + + L++ + +  I     P  L+                
Subjt:  HSPASSAIRAISCSIANQTSVAVVDGEG------------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGML

Query:  EEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHL
                      FT+++S                       T   E + CH  S+L  V+      RV+ VWV++ +  L G+V+  D++ V R  L
Subjt:  EEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHL

Q8GZA4 CBS domain-containing protein CBSX65.8e-2026.3Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
        MA  F  H V DL +GKP +     + TV  A+ A+  S +  + VW  R   +  G         +    R +G L  +D++ +L + E  L    A+ 
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS

Query:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLR-----------RKQLKNS------------------TNVIHGGREFCWLTQEDI
          VSE++     ++  ++P   L++A++++ QG + L+VP     R            K LKNS                  T++     +FC L++ED+
Subjt:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLR-----------RKQLKNS------------------TNVIHGGREFCWLTQEDI

Query:  IRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNYHSPASSAIRAISCSIANQTSVAVV-----DGEGILIGEISPLALAGCDKAVAA-AIMTLSSGDLMA
        IR+L+  +G  +P+   S+ TLGII  N    N+   +  AI A    + + +++AV+     + +  +IGEIS   L  CD   AA A+  L +G  + 
Subjt:  IRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNYHSPASSAIRAISCSIANQTSVAVV-----DGEGILIGEISPLALAGCDKAVAA-AIMTLSSGDLMA

Query:  YIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNY
         ++           + +R     L+        + ++  +  F+S S    F+P+  +R     S Y      R+  + C   SSL AVM Q ++HR  +
Subjt:  YIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNY

Query:  VWVIEDDCS--LIGIVTFLDML
        VWV E D    L+G+V + ++L
Subjt:  VWVIEDDCS--LIGIVTFLDML

Arabidopsis top hitse value%identityAlignment
AT1G65320.1 Cystathionine beta-synthase (CBS) family protein4.1e-2126.3Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS
        MA  F  H V DL +GKP +     + TV  A+ A+  S +  + VW  R   +  G         +    R +G L  +D++ +L + E  L    A+ 
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALS

Query:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLR-----------RKQLKNS------------------TNVIHGGREFCWLTQEDI
          VSE++     ++  ++P   L++A++++ QG + L+VP     R            K LKNS                  T++     +FC L++ED+
Subjt:  ASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLR-----------RKQLKNS------------------TNVIHGGREFCWLTQEDI

Query:  IRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNYHSPASSAIRAISCSIANQTSVAVV-----DGEGILIGEISPLALAGCDKAVAA-AIMTLSSGDLMA
        IR+L+  +G  +P+   S+ TLGII  N    N+   +  AI A    + + +++AV+     + +  +IGEIS   L  CD   AA A+  L +G  + 
Subjt:  IRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNYHSPASSAIRAISCSIANQTSVAVV-----DGEGILIGEISPLALAGCDKAVAA-AIMTLSSGDLMA

Query:  YIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNY
         ++           + +R     L+        + ++  +  F+S S    F+P+  +R     S Y      R+  + C   SSL AVM Q ++HR  +
Subjt:  YIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNY

Query:  VWVIEDDCS--LIGIVTFLDML
        VWV E D    L+G+V + ++L
Subjt:  VWVIEDDCS--LIGIVTFLDML

AT1G80090.1 Cystathionine beta-synthase (CBS) family protein1.1e-0519.66Show/hide
Query:  VSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDF--------GCCR--CIGKLCMVDVICYLCREENLLSPSAAL
        V DL + K  L  +  +AT+ DAL  +       V+    R        G   G GG          G  R   IG + M+DV+ ++  ++        +
Subjt:  VSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDF--------GCCR--CIGKLCMVDVICYLCREENLLSPSAAL

Query:  SASVSEILPQIP--AIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIIC
        +A VS I+   P    +  L P+ S+++ ++++ +G   ++VP+ +            ++     +  L+Q D+I +          + + ++  L  I 
Subjt:  SASVSEILPQIP--AIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIIC

Query:  TNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG------------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK
           L++   +    AI+ +S ++ N   +    GEG             ++G  S   L GC  A   + + L++ + +  I     P  L+        
Subjt:  TNPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG------------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK

Query:  ESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDML
                              FT+++S                       T   E + CH  S+L  V+      RV+ VWV++ +  L G+V+  D++
Subjt:  ESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDML

Query:  KVFREHL
         V R  L
Subjt:  KVFREHL

AT4G27460.1 Cystathionine beta-synthase (CBS) family protein3.3e-9550.61Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPL-SLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL
        MA+S  S++VSDLCLGKP LR L S S++V+DA+ ALK SED F+SVW+C    N   D            C C+GK+ M DVIC+L ++ +      AL
Subjt:  MAVSFFSHDVSDLCLGKPALRPL-SLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAAL

Query:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVI------HGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTL
        ++SVS +LP+  +IV+H++PS SL+EAIDL+++GAQNL+VPI TK   K+ +++ NV         G+ FCW+TQEDII++LL  I  FSP+ A+SL  L
Subjt:  SASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVI------HGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTL

Query:  GIICT--NPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG-----ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE
        G+I +    ++V+YHS AS+ + A+S ++A QTSVAVVDGEG      LIGEISP+ L  CD+  AAA+ TLS+GDLMAYID   PPE LV++V+ RL++
Subjt:  GIICT--NPLSVNYHSPASSAIRAISCSIANQTSVAVVDGEG-----ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE

Query:  SNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLK
          L G++  F     S+ S S +S  S EE +P  ++  Y RS S SAR+ R++EAIVC+P+SSL+AVMIQA+AHRVNY WV+E D   +G+VTF+D+LK
Subjt:  SNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLK

Query:  VFREHLE
        VFR+ LE
Subjt:  VFREHLE

AT5G53750.1 CBS domain-containing protein6.6e-9650.12Show/hide
Query:  MAVSFFSHDVSDLCLGKPALRPLSL-SATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCR-EENLLSPSAA
        MA++  SH++SDLC+GKP LR LS+ +ATVADA+ ALK S++ F++VWSC   +             D   C C+GK+CM DVICYL + + N+LS S+A
Subjt:  MAVSFFSHDVSDLCLGKPALRPLSL-SATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCR-EENLLSPSAA

Query:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTK---LRRKQ-----------LKNSTNVIH-GGREFCWLTQEDIIRYLLSSIGLFS
          ASVS +LP+  A+V+H++ S SL+EAIDL+++GAQNL+VPI TK    RR+Q           L N+T+  H   REFCW+TQEDIIR+LL SI +FS
Subjt:  LSASVSEILPQIPAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTK---LRRKQ-----------LKNSTNVIH-GGREFCWLTQEDIIRYLLSSIGLFS

Query:  PMAALSLDTLGIICTNP--LSVNYHSPASSAIRAISCSIANQTSVAVVDGEG--------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPP
        P+ +LS+  LG+I +    L+V+Y+S A+SA+ AIS +I +  SVAVV G+G        +LIGEISP+ LA CD+   AA+ TLS+GDLM+YID  GPP
Subjt:  PMAALSLDTLGIICTNP--LSVNYHSPASSAIRAISCSIANQTSVAVVDGEG--------ILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPP

Query:  EDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSR---KYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIE
        E LV VV+ RL++  + G++       S + S+S +S SS +E SP+  +R    Y RS S +AR+ R++ AIVC+ +SSL+AVMIQAIAHRV+YVWVI+
Subjt:  EDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSR---KYRRSSSYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIE

Query:  DDCSLIGIVTFLDMLKVFREHLE
        +D  LIG+VTF+D+LK+FRE L+
Subjt:  DDCSLIGIVTFLDMLKVFREHLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAGCTTTTTTTCGCATGACGTTTCTGATCTGTGTCTCGGCAAGCCCGCGCTGAGGCCGCTCTCCCTCTCCGCCACCGTCGCCGACGCCTTGCTCGCTCTCAA
ATTCTCCGAAGATTACTTCGTCAGCGTCTGGAGTTGCCGTCTCGCGAAGAACGGTTGCGGCGATGGTACTTGTGCTGGTGGCGGTGGCGATTTCGGGTGCTGCCGATGCA
TCGGGAAGCTGTGTATGGTGGATGTGATTTGCTATCTCTGTAGGGAGGAGAATTTGTTATCTCCTTCGGCCGCATTGAGCGCTTCTGTCTCTGAAATTTTGCCTCAAATT
CCTGCAATCGTGATGCACTTGGAGCCCTCTGCTAGCTTGTTAGAGGCCATTGATCTGGTTCTCCAAGGTGCTCAAAATCTTGTTGTCCCAATCAAGACTAAGCTAAGAAG
AAAACAGCTCAAAAACTCCACAAATGTGATCCATGGCGGCCGTGAATTCTGCTGGCTGACTCAGGAAGATATAATTAGATATCTTCTCAGCTCTATTGGCCTCTTTTCCC
CCATGGCTGCCCTCTCCCTTGACACCCTTGGCATCATCTGCACCAATCCCTTGTCAGTGAACTACCACTCCCCTGCCTCGTCTGCCATCAGAGCCATTTCCTGCTCCATC
GCCAACCAAACCTCGGTTGCAGTCGTCGATGGCGAAGGAATTTTGATCGGTGAGATCTCGCCCTTGGCCCTTGCTGGGTGTGACAAGGCTGTTGCAGCTGCGATTATGAC
CTTGTCATCGGGAGACCTGATGGCCTACATAGACTGCGGTGGCCCTCCAGAGGATCTCGTCAAGGTAGTGAAGGCTAGGCTGAAAGAGAGCAACTTGGAAGGAATGCTGG
AGGAATTCACCAATTCACCATCCTCAGTTGGGTCTGTATCTTTCACTTCTTCATCATCAGATGAGGAATTCTCGCCATCGCCGAGCTCAAGAAAGTACAGAAGATCATCA
AGCTACTCAGCACGGATCACCCGTCGTGCAGAGGCCATAGTTTGCCATCCAAGGAGCTCCTTGGTAGCTGTAATGATCCAGGCGATCGCACACCGTGTTAACTACGTCTG
GGTTATCGAAGACGACTGCAGTTTGATCGGAATCGTCACATTCCTTGATATGTTAAAAGTTTTCAGAGAACATTTAGAGACAACTGAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTGAGCTTTTTTTCGCATGACGTTTCTGATCTGTGTCTCGGCAAGCCCGCGCTGAGGCCGCTCTCCCTCTCCGCCACCGTCGCCGACGCCTTGCTCGCTCTCAA
ATTCTCCGAAGATTACTTCGTCAGCGTCTGGAGTTGCCGTCTCGCGAAGAACGGTTGCGGCGATGGTACTTGTGCTGGTGGCGGTGGCGATTTCGGGTGCTGCCGATGCA
TCGGGAAGCTGTGTATGGTGGATGTGATTTGCTATCTCTGTAGGGAGGAGAATTTGTTATCTCCTTCGGCCGCATTGAGCGCTTCTGTCTCTGAAATTTTGCCTCAAATT
CCTGCAATCGTGATGCACTTGGAGCCCTCTGCTAGCTTGTTAGAGGCCATTGATCTGGTTCTCCAAGGTGCTCAAAATCTTGTTGTCCCAATCAAGACTAAGCTAAGAAG
AAAACAGCTCAAAAACTCCACAAATGTGATCCATGGCGGCCGTGAATTCTGCTGGCTGACTCAGGAAGATATAATTAGATATCTTCTCAGCTCTATTGGCCTCTTTTCCC
CCATGGCTGCCCTCTCCCTTGACACCCTTGGCATCATCTGCACCAATCCCTTGTCAGTGAACTACCACTCCCCTGCCTCGTCTGCCATCAGAGCCATTTCCTGCTCCATC
GCCAACCAAACCTCGGTTGCAGTCGTCGATGGCGAAGGAATTTTGATCGGTGAGATCTCGCCCTTGGCCCTTGCTGGGTGTGACAAGGCTGTTGCAGCTGCGATTATGAC
CTTGTCATCGGGAGACCTGATGGCCTACATAGACTGCGGTGGCCCTCCAGAGGATCTCGTCAAGGTAGTGAAGGCTAGGCTGAAAGAGAGCAACTTGGAAGGAATGCTGG
AGGAATTCACCAATTCACCATCCTCAGTTGGGTCTGTATCTTTCACTTCTTCATCATCAGATGAGGAATTCTCGCCATCGCCGAGCTCAAGAAAGTACAGAAGATCATCA
AGCTACTCAGCACGGATCACCCGTCGTGCAGAGGCCATAGTTTGCCATCCAAGGAGCTCCTTGGTAGCTGTAATGATCCAGGCGATCGCACACCGTGTTAACTACGTCTG
GGTTATCGAAGACGACTGCAGTTTGATCGGAATCGTCACATTCCTTGATATGTTAAAAGTTTTCAGAGAACATTTAGAGACAACTGAG
Protein sequenceShow/hide protein sequence
MAVSFFSHDVSDLCLGKPALRPLSLSATVADALLALKFSEDYFVSVWSCRLAKNGCGDGTCAGGGGDFGCCRCIGKLCMVDVICYLCREENLLSPSAALSASVSEILPQI
PAIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLRRKQLKNSTNVIHGGREFCWLTQEDIIRYLLSSIGLFSPMAALSLDTLGIICTNPLSVNYHSPASSAIRAISCSI
ANQTSVAVVDGEGILIGEISPLALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSVGSVSFTSSSSDEEFSPSPSSRKYRRSS
SYSARITRRAEAIVCHPRSSLVAVMIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETTE