; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017764 (gene) of Snake gourd v1 genome

Gene IDTan0017764
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCBS domain-containing protein CBSX5-like
Genome locationLG01:109265375..109267085
RNA-Seq ExpressionTan0017764
SyntenyTan0017764
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0042149 - cellular response to glucose starvation (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0031588 - nucleotide-activated protein kinase complex (cellular component)
GO:0016208 - AMP binding (molecular function)
GO:0019887 - protein kinase regulator activity (molecular function)
GO:0019901 - protein kinase binding (molecular function)
InterPro domainsIPR000644 - CBS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141365.1 CBS domain-containing protein CBSX5 [Cucumis sativus]1.2e-19590.52Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL
        MAVSLFSHDVSDLCLGKP LRPL LSAT+ADALLAL+FS DYFVSVWDC   K GC G  DG A GGDFECCRCVGKLCMVDVICYLC++ENLLSPS+AL
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI
        +ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKT+LGSNSRRKQLKN TN IHGG EFCWLTQEDIIRYLL SIGLFS I+ALSLDSLGI
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI

Query:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        ICTNALSVNYHSPASSAIGAIS  I NQTSVAV+D DGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLEE
Subjt:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS
        FTNSPSSIG  SFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAV+IQAI HRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLETS
Subjt:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS

Query:  E
        E
Subjt:  E

XP_022940315.1 CBS domain-containing protein CBSX5-like [Cucurbita moschata]1.5e-19891.79Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
        MAVSLFSHDVSDLCLGKPALR L LSATVADAL ALKFSDDYFVSVWDC  +KNGCGG   DG A GG+FECCRCVGKLCMVDVICYLCRD+NLLSPSAA
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA

Query:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        L A +SEILPQIPGIVMHLEPSASLL+AIDLVLQGAQNLVVPIK KLGSNSRRKQLK P NAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLD+LG
Subjt:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LSVNYHSPASSAIGAIS CIANQTSVAVVDADGILIGEISPFALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSSIG  SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  SE
        SE
Subjt:  SE

XP_022981296.1 CBS domain-containing protein CBSX5-like [Cucurbita maxima]8.5e-20293.03Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
        MAVSLFSHDVSDLCLGKPALR L LSATVADAL ALKFSDDYFVSVWDC  +KNGCGG   DG A GGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA

Query:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        L A +SEILPQIPGIVMHLEPSASLL+AIDLVLQGAQNLVVPIK KLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLD+LG
Subjt:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTNALS+NYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSSIG  SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  SE
        SE
Subjt:  SE

XP_023525010.1 CBS domain-containing protein CBSX5-like [Cucurbita pepo subsp. pepo]3.3e-19891.79Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
        MAVSLFSHDVSDLCLGKPALR L LSATVADAL ALKFSD YFVSVWDC  +KNGCGG   DG A GG+FECCRCVGKLCMVDVICYLCRDENLLSPSAA
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA

Query:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        L A +SEILPQIPGIVMHLEPSASLL+AIDLVLQGAQNLVVPIK KLGSNSRRKQLK PTNAI+GGREFCWLTQEDIIRYLLSSIG FS I+ALSLD+LG
Subjt:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPF LA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSSIG  SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  SE
        SE
Subjt:  SE

XP_038898999.1 CBS domain-containing protein CBSX5 [Benincasa hispida]4.2e-20192.27Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCG--GDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL
        MAVSLFSHDVSDLCLGKPALRPL LSA VADAL+AL+FSDDYFVSVWDC  +K+GC   GDG AGGGDF+CCRCVGKLCMVDVICYLC+DENLLSPSAAL
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCG--GDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI
        RASVSEILPQIPGIVMHLEP+ASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGG EFCWLTQEDIIRYLLSSIGLFS I+ALSLDSLGI
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI

Query:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        ICTN LSVNYHSPASSAIGAISRCIANQTSVAV+D +GILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLEE
Subjt:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS
        FTNSPSSIG  SFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAV+IQAIAHRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLE +
Subjt:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A0A0L3G0 Uncharacterized protein5.8e-19690.52Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL
        MAVSLFSHDVSDLCLGKP LRPL LSAT+ADALLAL+FS DYFVSVWDC   K GC G  DG A GGDFECCRCVGKLCMVDVICYLC++ENLLSPS+AL
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI
        +ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKT+LGSNSRRKQLKN TN IHGG EFCWLTQEDIIRYLL SIGLFS I+ALSLDSLGI
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI

Query:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        ICTNALSVNYHSPASSAIGAIS  I NQTSVAV+D DGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLK+S LEGMLEE
Subjt:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS
        FTNSPSSIG  SFTSSSSDEEFSPSPSSR+YRRSSSYSARITRRAEAIVCHPRSSLVAV+IQAI HRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLETS
Subjt:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS

Query:  E
        E
Subjt:  E

A0A1S3BTV5 CBS domain-containing protein CBSX51.7e-19590.77Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL
        MAVSLFSHDVSDLCLGKPALRPL LSAT+ADALLAL+FS DYFVSVWDC   K+GC G  DG A GGDFECCRCVGKLCMVDV+CYLC++ENLLSPSAAL
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI
        +ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKN  NAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLDSLGI
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI

Query:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        I TNALSV+YHSPASSAIGAISR I NQTSVAV+D DGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVK+VKARLK+S LEGMLEE
Subjt:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS
        FTNSPSSIG ASFTSSSSDEEFSPSP SR+ RRSSSYSARITRRAEAIVCHPRSSLVAV+IQAIAHRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLETS
Subjt:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS

Query:  E
        E
Subjt:  E

A0A5A7VF56 CBS domain-containing protein CBSX51.7e-19590.77Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL
        MAVSLFSHDVSDLCLGKPALRPL LSAT+ADALLAL+FS DYFVSVWDC   K+GC G  DG A GGDFECCRCVGKLCMVDV+CYLC++ENLLSPSAAL
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG--DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI
        +ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKN  NAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLDSLGI
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGI

Query:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE
        I TNALSV+YHSPASSAIGAISR I NQTSVAV+D DGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVK+VKARLK+S LEGMLEE
Subjt:  ICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEE

Query:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS
        FTNSPSSIG ASFTSSSSDEEFSPSP SR+ RRSSSYSARITRRAEAIVCHPRSSLVAV+IQAIAHRVNYVWVIEDDCSLIG+VTFLDMLKVFREHLETS
Subjt:  FTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETS

Query:  E
        E
Subjt:  E

A0A6J1FQ76 CBS domain-containing protein CBSX5-like7.3e-19991.79Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
        MAVSLFSHDVSDLCLGKPALR L LSATVADAL ALKFSDDYFVSVWDC  +KNGCGG   DG A GG+FECCRCVGKLCMVDVICYLCRD+NLLSPSAA
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA

Query:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        L A +SEILPQIPGIVMHLEPSASLL+AIDLVLQGAQNLVVPIK KLGSNSRRKQLK P NAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLD+LG
Subjt:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTN LSVNYHSPASSAIGAIS CIANQTSVAVVDADGILIGEISPFALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSSIG  SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  SE
        SE
Subjt:  SE

A0A6J1IW58 CBS domain-containing protein CBSX5-like4.1e-20293.03Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
        MAVSLFSHDVSDLCLGKPALR L LSATVADAL ALKFSDDYFVSVWDC  +KNGCGG   DG A GGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGG---DGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAA

Query:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        L A +SEILPQIPGIVMHLEPSASLL+AIDLVLQGAQNLVVPIK KLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIG FS I+ALSLD+LG
Subjt:  LRASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE
        IICTNALS+NYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALA CDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKE+NLEGML+
Subjt:  IICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLE

Query:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
        EFTNSPSSIG  SFTSSSSDEEFSPSPSSRKYRRSSSYSARITRR+EAIVCHPRSSL+AVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET
Subjt:  EFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLET

Query:  SE
        SE
Subjt:  SE

SwissProt top hitse value%identityAlignment
Q84WQ5 CBS domain-containing protein CBSX51.1e-9048.52Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPL-FLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALR
        MA+SL S++VSDLCLGKP LR L   S++V+DA+ ALK S+D F+SVW+C +  +           +   C C+GK+ M DVIC+L +D +      AL 
Subjt:  MAVSLFSHDVSDLCLGKPALRPL-FLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALR

Query:  ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR--KQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        +SVS +LP+   IV+H++PS SL+EAIDL+++GAQNL+VPI TK  +  ++    +   T     G+ FCW+TQEDII++LL  I  FS + A+SL  LG
Subjt:  ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR--KQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICT--NALSVNYHSPASSAIGAISRCIANQTSVAVVDADG-----ILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKES
        +I +    ++V+YHS AS+ + A+S  +A QTSVAVVD +G      LIGEISP  L  CD+  AAA+ TLS+GDLMAYID   PPE LV++V+ RL++ 
Subjt:  IICT--NALSVNYHSPASSAIGAISRCIANQTSVAVVDADG-----ILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKES

Query:  NLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKV
         L G++  F     S+   S +S  S EE +P  ++  Y RS S SAR+ R++EAIVC+P+SSL+AV+IQA+AHRVNY WV+E D   +G+VTF+D+LKV
Subjt:  NLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKV

Query:  FREHLE
        FR+ LE
Subjt:  FREHLE

Q8GXI9 SNF1-related protein kinase regulatory subunit gamma-like PV42b1.6e-0622.15Show/hide
Query:  VGKLCMVDVICYLCRDENLLSPSAALRASVSEILPQIP-GI-VMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLT
        +G + M+DV+ ++  D+        + A VS I+   P G+ +  L P+ S+++ ++++ +G   ++VP+      +S  + +  P   +     +  L+
Subjt:  VGKLCMVDVICYLCRDENLLSPSAALRASVSEILPQIP-GI-VMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLT

Query:  QEDIIRYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYI
        Q D+I +          I + ++  L  I    L++   +    AI  +S  IA   +V +V+A G   GE     + G ++ V   + T S+ DL    
Subjt:  QEDIIRYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYI

Query:  DCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVW
                      A L+       LE     P ++ F + TS                          T   E + CH  S+L  VI      RV+ VW
Subjt:  DCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVW

Query:  VIEDDCSLIGIVTFLDMLKVFREHL
        V++ +  L G+V+  D++ V R  L
Subjt:  VIEDDCSLIGIVTFLDMLKVFREHL

Q8GZA4 CBS domain-containing protein CBSX69.9e-2026.13Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALRA
        MA     H V DL +GKP +   + + TV  A+ A+  S +  + VW     +      G     +    R VG L  +D++ +L + E  L    A++ 
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALRA

Query:  SVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR-------KQLKN-----------------PTNAIHGGRE-FCWLTQEDII
         VSE++     ++  ++P   L++A++++ QG + L+VP        S+R       K LKN                 PT ++   R+ FC L++ED+I
Subjt:  SVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR-------KQLKN-----------------PTNAIHGGRE-FCWLTQEDII

Query:  RYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVV-----DADGILIGEISPFALAGCDKAVAA-AIMTLSSGDLMAY
        R+L+  +G  + +   S+ +LGII  N    N+   +  AI A  R + + +++AV+     +    +IGEIS   L  CD   AA A+  L +G     
Subjt:  RYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVV-----DADGILIGEISPFALAGCDKAVAA-AIMTLSSGDLMAY

Query:  IDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYV
                  V  V+  +   +    L+         G A+     S      +P+S    R S   +    R+  + C   SSL AV+ Q ++HR  +V
Subjt:  IDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYV

Query:  WVIEDDCS--LIGIVTFLDML
        WV E D    L+G+V + ++L
Subjt:  WVIEDDCS--LIGIVTFLDML

Arabidopsis top hitse value%identityAlignment
AT1G65320.1 Cystathionine beta-synthase (CBS) family protein7.1e-2126.13Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALRA
        MA     H V DL +GKP +   + + TV  A+ A+  S +  + VW     +      G     +    R VG L  +D++ +L + E  L    A++ 
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALRA

Query:  SVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR-------KQLKN-----------------PTNAIHGGRE-FCWLTQEDII
         VSE++     ++  ++P   L++A++++ QG + L+VP        S+R       K LKN                 PT ++   R+ FC L++ED+I
Subjt:  SVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR-------KQLKN-----------------PTNAIHGGRE-FCWLTQEDII

Query:  RYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVV-----DADGILIGEISPFALAGCDKAVAA-AIMTLSSGDLMAY
        R+L+  +G  + +   S+ +LGII  N    N+   +  AI A  R + + +++AV+     +    +IGEIS   L  CD   AA A+  L +G     
Subjt:  RYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVV-----DADGILIGEISPFALAGCDKAVAA-AIMTLSSGDLMAY

Query:  IDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYV
                  V  V+  +   +    L+         G A+     S      +P+S    R S   +    R+  + C   SSL AV+ Q ++HR  +V
Subjt:  IDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYV

Query:  WVIEDDCS--LIGIVTFLDML
        WV E D    L+G+V + ++L
Subjt:  WVIEDDCS--LIGIVTFLDML

AT1G80090.1 Cystathionine beta-synthase (CBS) family protein1.2e-0722.15Show/hide
Query:  VGKLCMVDVICYLCRDENLLSPSAALRASVSEILPQIP-GI-VMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLT
        +G + M+DV+ ++  D+        + A VS I+   P G+ +  L P+ S+++ ++++ +G   ++VP+      +S  + +  P   +     +  L+
Subjt:  VGKLCMVDVICYLCRDENLLSPSAALRASVSEILPQIP-GI-VMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLT

Query:  QEDIIRYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYI
        Q D+I +          I + ++  L  I    L++   +    AI  +S  IA   +V +V+A G   GE     + G ++ V   + T S+ DL    
Subjt:  QEDIIRYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAISRCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYI

Query:  DCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVW
                      A L+       LE     P ++ F + TS                          T   E + CH  S+L  VI      RV+ VW
Subjt:  DCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVW

Query:  VIEDDCSLIGIVTFLDMLKVFREHL
        V++ +  L G+V+  D++ V R  L
Subjt:  VIEDDCSLIGIVTFLDMLKVFREHL

AT4G27460.1 Cystathionine beta-synthase (CBS) family protein7.7e-9248.52Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPL-FLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALR
        MA+SL S++VSDLCLGKP LR L   S++V+DA+ ALK S+D F+SVW+C +  +           +   C C+GK+ M DVIC+L +D +      AL 
Subjt:  MAVSLFSHDVSDLCLGKPALRPL-FLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALR

Query:  ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR--KQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG
        +SVS +LP+   IV+H++PS SL+EAIDL+++GAQNL+VPI TK  +  ++    +   T     G+ FCW+TQEDII++LL  I  FS + A+SL  LG
Subjt:  ASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRR--KQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLG

Query:  IICT--NALSVNYHSPASSAIGAISRCIANQTSVAVVDADG-----ILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKES
        +I +    ++V+YHS AS+ + A+S  +A QTSVAVVD +G      LIGEISP  L  CD+  AAA+ TLS+GDLMAYID   PPE LV++V+ RL++ 
Subjt:  IICT--NALSVNYHSPASSAIGAISRCIANQTSVAVVDADG-----ILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKES

Query:  NLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKV
         L G++  F     S+   S +S  S EE +P  ++  Y RS S SAR+ R++EAIVC+P+SSL+AV+IQA+AHRVNY WV+E D   +G+VTF+D+LKV
Subjt:  NLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKV

Query:  FREHLE
        FR+ LE
Subjt:  FREHLE

AT5G53750.1 CBS domain-containing protein1.5e-9549.64Show/hide
Query:  MAVSLFSHDVSDLCLGKPALRPLFL-SATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCR-DENLLSPSAAL
        MA++L SH++SDLC+GKP LR L + +ATVADA+ ALK SD+ F++VW C + +            D + C C+GK+CM DVICYL + D N+LS S+A 
Subjt:  MAVSLFSHDVSDLCLGKPALRPLFL-SATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCR-DENLLSPSAAL

Query:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQ----------LKNPTNAIH-GGREFCWLTQEDIIRYLLSSIGLFSR
         ASVS +LP+   +V+H++ S SL+EAIDL+++GAQNL+VPI TK  +  R++Q          L N T+  H   REFCW+TQEDIIR+LL SI +FS 
Subjt:  RASVSEILPQIPGIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQ----------LKNPTNAIH-GGREFCWLTQEDIIRYLLSSIGLFSR

Query:  ISALSLDSLGIICT--NALSVNYHSPASSAIGAISRCIANQTSVAVV-------DADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPED
        + +LS+  LG+I +    L+V+Y+S A+SA+ AISR I +  SVAVV       D   +LIGEISP  LA CD+   AA+ TLS+GDLM+YID  GPPE 
Subjt:  ISALSLDSLGIICT--NALSVNYHSPASSAIGAISRCIANQTSVAVV-------DADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPED

Query:  LVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSR---KYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDD
        LV VV+ RL++  + G++       S I   S +S SS +E SP+  +R    Y RS S +AR+ R++ AIVC+ +SSL+AV+IQAIAHRV+YVWVI++D
Subjt:  LVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSR---KYRRSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDD

Query:  CSLIGIVTFLDMLKVFREHLE
          LIG+VTF+D+LK+FRE L+
Subjt:  CSLIGIVTFLDMLKVFREHLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAGCTTGTTTTCGCATGACGTTTCTGACCTCTGTCTCGGCAAGCCTGCGCTGAGGCCGCTGTTCTTGTCCGCAACCGTCGCCGACGCCTTGCTTGCTCTCAA
ATTCTCCGACGATTACTTCGTCAGTGTTTGGGATTGCTGTTACTCGAAGAACGGTTGCGGCGGCGATGGTTGTGCTGGCGGCGGCGATTTTGAGTGCTGCCGATGCGTCG
GGAAGCTGTGTATGGTAGATGTGATTTGCTATCTCTGTAGGGACGAGAATCTGTTGTCACCTTCGGCTGCATTGCGTGCTTCTGTCTCTGAGATTTTGCCTCAAATTCCT
GGAATCGTGATGCATTTGGAGCCTTCTGCTAGCTTGTTAGAGGCCATTGATCTGGTTCTCCAAGGTGCTCAAAATCTTGTAGTCCCAATCAAGACTAAGCTAGGAAGCAA
TTCAAGAAGGAAGCAGCTCAAAAACCCCACAAATGCCATCCATGGCGGCCGTGAATTCTGCTGGCTGACTCAGGAAGATATAATCAGATATCTCCTCAGCTCAATTGGCC
TTTTTTCCCGCATTTCTGCCCTCTCCCTTGACAGCCTTGGCATCATCTGCACCAATGCTTTATCAGTAAACTACCACTCCCCTGCCTCCTCTGCCATTGGAGCCATTTCC
CGCTGCATCGCCAACCAAACCTCGGTTGCAGTCGTCGATGCTGATGGAATTCTGATCGGTGAGATCTCACCCTTTGCCCTTGCCGGGTGCGACAAGGCCGTTGCAGCTGC
GATTATGACCTTATCGTCCGGAGACCTGATGGCCTACATTGACTGTGGGGGGCCTCCGGAGGATCTTGTTAAGGTGGTGAAGGCTAGGCTGAAAGAGAGCAACTTGGAAG
GAATGCTGGAGGAATTCACTAATTCACCATCCTCAATTGGATTTGCATCTTTCACTTCTTCATCATCAGATGAGGAGTTCTCACCATCTCCAAGCTCGAGAAAGTATAGG
AGATCATCAAGCTACTCGGCACGGATCACTCGTCGTGCTGAGGCCATAGTTTGTCATCCAAGGAGCTCCTTGGTAGCTGTTATCATCCAGGCAATCGCACACCGTGTGAA
CTACGTCTGGGTTATCGAAGACGACTGCAGTTTGATTGGAATTGTCACATTCCTTGATATGTTAAAAGTTTTCAGAGAACATTTAGAGACAAGTGAGTAG
mRNA sequenceShow/hide mRNA sequence
GCCATTGCCGGCCACCGTCAACCTCCTCCTCTCCGCCGCTCTCCTATATCCATTTCCGTCGGGGTTTCTTCAAACTCTTCCCTCTTAGCTTTTGTATGGCAGTGAGCTTG
TTTTCGCATGACGTTTCTGACCTCTGTCTCGGCAAGCCTGCGCTGAGGCCGCTGTTCTTGTCCGCAACCGTCGCCGACGCCTTGCTTGCTCTCAAATTCTCCGACGATTA
CTTCGTCAGTGTTTGGGATTGCTGTTACTCGAAGAACGGTTGCGGCGGCGATGGTTGTGCTGGCGGCGGCGATTTTGAGTGCTGCCGATGCGTCGGGAAGCTGTGTATGG
TAGATGTGATTTGCTATCTCTGTAGGGACGAGAATCTGTTGTCACCTTCGGCTGCATTGCGTGCTTCTGTCTCTGAGATTTTGCCTCAAATTCCTGGAATCGTGATGCAT
TTGGAGCCTTCTGCTAGCTTGTTAGAGGCCATTGATCTGGTTCTCCAAGGTGCTCAAAATCTTGTAGTCCCAATCAAGACTAAGCTAGGAAGCAATTCAAGAAGGAAGCA
GCTCAAAAACCCCACAAATGCCATCCATGGCGGCCGTGAATTCTGCTGGCTGACTCAGGAAGATATAATCAGATATCTCCTCAGCTCAATTGGCCTTTTTTCCCGCATTT
CTGCCCTCTCCCTTGACAGCCTTGGCATCATCTGCACCAATGCTTTATCAGTAAACTACCACTCCCCTGCCTCCTCTGCCATTGGAGCCATTTCCCGCTGCATCGCCAAC
CAAACCTCGGTTGCAGTCGTCGATGCTGATGGAATTCTGATCGGTGAGATCTCACCCTTTGCCCTTGCCGGGTGCGACAAGGCCGTTGCAGCTGCGATTATGACCTTATC
GTCCGGAGACCTGATGGCCTACATTGACTGTGGGGGGCCTCCGGAGGATCTTGTTAAGGTGGTGAAGGCTAGGCTGAAAGAGAGCAACTTGGAAGGAATGCTGGAGGAAT
TCACTAATTCACCATCCTCAATTGGATTTGCATCTTTCACTTCTTCATCATCAGATGAGGAGTTCTCACCATCTCCAAGCTCGAGAAAGTATAGGAGATCATCAAGCTAC
TCGGCACGGATCACTCGTCGTGCTGAGGCCATAGTTTGTCATCCAAGGAGCTCCTTGGTAGCTGTTATCATCCAGGCAATCGCACACCGTGTGAACTACGTCTGGGTTAT
CGAAGACGACTGCAGTTTGATTGGAATTGTCACATTCCTTGATATGTTAAAAGTTTTCAGAGAACATTTAGAGACAAGTGAGTAGACTTTCAAGCCATTTGGCTTAGAAA
TTTTGCACCCCCTCTCCCC
Protein sequenceShow/hide protein sequence
MAVSLFSHDVSDLCLGKPALRPLFLSATVADALLALKFSDDYFVSVWDCCYSKNGCGGDGCAGGGDFECCRCVGKLCMVDVICYLCRDENLLSPSAALRASVSEILPQIP
GIVMHLEPSASLLEAIDLVLQGAQNLVVPIKTKLGSNSRRKQLKNPTNAIHGGREFCWLTQEDIIRYLLSSIGLFSRISALSLDSLGIICTNALSVNYHSPASSAIGAIS
RCIANQTSVAVVDADGILIGEISPFALAGCDKAVAAAIMTLSSGDLMAYIDCGGPPEDLVKVVKARLKESNLEGMLEEFTNSPSSIGFASFTSSSSDEEFSPSPSSRKYR
RSSSYSARITRRAEAIVCHPRSSLVAVIIQAIAHRVNYVWVIEDDCSLIGIVTFLDMLKVFREHLETSE