; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G077610 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G077610
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionCBS domain-containing protein CBSX5-like
Genome locationCicolChr04:32371268..32373634
RNA-Seq ExpressionCcUC04G077610
SyntenyCcUC04G077610
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0042149 - cellular response to glucose starvation (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0031588 - nucleotide-activated protein kinase complex (cellular component)
GO:0016208 - AMP binding (molecular function)
GO:0019887 - protein kinase regulator activity (molecular function)
GO:0019901 - protein kinase binding (molecular function)
InterPro domainsIPR000644 - CBS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594611.1 CBS domain-containing protein CBSX5, partial [Cucurbita argyrosperma subsp. sororia]1.6e-18183.8Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL  H +SDICLGKPALRSIS+SATLADALSALK+LGENYISVW+CA HSSKS S  DCRCV KISV+DV+ FLC E+NLSQPAVAL S +SV+I +
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          V+VRHL+PHASLVEAI+LLLEG QNLVVPIQTRT  KSR++VL+EVAPFDC L+N+LEYCWLTQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        A+HYDDPALSALPLLSQAIIHQ+++AIVDS+GKLIGEISPLTLNSCDET+ AAIVTLSAGELMAYVDCGDPPEDL+QLVKDRLEERNLGALLEWVEE+SA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYS+SSDDDSGS+ GRSGK G+ S RQVRSSE A CNPRSSLVAVMIQALAHRVPY+WV+E+DG LVGIITF SMLKVFRERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

XP_008439875.1 PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis melo]9.7e-18286.11Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL DHH+SDICLGKPAL SISLSATLADALSALKKLGENYISVW+C+SH SKS+SH+DC+C+ KISVLDV+LFLC E+NLSQPAVALQSSVSVLIP 
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          VLV HLEPHASLVE I+LLLEG QNLVVPIQTRTS KSRE+VLE VAPFDCPL+N LEYCW+TQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        AVHYDDPALSALPL+SQAIIHQ+SVAIV+S+GKLIGEISPLTLNS DETI AAIVTLSAGELMAYV C DPPEDLVQLVKDRLEERNL  LLEWVEEESA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +S+CSSF SSSSDDDSGS WGRSGKL +CSTRQV RSSE AVCNP+SSLVAVMIQALA RVPYMWV EEDG LVGI TF SMLKVF ERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

XP_022926754.1 CBS domain-containing protein CBSX5-like [Cucurbita moschata]5.1e-18384.3Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL  H +SDICLGKPALRSIS+SATLADALSALK+LGENYISVW+CA HSSKS S  DCRCV KISV+DV+ FLC E+NLSQPAVAL SS+SV+IP+
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          V+VRHL+PHASLVEAI+LLLEG QNLVVPIQTRT  KSR++VL+EVAPFDC L+N+LEYCWLTQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        A+HYDDPALSALPLLSQAIIHQ+++AIVDS+GKLIGEISPLTLNSCDET+ AAIVTLSAGELMAYVDCGDPPEDL+QLVKDRLEERNLGALLEWVEE+SA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYS+SSDDDSGS+ GRSGK G+ S RQVRSSE A CNPRSSLVAVMIQALAHRVPY+WV+E+DG LVGIITF SMLKVFRERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

XP_023003177.1 CBS domain-containing protein CBSX5-like [Cucurbita maxima]1.6e-18184.05Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL  H +SDICLGKPALRSIS+SATLADALSALK+LGENYISVW+CA HSSKS S  DCRCV KISV+DV+ FLC E+NL QPAVAL SS+SV+IP+
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          V+VRHL+PHASLVEAI+LLLEG QNLVVPIQTRT  KSR++VL+EVAPFDC L+N+LEYCWLTQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        A+HYDDPALSALPLLSQAIIHQ+++AIVDS+GKLIGEISP TLNSC+ET+ AAI TLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEE+SA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYS+SSDDDSGS+ GRSGK GR S RQVRSSEAAVCNPRSSLVAVMIQALAHRVPY+WV+E+DG LVGIITF SMLKVFRERL+SMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

XP_038881934.1 CBS domain-containing protein CBSX5-like [Benincasa hispida]1.0e-18688.35Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL DHH+SDICLGKPAL SISLSATLADALSAL KLGE +ISVW+C  H SKSASH DCRC+ KISVLDVILFLC E+NLSQPAVALQS VSVLIPQ
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          VLVRHLEPHASLVEAI+LLLEG QNLVVPIQTRTSVKSRE+VLEE A  DCPLYND  YCWLTQEDIIRYL NSI LFSPTSITPINSL+A+DTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        AVHYDDPALSALPLLSQAIIHQ+SVAIVDS+GKLIGEISPLTLNSC ETI AAIVTLSAGELMAY+DCGDPPEDLVQL+KDRLEERNLGALLEWVEEESA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYSSSSDDDSG +WGRSGKL +CSTRQVRSSEAAVC+PRSSLVAVMIQALA RVPYMWVIEEDG+LVGIITFASMLKVFRERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

TrEMBL top hitse value%identityAlignment
A0A1S3B0E8 CBS domain-containing protein CBSX5-like4.7e-18286.11Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL DHH+SDICLGKPAL SISLSATLADALSALKKLGENYISVW+C+SH SKS+SH+DC+C+ KISVLDV+LFLC E+NLSQPAVALQSSVSVLIP 
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          VLV HLEPHASLVE I+LLLEG QNLVVPIQTRTS KSRE+VLE VAPFDCPL+N LEYCW+TQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        AVHYDDPALSALPL+SQAIIHQ+SVAIV+S+GKLIGEISPLTLNS DETI AAIVTLSAGELMAYV C DPPEDLVQLVKDRLEERNL  LLEWVEEESA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +S+CSSF SSSSDDDSGS WGRSGKL +CSTRQV RSSE AVCNP+SSLVAVMIQALA RVPYMWV EEDG LVGI TF SMLKVF ERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

A0A5A7UBW3 CBS domain-containing protein CBSX5-like4.7e-18286.11Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL DHH+SDICLGKPAL SISLSATLADALSALKKLGENYISVW+C+SH SKS+SH+DC+C+ KISVLDV+LFLC E+NLSQPAVALQSSVSVLIP 
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          VLV HLEPHASLVE I+LLLEG QNLVVPIQTRTS KSRE+VLE VAPFDCPL+N LEYCW+TQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        AVHYDDPALSALPL+SQAIIHQ+SVAIV+S+GKLIGEISPLTLNS DETI AAIVTLSAGELMAYV C DPPEDLVQLVKDRLEERNL  LLEWVEEESA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +S+CSSF SSSSDDDSGS WGRSGKL +CSTRQV RSSE AVCNP+SSLVAVMIQALA RVPYMWV EEDG LVGI TF SMLKVF ERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

A0A5D3CRU7 CBS domain-containing protein CBSX5-like1.4e-18185.86Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL DHH+SDICLGKPAL SISLSATLADALSALKKLGENYISVW+C+SH SKS+SH+DC+C+ KISVLDV+LFLC E+NLSQPAVALQSSVSVLIP 
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          VLV HLEPHASLVE I+LLLEG QNLVVPIQTRTS KSRE+VLE VAPFDCPL+N LEYCW+TQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        AVHYDDPALSALPL+SQAIIHQ+SVAIV+S+GKLIGEISPLTLNS DETI AAIVTLSAGELMAYV C DPPEDLVQLVKDRLEERNL  LLEWVEEE A
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +S+CSSF SSSSDDDSGS WGRSGKL +CSTRQV RSSE AVCNP+SSLVAVMIQALA RVPYMWV EEDG LVGI TF SMLKVF ERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQV-RSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

A0A6J1EG28 CBS domain-containing protein CBSX5-like2.5e-18384.3Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL  H +SDICLGKPALRSIS+SATLADALSALK+LGENYISVW+CA HSSKS S  DCRCV KISV+DV+ FLC E+NLSQPAVAL SS+SV+IP+
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          V+VRHL+PHASLVEAI+LLLEG QNLVVPIQTRT  KSR++VL+EVAPFDC L+N+LEYCWLTQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        A+HYDDPALSALPLLSQAIIHQ+++AIVDS+GKLIGEISPLTLNSCDET+ AAIVTLSAGELMAYVDCGDPPEDL+QLVKDRLEERNLGALLEWVEE+SA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYS+SSDDDSGS+ GRSGK G+ S RQVRSSE A CNPRSSLVAVMIQALAHRVPY+WV+E+DG LVGIITF SMLKVFRERLKSMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

A0A6J1KLQ5 CBS domain-containing protein CBSX5-like8.0e-18284.05Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ
        MAVRL  H +SDICLGKPALRSIS+SATLADALSALK+LGENYISVW+CA HSSKS S  DCRCV KISV+DV+ FLC E+NL QPAVAL SS+SV+IP+
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQ

Query:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL
          V+VRHL+PHASLVEAI+LLLEG QNLVVPIQTRT  KSR++VL+EVAPFDC L+N+LEYCWLTQEDIIRYLLNSIGLFSPTSITPINSL+AIDTANIL
Subjt:  GHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANIL

Query:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA
        A+HYDDPALSALPLLSQAIIHQ+++AIVDS+GKLIGEISP TLNSC+ET+ AAI TLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEE+SA
Subjt:  AVHYDDPALSALPLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESA

Query:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC
          +SSCSSFYS+SSDDDSGS+ GRSGK GR S RQVRSSEAAVCNPRSSLVAVMIQALAHRVPY+WV+E+DG LVGIITF SMLKVFRERL+SMC
Subjt:  SVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC

SwissProt top hitse value%identityAlignment
Q84WQ5 CBS domain-containing protein CBSX53.1e-8244.5Show/hide
Query:  MAVRLFDHHISDICLGKPALRSI-SLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIP
        MA+ L  +++SD+CLGKP LR + S S++++DA++ALK   + ++SVW+C   +    ++ +C C+ KIS+ DVI  L  + + S    AL SSVSVL+P
Subjt:  MAVRLFDHHISDICLGKPALRSI-SLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIP

Query:  QGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAID-TAN
        +   +V H++P  SL+EAI+L+++G QNL+VPI T+   K ++   + V+       N   +CW+TQEDII++LL  I  FSP     ++ L  I+ T  
Subjt:  QGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAID-TAN

Query:  ILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG-----KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLE
        ++AV Y   A + +  +S A+  QTSVA+VD EG      LIGEISP+TL  CDET AAA+ TLSAG+LMAY+D  +PPE LVQ+V++RLE++ L  L+ 
Subjt:  ILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG-----KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLE

Query:  WVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKS
          +      LSS S+    SS++++       G+    S R  R SEA VCNP+SSL+AVMIQA+AHRV Y WV+E+DG  VG++TF  +LKVFR+ L++
Subjt:  WVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKS

Q8GXI9 SNF1-related protein kinase regulatory subunit gamma-like PV42b1.1e-0519.47Show/hide
Query:  RLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISV------WSCASHS-----SKSASHHDCRCVAKISVLDVILFLCNED---NLSQPAVA
        RL    + D+ + K  L  +  +ATL DAL+ +       + V      W  A  S      K +     + +  +++LDV+  +  +D    L +   A
Subjt:  RLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISV------WSCASHS-----SKSASHHDCRCVAKISVLDVILFLCNED---NLSQPAVA

Query:  LQSSVSVLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPIN
          SS+    P+G + +  L P+ S+++ + +L +G+  ++VP+ + T   +   ++E  +           Y  L+Q D+I +  +            + 
Subjt:  LQSSVSVLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPIN

Query:  SLDAIDTANILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG------------KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQ
         L AI    +LA+        A+  +S A+++   +     EG            +++G  S   L  C      + + L+A                  
Subjt:  SLDAIDTANILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG------------KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQ

Query:  LVKDRLEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGI
                      LE+VE+   ++L + ++           ST GR               E   C+  S+L  V+      RV  +WV++++G L G+
Subjt:  LVKDRLEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGI

Query:  ITFASMLKVFRERLKS
        ++   ++ V R  L S
Subjt:  ITFASMLKVFRERLKS

Q8GZA4 CBS domain-containing protein CBSX68.3e-1925.83Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSS-----KSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVS
        MA     H + D+ +GKP +     + T+  A+ A+ +  E  I VW   +  S     +++     R V  ++ LD++ FL   + L Q   A++  VS
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSS-----KSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVS

Query:  VLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVP-------IQTRTSVKSRERVLEEV---------------APFDCPLYNDLEYCWLTQEDIIRYL
         ++   + L++ ++P   L++A+ ++ +GV+ L+VP       +  R S+    + L+                  P      +  ++C L++ED+IR+L
Subjt:  VLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVP-------IQTRTSVKSRERVLEEV---------------APFDCPLYNDLEYCWLTQEDIIRYL

Query:  LNSIGLFSPTSITPINSLDAIDTANILAVHYDDPALSAL--PLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAA-AIVTLSAGELMAYVDCGD
        +  +G  +P  +T I++L  I+  N   +    PA+ A   PL   + I        + + K+IGEIS   L  CD   AA A+  L AG+         
Subjt:  LNSIGLFSPTSITPINSLDAIDTANILAVHYDDPALSAL--PLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAA-AIVTLSAGELMAYVDCGD

Query:  PPEDLVQLVKDRLEERNLGALLE--WVEEESASVLSSCSSFYSSSSDDDSGS----TWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPY
             V  V+D +  R+    L+  +   E     ++   F S S   +  S    + GRS   GR        S    C   SSL AVM Q L+HR  +
Subjt:  PPEDLVQLVKDRLEERNLGALLE--WVEEESASVLSSCSSFYSSSSDDDSGS----TWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPY

Query:  MWVIEEDGS--LVGIITFASML
        +WV E D    LVG++ +  +L
Subjt:  MWVIEEDGS--LVGIITFASML

Arabidopsis top hitse value%identityAlignment
AT1G65320.1 Cystathionine beta-synthase (CBS) family protein5.9e-2025.83Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSS-----KSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVS
        MA     H + D+ +GKP +     + T+  A+ A+ +  E  I VW   +  S     +++     R V  ++ LD++ FL   + L Q   A++  VS
Subjt:  MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSS-----KSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVS

Query:  VLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVP-------IQTRTSVKSRERVLEEV---------------APFDCPLYNDLEYCWLTQEDIIRYL
         ++   + L++ ++P   L++A+ ++ +GV+ L+VP       +  R S+    + L+                  P      +  ++C L++ED+IR+L
Subjt:  VLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVP-------IQTRTSVKSRERVLEEV---------------APFDCPLYNDLEYCWLTQEDIIRYL

Query:  LNSIGLFSPTSITPINSLDAIDTANILAVHYDDPALSAL--PLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAA-AIVTLSAGELMAYVDCGD
        +  +G  +P  +T I++L  I+  N   +    PA+ A   PL   + I        + + K+IGEIS   L  CD   AA A+  L AG+         
Subjt:  LNSIGLFSPTSITPINSLDAIDTANILAVHYDDPALSAL--PLLSQAIIHQTSVAIVDSEGKLIGEISPLTLNSCDETIAA-AIVTLSAGELMAYVDCGD

Query:  PPEDLVQLVKDRLEERNLGALLE--WVEEESASVLSSCSSFYSSSSDDDSGS----TWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPY
             V  V+D +  R+    L+  +   E     ++   F S S   +  S    + GRS   GR        S    C   SSL AVM Q L+HR  +
Subjt:  PPEDLVQLVKDRLEERNLGALLE--WVEEESASVLSSCSSFYSSSSDDDSGS----TWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPY

Query:  MWVIEEDGS--LVGIITFASML
        +WV E D    LVG++ +  +L
Subjt:  MWVIEEDGS--LVGIITFASML

AT1G80090.1 Cystathionine beta-synthase (CBS) family protein5.8e-0719.39Show/hide
Query:  RLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYIS-------------VWSCASHS-----SKSASHHDCRCVAKISVLDVILFLCNED---N
        RL    + D+ + K  L  +  +ATL DAL+ +  LG   ++              W  A  S      K +     + +  +++LDV+  +  +D    
Subjt:  RLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYIS-------------VWSCASHS-----SKSASHHDCRCVAKISVLDVILFLCNED---N

Query:  LSQPAVALQSSVSVLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSP
        L +   A  SS+    P+G + +  L P+ S+++ + +L +G+  ++VP+ + T   +   ++E  +           Y  L+Q D+I +  +       
Subjt:  LSQPAVALQSSVSVLIPQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSP

Query:  TSITPINSLDAIDTANILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG------------KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGD
             +  L AI    +LA+        A+  +S A+++   +     EG            +++G  S   L  C      + + L+A           
Subjt:  TSITPINSLDAIDTANILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG------------KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGD

Query:  PPEDLVQLVKDRLEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEE
                             LE+VE+   ++L + ++           ST GR               E   C+  S+L  V+      RV  +WV+++
Subjt:  PPEDLVQLVKDRLEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEE

Query:  DGSLVGIITFASMLKVFRERLKS
        +G L G+++   ++ V R  L S
Subjt:  DGSLVGIITFASMLKVFRERLKS

AT4G27460.1 Cystathionine beta-synthase (CBS) family protein2.2e-8344.5Show/hide
Query:  MAVRLFDHHISDICLGKPALRSI-SLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIP
        MA+ L  +++SD+CLGKP LR + S S++++DA++ALK   + ++SVW+C   +    ++ +C C+ KIS+ DVI  L  + + S    AL SSVSVL+P
Subjt:  MAVRLFDHHISDICLGKPALRSI-SLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIP

Query:  QGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAID-TAN
        +   +V H++P  SL+EAI+L+++G QNL+VPI T+   K ++   + V+       N   +CW+TQEDII++LL  I  FSP     ++ L  I+ T  
Subjt:  QGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAID-TAN

Query:  ILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG-----KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLE
        ++AV Y   A + +  +S A+  QTSVA+VD EG      LIGEISP+TL  CDET AAA+ TLSAG+LMAY+D  +PPE LVQ+V++RLE++ L  L+ 
Subjt:  ILAVHYDDPALSALPLLSQAIIHQTSVAIVDSEG-----KLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLE

Query:  WVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKS
          +      LSS S+    SS++++       G+    S R  R SEA VCNP+SSL+AVMIQA+AHRV Y WV+E+DG  VG++TF  +LKVFR+ L++
Subjt:  WVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKS

AT5G53750.1 CBS domain-containing protein2.9e-8344.71Show/hide
Query:  MAVRLFDHHISDICLGKPALRSISL-SATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCN-EDNLSQPAVALQSSVSVLI
        MA+ L  H +SD+C+GKP LR +S+ +AT+ADA++ALK   E +++VWSC +H  K+  +  C C+ KI + DVI +L   ++N+   + A  +SVSVL+
Subjt:  MAVRLFDHHISDICLGKPALRSISL-SATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCN-EDNLSQPAVALQSSVSVLI

Query:  PQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRER--------VLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINS
        P+   LV H++   SL+EAI+L+++G QNL+VPI T++  K R++        V+           N  E+CW+TQEDIIR+LL+SI +FSP     I+ 
Subjt:  PQGHVLVRHLEPHASLVEAINLLLEGVQNLVVPIQTRTSVKSRER--------VLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINS

Query:  LDAID-TANILAVHYDDPALSALPLLSQAIIHQTSVAIV----DSEGK---LIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDR
        L  I+ T  ILAV Y   A SA+  +S+AI+   SVA+V    D E     LIGEISP+TL  CDET  AA+ TLSAG+LM+Y+D   PPE LV +V++R
Subjt:  LDAID-TANILAVHYDDPALSALPLLSQAIIHQTSVAIV----DSEGK---LIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDR

Query:  LEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDS-------GSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLV
        LE++ +  L+        S++ S S    SSSD++S        S++GRS      + R  R S A VCN +SSL+AVMIQA+AHRV Y+WVI+EDG L+
Subjt:  LEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDS-------GSTWGRSGKLGRCSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLV

Query:  GIITFASMLKVFRERL
        G++TF  +LK+FRE L
Subjt:  GIITFASMLKVFRERL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGAGGTTGTTTGATCATCATATCTCTGACATATGCCTTGGAAAGCCTGCGTTGAGGTCCATCTCTCTTTCCGCCACACTCGCCGACGCCCTCTCTGCTCTCAA
AAAGCTCGGTGAAAACTACATCAGCGTCTGGAGTTGCGCCTCCCACTCCTCCAAATCCGCCTCCCACCACGATTGTCGATGTGTTGCCAAGATTTCCGTTCTTGATGTTA
TCTTATTCTTATGTAACGAAGACAATCTCTCCCAACCTGCCGTCGCGCTTCAATCTTCTGTTTCGGTTCTTATTCCTCAGGGTCATGTTCTCGTTAGACATTTAGAGCCT
CATGCTAGTTTGGTGGAAGCCATAAATCTCCTCCTCGAAGGTGTGCAAAACCTCGTAGTTCCAATTCAAACTAGAACCTCTGTGAAGTCCAGAGAGAGGGTTCTTGAAGA
AGTTGCTCCCTTCGACTGTCCGCTTTATAATGACCTTGAATATTGTTGGCTCACCCAAGAGGACATAATCCGTTACCTCCTCAACTCTATTGGGCTTTTCAGCCCCACCT
CCATCACTCCCATTAATTCCCTCGACGCCATCGACACGGCCAACATTCTTGCTGTACACTACGACGATCCTGCACTTTCTGCCCTGCCCCTCCTTTCTCAAGCCATCATC
CACCAAACCTCCGTTGCAATTGTTGACTCAGAAGGGAAGTTGATCGGAGAAATCTCACCCCTCACTTTGAATTCTTGCGATGAAACCATTGCGGCTGCGATTGTAACACT
CTCAGCTGGTGAGCTAATGGCGTATGTGGACTGTGGTGACCCGCCGGAGGATTTGGTTCAGTTGGTCAAAGACAGACTGGAAGAGAGAAACCTAGGGGCACTATTGGAGT
GGGTAGAAGAAGAGTCGGCATCAGTATTGTCATCATGTTCTTCATTTTACTCATCTTCTTCAGATGATGATTCTGGCTCTACCTGGGGAAGGAGTGGAAAGTTAGGAAGA
TGTTCAACCAGGCAAGTGAGGAGCTCAGAGGCTGCTGTGTGTAATCCCCGGAGTTCATTGGTGGCGGTGATGATTCAAGCTCTTGCGCATCGTGTTCCATATATGTGGGT
GATCGAAGAAGATGGAAGCTTGGTTGGCATTATCACATTTGCGTCGATGTTGAAGGTTTTCCGCGAACGTTTGAAATCAATGTGTTAA
mRNA sequenceShow/hide mRNA sequence
AGGAACTAACTCACGAGCTCAAAATCTCTCTCTAGAATCGTCTTCAACCTCAGTTTCGCATCCATTTTCTCTCTCTTCCGTGTTCTTCTTTCCTCTTTGTGCATGGCAGT
GAGGTTGTTTGATCATCATATCTCTGACATATGCCTTGGAAAGCCTGCGTTGAGGTCCATCTCTCTTTCCGCCACACTCGCCGACGCCCTCTCTGCTCTCAAAAAGCTCG
GTGAAAACTACATCAGCGTCTGGAGTTGCGCCTCCCACTCCTCCAAATCCGCCTCCCACCACGATTGTCGATGTGTTGCCAAGATTTCCGTTCTTGATGTTATCTTATTC
TTATGTAACGAAGACAATCTCTCCCAACCTGCCGTCGCGCTTCAATCTTCTGTTTCGGTTCTTATTCCTCAGGGTCATGTTCTCGTTAGACATTTAGAGCCTCATGCTAG
TTTGGTGGAAGCCATAAATCTCCTCCTCGAAGGTGTGCAAAACCTCGTAGTTCCAATTCAAACTAGAACCTCTGTGAAGTCCAGAGAGAGGGTTCTTGAAGAAGTTGCTC
CCTTCGACTGTCCGCTTTATAATGACCTTGAATATTGTTGGCTCACCCAAGAGGACATAATCCGTTACCTCCTCAACTCTATTGGGCTTTTCAGCCCCACCTCCATCACT
CCCATTAATTCCCTCGACGCCATCGACACGGCCAACATTCTTGCTGTACACTACGACGATCCTGCACTTTCTGCCCTGCCCCTCCTTTCTCAAGCCATCATCCACCAAAC
CTCCGTTGCAATTGTTGACTCAGAAGGGAAGTTGATCGGAGAAATCTCACCCCTCACTTTGAATTCTTGCGATGAAACCATTGCGGCTGCGATTGTAACACTCTCAGCTG
GTGAGCTAATGGCGTATGTGGACTGTGGTGACCCGCCGGAGGATTTGGTTCAGTTGGTCAAAGACAGACTGGAAGAGAGAAACCTAGGGGCACTATTGGAGTGGGTAGAA
GAAGAGTCGGCATCAGTATTGTCATCATGTTCTTCATTTTACTCATCTTCTTCAGATGATGATTCTGGCTCTACCTGGGGAAGGAGTGGAAAGTTAGGAAGATGTTCAAC
CAGGCAAGTGAGGAGCTCAGAGGCTGCTGTGTGTAATCCCCGGAGTTCATTGGTGGCGGTGATGATTCAAGCTCTTGCGCATCGTGTTCCATATATGTGGGTGATCGAAG
AAGATGGAAGCTTGGTTGGCATTATCACATTTGCGTCGATGTTGAAGGTTTTCCGCGAACGTTTGAAATCAATGTGTTAAAAGGACATCAGCTAGAGCTTTGCCCTCATA
TTAAGCTGAAACTATAATAATGTCTACATTGCGTTATTTTGCTCACTTTATTTTGGAGTATAATGTAAATGTGTAACCAGTAGAGACTATCTTCTTTTGCAAGTTCAAT
Protein sequenceShow/hide protein sequence
MAVRLFDHHISDICLGKPALRSISLSATLADALSALKKLGENYISVWSCASHSSKSASHHDCRCVAKISVLDVILFLCNEDNLSQPAVALQSSVSVLIPQGHVLVRHLEP
HASLVEAINLLLEGVQNLVVPIQTRTSVKSRERVLEEVAPFDCPLYNDLEYCWLTQEDIIRYLLNSIGLFSPTSITPINSLDAIDTANILAVHYDDPALSALPLLSQAII
HQTSVAIVDSEGKLIGEISPLTLNSCDETIAAAIVTLSAGELMAYVDCGDPPEDLVQLVKDRLEERNLGALLEWVEEESASVLSSCSSFYSSSSDDDSGSTWGRSGKLGR
CSTRQVRSSEAAVCNPRSSLVAVMIQALAHRVPYMWVIEEDGSLVGIITFASMLKVFRERLKSMC