; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G007420 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G007420
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCBS domain-containing protein CBSX5-like
Genome locationchr02:6814670..6817396
RNA-Seq ExpressionLsi02G007420
SyntenyLsi02G007420
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0042149 - cellular response to glucose starvation (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0031588 - nucleotide-activated protein kinase complex (cellular component)
GO:0016208 - AMP binding (molecular function)
GO:0019887 - protein kinase regulator activity (molecular function)
GO:0019901 - protein kinase binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146648.1 CBS domain-containing protein CBSX5 [Cucumis sativus]5.1e-19189.95Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAASLLACEVSDLCLGKPALRSISISAT+ADALS LT+IDEGYISVWSCGDHSS KAD DL CRCVGKVCMVDIICFLCRQENLLQPAI L+SPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EG  LVRHLEPHASLMEAIDLIHDGVHNLVIPIK S SK+KN LKKS  NSISSLHND+EYCWLAPEDIIRYLLNSIGLFS TAA+PINSFNIIDTNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AV YDESALSILPLISQA IHQSSVAIVDLD+KLIGEISPFTLNFCDETV AAIATLTAGELM YIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEES 
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TI SSSSSICSSSDDEFG     SGSGRSG+I GYSARV+RRSEAIVCYPWNSLVAVMIQALAHRVSY+WVIQEDGTLAGTVTF S+LAVFRDRLK L
Subjt:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

XP_008442367.1 PREDICTED: CBS domain-containing protein CBSX5-like [Cucumis melo]9.6e-19088.94Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAA+LL CEVSDLCLGKPALRSISISATVADALS LT+IDEGYISVWSCGDHSS KAD DL CRCVGKVCMVDIICFLCRQENLLQPAI L+SPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EG  LV+HLEPHASLMEAIDLIHDG+HNLVIPIK STS++KN LKKS  NSISSLHND+EYCWLAPEDIIRYLLNSIGLFS TAA+PINS NIIDTNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AV YDESALSILPLISQA IHQSSVAIVDLD+KLIGEISP TLNFCDETV AAIATLTAGE MAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEES 
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TI SSSSSICSSSDDEFG     +GSGRSG+I GYSARV+RRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQ DGTLAGTVTFAS+LAVFRDRLKSL
Subjt:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

XP_022139821.1 CBS domain-containing protein CBSX5-like [Momordica charantia]3.6e-18991.09Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MA SLLA EVSDLCLGKPALRSISISATVADALS LTRIDEGYISVWSCGDHSS KA+ +L CRCVGKVCMVDIICFLCRQENLLQPA ALRSPIS LIH
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        E +GLVRHLEPH+SLMEAIDLI DG HNLVIPIK+STSK+KNFLKK+S  SISSLHNDREYCWLAPEDIIRYLLNSIGLFS  AASPINSFNIIDTNNVL
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AVHYDESALSILPLISQA IHQSSVAIVDLD+ LIGEISPFTLNFCDETV AAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        T  SSSSSICSSSDDEFGSGSGRSGKIGGYSARV RRSEAIVCYPWNSLVAVMIQALAHRV YVWVI+EDG LAG VTFASML+VFRDRLKS+
Subjt:  TI-SSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

XP_022954486.1 CBS domain-containing protein CBSX5-like [Cucurbita moschata]2.4e-18889.03Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MA SLL+ ++SDLCLGKPA+RSISISATVADALS LTRIDEG ISVWSCGDHSS +AD DL CRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EGL LVRHLEPHASLMEAIDLI DGVHNLVIPI  S SK++NFL+ +++NSISSLHNDR+YCWLAPEDIIRYLLNSIGLFSPTAASPINS NII+TNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
         VHYD+SALSILPLISQA IHQSSVAIVDLD+KL+GEISPFTLNFCDET+ AAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE+VEEESS
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
         ISSSSSICS SDDEFG GSGRS KIGGYSARVLRRSEAIVC+PWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS+
Subjt:  TISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

XP_038893760.1 CBS domain-containing protein CBSX5-like [Benincasa hispida]3.0e-19992.68Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAASLLACEVSDLCLGKPALRSISISATVADALS LTRIDEGYIS+WSCGDHSS K D DL CRCVGKVCMVDIICFLCRQENLLQPA+ALRSPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EGLGLVRHLEPH SLMEAIDLIH+GVHNLVIPIK STSK+K+FLKKSS+NSISSLHND+EYCWLAPEDIIRYLLNSIGLFSP AA+PI+SFNIIDT N+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AVHYDESALSILPLISQA IHQSSVAIVDLD+KLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEW EEESS
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TISSSSSICSSSDDEFGS----GSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TISSSSSICSSSDDEFGS    GSGRSGK+GGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWV QEDGTLAGTVTFASMLAVFRDRLKSL
Subjt:  TISSSSSICSSSDDEFGS----GSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

TrEMBL top hitse value%identityAlignment
A0A0A0LV40 Uncharacterized protein2.5e-19189.95Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAASLLACEVSDLCLGKPALRSISISAT+ADALS LT+IDEGYISVWSCGDHSS KAD DL CRCVGKVCMVDIICFLCRQENLLQPAI L+SPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EG  LVRHLEPHASLMEAIDLIHDGVHNLVIPIK S SK+KN LKKS  NSISSLHND+EYCWLAPEDIIRYLLNSIGLFS TAA+PINSFNIIDTNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AV YDESALSILPLISQA IHQSSVAIVDLD+KLIGEISPFTLNFCDETV AAIATLTAGELM YIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEES 
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TI SSSSSICSSSDDEFG     SGSGRSG+I GYSARV+RRSEAIVCYPWNSLVAVMIQALAHRVSY+WVIQEDGTLAGTVTF S+LAVFRDRLK L
Subjt:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

A0A1S3B5J3 CBS domain-containing protein CBSX5-like4.6e-19088.94Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAA+LL CEVSDLCLGKPALRSISISATVADALS LT+IDEGYISVWSCGDHSS KAD DL CRCVGKVCMVDIICFLCRQENLLQPAI L+SPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EG  LV+HLEPHASLMEAIDLIHDG+HNLVIPIK STS++KN LKKS  NSISSLHND+EYCWLAPEDIIRYLLNSIGLFS TAA+PINS NIIDTNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AV YDESALSILPLISQA IHQSSVAIVDLD+KLIGEISP TLNFCDETV AAIATLTAGE MAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEES 
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TI SSSSSICSSSDDEFG     +GSGRSG+I GYSARV+RRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQ DGTLAGTVTFAS+LAVFRDRLKSL
Subjt:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

A0A5D3DS85 CBS domain-containing protein CBSX5-like4.6e-19088.94Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MAA+LL CEVSDLCLGKPALRSISISATVADALS LT+IDEGYISVWSCGDHSS KAD DL CRCVGKVCMVDIICFLCRQENLLQPAI L+SPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EG  LV+HLEPHASLMEAIDLIHDG+HNLVIPIK STS++KN LKKS  NSISSLHND+EYCWLAPEDIIRYLLNSIGLFS TAA+PINS NIIDTNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AV YDESALSILPLISQA IHQSSVAIVDLD+KLIGEISP TLNFCDETV AAIATLTAGE MAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEES 
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        TI SSSSSICSSSDDEFG     +GSGRSG+I GYSARV+RRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQ DGTLAGTVTFAS+LAVFRDRLKSL
Subjt:  TI-SSSSSICSSSDDEFG-----SGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

A0A6J1CDV2 CBS domain-containing protein CBSX5-like1.8e-18991.09Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MA SLLA EVSDLCLGKPALRSISISATVADALS LTRIDEGYISVWSCGDHSS KA+ +L CRCVGKVCMVDIICFLCRQENLLQPA ALRSPIS LIH
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        E +GLVRHLEPH+SLMEAIDLI DG HNLVIPIK+STSK+KNFLKK+S  SISSLHNDREYCWLAPEDIIRYLLNSIGLFS  AASPINSFNIIDTNNVL
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
        AVHYDESALSILPLISQA IHQSSVAIVDLD+ LIGEISPFTLNFCDETV AAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TI-SSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
        T  SSSSSICSSSDDEFGSGSGRSGKIGGYSARV RRSEAIVCYPWNSLVAVMIQALAHRV YVWVI+EDG LAG VTFASML+VFRDRLKS+
Subjt:  TI-SSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

A0A6J1GR12 CBS domain-containing protein CBSX5-like1.1e-18889.03Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH
        MA SLL+ ++SDLCLGKPA+RSISISATVADALS LTRIDEG ISVWSCGDHSS +AD DL CRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI 
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIH

Query:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL
        EGL LVRHLEPHASLMEAIDLI DGVHNLVIPI  S SK++NFL+ +++NSISSLHNDR+YCWLAPEDIIRYLLNSIGLFSPTAASPINS NII+TNN+L
Subjt:  EGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVL

Query:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS
         VHYD+SALSILPLISQA IHQSSVAIVDLD+KL+GEISPFTLNFCDET+ AAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE+VEEESS
Subjt:  AVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESS

Query:  TISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL
         ISSSSSICS SDDEFG GSGRS KIGGYSARVLRRSEAIVC+PWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS+
Subjt:  TISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL

SwissProt top hitse value%identityAlignment
Q84WQ5 CBS domain-containing protein CBSX54.6e-7844.22Show/hide
Query:  MAASLLACEVSDLCLGKPALRSI-SISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI
        MA SLL+  VSDLCLGKP LR + S S++V+DA++AL   ++ ++SVW+C +H     D + +C C+GK+ M D+IC L +  +      AL S +SVL+
Subjt:  MAASLLACEVSDLCLGKPALRSI-SISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI

Query:  HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIID-TNN
         +   +V H++P  SL+EAIDLI  G  NL++PI T    KK     + + + ++  N + +CW+  EDII++LL  I  FSP  A  ++   +I+ T+ 
Subjt:  HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIID-TNN

Query:  VLAVHYDESALSILPLISQAQIHQSSVAIVDLD-----EKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE
        V+AV Y  SA +++  +S A   Q+SVA+VD +       LIGEISP TL  CDET AAA+ATL+AG+LMAYID   PP+ LVQ+V+ RLE+K L  ++ 
Subjt:  VLAVHYDESALSILPLISQAQIHQSSVAIVDLD-----EKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE

Query:  WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS
          +  SS  +SS     SS++E    +   G+    SAR+ R+SEAIVC P +SL+AVMIQA+AHRV+Y WV+++DG   G VTF  +L VFR  L++
Subjt:  WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS

Q8GXI9 SNF1-related protein kinase regulatory subunit gamma-like PV42b3.4e-0421.2Show/hide
Query:  LLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISV------WSCGDHSSFKADLDLQC-----RCVGKVCMVDIICFLCRQENLLQPAIALRS
        L+  +V DL + K  L  +  +AT+ DAL+ +       + V      W  G   S   +LD Q      + +G V M+D++  +   +        + +
Subjt:  LLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISV------WSCGDHSSFKADLDLQC-----RCVGKVCMVDIICFLCRQENLLQPAIALRS

Query:  PISVLI---HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINS
        P+S +I    EGL L   L P+ S+M+ ++++  G+H +++P+ ++T   +N        S S+      Y  L+  D+I +  +         +S ++ 
Subjt:  PISVLI---HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINS

Query:  FNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEA
               ++ A+H      ++L L SQA++                          D     +IA L A   +  ++  G  +D  QLV  +   + +  
Subjt:  FNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEA

Query:  VLEWVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLK
             + +   +++  S    +  EF     R+      ++   R  E + C+  ++L  V+      RV  VWV+ ++G L G V+   ++AV R  L 
Subjt:  VLEWVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLK

Query:  S
        S
Subjt:  S

Q8GZA4 CBS domain-containing protein CBSX61.5e-2026.97Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLD----LQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPIS
        MA+  L   V DL +GKP +     + TV  A+ A+    E  I VW      S    ++     Q R VG +  +DI+ FL + E  LQ   A++ P+S
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLD----LQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPIS

Query:  VLIHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIP---IKTSTSKK----------KNFLKKSSTNSISSLHNDR----------EYCWLAPEDIIRYL
         ++     L++ ++P   L++A++++  GV  L++P   +    SK+          KN    SS++ +S+   +R          ++C L+ ED+IR+L
Subjt:  VLIHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIP---IKTSTSKK----------KNFLKKSSTNSISSLHNDR----------EYCWLAPEDIIRYL

Query:  LNSIGLFSPTAASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDE-----KLIGEISPFTLNFCDETVAA-AIATLTAGELMAYID
        +  +G  +P   + I++  II+ N     ++ E++L  +    +     S++A+++  E     K+IGEIS   L  CD   AA A+A L AG+      
Subjt:  LNSIGLFSPTAASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDE-----KLIGEISPFTLNFCDETVAA-AIATLTAGELMAYID

Query:  CGGPPDDLVQLVKERLEEKNLEAVLE--WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWV
                V  V++ +  ++    L+  +   E +  ++++   SS    F   S     IG    R   RS  + C   +SL AVM Q L+HR ++VWV
Subjt:  CGGPPDDLVQLVKERLEEKNLEAVLE--WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWV

Query:  IQ--EDGTLAGTVTFASML
         +   D  L G V +  +L
Subjt:  IQ--EDGTLAGTVTFASML

Arabidopsis top hitse value%identityAlignment
AT1G65320.1 Cystathionine beta-synthase (CBS) family protein1.1e-2126.97Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLD----LQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPIS
        MA+  L   V DL +GKP +     + TV  A+ A+    E  I VW      S    ++     Q R VG +  +DI+ FL + E  LQ   A++ P+S
Subjt:  MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLD----LQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPIS

Query:  VLIHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIP---IKTSTSKK----------KNFLKKSSTNSISSLHNDR----------EYCWLAPEDIIRYL
         ++     L++ ++P   L++A++++  GV  L++P   +    SK+          KN    SS++ +S+   +R          ++C L+ ED+IR+L
Subjt:  VLIHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIP---IKTSTSKK----------KNFLKKSSTNSISSLHNDR----------EYCWLAPEDIIRYL

Query:  LNSIGLFSPTAASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDE-----KLIGEISPFTLNFCDETVAA-AIATLTAGELMAYID
        +  +G  +P   + I++  II+ N     ++ E++L  +    +     S++A+++  E     K+IGEIS   L  CD   AA A+A L AG+      
Subjt:  LNSIGLFSPTAASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDE-----KLIGEISPFTLNFCDETVAA-AIATLTAGELMAYID

Query:  CGGPPDDLVQLVKERLEEKNLEAVLE--WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWV
                V  V++ +  ++    L+  +   E +  ++++   SS    F   S     IG    R   RS  + C   +SL AVM Q L+HR ++VWV
Subjt:  CGGPPDDLVQLVKERLEEKNLEAVLE--WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWV

Query:  IQ--EDGTLAGTVTFASML
         +   D  L G V +  +L
Subjt:  IQ--EDGTLAGTVTFASML

AT1G80090.1 Cystathionine beta-synthase (CBS) family protein7.0e-0520.83Show/hide
Query:  LLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYIS-------------VWSCGDHSSFKADLDLQC-----RCVGKVCMVDIICFLCRQENLLQ
        L+  +V DL + K  L  +  +AT+ DAL+ +T +    ++              W  G   S   +LD Q      + +G V M+D++  +   +    
Subjt:  LLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYIS-------------VWSCGDHSSFKADLDLQC-----RCVGKVCMVDIICFLCRQENLLQ

Query:  PAIALRSPISVLI---HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPT
            + +P+S +I    EGL L   L P+ S+M+ ++++  G+H +++P+ ++T   +N        S S+      Y  L+  D+I +  +        
Subjt:  PAIALRSPISVLI---HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPT

Query:  AASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERL
         +S ++        ++ A+H      ++L L SQA++                          D     +IA L A   +  ++  G  +D  QLV  + 
Subjt:  AASPINSFNIIDTNNVLAVHYDESALSILPLISQAQIHQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERL

Query:  EEKNLEAVLEWVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLA
          + +       + +   +++  S    +  EF     R+      ++   R  E + C+  ++L  V+      RV  VWV+ ++G L G V+   ++A
Subjt:  EEKNLEAVLEWVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLA

Query:  VFRDRLKS
        V R  L S
Subjt:  VFRDRLKS

AT4G27460.1 Cystathionine beta-synthase (CBS) family protein3.3e-7944.22Show/hide
Query:  MAASLLACEVSDLCLGKPALRSI-SISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI
        MA SLL+  VSDLCLGKP LR + S S++V+DA++AL   ++ ++SVW+C +H     D + +C C+GK+ M D+IC L +  +      AL S +SVL+
Subjt:  MAASLLACEVSDLCLGKPALRSI-SISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLI

Query:  HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIID-TNN
         +   +V H++P  SL+EAIDLI  G  NL++PI T    KK     + + + ++  N + +CW+  EDII++LL  I  FSP  A  ++   +I+ T+ 
Subjt:  HEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIID-TNN

Query:  VLAVHYDESALSILPLISQAQIHQSSVAIVDLD-----EKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE
        V+AV Y  SA +++  +S A   Q+SVA+VD +       LIGEISP TL  CDET AAA+ATL+AG+LMAYID   PP+ LVQ+V+ RLE+K L  ++ 
Subjt:  VLAVHYDESALSILPLISQAQIHQSSVAIVDLD-----EKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLE

Query:  WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS
          +  SS  +SS     SS++E    +   G+    SAR+ R+SEAIVC P +SL+AVMIQA+AHRV+Y WV+++DG   G VTF  +L VFR  L++
Subjt:  WVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKS

AT5G53750.1 CBS domain-containing protein6.4e-8346.36Show/hide
Query:  MAASLLACEVSDLCLGKPALRSISI-SATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCR-QENLLQPAIALRSPISVL
        MA +LL+ E+SDLC+GKP LR +S+ +ATVADA++AL   DE +++VWSC +H   K D + +C C+GK+CM D+IC+L +   N+L  + A  + +SVL
Subjt:  MAASLLACEVSDLCLGKPALRSISI-SATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCR-QENLLQPAIALRSPISVL

Query:  IHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKK--------SSTNSISSLH-NDREYCWLAPEDIIRYLLNSIGLFSPTAASPIN
        + +   LV H++   SL+EAIDLI  G  NL++PI T +  K+   +K        S TN+ S+ H N RE+CW+  EDIIR+LL+SI +FSP  +  I+
Subjt:  IHEGLGLVRHLEPHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKK--------SSTNSISSLH-NDREYCWLAPEDIIRYLLNSIGLFSPTAASPIN

Query:  SFNIID-TNNVLAVHYDESALSILPLISQAQIHQSSVAIV-------DLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKE
           +I+ T+ +LAV Y  SA S +  IS+A +   SVA+V       D    LIGEISP TL  CDET  AA+ATL+AG+LM+YID  GPP+ LV +V+ 
Subjt:  SFNIID-TNNVLAVHYDESALSILPLISQAQIHQSSVAIV-------DLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKE

Query:  RLEEKNLEAVLEWVEEESSTISSSSSICSSSDDEFGSGSGRS----GKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVT
        RLE+K +  ++  ++      S S S  SSSD+E  +G  R     G+    +AR+ R+S AIVC   +SL+AVMIQA+AHRVSYVWVI EDG L G VT
Subjt:  RLEEKNLEAVLEWVEEESSTISSSSSICSSSDDEFGSGSGRS----GKIGGYSARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVT

Query:  FASMLAVFRDRL
        F  +L +FR+ L
Subjt:  FASMLAVFRDRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCAAGCTTGTTGGCCTGCGAGGTATCTGACTTATGCCTTGGAAAACCTGCGCTTCGATCCATATCCATCTCCGCCACTGTCGCCGATGCGTTATCAGCCTTGAC
GAGGATTGACGAAGGCTACATAAGCGTCTGGAGCTGTGGAGATCACTCCTCCTTTAAAGCGGATTTGGATTTACAATGTCGCTGCGTTGGTAAAGTATGCATGGTGGATA
TAATCTGTTTTCTTTGTAGACAAGAGAATTTGTTGCAACCGGCGATTGCGCTTCGATCTCCGATCTCTGTTTTGATTCATGAGGGCCTTGGACTCGTTAGGCATTTGGAA
CCTCATGCCAGTTTGATGGAGGCGATTGATCTTATACACGATGGAGTACATAACCTTGTTATCCCAATCAAAACGAGCACAAGTAAGAAGAAAAACTTTCTCAAGAAATC
ATCGACTAATTCCATCTCCTCTCTCCATAACGATAGAGAATATTGCTGGCTCGCTCCGGAGGATATAATCCGTTACCTCCTCAACTCAATCGGACTTTTCAGCCCTACCG
CCGCGAGTCCTATCAATTCCTTCAACATAATCGACACTAACAACGTTCTCGCCGTCCACTACGACGAATCTGCGTTATCTATTCTGCCTCTAATCTCGCAAGCGCAGATT
CACCAATCTTCTGTCGCCATTGTTGACTTAGACGAAAAATTGATTGGCGAAATTTCACCGTTCACGCTCAATTTCTGCGACGAGACTGTGGCGGCGGCCATAGCAACGCT
TACAGCCGGTGAACTCATGGCCTATATAGACTGTGGCGGCCCGCCAGATGATTTAGTGCAATTAGTGAAAGAAAGATTGGAAGAGAAAAACTTAGAGGCGGTTCTGGAGT
GGGTCGAGGAAGAATCATCAACAATTTCATCATCTTCTTCAATTTGCTCTTCTTCAGATGATGAATTTGGATCTGGATCAGGGAGAAGTGGAAAGATCGGTGGGTATTCA
GCGAGAGTATTGAGAAGATCGGAGGCGATTGTTTGTTATCCATGGAACTCTTTGGTTGCAGTGATGATTCAAGCTTTAGCTCATCGAGTCAGCTATGTGTGGGTCATTCA
AGAAGACGGAACTCTCGCCGGAACTGTGACGTTCGCTTCAATGTTGGCCGTTTTTCGGGATAGATTAAAGTCTCTATAG
mRNA sequenceShow/hide mRNA sequence
CTAAAAACGCGTGCGCCACCTCTCGGATAATCACATAACACGCCCTATAAATTTCAGACAATTCCCCCTCAGTCTTTCGTATTGGAAATCCGCCCGTCTCTCTTTATCTC
GACCCGACCCGACCCGGATTTGTTCTTCTTTCTCTCTCAGAACCGCCATTTCTGTACCTTCATCGATTGTTTTCTTCCATTTTCAATCCTCTCTTGCTTTTTTTGATCTC
AATTGCTCGTGTATGGCAGCAAGCTTGTTGGCCTGCGAGGTATCTGACTTATGCCTTGGAAAACCTGCGCTTCGATCCATATCCATCTCCGCCACTGTCGCCGATGCGTT
ATCAGCCTTGACGAGGATTGACGAAGGCTACATAAGCGTCTGGAGCTGTGGAGATCACTCCTCCTTTAAAGCGGATTTGGATTTACAATGTCGCTGCGTTGGTAAAGTAT
GCATGGTGGATATAATCTGTTTTCTTTGTAGACAAGAGAATTTGTTGCAACCGGCGATTGCGCTTCGATCTCCGATCTCTGTTTTGATTCATGAGGGCCTTGGACTCGTT
AGGCATTTGGAACCTCATGCCAGTTTGATGGAGGCGATTGATCTTATACACGATGGAGTACATAACCTTGTTATCCCAATCAAAACGAGCACAAGTAAGAAGAAAAACTT
TCTCAAGAAATCATCGACTAATTCCATCTCCTCTCTCCATAACGATAGAGAATATTGCTGGCTCGCTCCGGAGGATATAATCCGTTACCTCCTCAACTCAATCGGACTTT
TCAGCCCTACCGCCGCGAGTCCTATCAATTCCTTCAACATAATCGACACTAACAACGTTCTCGCCGTCCACTACGACGAATCTGCGTTATCTATTCTGCCTCTAATCTCG
CAAGCGCAGATTCACCAATCTTCTGTCGCCATTGTTGACTTAGACGAAAAATTGATTGGCGAAATTTCACCGTTCACGCTCAATTTCTGCGACGAGACTGTGGCGGCGGC
CATAGCAACGCTTACAGCCGGTGAACTCATGGCCTATATAGACTGTGGCGGCCCGCCAGATGATTTAGTGCAATTAGTGAAAGAAAGATTGGAAGAGAAAAACTTAGAGG
CGGTTCTGGAGTGGGTCGAGGAAGAATCATCAACAATTTCATCATCTTCTTCAATTTGCTCTTCTTCAGATGATGAATTTGGATCTGGATCAGGGAGAAGTGGAAAGATC
GGTGGGTATTCAGCGAGAGTATTGAGAAGATCGGAGGCGATTGTTTGTTATCCATGGAACTCTTTGGTTGCAGTGATGATTCAAGCTTTAGCTCATCGAGTCAGCTATGT
GTGGGTCATTCAAGAAGACGGAACTCTCGCCGGAACTGTGACGTTCGCTTCAATGTTGGCCGTTTTTCGGGATAGATTAAAGTCTCTATAGGATTTAACGGTTCAAAAAA
TTTCAAAGCTTATGGAGTAATCTATTGATCTGAAACCCAAAAATGGTTTTGGCTTTTGGAGATTTGAACAGGAAAATGGCTAAAAACCAAATTCCAAATTAAAAATTCAA
TGCAAAAATCTGAAACTGATTTGGCGAAAACCTAATTAGGGAAGGAAATAACAACTTTACCAATTGAAAAAAGCATTCAAACATATAGGAAGTGTTTTTGTAAACTATTT
TGAAAAGAAAAATCTGAACTTTTAACACTTTTCCTTCCTAAGAATGGAAACTATTTTCCAAAATATTTGGGTGCAACTTGGATTATATTTTTGTGCATTCTATCATAATT
TGTCATATTTATAGACTCTTCTTACTTCTAAACGAAAAAAATTTAAGTGTTGTACTCATCTATTGTGGAGTTCCTACTATACCTAATTAAAGATCGACACTTAAGTGTAG
CTAAATGGTATCTAGGTATTGCTTATTTTGAACCCACATCCAACTAACATTCAATAATTTTTTGAGGGTTAGGTTGAGTCAATTGTTCTTTTATTGAGATTTTATTTATT
TAAACTCTTGTGTTTGCCCTTTGATAACTACAGGCACGTACACAATTTTGACTTTAGTTTCAAAAATATTTTAGGTGGAAATGATATTTATAACCTTAATTTTCGAAAAT
TAAAAACTAAAAATGTTAAGATGAAAATTATATAATAATAATAATAATAATAATTATTATTATTGGATAGGATATATTGAAACAATGTAAATGGAGGGAAAAAGCTTCCT
AGTACCCATTGTATTCCCATGAGAAATGTTGAGGGTGGAGTGAAGTATAATAGGGTCTAGGTCTTGAATTTGGTCAATTTAGGGCTTGATATTTAAGGGTCAAATATTTA
CATATTTGCAGGTTTGGTGAGGCTCAGCTGCTACTTGTTATTTTGATTTTGGGCATTGAATTTTACTTAAACATTCTCCTCCACATTACAGTATAAGAGACTCATATTGT
CTATGAAAAAATTTTACGTTCTGTATCTGTAGAATTATACAAAAAAATCTCCAACATTATTTGGCAGCTATTTGGATTGGAAAATCCTACTATTTACTTACCACTAAAAT
TTCTATCAACATTTTTATATTCATATATGTGGATTGACATTTTTATTATA
Protein sequenceShow/hide protein sequence
MAASLLACEVSDLCLGKPALRSISISATVADALSALTRIDEGYISVWSCGDHSSFKADLDLQCRCVGKVCMVDIICFLCRQENLLQPAIALRSPISVLIHEGLGLVRHLE
PHASLMEAIDLIHDGVHNLVIPIKTSTSKKKNFLKKSSTNSISSLHNDREYCWLAPEDIIRYLLNSIGLFSPTAASPINSFNIIDTNNVLAVHYDESALSILPLISQAQI
HQSSVAIVDLDEKLIGEISPFTLNFCDETVAAAIATLTAGELMAYIDCGGPPDDLVQLVKERLEEKNLEAVLEWVEEESSTISSSSSICSSSDDEFGSGSGRSGKIGGYS
ARVLRRSEAIVCYPWNSLVAVMIQALAHRVSYVWVIQEDGTLAGTVTFASMLAVFRDRLKSL