; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G013490 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G013490
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionchaperonin-like RbcX protein 2, chloroplastic
Genome locationGy14Chr4:18552088..18555443
RNA-Seq ExpressionCsGy4G013490
SyntenyCsGy4G013490
Gene Ontology termsGO:0006457 - protein folding (biological process)
GO:0015977 - carbon fixation (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0110102 - chloroplast ribulose bisphosphate carboxylase complex assembly (biological process)
GO:0044183 - protein folding chaperone (molecular function)
InterPro domainsIPR003435 - Chaperonin-like RbcX
IPR038052 - Chaperonin-like RbcX superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041415.1 RcbX domain-containing protein [Cucumis melo var. makuwa]1.60e-11894.44Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPV+DS PPC SFDSSPVTNMSLRSGGDLVLQRKSK KSY TVSKSVDLRSSFVN GDEWQLSAGG R+S+RNQRRNRRLV+VNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

XP_004142113.1 chaperonin-like RbcX protein 2, chloroplastic [Cucumis sativus]4.48e-125100Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

XP_008449714.1 PREDICTED: uncharacterized protein LOC103491510 [Cucumis melo]3.94e-11995Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPV+DS PPC SFDSSPVTNMSLRSGGDLVLQRKSK KSY TVSKSVDLRSSFVN GDEWQLSAGG RQS+RNQRRNRRLV+VNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

XP_023512052.1 chaperonin-like RbcX protein 2, chloroplastic [Cucurbita pepo subsp. pepo]9.48e-10586.19Show/hide
Query:  MVGALFVVGAPVVDSLP-PCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQY
        MVGALFV GAPV+DS P PCL  DSSPVTNMSLRS GDLVL R+SKAKS+L VS+ VDLRSSFVN G EWQLSA GG + +RNQRRNRRLVVVNEFAGQY
Subjt:  MVGALFVVGAPVVDSLP-PCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQY

Query:  EDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        E+SFDDVKMQI NYFTYKAVKTVLNQLYEMNPTQYRWFY+FVVNHKP +GKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  EDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

XP_038902700.1 chaperonin-like RbcX protein 2, chloroplastic [Benincasa hispida]2.21e-10489.01Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSF--DSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQ
        MVGALFVVGAPVVDS PPC S   DSSPVT  SLRSGGDLVLQRKS   S+L VSKSVDLRSSFVN   EW+LSA GG +S+RNQRRNRRLVVVNEFAGQ
Subjt:  MVGALFVVGAPVVDSLPPCLSF--DSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQ

Query:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

TrEMBL top hitse value%identityAlignment
A0A0A0KX19 Uncharacterized protein2.17e-125100Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

A0A1S3BNA7 uncharacterized protein LOC1034915101.91e-11995Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPV+DS PPC SFDSSPVTNMSLRSGGDLVLQRKSK KSY TVSKSVDLRSSFVN GDEWQLSAGG RQS+RNQRRNRRLV+VNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

A0A5A7TJ10 RcbX domain-containing protein7.76e-11994.44Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPV+DS PPC SFDSSPVTNMSLRSGGDLVLQRKSK KSY TVSKSVDLRSSFVN GDEWQLSAGG R+S+RNQRRNRRLV+VNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

A0A5D3BAI6 RcbX domain-containing protein1.91e-11995Show/hide
Query:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE
        MVGALFVVGAPV+DS PPC SFDSSPVTNMSLRSGGDLVLQRKSK KSY TVSKSVDLRSSFVN GDEWQLSAGG RQS+RNQRRNRRLV+VNEFAGQYE
Subjt:  MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYE

Query:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  DSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

A0A6J1FUS7 chaperonin-like RbcX protein 2, chloroplastic7.59e-10485.64Show/hide
Query:  MVGALFVVGAPVVDSLP-PCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQY
        MVGAL V GAPV+DS P PCL  DSSPVTNMSLRS GDLVL R+SKAKS+L VS+ VDLRSSFVN G EWQLSA GG + +RNQRRNRRLVVVNEFAGQY
Subjt:  MVGALFVVGAPVVDSLP-PCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQY

Query:  EDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        E+SFDDVKMQI NYFTYKAVKTVLNQLYEMNPTQYRWFY+FVVNHKP +GKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
Subjt:  EDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

SwissProt top hitse value%identityAlignment
O86418 RuBisCO chaperone RbcX2.3e-0640Show/hide
Query:  DVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL
        D    +Q+Y TY+A+ TVL QL E NP    W + F V  K  +G+ +++ L +E+ DLA R+M  R H+
Subjt:  DVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL

Q44212 RuBisCO chaperone RbcX3.5e-0742.86Show/hide
Query:  DVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL
        D    +Q+Y TY+A++TVL QL E NP    W +NF    K  +G+++I  L  EK DLA R+M  R H+
Subjt:  DVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL

Q8DIS6 RuBisCO chaperone RbcX5.0e-0636.51Show/hide
Query:  NYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL
        +Y TY+AV+TV+ QL E +P +  W + F       +G+R++  L +E+ DL  R++  R HL
Subjt:  NYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHL

Q8L9X2 Chaperonin-like RbcX protein 2, chloroplastic3.8e-3848.9Show/hide
Query:  MVGALFVVGAPVVD-SLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNE-FAGQ
        MV A FVVG+PV+D S  PCL  D+     +              + K  L  +++++L SSF      ++LS    R S+   R++++L++VNE  AG 
Subjt:  MVGALFVVGAPVVD-SLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNE-FAGQ

Query:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        Y+D+F DV+ QI NYFTYKAV+TVL+QLYEMNP QY WFYN ++ ++P +GKRF+R L KE Q+LAERVMITRLHLY KW+K
Subjt:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK

Arabidopsis top hitse value%identityAlignment
AT5G19855.1 Chaperonin-like RbcX protein2.7e-3948.9Show/hide
Query:  MVGALFVVGAPVVD-SLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNE-FAGQ
        MV A FVVG+PV+D S  PCL  D+     +              + K  L  +++++L SSF      ++LS    R S+   R++++L++VNE  AG 
Subjt:  MVGALFVVGAPVVD-SLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNE-FAGQ

Query:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK
        Y+D+F DV+ QI NYFTYKAV+TVL+QLYEMNP QY WFYN ++ ++P +GKRF+R L KE Q+LAERVMITRLHLY KW+K
Subjt:  YEDSFDDVKMQIQNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGAGCTTTGTTTGTTGTTGGTGCACCTGTGGTCGACTCCCTGCCGCCGTGTTTGTCTTTCGACTCGTCGCCGGTTACCAATATGAGTCTTAGAAGTGGTGGGGA
TTTGGTTTTGCAAAGGAAATCCAAGGCTAAGAGTTATTTGACCGTGTCTAAATCGGTGGACTTGAGAAGTTCGTTTGTCAATCTTGGTGACGAGTGGCAGCTCTCGGCCG
GTGGTGGCCGTCAGAGTAAGAGGAACCAACGAAGGAATCGGAGGCTTGTTGTTGTCAATGAATTTGCAGGGCAGTATGAGGATAGTTTTGATGATGTGAAAATGCAAATA
CAGAATTATTTCACATACAAAGCTGTGAAGACAGTTTTGAATCAGCTTTATGAGATGAACCCAACACAGTACAGATGGTTTTATAACTTTGTTGTAAATCACAAGCCTGG
GGAAGGGAAGCGTTTCATTCGAACTCTTGTAAAGGAGAAGCAAGATTTGGCTGAGAGGGTGATGATAACAAGGCTTCATCTCTATAACAAATGGGTTAAGGTAAGAACAA
ACTTGATATTTTGA
mRNA sequenceShow/hide mRNA sequence
GCAGTAAGAATATTTAATTTAAATATGGAAAAAACAACCACAATTGGTTTTGAGAGTTGGGGAATGTTGCATGGATGATTTAAGACCTCAGAAAAATGTTTGGTTTACGT
TGGCAGTTGACATGCATTTTCTTTTTGTACCAAAGAAAAAAGGAAATTAACGACAAACTTCCAACAATATGAAACCTTTCTTTCATCCTCTTTCTTGTACAATTCTAAAT
TGAATTCTCAAAGTTTGATCAGTTTCTTACGATAGCTGAATCTTTGACCCAATCTCCTATTCTCTCCCACTTTTCCTTTATATTCACTTCTTTCTCTTTCAATTCCTTTA
TTCCCACTTCATATATCCTCCTTCTCTCAACGGTTGGCTTGTGATCTTGGAATCTTTTTTTCCAAAATGGTGGGAGCTTTGTTTGTTGTTGGTGCACCTGTGGTCGACTC
CCTGCCGCCGTGTTTGTCTTTCGACTCGTCGCCGGTTACCAATATGAGTCTTAGAAGTGGTGGGGATTTGGTTTTGCAAAGGAAATCCAAGGCTAAGAGTTATTTGACCG
TGTCTAAATCGGTGGACTTGAGAAGTTCGTTTGTCAATCTTGGTGACGAGTGGCAGCTCTCGGCCGGTGGTGGCCGTCAGAGTAAGAGGAACCAACGAAGGAATCGGAGG
CTTGTTGTTGTCAATGAATTTGCAGGGCAGTATGAGGATAGTTTTGATGATGTGAAAATGCAAATACAGAATTATTTCACATACAAAGCTGTGAAGACAGTTTTGAATCA
GCTTTATGAGATGAACCCAACACAGTACAGATGGTTTTATAACTTTGTTGTAAATCACAAGCCTGGGGAAGGGAAGCGTTTCATTCGAACTCTTGTAAAGGAGAAGCAAG
ATTTGGCTGAGAGGGTGATGATAACAAGGCTTCATCTCTATAACAAATGGGTTAAGGTAAGAACAAACTTGATATTTTGAGGGGTTTTTATTATTATTATTATTATATTG
AATTGTGATTTGGTGTTGGATAGAAATGTGATCATGCTGAAATATACAAAGGGATATCTGATGAGAACTTGGAGTTGATGCGAGAGAGGCTAATGGAGACTGTGATATGG
CCTTCCGACGACTAGAGCTTTTAGGAGATTGGCAGACTAGGAGTTAGAAGATTTTTTTATCTTATTTATTTTCCTTTAGTGTCGACCAATCTGGTCCTGTTCCTAATTAG
TTGTCATTCACATTCATATTATTTTCCCCCAACTAATAACAATTCTTACATCCACTACTACAAAAAAAATCAATGTAAATACTTTCAACATCTTATACCATGTTAAAAGT
ATCAATGTCATTGAATATGTTTCTATAAACAAAAGGCATTTTCGAGATTAGTCCAACCGTTTCATATCTGTTGTAATTGCAGTAATCAAACCTCACAGGATATCTCATTT
TGTGCTAATCTAGGTACGATTCTATTGGTTTAGGCCTAAACCTTTGACTAAAAGGTCATAGGTTCGAAAATCACGAATCCCATATGTTGTTGAACTCAAAAACATATAAT
AATGTTTCACTTTCTGTGTCAACACCTTTCCAGTTGAAAGAGTAATAATTTCTCAATTTCTAAAGCAAAATCACGAAGGTGACATTTACTATCTTCAAGAGAAATTTATG
CAGATCCACATCCACCACCATCATGAAGATGATAAAGGCGGTAATCTTTAGTAAAAGGTGTTGGTTCATGGTTGTGATTTACAGCTCTTTCAATAAAATAACATGAGAAA
TACATATTTGAAATGCACGTCATATATCTAATCATAATTTGTACCAGCAAGTAGAATGGTTTGTTTATGCATGATAAACCAACGCACTCATACATTAGAAAGGATAATGA
GAAAATCAGTACAAAGTATACGTACTGATTCGGTTCTCGGAGGAATCTAGATCAAGTTGGGGGAACCAGGCAATTGACCAAAAGACCGCTGCAGATCACAAAGAAGAATG
CTGCAAGAAGTGGATCTCTTGTAAGATATATCGGACAAGAACAGAGAATATCGATCTGGGCTTCATCCCTTTCATTTTACAATTCTCTCACAAAGCACATCTCAATTCAG
TTTTGAATAGCAGGAGTGAATCATGAGCATAATGCAAGGAGTGAACAATGACTTAATTTTTTTCAAGAAAAGCTCTTTTCATAACACGCACAAAATCACGTCTGGCTGGA
TGACAGAAAAATCCTTAATGTCAGCCCAAATCTCGGATCAACGATGTCAAAGAATTCAAATTTTCGTAAATTCTTTCTGATTGAGGGTGTAATACATCAGTAGCAGTGAA
ATGGTGAACCTGATTGCCAATTACAATCCAGCTAGTTCCTGGAGGCTTTCCTAGACCTCTTTCTCGAAGCCACTTTCTTATCTGTACAACTTCTGACCAAAGGCCTGCTG
CTGCATAAATATTGGAAAGCAAAATAAAATTGCCTGGATTGCTTGGTTCTAATTCAACAAGCTTTCCAAAGATGGTCTCCAATAGCTTGTTATTGCTTTTAATCTGACAT
GAACTGAGCAAAGCTCTCCAAATTGATGCATCAGGTTCAATGGGCATTGAGTTGATGAAAGCTATAGCTTCAGAAAAATGGCCCCCACGACCAAGCAGATCAACCATACA
ACCATAGTGAGTAAGTTGAGGAGCAATACCAAAATCCCGCACCATGGAATGGAAAAGTTGCAAACCTGTCACGGTCAAACCAGAATGGCTGCAGGCAGATAAAACAGATG
C
Protein sequenceShow/hide protein sequence
MVGALFVVGAPVVDSLPPCLSFDSSPVTNMSLRSGGDLVLQRKSKAKSYLTVSKSVDLRSSFVNLGDEWQLSAGGGRQSKRNQRRNRRLVVVNEFAGQYEDSFDDVKMQI
QNYFTYKAVKTVLNQLYEMNPTQYRWFYNFVVNHKPGEGKRFIRTLVKEKQDLAERVMITRLHLYNKWVKVRTNLIF