; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G17494 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G17494
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionGag protease polyprotein
Genome locationctg278:292186..295809
RNA-Seq ExpressionCucsat.G17494
SyntenyCucsat.G17494
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0006885 - regulation of pH (biological process)
GO:0012505 - endomembrane system (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036632.1 gag protease polyprotein [Cucumis melo var. makuwa]2.04e-10553.67Show/hide
Query:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP
        M  R+ A RGG+GGR  G  +V  + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A                APQ +P
Subjt:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP

Query:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL
        +QLSAEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP+NQKVQCAVF+L +R   WW + ERMLGG+V+Q TW QF ESFYAKFF ASL
Subjt:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL

Query:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA
        RD KRQEF++L+QG MT+++YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKA
Subjt:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA

Query:  EQKPIDVSWRNLR
        EQ+P+ V  RN R
Subjt:  EQKPIDVSWRNLR

KGN57866.2 hypothetical protein Csa_011500 [Cucumis sativus]5.46e-10557.67Show/hide
Query:  RKVASRGGQG-GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF
        R+   RG  G GR  G NQ  E  AE    P APV+  +  A+S  +EQ F + +T + AQ+Q A A P         AAP  Q+LPNQLSAEAKHLRDF
Subjt:  RKVASRGGQG-GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF

Query:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ
        R YDPQTF+G  EDP    +WLSSVE IF+YM+CP+  +VQCA FLLR+R  IWW +  RMLGG+V Q TWDQFK+ FY KFF A+LRD K QEF++LKQ
Subjt:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ

Query:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        G MT+EEYD EFD+LS FAPELV  E ARA RFV GL++++RGFVRALKP TQ EALR+A+D+S  KD+  P+    G SSGQKRK EQ+ + V  RN+R
Subjt:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

XP_008456947.1 PREDICTED: uncharacterized protein LOC103496742 [Cucumis melo]2.89e-14576.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

XP_011655042.1 uncharacterized protein LOC101209878 [Cucumis sativus]1.11e-21199.66Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MTARKVASRGGQGGREAGHNQVDEQPAEQV NPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

XP_031744976.1 uncharacterized protein LOC116405198 [Cucumis sativus]6.85e-10758.33Show/hide
Query:  RKVASRGGQG-GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF
        R+   RG  G GR AG NQ  E  AEQ   P APV+  +  A+S  +EQ F + +T + AQ+Q A A P         AAP  Q+LPNQLSAEAKHLRDF
Subjt:  RKVASRGGQG-GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF

Query:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ
        R YDPQTF+G  EDP    +WLSSVE IF+YM+CP+  +VQCA FLLR+R  IWW +  RMLGG+V Q TWDQFK+ FY KFF A+LRD K QEF++LKQ
Subjt:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ

Query:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        G MT+EEYD EFD+LS FAPELV  E ARA RFV GL++++RGFVRALKP TQ EALR+A+D+S  KD+  P+    G SSGQKRK EQ+ + V  RN+R
Subjt:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

TrEMBL top hitse value%identityAlignment
A0A1S3C5M7 uncharacterized protein LOC1034967421.40e-14576.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

A0A5A7SZN4 Gag protease polyprotein9.88e-10653.67Show/hide
Query:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP
        M  R+ A RGG+GGR  G  +V  + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A                APQ +P
Subjt:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP

Query:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL
        +QLSAEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP+NQKVQCAVF+L +R   WW + ERMLGG+V+Q TW QF ESFYAKFF ASL
Subjt:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL

Query:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA
        RD KRQEF++L+QG MT+++YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKA
Subjt:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA

Query:  EQKPIDVSWRNLR
        EQ+P+ V  RN R
Subjt:  EQKPIDVSWRNLR

A0A5A7VA59 Gag protease polyprotein5.95e-10554.69Show/hide
Query:  MTARKVASRGGQGGRE--AGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS
        M  R+ A RGG+GGR   AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A            APQ +P+QLS
Subjt:  MTARKVASRGGQGGRE--AGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS

Query:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK
        AEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD K
Subjt:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK

Query:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP
        RQEF++L+QG MT+E+YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA + AT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P
Subjt:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP

Query:  IDVSWRNLR
        + V  RN R
Subjt:  IDVSWRNLR

A0A5D3BCD2 Gag protease polyprotein1.40e-14576.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

E5GB72 Ty3-gypsy retrotransposon protein1.40e-14576.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGAACAAGTTGCCAATCCTGTTGCACCCGT
TTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATCAGCTAGCCCAAGCAGCCCCTAGTCAGG
GTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATTGTCGGAGGATCCC
ATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGCTGTGTTCCTGCTGAGGGAGAGAGCTGC
CATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACGCTAAGTTTTTTCCTGCTAGCCTCAGGG
ATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCCCTCTTTGCCCCTGAGTTGGTTGAGACC
GAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCAAACTGAAGCATTACGGATAGCAATGGA
CTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGAAGCCTATCGATGTTTCATGGAGAAACT
TGAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGAACAAGTTGCCAATCCTGTTGCACCCGT
TTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATCAGCTAGCCCAAGCAGCCCCTAGTCAGG
GTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATTGTCGGAGGATCCC
ATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGCTGTGTTCCTGCTGAGGGAGAGAGCTGC
CATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACGCTAAGTTTTTTCCTGCTAGCCTCAGGG
ATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCCCTCTTTGCCCCTGAGTTGGTTGAGACC
GAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCAAACTGAAGCATTACGGATAGCAATGGA
CTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGAAGCCTATCGATGTTTCATGGAGAAACT
TGAGGTAA
Protein sequenceShow/hide protein sequence
MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDPQTFNGLSEDP
INVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVET
EAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR