; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G008757 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G008757
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGag protease polyprotein
Genome locationGy14Chr5:6790893..6794016
RNA-Seq ExpressionCsGy5G008757
SyntenyCsGy5G008757
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0006885 - regulation of pH (biological process)
GO:0012505 - endomembrane system (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036632.1 gag protease polyprotein [Cucumis melo var. makuwa]7.43e-10653.67Show/hide
Query:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP
        M  R+ A RGG+GGR  G  +V  + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A                APQ +P
Subjt:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP

Query:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL
        +QLSAEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP+NQKVQCAVF+L +R   WW + ERMLGG+V+Q TW QF ESFYAKFF ASL
Subjt:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL

Query:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA
        RD KRQEF++L+QG MT+++YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKA
Subjt:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA

Query:  EQKPIDVSWRNLR
        EQ+P+ V  RN R
Subjt:  EQKPIDVSWRNLR

KGN57866.2 hypothetical protein Csa_011500 [Cucumis sativus]1.99e-10557.67Show/hide
Query:  RKVASRGGQG-GREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF
        R+   RG  G GR  G NQ  E  AE    P APV+  +  A+S  +EQ F + +T + AQ+Q A A P         AAP  Q+LPNQLSAEAKHLRDF
Subjt:  RKVASRGGQG-GREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF

Query:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ
        R YDPQTF+G  EDP    +WLSSVE IF+YM+CP+  +VQCA FLLR+R  IWW +  RMLGG+V Q TWDQFK+ FY KFF A+LRD K QEF++LKQ
Subjt:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ

Query:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        G MT+EEYD EFD+LS FAPELV  E ARA RFV GL++++RGFVRALKP TQ EALR+A+D+S  KD+  P+    G SSGQKRK EQ+ + V  RN+R
Subjt:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

XP_008456947.1 PREDICTED: uncharacterized protein LOC103496742 [Cucumis melo]6.05e-14575.7Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQV NPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+  +
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ

XP_011655042.1 uncharacterized protein LOC101209878 [Cucumis sativus]6.97e-213100Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

XP_031744976.1 uncharacterized protein LOC116405198 [Cucumis sativus]2.50e-10758.33Show/hide
Query:  RKVASRGGQG-GREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF
        R+   RG  G GR AG NQ  E  AEQ   P APV+  +  A+S  +EQ F + +T + AQ+Q A A P         AAP  Q+LPNQLSAEAKHLRDF
Subjt:  RKVASRGGQG-GREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQ-----GQAAP--QDLPNQLSAEAKHLRDF

Query:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ
        R YDPQTF+G  EDP    +WLSSVE IF+YM+CP+  +VQCA FLLR+R  IWW +  RMLGG+V Q TWDQFK+ FY KFF A+LRD K QEF++LKQ
Subjt:  RIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQ

Query:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        G MT+EEYD EFD+LS FAPELV  E ARA RFV GL++++RGFVRALKP TQ EALR+A+D+S  KD+  P+    G SSGQKRK EQ+ + V  RN+R
Subjt:  GQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

TrEMBL top hitse value%identityAlignment
A0A1S3C5M7 uncharacterized protein LOC1034967422.93e-14575.7Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQV NPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+  +
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ

A0A5A7SZN4 Gag protease polyprotein3.60e-10653.67Show/hide
Query:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP
        M  R+ A RGG+GGR  G  +V  + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A                APQ +P
Subjt:  MTARKVASRGGQGGREAGHNQV--DEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA----------------APQDLP

Query:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL
        +QLSAEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP+NQKVQCAVF+L +R   WW + ERMLGG+V+Q TW QF ESFYAKFF ASL
Subjt:  NQLSAEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASL

Query:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA
        RD KRQEF++L+QG MT+++YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKA
Subjt:  RDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKA

Query:  EQKPIDVSWRNLR
        EQ+P+ V  RN R
Subjt:  EQKPIDVSWRNLR

A0A5A7VA59 Gag protease polyprotein2.17e-10554.69Show/hide
Query:  MTARKVASRGGQGGRE--AGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS
        M  R+ A RGG+GGR   AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q + A   P+   A            APQ +P+QLS
Subjt:  MTARKVASRGGQGGRE--AGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS

Query:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK
        AEAKHLRDFR Y+P TF+G  EDP    +WLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD K
Subjt:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK

Query:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP
        RQEF++L+QG MT+E+YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA + AT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P
Subjt:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP

Query:  IDVSWRNLR
        + V  RN R
Subjt:  IDVSWRNLR

A0A5D3BCD2 Gag protease polyprotein2.93e-14575.7Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQV NPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+  +
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ

E5GB72 Ty3-gypsy retrotransposon protein2.93e-14575.7Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQV NPVAP++ ADLT    +LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+  +
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGAACAAGTTACCAATCCTGTTGCACCCGT
TTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATCAGCTAGCCCAAGCAGCCCCTAGTCAGG
GTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATTGTCGGAGGATCCC
ATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGCTGTGTTCCTGCTGAGGGAGAGAGCTGC
CATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACGCTAAGTTTTTTCCTGCTAGCCTCAGGG
ATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCCCTCTTTGCCCCTGAGTTGGTTGAGACC
GAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCAAACTGAAGCATTACGGATAGCAATGGA
CTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGAAGCCTATCGATGTTTCATGGAGAAACT
TGAGGTAA
mRNA sequenceShow/hide mRNA sequence
CTTTTTATAACTACGAGAATCTTGCTCATTTTTTTTCCATTTTTTTATTAAAGCTATTTTATTAATGAAATCTAACTAAAGGGAAGAAAATGGAAATTATTTGGATTAAA
ATTTTGCTAGAAAGAAAAGAAAAAGGAAATTTGTTTCATCTTCTTCTCCATCAAGCCCACGCCTCCCTCCTCTCCACTTTGCCGCCGCTCACCGCGGTGCAATTGGAACC
CTCCTGATTCATTTCTTGATTTTCCATCAAGTTTGAGATTTAGTTCAACATCCAATTTGGATTGAATTTGGGAATTAGTTTGCAATAGGAAATTTTTATTAAGTTAAGGA
AAATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGAACAAGTTACCAATCCTGTTGCACCC
GTTTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATCAGCTAGCCCAAGCAGCCCCTAGTCA
GGGTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATTGTCGGAGGATC
CCATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGCTGTGTTCCTGCTGAGGGAGAGAGCT
GCCATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACGCTAAGTTTTTTCCTGCTAGCCTCAG
GGATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCCCTCTTTGCCCCTGAGTTGGTTGAGA
CCGAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCAAACTGAAGCATTACGGATAGCAATG
GACTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGAAGCCTATCGATGTTTCATGGAGAAA
CTTGAGGTAAGGTGCGGCACTTCGCTGCTATCAACAGGAGTTTGTTGAGGCGGATAGAACTTTGTGAGAGAGATACCTTATTGTAGTACCATTTAGGTTGCTGTGTGTCG
GAGACTATGGCTTTCTTCAAACTCAAGGGGAAAGATTTGAAAATTCTATAAATAAGCTTTTGAGACGCAGAAGATGAAAAACTCTAGCAATTTTCTTTGACTCACTCACA
CATATACACTTTTCAGCCTAAATTTTTTTTCTCCTCCCTCAAAATATTTTTCTAAATTCACACGGAGAAACGTAGCCATTATCTCTTTGGAAATTTCTTGTTTTCTCATT
ATTCCATAAATCTCCCAAAACCACAACTAATTTCTTGTCTTCTACTTACATTCTCTTTCACCATACATCCATTTTTCTCCCTTTCACTCCCCAACAAACCTCCAGGTTTT
CTCTCTTCTCCATATTTTCTGGTTCAAGTAACTTGCAGTTTATTTCTTACTAGTTTTGCTTCCTGGTATGATTAGCATTTTTCAATCTAAGATGATATGTTCACATCTTC
ATATAACATGTATTATTTGTCTACTAATCATAAACATATTTTCCTTATAAGAGTTCTGTTT
Protein sequenceShow/hide protein sequence
MTARKVASRGGQGGREAGHNQVDEQPAEQVTNPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDPQTFNGLSEDP
INVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVET
EAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR