; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G11500 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G11500
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag protease polyprotein
Genome locationChr5:10128925..10131506
RNA-Seq ExpressionCSPI05G11500
SyntenyCSPI05G11500
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025469.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]6.7e-8756.72Show/hide
Query:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA--------APQDLPNQLSAEAK
        M  R+ A RGG+G  GR AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q Q A  AP+   A        APQ +P+QLSAEAK
Subjt:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA--------APQDLPNQLSAEAK

Query:  HLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEF
        HLRDFR Y+P TF+G  +DP    LWLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD KRQEF
Subjt:  HLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEF

Query:  IDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVS
        ++L+QG MT+E+YD EFD+LS FAPE++ TEAARA +FV GL+ D++GFVRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P+ V 
Subjt:  IDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVS

Query:  WRNLR
         RN R
Subjt:  WRNLR

KAA0031931.1 pol protein [Cucumis melo var. makuwa]5.7e-8655.99Show/hide
Query:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS
        M  R+ A RGGQG  GR AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q Q    AP+   A            APQ +P+QLS
Subjt:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------------APQDLPNQLS

Query:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK
        AEAKHLRDFR Y+P TF+G  EDP    LWLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD K
Subjt:  AEAKHLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDK

Query:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP
        RQEF++L+QG MT+E+YD EFD+LS FAPE++ TEAARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P
Subjt:  RQEFIDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKP

Query:  IDVSWRNLR
        + V  RN R
Subjt:  IDVSWRNLR

KAA0065602.1 pol protein [Cucumis melo var. makuwa]3.3e-8656.77Show/hide
Query:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------APQDLPNQLSAEAKHL
        M  R+ A RGGQG  GR AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q Q A  AP+   A      APQ +P+QLSAEAKHL
Subjt:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------APQDLPNQLSAEAKHL

Query:  RDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFID
        RDFR Y+P TF+G  EDP    LWLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD KRQ+F++
Subjt:  RDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFID

Query:  LKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWR
        L+QG MT+E+YD EFD+LS FAPE++ TE ARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P+ V  R
Subjt:  LKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWR

Query:  NLR
        N R
Subjt:  NLR

XP_008456947.1 PREDICTED: uncharacterized protein LOC103496742 [Cucumis melo]1.4e-11376.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADL    T+LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

XP_011655042.1 uncharacterized protein LOC101209878 [Cucumis sativus]2.3e-16499.66Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MTARKVASRGGQGGREAGHNQVDEQPAEQV NPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
        EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR

TrEMBL top hitse value%identityAlignment
A0A1S3C5M7 uncharacterized protein LOC1034967427.0e-11476.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADL    T+LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

A0A5A7SJH3 Reverse transcriptase3.3e-8756.72Show/hide
Query:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA--------APQDLPNQLSAEAK
        M  R+ A RGG+G  GR AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q Q A  AP+   A        APQ +P+QLSAEAK
Subjt:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA--------APQDLPNQLSAEAK

Query:  HLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEF
        HLRDFR Y+P TF+G  +DP    LWLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD KRQEF
Subjt:  HLRDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEF

Query:  IDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVS
        ++L+QG MT+E+YD EFD+LS FAPE++ TEAARA +FV GL+ D++GFVRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P+ V 
Subjt:  IDLKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVS

Query:  WRNLR
         RN R
Subjt:  WRNLR

A0A5A7VBD0 Reverse transcriptase1.6e-8656.77Show/hide
Query:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------APQDLPNQLSAEAKHL
        M  R+ A RGGQG  GR AG  Q + QP  Q  +P APV+ ADL AM    EQ F+D + ++  Q Q A  AP+   A      APQ +P+QLSAEAKHL
Subjt:  MTARKVASRGGQG--GREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQA------APQDLPNQLSAEAKHL

Query:  RDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFID
        RDFR Y+P TF+G  EDP    LWLSS+E IF YMKCP++QKVQCAVF+L +R   WW + ERMLGG+V+Q TW QFKESFYAKFF ASLRD KRQ+F++
Subjt:  RDFRIYDPQTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFID

Query:  LKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWR
        L+QG MT+E+YD EFD+LS FAPE++ TE ARA +FV GL+ D++G VRA +PAT  +ALR+A+DLS  +  +  K +  G +SGQKRKAEQ+P+ V  R
Subjt:  LKQGQMTLEEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWR

Query:  NLR
        N R
Subjt:  NLR

A0A5D3BCD2 Gag protease polyprotein7.0e-11476.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADL    T+LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

E5GB72 Ty3-gypsy retrotransposon protein7.0e-11476.87Show/hide
Query:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP
        MT R+ A RG QGGR  G N    QPAEQVANPVAP++ ADL    T+LEQ F DTVT+VLA+HQLAQAA +QGQ A QDLP+QLS EAKHLRDFRIYDP
Subjt:  MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDP

Query:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL
        QTFNG  EDPI+  LWLSSVE IF +M+CPD+QKVQCAVFLLRERAAIWW SVERMLGGNVNQ TWDQFKESFYAKF P+SLRD KRQEFI+LKQGQMT+
Subjt:  QTFNGLSEDPINVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTL

Query:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK
        EEYDYEFD+LSLFAPELVETEAARA+ FVWGL+ DL+GFVRA KPATQTEAL +A+DLS  KDDD  KVSR GP  GQKR+
Subjt:  EEYDYEFDILSLFAPELVETEAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGAACAAGTTGCCAATCCTGTTGCACCCGT
TTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATCAGCTAGCCCAAGCAGCCCCTAGTCAGG
GTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAAACTTTTAATGGATTGTCGGAGGATCCC
ATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGCTGTGTTCCTGCTGAGGGAGAGAGCTGC
CATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACGCTAAGTTTTTTCCTGCTAGCCTCAGGG
ATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCCCTCTTTGCCCCTGAGTTGGTTGAGACC
GAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCAAACTGAAGCATTACGGATAGCAATGGA
CTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGAAGCCTATCGATGTTTCATGGAGAAACT
TGAGGTAA
mRNA sequenceShow/hide mRNA sequence
TAAAATTTTGCTAGAAAGAAAAGAAAAAGGAAATTTGTTTCATCTTCTTCTCCATCAAGCCCACGCCTCCCTCCTCTCCACTTTGCCGCCGCTCACCGCGGTAAGTTTGG
GCGAGTGTTTTTGGATCTTTATGCTTACGGTTGGTTTTGAATTACCCATTAAAGGTTGGGTGCAATTGGAACCCTCCTGATTCATTTCTTGATTTTCCATCAAGTTTGAG
ATTTAGTTCAACATCCAATTTGGGAAAATGACGGCACGTAAAGTTGCAAGTAGGGGTGGTCAAGGAGGCAGAGAAGCAGGGCACAATCAAGTTGATGAACAACCTGCTGA
ACAAGTTGCCAATCCTGTTGCACCCGTTTCCCAAGCTGATCTTACTGCCATGTCGACTCGGCTAGAGCAGATGTTTAAAGACACCGTGACTAAAGTGTTGGCACAACATC
AGCTAGCCCAAGCAGCCCCTAGTCAGGGTCAAGCCGCACCACAGGATCTGCCGAATCAGTTGTCAGCTGAGGCTAAGCATTTGAGGGATTTCAGAATATATGATCCACAA
ACTTTTAATGGATTGTCGGAGGATCCCATTAACGTGATGTTATGGTTATCATCTGTGGAGAGAATCTTTCATTACATGAAATGTCCTGACAACCAGAAGGTTCAGTGTGC
TGTGTTCCTGCTGAGGGAGAGAGCTGCCATTTGGTGGCTTTCAGTAGAAAGGATGCTTGGTGGTAATGTTAACCAGTTTACGTGGGACCAGTTCAAGGAGAGTTTTTACG
CTAAGTTTTTTCCTGCTAGCCTCAGGGATGACAAACGTCAAGAGTTCATAGACCTAAAGCAAGGTCAAATGACACTTGAGGAGTACGATTATGAGTTTGACATACTATCC
CTCTTTGCCCCTGAGTTGGTTGAGACCGAGGCTGCCAGAGCTCAGAGGTTTGTTTGGGGTCTGAAAAATGATCTTCGAGGTTTTGTTCGAGCCTTAAAACCAGCCACCCA
AACTGAAGCATTACGGATAGCAATGGACTTGAGTGCCCATAAGGATGATGATCCACCGAAGGTATCAAGGAATGGACCATCTTCAGGTCAGAAGAGAAAGGCTGAGCAGA
AGCCTATCGATGTTTCATGGAGAAACTTGAGGTAAGGTGCGGCACTTCGCTGCTATCAACAGGAGTTTGTTGAGGCGGATAGAACTTTGTGAGAGAGATACCTTATTGTA
GTACCATTTAGGTTGCTGTGTGTCGGAGACTATGGCTTTCTTCAAACTCAAGGGGAAAGATTTGAAAATTCTATAAATAAGCTTTTGAGACGCAGAAGATGAAAAACTCT
AGCAATTTTCTTTGACTCACTCACACATATACACTTTTCAGCC
Protein sequenceShow/hide protein sequence
MTARKVASRGGQGGREAGHNQVDEQPAEQVANPVAPVSQADLTAMSTRLEQMFKDTVTKVLAQHQLAQAAPSQGQAAPQDLPNQLSAEAKHLRDFRIYDPQTFNGLSEDP
INVMLWLSSVERIFHYMKCPDNQKVQCAVFLLRERAAIWWLSVERMLGGNVNQFTWDQFKESFYAKFFPASLRDDKRQEFIDLKQGQMTLEEYDYEFDILSLFAPELVET
EAARAQRFVWGLKNDLRGFVRALKPATQTEALRIAMDLSAHKDDDPPKVSRNGPSSGQKRKAEQKPIDVSWRNLR