; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g31450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g31450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:22643552..22652965
RNA-Seq ExpressionMoc08g31450
SyntenyMoc08g31450
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9171.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9171.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9171.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9171.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9171.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein7.4e-9271.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

A0A5A7TU93 Gag/pol protein7.4e-9271.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

A0A5A7TWB9 Gag/pol protein7.4e-9271.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

A0A5A7TZD7 Gag/pol protein7.4e-9271.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

A0A5A7V4M1 Gag/pol protein7.4e-9271.54Show/hide
Query:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP
        +AN+KA+ YILAS+S+VL+KKHE  +TA+EIMD L  MFGQ S Q +++ALK+IYN+ M EG+  REHVLN+MVHFNVAE N AVIDE SQVSFILE LP
Subjt:  EANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMKEGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLP

Query:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK
        +SFL F SNAV+NK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++FHRGS+S  KS  SSS +K +KKKK  G+G+K + AAA   K KAK A K
Subjt:  KSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATS-KRFHRGSSSRIKSASSSSRSKTFKKKKAAGKGSKPDSAAAAAKKGKAKVADK

Query:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS
        G CFHCN +GHWK NCPKYLAEKKKA +GKYDLLVLET LVENDD AWI+DSGATNHVCSSFQGISS
Subjt:  GKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAGTCGAGGAGTCGTCGGTCGTCGGTCGTCAATGCCGGATCTGATGTGGGCGTCGTCGGATCTGATGTAGAGAGTCGTTGGATTTCAGATTCGTGGCGGATGGC
TGTAGTTGGAAGGATACATGAGTCCGATTTAGAGGGGAGACGACGGGATGGTGGCTACCAGGAGCTCTCCGAATCTGTGATGGATGCGGTAGGACAAAACTCCACAGCGG
ACGAGGAATGGTTAGTTAATGAGGAAGCCATGTTAGTTGAACCTGACACGAAGGTGGACGTGGACATGTTGACGGCACCAGGATTAGTTACTTCATGTACGCCAGTTTGT
GACCAGTTAACTGTGGATCATGACATTTCAAGGACAATTCCACAATCTCATTTTGATATTGGCACTGGTGACGCCCGGGTTATTGCTCACTCTCATAGAGATGCACCAGA
AGGGGAGGCTGGAGCCTTCACAGACGCCTTCAGGCATATGGAGAATGGGTTGAATTGCATGTGGTCAAATACGGAGAAGGAGTTCACGTTGATGAGAACGGAGCTGGTTG
TGAGATTAGGCTTTCCATCCACACGTACCCAATGTAGAACAATCAACCGCCAGCATAATAGGGGCATGGATGTGCTGTGTGGCGACATAGGCACCTTACGAGTAACAACC
AAGCCCAAGAAAGATACAGCATTCTCTCAATCACACTTTCTCAATTCTCCCTCACGTTCCAAACCGACGCTCCCACAAGAGAGATCGTCGCACCCTGAGGATAGTAAGGA
AGATCGTTTGATGGTGTTCGGAGGATTCATTGAAGAAGAGTTCTTAAAGTCCGGGTCGCTCGAAAGCTGTTTGTGGCGTCGCCATGGTGCTGTCGAAGAACAGCATACAG
CGCCGCAGCGCAGCACTATAGTGCTGCGGCGCTGTGCTGTGCTGCAGCAGCACCGCAGCGCTGCCCTTAGGCTCCGCGACGCTGTCTTGGGTGTTCTCCGACCTGGCTCT
AGTTCGCAGTTCGAGGGGCGATTTCGGATGGAACCCTGTCTAAGCAAGGTTTTGCCTAAGGTTGAGGTACTTAAACTGACCTTCCTTGCTCGGCCTCGTGTCGCACTGAG
CGCGACCTCCCTACGGAAAGTGTTTGCATGGTTCAATATCGAGGCCAATGACAAGGCTAAGGTATACATCTTGGCGAGCATATCTGATGTGCTTTCTAAAAAGCACGAGG
ACACGATCACTGCTAAGGAGATAATGGACTTGCTGCACAGCATGTTTGGACAACCGTCCTCACAAGCTCGAAACGAAGCCCTTAAGTTCATTTACAACTCCCTTATGAAG
GAGGGCTCCTTAGAGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTAGCAGAGTCGAACGAGGCTGTCATAGACGAACAGAGTCAGGTCAGCTTCATTCTGGA
ACCTCTTCCGAAGAGTTTCCTACCATTTTGCAGTAATGCGGTTATTAATAAGTTGGAATACACTCTTACCACACTCTTAAACGAGTTGCAGACCTACCAGTCTCTTATGA
AAAGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCCATCGAGGTTCGTCCTCTAGAATAAAGTCTGCGTCCTCTTCTTCTAGAAGTAAGACTTTC
AAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGATTCCGCTGCTGCCGCTGCTAAGAAAGGCAAGGCCAAGGTTGCAGACAAAGGAAAATGTTTCCACTGCAACAT
GGACGGGCATTGGAAGCATAACTGCCCGAAATACCTGGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGGTTAGTGGAGAACGATG
ACTACGCTTGGATATTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCGTTTCAGGGAATTAGTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGAGTCGAGGAGTCGTCGGTCGTCGGTCGTCAATGCCGGATCTGATGTGGGCGTCGTCGGATCTGATGTAGAGAGTCGTTGGATTTCAGATTCGTGGCGGATGGC
TGTAGTTGGAAGGATACATGAGTCCGATTTAGAGGGGAGACGACGGGATGGTGGCTACCAGGAGCTCTCCGAATCTGTGATGGATGCGGTAGGACAAAACTCCACAGCGG
ACGAGGAATGGTTAGTTAATGAGGAAGCCATGTTAGTTGAACCTGACACGAAGGTGGACGTGGACATGTTGACGGCACCAGGATTAGTTACTTCATGTACGCCAGTTTGT
GACCAGTTAACTGTGGATCATGACATTTCAAGGACAATTCCACAATCTCATTTTGATATTGGCACTGGTGACGCCCGGGTTATTGCTCACTCTCATAGAGATGCACCAGA
AGGGGAGGCTGGAGCCTTCACAGACGCCTTCAGGCATATGGAGAATGGGTTGAATTGCATGTGGTCAAATACGGAGAAGGAGTTCACGTTGATGAGAACGGAGCTGGTTG
TGAGATTAGGCTTTCCATCCACACGTACCCAATGTAGAACAATCAACCGCCAGCATAATAGGGGCATGGATGTGCTGTGTGGCGACATAGGCACCTTACGAGTAACAACC
AAGCCCAAGAAAGATACAGCATTCTCTCAATCACACTTTCTCAATTCTCCCTCACGTTCCAAACCGACGCTCCCACAAGAGAGATCGTCGCACCCTGAGGATAGTAAGGA
AGATCGTTTGATGGTGTTCGGAGGATTCATTGAAGAAGAGTTCTTAAAGTCCGGGTCGCTCGAAAGCTGTTTGTGGCGTCGCCATGGTGCTGTCGAAGAACAGCATACAG
CGCCGCAGCGCAGCACTATAGTGCTGCGGCGCTGTGCTGTGCTGCAGCAGCACCGCAGCGCTGCCCTTAGGCTCCGCGACGCTGTCTTGGGTGTTCTCCGACCTGGCTCT
AGTTCGCAGTTCGAGGGGCGATTTCGGATGGAACCCTGTCTAAGCAAGGTTTTGCCTAAGGTTGAGGTACTTAAACTGACCTTCCTTGCTCGGCCTCGTGTCGCACTGAG
CGCGACCTCCCTACGGAAAGTGTTTGCATGGTTCAATATCGAGGCCAATGACAAGGCTAAGGTATACATCTTGGCGAGCATATCTGATGTGCTTTCTAAAAAGCACGAGG
ACACGATCACTGCTAAGGAGATAATGGACTTGCTGCACAGCATGTTTGGACAACCGTCCTCACAAGCTCGAAACGAAGCCCTTAAGTTCATTTACAACTCCCTTATGAAG
GAGGGCTCCTTAGAGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTAGCAGAGTCGAACGAGGCTGTCATAGACGAACAGAGTCAGGTCAGCTTCATTCTGGA
ACCTCTTCCGAAGAGTTTCCTACCATTTTGCAGTAATGCGGTTATTAATAAGTTGGAATACACTCTTACCACACTCTTAAACGAGTTGCAGACCTACCAGTCTCTTATGA
AAAGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCCATCGAGGTTCGTCCTCTAGAATAAAGTCTGCGTCCTCTTCTTCTAGAAGTAAGACTTTC
AAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGATTCCGCTGCTGCCGCTGCTAAGAAAGGCAAGGCCAAGGTTGCAGACAAAGGAAAATGTTTCCACTGCAACAT
GGACGGGCATTGGAAGCATAACTGCCCGAAATACCTGGCCGAAAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTTTTGGAAACATGGTTAGTGGAGAACGATG
ACTACGCTTGGATATTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCGTTTCAGGGAATTAGTTCCTAG
Protein sequenceShow/hide protein sequence
MGESRSRRSSVVNAGSDVGVVGSDVESRWISDSWRMAVVGRIHESDLEGRRRDGGYQELSESVMDAVGQNSTADEEWLVNEEAMLVEPDTKVDVDMLTAPGLVTSCTPVC
DQLTVDHDISRTIPQSHFDIGTGDARVIAHSHRDAPEGEAGAFTDAFRHMENGLNCMWSNTEKEFTLMRTELVVRLGFPSTRTQCRTINRQHNRGMDVLCGDIGTLRVTT
KPKKDTAFSQSHFLNSPSRSKPTLPQERSSHPEDSKEDRLMVFGGFIEEEFLKSGSLESCLWRRHGAVEEQHTAPQRSTIVLRRCAVLQQHRSAALRLRDAVLGVLRPGS
SSQFEGRFRMEPCLSKVLPKVEVLKLTFLARPRVALSATSLRKVFAWFNIEANDKAKVYILASISDVLSKKHEDTITAKEIMDLLHSMFGQPSSQARNEALKFIYNSLMK
EGSLEREHVLNLMVHFNVAESNEAVIDEQSQVSFILEPLPKSFLPFCSNAVINKLEYTLTTLLNELQTYQSLMKSKGQEGEANVATSKRFHRGSSSRIKSASSSSRSKTF
KKKKAAGKGSKPDSAAAAAKKGKAKVADKGKCFHCNMDGHWKHNCPKYLAEKKKANEGKYDLLVLETWLVENDDYAWILDSGATNHVCSSFQGISS