; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G011710 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G011710
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMog1/PsbP/DUF1795-like photosystem II reaction center PsbP family protein
Genome locationCmo_Chr05:9241107..9247353
RNA-Seq ExpressionCmoCh05G011710
SyntenyCmoCh05G011710
Gene Ontology termsNA
InterPro domainsIPR016123 - Mog1/PsbP, alpha/beta/alpha sandwich


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022946289.1 uncharacterized protein LOC111450417 isoform X1 [Cucurbita moschata]9.5e-13596.12Show/hide
Query:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
        MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
Subjt:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ

Query:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE----------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD
        FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE          AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD
Subjt:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE----------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD

Query:  EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

XP_022946291.1 uncharacterized protein LOC111450417 isoform X2 [Cucurbita moschata]3.5e-137100Show/hide
Query:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
        MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
Subjt:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ

Query:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
        FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
Subjt:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE

Query:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

XP_022999127.1 uncharacterized protein LOC111493605 isoform X1 [Cucurbita maxima]5.2e-13396.06Show/hide
Query:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
        LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILK+ASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
Subjt:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED

Query:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE---------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR
        LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE         AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR
Subjt:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE---------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR

Query:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

XP_022999130.1 uncharacterized protein LOC111493605 isoform X2 [Cucurbita maxima]2.5e-13599.59Show/hide
Query:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
        LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILK+ASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
Subjt:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED

Query:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK
        LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK
Subjt:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK

Query:  NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

XP_023521934.1 uncharacterized protein LOC111785779 [Cucurbita pepo subsp. pepo]1.0e-13699.6Show/hide
Query:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
        MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPT+LNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
Subjt:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ

Query:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
        FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
Subjt:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE

Query:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

TrEMBL top hitse value%identityAlignment
A0A0A0LJH6 Uncharacterized protein3.2e-12088.35Show/hide
Query:  MALLLSLCL-HPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPP
        MA+ LSLC+ HPPN  QNPT+ VP PTT+ F  PT+LNS SKRHFILKTASLCLIS IPKCPV QSS+ SPTSK GLP +ANTKSWFQFYGDGFSIRVPP
Subjt:  MALLLSLCL-HPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPP

Query:  QFEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFY
        QFEDLTEPED++AGLSLYGDK KTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGG+TLFSARTFKIK+DEGFRTYYFY
Subjt:  QFEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFY

Query:  EFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        EFGKNE+HVALVATVNSGQ FVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  EFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

A0A6J1G382 uncharacterized protein LOC111450417 isoform X21.7e-137100Show/hide
Query:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
        MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
Subjt:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ

Query:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
        FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE
Subjt:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYE

Query:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  FGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

A0A6J1G3F8 uncharacterized protein LOC111450417 isoform X14.6e-13596.12Show/hide
Query:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
        MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ
Subjt:  MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQ

Query:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE----------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD
        FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE          AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD
Subjt:  FEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE----------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDD

Query:  EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  EGFRTYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

A0A6J1KC82 uncharacterized protein LOC111493605 isoform X21.2e-13599.59Show/hide
Query:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
        LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILK+ASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
Subjt:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED

Query:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK
        LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK
Subjt:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGK

Query:  NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  NEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

A0A6J1KIQ9 uncharacterized protein LOC111493605 isoform X12.5e-13396.06Show/hide
Query:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
        LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILK+ASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED
Subjt:  LLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFED

Query:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE---------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR
        LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE         AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR
Subjt:  LTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLE---------AKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR

Query:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
        TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
Subjt:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G27390.1 Mog1/PsbP/DUF1795-like photosystem II reaction center PsbP family protein3.0e-7859.06Show/hide
Query:  MALLL-SLCLHPPNLQ-QNPTAAVPHPTTVGFHEPTTLNSNS--KRHFILKTASLCLISLI--PKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFS
        MA+LL SL LHPPN + QNP            ++P  L+S++  +R  +L+TASLC +S I   + P + +  +  T    L G+ANTKSWFQ++G GF+
Subjt:  MALLL-SLCLHPPNLQ-QNPTAAVPHPTTVGFHEPTTLNSNS--KRHFILKTASLCLISLI--PKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFS

Query:  IRVPPQFEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR
        IRVPP FED+ EPED++AGLSLYGDK K +TFAARF +PDGSEVLSVV RP+NQLKITFLEAKDI+D+GSL+ AA++FVPG AT++SART K+K++EG R
Subjt:  IRVPPQFEDLTEPEDFNAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFR

Query:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL
         YYFYEFG++EE +ALVA+VN G+ ++AGA AP SKW +D +KLRSAAIS T+L
Subjt:  TYYFYEFGKNEEHVALVATVNSGQAFVAGATAPLSKWDEDGIKLRSAAISLTVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTCTTCTTTCTTTATGCCTCCACCCTCCAAACCTTCAGCAAAACCCCACCGCCGCCGTCCCCCACCCGACCACCGTCGGATTTCACGAGCCTACAACGCTGAA
TTCCAATTCCAAGAGACATTTCATCCTCAAAACGGCTTCACTCTGTCTAATTTCATTGATTCCAAAATGCCCAGTTGCCCAATCTTCCCAAATTTCGCCTACTTCGAAAC
AAGGCCTTCCTGGCCTTGCAAATACAAAGTCTTGGTTTCAGTTTTATGGTGATGGGTTCTCCATCCGTGTGCCGCCTCAGTTTGAAGACCTCACGGAGCCTGAGGATTTC
AATGCTGGACTCTCTCTTTATGGAGATAAGGTAAAAACGAAAACGTTCGCAGCTCGGTTTGGATCTCCTGATGGCTCTGAAGTTCTAAGCGTTGTCACTCGTCCAACGAA
TCAGCTTAAGATCACCTTTCTAGAGGCGAAAGATATAACTGATATAGGTTCCTTGAGGGAGGCTGCAAAAATCTTTGTTCCAGGTGGTGCAACGTTATTTTCGGCCCGAA
CGTTTAAAATTAAGGACGATGAAGGTTTCAGGACGTATTACTTCTACGAGTTTGGGAAGAACGAAGAACACGTCGCGTTAGTAGCAACTGTTAATAGTGGACAGGCGTTT
GTTGCCGGAGCAACCGCACCATTGTCCAAATGGGATGAAGATGGCATAAAACTCCGTTCTGCTGCTATATCTCTAACCGTGCTATAA
mRNA sequenceShow/hide mRNA sequence
GGATATGAAAGCTATCCTAATTTCCCTATGGCCCTTCTTCTTTCTTTATGCCTCCACCCTCCAAACCTTCAGCAAAACCCCACCGCCGCCGTCCCCCACCCGACCACCGT
CGGATTTCACGAGCCTACAACGCTGAATTCCAATTCCAAGAGACATTTCATCCTCAAAACGGCTTCACTCTGTCTAATTTCATTGATTCCAAAATGCCCAGTTGCCCAAT
CTTCCCAAATTTCGCCTACTTCGAAACAAGGCCTTCCTGGCCTTGCAAATACAAAGTCTTGGTTTCAGTTTTATGGTGATGGGTTCTCCATCCGTGTGCCGCCTCAGTTT
GAAGACCTCACGGAGCCTGAGGATTTCAATGCTGGACTCTCTCTTTATGGAGATAAGGTAAAAACGAAAACGTTCGCAGCTCGGTTTGGATCTCCTGATGGCTCTGAAGT
TCTAAGCGTTGTCACTCGTCCAACGAATCAGCTTAAGATCACCTTTCTAGAGGCGAAAGATATAACTGATATAGGTTCCTTGAGGGAGGCTGCAAAAATCTTTGTTCCAG
GTGGTGCAACGTTATTTTCGGCCCGAACGTTTAAAATTAAGGACGATGAAGGTTTCAGGACGTATTACTTCTACGAGTTTGGGAAGAACGAAGAACACGTCGCGTTAGTA
GCAACTGTTAATAGTGGACAGGCGTTTGTTGCCGGAGCAACCGCACCATTGTCCAAATGGGATGAAGATGGCATAAAACTCCGTTCTGCTGCTATATCTCTAACCGTGCT
ATAATCAATTGCTGATCAAGTTATTTTCATTAGCTTGAGGAAGCTTTTGGTAAGTCCCCCAGTTGGCAGAGAGACAGCTAGATTTGTGTATACATTTTTGTAAGAATTGT
ACTTTGAAATGGGTATGCTGAGTGATGAGTGGTGCTTGATTAATGTGGTACCCTTTGTTCTTATATGATTGTTGTCAAACTTGAACTCGTTTTTTCTCTTTTGATAAGGT
ATATAGAGATGTCCACGGGATGGAGAGTGGAGACCGGGAAGCTTTCCTTGTCTCTGTCTACATCTCTTGTTTATGTAACAGTTTAAGTTCACCGCTAATGGATATTGTCT
ACTTTGGTCTATTACGTATCGCTATCAGCCTTATGGTTTTAAAACGTGTCTACTAAGGAGAGGTTTCCATACCCTTGTAAGGAATGTCTCATTCCCCTCTCCAATCCACC
CCCTTCGGGCCCAGCGCCCTTGCTGGC
Protein sequenceShow/hide protein sequence
MALLLSLCLHPPNLQQNPTAAVPHPTTVGFHEPTTLNSNSKRHFILKTASLCLISLIPKCPVAQSSQISPTSKQGLPGLANTKSWFQFYGDGFSIRVPPQFEDLTEPEDF
NAGLSLYGDKVKTKTFAARFGSPDGSEVLSVVTRPTNQLKITFLEAKDITDIGSLREAAKIFVPGGATLFSARTFKIKDDEGFRTYYFYEFGKNEEHVALVATVNSGQAF
VAGATAPLSKWDEDGIKLRSAAISLTVL