; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G013400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G013400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionphotosystem I reaction center subunit VI, chloroplastic-like
Genome locationCmo_Chr14:11159583..11162624
RNA-Seq ExpressionCmoCh14G013400
SyntenyCmoCh14G013400
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0009538 - photosystem I reaction center (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004928 - Photosystem I PsaH, reaction centre subunit VI


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581890.1 Photosystem I reaction center subunit VI, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.1e-14197.79Show/hide
Query:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVEI
        MWILQDSSTNS ESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTA+TNGGVLLT AVQILTDASPSAKPTMESVEI
Subjt:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVEI

Query:  EIPFEASPPPSACNSKFVPFQSKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLY
        EIPFEAS PPSACNSKFVP QSKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRP+RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLY
Subjt:  EIPFEASPPPSACNSKFVPFQSKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLY

Query:  GSDAPSPYNSLQSKFFETFAAPFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GSDAPSPYNSLQSKFFETFAAPFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  GSDAPSPYNSLQSKFFETFAAPFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

KAG7018324.1 Photosystem I reaction center subunit VI, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-7399.31Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAGTKLPLRP+RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

XP_008447512.1 PREDICTED: photosystem I reaction center subunit VI, chloroplastic [Cucumis melo]4.3e-7297.92Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQP TVKGLGGSSLAG KLPLRPSRQSFRPK+FKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

XP_022955723.1 photosystem I reaction center subunit VI, chloroplastic-like [Cucurbita moschata]1.7e-73100Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

XP_023529177.1 photosystem I reaction center subunit VI, chloroplastic-like [Cucurbita pepo subsp. pepo]1.5e-7299.31Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQS RPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

TrEMBL top hitse value%identityAlignment
A0A0A0LAW4 Uncharacterized protein4.6e-7297.22Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAG KLPLRPSRQ+FRPK+FKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAY+SATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

A0A1S3BI80 photosystem I reaction center subunit VI, chloroplastic2.1e-7297.92Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQP TVKGLGGSSLAG KLPLRPSRQSFRPK+FKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

A0A5D3C9C3 Photosystem I reaction center subunit VI2.1e-7297.92Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQP TVKGLGGSSLAG KLPLRPSRQSFRPK+FKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

A0A6J1GUS6 photosystem I reaction center subunit VI, chloroplastic-like8.4e-74100Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

A0A6J1IVS0 photosystem I reaction center subunit VI, chloroplastic-like8.4e-74100Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

SwissProt top hitse value%identityAlignment
A2Y7D9 Photosystem I reaction center subunit VI, chloroplastic4.6e-5373.97Show/hide
Query:  LATLAAVQPVTVKGLGGSSLAGTKLPLRPS-----RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTK
        +A+L AVQPV VKGL GSS++G KL +RPS     R + RP++    AVVAKYG+KSVYFDLED+GNTTGQWDLYGSDAPSPYN LQSKFFETFA PFTK
Subjt:  LATLAAVQPVTVKGLGGSSLAGTKLPLRPS-----RQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTK

Query:  RGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        RGLLLKFLLLGGG+ +AY SA+A  D+LPIKKGPQLPP  GPRGKI
Subjt:  RGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

O04006 Photosystem I reaction center subunit VI, chloroplastic1.3e-6081.38Show/hide
Query:  MASLATLAAVQPVT-VKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR
        MAS AT+AAVQP + VKGLGGSSL G KL ++PSRQSF+PKS +AGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYN LQSKFFETFAAPFTKR
Subjt:  MASLATLAAVQPVT-VKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR

Query:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GLLLKFL+LGGG+ L Y SA++  DVLPIK+GPQ  PKLGPRGK+
Subjt:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

P22179 Photosystem I reaction center subunit VI, chloroplastic1.2e-6179.86Show/hide
Query:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG
        MASLATLAAVQP T+KGL GSS+AGTKL ++P+RQSF+  + ++GA+VAKYGDKSVYFDLED+ NTTGQWD+YGSDAPSPYNSLQSKFFETFAAPFTKRG
Subjt:  MASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKRG

Query:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        LLLKFL+LGGG+ L Y SA AP DVLPI +GPQ PPKLGPRGKI
Subjt:  LLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

Q9SUI6 Photosystem I reaction center subunit VI-2, chloroplastic1.3e-6081.38Show/hide
Query:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR
        MAS AT+AAVQP   VKGLGGSSLAG KL ++PSRQSF+ KS +AGAVVAKYGDKSVYFDLEDLGNTTGQWD+YGSDAPSPYN LQSKFFETFAAPFTKR
Subjt:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR

Query:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GLLLKFL+LGGG+ L Y SA +  DVLPIK+GPQ PPKLGPRGK+
Subjt:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

Q9SUI7 Photosystem I reaction center subunit VI-1, chloroplastic5.1e-6079.31Show/hide
Query:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR
        MASLAT+AAV+P   +KGLGGSSLAG KL ++PSR SF+PKS +A  VVAKYGDKSVYFDLEDLGNTTGQWD+YGSDAPSPYN LQSKFFETFAAPFTKR
Subjt:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR

Query:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GLLLKFL+LGGG+ L Y SAT+  +VLPIK+GPQ PPKLGPRGK+
Subjt:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

Arabidopsis top hitse value%identityAlignment
AT1G52230.1 photosystem I subunit H29.6e-6281.38Show/hide
Query:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR
        MAS AT+AAVQP   VKGLGGSSLAG KL ++PSRQSF+ KS +AGAVVAKYGDKSVYFDLEDLGNTTGQWD+YGSDAPSPYN LQSKFFETFAAPFTKR
Subjt:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR

Query:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GLLLKFL+LGGG+ L Y SA +  DVLPIK+GPQ PPKLGPRGK+
Subjt:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

AT1G79840.1 HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain7.4e-2257.58Show/hide
Query:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE
        +W+LQDSSTNS ES+VVY+ VD+   Q V+ G D S++ ILPSGFSI+PDG  SR PL+IT  +DD+  ++ GG LLT+A+Q L + SP+AK  MESVE
Subjt:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE

AT1G79840.2 HD-ZIP IV family of homeobox-leucine zipper protein with lipid-binding START domain7.4e-2257.58Show/hide
Query:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE
        +W+LQDSSTNS ES+VVY+ VD+   Q V+ G D S++ ILPSGFSI+PDG  SR PL+IT  +DD+  ++ GG LLT+A+Q L + SP+AK  MESVE
Subjt:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE

AT3G16140.1 photosystem I subunit H-13.6e-6179.31Show/hide
Query:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR
        MASLAT+AAV+P   +KGLGGSSLAG KL ++PSR SF+PKS +A  VVAKYGDKSVYFDLEDLGNTTGQWD+YGSDAPSPYN LQSKFFETFAAPFTKR
Subjt:  MASLATLAAVQP-VTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFAAPFTKR

Query:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI
        GLLLKFL+LGGG+ L Y SAT+  +VLPIK+GPQ PPKLGPRGK+
Subjt:  GLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI

AT4G00730.1 Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein1.3e-1345.45Show/hide
Query:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE
        M ILQ++  ++S ++VVY+ VD+  M  VM G DSS + +LPSGF++LPDG +           D       GG LLTVA QIL +  P+AK T+ESVE
Subjt:  MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGATCCTCCAAGACAGCTCCACAAACTCGTCGGAATCCATGGTGGTTTACTCCGGAGTAGACGTTACCGGCATGCAGTCGGTGATGACAGGCTGCGATTCCAGCAG
CCTCACCATTCTCCCTTCTGGCTTTTCAATTCTCCCCGACGGCGCTGTGTCCAGGCCGCCCCTACTCATCACTCGACAGAAAGACGACAAGACCGCCGACACCAATGGCG
GCGTTCTGCTGACTGTCGCCGTTCAAATCCTCACCGACGCCTCTCCCTCTGCAAAACCCACCATGGAATCCGTTGAGATCGAAATCCCATTTGAAGCCTCACCTCCTCCC
TCAGCCTGCAACTCCAAGTTTGTTCCTTTTCAATCCAAAACGCAGACAACAATGGCTTCCTTAGCAACATTAGCCGCCGTTCAGCCGGTCACCGTAAAGGGCCTTGGTGG
AAGCTCCCTTGCCGGAACTAAGCTCCCTCTCAGGCCCTCTCGCCAGAGCTTCAGACCAAAAAGCTTCAAGGCTGGTGCTGTGGTGGCTAAGTACGGTGACAAAAGTGTTT
ACTTCGATTTGGAGGATTTGGGCAACACTACTGGACAGTGGGATTTGTATGGATCTGATGCTCCTTCACCATACAATTCTCTTCAGAGCAAATTCTTTGAGACCTTTGCC
GCTCCATTCACCAAGAGAGGATTGTTGCTCAAGTTCTTGCTTCTAGGCGGTGGAGCCACATTAGCTTATTACAGTGCCACTGCCCCAGATGATGTTCTTCCCATCAAGAA
AGGACCTCAACTTCCACCAAAGCTTGGGCCTCGTGGCAAGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGATCCTCCAAGACAGCTCCACAAACTCGTCGGAATCCATGGTGGTTTACTCCGGAGTAGACGTTACCGGCATGCAGTCGGTGATGACAGGCTGCGATTCCAGCAG
CCTCACCATTCTCCCTTCTGGCTTTTCAATTCTCCCCGACGGCGCTGTGTCCAGGCCGCCCCTACTCATCACTCGACAGAAAGACGACAAGACCGCCGACACCAATGGCG
GCGTTCTGCTGACTGTCGCCGTTCAAATCCTCACCGACGCCTCTCCCTCTGCAAAACCCACCATGGAATCCGTTGAGATCGAAATCCCATTTGAAGCCTCACCTCCTCCC
TCAGCCTGCAACTCCAAGTTTGTTCCTTTTCAATCCAAAACGCAGACAACAATGGCTTCCTTAGCAACATTAGCCGCCGTTCAGCCGGTCACCGTAAAGGGCCTTGGTGG
AAGCTCCCTTGCCGGAACTAAGCTCCCTCTCAGGCCCTCTCGCCAGAGCTTCAGACCAAAAAGCTTCAAGGCTGGTGCTGTGGTGGCTAAGTACGGTGACAAAAGTGTTT
ACTTCGATTTGGAGGATTTGGGCAACACTACTGGACAGTGGGATTTGTATGGATCTGATGCTCCTTCACCATACAATTCTCTTCAGAGCAAATTCTTTGAGACCTTTGCC
GCTCCATTCACCAAGAGAGGATTGTTGCTCAAGTTCTTGCTTCTAGGCGGTGGAGCCACATTAGCTTATTACAGTGCCACTGCCCCAGATGATGTTCTTCCCATCAAGAA
AGGACCTCAACTTCCACCAAAGCTTGGGCCTCGTGGCAAGATCTAATTCGCTTTCAAATCCTTTTGCAGTATGTAAATTTTCTCTCTTATCCTCCCAGTTATGTTTCAAT
TGAGAAATTGTTATTATGTAATGAATGTACTTTTACAAAGTGTTCTTGCCAGTTTCTTTGAAAAACTGATGAAGATATGGGTTAAAAAGACTTATTCTTTCTAACAAAAT
TGACTAAAACTTACCCCTGC
Protein sequenceShow/hide protein sequence
MWILQDSSTNSSESMVVYSGVDVTGMQSVMTGCDSSSLTILPSGFSILPDGAVSRPPLLITRQKDDKTADTNGGVLLTVAVQILTDASPSAKPTMESVEIEIPFEASPPP
SACNSKFVPFQSKTQTTMASLATLAAVQPVTVKGLGGSSLAGTKLPLRPSRQSFRPKSFKAGAVVAKYGDKSVYFDLEDLGNTTGQWDLYGSDAPSPYNSLQSKFFETFA
APFTKRGLLLKFLLLGGGATLAYYSATAPDDVLPIKKGPQLPPKLGPRGKI