; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007122 (gene) of Chayote v1 genome

Gene IDSed0007122
OrganismSechium edule (Chayote v1)
Descriptionzinc finger protein CONSTANS-LIKE 14-like
Genome locationLG05:40113104..40114952
RNA-Seq ExpressionSed0007122
SyntenySed0007122
Gene Ontology termsGO:0009909 - regulation of flower development (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR010402 - CCT domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570739.1 Zinc finger protein CONSTANS-LIKE 3, partial [Cucurbita argyrosperma subsp. sororia]1.5e-7569.17Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRT--CSASSPPPSAADN
        MAF P  SA     ADLA VDFDWLPNSSIK  W SE+S   G   P S PQP T RRP NRFLL Q+F  L PSL+T ++  RT   S+SSPPP  A++
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRT--CSASSPPPSAADN

Query:  LLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN
        L++RP FP GDF     Q+N +R ESPVSCESSITIEG+SRACRYSPEEKK+RIERY+SKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYND+ A KN
Subjt:  LLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN

Query:  YPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP
         P+QWSHGQ EEEE     NGG+NW+KYFIDAY SANLIP
Subjt:  YPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP

XP_022944273.1 uncharacterized protein LOC111448769 isoform X1 [Cucurbita moschata]4.0e-7670.54Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD
        MAFSP  SA     ADLA VDFDWLP+SSIK  W SE+S   G   P S PQP T RRP NRFLL Q+FG L PSL+ TN+L  RT   S+SSPPP AA+
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD

Query:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK
        +L++RP FP GDF     Q+N +R ESPVSCESSITIEG+SRACRYSPEEKK+RIERY+SKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYND+ A K
Subjt:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK

Query:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP
        N P+QWSHGQ EEEE     NGG+NW+KYFIDAY SANLIP
Subjt:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP

XP_023511996.1 uncharacterized protein LOC111776840 [Cucurbita pepo subsp. pepo]2.9e-7469.04Show/hide
Query:  MAFSPL-----PSAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRT--CSASSPPPSAADN
        MAFSP       S+ADLAGVDFDWLP+SSIK  W SE+S   G   P S PQP T RRP NRFLL Q+F  L PSL+T     RT   S+SSPPP  A++
Subjt:  MAFSPL-----PSAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRT--CSASSPPPSAADN

Query:  LLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN
        L +RP FP GDF     Q+N +R ESPVSCESSITIEG+SRACRYSPEEKK+RIERY+SKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYND+ A KN
Subjt:  LLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN

Query:  YPVQWSHGQDEEEEA---NGGDNWLKYFIDAY-SANLIP
         P+QWSH Q EEEE    NGG+NW+KYFIDAY SANLIP
Subjt:  YPVQWSHGQDEEEEA---NGGDNWLKYFIDAY-SANLIP

XP_023532290.1 two-component response regulator-like PRR1 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-7368.3Show/hide
Query:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPS--AADNLLNRPPFPTGDF
        S+ D AGVDFDWLP SSIK  W SEL    G     S PQPQT   P N  L H TF  L P L+TN L ++T S+ SPPP   ++DN+L++   P GDF
Subjt:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPS--AADNLLNRPPFPTGDF

Query:  QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ------DE
        Q+N +R ESPVSCESSI IEGMSRACRYSPEEKK+RI+RYRSKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYNDD A KNYPVQWSHGQ      +E
Subjt:  QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ------DE

Query:  EEEANGGDNWLKYFIDAYSANLIP
        EEEAN  DNW+KYF+DAYS NLIP
Subjt:  EEEANGGDNWLKYFIDAYSANLIP

XP_038902180.1 zinc finger protein CONSTANS-LIKE 5-like [Benincasa hispida]7.0e-8172.34Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDL-SKRTCSASSPPPS-----
        MAFSP  SA     ADLAGVDFDWLP+SSI   W SE S   G   P S PQ  T R   N FLLHQ+F  L PSL++NDL  +RT S+SSPPPS     
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDL-SKRTCSASSPPPS-----

Query:  AADNLLNRPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNY
        AA+ L+NRP FP GDFQ+N +R ESPVSCESSITIEGMSRACRYSPEEKK+RIERYR+KRN+RNFNKKIKYACRKSLADSRPRIRGRFARYN DD VKNY
Subjt:  AADNLLNRPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNY

Query:  PVQWSHGQDEEEEANG-GDNWLKYFIDAYSANLIP
        P+QWSHGQ+EE+EAN  GDNW+KYFIDAYS NLIP
Subjt:  PVQWSHGQDEEEEANG-GDNWLKYFIDAYSANLIP

TrEMBL top hitse value%identityAlignment
A0A0A0KG14 CCT domain-containing protein4.8e-6766.4Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIK---PNWKSELSGGSPPPSRPQPQTTRRPPNRFLLH-QTFGFLKP-SLETNDLS---KRTCSASSPPPSA
        MAFSP  SA     ADLAGVDFDWLP+SSIK    +  S+  G   PPS P+P   R   N FLLH   F  L P SL++ND     +RT S+SSPPPSA
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIK---PNWKSELSGGSPPPSRPQPQTTRRPPNRFLLH-QTFGFLKP-SLETNDLS---KRTCSASSPPPSA

Query:  A-DNLLNRPPFPTGDFQK---NYKRG-ESPVSCESSIT-IEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDD
        A D  +NRP FP GDFQK   N +RG ESPVSCESSIT IEGMSRACRYSPEEKK+RIERYR+KR++RNFNKKIKYACRKSLADSRPRIRGRFARYN D+
Subjt:  A-DNLLNRPPFPTGDFQK---NYKRG-ESPVSCESSIT-IEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDD

Query:  AVKNYPVQWS-HGQDEEE------EANG-GDNWLKYFIDAYSANLIP
         VKNYPVQW+ H ++EEE      EAN  GDNW+KYFID YS NLIP
Subjt:  AVKNYPVQWS-HGQDEEE------EANG-GDNWLKYFIDAYSANLIP

A0A6J1FXW4 uncharacterized protein LOC111448769 isoform X11.9e-7670.54Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD
        MAFSP  SA     ADLA VDFDWLP+SSIK  W SE+S   G   P S PQP T RRP NRFLL Q+FG L PSL+ TN+L  RT   S+SSPPP AA+
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD

Query:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK
        +L++RP FP GDF     Q+N +R ESPVSCESSITIEG+SRACRYSPEEKK+RIERY+SKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYND+ A K
Subjt:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK

Query:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP
        N P+QWSHGQ EEEE     NGG+NW+KYFIDAY SANLIP
Subjt:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP

A0A6J1G5H7 zinc finger protein CONSTANS-LIKE 14-like1.2e-7067.71Show/hide
Query:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPS-AADNLLNRPPFPTGDFQ
        S+ D AGVDFDWLP SSIK  W SEL    G     S PQPQT   P N  L H TF  L PSL+TN L ++T   SSP PS ++DN+L++   P  DFQ
Subjt:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPS-AADNLLNRPPFPTGDFQ

Query:  KNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ------DEE
        +N +R ESPVSCESSI IEGMSRACRYSPEEKK+RI+RYRSKRN+RNFNKKIKYACRKSLADSRPRIRGRFA+YNDD A KNYP QWSHGQ      +EE
Subjt:  KNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ------DEE

Query:  EEANGGDNWLKYFIDAYSANLIP
        EEAN  DNW+KYF+DAYS NLIP
Subjt:  EEANGGDNWLKYFIDAYSANLIP

A0A6J1JHZ7 uncharacterized protein LOC111484524 isoform X17.6e-7368.05Show/hide
Query:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD
        MAFSP  SA     ADLA VDFDWLP+SS+K  WKSE+S   G     S PQP T RRP NRFLL Q+F  L PSL  TN+L  RT   S+SS PP  A+
Subjt:  MAFSPLPSA-----ADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLE-TNDLSKRT--CSASSPPPSAAD

Query:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK
        +L++RP F  GDF     Q+N +R ESPVSCESSITIEG+SRACRYSPEEKK+RIERY+SKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYND+ A K
Subjt:  NLLNRPPFPTGDF-----QKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVK

Query:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP
        N+P++WSHGQ EEEE     NGG+NW+KYFIDAY SANLIP
Subjt:  NYPVQWSHGQDEEEEA----NGGDNWLKYFIDAY-SANLIP

A0A6J1L4M8 two-component response regulator-like PRR18.4e-7268.33Show/hide
Query:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQK
        S+ D AGVDFDWLP SSIK  W SEL    G     S PQPQT R P N  L H TF  L PSL+TN L ++T S+  PPP   D L ++   P GDFQ+
Subjt:  SAADLAGVDFDWLPNSSIKPNWKSELS---GGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQK

Query:  NYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ-----DEEEE
        N +R ESPVSCESSI I+GMSRACRYSPEEKK+RI+RYRSKRN+RNFNKKIKYACRKSLADSRPRIRGRFARYN D A KNYPVQWSHGQ     +EEEE
Subjt:  NYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQ-----DEEEE

Query:  ANGGDNWLKYFIDAYSANLIP
        AN  DNW+KYF+DAYS NLIP
Subjt:  ANGGDNWLKYFIDAYSANLIP

SwissProt top hitse value%identityAlignment
E5RQA1 Transcription factor GHD71.7e-0550Show/hide
Query:  EKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAV
        E++ ++ RY+ KR KR + K+I+YA RK+ A+ RPR+RGRFA+  D +AV
Subjt:  EKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAV

O50055 Zinc finger protein CONSTANS-LIKE 17.7e-0649.09Show/hide
Query:  SPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN
        SP +++ R+ RYR K+  R F K I+YA RK+ A+ RPRI+GRFA+  D D   N
Subjt:  SPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKN

O82117 Zinc finger protein CO31.7e-0552.08Show/hide
Query:  EKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDD
        +++ R+ RYR KR  R F K I+YA RK+ A++RPRI+GRFA+ +D D
Subjt:  EKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDD

Q940T9 Zinc finger protein CONSTANS-LIKE 47.7e-0649.12Show/hide
Query:  GMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYND
        G  RA   +  E++ R+ RYR KR  R F K I+YA RK+ A+ RPRI+GRFA+  D
Subjt:  GMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYND

Q9SK53 Zinc finger protein CONSTANS-LIKE 32.6e-0652.83Show/hide
Query:  ACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYND
        A + SP E++ R+ RYR KR  R F K I+YA RK+ A+ RPRI+GRFA+  D
Subjt:  ACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYND

Arabidopsis top hitse value%identityAlignment
AT1G63820.1 CCT motif family protein4.2e-1538.51Show/hide
Query:  FLKPSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQK------NYKRGESPVSCESSITI-----EGMSRACRYSPEEKKQRIERYRSKRNKRNF
        F  P +E +   +     +SP  S   + + R  + TGD Q         +   SP++ ESS T      E   R  RYS EE+K++I +YR+KR +RNF
Subjt:  FLKPSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQK------NYKRGESPVSCESSITI-----EGMSRACRYSPEEKKQRIERYRSKRNKRNF

Query:  NKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEE
         K IKYACRK+LAD+RPR+RGRFAR  +D+  +N  +  S  + E ++
Subjt:  NKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEE

AT2G33350.1 CCT motif family protein2.9e-1655.7Show/hide
Query:  MSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEEANG
        +++  + SPE++K++I RY  KRN+RNFNKKIKYACRK+LADSRPR+RGRFA+ ND+    N     SH  DE+E+  G
Subjt:  MSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEEANG

AT2G33350.2 CCT motif family protein2.9e-1655.7Show/hide
Query:  MSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEEANG
        +++  + SPE++K++I RY  KRN+RNFNKKIKYACRK+LADSRPR+RGRFA+ ND+    N     SH  DE+E+  G
Subjt:  MSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEEANG

AT5G41380.1 CCT motif family protein2.6e-1741.13Show/hide
Query:  PSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLAD
        P +++++LS    S+   P +A  +   R  + TGD Q N +   S  +     + E   +  RYS EE+K++I +YR+KRN+RNF K IKYACRK+LAD
Subjt:  PSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLAD

Query:  SRPRIRGRFARYNDDDAVKNYPVQWSH-----GQDEEEEAN
        SRPRIRGRFAR ++   + N     S      G  E+EEA+
Subjt:  SRPRIRGRFARYNDDDAVKNYPVQWSH-----GQDEEEEAN

AT5G59990.1 CCT motif family protein8.4e-2450.81Show/hide
Query:  RPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHG
        R     GD  ++ +R  S V  ES+  IEGMS+A +YSPEEKK++IE+YRSKRN RNFNK+IKY CRK+LADSRPRIRGRFAR ++    +   V     
Subjt:  RPPFPTGDFQKNYKRGESPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHG

Query:  QDEEEEANGGDNWLKYFIDAYSAN
           E      D W   F+D++SAN
Subjt:  QDEEEEANGGDNWLKYFIDAYSAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTTTCACCATTGCCCTCCGCCGCCGACCTCGCCGGCGTTGATTTTGACTGGCTGCCCAATTCCTCCATCAAGCCTAATTGGAAATCTGAACTTTCCGGCGGTTC
GCCGCCGCCGTCTCGGCCGCAACCCCAAACAACTCGTCGTCCACCGAACAGATTTTTGTTACATCAAACTTTTGGTTTCTTAAAACCTTCTCTCGAAACCAATGACCTTA
GTAAACGTACTTGTTCGGCGTCGTCGCCGCCGCCCTCCGCCGCAGATAATCTCCTCAACAGGCCGCCCTTTCCCACCGGTGATTTTCAGAAGAATTACAAGAGAGGCGAG
AGCCCAGTTTCATGTGAAAGTAGCATCACAATTGAAGGGATGAGCAGAGCTTGTAGGTACAGCCCAGAAGAGAAGAAGCAAAGAATTGAAAGGTATAGGTCAAAGAGAAA
CAAGAGGAACTTCAACAAGAAGATTAAGTATGCTTGTAGGAAATCATTGGCGGATAGTCGACCACGGATCAGAGGACGATTTGCAAGGTACAATGACGACGATGCTGTGA
AGAATTATCCAGTGCAATGGAGTCATGGGCAAGATGAAGAAGAAGAGGCAAATGGTGGCGATAATTGGCTGAAGTATTTTATTGATGCATATTCTGCGAACCTTATTCCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTTTCACCATTGCCCTCCGCCGCCGACCTCGCCGGCGTTGATTTTGACTGGCTGCCCAATTCCTCCATCAAGCCTAATTGGAAATCTGAACTTTCCGGCGGTTC
GCCGCCGCCGTCTCGGCCGCAACCCCAAACAACTCGTCGTCCACCGAACAGATTTTTGTTACATCAAACTTTTGGTTTCTTAAAACCTTCTCTCGAAACCAATGACCTTA
GTAAACGTACTTGTTCGGCGTCGTCGCCGCCGCCCTCCGCCGCAGATAATCTCCTCAACAGGCCGCCCTTTCCCACCGGTGATTTTCAGAAGAATTACAAGAGAGGCGAG
AGCCCAGTTTCATGTGAAAGTAGCATCACAATTGAAGGGATGAGCAGAGCTTGTAGGTACAGCCCAGAAGAGAAGAAGCAAAGAATTGAAAGGTATAGGTCAAAGAGAAA
CAAGAGGAACTTCAACAAGAAGATTAAGTATGCTTGTAGGAAATCATTGGCGGATAGTCGACCACGGATCAGAGGACGATTTGCAAGGTACAATGACGACGATGCTGTGA
AGAATTATCCAGTGCAATGGAGTCATGGGCAAGATGAAGAAGAAGAGGCAAATGGTGGCGATAATTGGCTGAAGTATTTTATTGATGCATATTCTGCGAACCTTATTCCA
TGA
Protein sequenceShow/hide protein sequence
MAFSPLPSAADLAGVDFDWLPNSSIKPNWKSELSGGSPPPSRPQPQTTRRPPNRFLLHQTFGFLKPSLETNDLSKRTCSASSPPPSAADNLLNRPPFPTGDFQKNYKRGE
SPVSCESSITIEGMSRACRYSPEEKKQRIERYRSKRNKRNFNKKIKYACRKSLADSRPRIRGRFARYNDDDAVKNYPVQWSHGQDEEEEANGGDNWLKYFIDAYSANLIP