; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011473 (gene) of Chayote v1 genome

Gene IDSed0011473
OrganismSechium edule (Chayote v1)
Descriptionhomeobox protein 6
Genome locationLG05:43363302..43365027
RNA-Seq ExpressionSed0011473
SyntenySed0011473
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605199.1 hypothetical protein SDJN03_02516, partial [Cucurbita argyrosperma subsp. sororia]2.0e-8869.59Show/hide
Query:  EEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNMLKF
        EEE LSLCDLPVKEKQQ      NPI        TEDFDFNHWPPPP PP MCAADDIFFQGH+LPL LS SSD+THN+ FFSK LS RSESMDHNML+F
Subjt:  EEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNMLKF

Query:  RNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSKAPAT
        RNGSSSSS  SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRSRSR+SSRWDFFR+GLLRTP MEL DLKTR T + A  T
Subjt:  RNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSKAPAT

Query:  VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN-------------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHA-SFAD
        V QKT  SFLGVVSCKKSV+TI      P  K+++          N             N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA +FAD
Subjt:  VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN-------------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHA-SFAD

XP_022947585.1 uncharacterized protein LOC111451408 [Cucurbita moschata]2.2e-9069.59Show/hide
Query:  GRSNNGDEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHN
        G  ++  +  EEE LSLCDLPVKEKQQ  +        TEDFDFNHWPPPP PP MCAADDIFFQGH+LPL LS SSD+THN+ FFSK LS RSESMDHN
Subjt:  GRSNNGDEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHN

Query:  MLKFRNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSK
        ML+FRNGSSSSS  SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRSRSR+SSRWDFFR+GLLRTP MEL DLKTR T S 
Subjt:  MLKFRNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSK

Query:  APATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN----------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD
        A  TV QKT  SFLGVVSCKKSV+TI   P    +K    +  KK               N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA+FAD
Subjt:  APATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN----------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD

XP_023006931.1 uncharacterized protein LOC111499574 [Cucurbita maxima]2.0e-8368.71Show/hide
Query:  DEIWEEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNM
        +E  EEE LS+CDLPVKEKQQ      NPI        TEDFDFNHWPPPP   MCAADDIFFQGH+LPL LS SSD+THN+ FFSK LS RSESMDHNM
Subjt:  DEIWEEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNM

Query:  LKFRNGSSSSS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---
        L+FRNGSSSSS SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRS    SSRWDFFR+GLLRTP MEL DLKTR T     
Subjt:  LKFRNGSSSSS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---

Query:  KAPATVVQKTIPSFLGVVSCKKSVDTISPPP-----PPPLVKRLQEHDQKKSCNNN--NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD
        +A  TV QKT  +FLGVVSCKKSVDTI            +VK+  E  +      N  N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA+FAD
Subjt:  KAPATVVQKTIPSFLGVVSCKKSVDTISPPP-----PPPLVKRLQEHDQKKSCNNN--NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD

XP_023534091.1 uncharacterized protein LOC111795757 [Cucurbita pepo subsp. pepo]1.9e-8669.07Show/hide
Query:  EEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNMLKFRNGSSS
        EEE LSLCDLPVKEKQQ  +       E  DF+FNHWPPPP PP MCAADDIFFQGH+LPL LS SSD+TH NQFFSK LS RSESMDHNML+FRNGSSS
Subjt:  EEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNMLKFRNGSSS

Query:  SS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---KAPATVVQK
        SS SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRSRSR+SSRWDFFR+G+LRTP MEL DLKTR T +   +A  TV QK
Subjt:  SS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---KAPATVVQK

Query:  TIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN------------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHA-SFAD
        T  SFLGVVSCKKSV+TI   P    ++    +  KK                 N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA +FAD
Subjt:  TIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN------------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHA-SFAD

XP_038902148.1 uncharacterized protein LOC120088781 [Benincasa hispida]1.0e-8465.03Show/hide
Query:  MGRSNNGDEIW------------EEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNN---QF
        MGRS +GDE W            EEE LS CDLPVKEKQQ P+  ++AAVETEDFDFNHW PPPPP M AAD++FFQG MLPL LSFSS++++NN     
Subjt:  MGRSNNGDEIW------------EEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNN---QF

Query:  FSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRM
        F  NL  RSESMDHNML+F NGSSSSSSS S+YSRSSS+SNNS+SIPTNSK R QKNVFHSHPSP+PQIRS S SS RSRS  SSRW+FFRLGLLRTP M
Subjt:  FSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRM

Query:  ELDDLKTRNTTSKAPATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIR------EKEKEKETRLSNRRTFEWLKQLSH
        EL DLKTR TT+ A AT   KT  S LGVVSCK+SVDT++      + K  +  + KK    NN  VEIR      EKEKEKE R+S+RRTFEWLKQLSH
Subjt:  ELDDLKTRNTTSKAPATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIR------EKEKEKETRLSNRRTFEWLKQLSH

Query:  ASFADD
        A+F ++
Subjt:  ASFADD

TrEMBL top hitse value%identityAlignment
A0A0A0LPT6 Uncharacterized protein7.8e-7865.49Show/hide
Query:  DEIWEEEELSLCDLPVKEKQQNPINKSTAAVET-EDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMD-HNMLKFR
        +E  EEE LSLCDLPVKEKQQ   + ST  VET +DFDFNHW PPP P M  ADD+FFQGHMLPL LSFSS+++ NN   + NL  RSESMD +NML+FR
Subjt:  DEIWEEEELSLCDLPVKEKQQNPINKSTAAVET-EDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMD-HNMLKFR

Query:  NGSSSSSSSTSYYSRSSSISNNSISIPTNSKPR-SQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSKAPAT--
        N S+SSSSS S+YSRSSS+SNNSISIPTNSKPR S  NVFHSHPSP+PQIRS STSS RSRS  SSRW+FFRLGLLRTP MEL DLKTR TT+    T  
Subjt:  NGSSSSSSSTSYYSRSSSISNNSISIPTNSKPR-SQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSKAPAT--

Query:  -VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFADD
            KT  S LGVVSCK+SV+T+          R +   +    NN++N VEIREKEKEKE R+S+RRTFEWLKQLSHA+F ++
Subjt:  -VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFADD

A0A6J1F3H1 probable membrane-associated kinase regulator 11.0e-7761.31Show/hide
Query:  MGRSNNGDEIWE----------EEELSLCDLPVKEKQQNPIN------KSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQ
        MGRS +GDE WE          EE LS CDLP+KE Q  P+N      +S+AAV++EDFDFNH PPP P  MCAAD++FFQGH+LPL  SFSS+++HNN 
Subjt:  MGRSNNGDEIWE----------EEELSLCDLPVKEKQQNPIN------KSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQ

Query:  FFSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPR
        FF +N S RSES D  ML+FRNGS+SSSSS S+YSRSSS+SNNSISIPTNSKPR   NVFHSHPSP+PQIRS STS RRSRS  SSRWDFFRLGLLRTP 
Subjt:  FFSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPR

Query:  MELDDLKTR-NTTSKAPAT---VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHA
        MEL DLKTR N++S A A        T  SFLGVVSCKKSVDT++               +KK   N       +E EKE+ TR+S+RRTFEW+KQLSHA
Subjt:  MELDDLKTR-NTTSKAPAT---VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHA

Query:  SFADD
        S  D+
Subjt:  SFADD

A0A6J1G7B7 uncharacterized protein LOC1114514081.0e-9069.59Show/hide
Query:  GRSNNGDEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHN
        G  ++  +  EEE LSLCDLPVKEKQQ  +        TEDFDFNHWPPPP PP MCAADDIFFQGH+LPL LS SSD+THN+ FFSK LS RSESMDHN
Subjt:  GRSNNGDEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPP-PPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHN

Query:  MLKFRNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSK
        ML+FRNGSSSSS  SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRSRSR+SSRWDFFR+GLLRTP MEL DLKTR T S 
Subjt:  MLKFRNGSSSSS--SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSK

Query:  APATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN----------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD
        A  TV QKT  SFLGVVSCKKSV+TI   P    +K    +  KK               N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA+FAD
Subjt:  APATVVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNN----------NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD

A0A6J1JCH9 probable membrane-associated kinase regulator 11.2e-8163.7Show/hide
Query:  MGRSNNGDEIWE----------EEELSLCDLPVKEKQQNPIN------KSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQ
        MGRS +GDE WE          EE LS CDLP+KE Q  P+N      +S+AAV++EDFDFNH PPP P  MCAAD++FFQGH+LPLC SFSS+++HNN 
Subjt:  MGRSNNGDEIWE----------EEELSLCDLPVKEKQQNPIN------KSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQ

Query:  FFSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPR
        FF +N S RSES D  ML+FRNGS+SSSSS S+YSRSSSISNNSISIPTNSKPR   NVFHSHPSP+PQIRS STS RRSRS  SSRWDFFRLGLLRTP 
Subjt:  FFSKNLSIRSESMDHNMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPR

Query:  MELDDLKTR-NTTSKAPAT-VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHASF
        MEL DLKTR N++S A AT     T  SFLGVVSCKKSVDT++                KK  + N    E +EKEKE+ETR+S+RRTFEW+KQLSHAS 
Subjt:  MELDDLKTR-NTTSKAPAT-VVQKTIPSFLGVVSCKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHASF

Query:  ADD
         D+
Subjt:  ADD

A0A6J1KZ54 uncharacterized protein LOC1114995749.5e-8468.71Show/hide
Query:  DEIWEEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNM
        +E  EEE LS+CDLPVKEKQQ      NPI        TEDFDFNHWPPPP   MCAADDIFFQGH+LPL LS SSD+THN+ FFSK LS RSESMDHNM
Subjt:  DEIWEEEELSLCDLPVKEKQQ------NPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNM

Query:  LKFRNGSSSSS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---
        L+FRNGSSSSS SS S+YSR SSISNNSISIPTNSKPR+Q NVFHSHPSP+PQIRS ST  RRS    SSRWDFFR+GLLRTP MEL DLKTR T     
Subjt:  LKFRNGSSSSS-SSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTS---

Query:  KAPATVVQKTIPSFLGVVSCKKSVDTISPPP-----PPPLVKRLQEHDQKKSCNNN--NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD
        +A  TV QKT  +FLGVVSCKKSVDTI            +VK+  E  +      N  N+NVEIREKEKEK TRLS+RRTFEWLKQLSHA+FAD
Subjt:  KAPATVVQKTIPSFLGVVSCKKSVDTISPPP-----PPPLVKRLQEHDQKKSCNNN--NNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G67350.1 unknown protein1.6e-1129.84Show/hide
Query:  DEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPP--------PPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDH
        +E  EEE LSLCDLP ++ +   I K         F+F              P P M  AD++FF+G +LPL  S S D   N       L  RSES++ 
Subjt:  DEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPP--------PPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDH

Query:  NMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRS----RASSRWDFFRLGLLRTPRMELDDLKTRNT
            FR         T        I NN I               +S PSP PQIR  S+ + R  S    ++SS WDF RLGL+RTP +EL     R T
Subjt:  NMLKFRNGSSSSSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRS----RASSRWDFFRLGLLRTPRMELDDLKTRNT

Query:  TSKAPATVVQKTIPSFLGVVS-----------------------CKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFE
           A  +V + +  S     S                       CK SV T +   P  +     E ++K+        +E +  +KE+++ ++ +RTFE
Subjt:  TSKAPATVVQKTIPSFLGVVS-----------------------CKKSVDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFE

Query:  WLKQL
        WL Q+
Subjt:  WLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGCAACAATGGAGATGAAATATGGGAAGAAGAAGAATTATCTTTGTGTGATCTTCCTGTGAAAGAAAAGCAACAAAATCCCATTAATAAATCCACCGCCGC
CGTGGAAACGGAGGATTTTGATTTCAACCACTGGCCGCCGCCGCCGCCGCCGGCCATGTGCGCCGCCGATGATATCTTCTTTCAAGGCCATATGCTTCCTCTTTGTCTTT
CTTTCAGCTCCGATCATACTCACAATAATCAGTTCTTTTCTAAGAATTTGTCCATCAGGTCGGAGTCTATGGATCATAATATGCTGAAGTTTAGAAATGGAAGCAGTAGT
AGCAGCAGTAGTACAAGTTATTATTCCAGGTCTTCAAGTATAAGTAACAATTCAATTTCCATTCCAACGAATTCAAAGCCAAGATCTCAAAAGAACGTTTTTCACTCTCA
TCCAAGTCCCTCTCCCCAAATCAGATCCTTATCAACTTCCAGCCGTCGGAGCCGGAGCCGAGCCTCCTCCCGCTGGGACTTTTTCCGTCTCGGCCTTCTCCGAACACCCA
GAATGGAACTCGACGACCTCAAAACTCGAAACACGACCAGCAAAGCACCCGCGACAGTAGTGCAGAAAACAATCCCCTCGTTTTTGGGCGTCGTCAGTTGCAAGAAATCC
GTAGATACAATATCACCGCCACCGCCACCGCCGCTCGTGAAGAGGTTACAAGAACATGATCAAAAGAAAAGTTGTAATAATAATAATAATAATGTTGAAATTAGAGAAAA
GGAAAAGGAAAAGGAAACGAGGCTGTCAAATCGTCGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAAGCTTTGCTGACGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAAGCAACAATGGAGATGAAATATGGGAAGAAGAAGAATTATCTTTGTGTGATCTTCCTGTGAAAGAAAAGCAACAAAATCCCATTAATAAATCCACCGCCGC
CGTGGAAACGGAGGATTTTGATTTCAACCACTGGCCGCCGCCGCCGCCGCCGGCCATGTGCGCCGCCGATGATATCTTCTTTCAAGGCCATATGCTTCCTCTTTGTCTTT
CTTTCAGCTCCGATCATACTCACAATAATCAGTTCTTTTCTAAGAATTTGTCCATCAGGTCGGAGTCTATGGATCATAATATGCTGAAGTTTAGAAATGGAAGCAGTAGT
AGCAGCAGTAGTACAAGTTATTATTCCAGGTCTTCAAGTATAAGTAACAATTCAATTTCCATTCCAACGAATTCAAAGCCAAGATCTCAAAAGAACGTTTTTCACTCTCA
TCCAAGTCCCTCTCCCCAAATCAGATCCTTATCAACTTCCAGCCGTCGGAGCCGGAGCCGAGCCTCCTCCCGCTGGGACTTTTTCCGTCTCGGCCTTCTCCGAACACCCA
GAATGGAACTCGACGACCTCAAAACTCGAAACACGACCAGCAAAGCACCCGCGACAGTAGTGCAGAAAACAATCCCCTCGTTTTTGGGCGTCGTCAGTTGCAAGAAATCC
GTAGATACAATATCACCGCCACCGCCACCGCCGCTCGTGAAGAGGTTACAAGAACATGATCAAAAGAAAAGTTGTAATAATAATAATAATAATGTTGAAATTAGAGAAAA
GGAAAAGGAAAAGGAAACGAGGCTGTCAAATCGTCGAACATTTGAATGGCTAAAGCAGCTCTCGCATGCAAGCTTTGCTGACGACTAG
Protein sequenceShow/hide protein sequence
MGRSNNGDEIWEEEELSLCDLPVKEKQQNPINKSTAAVETEDFDFNHWPPPPPPAMCAADDIFFQGHMLPLCLSFSSDHTHNNQFFSKNLSIRSESMDHNMLKFRNGSSS
SSSSTSYYSRSSSISNNSISIPTNSKPRSQKNVFHSHPSPSPQIRSLSTSSRRSRSRASSRWDFFRLGLLRTPRMELDDLKTRNTTSKAPATVVQKTIPSFLGVVSCKKS
VDTISPPPPPPLVKRLQEHDQKKSCNNNNNNVEIREKEKEKETRLSNRRTFEWLKQLSHASFADD