; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027603 (gene) of Chayote v1 genome

Gene IDSed0027603
OrganismSechium edule (Chayote v1)
DescriptionHolliday junction resolvase MOC1, chloroplastic
Genome locationLG01:60007284..60018835
RNA-Seq ExpressionSed0027603
SyntenySed0027603
Gene Ontology termsGO:0006950 - response to stress (biological process)
GO:0009987 - cellular process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010304.1 hypothetical protein SDJN02_27097 [Cucurbita argyrosperma subsp. argyrosperma]4.6e-10078.82Show/hide
Query:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT-SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL------
        MESL      LQ+QSH  M S LSS LK  LH FRFLCT SSSSS+ S  IPT PSSSVRKE +GG  LK+AH QLKDNWLASLS PFP GHD       
Subjt:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT-SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL------

Query:  ---NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWI
           NA+SDCVIGVDPDVSGAVALLRT +SI   QVYDSPHLQVLVGG+ RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGK GWWSGGFGYGLWI
Subjt:  ---NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWI

Query:  GVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GVLVGLGFSVVPV SLAWKNKF+LSGKDTSKDDSRR+AS LFP+LSPLLKRKKDH
Subjt:  GVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

XP_008449639.1 PREDICTED: uncharacterized protein LOC103491461 isoform X1 [Cucumis melo]7.8e-10077.51Show/hide
Query:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS
        ME L LQ+ S+  M SL SS LKP+LH FRFLC+SS SSL SP+I T  SSS+RK+ +G A L +AHAQLKDNWLASLS PFP GHD          NA+
Subjt:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS

Query:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL
        S+CVIGVDPDVSGAVALLRT +SI   QVYDSPH+QVLVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQS P+P+DGK GWW GGFGYGLWIGVLVGL
Subjt:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL

Query:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GFSVVPV  LAWKNKFELSGKDTSKDDSRR+ASELFPSLSPLLKRKKDH
Subjt:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

XP_022153090.1 uncharacterized protein LOC111020674 isoform X1 [Momordica charantia]4.4e-10381.3Show/hide
Query:  LTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH---------DLNASSDC
        L LQ QSH  MNS LSS LKP  + FR LCT  SSS+H PQIP  PSSS RKE  G A LK+AH+QLKDNWLASLSSPFP GH         DLNASSDC
Subjt:  LTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH---------DLNASSDC

Query:  VIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFS
        VIGVDPDVSGAVALLRT  S+C  QVYDSPHLQVLVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGK GWWSGGFGYGLWIG+LVGLGFS
Subjt:  VIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFS

Query:  VVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        V+PV SLAWKNKFELSGKDTSKDDSRR+AS LFPSLSPLLKRKKDH
Subjt:  VVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

XP_023512747.1 Holliday junction resolvase MOC1, chloroplastic [Cucurbita pepo subsp. pepo]4.6e-10078.82Show/hide
Query:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT-SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL------
        MESL      LQ+QSH  M S LSS LK  LH FRFLCT SSSSS+ S  IPT PSSSVRKE +GG  LK+AH QLKDNWLASLS PFP GHD       
Subjt:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT-SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL------

Query:  ---NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWI
           NA+SDCVIGVDPDVSGAVALLRT +SI   QVYDSPHLQVLVGG+ RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGK GWWSGGFGYGLWI
Subjt:  ---NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWI

Query:  GVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GVLVGLGFSVVPV SLAWKNKF+LSGKDTSKDDSRR+AS LFP+LSPLLKRKKDH
Subjt:  GVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

XP_038902489.1 Holliday junction resolvase MOC1, chloroplastic [Benincasa hispida]1.0e-9976.81Show/hide
Query:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT---------SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH
        MESL      LQ+QS+S+MNS LSS L+P LH FR LCT         SSSSSL SP+IPT  SSSVRK+ +GGA LK+AH QLKDNWLASLS PFP G+
Subjt:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCT---------SSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH

Query:  ---------DLNASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSG
                 DLN +SDCVIGVDPDVSGAVALLRT +SI   QVYDSPHLQVLVG R RKRLDAKSIVQLLHSFNAPIGT AYLEQS P+PQDGK GWWSG
Subjt:  ---------DLNASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSG

Query:  GFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GFGYGLWIGVLVGLGFSVVPV SLAWKNKFELSGKDTSKDDSRR+ASELFPSLSPLLKRKKDH
Subjt:  GFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

TrEMBL top hitse value%identityAlignment
A0A0A0KH00 Uncharacterized protein1.1e-9977.51Show/hide
Query:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS
        ME L LQS S+S MNSL SS LK +LH FRFLCTSS SS+ S +I T  SSSVRK+ +G A L +AHAQLKDNWLASLS PFP GHD          NA+
Subjt:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS

Query:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL
        S+CVIGVDPDVSGAVALLRT +SI   QVYDSPH+Q+LVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQS P+P+DGK GWW GGFGYGLWIGVLVGL
Subjt:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL

Query:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GFSVVPV  LAWKNKFELSGKDTSKDDSRR+ASELFPSL+PLLKRKKDH
Subjt:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

A0A1S3BMH5 uncharacterized protein LOC103491461 isoform X13.8e-10077.51Show/hide
Query:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS
        ME L LQ+ S+  M SL SS LKP+LH FRFLC+SS SSL SP+I T  SSS+RK+ +G A L +AHAQLKDNWLASLS PFP GHD          NA+
Subjt:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS

Query:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL
        S+CVIGVDPDVSGAVALLRT +SI   QVYDSPH+QVLVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQS P+P+DGK GWW GGFGYGLWIGVLVGL
Subjt:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL

Query:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GFSVVPV  LAWKNKFELSGKDTSKDDSRR+ASELFPSLSPLLKRKKDH
Subjt:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

A0A5D3B8B8 Uncharacterized protein3.8e-10077.51Show/hide
Query:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS
        ME L LQ+ S+  M SL SS LKP+LH FRFLC+SS SSL SP+I T  SSS+RK+ +G A L +AHAQLKDNWLASLS PFP GHD          NA+
Subjt:  MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL---------NAS

Query:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL
        S+CVIGVDPDVSGAVALLRT +SI   QVYDSPH+QVLVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQS P+P+DGK GWW GGFGYGLWIGVLVGL
Subjt:  SDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGL

Query:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        GFSVVPV  LAWKNKFELSGKDTSKDDSRR+ASELFPSLSPLLKRKKDH
Subjt:  GFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

A0A6J1DFT4 uncharacterized protein LOC111020674 isoform X12.1e-10381.3Show/hide
Query:  LTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH---------DLNASSDC
        L LQ QSH  MNS LSS LKP  + FR LCT  SSS+H PQIP  PSSS RKE  G A LK+AH+QLKDNWLASLSSPFP GH         DLNASSDC
Subjt:  LTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGH---------DLNASSDC

Query:  VIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFS
        VIGVDPDVSGAVALLRT  S+C  QVYDSPHLQVLVGGR RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGK GWWSGGFGYGLWIG+LVGLGFS
Subjt:  VIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFS

Query:  VVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        V+PV SLAWKNKFELSGKDTSKDDSRR+AS LFPSLSPLLKRKKDH
Subjt:  VVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

A0A6J1FUK2 Holliday junction resolvase MOC1, chloroplastic4.9e-10078.74Show/hide
Query:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL-------
        MESL      LQ+QSH  M S LSS LK  LH FRFLCT SSSS+ S  IPT PSSSVRKE +GG  LK+AH QLKDNWLASLS PFP GHD        
Subjt:  MESL-----TLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDL-------

Query:  --NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIG
          NA+SDCVIGVDPDVSGAVALLRT +SI   QVYDSPHLQVLVGG+ RKRLDAKSIVQLLHSFNAPIGTTAYLEQSTP+PQDGK GWWSGGFGYGLWIG
Subjt:  --NASSDCVIGVDPDVSGAVALLRTLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIG

Query:  VLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
        VLVGLGFSVVPV SLAWKNKF+LSGKDTSKDDSRR+AS LFP+LSPLLKRKKDH
Subjt:  VLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

SwissProt top hitse value%identityAlignment
Q8GWA2 Holliday junction resolvase MOC1, chloroplastic2.2e-5259.89Show/hide
Query:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALLR--TLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQS
        A +K+ WL SLS  S        NA S C+IG+DPD+SGA+ALL+   L S    QVYD+PH+ VLVG R RKRLDAKSIVQL+ S + P G+  Y+EQS
Subjt:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALLR--TLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQS

Query:  TPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
         P+P+DGK GW+SGGFGYGLWIG LV  GF V+PVS+  WK  F+L+    +KDDSRR+A+ELFPSLS  LKRKKDH
Subjt:  TPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

Arabidopsis top hitse value%identityAlignment
AT2G26840.1 unknown protein1.5e-5359.89Show/hide
Query:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALLR--TLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQS
        A +K+ WL SLS  S        NA S C+IG+DPD+SGA+ALL+   L S    QVYD+PH+ VLVG R RKRLDAKSIVQL+ S + P G+  Y+EQS
Subjt:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALLR--TLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQS

Query:  TPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH
         P+P+DGK GW+SGGFGYGLWIG LV  GF V+PVS+  WK  F+L+    +KDDSRR+A+ELFPSLS  LKRKKDH
Subjt:  TPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSRRLASELFPSLSPLLKRKKDH

AT2G26840.2 unknown protein7.4e-4846.09Show/hide
Query:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALL----------------------------------------------RTLDSICEP-
        A +K+ WL SLS  S        NA S C+IG+DPD+SGA+ALL                                              R  +  CE  
Subjt:  AQLKDNWLASLS--SPFPRGHDLNASSDCVIGVDPDVSGAVALL----------------------------------------------RTLDSICEP-

Query:  --------QVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELS
                +VYD+PH+ VLVG R RKRLDAKSIVQL+ S + P G+  Y+EQS P+P+DGK GW+SGGFGYGLWIG LV  GF V+PVS+  WK  F+L+
Subjt:  --------QVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELS

Query:  GKDTSKDDSRRLASELFPSLSPLLKRKKDH
            +KDDSRR+A+ELFPSLS  LKRKKDH
Subjt:  GKDTSKDDSRRLASELFPSLSPLLKRKKDH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCACTTACATTGCAATCTCAGTCCCACTCTTCCATGAACTCCCTCCTCTCTTCCAACCTCAAACCCAGACTTCACGGCTTCAGATTTCTCTGCACTTCGTCTTC
TTCGTCACTTCATTCTCCACAAATCCCTACAAACCCTTCTTCTTCTGTTCGCAAAGAGCGCAACGGCGGAGCCAATTTGAAAGTCGCCCACGCTCAGCTCAAGGACAACT
GGTTGGCTTCTCTTTCCTCTCCTTTTCCTCGAGGTCACGACCTGAATGCGAGCTCCGACTGCGTTATTGGCGTCGATCCCGACGTTTCTGGTGCTGTCGCGCTCTTGAGA
ACCCTCGATTCCATTTGTGAGCCTCAGGTATATGATTCTCCACACCTCCAAGTACTAGTTGGTGGAAGAAGCAGGAAACGCTTAGATGCTAAATCGATTGTCCAACTTCT
TCACAGCTTTAATGCTCCAATTGGAACTACTGCATATCTGGAGCAGTCAACCCCATATCCACAGGATGGAAAACATGGGTGGTGGAGTGGAGGATTTGGATATGGATTGT
GGATTGGCGTATTAGTTGGGCTGGGATTTTCTGTTGTTCCCGTGTCATCTCTTGCGTGGAAAAACAAATTTGAGCTGTCAGGAAAGGATACTTCTAAGGACGACAGCCGG
AGGCTTGCATCAGAATTATTTCCATCTCTAAGCCCCCTTCTGAAAAGGAAAAAAGATCATGAGATGGATCAAGAAAATGTTCGCTTATTTCAAGAGATGTGGCAAAATTT
TTACAATGACATGAAAAGTTATTTTAGAGAGATAAGAGAGGAAAGTGCAAGGGAGCAGGAACAAAGAAGATTAGAAAGGGAGGAGCAGTCCAAGGCAAGGGAACAATGGT
ACTCAGAGATGGAAGCTAGGATTCGAGGAAGAAGGGAGGTTGTGGAAGAAACAGTGACGGTTAGGGTTTTGACACCGCCACCTCCGCTGCCACCACTGCTGCCGCTGCCG
CCGTCACCACCTCCACCACCACCATCGCCGCCACCACCTCCGACATCGCCTCTGCCACCTCCGTTGCCACCACCGCTACCGCCATCGCCACCACCTCTGACATCGCCGCC
GCCACCACCGCTGCCACCTCCGTCGCCACCACCGCTGCCACCTTCGCCACCACCTCCGCCACCAACGCCGCCGTTTCCGCCACCACCGACGCGCCGCTGCCGCAATCGCC
GCTGCTACCGCTGGAAAAAGAAAAAGAAAAAAAAGAAAAGAAAAGAAGTTTTCATTTTTTTATTTTTCTATTTTTGTGTATGCCCACCTTGA
mRNA sequenceShow/hide mRNA sequence
ACGAAGAATAAGTGAGATCTAGTGAATGGGTCGGTGACGCGATTAATTATCTAGTTTGATGTTAATCCATTTCTATTCCACGATCAAAGATTCGGTTTTCGGAAGTGCTC
CTAATTATCCATTAGAATTTAGAAGGTGGGTGTGATTTCTATTGGTGCGAAGTTTATGAATTGATGGAATCACTTACATTGCAATCTCAGTCCCACTCTTCCATGAACTC
CCTCCTCTCTTCCAACCTCAAACCCAGACTTCACGGCTTCAGATTTCTCTGCACTTCGTCTTCTTCGTCACTTCATTCTCCACAAATCCCTACAAACCCTTCTTCTTCTG
TTCGCAAAGAGCGCAACGGCGGAGCCAATTTGAAAGTCGCCCACGCTCAGCTCAAGGACAACTGGTTGGCTTCTCTTTCCTCTCCTTTTCCTCGAGGTCACGACCTGAAT
GCGAGCTCCGACTGCGTTATTGGCGTCGATCCCGACGTTTCTGGTGCTGTCGCGCTCTTGAGAACCCTCGATTCCATTTGTGAGCCTCAGGTATATGATTCTCCACACCT
CCAAGTACTAGTTGGTGGAAGAAGCAGGAAACGCTTAGATGCTAAATCGATTGTCCAACTTCTTCACAGCTTTAATGCTCCAATTGGAACTACTGCATATCTGGAGCAGT
CAACCCCATATCCACAGGATGGAAAACATGGGTGGTGGAGTGGAGGATTTGGATATGGATTGTGGATTGGCGTATTAGTTGGGCTGGGATTTTCTGTTGTTCCCGTGTCA
TCTCTTGCGTGGAAAAACAAATTTGAGCTGTCAGGAAAGGATACTTCTAAGGACGACAGCCGGAGGCTTGCATCAGAATTATTTCCATCTCTAAGCCCCCTTCTGAAAAG
GAAAAAAGATCATGAGATGGATCAAGAAAATGTTCGCTTATTTCAAGAGATGTGGCAAAATTTTTACAATGACATGAAAAGTTATTTTAGAGAGATAAGAGAGGAAAGTG
CAAGGGAGCAGGAACAAAGAAGATTAGAAAGGGAGGAGCAGTCCAAGGCAAGGGAACAATGGTACTCAGAGATGGAAGCTAGGATTCGAGGAAGAAGGGAGGTTGTGGAA
GAAACAGTGACGGTTAGGGTTTTGACACCGCCACCTCCGCTGCCACCACTGCTGCCGCTGCCGCCGTCACCACCTCCACCACCACCATCGCCGCCACCACCTCCGACATC
GCCTCTGCCACCTCCGTTGCCACCACCGCTACCGCCATCGCCACCACCTCTGACATCGCCGCCGCCACCACCGCTGCCACCTCCGTCGCCACCACCGCTGCCACCTTCGC
CACCACCTCCGCCACCAACGCCGCCGTTTCCGCCACCACCGACGCGCCGCTGCCGCAATCGCCGCTGCTACCGCTGGAAAAAGAAAAAGAAAAAAAAGAAAAGAAAAGAA
GTTTTCATTTTTTTATTTTTCTATTTTTGTGTATGCCCACCTTGAGGACAAGGTGCTTTAAAGGGGCGGGTATTGATAAGGTTGGGCCTAGGGTTCTATTTAGCTATTAC
TTGGAGAATAAGTTAGTATTTTGTTATAAATAGAGTGGGTTAGGTTAGGGTTTAGGCATCCAATTGTTAAATGAGTTATTGAGTGATTTATTATAGGGCTCAAGAGATTC
TCTTGAGAAGGGAGATTCCAAGTTCCTATCAAAACTTGGTTTATCTTTATTCTTTTATCTTTTGATATAATCTAGATCTTGGATCATATCACATACCTTATGAATTACAC
ACAACTTGGCATTTTAAATATGCAAGTCTTGGGCCTTTTAACTCGTCTATGCCAAGTTAGAGCCAAGGTTGAAGATAAAGACTTGTTTTGTAAACGCTTTTTGTTCATTT
TGATTAATTCACTATGATTTATTCCTTCGGCTGTGGGGTTGTGAAATTATTATACCTAAGTTATGAATTTTGTGATACAACCCGAGATCTATAATGTTATCAAAACATAA
AGAAACTAAATAAAAGATAAACCAAGTTCTGATGGAAACTTGACTCCCATTCTTTAGAGAATACTCTCTATGAACCCACAAGATTTCACTTAATATTTAGATGCCTCTAC
CATTACCCCCGTTGCATCTATTTATAACCAAATGGCTAACAAACTAATTGTCTAATAACTTAACTACCCCTACATTTCTCCCTTATATTCTAATGTAGGTACGTATCATT
TTGTTTTTCGTTATAGATCTGCAGAAGAATGTAAGAAATGTGAGAATGTGACTTAGCGGATGGTTCTCTACCATCAGAAACGAATGTGAGAAATGTCCACTGATTTGGAA
CCGTCTTTAATGGCCGACTGCATGCTGCCTTGAATTACTAGATCCAGTTAATTTCAGTTCTTTTCTAAACCATCACGGCCATGCCTAGTGGTTAAGAAAAGTCTTTAAGT
AACTAAGATGTGATGAATTCAATATTTAGTGACCGCCCATCTAGAATTTAATTTCTTACAAATTTTCTTGACACCTAAATGTTGTGGAATCAGACGGTCTGTCCCATGAA
AATAGGCAATGTGTGTGTAAGTTGGTATGGACACTCATAGATATAAAAAAAAAGTTCTTTCAATTTCCCGGCTGCAGAGGACTAAAAGCCACGGTCTTCTCTCAACATTG
TATCTTCATTTCATTGCATAAACTCATGGATATAAAAAAAGGGGTTCTTTTCTACAATTTTCCTGCTGCTGGAGACTAAAATCTATGGTCTTCTCTCAACATTATATCTT
CATTTCATTGTATAAACCAGGAAAAACGAGTTCGGGTTGATACACAAATGCTTGAATGTGTCCGACTTGCAGAAGAAATGTCTATGAACTGGTTATGACCGCTTCTGAGT
TTCTCGTATTTGTAGGGAGGGCTGAGGCTTTACTCATTGCTGCTTATGGAAAAGGCTTGGGCATGAAGTTGGAAACACCTTCCATCTTAGAGTCGATAGCATCATAGCAA
TGGCAGCTGGTCCTAGCTTTCTTCTTCCCCAAATTAGATTCAAGAATGGTGTTAAATTGAAAGAACAAACGAAATCGGTGAGTGCTCGAGCCAACTTATGTGCAACTGTC
AGATATATTCCCCATAGATGGGGTTCCTTTCTTTAGATTACTTTACTTTAGACAACCATTTATCTCGCTAAGTTAGAAATGAGCAGTGTAGCAATGTTTCTCTGTAACCA
AAAAATGGATACAGGCTCTTAATATCGTTATAGAGATTGGAAAATTTTTAAGGCCTCAATAACCGTTTGTGTTTTTGGTTTTTGAAATTGAAGCTTATTTTTACTTATAT
TTTCTACGATATGCTTCATCTTTCATACAATGTACCTAACTTTTTTGAGAAAGTACGTGAATACTGAC
Protein sequenceShow/hide protein sequence
MESLTLQSQSHSSMNSLLSSNLKPRLHGFRFLCTSSSSSLHSPQIPTNPSSSVRKERNGGANLKVAHAQLKDNWLASLSSPFPRGHDLNASSDCVIGVDPDVSGAVALLR
TLDSICEPQVYDSPHLQVLVGGRSRKRLDAKSIVQLLHSFNAPIGTTAYLEQSTPYPQDGKHGWWSGGFGYGLWIGVLVGLGFSVVPVSSLAWKNKFELSGKDTSKDDSR
RLASELFPSLSPLLKRKKDHEMDQENVRLFQEMWQNFYNDMKSYFREIREESAREQEQRRLEREEQSKAREQWYSEMEARIRGRREVVEETVTVRVLTPPPPLPPLLPLP
PSPPPPPPSPPPPPTSPLPPPLPPPLPPSPPPLTSPPPPPLPPPSPPPLPPSPPPPPPTPPFPPPPTRRCRNRRCYRWKKKKKKKKRKEVFIFLFFYFCVCPP