; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G012110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G012110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionZeta-carotene desaturase
Genome locationCmo_Chr05:9489849..9498653
RNA-Seq ExpressionCmoCh05G012110
SyntenyCmoCh05G012110
Gene Ontology termsNA
InterPro domainsIPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599294.1 hypothetical protein SDJN03_09072, partial [Cucurbita argyrosperma subsp. sororia]3.2e-10383Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISK                        RVDGVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAG+DDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

XP_004139222.1 uncharacterized protein LOC101220191 isoform X1 [Cucumis sativus]1.1e-9275.52Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        M GICYCPPLPPSLSPK V+ IRCFSSSPENSN SRKKEASAIVKI VSG+TELLRLFSSPISK                        RVD + DNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL
        FVV G+D+V+NILKSDYENAYFVTGIFTSAIY DDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTI+K    G+          TYLKLPWRPL
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL

Query:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFK
        ISIDG+TLYELDEE KI RHAESWSVSALEAITQIFIPSF+
Subjt:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFK

XP_022946926.1 uncharacterized protein LOC111450860 [Cucurbita moschata]1.4e-10383.4Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISK                        RVDGVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

XP_022999247.1 uncharacterized protein LOC111493675 [Cucurbita maxima]1.7e-10181.78Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSN SRKKEASAIVKIAVSG TELLRLFSSPISK                        RV+GVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTI+KG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

XP_023545588.1 uncharacterized protein LOC111804973 [Cucurbita pepo subsp. pepo]3.5e-10282.19Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSN SR+KEASAIVKIAVSGVTELLRLFSSPISK                        RVDGVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTI+KG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

TrEMBL top hitse value%identityAlignment
A0A0A0LJD8 Uncharacterized protein5.5e-9375.52Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        M GICYCPPLPPSLSPK V+ IRCFSSSPENSN SRKKEASAIVKI VSG+TELLRLFSSPISK                        RVD + DNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL
        FVV G+D+V+NILKSDYENAYFVTGIFTSAIY DDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTI+K    G+          TYLKLPWRPL
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL

Query:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFK
        ISIDG+TLYELDEE KI RHAESWSVSALEAITQIFIPSF+
Subjt:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFK

A0A1S3C2Z9 uncharacterized protein LOC1034958701.8e-8872.02Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        M GI YCPPLPPSLSPK V+AIRC SSSPENSN SRKKEAS I+KI VSG+TELLRLFSSPISK                        RVDG+ DNQ EE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL
        FVV G+D+V++ILKSDYENAYFVTG FTSAIY D CLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQL+TI+K    G+          TYLKLPWRPL
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWRPL

Query:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        ISIDG+TLYELD+E KI RHAESWSVSALEAITQIFIPSF+ D
Subjt:  ISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

A0A6J1CMH1 uncharacterized protein LOC111012984 isoform X54.8e-8973.47Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERY--EFDSGSCLCCRVDGVGDNQG
        M GIC  PPLP SLSPK    +RCFSSSP+NS   RKKEAS IVKIAVSGVTELLRLFSS  SKS    +    R  +      G CL CR DGVGDNQ 
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERY--EFDSGSCLCCRVDGVGDNQG

Query:  EEFVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWR
        EEFVVAG+DDVL+ILKSDY+NAYFVTGIFTS IYADDCLFEDPTI+FRGKELYSRNLKLLVPFFDCPSIQLQ I+K    G+          TYLKLPWR
Subjt:  EEFVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDK----GMYMAAKNGTSETYLKLPWR

Query:  PLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        PLI+I GSTLYELDEELKI RHAESWSVSALEAI QIFIPSF+GD
Subjt:  PLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

A0A6J1G4Y1 uncharacterized protein LOC1114508606.9e-10483.4Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISK                        RVDGVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

A0A6J1KAD4 uncharacterized protein LOC1114936758.4e-10281.78Show/hide
Query:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE
        MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSN SRKKEASAIVKIAVSG TELLRLFSSPISK                        RV+GVGDNQGEE
Subjt:  MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEE

Query:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP
        FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTI+KG     K+G           TYLKLP
Subjt:  FVVAGIDDVLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTS--------ETYLKLP

Query:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
        WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD
Subjt:  WRPLISIDGSTLYELDEELKIARHAESWSVSALEAITQIFIPSFKGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46100.1 Nuclear transport factor 2 (NTF2) family protein6.5e-1429.69Show/hide
Query:  VLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTSETYLKLPWRPLISIDGSTLYELD
        V++ +K D++ +YFVTG  T  +Y + C F DP   F+G   + RN        +  +++L   +        +      +  PW+P++S  G T Y  D
Subjt:  VLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTSETYLKLPWRPLISIDGSTLYELD

Query:  EEL-KIARHAESWSVSALEAITQIFIPS
         E  KI RH E W+V  +    Q+  PS
Subjt:  EEL-KIARHAESWSVSALEAITQIFIPS

AT2G46100.2 Nuclear transport factor 2 (NTF2) family protein7.3e-0539.13Show/hide
Query:  VLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRN
        V++ +K D++ +YFVTG  T  +Y + C F DP   F+G   + RN
Subjt:  VLNILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRN

AT3G04890.1 Uncharacterized conserved protein (DUF2358)1.3e-4950.23Show/hide
Query:  RCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEEFVVAGIDDVLNILKSDYENAYF
        RCFS SPE       K+  A++K AVSGVTE LRL S   S + I +  + ++                        E     +DDV+ IL+SDY N YF
Subjt:  RCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEEFVVAGIDDVLNILKSDYENAYF

Query:  VTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG-----MYMAAKNGTSETYLKLPWRPLISIDGSTLYELDEELKIARHA
        VTG+ TSAIY+DDC+FEDPTI F+G ELY RNLKLLVPF +  SI+LQ ++K       Y+ A      TYLKLPWRPLISI+G+T+Y+LD++ KI RH 
Subjt:  VTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG-----MYMAAKNGTSETYLKLPWRPLISIDGSTLYELDEELKIARHA

Query:  ESWSVSALEAITQIF
        ESW+VSALEAI QIF
Subjt:  ESWSVSALEAITQIF

AT3G04890.2 Uncharacterized conserved protein (DUF2358)2.4e-4047.18Show/hide
Query:  RCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEEFVVAGIDDVLNILKSDYENAYF
        RCFS SPE       K+  A++K AVSGVTE LRL S   S + I +  + ++                        E     +DDV+ IL+SDY N YF
Subjt:  RCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEEFVVAGIDDVLNILKSDYENAYF

Query:  VTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG-----MYMAAKNGTSETYLKLPWRPLISIDGSTLYELDEELK
        VTG+ TSAIY+DDC+FEDPTI F+G ELY RNLKLLVPF +  SI+LQ ++K       Y+ A      TYLKLPWRPLISI+G+T+Y+LD++ K
Subjt:  VTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKG-----MYMAAKNGTSETYLKLPWRPLISIDGSTLYELDEELK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGAATCTGTTACTGCCCACCGCTTCCCCCATCTCTATCCCCCAAGATGGTAACCGCCATACGATGCTTCTCAAGCTCGCCGGAGAATTCCAATAGTAGCCGGAA
GAAAGAGGCATCTGCGATTGTGAAAATCGCTGTGAGTGGAGTCACAGAGCTTCTTAGGCTTTTCTCGTCTCCGATCAGTAAGAGTTGCATAAATAGTTTTCTTGAACATG
CGAGGGAGAGATATGAATTCGATTCTGGTTCGTGTCTGTGTTGCAGAGTTGATGGAGTTGGGGACAATCAGGGAGAGGAATTTGTAGTTGCAGGCATCGATGATGTTCTA
AATATCCTGAAATCAGATTATGAGAATGCTTACTTTGTTACTGGGATATTCACTTCTGCCATTTATGCTGATGATTGTCTCTTTGAAGATCCAACCATCAGGTTTCGAGG
TAAGGAGCTGTATTCACGGAATCTAAAGTTACTTGTTCCCTTTTTTGACTGTCCCTCAATCCAGTTACAAACGATCGATAAGGGGATGTATATGGCAGCTAAGAACGGGA
CGAGTGAGACATACCTGAAACTTCCTTGGAGGCCACTCATTTCTATTGATGGAAGTACTCTCTATGAACTTGATGAAGAGTTGAAAATAGCAAGGCATGCTGAGAGCTGG
AGTGTTTCTGCACTTGAAGCAATAACTCAGATATTCATCCCTAGTTTTAAAGGAGACTGA
mRNA sequenceShow/hide mRNA sequence
TGACATGGCAACTTTCGCGGGACTGGCTGGACGAAATCGGAGAGGAGCTGAGCGTCGCCGGAGGCACTGTCTCCGAGAGAACAATGACCGGAATCTGTTACTGCCCACCG
CTTCCCCCATCTCTATCCCCCAAGATGGTAACCGCCATACGATGCTTCTCAAGCTCGCCGGAGAATTCCAATAGTAGCCGGAAGAAAGAGGCATCTGCGATTGTGAAAAT
CGCTGTGAGTGGAGTCACAGAGCTTCTTAGGCTTTTCTCGTCTCCGATCAGTAAGAGTTGCATAAATAGTTTTCTTGAACATGCGAGGGAGAGATATGAATTCGATTCTG
GTTCGTGTCTGTGTTGCAGAGTTGATGGAGTTGGGGACAATCAGGGAGAGGAATTTGTAGTTGCAGGCATCGATGATGTTCTAAATATCCTGAAATCAGATTATGAGAAT
GCTTACTTTGTTACTGGGATATTCACTTCTGCCATTTATGCTGATGATTGTCTCTTTGAAGATCCAACCATCAGGTTTCGAGGTAAGGAGCTGTATTCACGGAATCTAAA
GTTACTTGTTCCCTTTTTTGACTGTCCCTCAATCCAGTTACAAACGATCGATAAGGGGATGTATATGGCAGCTAAGAACGGGACGAGTGAGACATACCTGAAACTTCCTT
GGAGGCCACTCATTTCTATTGATGGAAGTACTCTCTATGAACTTGATGAAGAGTTGAAAATAGCAAGGCATGCTGAGAGCTGGAGTGTTTCTGCACTTGAAGCAATAACT
CAGATATTCATCCCTAGTTTTAAAGGAGACTGAGAATTGTTTGCTAGGATGAACACAAACTGCTCCATTCAATTGATTTTATGTTGTGTTTTAAATATTTATGTATAAAC
TTTATTTTGATATAATGTAAATATAATGTATGGTATGTTTTAAATTCCCGTTTGTTTTATTTCGCAAAAAATATACCCATCTTTTTGGAGCCATCGAGTCAGCCCGGTCC
TTACATAACATGTCAATACTACAATGCTGATATAGCATCATAGCACTGTCACCCATCTTTCTTTTTTCGTCTTTTTTTTTACTATGTCGAGTCGCTATTCTCTCGGGGTC
TGGACTTCCTCATTTGCCTTACAGAACTTTTTGGTTGTTTGTGTTTTTGTCCAAATTGAACGACACACCTCTTATCCATTATCAAGTTTGCTCTTTGACGAAGCTTTTCG
AGTTTAAGCCTCGCCATCTCCAAGTTGCCTCTTAAACTTATGCCAGTTACATACCATCCATTAGTCCAAGTTCCATGACGGATCATCTGATTGTAGACATATATAATGAA
CAACAAGCTTTCATCAATTATATTTATAACCTTCTAATGTGGATGGATCATTCAAAATTTCCATCATATATCGAATTCTGTTCTCTCATTTTTGTTTTCAGGCAGTTTCT
CTCAGAAATTAATTGCATGTCTTTGCTTTCCCAATATGTTGTACACTCATACTTCATAACGCTCGCCCTTCCTCTGGTTACATGACAGGGTTAACTGCCACCAATTGATG
GAAACGCCTGCAGTAGAGACTCCACACAAGCAGAGAGCACTTAGAAGAATGAATTTCTACTAAGAATTTATAAGCACCCTGGCCTCTGTCTTGCCATCTACCACAACCGC
CGGTGGAGATGAAGAAATGGCTGATGACAGACAAGCGAGTGGACGTGATTCCAACCATGATACCATTGCCCGGAAAAGCCAGTGGTGAAGATGCTTGGATTGACAAAGAT
GGAAACTGAGGTGAAATTGCTGACATGAAAGGGCGGAAGGATGGGTTTCATGTGCACCAGCAAGTGTCCTAGCTCATCGCTAGTAGATATTGTCCTCTCCAACCAATGTA
GGGTCTTACAATCCACCCCCTTGGGGGTCCAGCGACCTCGTTGGCACACCGTTTGGTATCTAGCTCTGTTACCATGTGTAACAGCCCAAACCCACCACTAACAGATATTA
TCCCTTTCAGCCCATAACGCATCACTGTCAGCTTCACGGCTTTAAAACGTGTCTGCTAAGGAGAGGTTTCGACATCCTTATAAAGAATGTTTCATACTCCTCTCCAAGTG
GGGATCTCACAATAAGTTATTTTGTTCATCACCAAAGTCGTTTCTTTTCGACACAAGTTGTCGAAGATCGTCCTATAAGCGAAAAAATGAATCTGAATCAGATGCAGATA
ATGGGGAAATTGATGAGTCCGGGATTGATGCGTCAATCATATGCTTGGTTTTGTGATTTGGGGTGTGGAAAGGCACAGAATTGAGGTATGATGATCTGGTCCACCATCCC
TCATAATATAAGGTACCTCTTTTGTTTGTTTCCAACAAAATGATTGCTGTTCATTACCCACAAAAAAGAAAAAGAAAATGAAGAGAAAGTAGAGTGTTTAGTCAGTTTCA
AGTGGTGGTGTGATTTGATGATCTGAAGCAGACCCTTTACAGAGCTTCTTTTGACTGCTAGTTTAAGGGTACATAGAGAAATTCAAGAATTTGAATGCATCTGCCATTTG
GCCCATTTCATTCTTCATGGTCACATCATAAATCTTTTATATAAATTCTCACATCAAAACCTTGGAGGACACCAATCAGCTCCTATCTACTTTTATTCATTTTATAGCTA
AATTCAACTTTTATATGAAACCGCTACTATTTTATTTCAAAAATATCTTTAC
Protein sequenceShow/hide protein sequence
MTGICYCPPLPPSLSPKMVTAIRCFSSSPENSNSSRKKEASAIVKIAVSGVTELLRLFSSPISKSCINSFLEHARERYEFDSGSCLCCRVDGVGDNQGEEFVVAGIDDVL
NILKSDYENAYFVTGIFTSAIYADDCLFEDPTIRFRGKELYSRNLKLLVPFFDCPSIQLQTIDKGMYMAAKNGTSETYLKLPWRPLISIDGSTLYELDEELKIARHAESW
SVSALEAITQIFIPSFKGD