; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0499 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0499
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionearly light-induced protein 1, chloroplastic-like
Genome locationMC05:3734655..3736022
RNA-Seq ExpressionMC05g0499
SyntenyMC05g0499
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
InterPro domainsIPR022796 - Chlorophyll A-B binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582332.1 Early light-induced protein 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.51e-9365.23Show/hide
Query:  MMTWNLNIHHSPSGFPIL--RYINPAQGFARIVINNSQFTRR--FSLDRQFSFFPTSFNSSA---MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLC
        M+TW+LNIHHSP   P+   R  +P         ++ QFT R   +L  QF     SF +S    MAASAVMQS LASSA+R  C  P NR LPAA    
Subjt:  MMTWNLNIHHSPSGFPIL--RYINPAQGFARIVINNSQFTRR--FSLDRQFSFFPTSFNSSA---MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLC

Query:  SRRMASFRVRCAAEEEQREQPIPSSI--PSPSPAP-PSPAPKVSTKFTDILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTS
        SRR A+FRVRC AEE+QRE   P S+  PSP P P P+P+PK+STK TDILAFSGPAPERINGRLAMVGFVAA+AVEA+KGQDV EQI NGGIPWFVGTS
Subjt:  SRRMASFRVRCAAEEEQREQPIPSSI--PSPSPAP-PSPAPKVSTKFTDILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTS

Query:  VVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
         +L+LASLIP ++GVS ES S+G MTS AELWNGRFAMLGL+ALAFTEYV GGSLV
Subjt:  VVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

XP_004134120.1 early light-induced protein 1, chloroplastic [Cucumis sativus]1.09e-8977.72Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING
        MAASAVMQS L+++ATRG     TNR LPAA   CSR  A+FRVRC AEE+QREQ IPSSIP PSP  P P+P       STK TDILAFSGP PERING
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING

Query:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RLAM+GFVAA+AVE SKGQDVFEQIANGGIPWFVGTSVVL+LASLIPL KGVSAES S+G+M+S+AELWNGRFAMLGLVALAFTEYVKGGSLV
Subjt:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

XP_022939396.1 early light-induced protein 1, chloroplastic-like [Cucurbita moschata]2.38e-8977.49Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL
        MA SA++Q+  A SA+R T   PTNR       LCSRR ASFRVRC AEEEQREQ IP+  P   +PSP PPSP PKVSTKF+D+LAFSGPAPERINGRL
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL

Query:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        AM+GFVAA+AVEA KGQDV EQI NGGIPWFVGTSVVL+LASLIPL KGVS ESKS+GLMTSDAELWNGRFAMLGLVALAFTE+VKGGSLV
Subjt:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

XP_022974396.1 early light-induced protein 1, chloroplastic-like [Cucurbita maxima]2.38e-8977.49Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL
        MA SA++Q+   SSA+R T   PTNR       LCSRR ASFRVRC AEEEQREQ IP+  P   +PSP PPSP PKVSTKF+D+LAFSGPAPERINGRL
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL

Query:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        AM+GFVAA+AVEA KGQDV EQI NGGIPWFVGTSVVL+LASLIPL KGVS ESKS+GLMTSDAELWNGRFAMLGLVALAFTE+VKGGSLV
Subjt:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

XP_038880321.1 early light-induced protein 1, chloroplastic [Benincasa hispida]1.69e-9179.79Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP----SPSPAP-PSPAPKVSTKFTDILAFSGPAPERING
        MAASAVMQS  ASSA RG     TNR LPAA    SRRMASFRVRC AEE+QREQPI +SIP     P P P PSP PKVSTKFTDILAFSGP PERING
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP----SPSPAP-PSPAPKVSTKFTDILAFSGPAPERING

Query:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RLAM+GFVAA+AVE  KGQDVFEQIANGGIPWFVGTSVVL+LASLIPL +GVSAES S+G+M S+AELWNGRFAMLGLVALAFTEYVKGGSLV
Subjt:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

TrEMBL top hitse value%identityAlignment
A0A0A0LA49 Uncharacterized protein5.29e-9077.72Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING
        MAASAVMQS L+++ATRG     TNR LPAA   CSR  A+FRVRC AEE+QREQ IPSSIP PSP  P P+P       STK TDILAFSGP PERING
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING

Query:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RLAM+GFVAA+AVE SKGQDVFEQIANGGIPWFVGTSVVL+LASLIPL KGVSAES S+G+M+S+AELWNGRFAMLGLVALAFTEYVKGGSLV
Subjt:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

A0A1S3AXH6 early light-induced protein 1, chloroplastic8.70e-8977.2Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING
        MAASAVMQS  +S+A+RG     TNR LPAA  LCSR   +FRVRC  EE+QREQ IPSSIP PSP    PAP       STKFTDILAFSGP PERING
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING

Query:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RLAM+GFVAA+AVE SKGQDVFEQIANGGIPWFVGTSVVL+LASLIPL KGVSAES S+G+M+S+AELWNGRFAMLGLVALAFTEYVKGGSLV
Subjt:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

A0A5D3D3B2 Early light-induced protein 12.15e-8977.72Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING
        MAASAVMQS  +S+A+RG     TNR LPAA  LCSR   +FRVRC AEE+QREQ IPSSIP PSP    PAP       STKFTDILAFSGP PERING
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAPK-----VSTKFTDILAFSGPAPERING

Query:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RLAM+GFVAA+AVE SKGQDVFEQIANGGIPWFVGTSVVL+LASLIPL KGVSAES S+G+M+S+AELWNGRFAMLGLVALAFTEYVKGGSLV
Subjt:  RLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

A0A6J1FFS2 early light-induced protein 1, chloroplastic-like1.15e-8977.49Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL
        MA SA++Q+  A SA+R T   PTNR       LCSRR ASFRVRC AEEEQREQ IP+  P   +PSP PPSP PKVSTKF+D+LAFSGPAPERINGRL
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL

Query:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        AM+GFVAA+AVEA KGQDV EQI NGGIPWFVGTSVVL+LASLIPL KGVS ESKS+GLMTSDAELWNGRFAMLGLVALAFTE+VKGGSLV
Subjt:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

A0A6J1IB88 early light-induced protein 1, chloroplastic-like1.15e-8977.49Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL
        MA SA++Q+   SSA+R T   PTNR       LCSRR ASFRVRC AEEEQREQ IP+  P   +PSP PPSP PKVSTKF+D+LAFSGPAPERINGRL
Subjt:  MAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQREQPIPSSIP---SPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRL

Query:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        AM+GFVAA+AVEA KGQDV EQI NGGIPWFVGTSVVL+LASLIPL KGVS ESKS+GLMTSDAELWNGRFAMLGLVALAFTE+VKGGSLV
Subjt:  AMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

SwissProt top hitse value%identityAlignment
P11432 Early light-induced protein, chloroplastic1.7e-4655.84Show/hide
Query:  MAASAVMQSFLASSATRGTCGNPTNR--RLPAANLLCSRRMASFRVRCAAEEEQREQ---PIPSSIPSPSPAPPSPA----PKVSTKFTDILAFSGPAPE
        MA S+  QS +++S T  +  +  N+   +P+  +   RR  S +VR  AE E +EQ    +  + P+ S   P PA    PK+STKF+D++AFSGPAPE
Subjt:  MAASAVMQSFLASSATRGTCGNPTNR--RLPAANLLCSRRMASFRVRCAAEEEQREQ---PIPSSIPSPSPAPPSPA----PKVSTKFTDILAFSGPAPE

Query:  RINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        RINGRLAM+GFVAAM VE +KGQ + EQ++ GG+ WF+GTSV+LSLASLIP  +GVS ESKS+ +M+SDAE WNGR AMLGLVALAFTE+VKG SLV
Subjt:  RINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

P14895 High molecular mass early light-inducible protein HV58, chloroplastic5.7e-3457.86Show/hide
Query:  PIPSSIPSPSPAPPSPAPKVSTK-----FTDILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIA--NGGIPWFVGTSVVLSLASLIPLVKGVS
        P PS   SPSP     APK  TK       D LAFSGPAPERINGRLAMVGFVAA++VEA++G  + +Q+   + G+ WF+ T+ V S+ASL+PL++G S
Subjt:  PIPSSIPSPSPAPPSPAPKVSTK-----FTDILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIA--NGGIPWFVGTSVVLSLASLIPLVKGVS

Query:  AESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
         ESKS G+ ++DAELWNGRFAMLGLVALA TE++ G   V
Subjt:  AESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

P93735 Early light-induced protein 1, chloroplastic1.7e-4960.82Show/hide
Query:  ASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAE---EEQREQPIPSSI---PSP-SPAPPSP-APKVSTKFTDILAFSGPAPERIN
        AS  MQS  A   T  T    TN+   A +    +R     VRC AE     +   P PS+    P P SP+PP P  PKVSTKF+D+LAFSGPAPERIN
Subjt:  ASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAE---EEQREQPIPSSI---PSP-SPAPPSP-APKVSTKFTDILAFSGPAPERIN

Query:  GRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        GRLAMVGFVAA+AVE SKG++V  QI++GG+ WF+GT+ +L+LASL+PL KG+S ESKS+G+MTSDAELWNGRFAMLGLVALAFTE+VKGG+LV
Subjt:  GRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

Q01931 Desiccation stress protein DSP-22, chloroplastic6.1e-3653.85Show/hide
Query:  RRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAP---KVSTKFT-DILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTS
        RR A F VR   E+ ++E+             P   P   +V+TK T D+ +F G APERINGR AM+GFVAA+ VE + G+DVF Q+ NGG+ WF+ TS
Subjt:  RRMASFRVRCAAEEEQREQPIPSSIPSPSPAPPSPAP---KVSTKFT-DILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTS

Query:  VVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
         VL LA+LIP+ +G+S E+K+ G   SDAE+WNGRFAM+GLVALAFTEYVKGG L+
Subjt:  VVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

Q94K66 Early light-induced protein 2, chloroplastic4.8e-4959.26Show/hide
Query:  SFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQ-REQP-IPSSIPS-------PSPAPPSPAPKVSTKFTDILAFSGPAPERINGRLAM
        SF   S      G  T R +   N L  +R+A   VRC A+ +  +E P +PS+  S        SP PP   PKVSTKF D+LAFSGPAPERINGRLAM
Subjt:  SFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQ-REQP-IPSSIPS-------PSPAPPSPAPKVSTKFTDILAFSGPAPERINGRLAM

Query:  VGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        VGFVAA+A+E SKG++VF QI++GG+ WF+GT+ +L+LAS++PL KG+ AE+KS+G MTSDAELWNGRFAMLGLVALAFTEYV GG+LV
Subjt:  VGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

Arabidopsis top hitse value%identityAlignment
AT3G22840.1 Chlorophyll A-B binding family protein1.2e-5060.82Show/hide
Query:  ASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAE---EEQREQPIPSSI---PSP-SPAPPSP-APKVSTKFTDILAFSGPAPERIN
        AS  MQS  A   T  T    TN+   A +    +R     VRC AE     +   P PS+    P P SP+PP P  PKVSTKF+D+LAFSGPAPERIN
Subjt:  ASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAE---EEQREQPIPSSI---PSP-SPAPPSP-APKVSTKFTDILAFSGPAPERIN

Query:  GRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        GRLAMVGFVAA+AVE SKG++V  QI++GG+ WF+GT+ +L+LASL+PL KG+S ESKS+G+MTSDAELWNGRFAMLGLVALAFTE+VKGG+LV
Subjt:  GRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV

AT4G14690.1 Chlorophyll A-B binding family protein3.4e-5059.26Show/hide
Query:  SFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQ-REQP-IPSSIPS-------PSPAPPSPAPKVSTKFTDILAFSGPAPERINGRLAM
        SF   S      G  T R +   N L  +R+A   VRC A+ +  +E P +PS+  S        SP PP   PKVSTKF D+LAFSGPAPERINGRLAM
Subjt:  SFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQ-REQP-IPSSIPS-------PSPAPPSPAPKVSTKFTDILAFSGPAPERINGRLAM

Query:  VGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV
        VGFVAA+A+E SKG++VF QI++GG+ WF+GT+ +L+LAS++PL KG+ AE+KS+G MTSDAELWNGRFAMLGLVALAFTEYV GG+LV
Subjt:  VGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAELWNGRFAMLGLVALAFTEYVKGGSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACGTGGAATTTAAATATCCACCACTCTCCTTCCGGCTTCCCCATCCTCCGCTATATAAACCCCGCCCAGGGTTTTGCAAGAATCGTCATAAACAATTCACAGTT
CACTCGCCGTTTCAGCCTCGATAGACAATTTTCCTTCTTTCCGACGTCGTTTAATTCCTCCGCGATGGCCGCCTCAGCCGTAATGCAATCCTTTTTGGCGAGCTCGGCAA
CTCGAGGAACCTGTGGAAACCCGACGAACCGCCGTCTCCCGGCGGCGAATCTGCTGTGTTCTCGGAGGATGGCCTCCTTTCGAGTGCGATGCGCGGCTGAGGAGGAACAG
AGAGAGCAACCGATTCCTTCCTCGATTCCGTCACCATCCCCCGCGCCTCCATCGCCTGCGCCGAAGGTGAGTACGAAGTTTACAGATATCTTGGCATTCAGTGGACCGGC
GCCGGAGAGGATCAATGGGCGGCTAGCGATGGTCGGATTCGTGGCCGCAATGGCGGTGGAAGCATCGAAGGGCCAGGACGTGTTCGAGCAGATAGCGAACGGAGGGATCC
CATGGTTCGTGGGGACGAGCGTGGTGCTATCGCTGGCGTCGCTGATTCCTCTGGTTAAAGGGGTGAGTGCGGAGTCCAAATCGGAGGGGTTGATGACTTCCGATGCAGAG
CTGTGGAATGGAAGGTTCGCTATGTTGGGTCTCGTGGCTTTGGCCTTCACTGAGTATGTGAAGGGTGGTAGCCTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
TCCTACGACCACGTCACTGCAGGAAAGTTGCCTTTGTTTCTTCTGGTGAATTATGAAGCCACGTCATTGCCCTTTTCATGTGTGGACAAACCAACCCACGTGACTGTCCT
ATTTGGCATTAAATAAAAATGATGACGTGGAATTTAAATATCCACCACTCTCCTTCCGGCTTCCCCATCCTCCGCTATATAAACCCCGCCCAGGGTTTTGCAAGAATCGT
CATAAACAATTCACAGTTCACTCGCCGTTTCAGCCTCGATAGACAATTTTCCTTCTTTCCGACGTCGTTTAATTCCTCCGCGATGGCCGCCTCAGCCGTAATGCAATCCT
TTTTGGCGAGCTCGGCAACTCGAGGAACCTGTGGAAACCCGACGAACCGCCGTCTCCCGGCGGCGAATCTGCTGTGTTCTCGGAGGATGGCCTCCTTTCGAGTGCGATGC
GCGGCTGAGGAGGAACAGAGAGAGCAACCGATTCCTTCCTCGATTCCGTCACCATCCCCCGCGCCTCCATCGCCTGCGCCGAAGGTGAGTACGAAGTTTACAGATATCTT
GGCATTCAGTGGACCGGCGCCGGAGAGGATCAATGGGCGGCTAGCGATGGTCGGATTCGTGGCCGCAATGGCGGTGGAAGCATCGAAGGGCCAGGACGTGTTCGAGCAGA
TAGCGAACGGAGGGATCCCATGGTTCGTGGGGACGAGCGTGGTGCTATCGCTGGCGTCGCTGATTCCTCTGGTTAAAGGGGTGAGTGCGGAGTCCAAATCGGAGGGGTTG
ATGACTTCCGATGCAGAGCTGTGGAATGGAAGGTTCGCTATGTTGGGTCTCGTGGCTTTGGCCTTCACTGAGTATGTGAAGGGTGGTAGCCTTGTGTAAATTGAAATTTT
AAGATGCAAAATTGAGGCGTCTTCTGTCGCTTGGTGGAGTTGATGAAGAGAGTGGTATTAATTGGATGTTGAATGTATAGACTCATGTGCAAACAAGCTCTGAGATTTCA
CTCTCATTTATGGATAATTAAGAAATGTTTTTAGATCATATAATTCAAATTTTCTTCATTCTGTCTGGAATGAGAAAATTCAGTATCAACATCCGTAAAATTTCTTGTTA
GAATAACCATTGAAAAAAAAAATTATATTTTTGCTGGGTGATGTTCAATTGAGATGAGTGACCGTTTATAACTATTTATCAATTTTAAGGAACTAAC
Protein sequenceShow/hide protein sequence
MMTWNLNIHHSPSGFPILRYINPAQGFARIVINNSQFTRRFSLDRQFSFFPTSFNSSAMAASAVMQSFLASSATRGTCGNPTNRRLPAANLLCSRRMASFRVRCAAEEEQ
REQPIPSSIPSPSPAPPSPAPKVSTKFTDILAFSGPAPERINGRLAMVGFVAAMAVEASKGQDVFEQIANGGIPWFVGTSVVLSLASLIPLVKGVSAESKSEGLMTSDAE
LWNGRFAMLGLVALAFTEYVKGGSLV