; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0026870 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0026870
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchr12:19605807..19607306
RNA-Seq ExpressionPI0026870
SyntenyPI0026870
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139454.1 uncharacterized protein LOC101218532 [Cucumis sativus]1.6e-9085.35Show/hide
Query:  SSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTV
        SSSS+KE+SYV KHG AKRTRVLRITGRTLLGLMILV IAMI+CWL+VFP+ PD+IVETGQVIPHSLTDRKLNATIAFTV SYNPN++ASIRMDSMRM V
Subjt:  SSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTV

Query:  SDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI
        SDMGLSFWSDIPSFTQPPKNKTVLTS IQGNFI PFG MKE MKL+GISP+L FSAKVSYIMERW SR R+VE+YC SLRLKFNDST FDNKKC VD+
Subjt:  SDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI

XP_008462326.1 PREDICTED: uncharacterized protein LOC103500702 [Cucumis melo]1.4e-9485.1Show/hide
Query:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS
        MRSIATQG  SSSSAKEISYV+KHGAAKRTRVLRITGRTLLGLMILV IAMI+CWL+VFP+ PDLIVETG+VIPHSLTDRKLNATIAFTV SYNPN++AS
Subjt:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS

Query:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD
        IRMDSMRM V+DMGLSFWSDIPSFTQPPKNKTVL S IQGNFI PFG MKE +KL+GISP L FSAKVSYIMERW SRGR++E+YC SLRLKFNDST FD
Subjt:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD

Query:  NKKCDVDI
        NKKC VD+
Subjt:  NKKCDVDI

XP_022964992.1 uncharacterized protein At1g08160-like [Cucurbita moschata]6.0e-6660.09Show/hide
Query:  MRSIATQGAGSSSSA-----KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNP
        MRS   + A SS+++     K  SY Q+HG AKRT+++RI GR+LL +M LVG+A+++CWLVVFPKTP+LI+E G V PHSLTDRKLNA+I+FT+ SYNP
Subjt:  MRSIATQGAGSSSSA-----KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNP

Query:  NQRASIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFND
        N+RASI MDSM+MT+ DMG +F + IP+FTQPP N+T L   ++ NFI PFG+MK+ M  DG++P+LHFSA VSYI+E+WAS+ R +EIYC  +RLK N 
Subjt:  NQRASIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFND

Query:  STTFDNKKCDVDI
        ST FDN KC VD+
Subjt:  STTFDNKKCDVDI

XP_023520091.1 uncharacterized protein At1g08160-like [Cucurbita pepo subsp. pepo]7.9e-6660.77Show/hide
Query:  TQGAGSSSSA------KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRA
        TQG  SSS+       K  SY  +HG AKRT+++RI GR+LL +M LVG+A+++CWLVVFPKTP+LI+E G V PHSLTDRKLNA+I+FT+ SYNPN+RA
Subjt:  TQGAGSSSSA------KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRA

Query:  SIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTF
        SI MDSM+MT+ DMG +F + IP+FTQPP N+T L   ++ NFI PFG+MK+ M  DG++P+LHFSA VSYI+E+WAS+ R +EIYC  +RLK N ST F
Subjt:  SIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTF

Query:  DNKKCDVDI
        DN KC VD+
Subjt:  DNKKCDVDI

XP_038895440.1 uncharacterized protein LOC120083674 [Benincasa hispida]1.8e-7873.08Show/hide
Query:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS
        MR    QG  SSSS KE SYVQ+HG AKRT++LRITGR+LLG+MILVGI +I+CWL+VFPKTP+L VE+GQVIPH LTDRKL ATIAFTV SYNPN+RA+
Subjt:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS

Query:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD
        I MDSM M V+DMG +F S IP+FTQPP N+TV TSAIQGNFI PFG MKE +K +GISP L FSAKVSYIM+RW S+ R++EIYCG LRLKFNDST FD
Subjt:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD

Query:  NKKCDVDI
        NKKC VD+
Subjt:  NKKCDVDI

TrEMBL top hitse value%identityAlignment
A0A0A0LSV0 Uncharacterized protein7.6e-9185.35Show/hide
Query:  SSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTV
        SSSS+KE+SYV KHG AKRTRVLRITGRTLLGLMILV IAMI+CWL+VFP+ PD+IVETGQVIPHSLTDRKLNATIAFTV SYNPN++ASIRMDSMRM V
Subjt:  SSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTV

Query:  SDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI
        SDMGLSFWSDIPSFTQPPKNKTVLTS IQGNFI PFG MKE MKL+GISP+L FSAKVSYIMERW SR R+VE+YC SLRLKFNDST FDNKKC VD+
Subjt:  SDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI

A0A1S3CGQ5 uncharacterized protein LOC1035007026.7e-9585.1Show/hide
Query:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS
        MRSIATQG  SSSSAKEISYV+KHGAAKRTRVLRITGRTLLGLMILV IAMI+CWL+VFP+ PDLIVETG+VIPHSLTDRKLNATIAFTV SYNPN++AS
Subjt:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS

Query:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD
        IRMDSMRM V+DMGLSFWSDIPSFTQPPKNKTVL S IQGNFI PFG MKE +KL+GISP L FSAKVSYIMERW SRGR++E+YC SLRLKFNDST FD
Subjt:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD

Query:  NKKCDVDI
        NKKC VD+
Subjt:  NKKCDVDI

A0A5D3CC54 Protein YLS9-like6.7e-9585.1Show/hide
Query:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS
        MRSIATQG  SSSSAKEISYV+KHGAAKRTRVLRITGRTLLGLMILV IAMI+CWL+VFP+ PDLIVETG+VIPHSLTDRKLNATIAFTV SYNPN++AS
Subjt:  MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRAS

Query:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD
        IRMDSMRM V+DMGLSFWSDIPSFTQPPKNKTVL S IQGNFI PFG MKE +KL+GISP L FSAKVSYIMERW SRGR++E+YC SLRLKFNDST FD
Subjt:  IRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFD

Query:  NKKCDVDI
        NKKC VD+
Subjt:  NKKCDVDI

A0A6J1HJ52 uncharacterized protein At1g08160-like2.9e-6660.09Show/hide
Query:  MRSIATQGAGSSSSA-----KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNP
        MRS   + A SS+++     K  SY Q+HG AKRT+++RI GR+LL +M LVG+A+++CWLVVFPKTP+LI+E G V PHSLTDRKLNA+I+FT+ SYNP
Subjt:  MRSIATQGAGSSSSA-----KEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNP

Query:  NQRASIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFND
        N+RASI MDSM+MT+ DMG +F + IP+FTQPP N+T L   ++ NFI PFG+MK+ M  DG++P+LHFSA VSYI+E+WAS+ R +EIYC  +RLK N 
Subjt:  NQRASIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFND

Query:  STTFDNKKCDVDI
        ST FDN KC VD+
Subjt:  STTFDNKKCDVDI

A0A6J1I0B9 uncharacterized protein At1g08160-like1.4e-6560.77Show/hide
Query:  TQGAGSSSSAKEIS------YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRA
        TQG  SSSS   I       Y Q+HG AKRT+++RI GR+LL +M LVG+A+++CWLVVFPKTP+LI+E G V PHSLTDRKLNA+I+FT+ SYNPN+RA
Subjt:  TQGAGSSSSAKEIS------YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRA

Query:  SIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTF
        SI MDSM+MT+ DMG +F + IP+FTQ P N+T LT  ++ NFI PFG+MK+ M  +G++P+LHFSA VSYI+E+WAS+ R +EIYC  +RLK N ST F
Subjt:  SIRMDSMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTF

Query:  DNKKCDVDI
        DN KC VD+
Subjt:  DNKKCDVDI

SwissProt top hitse value%identityAlignment
Q8VZ13 Uncharacterized protein At1g081602.7e-0828.24Show/hide
Query:  ITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSL--TDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF-WSDIPSFTQPPKNK
        I    LLGL  LVG+A+++ +L + PK     VE   V   ++   D  +NA  ++ + SYNP +  S+R  SMR++ +    S    +I  F Q PKN+
Subjt:  ITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSL--TDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF-WSDIPSFTQPPKNK

Query:  T-VLTSAIQGNF-INPFGR--MKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDST
        T + T  +  N  ++ F    ++       I  +++ +A+VSY    + SR R ++  C  + +    S+
Subjt:  T-VLTSAIQGNF-INPFGR--MKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDST

Q9SJ52 NDR1/HIN1-like protein 106.6e-0725.25Show/hide
Query:  YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQV--IPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF
        Y + HG      +L +  + ++ L++++G+A ++ WL+V P+     V    +    H+  D  L   +A TV   NPN+R  +  D +       G  F
Subjt:  YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQV--IPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF

Query:  WS-DIPSFTQPPKNKTVLTSAIQGNFINPF----GRMKENMKLDGI-SPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDS------TTFDNKKCD
         +  +  F Q  KN TVLT   QG  +  F     R     ++ G+ + ++ F  +V + +     R    ++ C  LRL  + S      +T    KCD
Subjt:  WS-DIPSFTQPPKNKTVLTSAIQGNFINPF----GRMKENMKLDGI-SPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDS------TTFDNKKCD

Query:  VD
         D
Subjt:  VD

Arabidopsis top hitse value%identityAlignment
AT1G08160.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-0928.24Show/hide
Query:  ITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSL--TDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF-WSDIPSFTQPPKNK
        I    LLGL  LVG+A+++ +L + PK     VE   V   ++   D  +NA  ++ + SYNP +  S+R  SMR++ +    S    +I  F Q PKN+
Subjt:  ITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSL--TDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF-WSDIPSFTQPPKNK

Query:  T-VLTSAIQGNF-INPFGR--MKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDST
        T + T  +  N  ++ F    ++       I  +++ +A+VSY    + SR R ++  C  + +    S+
Subjt:  T-VLTSAIQGNF-INPFGR--MKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDST

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-0825.25Show/hide
Query:  YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQV--IPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF
        Y + HG      +L +  + ++ L++++G+A ++ WL+V P+     V    +    H+  D  L   +A TV   NPN+R  +  D +       G  F
Subjt:  YVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQV--IPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTVSDMGLSF

Query:  WS-DIPSFTQPPKNKTVLTSAIQGNFINPF----GRMKENMKLDGI-SPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDS------TTFDNKKCD
         +  +  F Q  KN TVLT   QG  +  F     R     ++ G+ + ++ F  +V + +     R    ++ C  LRL  + S      +T    KCD
Subjt:  WS-DIPSFTQPPKNKTVLTSAIQGNFINPF----GRMKENMKLDGI-SPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDS------TTFDNKKCD

Query:  VD
         D
Subjt:  VD

AT4G01410.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.2e-0528.1Show/hide
Query:  QGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNAT-IAFTVVSYNPNQRASIRMD-
        + A S+SS  E SY ++ G         I G  +  +++++GI  ++ WLV  P  P L V    +   + T   L +T + F+V++ NPN+R SI  D 
Subjt:  QGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNAT-IAFTVVSYNPNQRASIRMD-

Query:  -SMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLD
         SM +T  D  ++    +P      K+  V+   + GN I     +   +K D
Subjt:  -SMRMTVSDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLD

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.2e-1027.12Show/hide
Query:  LLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLT-DRKLNATIAFTVVSYNPNQRASIRMDSMRMTV--SDMGLSFWSDIPSFTQPPKN-----K
        +L L+ +  +  ++ WL   PK     VE   V   +LT D  ++AT  FT+ S+NPN R S+   S+ + V   D  L+F   +  F QP  N     +
Subjt:  LLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLT-DRKLNATIAFTVVSYNPNQRASIRMDSMRMTV--SDMGLSFWSDIPSFTQPPKN-----K

Query:  TVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI
        T++   +  +  N      +N  L  I  ++   A+V + +  W S  R  +I C  + +  +      N  CD DI
Subjt:  TVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGCATTGCTACCCAAGGAGCAGGATCATCTTCATCAGCAAAAGAAATATCCTATGTACAAAAACATGGTGCAGCAAAAAGAACAAGGGTGTTGAGAATCACAGG
AAGAACCTTATTGGGTTTAATGATTCTTGTGGGTATTGCAATGATTGTATGTTGGCTTGTTGTGTTTCCCAAAACCCCAGATCTCATTGTGGAAACTGGCCAAGTTATAC
CCCATAGTTTAACTGATAGAAAGTTGAATGCTACCATAGCTTTTACTGTTGTAAGCTATAACCCTAACCAAAGAGCCTCCATTCGTATGGATTCTATGAGGATGACAGTT
AGTGATATGGGGTTGTCGTTTTGGTCCGACATCCCCAGCTTCACCCAGCCACCGAAAAACAAAACCGTCTTGACCTCTGCCATCCAAGGCAACTTCATAAACCCATTTGG
GCGCATGAAAGAAAATATGAAGTTGGATGGGATCAGTCCGAAGCTTCACTTCTCGGCCAAAGTCAGTTACATTATGGAGAGATGGGCATCGAGAGGTCGGGTGGTGGAGA
TCTATTGTGGTAGCCTTAGGCTTAAGTTCAATGATTCTACAACTTTTGATAATAAAAAGTGCGATGTTGATATTTGA
mRNA sequenceShow/hide mRNA sequence
CAAAAACCCTAATTCAAAATAGAGAACATATTCCTCTACCCATCATCAAGCAAACCAAAATAGTACACATACACACAGTTCTTATACTCATCAATATAATATATCATCCC
TTCACTTAAACATCAAAAAACAAATATCAAAACCACCAAAAGTAAAAAAGTGGTGAAGAAAAAACAGATGAGGAGCATTGCTACCCAAGGAGCAGGATCATCTTCATCAG
CAAAAGAAATATCCTATGTACAAAAACATGGTGCAGCAAAAAGAACAAGGGTGTTGAGAATCACAGGAAGAACCTTATTGGGTTTAATGATTCTTGTGGGTATTGCAATG
ATTGTATGTTGGCTTGTTGTGTTTCCCAAAACCCCAGATCTCATTGTGGAAACTGGCCAAGTTATACCCCATAGTTTAACTGATAGAAAGTTGAATGCTACCATAGCTTT
TACTGTTGTAAGCTATAACCCTAACCAAAGAGCCTCCATTCGTATGGATTCTATGAGGATGACAGTTAGTGATATGGGGTTGTCGTTTTGGTCCGACATCCCCAGCTTCA
CCCAGCCACCGAAAAACAAAACCGTCTTGACCTCTGCCATCCAAGGCAACTTCATAAACCCATTTGGGCGCATGAAAGAAAATATGAAGTTGGATGGGATCAGTCCGAAG
CTTCACTTCTCGGCCAAAGTCAGTTACATTATGGAGAGATGGGCATCGAGAGGTCGGGTGGTGGAGATCTATTGTGGTAGCCTTAGGCTTAAGTTCAATGATTCTACAAC
TTTTGATAATAAAAAGTGCGATGTTGATATTTGAGATTTTCCATGTTGTAATTGTGTTCTTTCTCAATAATGATCCCAAAGATTTGTAACACAAAGTTTTAATTGTGTTA
TTTTTATGAATAGAAAGAGTTCTTGTAATTA
Protein sequenceShow/hide protein sequence
MRSIATQGAGSSSSAKEISYVQKHGAAKRTRVLRITGRTLLGLMILVGIAMIVCWLVVFPKTPDLIVETGQVIPHSLTDRKLNATIAFTVVSYNPNQRASIRMDSMRMTV
SDMGLSFWSDIPSFTQPPKNKTVLTSAIQGNFINPFGRMKENMKLDGISPKLHFSAKVSYIMERWASRGRVVEIYCGSLRLKFNDSTTFDNKKCDVDI