; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0321631 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0321631
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family, putative
Genome locationCMiso1.1chr12:8711169..8712742
RNA-Seq ExpressionCmc12g0321631
SyntenyCmc12g0321631
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462337.1 PREDICTED: NDR1/HIN1-like protein 12 [Cucumis melo]2.6e-10699.5Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        LHSMRMIITSMGQAF SAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

XP_008462348.1 PREDICTED: protein YLS9-like [Cucumis melo]4.1e-6470.45Show/hide
Query:  TKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQ
        TK+TR+IR++GR LL VIFLV L M+ICWLVV PK PR +VETGKV+ H+ST +MLNATIAFTVK YNPNKRASIH+  MRMI+ +MG  F SA+P F  
Subjt:  TKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQ

Query:  TPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        TP NQTVL  AV VNF+YPFG+ EEINPEL FSAE+SYSV  W S+PRLL+IYCN+LLL+IND+ TF+NTKC VDL
Subjt:  TPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

XP_011648454.1 uncharacterized protein LOC105434469 [Cucumis sativus]6.3e-9791Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRS TT KGEASSS+KSSKGQNETTKKTRIIRIIGR LLSVIFLVGLAMVICWLVVFPKNPR  VETG+VIAHNSTHNMLNATI FTVKCYNPNKRAS+H
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        LHSMRMI+TSMGQAF S +P FMQTPGNQTVLSPAV+VNFDYPFGH+EEINPELHFSAEISYSV HWTSRPRLL IYCNNLLLRINDTRTFENTKCNVDL
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

XP_038895359.1 NDR1/HIN1-like protein 10 [Benincasa hispida]2.9e-6567Show/hide
Query:  MRSITTTKGEAS--SSRKSS-KGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRA
        MRS T T+GE S  SS++SS + Q+ T K+TRIIRIIGR LL VIFLVGLA++ICWLVV PK PR +VETG VI H+STHNML ATI FT + YNPNKRA
Subjt:  MRSITTTKGEAS--SSRKSS-KGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRA

Query:  SIHLHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCN
        +IH++SMRMI+ S+G+ F S +P F  TP NQTVLS AV VNF+YPFG  EEI+PEL FSAE+SYS+  W S+PRLL+IYCN+LLL+IN + TF+NTKC 
Subjt:  SIHLHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCN

Query:  VDL
        VDL
Subjt:  VDL

XP_038895526.1 NDR1/HIN1-like protein 26 isoform X1 [Benincasa hispida]3.2e-6465.8Show/hide
Query:  KGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMI
        +G   +S++ S  ++ TTK+TRIIRIIGR LLSVI L+GLA++ CW+VVFPK PRF+VETG+VIA +ST  MLNATIA+TVK YNPNKRASIH+ SMRMI
Subjt:  KGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMI

Query:  ITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        +T+MGQ F S +P F Q PGN TVLS AV  NF+YPFG +EEI+P L F A++SYSV  W S+PRLL+IYC+ L L IND+RTFENT+C VDL
Subjt:  ITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

TrEMBL top hitse value%identityAlignment
A0A0A0LVK9 LEA_2 domain-containing protein3.1e-9791Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRS TT KGEASSS+KSSKGQNETTKKTRIIRIIGR LLSVIFLVGLAMVICWLVVFPKNPR  VETG+VIAHNSTHNMLNATI FTVKCYNPNKRAS+H
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        LHSMRMI+TSMGQAF S +P FMQTPGNQTVLSPAV+VNFDYPFGH+EEINPELHFSAEISYSV HWTSRPRLL IYCNNLLLRINDTRTFENTKCNVDL
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

A0A0A0LY09 Uncharacterized protein4.5e-6464.82Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRS TTT              +  TK+TR+IR++GR LL VIFLV L M+ICWLVV PK+PR +VETGKVIAH+ST +MLNATIAFTVK YNPNKRASIH
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVD
        +  MRMI+ +MG  F SA+P F  TP NQTVLS AV VNF+YPFG+ EEINPEL FSAE+SYS+  W S+PRLL+IYCN++LL+IND+  F+NTKC VD
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVD

A0A1S3CGR5 NDR1/HIN1-like protein 121.2e-10699.5Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        LHSMRMIITSMGQAF SAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

A0A1S3CGS4 protein YLS9-like2.0e-6470.45Show/hide
Query:  TKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQ
        TK+TR+IR++GR LL VIFLV L M+ICWLVV PK PR +VETGKV+ H+ST +MLNATIAFTVK YNPNKRASIH+  MRMI+ +MG  F SA+P F  
Subjt:  TKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQ

Query:  TPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        TP NQTVL  AV VNF+YPFG+ EEINPEL FSAE+SYSV  W S+PRLL+IYCN+LLL+IND+ TF+NTKC VDL
Subjt:  TPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

A0A5A7TF76 NDR1/HIN1-like protein 121.2e-10699.5Show/hide
Query:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
        MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH
Subjt:  MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIH

Query:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
        LHSMRMIITSMGQAF SAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
Subjt:  LHSMRMIITSMGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL

SwissProt top hitse value%identityAlignment
Q8VZ13 Uncharacterized protein At1g081602.6e-0828.74Show/hide
Query:  LVGLAMVICWLVVFPKNPRFVVETGKV----IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFP-SAMPMFMQTPGNQT-VLSPAVD
        LVGLA++I +L + PK   + VE   V    I +N  H  +NA  ++ +K YNP K  S+  HSMR+      Q+     +  F Q P N+T + +  V 
Subjt:  LVGLAMVICWLVVFPKNPRFVVETGKV----IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFP-SAMPMFMQTPGNQT-VLSPAVD

Query:  VNFDY-PFGHRE--------EINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRT--FENTKCNVDL
         N     F  R+         I  E++ +A +SY    + SR R L+  C  +++ +  +    F+   C   L
Subjt:  VNFDY-PFGHRE--------EINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRT--FENTKCNVDL

Q9SJ52 NDR1/HIN1-like protein 104.5e-0525.26Show/hide
Query:  IIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKV--IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPS-AMPMFMQTP
        ++ +  + ++S+I ++G+A +I WL+V P+  +F V    +    H S  N+L   +A TV   NPNKR  ++   +       G+ F +  +  F Q  
Subjt:  IIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKV--IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPS-AMPMFMQTP

Query:  GNQTVLSPAVDVNFDYPFG-------HREEI----NPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRIN------DTRTFENTKCNVD
         N TVL+P         F        + E I    N E+ F   + + +G    R    ++ C++L L ++       T T    KC+ D
Subjt:  GNQTVLSPAVDVNFDYPFG-------HREEI----NPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRIN------DTRTFENTKCNVD

Q9SRN0 NDR1/HIN1-like protein 15.9e-0525.62Show/hide
Query:  QNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHN---STHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQ--AF
        +N    + ++IR I   ++ V+F++ L +++ W ++ P  PRF+++   V A N   +  N+L +    T+   NPN +  I+   + +  T   Q   F
Subjt:  QNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHN---STHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQ--AF

Query:  PSAMPMFMQTPGNQTVLSPAV
        P+++P   Q   +  + SP V
Subjt:  PSAMPMFMQTPGNQTVLSPAV

Arabidopsis top hitse value%identityAlignment
AT1G08160.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.8e-0928.74Show/hide
Query:  LVGLAMVICWLVVFPKNPRFVVETGKV----IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFP-SAMPMFMQTPGNQT-VLSPAVD
        LVGLA++I +L + PK   + VE   V    I +N  H  +NA  ++ +K YNP K  S+  HSMR+      Q+     +  F Q P N+T + +  V 
Subjt:  LVGLAMVICWLVVFPKNPRFVVETGKV----IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFP-SAMPMFMQTPGNQT-VLSPAVD

Query:  VNFDY-PFGHRE--------EINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRT--FENTKCNVDL
         N     F  R+         I  E++ +A +SY    + SR R L+  C  +++ +  +    F+   C   L
Subjt:  VNFDY-PFGHRE--------EINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRT--FENTKCNVDL

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.2e-0625.26Show/hide
Query:  IIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKV--IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPS-AMPMFMQTP
        ++ +  + ++S+I ++G+A +I WL+V P+  +F V    +    H S  N+L   +A TV   NPNKR  ++   +       G+ F +  +  F Q  
Subjt:  IIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKV--IAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPS-AMPMFMQTP

Query:  GNQTVLSPAVDVNFDYPFG-------HREEI----NPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRIN------DTRTFENTKCNVD
         N TVL+P         F        + E I    N E+ F   + + +G    R    ++ C++L L ++       T T    KC+ D
Subjt:  GNQTVLSPAVDVNFDYPFG-------HREEI----NPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRIN------DTRTFENTKCNVD

AT3G11660.1 NDR1/HIN1-like 14.2e-0625.62Show/hide
Query:  QNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHN---STHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQ--AF
        +N    + ++IR I   ++ V+F++ L +++ W ++ P  PRF+++   V A N   +  N+L +    T+   NPN +  I+   + +  T   Q   F
Subjt:  QNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHN---STHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQ--AF

Query:  PSAMPMFMQTPGNQTVLSPAV
        P+++P   Q   +  + SP V
Subjt:  PSAMPMFMQTPGNQTVLSPAV

AT4G01410.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.1e-0527.03Show/hide
Query:  RIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNST-HNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQTPGNQT
        R I   + +++ ++G+  +I WLV  P  PR  V    +   N T   +++ ++ F+V   NPN+R SIH   + M +T   Q     +P+     G+++
Subjt:  RIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNST-HNMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAFPSAMPMFMQTPGNQT

Query:  --VLSPAVDVN
          V++P +  N
Subjt:  --VLSPAVDVN

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1125.13Show/hide
Query:  KKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTH-NMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAF------PSA
        ++  +I  I   +L++IF+  +  +I WL   PK  R+ VE   V   N T+ N ++AT  FT++ +NPN R S++  S+ + +    Q        P  
Subjt:  KKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTH-NMLNATIAFTVKCYNPNKRASIHLHSMRMIITSMGQAF------PSA

Query:  MPMFMQTPGNQTVLSPAVDVNFDYPFGHREE-----INPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL
         P       ++T+++  V V+       R +     I  E+   A + + VG W S  R  +I C+++ + ++     +N+ C+ D+
Subjt:  MPMFMQTPGNQTVLSPAVDVNFDYPFGHREE-----INPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGCATTACTACAACAAAAGGAGAAGCATCATCGTCAAGAAAAAGCTCTAAAGGGCAAAATGAGACAACAAAAAAGACAAGAATTATAAGAATCATAGGAAGATG
TTTGTTGAGTGTAATATTTCTTGTTGGTCTTGCAATGGTCATATGTTGGCTTGTCGTGTTCCCCAAAAATCCTCGTTTCGTTGTGGAAACTGGCAAAGTAATAGCCCATA
ATTCAACTCATAATATGCTCAATGCCACCATAGCTTTCACTGTTAAATGCTACAACCCCAACAAAAGAGCCTCCATTCATTTGCACTCTATGAGGATGATAATTACTAGT
ATGGGCCAGGCATTTCCGTCCGCCATGCCGATGTTTATGCAGACTCCTGGAAACCAAACCGTCTTGTCCCCTGCTGTCGATGTCAACTTCGACTACCCATTTGGGCACCG
GGAAGAGATAAATCCCGAGCTTCACTTCTCTGCTGAAATCAGCTATAGTGTCGGGCACTGGACGTCGAGACCTCGGTTGCTCCAGATCTATTGTAATAATCTCTTGCTGA
GGATCAATGATACTAGAACTTTTGAAAATACCAAATGCAATGTGGATCTTTGA
mRNA sequenceShow/hide mRNA sequence
CCTAATGGTGAATAGTGGCATATATTCTTGTTCCTCTTCATTAAGGAAACCAAAACACACCTTCTTATACTTCCCATCAACATCTCCCTTCACTTATAGCATGAGGGAGA
AAAAAAAAATCAAAACCACTCAAAAAAGAAAAAAGGTAAGAATTTTGTGAAGAACAACAAATGAGGAGCATTACTACAACAAAAGGAGAAGCATCATCGTCAAGAAAAAG
CTCTAAAGGGCAAAATGAGACAACAAAAAAGACAAGAATTATAAGAATCATAGGAAGATGTTTGTTGAGTGTAATATTTCTTGTTGGTCTTGCAATGGTCATATGTTGGC
TTGTCGTGTTCCCCAAAAATCCTCGTTTCGTTGTGGAAACTGGCAAAGTAATAGCCCATAATTCAACTCATAATATGCTCAATGCCACCATAGCTTTCACTGTTAAATGC
TACAACCCCAACAAAAGAGCCTCCATTCATTTGCACTCTATGAGGATGATAATTACTAGTATGGGCCAGGCATTTCCGTCCGCCATGCCGATGTTTATGCAGACTCCTGG
AAACCAAACCGTCTTGTCCCCTGCTGTCGATGTCAACTTCGACTACCCATTTGGGCACCGGGAAGAGATAAATCCCGAGCTTCACTTCTCTGCTGAAATCAGCTATAGTG
TCGGGCACTGGACGTCGAGACCTCGGTTGCTCCAGATCTATTGTAATAATCTCTTGCTGAGGATCAATGATACTAGAACTTTTGAAAATACCAAATGCAATGTGGATCTT
TGAGATCTGATATATTTGTTTATGTTATTCTTGTTGAATAATCCAACAGATTTGTAACCTAAAAAGTTCATTTTTCTTTTCGGTTTTTTTTCTAATAATCAAGAAAAATG
TTATGGAACATGTGAGAGGTTCAGACAGGTTAGAATATATTATAGAATTTGTTATAGGTGTGGTGTAAAAAAAATAGAAAAAGCATATAAATTTTAGCTTAATTGCTTAA
AAAGATCCATATAACTTGGAACCTCTAAAGGTAATTTTTTAT
Protein sequenceShow/hide protein sequence
MRSITTTKGEASSSRKSSKGQNETTKKTRIIRIIGRCLLSVIFLVGLAMVICWLVVFPKNPRFVVETGKVIAHNSTHNMLNATIAFTVKCYNPNKRASIHLHSMRMIITS
MGQAFPSAMPMFMQTPGNQTVLSPAVDVNFDYPFGHREEINPELHFSAEISYSVGHWTSRPRLLQIYCNNLLLRINDTRTFENTKCNVDL