; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G02410 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G02410
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLEA_2 domain-containing protein
Genome locationClcChr01:2105689..2107015
RNA-Seq ExpressionClc01G02410
SyntenyClc01G02410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042716.1 late embryogenesis abundant protein [Cucumis melo var. makuwa]5.8e-8582.08Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK+D+EQQLATFKTL KERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LS+VSIPKFS++NA NSSSP SL+L+  A 
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        F VDNSNFGPF+FDNGTVGL+YG  I GERSTG GRAEAKGS RMNVTVE SAKN+SG +   GIL+LSSF KLRGRVRLIH+FRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

KAE8647845.1 hypothetical protein Csa_000600 [Cucumis sativus]3.9e-8180.19Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK ++EQQLATFK LRKERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LSS+S P+ S++ N NSSSP SLNL+  AE
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        FTVDNSNFGPF+FDNGTVGL+YG  I GERSTG GRA AKGS RMNVTVE SAKN+SG +   GILN SSF KLRGRVRLIHIFRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNW  E
Subjt:  NTHQIQHNWVCE

XP_004143966.1 late embryogenesis abundant protein At1g64065 [Cucumis sativus]4.2e-8381.13Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK ++EQQLATFK LRKERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LSS+S P+ S++ N NSSSP SLNL+  AE
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        FTVDNSNFGPF+FDNGTVGL+YG  I GERSTG GRA AKGS RMNVTVE SAKN+SG +   GILN SSF KLRGRVRLIHIFRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

XP_008437349.1 PREDICTED: late embryogenesis abundant protein At1g64065 [Cucumis melo]1.2e-8582.55Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK+D+EQQLATFKTL KERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LS+VSIPKFS++NA NSSSP SL+L+  A 
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        FTVDNSNFGPF+FDNGTVGL+YG  I GERSTG GRAEAKGS RMNVTVE SAKN+SG +   GIL+LSSF KLRGRVRLIH+FRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

XP_038875090.1 late embryogenesis abundant protein At1g64065 [Benincasa hispida]2.7e-9890.48Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFT
        MVEDSQSFPLAHYQAHHKSD+EQQLATFKTLRKERSNKCFIY+FS FVFLSVAVLIFALIVLR+NSPSI+LSSVSIPKFSITNANSSSPSLNLT+IAEFT
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFT

Query:  VDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNT
        VDNSNFGPF+FDNGTVGLMYG AI+GE+STGAGRAEAKGS RMNVT+EASAKNIS DSNNLGILNL+SF KLRGRVRLIHIFRRR SSE++CSMNLD+NT
Subjt:  VDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNT

Query:  HQIQHNWVCE
        HQIQ+NWVCE
Subjt:  HQIQHNWVCE

TrEMBL top hitse value%identityAlignment
A0A0A0KQT7 LEA_2 domain-containing protein2.0e-8381.13Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK ++EQQLATFK LRKERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LSS+S P+ S++ N NSSSP SLNL+  AE
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        FTVDNSNFGPF+FDNGTVGL+YG  I GERSTG GRA AKGS RMNVTVE SAKN+SG +   GILN SSF KLRGRVRLIHIFRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

A0A1S3ATY3 late embryogenesis abundant protein At1g640655.7e-8682.55Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK+D+EQQLATFKTL KERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LS+VSIPKFS++NA NSSSP SL+L+  A 
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        FTVDNSNFGPF+FDNGTVGL+YG  I GERSTG GRAEAKGS RMNVTVE SAKN+SG +   GIL+LSSF KLRGRVRLIH+FRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

A0A5A7TL68 Late embryogenesis abundant protein2.8e-8582.08Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE
        M EDSQSFPLAHYQAHHK+D+EQQLATFKTL KERSNKCFIYIFS FVFLSVA+LIFALIVLR+NSPSI LS+VSIPKFS++NA NSSSP SL+L+  A 
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNA-NSSSP-SLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
        F VDNSNFGPF+FDNGTVGL+YG  I GERSTG GRAEAKGS RMNVTVE SAKN+SG +   GIL+LSSF KLRGRVRLIH+FRRR+SSE+SCSMNLDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

A0A6J1H2C0 uncharacterized protein LOC1114597392.3e-6667.62Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFT
        M E S SFPL H QAHH           KT + E SNKCFIYIFS+FVFL VA+LIF+LIVLR+NSP+I LSS+S+ KFSI+N NSSS SLNLTLIAEF+
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFT

Query:  VDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNT
        +DNSNFGPF FD  TV  MYG  I+GERSTG GRAEAKG+ RMNV+VEAS +N+S D N  GILN+SSFAK  GR+ LIH+ R+RI SE+SCS+NLDLNT
Subjt:  VDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNT

Query:  HQIQHNWVCE
        HQIQ  WVC+
Subjt:  HQIQHNWVCE

A0A6J1I3M2 uncharacterized protein LOC1114688754.0e-7974.06Show/hide
Query:  MVEDSQSFPLAHYQAHHKSDQEQQ--LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAE
        M +DSQSFP+AHY+AHHKSD+EQ+  L TFK L+KERSNKCFIY+FSAFVFLSVAVLIFALIVLR+NSP++  SS+S+ KFS++N NSSSPSLNLT+ A+
Subjt:  MVEDSQSFPLAHYQAHHKSDQEQQ--LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAE

Query:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL
          VDNSNFGPF+FD  +VG +Y  AI+G+ +TGAGRA+AKG+  MNVTV+ASA NIS D NN  +LNLSSFA LRGRVRLIHIFRRR SSE+SCSM LDL
Subjt:  FTVDNSNFGPFDFDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDL

Query:  NTHQIQHNWVCE
        NTHQIQHNWVCE
Subjt:  NTHQIQHNWVCE

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640656.6e-1531.48Show/hide
Query:  DSQSFPLAHYQAHHKSDQEQQ-LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVD
        D     LA  + + +SD+EQ     ++   +E   KC +Y  +  V +    LI + I LR++ P I   S+S      +  NS++P  N TL+++ ++ 
Subjt:  DSQSFPLAHYQAHHKSDQEQQ-LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVD

Query:  NSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRM-NVTVEASA------KNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSM
        NSNFG F+F++ T+ ++Y    ++GE      R EA  ++R+  V VE  +      K++  D   LG L L S A++RGR++++   R ++ S +SC+M
Subjt:  NSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRM-NVTVEASA------KNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSM

Query:  NLDLNTHQIQHNWVCE
         L+L    IQ N +CE
Subjt:  NLDLNTHQIQHNWVCE

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-1631.48Show/hide
Query:  DSQSFPLAHYQAHHKSDQEQQ-LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVD
        D     LA  + + +SD+EQ     ++   +E   KC +Y  +  V +    LI + I LR++ P I   S+S      +  NS++P  N TL+++ ++ 
Subjt:  DSQSFPLAHYQAHHKSDQEQQ-LATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVD

Query:  NSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRM-NVTVEASA------KNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSM
        NSNFG F+F++ T+ ++Y    ++GE      R EA  ++R+  V VE  +      K++  D   LG L L S A++RGR++++   R ++ S +SC+M
Subjt:  NSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRM-NVTVEASA------KNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSM

Query:  NLDLNTHQIQHNWVCE
         L+L    IQ N +CE
Subjt:  NLDLNTHQIQHNWVCE

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.0e-1124.21Show/hide
Query:  QEQQLATFKTLRKERSNK-CFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSPSLNLTLIAEFTVDNSNFGPFDFDNGTVGL
        Q     T K LR++R+ K C  +     + +++ ++I A  + +   P+  + SV++ +   + N       LNLTL  + ++ N N   F +D+ +  L
Subjt:  QEQQLATFKTLRKERSNK-CFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSIT-NANSSSPSLNLTLIAEFTVDNSNFGPFDFDNGTVGL

Query:  MYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNL-----GILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNTHQI
         Y   +IGE    A R  A+ ++ +N+T+   A  +  ++  L     G++ L++F K+ G+V ++ IF+ ++ S  SC +++ ++   +
Subjt:  MYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNL-----GILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNTHQI

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-0827.23Show/hide
Query:  VEDSQSFPLAHYQAHHKSDQ---EQQLATFKTLRKERSNK---CFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSI-PKFSITNANSSSPSLNLT
        + + Q+ PLA      +SDQ   E Q    +T       K   C  +I S  + ++V  ++ +L V  L+SP++ + S+S   +F   N   ++ + N T
Subjt:  VEDSQSFPLAHYQAHHKSDQ---EQQLATFKTLRKERSNK---CFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSI-PKFSITNANSSSPSLNLT

Query:  LIAEFTVDNSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRMNVTVE-------ASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRI
        +  E ++ N N   F   N  V   +G   ++GE    +    AK +++MN+T E       AS   +  D N  G+ +L S  ++RGRV+ + IFR+ +
Subjt:  LIAEFTVDNSNFGPFDFDNGTVGLMYGS-AIIGERSTGAGRAEAKGSMRMNVTVE-------ASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRI

Query:  SSEVSCSMNLDLN
          +  C M +  N
Subjt:  SSEVSCSMNLDLN

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.8e-0627.27Show/hide
Query:  LRKERSNKCFIYIFSAF-VFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVDNSNFGPFDFDNGTVGLMYGSAIIGERS
        + K  SN     + + F VFL +A L   L V R   P I ++SV +P FS+ N+     S++ T      V N N   F   N  + L Y    IG   
Subjt:  LRKERSNKCFIYIFSAF-VFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVDNSNFGPFDFDNGTVGLMYGSAIIGERS

Query:  TGAGRAEAKGSMRMNVTV-------------EASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSC
          AG  E+  + RM  T              + SA        +   + + S  ++ GRVR++ +F  RI+++ +C
Subjt:  TGAGRAEAKGSMRMNVTV-------------EASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAGACAGCCAGAGCTTTCCATTAGCGCACTACCAAGCTCACCACAAATCCGACCAAGAACAACAACTCGCCACTTTCAAAACTCTCCGAAAAGAACGATCCAA
CAAATGTTTCATCTACATCTTCTCCGCCTTCGTCTTCCTCAGCGTCGCTGTTCTAATCTTCGCTCTCATCGTCCTCCGCCTCAATTCCCCTTCCATCCGCCTCTCTTCCG
TCTCAATCCCTAAGTTTTCCATTACTAACGCCAATTCCTCTTCTCCTTCGCTTAATCTCACCTTAATCGCCGAATTCACCGTCGATAATTCCAACTTCGGTCCTTTCGAT
TTCGACAACGGCACCGTGGGTCTCATGTATGGCAGCGCCATCATCGGTGAGAGGAGTACCGGCGCTGGAAGAGCCGAGGCCAAGGGGAGTATGAGGATGAATGTTACTGT
GGAAGCTTCGGCGAAGAATATCAGCGGTGATTCGAATAATTTGGGGATTTTGAATCTGAGTAGCTTTGCGAAACTGAGAGGCAGAGTTCGTTTGATTCATATTTTTAGGA
GGAGGATTTCGTCGGAGGTTAGCTGTTCTATGAATCTCGATTTGAATACTCATCAAATTCAGCATAATTGGGTTTGTGAGTAG
mRNA sequenceShow/hide mRNA sequence
AAAAGCGTGGGGTGAAAACAAAACAGAGCTTCGCCGGCGGCGCATTCCAAAACTGACCCCTTCGTCGTTTCCCACGAATTTCCCATTCTCCCCCTCACTCACTAACCGCT
TTCTTCCACTTTCCCTATAAATTCTCTCTCTCTCTCTCTCTAAAACCAAAAACCCTCTCCTTCTTCTTCCTCTCTCCTCTGCTACCAACAAAACAGAAAAGAAAATTAAA
GAAAAATGGTGGAAGACAGCCAGAGCTTTCCATTAGCGCACTACCAAGCTCACCACAAATCCGACCAAGAACAACAACTCGCCACTTTCAAAACTCTCCGAAAAGAACGA
TCCAACAAATGTTTCATCTACATCTTCTCCGCCTTCGTCTTCCTCAGCGTCGCTGTTCTAATCTTCGCTCTCATCGTCCTCCGCCTCAATTCCCCTTCCATCCGCCTCTC
TTCCGTCTCAATCCCTAAGTTTTCCATTACTAACGCCAATTCCTCTTCTCCTTCGCTTAATCTCACCTTAATCGCCGAATTCACCGTCGATAATTCCAACTTCGGTCCTT
TCGATTTCGACAACGGCACCGTGGGTCTCATGTATGGCAGCGCCATCATCGGTGAGAGGAGTACCGGCGCTGGAAGAGCCGAGGCCAAGGGGAGTATGAGGATGAATGTT
ACTGTGGAAGCTTCGGCGAAGAATATCAGCGGTGATTCGAATAATTTGGGGATTTTGAATCTGAGTAGCTTTGCGAAACTGAGAGGCAGAGTTCGTTTGATTCATATTTT
TAGGAGGAGGATTTCGTCGGAGGTTAGCTGTTCTATGAATCTCGATTTGAATACTCATCAAATTCAGCATAATTGGGTTTGTGAGTAGTGATGATCTGAATCAGAATCCA
TCAACTCCACACAGCCCTAATTAGTTTATTTTTCTTTTCAAATTCGTTTATAATTATTATTATTTGTTATTCTTTTTTTACAATTTTGGTGGGTTTCTTCTTTTTCCTGC
ACACGATGTAAATATAATATATGTTTTCTTTTTCTTTTCAATATACACTCTTTCTTTATTTTTATATTTTTAATGGATTATTTATAAGGAGTTAGAGTTAGGATAGGCTG
ACAAGGTAAAAATACTCCTCTATTATTTCATATTTGTAATGGCACTTTCTTAATTGTTTTAGACGATTTTTCTTCGTTACGTACAATAAAATTTTGAACTATTTGGCTTA
GTTTACCATTTTAATGATGAATATATTATTTTGTGGACAACTTGTATGGGTGTTGGCTATAAATGTTATATTGAATGATAATGGATATTGGATAACTTTATGGAATATGA
ACGGAAG
Protein sequenceShow/hide protein sequence
MVEDSQSFPLAHYQAHHKSDQEQQLATFKTLRKERSNKCFIYIFSAFVFLSVAVLIFALIVLRLNSPSIRLSSVSIPKFSITNANSSSPSLNLTLIAEFTVDNSNFGPFD
FDNGTVGLMYGSAIIGERSTGAGRAEAKGSMRMNVTVEASAKNISGDSNNLGILNLSSFAKLRGRVRLIHIFRRRISSEVSCSMNLDLNTHQIQHNWVCE