; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G006090 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G006090
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLEA_2 domain-containing protein
Genome locationCG_Chr04:20436099..20436731
RNA-Seq ExpressionClCG04G006090
SyntenyClCG04G006090
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142665.1 uncharacterized protein LOC101208230 [Cucumis sativus]2.2e-9287.44Show/hide
Query:  MEIASSSS--VKDPKSTQS---ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP
        MEIASSSS  +KDPKSTQS   A  RSR+RRNTCIG+SIAI+LLL+I+IIILAFTVFKAKRPIT +NSVALADLDVSLNLA V+VDINVTLIAD+AITNP
Subjt:  MEIASSSS--VKDPKSTQS---ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP

Query:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDIS
        NKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+TVF+D VAGSMPLNTYTRISGKV+ILGIFNIHVVS+TSCDFNVDIS
Subjt:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDIS

Query:  ERKIGDQQCNYHTKI
        ERKIGDQQCNYHTKI
Subjt:  ERKIGDQQCNYHTKI

XP_008463309.1 PREDICTED: uncharacterized protein LOC103501497 [Cucumis melo]1.5e-9388.43Show/hide
Query:  MEIA--SSSSVKDPKSTQS----ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN
        MEIA  SSSS+KDPKSTQS    A ARSR+RRNTCIG+SIAI+LLL+ILIIILAFTVFKAKRPIT +NSVALADLDVSLNLARV+VDINVTLIA +AITN
Subjt:  MEIA--SSSSVKDPKSTQS----ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN

Query:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI
        PNKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+TVFSDVVAGSMPLNTY RISGKV+ILGIFNIHVVSTTSCDFNVDI
Subjt:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI

Query:  SERKIGDQQCNYHTKI
        SERK+GDQQCNYHTKI
Subjt:  SERKIGDQQCNYHTKI

XP_022966458.1 uncharacterized protein LOC111466106 [Cucurbita maxima]2.3e-8180.48Show/hide
Query:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF
        MEIASS++ KDPKS      RSRRRRNTCIGVSIA VLLL++LI+ILAFTVFKAKRPIT INSVALADLD+SLN+AR AV +N+TLI DV+ITNPNKVGF
Subjt:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF

Query:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG
        SYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++STV SDVVAGSMPLNTYTRISGKVRILGIF I VVS+TSCDF +DIS+RKIG
Subjt:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG

Query:  DQQCNYHTKI
        DQQC+YHTKI
Subjt:  DQQCNYHTKI

XP_023518218.1 uncharacterized protein LOC111781758 [Cucurbita pepo subsp. pepo]1.9e-8381.43Show/hide
Query:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF
        MEIASSSS KDPKS      RSRRRRNTCIGVSIA VLLL++LI+ILAFTVFKAKRPIT INSVALADLD+SLN+AR AV +N+TLI DV+ITNPNKVGF
Subjt:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF

Query:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG
        SYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++STV SDVVAGSMPLNTYTRISGKVRILGIF I VVS+TSCDF +DIS+RKIG
Subjt:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG

Query:  DQQCNYHTKI
        DQQC+YHTKI
Subjt:  DQQCNYHTKI

XP_038882665.1 uncharacterized protein LOC120073854 [Benincasa hispida]9.6e-9692.89Show/hide
Query:  MEIASSSSVKDPKSTQS-ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG
        MEIASSS+ KDPKSTQS A ARSRRRRNTCIG+SIA V+LLV+LI+ILAFTVFKAKRPITAINSV LADLDVSLNLARV+VDINVTLIADVAITNPNKVG
Subjt:  MEIASSSSVKDPKSTQS-ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKI
        FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKT+TVFSDVVAG+MPLNTYTRISGKVRILGIFNIHVVSTTSCDFNV ISERK+
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKI

Query:  GDQQCNYHTKI
        GDQQCNYHTKI
Subjt:  GDQQCNYHTKI

TrEMBL top hitse value%identityAlignment
A0A0A0L094 LEA_2 domain-containing protein1.1e-9287.44Show/hide
Query:  MEIASSSS--VKDPKSTQS---ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP
        MEIASSSS  +KDPKSTQS   A  RSR+RRNTCIG+SIAI+LLL+I+IIILAFTVFKAKRPIT +NSVALADLDVSLNLA V+VDINVTLIAD+AITNP
Subjt:  MEIASSSS--VKDPKSTQS---ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP

Query:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDIS
        NKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+TVF+D VAGSMPLNTYTRISGKV+ILGIFNIHVVS+TSCDFNVDIS
Subjt:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDIS

Query:  ERKIGDQQCNYHTKI
        ERKIGDQQCNYHTKI
Subjt:  ERKIGDQQCNYHTKI

A0A1S3CJB5 uncharacterized protein LOC1035014977.4e-9488.43Show/hide
Query:  MEIA--SSSSVKDPKSTQS----ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN
        MEIA  SSSS+KDPKSTQS    A ARSR+RRNTCIG+SIAI+LLL+ILIIILAFTVFKAKRPIT +NSVALADLDVSLNLARV+VDINVTLIA +AITN
Subjt:  MEIA--SSSSVKDPKSTQS----ATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN

Query:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI
        PNKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+TVFSDVVAGSMPLNTY RISGKV+ILGIFNIHVVSTTSCDFNVDI
Subjt:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI

Query:  SERKIGDQQCNYHTKI
        SERK+GDQQCNYHTKI
Subjt:  SERKIGDQQCNYHTKI

A0A6J1DQ35 uncharacterized protein LOC1110231751.5e-8182.94Show/hide
Query:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF
        MEIASS++ KD KST +A ARSRRRRN CIG S+  +LLLVILI+ILAFTVFKA+RPITAINSVALADL VSL++ARVAVDINVTLIA VA+TNPNKVGF
Subjt:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF

Query:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKI
        SYSNSTA LNYRGELVGEAPI AGRIDA Q K+MNITLTIMADRLL K++ VFSDVVAGSMPLNTYTRISG+V+ILGIF IHVVSTTSCD  +DIS RKI
Subjt:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKI

Query:  GDQQCNYHTKI
        GDQQCNYHTKI
Subjt:  GDQQCNYHTKI

A0A6J1EAI5 uncharacterized protein LOC1114323071.5e-8180.48Show/hide
Query:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF
        MEIASS+  KDPKS      RSRRRRNTCIGVSIA VLLL++LI+ILAFTVFKAKRPITAINSVALADLD+SLN+AR AV +N+TLI DV+ITNPNKVGF
Subjt:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF

Query:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG
        SYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++STV SDVVAGS+PLNTYTRISGKVRILGIF I VVS+TSCDF +DIS+RKIG
Subjt:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG

Query:  DQQCNYHTKI
        DQQC+YHTKI
Subjt:  DQQCNYHTKI

A0A6J1HS80 uncharacterized protein LOC1114661061.1e-8180.48Show/hide
Query:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF
        MEIASS++ KDPKS      RSRRRRNTCIGVSIA VLLL++LI+ILAFTVFKAKRPIT INSVALADLD+SLN+AR AV +N+TLI DV+ITNPNKVGF
Subjt:  MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGF

Query:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG
        SYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++STV SDVVAGSMPLNTYTRISGKVRILGIF I VVS+TSCDF +DIS+RKIG
Subjt:  SYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIG

Query:  DQQCNYHTKI
        DQQC+YHTKI
Subjt:  DQQCNYHTKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family1.5e-0935.33Show/hide
Query:  ATARSRRR---RNTCIGVSIAIVLLLVILIIILA--FTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR
        A    RRR   R      ++A V LL++L+++L   FTVFK K P  ++N+V L    VS N A      N +    VA+ NPN+  FS+ +S+  L Y 
Subjt:  ATARSRRR---RNTCIGVSIAIVLLLVILIIILA--FTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR

Query:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMP
        G  VG   I AG+ID+G+ + M  T T+ +  +   +S+  S V A  +P
Subjt:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMP

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.2e-0522.17Show/hide
Query:  IASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDV---SLNLARVAVDINVTLIADVAITNPNKVG
        IA+ +     +S  S+++ S +    C+ +  A + LLV+ ++++     K K+P   +  VA+  + +   S  L      +++T+       NPNKVG
Subjt:  IASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDV---SLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGS-----MPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI
          Y  S+  + Y+G  +G A +     DA   K  N+  TI  DR+       +D+V  +     + L     +  K+R++   +  V  + +C   +  
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGS-----MPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDI

Query:  SERKIGDQQCNY
         ++ +  +QC +
Subjt:  SERKIGDQQCNY

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-2236.5Show/hide
Query:  PKSTQSA-----TARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARV-AVDINVTLIADVAITNPNKVGFSYSNS
        P S +SA     T RSR R    I V+ A  L+L  +++ L FTVF+ K PI  +N V +  LD      +V  +  N+++I DV++ NPN   F YSN+
Subjt:  PKSTQSA-----TARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARV-AVDINVTLIADVAITNPNKVGFSYSNS

Query:  TAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVV-AGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQQC
        T  + Y+G LVGEA    G+    +   MN+T+ IM DR+L    +  ++  +G + + +YTR+ GKV+I+GI   HV    +C   V+I+ + I D  C
Subjt:  TAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVV-AGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQQC

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.3e-1326.6Show/hide
Query:  RRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITA
        +RR  CI   I  VL ++ +  ++   VFK K PI    S  +  +  +++L    V +N TL  ++ + NPN   F Y      + YR  LVG   + +
Subjt:  RRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITA

Query:  GRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQQCNYHTKI
          + A     +   L +  D+ +     +  DV+ G + + T  ++ GK+ +LGIF I + S + C+  +      + DQ C+  TK+
Subjt:  GRIDAGQRKEMNITLTIMADRLL-KTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQQCNYHTKI

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.5e-4646.15Show/hide
Query:  SSSSVKDPKSTQSATARSRRRRN--TCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSY
        ++SS++   +      + RR+RN   CI  +I ++LL+ I+I+ILAFT+FK KRP T I+SV +  L  S+N   + V +N+TL  D+++ NPN++GFSY
Subjt:  SSSSVKDPKSTQSATARSRRRRN--TCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSY

Query:  SNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQ
         +S+A LNYRG+++GEAP+ A RI A +   +NITLT+MADRLL  + + SDV+AG +PLNT+ +++GKV +L IF I V S++SCD ++ +S+R +  Q
Subjt:  SNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQ

Query:  QCNYHTKI
         C Y TK+
Subjt:  QCNYHTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATCGCTTCCTCCTCCTCAGTCAAAGATCCCAAATCCACTCAATCCGCCACCGCCCGCTCCCGGAGACGCCGCAACACCTGCATCGGAGTCTCCATCGCCATCGT
CCTCCTCCTCGTAATTCTAATAATCATTTTAGCCTTCACAGTATTCAAAGCCAAACGCCCTATCACCGCTATCAATTCCGTTGCCCTAGCCGACCTCGATGTGTCGCTAA
ACCTAGCCAGAGTCGCTGTCGACATCAACGTCACTCTAATTGCCGACGTCGCAATCACGAACCCTAACAAGGTCGGATTCAGCTACTCGAATAGCACCGCGTTTCTGAAT
TACAGAGGGGAATTGGTCGGAGAGGCGCCAATTACGGCTGGGCGGATCGATGCGGGACAGAGGAAGGAGATGAATATCACGCTCACGATTATGGCGGATCGGCTACTGAA
GACGTCGACGGTGTTTTCCGACGTGGTGGCGGGATCGATGCCGTTGAATACGTATACGAGAATTTCAGGCAAGGTGAGGATTTTGGGGATTTTCAATATTCATGTGGTTT
CAACTACTTCGTGTGATTTCAATGTCGATATATCGGAGAGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATCGCTTCCTCCTCCTCAGTCAAAGATCCCAAATCCACTCAATCCGCCACCGCCCGCTCCCGGAGACGCCGCAACACCTGCATCGGAGTCTCCATCGCCATCGT
CCTCCTCCTCGTAATTCTAATAATCATTTTAGCCTTCACAGTATTCAAAGCCAAACGCCCTATCACCGCTATCAATTCCGTTGCCCTAGCCGACCTCGATGTGTCGCTAA
ACCTAGCCAGAGTCGCTGTCGACATCAACGTCACTCTAATTGCCGACGTCGCAATCACGAACCCTAACAAGGTCGGATTCAGCTACTCGAATAGCACCGCGTTTCTGAAT
TACAGAGGGGAATTGGTCGGAGAGGCGCCAATTACGGCTGGGCGGATCGATGCGGGACAGAGGAAGGAGATGAATATCACGCTCACGATTATGGCGGATCGGCTACTGAA
GACGTCGACGGTGTTTTCCGACGTGGTGGCGGGATCGATGCCGTTGAATACGTATACGAGAATTTCAGGCAAGGTGAGGATTTTGGGGATTTTCAATATTCATGTGGTTT
CAACTACTTCGTGTGATTTCAATGTCGATATATCGGAGAGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA
Protein sequenceShow/hide protein sequence
MEIASSSSVKDPKSTQSATARSRRRRNTCIGVSIAIVLLLVILIIILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLN
YRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSTVFSDVVAGSMPLNTYTRISGKVRILGIFNIHVVSTTSCDFNVDISERKIGDQQCNYHTKI