; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G016430 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G016430
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionLEA_2 domain-containing protein
Genome locationchr01:14888268..14889743
RNA-Seq ExpressionLsi01G016430
SyntenyLsi01G016430
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142665.1 uncharacterized protein LOC101208230 [Cucumis sativus]1.9e-9287.91Show/hide
Query:  MEIASSSS--IKDPKSTQSTAAA--RSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP
        MEIASSSS  IKDPKSTQSTAAA  RSR+RRNTCIG+SIA ++LL+III+ILAFTVFKAKRPIT +NSVALADLDVSLNLA V+VDINVTLIAD+AITNP
Subjt:  MEIASSSS--IKDPKSTQSTAAA--RSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP

Query:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEIS
        NKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+ VF+D VAGSMPLNTYTRISGKVKILGIFNIHVVS+TSCDFNV+IS
Subjt:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEIS

Query:  ERKIGDQQCNYHTKI
        ERKIGDQQCNYHTKI
Subjt:  ERKIGDQQCNYHTKI

XP_008463309.1 PREDICTED: uncharacterized protein LOC103501497 [Cucumis melo]4.2e-9287.04Show/hide
Query:  MEIA--SSSSIKDPKSTQS---TAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN
        MEIA  SSSSIKDPKSTQS    AAARSR+RRNTCIG+SIA ++LL+I+I+ILAFTVFKAKRPIT +NSVALADLDVSLNLARV+VDINVTLIA +AITN
Subjt:  MEIA--SSSSIKDPKSTQS---TAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN

Query:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEI
        PNKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+ VFSDVVAGSMPLNTY RISGKVKILGIFNIHVVSTTSCDFNV+I
Subjt:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEI

Query:  SERKIGDQQCNYHTKI
        SERK+GDQQCNYHTKI
Subjt:  SERKIGDQQCNYHTKI

XP_022156243.1 uncharacterized protein LOC111023175 [Momordica charantia]3.3e-8182.55Show/hide
Query:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG
        MEIASS++ KD KST  TAAARSRRRRN CIG S+  ++LLVI+I+ILAFTVFKA+RPITAINSVALADL VSL++ARVAVDINVTLIA VA+TNPNKVG
Subjt:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERK
        FSYSNSTA LNYRGELVGEAPI AGRIDA Q K+MNITLTIMADRLL K++ VFSDVVAGSMPLNTYTRISG+VKILGIF IHVVSTTSCD  ++IS RK
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERK

Query:  IGDQQCNYHTKI
        IGDQQCNYHTKI
Subjt:  IGDQQCNYHTKI

XP_023518218.1 uncharacterized protein LOC111781758 [Cucurbita pepo subsp. pepo]6.1e-8375.76Show/hide
Query:  IPNFNASIKIPILSQNPNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVA
        +P   +S KI    Q P +AMEIASSSS KDPKS       RSRRRRNTCIGVSIATV+LL+++IVILAFTVFKAKRPIT INSVALADLD+SLN+AR A
Subjt:  IPNFNASIKIPILSQNPNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVA

Query:  VDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFN
        V +N+TLI DV+ITNPNKVGFSYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++S V SDVVAGSMPLNTYTRISGKV+ILGIF 
Subjt:  VDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFN

Query:  IHVVSTTSCDFNVEISERKIGDQQCNYHTKI
        I VVS+TSCDF ++IS+RKIGDQQC+YHTKI
Subjt:  IHVVSTTSCDFNVEISERKIGDQQCNYHTKI

XP_038882665.1 uncharacterized protein LOC120073854 [Benincasa hispida]2.5e-9793.36Show/hide
Query:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG
        MEIASSS+ KDPKSTQS AAARSRRRRNTCIG+SIATVVLLV++IVILAFTVFKAKRPITAINSV LADLDVSLNLARV+VDINVTLIADVAITNPNKVG
Subjt:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKI
        FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKT+ VFSDVVAG+MPLNTYTRISGKV+ILGIFNIHVVSTTSCDFNV ISERK+
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKI

Query:  GDQQCNYHTKI
        GDQQCNYHTKI
Subjt:  GDQQCNYHTKI

TrEMBL top hitse value%identityAlignment
A0A0A0L094 LEA_2 domain-containing protein9.2e-9387.91Show/hide
Query:  MEIASSSS--IKDPKSTQSTAAA--RSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP
        MEIASSSS  IKDPKSTQSTAAA  RSR+RRNTCIG+SIA ++LL+III+ILAFTVFKAKRPIT +NSVALADLDVSLNLA V+VDINVTLIAD+AITNP
Subjt:  MEIASSSS--IKDPKSTQSTAAA--RSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNP

Query:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEIS
        NKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+ VF+D VAGSMPLNTYTRISGKVKILGIFNIHVVS+TSCDFNV+IS
Subjt:  NKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEIS

Query:  ERKIGDQQCNYHTKI
        ERKIGDQQCNYHTKI
Subjt:  ERKIGDQQCNYHTKI

A0A1S3CJB5 uncharacterized protein LOC1035014972.0e-9287.04Show/hide
Query:  MEIA--SSSSIKDPKSTQS---TAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN
        MEIA  SSSSIKDPKSTQS    AAARSR+RRNTCIG+SIA ++LL+I+I+ILAFTVFKAKRPIT +NSVALADLDVSLNLARV+VDINVTLIA +AITN
Subjt:  MEIA--SSSSIKDPKSTQS---TAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITN

Query:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEI
        PNKVGFSY NSTAFLNYRGELVGEAPI AG+IDAG+RKEMNITLTIMADRLLKT+ VFSDVVAGSMPLNTY RISGKVKILGIFNIHVVSTTSCDFNV+I
Subjt:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEI

Query:  SERKIGDQQCNYHTKI
        SERK+GDQQCNYHTKI
Subjt:  SERKIGDQQCNYHTKI

A0A6J1DQ35 uncharacterized protein LOC1110231751.6e-8182.55Show/hide
Query:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG
        MEIASS++ KD KST  TAAARSRRRRN CIG S+  ++LLVI+I+ILAFTVFKA+RPITAINSVALADL VSL++ARVAVDINVTLIA VA+TNPNKVG
Subjt:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERK
        FSYSNSTA LNYRGELVGEAPI AGRIDA Q K+MNITLTIMADRLL K++ VFSDVVAGSMPLNTYTRISG+VKILGIF IHVVSTTSCD  ++IS RK
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERK

Query:  IGDQQCNYHTKI
        IGDQQCNYHTKI
Subjt:  IGDQQCNYHTKI

A0A6J1EAI5 uncharacterized protein LOC1114323073.6e-8174.46Show/hide
Query:  IPNFNASIKIPILSQNPNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVA
        +P   +S+KI   S     AMEIASS+  KDPKS       RSRRRRNTCIGVSIATV+LL+++IVILAFTVFKAKRPITAINSVALADLD+SLN+AR A
Subjt:  IPNFNASIKIPILSQNPNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVA

Query:  VDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFN
        V +N+TLI DV+ITNPNKVGFSYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++S V SDVVAGS+PLNTYTRISGKV+ILGIF 
Subjt:  VDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFN

Query:  IHVVSTTSCDFNVEISERKIGDQQCNYHTKI
        I VVS+TSCDF ++IS+RKIGDQQC+YHTKI
Subjt:  IHVVSTTSCDFNVEISERKIGDQQCNYHTKI

A0A6J1HS80 uncharacterized protein LOC1114661062.3e-8078.67Show/hide
Query:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG
        MEIASS++ KDPKS       RSRRRRNTCIGVSIATV+LL+++IVILAFTVFKAKRPIT INSVALADLD+SLN+AR AV +N+TLI DV+ITNPNKVG
Subjt:  MEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVG

Query:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKI
        FSYSNSTA LNYRGEL+GEAPI +GRI+A Q K MNIT+TIMADRLL++S V SDVVAGSMPLNTYTRISGKV+ILGIF I VVS+TSCDF ++IS+RKI
Subjt:  FSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKI

Query:  GDQQCNYHTKI
        GDQQC+YHTKI
Subjt:  GDQQCNYHTKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family7.3e-1036Show/hide
Query:  AAARSRRR---RNTCIGVSIATVVLLVIIIVILA--FTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR
        A    RRR   R      ++ATV LL++++V+L   FTVFK K P  ++N+V L    VS N A      N +    VA+ NPN+  FS+ +S+  L Y 
Subjt:  AAARSRRR---RNTCIGVSIATVVLLVIIIVILA--FTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR

Query:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMP
        G  VG   I AG+ID+G+ + M  T T+ +  +   +S   S V A  +P
Subjt:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMP

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.4e-0522.11Show/hide
Query:  STAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDV---SLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR
        S++++ S +    C+ +  A + LLV+ +V++     K K+P   +  VA+  + +   S  L      +++T+       NPNKVG  Y  S+  + Y+
Subjt:  STAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDV---SLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYR

Query:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGS-----MPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKIGDQQCNY
        G  +G A +     DA   K  N+  TI  DR+       +D+V  +     + L     +  K++++   +  V  + +C   +   ++ +  +QC +
Subjt:  GELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGS-----MPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKIGDQQCNY

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-2233.65Show/hide
Query:  PNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARV-AVDINVTLIADVAITN
        P T + ++  S+     + ++T  +R+R + + C+    AT ++L  I++ L FTVF+ K PI  +N V +  LD      +V  +  N+++I DV++ N
Subjt:  PNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARV-AVDINVTLIADVAITN

Query:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVV-AGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVE
        PN   F YSN+T  + Y+G LVGEA    G+    +   MN+T+ IM DR+L    +  ++  +G + + +YTR+ GKVKI+GI   HV    +C   V 
Subjt:  PNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVV-AGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVE

Query:  ISERKIGDQQC
        I+ + I D  C
Subjt:  ISERKIGDQQC

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-1226.06Show/hide
Query:  RRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITA
        +RR  CI   I  V+ ++ +  ++   VFK K PI    S  +  +  +++L    V +N TL  ++ + NPN   F Y      + YR  LVG   + +
Subjt:  RRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITA

Query:  GRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKIGDQQCNYHTKI
          + A     +   L +  D+ +     +  DV+ G + + T  ++ GK+ +LGIF I + S + C+  +      + DQ C+  TK+
Subjt:  GRIDAGQRKEMNITLTIMADRLL-KTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISERKIGDQQCNYHTKI

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-4645.7Show/hide
Query:  QNPNTAM--EIASSSSIKDPKSTQSTAAARSRRRRN--TCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIAD
        + P TAM      ++S  + +S  +  A + RR+RN   CI  +I  ++L+ I+IVILAFT+FK KRP T I+SV +  L  S+N   + V +N+TL  D
Subjt:  QNPNTAM--EIASSSSIKDPKSTQSTAAARSRRRRN--TCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTLIAD

Query:  VAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCD
        +++ NPN++GFSY +S+A LNYRG+++GEAP+ A RI A +   +NITLT+MADRLL  +++ SDV+AG +PLNT+ +++GKV +L IF I V S++SCD
Subjt:  VAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCD

Query:  FNVEISERKIGDQQCNYHTKI
         ++ +S+R +  Q C Y TK+
Subjt:  FNVEISERKIGDQQCNYHTKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGTAATCCCAAATTTCAATGCCTCCATAAAAATCCCAATTCTCTCTCAAAATCCCAACACCGCCATGGAAATCGCTTCCTCTTCCTCAATCAAGGATCCCAAATC
CACTCAATCCACCGCCGCCGCCCGCTCCCGGAGACGCCGCAACACCTGCATCGGAGTCTCCATCGCCACCGTCGTCCTTCTCGTAATCATAATCGTCATTTTAGCCTTCA
CAGTATTCAAAGCTAAACGCCCTATCACCGCCATCAATTCCGTTGCCCTAGCCGACCTCGATGTATCGCTAAACCTAGCTAGAGTCGCCGTCGACATCAACGTCACTCTA
ATTGCCGACGTCGCAATCACGAACCCCAACAAGGTCGGATTCAGCTACTCGAACAGTACCGCGTTTCTGAATTACAGAGGGGAATTGGTCGGAGAAGCGCCGATTACGGC
TGGGCGGATCGATGCGGGACAGAGGAAGGAGATGAATATCACGCTGACAATTATGGCGGATCGGCTACTGAAGACGTCGAAGGTGTTCTCCGACGTGGTGGCCGGATCGA
TGCCGTTGAATACGTATACGAGAATTTCAGGTAAGGTGAAGATTTTGGGGATTTTCAATATTCATGTGGTTTCAACTACGTCCTGTGATTTCAATGTCGAGATATCGGAG
AGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGTAATCCCAAATTTCAATGCCTCCATAAAAATCCCAATTCTCTCTCAAAATCCCAACACCGCCATGGAAATCGCTTCCTCTTCCTCAATCAAGGATCCCAAATC
CACTCAATCCACCGCCGCCGCCCGCTCCCGGAGACGCCGCAACACCTGCATCGGAGTCTCCATCGCCACCGTCGTCCTTCTCGTAATCATAATCGTCATTTTAGCCTTCA
CAGTATTCAAAGCTAAACGCCCTATCACCGCCATCAATTCCGTTGCCCTAGCCGACCTCGATGTATCGCTAAACCTAGCTAGAGTCGCCGTCGACATCAACGTCACTCTA
ATTGCCGACGTCGCAATCACGAACCCCAACAAGGTCGGATTCAGCTACTCGAACAGTACCGCGTTTCTGAATTACAGAGGGGAATTGGTCGGAGAAGCGCCGATTACGGC
TGGGCGGATCGATGCGGGACAGAGGAAGGAGATGAATATCACGCTGACAATTATGGCGGATCGGCTACTGAAGACGTCGAAGGTGTTCTCCGACGTGGTGGCCGGATCGA
TGCCGTTGAATACGTATACGAGAATTTCAGGTAAGGTGAAGATTTTGGGGATTTTCAATATTCATGTGGTTTCAACTACGTCCTGTGATTTCAATGTCGAGATATCGGAG
AGGAAAATTGGAGATCAACAGTGTAATTATCATACTAAGATCTGA
Protein sequenceShow/hide protein sequence
MRVIPNFNASIKIPILSQNPNTAMEIASSSSIKDPKSTQSTAAARSRRRRNTCIGVSIATVVLLVIIIVILAFTVFKAKRPITAINSVALADLDVSLNLARVAVDINVTL
IADVAITNPNKVGFSYSNSTAFLNYRGELVGEAPITAGRIDAGQRKEMNITLTIMADRLLKTSKVFSDVVAGSMPLNTYTRISGKVKILGIFNIHVVSTTSCDFNVEISE
RKIGDQQCNYHTKI