; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002045 (gene) of Snake gourd v1 genome

Gene IDTan0002045
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
Genome locationLG01:10543061..10544446
RNA-Seq ExpressionTan0002045
SyntenyTan0002045
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148992.1 uncharacterized protein LOC101209064 [Cucumis sativus]4.8e-9791Show/hide
Query:  MSKPHGH--HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ
        M  PHGH  HPPSGRTNLASC+VAT+FLIF++IV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT+NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQ
Subjt:  MSKPHGH--HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ

Query:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        IGFMFIPAGKIDAGQTQYMAATFSVQSFPL APV + GAGPTFSEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRVAIAVSDGSVLGFHC
Subjt:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

XP_022941606.1 uncharacterized protein LOC111446913 [Cucurbita moschata]1.4e-9995.45Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M K H HHPPSGRTNLASCIVATIFLIFV+IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKID+GQTQYMAATFSVQSFPL APV A GAGPT+SEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

XP_022982486.1 uncharacterized protein LOC111481290 [Cucurbita maxima]1.4e-9995.45Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M K H HHPPSGRTNLASCIVATIFLIFV+IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKID+GQTQYMAATFSVQSFPL APV A GAGPT+SEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

XP_023525568.1 uncharacterized protein LOC111789137 [Cucurbita pepo subsp. pepo]3.9e-9994.95Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M K H HHPPSGRTNLASCIVATIFLIFV+IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKID+GQTQYMAATFSVQSFPL AP  A GAGPT+SEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

XP_038899581.1 uncharacterized protein LOC120086842 [Benincasa hispida]5.1e-9993.94Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        MSKPHGHHPPSGRTNLASCIVAT+FLIF+VI+VLIVFFTVFKPQDPKIAVSAVQLPSFSVANG++NFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKIDAGQTQYMAATFSVQSF L APV A GAGPTFSEGMNGYRIGPTLEIESKMDM GRVRVLHFFTHHVE T SCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

TrEMBL top hitse value%identityAlignment
A0A0A0KX26 LEA_2 domain-containing protein2.3e-9791Show/hide
Query:  MSKPHGH--HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ
        M  PHGH  HPPSGRTNLASC+VAT+FLIF++IV+LIVFFTVFKPQDPKIAVSAVQLPSFSVANGT+NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQ
Subjt:  MSKPHGH--HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ

Query:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        IGFMFIPAGKIDAGQTQYMAATFSVQSFPL APV + GAGPTFSEGMNGYR+GP LEIESKMDMAGRVRVLHFFTHHVEAT SCRVAIAVSDGSVLGFHC
Subjt:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

A0A1S3BSP1 uncharacterized protein LOC1034931063.8e-9287.5Show/hide
Query:  MSKPH--GHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ
        M  PH  G  PPSGRTNLASC+VAT+FLIF++IV+LIVFFTVFKPQDPKIAVSAVQLPSFSV NGT+NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQ
Subjt:  MSKPH--GHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ

Query:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        IGFMFIPAGKI+AGQTQYMAATFSVQSFPL +PV A GAGPTFS GMNGYR+GP LEIESKMDMAGRVRVL+FFTHHVEA  SCRVAIAVSDGSVLGFHC
Subjt:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

A0A5D3D1L2 Proline-rich receptor-like protein kinase PERK33.8e-9287.5Show/hide
Query:  MSKPH--GHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ
        M  PH  G  PPSGRTNLASC+VAT+FLIF++IV+LIVFFTVFKPQDPKIAVSAVQLPSFSV NGT+NFTFSQYVSV+NPNKASFSHYDSSLQLLYSGSQ
Subjt:  MSKPH--GHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQ

Query:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        IGFMFIPAGKI+AGQTQYMAATFSVQSFPL +PV A GAGPTFS GMNGYR+GP LEIESKMDMAGRVRVL+FFTHHVEA  SCRVAIAVSDGSVLGFHC
Subjt:  IGFMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

A0A6J1FLK4 uncharacterized protein LOC1114469136.5e-10095.45Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M K H HHPPSGRTNLASCIVATIFLIFV+IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKID+GQTQYMAATFSVQSFPL APV A GAGPT+SEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

A0A6J1IZG6 uncharacterized protein LOC1114812906.5e-10095.45Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M K H HHPPSGRTNLASCIVATIFLIFV+IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        FMFIPAGKID+GQTQYMAATFSVQSFPL APV A GAGPT+SEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family5.5e-5939.77Show/hide
Query:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        M+KPH     SGRTNLASC VAT+FL+ +++V+L+V+FTVFKP+DPKI+V+AVQLPSF+V+N T NF+FSQYV+VRNPN+A FSHYDSS+QLLYSG+Q+G
Subjt:  MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAP------------------------------------------------------------VPAF-----
        FMFIPAGKID+G+ QYMAATF+V SFP++ P                                                             P+F     
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAP------------------------------------------------------------VPAF-----

Query:  -------------------------------------------------------------------------GAGPTFSEGM------NGYRIGPTLEI
                                                                                 G GPT  +G        G R+GPT+EI
Subjt:  -------------------------------------------------------------------------GAGPTFSEGM------NGYRIGPTLEI

Query:  ESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        ESKM++AGRV+VLH FTHHV A   CRV ++++DGSVLGFHC
Subjt:  ESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.0e-0923.23Show/hide
Query:  HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT-------VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG
        H    R   + C+ AT  ++  +++ L+  FTVF+ +DP I ++ V +       GT        N +    VSV+NPN ASF + +++  + Y G+ +G
Subjt:  HPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGT-------VNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIG

Query:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
              GK    +T  M  T  +    + +  P  G   + S  +N         + S   + G+V+++     HV   ++C +A+ ++  ++    C
Subjt:  FMFIPAGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.7e-1329.53Show/hide
Query:  RTNLASCIVATIFLIFVV-IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVA------NGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIP
        + N   CI  TI LI ++ IV++I+ FT+FKP+ P   + +V +     +         +N T +  +S++NPN+  FS+  SS  L Y G  IG   +P
Subjt:  RTNLASCIVATIFLIFVV-IVVLIVFFTVFKPQDPKIAVSAVQLPSFSVA------NGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIP

Query:  AGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        A +I A +T  +  T ++ +  L +           S+ M G      + + + + + G+V VL  F   V+++ SC ++I+VSD +V   HC
Subjt:  AGKIDAGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.5e-5354.59Show/hide
Query:  TNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQ
        +NLASC VAT+F++F++I  L V+ TVF+P+DP+I+V++V++PSFSVAN +V+FTFSQ+ +VRNPN+A+FSHY++ +QL Y G++IG+ F+PAG+I++G+
Subjt:  TNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQ

Query:  TQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        T+ M ATFSVQSFPL A   A  +  + ++  N  R G T+EIESK++MAGRVRVL  FTH + A  +CR+AI+ SDGS++   C
Subjt:  TQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC

AT4G23930.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.6e-3745.41Show/hide
Query:  TNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQ
        +NLASC VAT+F++F++I  L V+ TVF+P+DP+I+V++V++PSFSVAN +                           L Y G++IG+ F+PAG+I++G+
Subjt:  TNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKIDAGQ

Query:  TQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC
        T+ M ATFSVQSFPL A   A  +  + ++  N  R G T+EIESK++MAGRVRVL  FTH + A  +CR+AI+ SDGS++   C
Subjt:  TQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAACCCCACGGCCACCACCCGCCGTCCGGCCGAACGAACTTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTCGTCGTCATCGTCGTCCTCATCGTCTT
CTTCACCGTCTTCAAGCCTCAGGATCCGAAGATCGCCGTCTCCGCCGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACTGTCAATTTCACTTTCTCTCAGTACGTCT
CCGTCAGAAACCCTAACAAAGCTTCTTTCTCTCACTACGACAGCTCACTCCAGCTCCTCTACTCCGGTTCTCAAATTGGATTCATGTTCATTCCCGCCGGTAAAATCGAC
GCCGGCCAGACGCAGTACATGGCGGCGACCTTCTCCGTCCAGTCATTCCCCTTGACCGCTCCGGTCCCCGCCTTCGGAGCGGGGCCAACCTTCTCGGAAGGAATGAACGG
GTACAGAATCGGACCGACGCTGGAGATCGAATCGAAAATGGATATGGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAG
TCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCTAA
mRNA sequenceShow/hide mRNA sequence
CTCATTCAATCATTTTTTTCACTCATTTTCCTATAGAACCAAACATTTAAAACCCAAAATCCAAGAAGACAAAGACAACATACAAACTCCACTTTTCAAATTTAGAAACG
CAAAACAATAAATTTATTTATAATAATAATAAACTTATATTCATAATTTCCACACTTTGTCTTTCCCTTTCCTCCACCATTTTATATCCCAATTTTTTTGCATCCAAACA
CACCGCTATTCTCTCTCCTCTGAAGCAAAAACCAGATCCGTTCCACCTCCGCCGTAAGCACCGCCATGAGCAAACCCCACGGCCACCACCCGCCGTCCGGCCGAACGAAC
TTGGCGTCCTGTATAGTCGCCACGATCTTCTTAATCTTCGTCGTCATCGTCGTCCTCATCGTCTTCTTCACCGTCTTCAAGCCTCAGGATCCGAAGATCGCCGTCTCCGC
CGTCCAGTTGCCGTCCTTCTCCGTTGCCAACGGCACTGTCAATTTCACTTTCTCTCAGTACGTCTCCGTCAGAAACCCTAACAAAGCTTCTTTCTCTCACTACGACAGCT
CACTCCAGCTCCTCTACTCCGGTTCTCAAATTGGATTCATGTTCATTCCCGCCGGTAAAATCGACGCCGGCCAGACGCAGTACATGGCGGCGACCTTCTCCGTCCAGTCA
TTCCCCTTGACCGCTCCGGTCCCCGCCTTCGGAGCGGGGCCAACCTTCTCGGAAGGAATGAACGGGTACAGAATCGGACCGACGCTGGAGATCGAATCGAAAATGGATAT
GGCCGGTAGGGTTAGGGTATTGCACTTCTTCACTCACCATGTGGAGGCCACATTGAGTTGCAGAGTCGCCATTGCTGTAAGTGATGGATCTGTGTTAGGTTTCCACTGCT
AATTCTTCATCTTCATCTTCTTCTTCTTCTTCTTCTTGTTTAGTTCTGGAAAATTTGGTCAGTTGAGTGTTCAAATTCTCATACTCAAAGTGTAAAAAAAAAATCCAAAA
AGATTTTGAGCTCTCTCCCTTGAGTAATTTGTGGAATTAAATTGCAGAGCGATCTGATCTTAGATTTTGCTTAAGCTCTGTTTCTGAACTGAAGTTCATCAGAAATCTCT
TTTTATTTTTGTGAATTTGTGATTTGTTTTTTAGGCAATGAATCTAATTTTGATTCTTAGGGATAAATCTATGGGTTTGAATCATTAAGGAATGTTAGATGTGAACAAAG
GATGATTGCAATTGAAACAAAGAACAAAGTGGGATTGTACTCATTTTCTTTGGACTTCTTTTTTTTTTTTTCCAGCTTTTTTGAAGTTAAATAAATTGGGCACTAAGCAA
AGTTTATTACTTTATAGTTGCTTTTGGCTGAGGATTGAATATTAATGTTAGGTAGTAGGATTTATT
Protein sequenceShow/hide protein sequence
MSKPHGHHPPSGRTNLASCIVATIFLIFVVIVVLIVFFTVFKPQDPKIAVSAVQLPSFSVANGTVNFTFSQYVSVRNPNKASFSHYDSSLQLLYSGSQIGFMFIPAGKID
AGQTQYMAATFSVQSFPLTAPVPAFGAGPTFSEGMNGYRIGPTLEIESKMDMAGRVRVLHFFTHHVEATLSCRVAIAVSDGSVLGFHC