; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0323 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0323
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationMC07:10944650..10945288
RNA-Seq ExpressionMC07g0323
SyntenyMC07g0323
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575200.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]2.31e-7258.56Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        M + +QARPLA T + R S DD       K IQRRR I   C  ++ LL+ L V+V+ IL FT+F++KDPIIQMN IS+       G IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT +MN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T  IN+ ++SIEDQ C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.71e-7358.56Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        M + +QARPLA T++ R S DD       K IQRRR I   C  ++ LL+ L V+V+ IL FT+F++KDPIIQMN IS+       G IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT +MN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T  IN+ ++SIEDQ C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

XP_011656360.1 uncharacterized protein LOC105435724 [Cucumis sativus]2.00e-7357.66Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PL   +  R S D+  T    K IQR+R  +KCCS +VALL +   V+++IL FT+F+IKDPIIQMN +S+         IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GE RGPSG+A+  +T RMN+T++I+A+ +LSN+N  VS G ++LRSFSRIPG+VK+LHFI +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T VIN+ S+SIEDQKC +++K+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

XP_022147195.1 late embryogenesis abundant protein At1g64065 [Momordica charantia]1.28e-89100Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

XP_038875202.1 uncharacterized protein LOC120067718 [Benincasa hispida]2.35e-7256.31Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVI-LTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PLA  ++ RSS D+  T+   K IQRRR  +KCC  +V  L+   ++++I L FT+F+IKDP+I+MN +S+       GAIP+PGS   +SL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVI-LTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GEARGP G+A+  RT RMN+T++I+A+ +LSN++  VS G ++LRSFSRIPGRVK+LH I +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T +IN+ +RSIEDQ+C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein9.68e-7457.66Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PL   +  R S D+  T    K IQR+R  +KCCS +VALL +   V+++IL FT+F+IKDPIIQMN +S+         IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GE RGPSG+A+  +T RMN+T++I+A+ +LSN+N  VS G ++LRSFSRIPG+VK+LHFI +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T VIN+ S+SIEDQKC +++K+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

A0A2I4GMG7 uncharacterized protein LOC1090091711.29e-7155.16Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        MVE +QARPLA +++  SS +D+   H+QK   RR+R VK C C+ ALL+   V+++IL FTVF++KDP+I+MNGI++       G  P+PG+   +SLT
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGT----LQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A FKY NTTTTL+   T+VGEARGP GQA+P RT RMNITV+II + LLSN N++A      L + S+SRIPGRVKM+  I+KH+VVKMN
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGT----LQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT  +N+ S++I+ QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A6J1CZG8 late embryogenesis abundant protein At1g640656.22e-90100Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

A0A6J1H4K3 uncharacterized protein LOC1114603391.59e-7258.11Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGV-LVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        M + +QARPLA  ++ R S DD       K IQRRR I   C  ++ LL+ L V +++IL FT+F++KDPIIQMN IS+       G IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGV-LVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT RMN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T  IN+ ++SIEDQ C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

A0A6J1L0R6 uncharacterized protein LOC1114993186.41e-7258.11Show/hide
Query:  MVETEQARPLASTSNGRSSGDD-DPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        M + +QARPLA  ++ R S DD     HL+K IQR R I   C  +  L++   V+++IL FT+F++KDPIIQMN IS+       G IP+PGS   VSL
Subjt:  MVETEQARPLASTSNGRSSGDD-DPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT RMN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH IR++IVVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T  IN+ ++SIEDQ C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640653.0e-0728.49Show/hide
Query:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA
        KC    + ++V +  L +IL+    +I  P I+   IS        G  ST      +L +D+S++N N   F++ ++T   +Y    VVGE +    + 
Subjt:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA

Query:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
          H+T R+  + V I +  LL     + ++  G L+LRS + + GR+K+L   R  + V M+CT+ +N+  R I++  C
Subjt:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.1e-0828.49Show/hide
Query:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA
        KC    + ++V +  L +IL+    +I  P I+   IS        G  ST      +L +D+S++N N   F++ ++T   +Y    VVGE +    + 
Subjt:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA

Query:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
          H+T R+  + V I +  LL     + ++  G L+LRS + + GR+K+L   R  + V M+CT+ +N+  R I++  C
Subjt:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.9e-4241.96Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT
        M ++E  RPLA  +    S  D+  S++ K   R R  +KC  C+ A  + L  +V+ L FTVF++KDPII+MNG+ + G     G+       + +S+ 
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM
         DVSVKNPNTA FKYSNTTT +Y   T+VGEA G  G+ARPHRT+RMN+TV+I+ + +LS+  +      +G + + S++R+ G+VK++  ++KH+ VKM
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM

Query:  NCTVVINVLSRSIEDQKCMKRVKL
        NCT+ +N+  ++I+D  C K++ L
Subjt:  NCTVVINVLSRSIEDQKCMKRVKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.6e-1731.58Show/hide
Query:  RRRRIVKCCSCMVALLVTL-GVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEAR
        RR+R  K C C   LL+ L  +++VIL FT+FK K P   ++ ++   L  ++     K  ++LT   D+S+KNPN   F Y +++  L     V+GEA 
Subjt:  RRRRIVKCCSCMVALLVTL-GVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEAR

Query:  GPSGQARPHRTARMNITVNIIAELLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
         P+ +    +T  +NIT+ ++A+ LLS      +V AG + L +F ++ G+V +L   +  +    +C + I+V  R++  Q C    KL
Subjt:  GPSGQARPHRTARMNITVNIIAELLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.5e-1325Show/hide
Query:  VETEQARPLA----STSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL
        +  +QA+PLA    +T + +   +D       K +  + +++ CC  + +L + + V  ++L+ TVF +  P + ++ IS          +  +    ++
Subjt:  VETEQARPLA----STSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGE-TVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV
        + ++S+ NPN A+F   N   + Y GE  VVGE+   S      RT +MN+T  I+   LL+++     +++   + L+S   + GRVK +   RK + +
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGE-TVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV

Query:  KMNC
        + +C
Subjt:  KMNC

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.7e-0826.26Show/hide
Query:  SCMVALLVTLGVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTAR
        SC VA L  + +++  LT   TVF+ +DP I +  + +  +     S  + + +   +V+NPN A F + N    L+     +G    P+G+    RT R
Subjt:  SCMVALLVTLGVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTAR

Query:  MNITVNI------------IAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
        M  T ++            I+     N + S  T+++ S   + GRV++L      I  K NC + I+    SI   +C
Subjt:  MNITVNI------------IAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCAACCTCCAATGGTCGGAGTAGCGGCGACGACGACCCAACATCACACCTACAAAAAGGAATCCAACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGGAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGATGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGTCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATAACCGTCAACAT
CATTGCCGAACTGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCAACCTCCAATGGTCGGAGTAGCGGCGACGACGACCCAACATCACACCTACAAAAAGGAATCCAACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGGAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGATGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGTCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATAACCGTCAACAT
CATTGCCGAACTGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTA
Protein sequenceShow/hide protein sequence
MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYS
NTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL