; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationchr7:15393386..15394027
RNA-Seq ExpressionMoc07g21110
SyntenyMoc07g21110
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575200.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]7.6e-5657.01Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M + +QARPLA T + R S DD       K IQRRR I   C  +  L++   ++++IL FT+F++KDPIIQMN IS+       G IP+PG  S VSLT
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT
        ADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT +MN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNCT
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT

Query:  VVINVLSRSIEDQKCMKRVKL
          IN+ ++SIEDQ C ++VK+
Subjt:  VVINVLSRSIEDQKCMKRVKL

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-5657.01Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M + +QARPLA T++ R S DD       K IQRRR I   C  +  L++   ++++IL FT+F++KDPIIQMN IS+       G IP+PG  S VSLT
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT
        ADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT +MN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNCT
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT

Query:  VVINVLSRSIEDQKCMKRVKL
          IN+ ++SIEDQ C ++VK+
Subjt:  VVINVLSRSIEDQKCMKRVKL

XP_011656360.1 uncharacterized protein LOC105435724 [Cucumis sativus]8.9e-5757.66Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PL   +  R S D+  T    K IQ R+R +KCCS +VALL +   V+++IL FT+F+IKDPIIQMN +S+         IP+PG  S VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GE RGPSG+A+  +T RMN+T++I+A+ +LSN+N  VS G ++LRSFSRIPG+VK+LHFI +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T VIN+ S+SIEDQKC +++K+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

XP_022147195.1 late embryogenesis abundant protein At1g64065 [Momordica charantia]1.3e-68100Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

XP_038875202.1 uncharacterized protein LOC120067718 [Benincasa hispida]2.0e-5656.31Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVA-LLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PLA  ++ RSS D+  T+   K IQ RRR +KCC  +V  L++   ++++IL FT+F+IKDP+I+MN +S+       GAIP+PG  S +SL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVA-LLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI--NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GEARGP G+A+  RT RMN+T++I+A+ +LSN+  +VS G ++LRSFSRIPGRVK+LH I +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI--NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T +IN+ +RSIEDQ+C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein4.3e-5757.66Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        MV+ +QA+PL   +  R S D+  T    K IQ R+R +KCCS +VALL +   V+++IL FT+F+IKDPIIQMN +S+         IP+PG  S VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALL-VTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTL+I ETV+GE RGPSG+A+  +T RMN+T++I+A+ +LSN+N  VS G ++LRSFSRIPG+VK+LHFI +++VVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T VIN+ S+SIEDQKC +++K+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

A0A2I4GMG7 uncharacterized protein LOC1090091711.8e-5555.16Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        MVE +QARPLA +++  SS +D+   H+QK   RR+R VK C C+ ALL+   V+++IL FTVF++KDP+I+MNGI++       G  P+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAG----TLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A FKY NTTTTL+   T+VGEARGP GQA+P RT RMNITV+II + LLSN N++A      L + S+SRIPGRVKM+  I+KH+VVKMN
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAG----TLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT  +N+ S++I+ QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A6J1CZG8 late embryogenesis abundant protein At1g640656.4e-69100Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

A0A6J1H4K3 uncharacterized protein LOC1114603394.8e-5657.66Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGV-LVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL
        M + +QARPLA  ++ R S DD       K IQ RRR +K    ++ LL+ L V +++IL FT+F++KDPIIQMN IS+       G IP+PG  S VSL
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGV-LVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC
        TADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT RMN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMNC
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNC

Query:  TVVINVLSRSIEDQKCMKRVKL
        T  IN+ ++SIEDQ C ++VK+
Subjt:  TVVINVLSRSIEDQKCMKRVKL

A0A6J1L0R6 uncharacterized protein LOC1114993181.1e-5557.47Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M + +QARPLA  ++ R S DD       K IQR R I   C  +  L++   V+++IL FT+F++KDPIIQMN IS+       G IP+PG  S VSLT
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT
        ADVSVKNPN A FKYSNTTTTLYI ETV+GEARGP GQA+  RT RMN+T+NI+ + LL N+N  +S+G L+LRSFSR+PGRVK+LH IR++IVVKMNCT
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCT

Query:  VVINVLSRSIEDQKCMKRVKL
          IN+ ++SIEDQ C ++VK+
Subjt:  VVINVLSRSIEDQKCMKRVKL

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640653.0e-0728.49Show/hide
Query:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA
        KC    + ++V +  L +IL+    +I  P I+   IS        G  ST      +L +D+S++N N   F++ ++T   +Y    VVGE +    + 
Subjt:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA

Query:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
          H+T R+  + V I +  LL     + ++  G L+LRS + + GR+K+L   R  + V M+CT+ +N+  R I++  C
Subjt:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.1e-0828.49Show/hide
Query:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA
        KC    + ++V +  L +IL+    +I  P I+   IS        G  ST      +L +D+S++N N   F++ ++T   +Y    VVGE +    + 
Subjt:  KCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----VSLTADVSVKNPNTAMFKYSNTT-TTLYIGETVVGEARGPSGQA

Query:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
          H+T R+  + V I +  LL     + ++  G L+LRS + + GR+K+L   R  + V M+CT+ +N+  R I++  C
Subjt:  RPHRTARM-NITVNIIAELLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.9e-4241.96Show/hide
Query:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT
        M ++E  RPLA  +    S  D+  S++ K   R R  +KC  C+ A  + L  +V+ L FTVF++KDPII+MNG+ + G     G+       + +S+ 
Subjt:  MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT

Query:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM
         DVSVKNPNTA FKYSNTTT +Y   T+VGEA G  G+ARPHRT+RMN+TV+I+ + +LS+  +      +G + + S++R+ G+VK++  ++KH+ VKM
Subjt:  ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM

Query:  NCTVVINVLSRSIEDQKCMKRVKL
        NCT+ +N+  ++I+D  C K++ L
Subjt:  NCTVVINVLSRSIEDQKCMKRVKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.6e-1731.58Show/hide
Query:  RRRRIVKCCSCMVALLVTL-GVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEAR
        RR+R  K C C   LL+ L  +++VIL FT+FK K P   ++ ++   L  ++     K  ++LT   D+S+KNPN   F Y +++  L     V+GEA 
Subjt:  RRRRIVKCCSCMVALLVTL-GVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTAMFKYSNTTTTLYIGETVVGEAR

Query:  GPSGQARPHRTARMNITVNIIAELLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
         P+ +    +T  +NIT+ ++A+ LLS      +V AG + L +F ++ G+V +L   +  +    +C + I+V  R++  Q C    KL
Subjt:  GPSGQARPHRTARMNITVNIIAELLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.5e-1325Show/hide
Query:  VETEQARPLA----STSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL
        +  +QA+PLA    +T + +   +D       K +  + +++ CC  + +L + + V  ++L+ TVF +  P + ++ IS          +  +    ++
Subjt:  VETEQARPLA----STSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL

Query:  TADVSVKNPNTAMFKYSNTTTTLYIGE-TVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV
        + ++S+ NPN A+F   N   + Y GE  VVGE+   S      RT +MN+T  I+   LL+++     +++   + L+S   + GRVK +   RK + +
Subjt:  TADVSVKNPNTAMFKYSNTTTTLYIGE-TVVGEARGPSGQARPHRTARMNITVNIIAELLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV

Query:  KMNC
        + +C
Subjt:  KMNC

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.7e-0826.26Show/hide
Query:  SCMVALLVTLGVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTAR
        SC VA L  + +++  LT   TVF+ +DP I +  + +  +     S  + + +   +V+NPN A F + N    L+     +G    P+G+    RT R
Subjt:  SCMVALLVTLGVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYSNTTTTLYIGETVVGEARGPSGQARPHRTAR

Query:  MNITVNI------------IAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
        M  T ++            I+     N + S  T+++ S   + GRV++L      I  K NC + I+    SI   +C
Subjt:  MNITVNI------------IAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCGACCTCCAATGGTCGGAGTAGCGGCGACGACGACCCAACATCACACCTACAAAAAGGAATCCAACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGGAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGATGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGTCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATAACCGTCAACAT
CATTGCCGAACTGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCGACCTCCAATGGTCGGAGTAGCGGCGACGACGACCCAACATCACACCTACAAAAAGGAATCCAACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGGAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGATGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGTCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATAACCGTCAACAT
CATTGCCGAACTGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTATAG
Protein sequenceShow/hide protein sequence
MVETEQARPLASTSNGRSSGDDDPTSHLQKGIQRRRRIVKCCSCMVALLVTLGVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTAMFKYS
NTTTTLYIGETVVGEARGPSGQARPHRTARMNITVNIIAELLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL