; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS022902 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS022902
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLate embryogenesis abundant protein
Genome locationscaffold635:119632..120270
RNA-Seq ExpressionMS022902
SyntenyMS022902
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2725396.1 hypothetical protein I3760_01G064000 [Carya illinoinensis]6.2e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        MVE +QARPLA  ++  SSD+D+   H+QK   RR+R VK C C+ ALL+  AV+++IL FTVF++KDP+I+MNGI++       G IP+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGT----LQLRSFSRIPGRVKMLHFIRKHIVVKMN
        AD+SVKNPN A+FKY NTTT L+   TVVGEARGPPGQA+P RT+RMNITV+II D+LLSN N++A      L + S+SRIPGRVKM+  I+KH+VVKMN
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGT----LQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT+ +N+ S++I+ QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

KAG7013763.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-5859.19Show/hide
Query:  MVETEQARPLASTSNGR-SSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS
        M + +QARPLA T++ R SSDD     HL++   +RRR +K    ++ LL+ L+V+V+ IL FT+F++KDPIIQMN IS+       G IP+PG  S VS
Subjt:  MVETEQARPLASTSNGR-SSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVV-ILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS

Query:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        LTADVSVKNPN A+FKYSNTTTTLYI ETV+GEARGPPGQA+  RT +MN+T+NI+ DRLL N+N  +S+G L+LRSFSR+PGRVK+LH +R++IVVKMN
Subjt:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT  IN+ ++SIEDQ C ++VK+
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

PON32698.1 Immunoglobulin-like fold containing protein [Parasponia andersonii]8.1e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M E EQARPLA  ++  SSDDDD T+ L+K   RRR+ +KCC C+ AL++  AV+++IL FTVF++KDP+I+MN I++          P+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN----VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A+FKY NTTTTLY    VVGEARGPPGQA+P RT RMNITV+II DRL+S+ N    V +G L + S+SRIPGRVKML+ I++H+VVKMN
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN----VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT+ +N+ S++I++QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

XP_022147195.1 late embryogenesis abundant protein At1g64065 [Momordica charantia]3.6e-6697.16Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTA FKYSNTTTTLYIGETVVGEARGP GQARPHRTARMNITVNIIA+ LLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

XP_038875202.1 uncharacterized protein LOC120067718 [Benincasa hispida]4.7e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTS-HLQKGIRRRRRIVKCCSCMVA-LLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS
        MV+ +QA+PLA  ++ RSS D+  T+ HL++   +RRR +KCC  +V  L++   ++++IL FT+F+IKDP+I+MN +S+       GAIP+PG  S +S
Subjt:  MVETEQARPLASTSNGRSSDDDDPTS-HLQKGIRRRRRIVKCCSCMVA-LLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS

Query:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI--NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        LTADVSVKNPN A+FKYSNTTTTL+I ETV+GEARGPPG+A+  RT RMN+T++I+ADR+LSN+  +VS G ++LRSFSRIPGRVK+LH I +++VVKMN
Subjt:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI--NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT +IN+ +RSIEDQ+C ++VK+
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

TrEMBL top hitse value%identityAlignment
A0A0A0KD33 LEA_2 domain-containing protein8.7e-5857.85Show/hide
Query:  MVETEQARPLA-STSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALL-VTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS
        MV+ +QA+PL  +T N  SSD+ +   HL++   +R+R +KCCS +VALL +   V+++IL FT+F+IKDPIIQMN +S+         IP+PG  S VS
Subjt:  MVETEQARPLA-STSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALL-VTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVS

Query:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        LTADVSVKNPN A+FKYSNTTTTL+I ETV+GE RGP G+A+  +T RMN+T++I+ADR+LSN+N  VS G ++LRSFSRIPG+VK+LHFI +++VVKMN
Subjt:  LTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN--VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT VIN+ S+SIEDQKC +++K+
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A2I4GMG7 uncharacterized protein LOC1090091715.1e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        MVE +QARPLA +++  SSD+D+   H+QK   RR+R VK C C+ ALL+  AV+++IL FTVF++KDP+I+MNGI++       G  P+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAG----TLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A+FKY NTTTTL+   T+VGEARGPPGQA+P RT RMNITV+II D+LLSN N++A      L + S+SRIPGRVKM+  I+KH+VVKMN
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAG----TLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT  +N+ S++I+ QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A2P5A832 Immunoglobulin-like fold containing protein3.9e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M E EQARPLA  ++  SSDDDD T+ L+K   RRR+ +KCC C+ AL++  AV+++IL FTVF++KDP+I+MN I++          P+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN----VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A+FKY NTTTTLY    VVGEARGPPGQA+P RT RMNITV+II DRL+S+ N    V +G L + S+SRIPGRVKML+ I++H+VVKMN
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNIN----VSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT+ +N+ S++I++QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A2P5C6S3 Immunoglobulin-like fold containing protein6.7e-5856.95Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT
        M E EQARPLA  ++  SSDDDD  + L+K   RRR+ +KCC C+ AL++  AV+++IL FTVF++KDP+I+MN I++          P+PG  + +SLT
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLA------GAIPEPGSKSTVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSA----GTLQLRSFSRIPGRVKMLHFIRKHIVVKMN
        ADVSVKNPN A+FKY NTTTTLY    VVGEARGPPGQA+P RT RMNITV+II DRLLS+ N++A    G L + S+SRIPGRVKML+ I++H+VVKMN
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSA----GTLQLRSFSRIPGRVKMLHFIRKHIVVKMN

Query:  CTVVINVLSRSIEDQKCMKRVKL
        CT+ +N+ S++I++QKC ++V L
Subjt:  CTVVINVLSRSIEDQKCMKRVKL

A0A6J1CZG8 late embryogenesis abundant protein At1g640651.7e-6697.16Show/hide
Query:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGTLQLRSFSRIP
        MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTA FKYSNTTTTLYIGETVVGEARGP GQARPHRTARMNITVNIIA+ LLSNINVSAGTLQLRSFSRIP
Subjt:  MNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGTLQLRSFSRIP

Query:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
        GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL
Subjt:  GRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640655.2e-0727.8Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIV-----KCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----
        MV+ ++     +   GRS ++       Q G R  RR       KC    + ++V +  L +IL+    +I  P I+   IS        G  ST     
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIV-----KCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----

Query:  VSLTADVSVKNPNTATFKYSNTT-TTLYIGETVVGEARGPPGQARPHRTARM-NITVNIIADRLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKH
         +L +D+S++N N   F++ ++T   +Y    VVGE +    +   H+T R+  + V I + RLL     + ++  G L+LRS + + GR+K+L   R  
Subjt:  VSLTADVSVKNPNTATFKYSNTT-TTLYIGETVVGEARGPPGQARPHRTARM-NITVNIIADRLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKH

Query:  IVVKMNCTVVINVLSRSIEDQKC
        + V M+CT+ +N+  R I++  C
Subjt:  IVVKMNCTVVINVLSRSIEDQKC

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.7e-0827.8Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIV-----KCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----
        MV+ ++     +   GRS ++       Q G R  RR       KC    + ++V +  L +IL+    +I  P I+   IS        G  ST     
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIV-----KCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKST-----

Query:  VSLTADVSVKNPNTATFKYSNTT-TTLYIGETVVGEARGPPGQARPHRTARM-NITVNIIADRLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKH
         +L +D+S++N N   F++ ++T   +Y    VVGE +    +   H+T R+  + V I + RLL     + ++  G L+LRS + + GR+K+L   R  
Subjt:  VSLTADVSVKNPNTATFKYSNTT-TTLYIGETVVGEARGPPGQARPHRTARM-NITVNIIADRLLS----NINVSAGTLQLRSFSRIPGRVKMLHFIRKH

Query:  IVVKMNCTVVINVLSRSIEDQKC
        + V M+CT+ +N+  R I++  C
Subjt:  IVVKMNCTVVINVLSRSIEDQKC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-4443.75Show/hide
Query:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT
        M ++E  RPLA  +    SD+    S+++   R R RI KC  C+ A  + L  +V+ L FTVF++KDPII+MNG+ + G     G+       + +S+ 
Subjt:  MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSK------STVSLT

Query:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM
         DVSVKNPNTA+FKYSNTTT +Y   T+VGEA G PG+ARPHRT+RMN+TV+I+ DR+LS+  +      +G + + S++R+ G+VK++  ++KH+ VKM
Subjt:  ADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVS-----AGTLQLRSFSRIPGRVKMLHFIRKHIVVKM

Query:  NCTVVINVLSRSIEDQKCMKRVKL
        NCT+ +N+  ++I+D  C K++ L
Subjt:  NCTVVINVLSRSIEDQKCMKRVKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.7e-1829.81Show/hide
Query:  NGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTATFKY
        N  S +     +   K +RR+R    C    + L++ +A+++VIL FT+FK K P   ++ ++   L  ++     K  ++LT   D+S+KNPN   F Y
Subjt:  NGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGIS---LAGAIPEPGSKSTVSLT--ADVSVKNPNTATFKY

Query:  SNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQ
         +++  L     V+GEA  P  +    +T  +NIT+ ++ADRLLS      +V AG + L +F ++ G+V +L   +  +    +C + I+V  R++  Q
Subjt:  SNTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQ

Query:  KCMKRVKL
         C    KL
Subjt:  KCMKRVKL

AT4G23610.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.9e-1325.98Show/hide
Query:  VETEQARPLAS---TSNGRSSDDDDPTSH-LQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL
        +  +QA+PLA    T+     D++D   H   K +  + +++ CC  + +L + +AV  ++L+ TVF +  P + ++ IS          +  +    ++
Subjt:  VETEQARPLAS---TSNGRSSDDDDPTSH-LQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIP----EPGSKSTVSL

Query:  TADVSVKNPNTATFKYSNTTTTLYIGE-TVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV
        + ++S+ NPN A F   N   + Y GE  VVGE+          RT +MN+T  I+  +LL+++     +++   + L+S   + GRVK +   RK + +
Subjt:  TADVSVKNPNTATFKYSNTTTTLYIGE-TVVGEARGPPGQARPHRTARMNITVNIIADRLLSNI-----NVSAGTLQLRSFSRIPGRVKMLHFIRKHIVV

Query:  KMNC
        + +C
Subjt:  KMNC

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-0726.26Show/hide
Query:  SCMVALLVTLAVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTAR
        SC VA L  + +++  LT   TVF+ +DP I +  + +  +     S  + + +   +V+NPN A F + N    L+     +G    P G+    RT R
Subjt:  SCMVALLVTLAVLVVILT--FTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYSNTTTTLYIGETVVGEARGPPGQARPHRTAR

Query:  MNITVNI------------IADRLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC
        M  T ++            I+     N + S  T+++ S   + GRV++L      I  K NC + I+    SI   +C
Subjt:  MNITVNI------------IADRLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCGACCTCCAACGGTCGGAGTAGCGACGACGACGACCCAACATCACACCTACAAAAAGGAATCCGACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGCAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGACGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGCCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATCACCGTCAACAT
CATTGCCGACCGGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAAACCGAGCAAGCTCGACCGTTAGCCTCGACCTCCAACGGTCGGAGTAGCGACGACGACGACCCAACATCACACCTACAAAAAGGAATCCGACGAAGAAGAAG
AATCGTAAAATGTTGCAGCTGCATGGTCGCCCTTCTCGTAACATTAGCAGTACTGGTCGTCATCTTGACATTCACCGTGTTTAAAATCAAGGATCCAATAATCCAAATGA
ATGGAATTTCGCTCGCCGGGGCCATCCCGGAGCCAGGATCCAAATCGACCGTGTCGCTGACGGCGGACGTGTCGGTGAAAAACCCCAACACGGCGACGTTCAAGTACAGC
AACACGACGACGACTTTGTACATCGGGGAGACGGTGGTCGGGGAGGCGAGAGGACCGCCGGGGCAGGCCAGGCCGCATCGGACGGCACGGATGAACATCACCGTCAACAT
CATTGCCGACCGGCTGCTGTCGAACATCAACGTCAGCGCCGGGACGCTGCAGTTGAGAAGCTTTTCGAGGATTCCGGGGAGGGTGAAGATGTTGCATTTTATAAGGAAAC
ATATTGTTGTGAAGATGAACTGTACGGTGGTTATCAATGTGCTGAGCCGATCGATTGAGGATCAGAAATGCATGAAGAGGGTGAAGCTA
Protein sequenceShow/hide protein sequence
MVETEQARPLASTSNGRSSDDDDPTSHLQKGIRRRRRIVKCCSCMVALLVTLAVLVVILTFTVFKIKDPIIQMNGISLAGAIPEPGSKSTVSLTADVSVKNPNTATFKYS
NTTTTLYIGETVVGEARGPPGQARPHRTARMNITVNIIADRLLSNINVSAGTLQLRSFSRIPGRVKMLHFIRKHIVVKMNCTVVINVLSRSIEDQKCMKRVKL