; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007557 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007557
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLEA_2 domain-containing protein
Genome locationscaffold401:144229..144924
RNA-Seq ExpressionMS007557
SyntenyMS007557
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607382.1 hypothetical protein SDJN03_00724, partial [Cucurbita argyrosperma subsp. sororia]3.1e-5551.77Show/hide
Query:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM
        DKG RRV FS+SLP HR+TS   TK    RLFA+C  IC+  FGI + LLI+ VIF+SFLQSGLPEI++K L LSK +I +STNQ  N AVL+ +V +++
Subjt:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM

Query:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR
         ++NKN+K+ELSY D+ + + S++++LG++VI  FS  PGNTT LNVT NV  D +DR++   +++++K+ ++V ++ M  ++GFH GIF + KVPIHV 
Subjt:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR

Query:  CDDVQQFLLVNRIKEASCNIRMFPFR
        C + QQ+LL+ R+KE  C+I MFP R
Subjt:  CDDVQQFLLVNRIKEASCNIRMFPFR

XP_008457557.1 PREDICTED: uncharacterized protein LOC103497223 [Cucumis melo]2.1e-6759.57Show/hide
Query:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        +++KG RRV FS+SLP HRA      S+   KC  RLFA C  IC+G FGI++A+LI+ VIF+SFLQSGLPEI+++ L LS FEI +STNQN NNA+L+A
Subjt:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        ++D+S+ +RNKN+KIELSY  IVVN+ S+DVKLG+SVI  FSH PGNTTYLNVT NV     D++N  ++++++K+V+M  QV+MEA +GFH GIF+++ 
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP
        VPIHV C D QQ LLV RI E  CNIRMFP
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP

XP_022158431.1 uncharacterized protein LOC111024923 [Momordica charantia]7.8e-12399.57Show/hide
Query:  MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
Subjt:  MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEA IGFHAGIFSIEK
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR
        VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR

XP_022949169.1 uncharacterized protein LOC111452600 [Cucurbita moschata]5.9e-5450.88Show/hide
Query:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM
        DKG RRV FS+SLP HR+TS   TK     L A+C  IC+  FGI + LLI+ VIF+SFLQSGLPEI++K L LSK +I +STNQ  N AVL+ +V +++
Subjt:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM

Query:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR
         ++NKN+K+ELSY D+ + + S++++LG++VI  FS  PGNTT LNVT NV  D +DR++   +++++K+ ++V ++ M  ++GFH GIF + KVPIHV 
Subjt:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR

Query:  CDDVQQFLLVNRIKEASCNIRMFPFR
        C + QQ+LL+ R+KE  C+I MFP R
Subjt:  CDDVQQFLLVNRIKEASCNIRMFPFR

XP_038895624.1 uncharacterized protein LOC120083816 [Benincasa hispida]2.2e-6961.74Show/hide
Query:  NEDKGERRVYFSESLPTHRATSDSG-----TKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        ++++G RRV FSESLP HRA S  G     +KC  RLFA C  IC+  FGI+LA+LI+ VIFMSFLQSGLP+I++K L LSKFE ++STNQ  NN +L+A
Subjt:  NEDKGERRVYFSESLPTHRATSDSG-----TKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        +VD+S+ +RNKNDKIELSY +IVVN+ASDDVKLG+SVI GF+H PGNTTY NVT NVVG   D++N  ++++++KRV+M  QV ME+T+GFH GIF++  
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP
        VPIHV C D +QFLL+ RI E  CNIRMFP
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP

TrEMBL top hitse value%identityAlignment
A0A1S3C5S1 uncharacterized protein LOC1034972231.0e-6759.57Show/hide
Query:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        +++KG RRV FS+SLP HRA      S+   KC  RLFA C  IC+G FGI++A+LI+ VIF+SFLQSGLPEI+++ L LS FEI +STNQN NNA+L+A
Subjt:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        ++D+S+ +RNKN+KIELSY  IVVN+ S+DVKLG+SVI  FSH PGNTTYLNVT NV     D++N  ++++++K+V+M  QV+MEA +GFH GIF+++ 
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP
        VPIHV C D QQ LLV RI E  CNIRMFP
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP

A0A5A7V2C7 Putative transmembrane protein1.0e-6759.57Show/hide
Query:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        +++KG RRV FS+SLP HRA      S+   KC  RLFA C  IC+G FGI++A+LI+ VIF+SFLQSGLPEI+++ L LS FEI +STNQN NNA+L+A
Subjt:  NEDKGERRVYFSESLPTHRA-----TSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        ++D+S+ +RNKN+KIELSY  IVVN+ S+DVKLG+SVI  FSH PGNTTYLNVT NV     D++N  ++++++K+V+M  QV+MEA +GFH GIF+++ 
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP
        VPIHV C D QQ LLV RI E  CNIRMFP
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFP

A0A6J1E0W1 uncharacterized protein LOC1110249233.8e-12399.57Show/hide
Query:  MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
        MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA
Subjt:  MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDA

Query:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK
        RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEA IGFHAGIFSIEK
Subjt:  RVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEK

Query:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR
        VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR
Subjt:  VPIHVRCDDVQQFLLVNRIKEASCNIRMFPFR

A0A6J1GC15 uncharacterized protein LOC1114526002.9e-5450.88Show/hide
Query:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM
        DKG RRV FS+SLP HR+TS   TK     L A+C  IC+  FGI + LLI+ VIF+SFLQSGLPEI++K L LSK +I +STNQ  N AVL+ +V +++
Subjt:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM

Query:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR
         ++NKN+K+ELSY D+ + + S++++LG++VI  FS  PGNTT LNVT NV  D +DR++   +++++K+ ++V ++ M  ++GFH GIF + KVPIHV 
Subjt:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR

Query:  CDDVQQFLLVNRIKEASCNIRMFPFR
        C + QQ+LL+ R+KE  C+I MFP R
Subjt:  CDDVQQFLLVNRIKEASCNIRMFPFR

A0A6J1K8Y2 uncharacterized protein LOC1114933532.4e-5350.44Show/hide
Query:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM
        DKG RRV FS+SLP HR+ S   TK     LFA+C  IC+  FGI + LLI+ VIF+SFLQS LPEI++K L LSK +I +STNQ  N AVL+ +V +++
Subjt:  DKGERRVYFSESLPTHRATSDSGTK-CRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISM

Query:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR
         +RNKN+K+ELSY D+ + + S++++LG++VI  FS  PGNTT LNVT  V  D +DR++   +++++K+ ++V ++ M  ++GFH GIF + KVPIHV 
Subjt:  TVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVR

Query:  CDDVQQFLLVNRIKEASCNIRMFPFR
        C + QQ+LL+ R+KE  C+I MFP R
Subjt:  CDDVQQFLLVNRIKEASCNIRMFPFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G30505.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.4e-1424.35Show/hide
Query:  NNYNEDKGERRVYFSESLPTHRATSDS---GTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLD
        + + E++   R   S S   H    D     + C R+    C   C+    +L+ +L++ +   S ++S LP++ +  L+ S+ +I  S+     + +++
Subjt:  NNYNEDKGERRVYFSESLPTHRATSDS---GTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLD

Query:  ARVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIE
        A ++  + + N NDK  L Y  +  +++S+++ LGK  + GF   PGN T L + T +    V   +A  +  ++K +E +  V +   +      F + 
Subjt:  ARVDISMTVRNKNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIE

Query:  KVPIHVRCDDVQQFLLVNRIKEASCNIRMF
         +PI + C+ V+Q  ++N +K A C++R+F
Subjt:  KVPIHVRCDDVQQFLLVNRIKEASCNIRMF

AT2G46300.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-0926.44Show/hide
Query:  CGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISMTVRNKNDKIELSYG----DIVVNVASDDVKLGKS
        C  ICI     +  LL+   +F  +    LP  S+ + +L  F++ D  +    +A   ARV+    ++N N K+   YG    D+ V   +D+  +G++
Subjt:  CGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISMTVRNKNDKIELSYG----DIVVNVASDDVKLGKS

Query:  VIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVRCDDV
         + GF  GP N+T + V T V    V+R  A  +  + +  ++V  V  +  +G   G   I  + +++RC  V
Subjt:  VIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVRCDDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAATAATTATAATGAAGACAAAGGAGAGCGTCGCGTTTACTTCTCCGAATCCCTTCCTACACATCGCGCAACATCCGACTCCGGCACAAAATGTCGTCGTCGTTT
ATTTGCTTATTGCGGAAGGATATGTATTGGGGCTTTCGGAATTCTTCTCGCCCTACTAATCATTGCCGTAATATTCATGTCCTTCCTTCAGTCGGGCCTGCCAGAAATCA
GCATAAAAACGTTGCAACTCTCCAAATTTGAGATCCACGACTCCACAAATCAGAATCATAACAATGCCGTCCTAGACGCACGAGTAGATATATCAATGACGGTGAGGAAC
AAGAATGACAAAATAGAGTTGAGTTACGGCGATATTGTGGTGAATGTGGCGTCGGATGATGTGAAATTGGGGAAGAGCGTGATTGGCGGTTTCTCTCACGGCCCTGGAAA
CACCACGTACTTGAACGTAACCACCAATGTGGTGGGAGATGGTGTGGATAGAGAGAACGCGTTGGAAATACAAGAGGAGAAGAAAAGAGTGGAAATGGTGGCGCAGGTGA
GAATGGAAGCCACAATTGGTTTCCACGCTGGGATATTCAGCATCGAGAAGGTGCCAATCCATGTACGGTGCGACGATGTTCAACAATTTCTTCTTGTAAACCGCATAAAG
GAGGCTTCCTGTAACATTAGAATGTTTCCTTTCAGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAATAATTATAATGAAGACAAAGGAGAGCGTCGCGTTTACTTCTCCGAATCCCTTCCTACACATCGCGCAACATCCGACTCCGGCACAAAATGTCGTCGTCGTTT
ATTTGCTTATTGCGGAAGGATATGTATTGGGGCTTTCGGAATTCTTCTCGCCCTACTAATCATTGCCGTAATATTCATGTCCTTCCTTCAGTCGGGCCTGCCAGAAATCA
GCATAAAAACGTTGCAACTCTCCAAATTTGAGATCCACGACTCCACAAATCAGAATCATAACAATGCCGTCCTAGACGCACGAGTAGATATATCAATGACGGTGAGGAAC
AAGAATGACAAAATAGAGTTGAGTTACGGCGATATTGTGGTGAATGTGGCGTCGGATGATGTGAAATTGGGGAAGAGCGTGATTGGCGGTTTCTCTCACGGCCCTGGAAA
CACCACGTACTTGAACGTAACCACCAATGTGGTGGGAGATGGTGTGGATAGAGAGAACGCGTTGGAAATACAAGAGGAGAAGAAAAGAGTGGAAATGGTGGCGCAGGTGA
GAATGGAAGCCACAATTGGTTTCCACGCTGGGATATTCAGCATCGAGAAGGTGCCAATCCATGTACGGTGCGACGATGTTCAACAATTTCTTCTTGTAAACCGCATAAAG
GAGGCTTCCTGTAACATTAGAATGTTTCCTTTCAGA
Protein sequenceShow/hide protein sequence
MNNNYNEDKGERRVYFSESLPTHRATSDSGTKCRRRLFAYCGRICIGAFGILLALLIIAVIFMSFLQSGLPEISIKTLQLSKFEIHDSTNQNHNNAVLDARVDISMTVRN
KNDKIELSYGDIVVNVASDDVKLGKSVIGGFSHGPGNTTYLNVTTNVVGDGVDRENALEIQEEKKRVEMVAQVRMEATIGFHAGIFSIEKVPIHVRCDDVQQFLLVNRIK
EASCNIRMFPFR