; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g04800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g04800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionlate embryogenesis abundant protein M17-like isoform X2
Genome locationchr2:3543589..3549233
RNA-Seq ExpressionMoc02g04800
SyntenyMoc02g04800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAN52106.1 variant-specific surface protein AS3 [Giardia intestinalis]7.3e-2922.68Show/hide
Query:  IASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDGG---HSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCTRCCGLHEAKNENPIEEL
        ++   +Q T++   + +VDN        +HC     GFD       C+R C +   +  N+     +HC     GFD +   T+C    +  N       
Subjt:  IASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDGG---HSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCTRCCGLHEAKNENPIEEL

Query:  KIHCPYGCCGFDASHSCTRCCGLRKVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHEIERENPVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKEN
         +HC     GFD +   T+C    +V+         +HC     GFD +   T+C    +++         +HC     GFD +   T+C    +V+   
Subjt:  KIHCPYGCCGFDASHSCTRCCGLRKVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHEIERENPVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKEN

Query:  LVERLKIHCPYGCCSFDASHSCTRCCGLRKVKKENSVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHK
              +HC      FD +   T+C    +V          +HC     GFD +   T+C    +V+         +HC     GFD +   T+C    +
Subjt:  LVERLKIHCPYGCCSFDASHSCTRCCGLRKVKKENSVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHK

Query:  VEKENPVEELKIHCPYGCCGFDVSHSCTRCCGLHEVEKENPVERLKIHCPYGCCSFD---DHYNCIRCCGLSAEEHAIIQNGFNLISHQNEANVEKGQRE
        V+        KIHC     GFD +   T+C    +V+         +HC      FD   D   C+R C +   E     N  N+   Q  A  +    +
Subjt:  VEKENPVEELKIHCPYGCCGFDVSHSCTRCCGLHEVEKENPVERLKIHCPYGCCSFD---DHYNCIRCCGLSAEEHAIIQNGFNLISHQNEANVEKGQRE

Query:  RNAVKGNEVE
           V+  +V+
Subjt:  RNAVKGNEVE

XP_022140307.1 uncharacterized protein LOC111011010 isoform X1 [Momordica charantia]1.4e-2443.13Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP
        M S AVVF CLL F+STV IASAVS+GTQEQH  +EV+N NS E+ +  CP G CC    F G   C+RCC   E           +NENS+EE K  CP
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP

Query:  YG-CCGFDA-SHSCTRCCGL----------HEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCCGLRKVE---------KENPIEELKIHCPYG-CCGF
         G CC  +  +  C RCC               +NEN +EE K  CP G CC  +  +  C RCC  R  E          EN IEE K  CP G CC  
Subjt:  YG-CCGFDA-SHSCTRCCGL----------HEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCCGLRKVE---------KENPIEELKIHCPYG-CCGF

Query:  DA-SHSCTRCC
        +  +  C RCC
Subjt:  DA-SHSCTRCC

XP_022140308.1 uncharacterized protein LOC111011010 isoform X2 [Momordica charantia]4.5e-1845.4Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP
        M S AVVF CLL F+STV IASAVS+GTQEQH  +EV+N NS E+ +  CP G CC    F G   C+RCC   E           +NENS+EE K  CP
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP

Query:  YG-CCGFDA-SHSCTRCC--------GLHEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCC
         G CC  +  +  C RCC         +     EN IEE K  CP G CC  +  +  C RCC
Subjt:  YG-CCGFDA-SHSCTRCC--------GLHEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCC

XP_022140309.1 uncharacterized protein LOC111011011 [Momordica charantia]2.0e-2638.73Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCT
        MTSK++VF+CLL FVS +   S V++ TQ+QH  +EV+N N  E+ +  C +GCCGF      CI CC    AK + +IE L +                
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCT

Query:  RCCGLHEAKNENPIEELKIHCPYGCCGF-DASHSCTRCCGLR-----------KVEKENPIEELKIHCPYGCCGF-DASHSCTRCCGLH------EIERE
                +N NPIEE K  C +GCCGF   S  C  CC  R            VE  NP+EE K  C +GCCGF   S  C RCC          I+ E
Subjt:  RCCGLHEAKNENPIEELKIHCPYGCCGF-DASHSCTRCCGLR-----------KVEKENPIEELKIHCPYGCCGF-DASHSCTRCCGLH------EIERE

Query:  NPVE
        N VE
Subjt:  NPVE

XP_022140624.1 late embryogenesis abundant protein M17-like [Momordica charantia]1.6e-2044.85Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEA----------KNENSIEELKIHCPYGC
        MTSK ++F+CLL FVS + IAS V++GTQEQH T+EV+N NS E+ +  C +GCCGF      CI CC    A          +NEN +EE K  C YGC
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEA----------KNENSIEELKIHCPYGC

Query:  CGFDASHSCTRCC-------GLHEAKNENPIEELKI
        C FD    C RCC          + +NEN +EE ++
Subjt:  CGFDASHSCTRCC-------GLHEAKNENPIEELKI

TrEMBL top hitse value%identityAlignment
A0A6J1CFC0 uncharacterized protein LOC1110110119.7e-2738.73Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCT
        MTSK++VF+CLL FVS +   S V++ TQ+QH  +EV+N N  E+ +  C +GCCGF      CI CC    AK + +IE L +                
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCT

Query:  RCCGLHEAKNENPIEELKIHCPYGCCGF-DASHSCTRCCGLR-----------KVEKENPIEELKIHCPYGCCGF-DASHSCTRCCGLH------EIERE
                +N NPIEE K  C +GCCGF   S  C  CC  R            VE  NP+EE K  C +GCCGF   S  C RCC          I+ E
Subjt:  RCCGLHEAKNENPIEELKIHCPYGCCGF-DASHSCTRCCGLR-----------KVEKENPIEELKIHCPYGCCGF-DASHSCTRCCGLH------EIERE

Query:  NPVE
        N VE
Subjt:  NPVE

A0A6J1CG70 late embryogenesis abundant protein M17-like7.9e-2144.85Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEA----------KNENSIEELKIHCPYGC
        MTSK ++F+CLL FVS + IAS V++GTQEQH T+EV+N NS E+ +  C +GCCGF      CI CC    A          +NEN +EE K  C YGC
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDG-GHSCIRCCGLHEA----------KNENSIEELKIHCPYGC

Query:  CGFDASHSCTRCC-------GLHEAKNENPIEELKI
        C FD    C RCC          + +NEN +EE ++
Subjt:  CGFDASHSCTRCC-------GLHEAKNENPIEELKI

A0A6J1CGG8 uncharacterized protein LOC111011010 isoform X16.9e-2543.13Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP
        M S AVVF CLL F+STV IASAVS+GTQEQH  +EV+N NS E+ +  CP G CC    F G   C+RCC   E           +NENS+EE K  CP
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP

Query:  YG-CCGFDA-SHSCTRCCGL----------HEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCCGLRKVE---------KENPIEELKIHCPYG-CCGF
         G CC  +  +  C RCC               +NEN +EE K  CP G CC  +  +  C RCC  R  E          EN IEE K  CP G CC  
Subjt:  YG-CCGFDA-SHSCTRCCGL----------HEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCCGLRKVE---------KENPIEELKIHCPYG-CCGF

Query:  DA-SHSCTRCC
        +  +  C RCC
Subjt:  DA-SHSCTRCC

A0A6J1CHQ6 uncharacterized protein LOC111011010 isoform X22.2e-1845.4Show/hide
Query:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP
        M S AVVF CLL F+STV IASAVS+GTQEQH  +EV+N NS E+ +  CP G CC    F G   C+RCC   E           +NENS+EE K  CP
Subjt:  MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYG-CC---GFDGGHSCIRCCGLHE----------AKNENSIEELKIHCP

Query:  YG-CCGFDA-SHSCTRCC--------GLHEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCC
         G CC  +  +  C RCC         +     EN IEE K  CP G CC  +  +  C RCC
Subjt:  YG-CCGFDA-SHSCTRCC--------GLHEAKNENPIEELKIHCPYG-CCGFDA-SHSCTRCC

Q8I8V9 Variant-specific surface protein AS33.5e-2922.68Show/hide
Query:  IASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDGG---HSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCTRCCGLHEAKNENPIEEL
        ++   +Q T++   + +VDN        +HC     GFD       C+R C +   +  N+     +HC     GFD +   T+C    +  N       
Subjt:  IASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDGG---HSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCTRCCGLHEAKNENPIEEL

Query:  KIHCPYGCCGFDASHSCTRCCGLRKVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHEIERENPVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKEN
         +HC     GFD +   T+C    +V+         +HC     GFD +   T+C    +++         +HC     GFD +   T+C    +V+   
Subjt:  KIHCPYGCCGFDASHSCTRCCGLRKVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHEIERENPVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKEN

Query:  LVERLKIHCPYGCCSFDASHSCTRCCGLRKVKKENSVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHK
              +HC      FD +   T+C    +V          +HC     GFD +   T+C    +V+         +HC     GFD +   T+C    +
Subjt:  LVERLKIHCPYGCCSFDASHSCTRCCGLRKVKKENSVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHK

Query:  VEKENPVEELKIHCPYGCCGFDVSHSCTRCCGLHEVEKENPVERLKIHCPYGCCSFD---DHYNCIRCCGLSAEEHAIIQNGFNLISHQNEANVEKGQRE
        V+        KIHC     GFD +   T+C    +V+         +HC      FD   D   C+R C +   E     N  N+   Q  A  +    +
Subjt:  VEKENPVEELKIHCPYGCCGFDVSHSCTRCCGLHEVEKENPVERLKIHCPYGCCSFD---DHYNCIRCCGLSAEEHAIIQNGFNLISHQNEANVEKGQRE

Query:  RNAVKGNEVE
           V+  +V+
Subjt:  RNAVKGNEVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCCAAAGCTGTTGTATTTTCGTGTTTGTTGTTTTTTGTCTCTACTGTAACAATTGCATCGGCAGTATCACAAGGAACTCAAGAACAGCACAACACAATTGAAGT
TGACAATGAAAATTCAGAAGAAAAGTTAAGGATTCATTGCCCCTATGGATGTTGCGGCTTTGACGGCGGTCATAGTTGCATAAGATGTTGTGGTCTTCATGAAGCTAAAA
ATGAAAATTCAATAGAAGAACTAAAGATTCACTGCCCCTACGGATGTTGCGGCTTTGATGCTAGTCATAGTTGCACTAGATGTTGTGGTCTTCACGAAGCCAAAAATGAA
AATCCAATAGAAGAACTAAAGATTCACTGTCCCTATGGGTGTTGCGGCTTCGATGCCAGTCATAGTTGCACTAGATGTTGTGGTCTTCGCAAGGTTGAAAAAGAAAATCC
AATAGAAGAACTAAAGATTCACTGCCCCTACGGATGCTGCGGTTTTGATGCCAGCCATAGTTGCACCAGATGCTGTGGCCTTCACGAGATTGAAAGAGAAAATCCAGTAG
AAGAACTAAAGATTCATTGCCCCTATGGATGCTGCGGTTTTGATGCCAGTCATAGTTGCACCAGATGTTGTGGTCTTCACGAAGTTGAAAAAGAAAATCTAGTAGAAAGA
TTAAAGATTCACTGCCCCTATGGGTGTTGCAGCTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGTCTTCGCAAGGTTAAAAAAGAAAATTCAGTAGAAGAACTAAA
GATTCACTGCCCCTACGGATGCTGCGGTTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGCCTTCACGAGGTTGAAAAAGAAAATCCAATAGAAGAACTAAAGATTC
ACTGCCCCTATGGATGTTGCGGTTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGCCTTCACAAGGTTGAAAAAGAAAATCCAGTAGAAGAACTAAAGATTCATTGC
CCCTATGGATGCTGCGGTTTTGATGTCAGTCATAGTTGCACCAGATGTTGTGGTCTTCACGAAGTTGAAAAAGAAAATCCAGTAGAAAGATTAAAGATTCACTGCCCCTA
TGGATGTTGTAGCTTTGATGACCATTACAATTGCATCAGATGCTGTGGTCTTTCTGCAGAAGAGCATGCCATAATCCAAAATGGTTTCAACTTGATCTCTCATCAAAATG
AGGCAAATGTGGAAAAAGGGCAACGGGAAAGGAATGCAGTGAAAGGGAATGAAGTGGAGACCAACGGAGTTGTCGGCAACAGAAAGGGAGCGGGTGAAGACGGAAAAAGA
GTCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCCAAAGCTGTTGTATTTTCGTGTTTGTTGTTTTTTGTCTCTACTGTAACAATTGCATCGGCAGTATCACAAGGAACTCAAGAACAGCACAACACAATTGAAGT
TGACAATGAAAATTCAGAAGAAAAGTTAAGGATTCATTGCCCCTATGGATGTTGCGGCTTTGACGGCGGTCATAGTTGCATAAGATGTTGTGGTCTTCATGAAGCTAAAA
ATGAAAATTCAATAGAAGAACTAAAGATTCACTGCCCCTACGGATGTTGCGGCTTTGATGCTAGTCATAGTTGCACTAGATGTTGTGGTCTTCACGAAGCCAAAAATGAA
AATCCAATAGAAGAACTAAAGATTCACTGTCCCTATGGGTGTTGCGGCTTCGATGCCAGTCATAGTTGCACTAGATGTTGTGGTCTTCGCAAGGTTGAAAAAGAAAATCC
AATAGAAGAACTAAAGATTCACTGCCCCTACGGATGCTGCGGTTTTGATGCCAGCCATAGTTGCACCAGATGCTGTGGCCTTCACGAGATTGAAAGAGAAAATCCAGTAG
AAGAACTAAAGATTCATTGCCCCTATGGATGCTGCGGTTTTGATGCCAGTCATAGTTGCACCAGATGTTGTGGTCTTCACGAAGTTGAAAAAGAAAATCTAGTAGAAAGA
TTAAAGATTCACTGCCCCTATGGGTGTTGCAGCTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGTCTTCGCAAGGTTAAAAAAGAAAATTCAGTAGAAGAACTAAA
GATTCACTGCCCCTACGGATGCTGCGGTTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGCCTTCACGAGGTTGAAAAAGAAAATCCAATAGAAGAACTAAAGATTC
ACTGCCCCTATGGATGTTGCGGTTTTGATGCCAGTCATAGTTGCACTAGATGTTGTGGCCTTCACAAGGTTGAAAAAGAAAATCCAGTAGAAGAACTAAAGATTCATTGC
CCCTATGGATGCTGCGGTTTTGATGTCAGTCATAGTTGCACCAGATGTTGTGGTCTTCACGAAGTTGAAAAAGAAAATCCAGTAGAAAGATTAAAGATTCACTGCCCCTA
TGGATGTTGTAGCTTTGATGACCATTACAATTGCATCAGATGCTGTGGTCTTTCTGCAGAAGAGCATGCCATAATCCAAAATGGTTTCAACTTGATCTCTCATCAAAATG
AGGCAAATGTGGAAAAAGGGCAACGGGAAAGGAATGCAGTGAAAGGGAATGAAGTGGAGACCAACGGAGTTGTCGGCAACAGAAAGGGAGCGGGTGAAGACGGAAAAAGA
GTCGAATGA
Protein sequenceShow/hide protein sequence
MTSKAVVFSCLLFFVSTVTIASAVSQGTQEQHNTIEVDNENSEEKLRIHCPYGCCGFDGGHSCIRCCGLHEAKNENSIEELKIHCPYGCCGFDASHSCTRCCGLHEAKNE
NPIEELKIHCPYGCCGFDASHSCTRCCGLRKVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHEIERENPVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENLVER
LKIHCPYGCCSFDASHSCTRCCGLRKVKKENSVEELKIHCPYGCCGFDASHSCTRCCGLHEVEKENPIEELKIHCPYGCCGFDASHSCTRCCGLHKVEKENPVEELKIHC
PYGCCGFDVSHSCTRCCGLHEVEKENPVERLKIHCPYGCCSFDDHYNCIRCCGLSAEEHAIIQNGFNLISHQNEANVEKGQRERNAVKGNEVETNGVVGNRKGAGEDGKR
VE