; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005180 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005180
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein, putative
Genome locationscaffold176:3294930..3295714
RNA-Seq ExpressionMS005180
SyntenyMS005180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140163.2 uncharacterized protein LOC101219078 [Cucumis sativus]1.6e-5163.02Show/hide
Query:  MALASFVTALSFLILLPTVELSTA-----------HFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGP
        MA+ S V A +F++ +  V  +TA           + LKG V CLDCHASYDLSG VVM KC+KV KVVTATT  DG FEAEL PS    +CEARLAGG 
Subjt:  MALASFVTALSFLILLPTVELSTA-----------HFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGP

Query:  NQIYAATNTIVAGIVRGDGG---FYGISTPLAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        NQ+YA+   IVAGIV+G GG    YGISTPLAFC++CR     + S+EA KYCKA   KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  NQIYAATNTIVAGIVRGDGG---FYGISTPLAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

XP_008449582.1 PREDICTED: uncharacterized protein LOC103491422 [Cucumis melo]2.8e-5370.37Show/hide
Query:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP
        ++T H+ LKG V CLDCHASYDL+G VVM KC+KV KVVTATT KDG FEAEL PS    +CEARLAGG NQ+YAAT  +VAGIV+G GG    YGISTP
Subjt:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP

Query:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        LAFC++CR     + S+EA KYCK    KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

XP_022152985.1 uncharacterized protein LOC111020592 isoform X1 [Momordica charantia]1.3e-9098.27Show/hide
Query:  MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIV
        MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAG PNQIY+ATNTIV
Subjt:  MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIV

Query:  AGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        AGIVRGDGG YGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
Subjt:  AGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

XP_022152986.1 uncharacterized protein LOC111020592 isoform X2 [Momordica charantia]1.6e-6795.52Show/hide
Query:  YDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGR
        + L+GTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAG PNQIY+ATNTIVAGIVRGDGG YGISTPLAFCTACRSISSEAVKYCKAAGR
Subjt:  YDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGR

Query:  KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
Subjt:  KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

XP_038902068.1 uncharacterized protein LOC120088711 [Benincasa hispida]1.8e-5569.4Show/hide
Query:  MALASFVTA---LSFLILLPTVELS-TAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAAT
        MA+ S VTA   L  ++    VELS T H LKG V CLDC A+YDLSG VVM KC+KV KVVTATT KDG FEAEL PS    DCEARL GG NQ+YAA 
Subjt:  MALASFVTA---LSFLILLPTVELS-TAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAAT

Query:  NTIVAGIVRGDGG---FYGISTPLAFCTACRSIS---SEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
          +VAGIVRG GG    YGI+TPLAFC++CR  S   SEA KYCKAAG KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  NTIVAGIVRGDGG---FYGISTPLAFCTACRSIS---SEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

TrEMBL top hitse value%identityAlignment
A0A0A0KGW8 Uncharacterized protein7.6e-5263.02Show/hide
Query:  MALASFVTALSFLILLPTVELSTA-----------HFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGP
        MA+ S V A +F++ +  V  +TA           + LKG V CLDCHASYDLSG VVM KC+KV KVVTATT  DG FEAEL PS    +CEARLAGG 
Subjt:  MALASFVTALSFLILLPTVELSTA-----------HFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGP

Query:  NQIYAATNTIVAGIVRGDGG---FYGISTPLAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        NQ+YA+   IVAGIV+G GG    YGISTPLAFC++CR     + S+EA KYCKA   KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  NQIYAATNTIVAGIVRGDGG---FYGISTPLAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

A0A1S3BLR1 uncharacterized protein LOC1034914221.4e-5370.37Show/hide
Query:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP
        ++T H+ LKG V CLDCHASYDL+G VVM KC+KV KVVTATT KDG FEAEL PS    +CEARLAGG NQ+YAAT  +VAGIV+G GG    YGISTP
Subjt:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP

Query:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        LAFC++CR     + S+EA KYCK    KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

A0A5D3BC61 Bile acid-inducible operon CD1.4e-5370.37Show/hide
Query:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP
        ++T H+ LKG V CLDCHASYDL+G VVM KC+KV KVVTATT KDG FEAEL PS    +CEARLAGG NQ+YAAT  +VAGIV+G GG    YGISTP
Subjt:  LSTAHF-LKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGG---FYGISTP

Query:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        LAFC++CR     + S+EA KYCK    KFGSSKTFNLPLPPEWG+APSSYYFPFFPIIG+P
Subjt:  LAFCTACR-----SISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

A0A6J1DFG9 uncharacterized protein LOC111020592 isoform X16.3e-9198.27Show/hide
Query:  MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIV
        MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAG PNQIY+ATNTIV
Subjt:  MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIV

Query:  AGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        AGIVRGDGG YGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
Subjt:  AGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

A0A6J1DJC9 uncharacterized protein LOC111020592 isoform X27.5e-6895.52Show/hide
Query:  YDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGR
        + L+GTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAG PNQIY+ATNTIVAGIVRGDGG YGISTPLAFCTACRSISSEAVKYCKAAGR
Subjt:  YDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGGFYGISTPLAFCTACRSISSEAVKYCKAAGR

Query:  KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
Subjt:  KFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein8.9e-2944Show/hide
Query:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI
        MAL S F+    F   L +  + +A  ++G VSC DC   YD SG  V V C   +   T TT K G F +EL PS    +CEA L G   Q+YA+ N +
Subjt:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI

Query:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
         + IV+  G  YG+S+ L F  +C RS  S            F SSKT +LP+PPEWGLAP+SYY PF PIIG+P
Subjt:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein8.9e-2944Show/hide
Query:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI
        MAL S F+    F   L +  + +A  ++G VSC DC   YD SG  V V C   +   T TT K G F +EL PS    +CEA L G   Q+YA+ N +
Subjt:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI

Query:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
         + IV+  G  YG+S+ L F  +C RS  S            F SSKT +LP+PPEWGLAP+SYY PF PIIG+P
Subjt:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein8.9e-2944Show/hide
Query:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI
        MAL S F+    F   L +  + +A  ++G VSC DC   YD SG  V V C   +   T TT K G F +EL PS    +CEA L G   Q+YA+ N +
Subjt:  MALAS-FVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTI

Query:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
         + IV+  G  YG+S+ L F  +C RS  S            F SSKT +LP+PPEWGLAP+SYY PF PIIG+P
Subjt:  VAGIVRGDGGFYGISTPLAFCTAC-RSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein3.9e-2438.42Show/hide
Query:  ALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGR---DCEARLAGGPNQIYAATNT
        A A F+ AL+   +   +ELS +  + G +SCLDCH  +D SG  V++KCD   K +TA    DG F + L P++D +   +C A+L GGP Q+YA  + 
Subjt:  ALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGR---DCEARLAGGPNQIYAATNT

Query:  IVAGIVRG--DGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP
        +V+ +V+   D      S PLAF  +C   S + +      G   G SKT N P    +G  P+S +FPF PIIG+P
Subjt:  IVAGIVRG--DGGFYGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTTGCAAGTTTTGTTACAGCATTGTCATTTCTGATCCTTCTTCCCACAGTTGAGCTTTCAACCGCTCATTTTCTGAAAGGGAACGTCTCCTGCCTTGACTGCCA
CGCCTCCTATGATCTCTCAGGGACGGTGGTGATGGTGAAGTGCGATAAGGTAGACAAGGTGGTTACAGCAACTACGGGAAAGGATGGGTTTTTCGAGGCGGAGCTGCACC
CCAGTTCAGATGGCCGCGACTGCGAGGCCAGGCTCGCCGGAGGGCCCAACCAGATCTACGCCGCCACAAACACCATTGTCGCCGGAATCGTAAGGGGTGACGGCGGCTTC
TACGGCATCTCCACTCCGCTGGCGTTCTGCACTGCTTGCCGCTCCATCAGCAGTGAAGCCGTCAAATATTGCAAGGCCGCCGGAAGGAAATTCGGTTCCTCCAAGACCTT
CAACCTTCCTCTGCCGCCGGAGTGGGGCCTGGCGCCGTCCAGCTACTATTTCCCTTTCTTCCCCATCATTGGCGTTCCT
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTTGCAAGTTTTGTTACAGCATTGTCATTTCTGATCCTTCTTCCCACAGTTGAGCTTTCAACCGCTCATTTTCTGAAAGGGAACGTCTCCTGCCTTGACTGCCA
CGCCTCCTATGATCTCTCAGGGACGGTGGTGATGGTGAAGTGCGATAAGGTAGACAAGGTGGTTACAGCAACTACGGGAAAGGATGGGTTTTTCGAGGCGGAGCTGCACC
CCAGTTCAGATGGCCGCGACTGCGAGGCCAGGCTCGCCGGAGGGCCCAACCAGATCTACGCCGCCACAAACACCATTGTCGCCGGAATCGTAAGGGGTGACGGCGGCTTC
TACGGCATCTCCACTCCGCTGGCGTTCTGCACTGCTTGCCGCTCCATCAGCAGTGAAGCCGTCAAATATTGCAAGGCCGCCGGAAGGAAATTCGGTTCCTCCAAGACCTT
CAACCTTCCTCTGCCGCCGGAGTGGGGCCTGGCGCCGTCCAGCTACTATTTCCCTTTCTTCCCCATCATTGGCGTTCCT
Protein sequenceShow/hide protein sequence
MALASFVTALSFLILLPTVELSTAHFLKGNVSCLDCHASYDLSGTVVMVKCDKVDKVVTATTGKDGFFEAELHPSSDGRDCEARLAGGPNQIYAATNTIVAGIVRGDGGF
YGISTPLAFCTACRSISSEAVKYCKAAGRKFGSSKTFNLPLPPEWGLAPSSYYFPFFPIIGVP