; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0434 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0434
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationMC03:11219239..11221080
RNA-Seq ExpressionMC03g0434
SyntenyMC03g0434
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573632.1 hypothetical protein SDJN03_27519, partial [Cucurbita argyrosperma subsp. sororia]1.75e-7668.54Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASN--SQSPPSNCIAKLVGGPHQ
        MASLKL + + FLL +VLA TQ AQC TL+A ISCLDC+SNYDFSGN I V C+ VKNL++AIT  +GSF+TTLPS+A++  S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASN--SQSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_008461260.1 PREDICTED: uncharacterized protein LOC103499896 [Cucumis melo]2.29e-7669.23Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQS----PPSNCIAKLVGGP
        MASLKL +  P  L MVLA+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+VKNL +AIT+ DGSFET LPSD ++  S    PP  CIAKLVGG 
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQS----PPSNCIAKLVGGP

Query:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP
        HQL+ASRK++ S +IK TNSKFFTIATAL+FSTCK+ N KC+A+K + I DSKT DLP LP EWG  P+SYYLP LPIIGIP
Subjt:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP

XP_022142556.1 uncharacterized protein LOC111012639 [Momordica charantia]6.57e-120100Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY
        MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY

Query:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
Subjt:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_022967108.1 uncharacterized protein LOC111466611 [Cucurbita maxima]1.30e-7870.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLM+LA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]3.72e-7870.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLMVLA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S++ PSNCIAKL GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK M+S VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein8.64e-7564.61Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPS-NCIAKLVGGPHQL
        MASLKL ++  F+ ++++A+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+ KNL +AIT+ DGSFET+LPS+ ++  +P S  CIAKL+GG HQL
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPS-NCIAKLVGGPHQL

Query:  YASRKDMASIVIKATNSKFFTIATALKFSTCKQHNK-CQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        +ASRK+M S +IK TNSKFFTIATALKFSTCK+ ++ C+A+K + + DSKT D PLP EWG  P+SYY+P LPIIGIP
Subjt:  YASRKDMASIVIKATNSKFFTIATALKFSTCKQHNK-CQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998961.11e-7669.23Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQS----PPSNCIAKLVGGP
        MASLKL +  P  L MVLA+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+VKNL +AIT+ DGSFET LPSD ++  S    PP  CIAKLVGG 
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQS----PPSNCIAKLVGGP

Query:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP
        HQL+ASRK++ S +IK TNSKFFTIATAL+FSTCK+ N KC+A+K + I DSKT DLP LP EWG  P+SYYLP LPIIGIP
Subjt:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126393.18e-120100Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY
        MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLY

Query:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
Subjt:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497451.21e-7667.98Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASN--SQSPPSNCIAKLVGGPHQ
        MASLKL + + FLL +VLA TQ AQC TL+A ISCLDC+SNYDFSGN + V C+ VKNL++AIT  +GSF+TTLPS+A++  S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASN--SQSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666116.29e-7970.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLM+LA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSQSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein4.8e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein4.8e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein4.8e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein5.7e-2337.8Show/hide
Query:  LMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVIKAT
        L V +  + +    +  KISCLDC  ++DFSG K+++KC+  K    A+   DGSF + LP   +  +    NC+AKL+GGP QLYA + ++ S ++K+ 
Subjt:  LMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIVIKAT

Query:  -NSKFFTIATALKFS-TCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
         +SK  T +  L FS +C + ++        I DSKT++ P    +G  P+S++ P LPIIGIP
Subjt:  -NSKFFTIATALKFS-TCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTAAGCTGCTCCTCACTCTTCCTTTTCTCTTGCTCATGGTTCTTGCAACAACTCAAGCTGCCCAATGCTACACTTTGAGGGCCAAGATCTCCTGCCTCGA
CTGTCGATCTAACTACGACTTCTCTGGAAACAAGATCGTGGTGAAGTGCGAGAAAGTGAAAAACCTAGCCGTAGCAATCACCAGAACCGATGGATCATTCGAAACTACGC
TTCCTTCGGACGCCTCCAACTCTCAGTCGCCTCCTTCCAACTGCATTGCCAAACTTGTTGGGGGACCTCACCAACTGTATGCTTCAAGGAAAGACATGGCTTCCATTGTC
ATCAAGGCAACCAACTCCAAATTCTTCACCATTGCCACAGCTCTCAAGTTCTCCACTTGCAAACAACACAACAAATGCCAAGCCATGAAAGATCAGTTTATTGCAGATTC
AAAGACCGTTGATTTGCCTCTGCCACGTGAGTGGGGGCTGGCACCTTCCAGCTACTATCTTCCTGAGCTCCCGATCATAGGCATTCCTTGA
mRNA sequenceShow/hide mRNA sequence
TAACCACGCAATCTCTATTCCTGCCATGTAGTATTTATTGCAACTTTACTTTGGAGAAGACTTTCTCCCATCATATTAAGAAATTTGTCAGCTGGATGGTAACTAAAACC
AAATGGCATTTATGCCACCTACTTAATGAGGCCATGTGACCTTTTACATTAACTTCTTTGTCCTCTACTTTATGCTTCATTTATAGGGCTGCATACTGATTTTGAGTAGC
TAAACACTTTGATTCAACTGTTGCTATCTTATCTATGGCTTCTCTTAAGCTGCTCCTCACTCTTCCTTTTCTCTTGCTCATGGTTCTTGCAACAACTCAAGCTGCCCAAT
GCTACACTTTGAGGGCCAAGATCTCCTGCCTCGACTGTCGATCTAACTACGACTTCTCTGGAAACAAGATCGTGGTGAAGTGCGAGAAAGTGAAAAACCTAGCCGTAGCA
ATCACCAGAACCGATGGATCATTCGAAACTACGCTTCCTTCGGACGCCTCCAACTCTCAGTCGCCTCCTTCCAACTGCATTGCCAAACTTGTTGGGGGACCTCACCAACT
GTATGCTTCAAGGAAAGACATGGCTTCCATTGTCATCAAGGCAACCAACTCCAAATTCTTCACCATTGCCACAGCTCTCAAGTTCTCCACTTGCAAACAACACAACAAAT
GCCAAGCCATGAAAGATCAGTTTATTGCAGATTCAAAGACCGTTGATTTGCCTCTGCCACGTGAGTGGGGGCTGGCACCTTCCAGCTACTATCTTCCTGAGCTCCCGATC
ATAGGCATTCCTTGAATGTTATATCATAGCTGCTAAGTTTGCAATAATCTGCTTTGAGAACTTTAAGGTTTGTGTATTTGTGCTGAAAATGTGTTGTGTTGTTGTGTTGT
ATGTTGTTGGGAGTGGATTAATCATATGGCTAAGTGTCGTCGGCTATATAATCTAGTTTGAAATATGTTTGTAACTATTGGTATTATGAAGTGAAGTTTCTAAGGCTGGT
TTCGTATTGTCTTAGCTAGCTCTTTTTACAGAGATCAAGATATGTATATTGATTAGAGAGTTTCGTTAGATGAACGAGTAGACTAAATTAGATAGATGTATATGAAAATT
TACTCTTAC
Protein sequenceShow/hide protein sequence
MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSQSPPSNCIAKLVGGPHQLYASRKDMASIV
IKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP