; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002834 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002834
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationscaffold359:71999..73264
RNA-Seq ExpressionMS002834
SyntenyMS002834
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573632.1 hypothetical protein SDJN03_27519, partial [Cucurbita argyrosperma subsp. sororia]4.0e-5868.54Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDAS--NSRSPPSNCIAKLVGGPHQ
        MASLKL + + FLL +VLA TQ AQC TL+A ISCLDC+SNYDFSGN I V C+ VKNL++AIT  +GSF+TTLPS+A+  +S + PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDAS--NSRSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_008461260.1 PREDICTED: uncharacterized protein LOC103499896 [Cucumis melo]3.0e-5869.23Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDA----SNSRSPPSNCIAKLVGGP
        MASLKL +  P  L MVLA+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+VKNL +AIT+ DGSFET LPSD     S +  PP  CIAKLVGG 
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDA----SNSRSPPSNCIAKLVGGP

Query:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP
        HQL+ASRK++ S +IK TNSKFFTIATAL+FSTCK+ N KC+A+K + I DSKT DLP LP EWG  P+SYYLP LPIIGIP
Subjt:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP

XP_022142556.1 uncharacterized protein LOC111012639 [Momordica charantia]6.1e-9199.43Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLY
        MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNS+SPPSNCIAKLVGGPHQLY
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLY

Query:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
Subjt:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_022967108.1 uncharacterized protein LOC111466611 [Cucurbita maxima]4.2e-6070.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLM+LA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]2.1e-5970.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLMVLA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S + PSNCIAKL GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK M+S VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein3.6e-5764.61Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPS-NCIAKLVGGPHQL
        MASLKL ++  F+ ++++A+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+ KNL +AIT+ DGSFET+LPS+ ++  +P S  CIAKL+GG HQL
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPS-NCIAKLVGGPHQL

Query:  YASRKDMASIVIKATNSKFFTIATALKFSTCKQHNK-CQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        +ASRK+M S +IK TNSKFFTIATALKFSTCK+ ++ C+A+K + + DSKT D PLP EWG  P+SYY+P LPIIGIP
Subjt:  YASRKDMASIVIKATNSKFFTIATALKFSTCKQHNK-CQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998961.5e-5869.23Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDA----SNSRSPPSNCIAKLVGGP
        MASLKL +  P  L MVLA+TQ AQC TL+AKISCLDC+SNYDFSGN I+VKCE+VKNL +AIT+ DGSFET LPSD     S +  PP  CIAKLVGG 
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDA----SNSRSPPSNCIAKLVGGP

Query:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP
        HQL+ASRK++ S +IK TNSKFFTIATAL+FSTCK+ N KC+A+K + I DSKT DLP LP EWG  P+SYYLP LPIIGIP
Subjt:  HQLYASRKDMASIVIKATNSKFFTIATALKFSTCKQHN-KCQAMKDQFIADSKTVDLP-LPREWGLAPSSYYLPELPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126392.9e-9199.43Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLY
        MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNS+SPPSNCIAKLVGGPHQLY
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLY

Query:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
Subjt:  ASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497452.5e-5867.98Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDAS--NSRSPPSNCIAKLVGGPHQ
        MASLKL + + FLL +VLA TQ AQC TL+A ISCLDC+SNYDFSGN + V C+ VKNL++AIT  +GSF+TTLPS+A+  +S + PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDAS--NSRSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666112.1e-6070.22Show/hide
Query:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ
        MASLKL + + FLLLM+LA TQ AQC TL+A ISCLDC+SNYDFSGN IVV C+ VKNL+VAIT  +GSF+TTLPSD  + +S++ PSNCIAKL+GGPHQ
Subjt:  MASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSD--ASNSRSPPSNCIAKLVGGPHQ

Query:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        LYASRK MAS VIK TNS FFT+A AL FSTCK + KC ++K+ F+ADSKT+DLPLP EWG  P+SYY P LPIIGIP
Subjt:  LYASRKDMASIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein4.9e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein4.9e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein4.9e-2236.36Show/hide
Query:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI
        FL    L++        +  K+SC DC ++YD+SG  + V C          T   G F + LPS         SNC A+L G   QLYAS+ ++ S ++
Subjt:  FLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVI

Query:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
        K    K+   +      +C +            + SKTVDLP+P EWGLAP+SYY+P LPIIGIP
Subjt:  KATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein4.4e-2337.8Show/hide
Query:  LMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVIKAT
        L V +  + +    +  KISCLDC  ++DFSG K+++KC+  K    A+   DGSF + LP   +  +    NC+AKL+GGP QLYA + ++ S ++K+ 
Subjt:  LMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMASIVIKAT

Query:  -NSKFFTIATALKFS-TCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP
         +SK  T +  L FS +C + ++        I DSKT++ P    +G  P+S++ P LPIIGIP
Subjt:  -NSKFFTIATALKFS-TCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCTTATCTATGGCTTCTCTTAAGCTGCTCCTCACTCTTCCTTTTCTCTTGCTCATGGTTCTTGCAACAACTCAAGCTGCCCAATGCTACACTTTGAGGGCCAAGATCTC
CTGCCTCGACTGTCGATCTAACTACGACTTCTCAGGAAACAAGATCGTGGTGAAGTGCGAGAAAGTGAAAAACCTAGCCGTAGCAATCACCAGAACCGATGGATCATTCG
AAACTACGCTTCCTTCGGACGCCTCCAACTCTCGGTCGCCTCCTTCCAACTGCATTGCCAAGCTTGTTGGGGGACCTCACCAACTGTATGCTTCAAGGAAAGACATGGCT
TCCATTGTCATCAAGGCAACCAACTCCAAATTCTTCACCATTGCCACAGCTCTCAAGTTCTCCACTTGCAAACAACACAACAAATGCCAAGCCATGAAAGATCAGTTTAT
TGCAGATTCAAAGACCGTTGATTTGCCTCTGCCACGTGAGTGGGGGCTGGCACCTTCCAGCTACTATCTTCCTGAGCTCCCGATCATAGGCATTCCT
mRNA sequenceShow/hide mRNA sequence
ATCTTATCTATGGCTTCTCTTAAGCTGCTCCTCACTCTTCCTTTTCTCTTGCTCATGGTTCTTGCAACAACTCAAGCTGCCCAATGCTACACTTTGAGGGCCAAGATCTC
CTGCCTCGACTGTCGATCTAACTACGACTTCTCAGGAAACAAGATCGTGGTGAAGTGCGAGAAAGTGAAAAACCTAGCCGTAGCAATCACCAGAACCGATGGATCATTCG
AAACTACGCTTCCTTCGGACGCCTCCAACTCTCGGTCGCCTCCTTCCAACTGCATTGCCAAGCTTGTTGGGGGACCTCACCAACTGTATGCTTCAAGGAAAGACATGGCT
TCCATTGTCATCAAGGCAACCAACTCCAAATTCTTCACCATTGCCACAGCTCTCAAGTTCTCCACTTGCAAACAACACAACAAATGCCAAGCCATGAAAGATCAGTTTAT
TGCAGATTCAAAGACCGTTGATTTGCCTCTGCCACGTGAGTGGGGGCTGGCACCTTCCAGCTACTATCTTCCTGAGCTCCCGATCATAGGCATTCCT
Protein sequenceShow/hide protein sequence
ILSMASLKLLLTLPFLLLMVLATTQAAQCYTLRAKISCLDCRSNYDFSGNKIVVKCEKVKNLAVAITRTDGSFETTLPSDASNSRSPPSNCIAKLVGGPHQLYASRKDMA
SIVIKATNSKFFTIATALKFSTCKQHNKCQAMKDQFIADSKTVDLPLPREWGLAPSSYYLPELPIIGIP