; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018115 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018115
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationChr04:527162..528809
RNA-Seq ExpressionHG10018115
SyntenyHG10018115
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139493.1 uncharacterized protein LOC101207654 [Cucumis sativus]2.5e-6878.69Show/hide
Query:  MASLKLFIIPIFLLMVL-APTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG
        MASLKLFI P F+ +VL A TQ AQCNTLKAKISCLDCQSNYDFSGNLIMVKC+R KNLTIAIT+ADGSFETSLPS++AS     AA SSPKCIAKL+GG
Subjt:  MASLKLFIIPIFLLMVL-APTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG

Query:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        SHQLFASRK++ ST+IKETNS FFTIATALKF TCK+ +  CKA+KKE  + DSKT D PLPPEWGFPPTSYY+PVLPIIGIP
Subjt:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_008461260.1 PREDICTED: uncharacterized protein LOC103499896 [Cucumis melo]8.9e-7484.15Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFI PIFL MVLA TQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKC+RVKNLTIAIT+ADGSFET LPSD+AS+D EAA    PKCIAKLVGGS
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP
        HQLFASRK++ ST+IKETNS FFTIATAL+F TCK+ N KCKA+KKE  I DSKT DLP LPPEWGFPPTSYYLPVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP

XP_022967108.1 uncharacterized protein LOC111466611 [Cucurbita maxima]9.9e-6574.03Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFII   LLM+LA TQLAQCNTLKA ISCLDCQSNYDFSGN+I+V CK VKNL++AITEA+GSF+T+LPSD  S D +AA      CIAKL+GG 
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQL+ASRK +AS VIK TNSNFFT+A AL F TCK NTKC ++K    +ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]2.4e-6675.69Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFII   LLMVLA TQLAQCNTLKA ISCLDCQSNYDFSGN+I+V CK VKNL++AITEA+GSF+T+LPSD+AS D EAA      CIAKL GG 
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQL+ASRK ++STVIK TNSNFFT+A AL F TCK NTKC ++K    +ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_038895206.1 uncharacterized protein LOC120083500 [Benincasa hispida]8.6e-7785.08Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASL LFIIPIFLL+VLAPT LAQ NTLKAKISCLDCQSNYD SGNLIMV C+RVKNLT+AIT+ADGSFET LPSDVAS+DFEAA  SSPKCIAKLVGG+
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQLFASRKD+AST+IK+TN  FFTIAT LKF TCK+N KCKAMKK+F I DSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein1.2e-6878.69Show/hide
Query:  MASLKLFIIPIFLLMVL-APTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG
        MASLKLFI P F+ +VL A TQ AQCNTLKAKISCLDCQSNYDFSGNLIMVKC+R KNLTIAIT+ADGSFETSLPS++AS     AA SSPKCIAKL+GG
Subjt:  MASLKLFIIPIFLLMVL-APTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG

Query:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        SHQLFASRK++ ST+IKETNS FFTIATALKF TCK+ +  CKA+KKE  + DSKT D PLPPEWGFPPTSYY+PVLPIIGIP
Subjt:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998964.3e-7484.15Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFI PIFL MVLA TQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKC+RVKNLTIAIT+ADGSFET LPSD+AS+D EAA    PKCIAKLVGGS
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP
        HQLFASRK++ ST+IKETNS FFTIATAL+F TCK+ N KCKA+KKE  I DSKT DLP LPPEWGFPPTSYYLPVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQ-NTKCKAMKKEFIIADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126396.0e-6070.33Show/hide
Query:  MASLKLFI-IPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG
        MASLKL + +P  LLMVLA TQ AQC TL+AKISCLDC+SNYDFSGN I+VKC++VKNL +AIT  DGSFET+LPSD ++     +      CIAKLVGG
Subjt:  MASLKLFI-IPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGG

Query:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
         HQL+ASRKD+AS VIK TNS FFTIATALKF TCKQ+ KC+AMK +F IADSKT+DLPLP EWG  P+SYYLP LPIIGIP
Subjt:  SHQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497455.3e-6473.48Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFII   L +VLA TQLAQCNTLKA ISCLDCQSNYDFSGN++ V CK VKNL+IAITEA+GSF+T+LPS+ AS D E A      CIAKL+GG 
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQL+ASRK +AS VIK TNSNFFT+A AL F TCK NTKC ++K    +ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666114.8e-6574.03Show/hide
Query:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS
        MASLKLFII   LLM+LA TQLAQCNTLKA ISCLDCQSNYDFSGN+I+V CK VKNL++AITEA+GSF+T+LPSD  S D +AA      CIAKL+GG 
Subjt:  MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGS

Query:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQL+ASRK +AS VIK TNSNFFT+A AL F TCK NTKC ++K    +ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  HQLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein1.9e-2134.25Show/hide
Query:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH
        +L+   + +FL    L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S            C A+L G   
Subjt:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH

Query:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+AS+ +V S ++K    + + +++ L FL        K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein1.9e-2134.25Show/hide
Query:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH
        +L+   + +FL    L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S            C A+L G   
Subjt:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH

Query:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+AS+ +V S ++K    + + +++ L FL        K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein1.9e-2134.25Show/hide
Query:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH
        +L+   + +FL    L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S            C A+L G   
Subjt:  SLKLFIIPIFLL-MVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSH

Query:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+AS+ +V S ++K    + + +++ L FL        K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  QLFASRKDVASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEF-IIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein2.5e-2137.57Show/hide
Query:  IFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSHQLFASRKDV
        +F L V +  +L+  + +  KISCLDC  ++DFSG  +++KC   K    A+  ADGSF + LP+        A    S  C+AKL+GG  QL+A + ++
Subjt:  IFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSHQLFASRKDV

Query:  ASTVIK-ETNSNFFTIATALKF-LTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
         S ++K + +S   T +  L F L+C + ++        +I DSKTI+ P    +GFPP S++ P LPIIGIP
Subjt:  ASTVIK-ETNSNFFTIATALKF-LTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTAAGCTTTTCATTATTCCCATTTTCTTGCTTATGGTTCTTGCACCAACTCAACTTGCACAATGCAACACTTTGAAGGCAAAGATCTCTTGCCTTGACTG
TCAATCCAACTATGACTTCTCAGGAAACTTGATTATGGTGAAGTGCAAGAGAGTAAAAAACCTAACCATAGCAATTACCGAAGCCGATGGATCATTCGAAACTTCACTTC
CTTCGGACGTGGCGTCTATCGACTTCGAAGCAGCTGCTATTTCTTCTCCCAAGTGCATAGCCAAGCTTGTTGGGGGATCTCACCAGCTCTTTGCTTCAAGAAAAGACGTG
GCTTCTACTGTCATCAAGGAAACCAACTCCAACTTCTTCACAATTGCTACAGCTCTCAAGTTCTTAACATGCAAACAAAACACAAAATGCAAAGCCATGAAGAAAGAGTT
TATTATTGCAGATTCAAAGACCATTGATTTGCCTTTGCCACCTGAGTGGGGCTTCCCACCCACAAGCTACTATCTCCCTGTGCTTCCTATCATAGGCATTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTTAAGCTTTTCATTATTCCCATTTTCTTGCTTATGGTTCTTGCACCAACTCAACTTGCACAATGCAACACTTTGAAGGCAAAGATCTCTTGCCTTGACTG
TCAATCCAACTATGACTTCTCAGGAAACTTGATTATGGTGAAGTGCAAGAGAGTAAAAAACCTAACCATAGCAATTACCGAAGCCGATGGATCATTCGAAACTTCACTTC
CTTCGGACGTGGCGTCTATCGACTTCGAAGCAGCTGCTATTTCTTCTCCCAAGTGCATAGCCAAGCTTGTTGGGGGATCTCACCAGCTCTTTGCTTCAAGAAAAGACGTG
GCTTCTACTGTCATCAAGGAAACCAACTCCAACTTCTTCACAATTGCTACAGCTCTCAAGTTCTTAACATGCAAACAAAACACAAAATGCAAAGCCATGAAGAAAGAGTT
TATTATTGCAGATTCAAAGACCATTGATTTGCCTTTGCCACCTGAGTGGGGCTTCCCACCCACAAGCTACTATCTCCCTGTGCTTCCTATCATAGGCATTCCTTGA
Protein sequenceShow/hide protein sequence
MASLKLFIIPIFLLMVLAPTQLAQCNTLKAKISCLDCQSNYDFSGNLIMVKCKRVKNLTIAITEADGSFETSLPSDVASIDFEAAAISSPKCIAKLVGGSHQLFASRKDV
ASTVIKETNSNFFTIATALKFLTCKQNTKCKAMKKEFIIADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP