; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12140 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12140
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPollen Ole e 1 allergen and extensin family protein
Genome locationClcChr01:20777250..20779588
RNA-Seq ExpressionClc01G12140
SyntenyClc01G12140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139493.1 uncharacterized protein LOC101207654 [Cucumis sativus]1.4e-6777.9Show/hide
Query:  MAFLKLFITPIFLLMVL-APTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS
        MA LKLFI+P F+ +VL A TQ AQCN+LKAKISCLDCQSNYDFSGNLIMVKC+  KNLTIAIT+ADGSFETSLPS++AS   EAA SSPKCIAKL+GGS
Subjt:  MAFLKLFITPIFLLMVL-APTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS

Query:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQLFASRK+M ST+IKETNS FFTI++ LKFSTCK+ +  CKA+KKE   DSKT D PLPPEWGFPPTSYY+PVLPIIGIP
Subjt:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_008461260.1 PREDICTED: uncharacterized protein LOC103499896 [Cucumis melo]3.2e-7281.77Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA LKLFI PIFL MVLA TQLAQCN+LKAKISCLDCQSNYDFSGNLIMVKC+ VKNLTIAIT+ADGSFET LPSD+AS+D EAA   PKCIAKLVGGSH
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP
        QLFASRK++ ST+IKETNS FFTI++ L+FSTCK+ N KCKA+KKE   DSKT DLP LPPEWGFPPTSYYLPVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP

XP_023542373.1 uncharacterized protein LOC111802294 [Cucurbita pepo subsp. pepo]8.5e-6575.42Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA LKLFI    LLMVLA TQLAQCN+LKA ISCLDCQSNYDFSGN+I+V CK+VKNL++AITEA+GSF+T+LPSD+AS D EAA S+  CIAKL GG H
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+ASRK M+STVIK TNSNFFT+++ L FSTCK N KC ++ K F ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_031737339.1 uncharacterized protein LOC116402231 [Cucumis sativus]2.5e-6477.84Show/hide
Query:  LMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSHQLFASRKDMASTV
        ++++A TQ AQCN+LKAKISCLDCQSNYDFSGNLIMVKC+  KNLTIAIT+ADGSFETSLPS++AS   EAA SSPKCIAKL+GGSHQLFASRK+M ST+
Subjt:  LMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSHQLFASRKDMASTV

Query:  IKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        IKETNS FFTI++ LKFSTCK+ +  CKA+KKE   DSKT D PLPPEWGFPPTSYY+PVLPIIGIP
Subjt:  IKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

XP_038895206.1 uncharacterized protein LOC120083500 [Benincasa hispida]9.7e-7783.8Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA L LFI PIFLL+VLAPT LAQ N+LKAKISCLDCQSNYD SGNLIMV C+ VKNLT+AIT+ADGSFET LPSDVAS+DFEAA SSPKCIAKLVGG+H
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QLFASRKDMAST+IK+TN  FFTI++PLKFSTCK+N KCKAMKK+F  DSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

TrEMBL top hitse value%identityAlignment
A0A0A0LYA1 Uncharacterized protein6.8e-6877.9Show/hide
Query:  MAFLKLFITPIFLLMVL-APTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS
        MA LKLFI+P F+ +VL A TQ AQCN+LKAKISCLDCQSNYDFSGNLIMVKC+  KNLTIAIT+ADGSFETSLPS++AS   EAA SSPKCIAKL+GGS
Subjt:  MAFLKLFITPIFLLMVL-APTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS

Query:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQLFASRK+M ST+IKETNS FFTI++ LKFSTCK+ +  CKA+KKE   DSKT D PLPPEWGFPPTSYY+PVLPIIGIP
Subjt:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A1S3CDU1 uncharacterized protein LOC1034998961.6e-7281.77Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA LKLFI PIFL MVLA TQLAQCN+LKAKISCLDCQSNYDFSGNLIMVKC+ VKNLTIAIT+ADGSFET LPSD+AS+D EAA   PKCIAKLVGGSH
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP
        QLFASRK++ ST+IKETNS FFTI++ L+FSTCK+ N KCKA+KKE   DSKT DLP LPPEWGFPPTSYYLPVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQ-NIKCKAMKKEFAADSKTIDLP-LPPEWGFPPTSYYLPVLPIIGIP

A0A6J1CMJ0 uncharacterized protein LOC1110126394.4e-5970Show/hide
Query:  MAFLKLFIT-PIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS
        MA LKL +T P  LLMVLA TQ AQC +L+AKISCLDC+SNYDFSGN I+VKC+ VKNL +AIT  DGSFET+LPSD ++    + S    CIAKLVGG 
Subjt:  MAFLKLFIT-PIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGS

Query:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        HQL+ASRKDMAS VIK TNS FFTI++ LKFSTCKQ+ KC+AMK +F ADSKT+DLPLP EWG  P+SYYLP LPIIGIP
Subjt:  HQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A6J1G179 uncharacterized protein LOC1114497451.9e-6273.18Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA LKLFI    L +VLA TQLAQCN+LKA ISCLDCQSNYDFSGN++ V CK+VKNL+IAITEA+GSF+T+LPS+ AS D E A S+  CIAKL+GG H
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+ASRK MAS VIK TNSNFFT+++ L FSTCK N KC ++ K F ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

A0A6J1HU61 uncharacterized protein LOC1114666111.7e-6373.74Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA LKLFI    LLM+LA TQLAQCN+LKA ISCLDCQSNYDFSGN+I+V CK+VKNL++AITEA+GSF+T+LPSD  S D +AA S+  CIAKL+GG H
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+ASRK MAS VIK TNSNFFT+++ L FSTCK N KC ++ K F ADSKTIDLPLPPEWGFPPTSYY PVLPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G27385.1 Pollen Ole e 1 allergen and extensin family protein3.0e-2033.69Show/hide
Query:  NLLLLSMAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAK
        +L    MA    F+        L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S           C A+
Subjt:  NLLLLSMAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAK

Query:  LVGGSHQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        L G   QL+AS+ ++ S ++K    + + +SS L F         K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  LVGGSHQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT2G27385.2 Pollen Ole e 1 allergen and extensin family protein1.0e-2034.25Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA    F+        L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S           C A+L G   
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+AS+ ++ S ++K    + + +SS L F         K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT2G27385.3 Pollen Ole e 1 allergen and extensin family protein1.0e-2034.25Show/hide
Query:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH
        MA    F+        L+   +   + ++ K+SC DC ++YD+SG  + V C          T+  G F + LPS + S           C A+L G   
Subjt:  MAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSH

Query:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        QL+AS+ ++ S ++K    + + +SS L F         K+  + F   + SKT+DLP+PPEWG  PTSYY+P LPIIGIP
Subjt:  QLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFA--ADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP

AT5G22430.1 Pollen Ole e 1 allergen and extensin family protein5.5e-2237.36Show/hide
Query:  IFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSHQLFASRKDMA
        +F L V +  +L+  + +  KISCLDC  ++DFSG  +++KC   K    A+  ADGSF + LP+            S  C+AKL+GG  QL+A + ++ 
Subjt:  IFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAASSSPKCIAKLVGGSHQLFASRKDMA

Query:  STVIK-ETNSNFFTISSPLKFSTCKQNIKCKAMKKE----FAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP
        S ++K + +S   T S+PL FS     + C    ++       DSKTI+ P    +GFPP S++ P LPIIGIP
Subjt:  STVIK-ETNSNFFTISSPLKFSTCKQNIKCKAMKKE----FAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGCTTCTTCACTTCACATGTCTTTCAGTTTGAGCACCAAAAGTTTGATAAATCTATTATTGCTTTCCATGGCTTTTCTTAAGCTTTTTATTACTCCCATTTTCTT
GCTTATGGTTCTTGCACCAACTCAACTTGCACAATGCAACAGTTTGAAGGCAAAGATCTCTTGCCTTGACTGTCAATCCAACTATGACTTCTCAGGAAACTTGATTATGG
TGAAGTGTAAGAGTGTGAAAAACCTAACGATAGCAATTACCGAAGCCGATGGATCGTTTGAAACCTCACTTCCTTCCGACGTGGCCTCCATCGACTTCGAAGCAGCTTCT
TCTTCTCCCAAGTGCATAGCCAAGCTTGTTGGGGGATCTCACCAGCTCTTTGCTTCAAGGAAAGATATGGCTTCTACTGTCATCAAGGAAACCAACTCCAACTTCTTCAC
AATTTCTAGTCCTCTCAAGTTCTCCACATGCAAACAAAACATAAAATGCAAAGCCATGAAAAAAGAGTTTGCTGCAGATTCAAAGACCATCGATTTGCCTTTGCCACCTG
AGTGGGGCTTCCCACCCACAAGCTACTATCTCCCAGTGCTTCCCATTATAGGCATTCCTTGA
mRNA sequenceShow/hide mRNA sequence
TATAGGGTTGGCATATGCTGGCTTCTTCACTTCACATGTCTTTCAGTTTGAGCACCAAAAGTTTGATAAATCTATTATTGCTTTCCATGGCTTTTCTTAAGCTTTTTATT
ACTCCCATTTTCTTGCTTATGGTTCTTGCACCAACTCAACTTGCACAATGCAACAGTTTGAAGGCAAAGATCTCTTGCCTTGACTGTCAATCCAACTATGACTTCTCAGG
AAACTTGATTATGGTGAAGTGTAAGAGTGTGAAAAACCTAACGATAGCAATTACCGAAGCCGATGGATCGTTTGAAACCTCACTTCCTTCCGACGTGGCCTCCATCGACT
TCGAAGCAGCTTCTTCTTCTCCCAAGTGCATAGCCAAGCTTGTTGGGGGATCTCACCAGCTCTTTGCTTCAAGGAAAGATATGGCTTCTACTGTCATCAAGGAAACCAAC
TCCAACTTCTTCACAATTTCTAGTCCTCTCAAGTTCTCCACATGCAAACAAAACATAAAATGCAAAGCCATGAAAAAAGAGTTTGCTGCAGATTCAAAGACCATCGATTT
GCCTTTGCCACCTGAGTGGGGCTTCCCACCCACAAGCTACTATCTCCCAGTGCTTCCCATTATAGGCATTCCTTGAGCATTAACTCTATGTTTGAGGAGCATATAATGCA
TGGCTAAGTGTTACAATAAATGTAGCGTGTCTGGGATTTGTATCTTCAAAGCTCATGGCCAAGATAAGGGCTTTAAGCAAGTTGGCTTAATATAGTTATTTTATGCTGCC
TATTAATTCTCTATTGTATGGTTTAAATATGTAGGTAAGTTAAAATGGTAAATTGTAACTTTTAAGGGGTAGTTTCTTTTATGAAAATGAAGATGGTCAAAATCTTCAAA
TGCCTAAATATATTTGAAAATTATGTAGATGTGCAACGTTTTTAAATAACCATGTTCGTAATATACAAAAATTTATAAAAGATTCAAGGAATATTAGATAGTCA
Protein sequenceShow/hide protein sequence
MLASSLHMSFSLSTKSLINLLLLSMAFLKLFITPIFLLMVLAPTQLAQCNSLKAKISCLDCQSNYDFSGNLIMVKCKSVKNLTIAITEADGSFETSLPSDVASIDFEAAS
SSPKCIAKLVGGSHQLFASRKDMASTVIKETNSNFFTISSPLKFSTCKQNIKCKAMKKEFAADSKTIDLPLPPEWGFPPTSYYLPVLPIIGIP