; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015703 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015703
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionYae1_N domain-containing protein
Genome locationChr02:29115494..29123688
RNA-Seq ExpressionHG10015703
SyntenyHG10015703
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0004386 - helicase activity (molecular function)
InterPro domainsIPR019191 - Essential protein Yae1, N-terminal
IPR038881 - Yae1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34203.1 aquarius [Cucumis melo subsp. melo]1.8e-8588.42Show/hide
Query:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE
        MSQGLVAGSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KE
Subjt:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE

Query:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE
        KLAG  EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE
Subjt:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE

KAA0034742.1 aquarius [Cucumis melo var. makuwa]1.4e-8587.96Show/hide
Query:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE
        MSQGLVAGSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KE
Subjt:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE

Query:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
        KLAG  EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE+
Subjt:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

XP_008446921.1 PREDICTED: uncharacterized protein LOC103489487 [Cucumis melo]7.2e-8287.03Show/hide
Query:  AGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIE
        +GSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KEKLAG  
Subjt:  AGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIE

Query:  ENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
        EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE+
Subjt:  ENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

XP_022990784.1 uncharacterized protein LOC111487537 isoform X1 [Cucurbita maxima]1.6e-8184.13Show/hide
Query:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL
        Q   +GS V+DSCEDDGSLWGGSDEGLEET+DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSVS+GYKLGLVRGVSSVLAS PDD+KEKL
Subjt:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL

Query:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
         GIEENRSKFQSLYESVNSLST DALRLF+DDI AQ TKEE V+ANT+SQ+ DLLK+NSDY RL +FYGEL+ALL  SPALN+HLHEEQ
Subjt:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

XP_038891531.1 protein YAE1-like [Benincasa hispida]1.9e-8286.17Show/hide
Query:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL
        Q   +GSGVDDSCEDDGSLWGGSDEGLE TTDLDREWQRRH QFHTIGYRDG+IAGKEAAAQEGFNVGFKQSVS+GYK GLVRGVSSVLAS PDD+KEKL
Subjt:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL

Query:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE
        AGIEENRSKFQSLYESVNSLSTVDALRLFND+IT QR KEE +NANTN++S DLLK+ SDY RL KFY ELEALL KSPALNVHLHE+
Subjt:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE

TrEMBL top hitse value%identityAlignment
A0A1S4DX60 uncharacterized protein LOC1034894873.5e-8287.03Show/hide
Query:  AGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIE
        +GSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KEKLAG  
Subjt:  AGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIE

Query:  ENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
        EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE+
Subjt:  ENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

A0A5A7SUA5 Aquarius6.8e-8687.96Show/hide
Query:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE
        MSQGLVAGSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KE
Subjt:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE

Query:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
        KLAG  EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE+
Subjt:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

A0A6J1JR95 uncharacterized protein LOC111487537 isoform X27.7e-8284.13Show/hide
Query:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL
        Q   +GS V+DSCEDDGSLWGGSDEGLEET+DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSVS+GYKLGLVRGVSSVLAS PDD+KEKL
Subjt:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL

Query:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
         GIEENRSKFQSLYESVNSLST DALRLF+DDI AQ TKEE V+ANT+SQ+ DLLK+NSDY RL +FYGEL+ALL  SPALN+HLHEEQ
Subjt:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

A0A6J1JRH1 uncharacterized protein LOC111487537 isoform X37.7e-8284.13Show/hide
Query:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL
        Q   +GS V+DSCEDDGSLWGGSDEGLEET+DLDREW RRHDQFHTIGYRDGLIAGKEAA+QEGFNVGFKQSVS+GYKLGLVRGVSSVLAS PDD+KEKL
Subjt:  QGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKL

Query:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ
         GIEENRSKFQSLYESVNSLST DALRLF+DDI AQ TKEE V+ANT+SQ+ DLLK+NSDY RL +FYGEL+ALL  SPALN+HLHEEQ
Subjt:  AGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ

E5GCK6 Aquarius8.8e-8688.42Show/hide
Query:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE
        MSQGLVAGSGVDDSC+DDGSLWGGSDEGLEET DLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFN+GFKQSVSVGYKLGLVRGVSSVLAS PDD+KE
Subjt:  MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKE

Query:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE
        KLAG  EN+SKFQSLYESVNSLSTVDALRLFN DIT Q TKEE V+ANTNSQ+ DLLKKN DYGRL KFY EL A L +SPALNVHLHEE
Subjt:  KLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEE

SwissProt top hitse value%identityAlignment
Q9NRH1 Protein YAE1 homolog5.1e-0640.58Show/hide
Query:  DEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASF
        DE  +E+    REWQ    +    GYRDG+ AGK    Q+GFN G+K+   V    G +RG  S L S+
Subjt:  DEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASF

Arabidopsis top hitse value%identityAlignment
AT1G34550.1 Protein of unknown function (DUF616)8.9e-0636.84Show/hide
Query:  SVLASFPDDVKEKLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPAL
        +VLA  PD+++EKL   +E R KFQ L+  V++LST  A++ F   +T   TKE    +  N+         SD+G    +  EL +LL KSP +
Subjt:  SVLASFPDDVKEKLAGIEENRSKFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPAL

AT1G34570.1 Essential protein Yae1, N-terminal2.0e-2943.02Show/hide
Query:  DGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIEENRSKFQSLYE
        D   +G SDE   E   LD E ++R  +FH+ GYRDG++ GKEA AQEG+N G+K+SV  GYK G+VRGVSS LA  P + +EKL   +E R KFQ L+ 
Subjt:  DGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIEENRSKFQSLYE

Query:  SVNSLSTVDALRLFNDDITA----QRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNV
        SV++LST  A++ F + +T     +++ EE  ++ + S S   +   +D G    +  EL +LL KSP + V
Subjt:  SVNSLSTVDALRLFNDDITA----QRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNV

AT3G15750.1 Essential protein Yae1, N-terminal1.3e-2841.76Show/hide
Query:  GSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIEE
        G  V  S  DD       +E   E   L  E ++R  +FH+ GYRDG++AGKEA AQEG+N G+K+SV  GY+ GLVRGVSS LA  PD+++EKL   +E
Subjt:  GSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIEE

Query:  NRSKFQSLYESVNSLSTVDALRLFNDDITA----QRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNV
         R KFQ L+ SV++LST  A++ F + +T     +++ EE  ++ ++S S   +   +D G    +  EL +LL KSP + V
Subjt:  NRSKFQSLYESVNSLSTVDALRLFNDDITA----QRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCAAGGTCTGGTTGCAGGTTCTGGTGTGGATGACTCGTGCGAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAACAACTGATTTGGACAG
GGAGTGGCAGAGAAGACATGACCAATTTCATACGATTGGATACCGCGATGGCTTAATTGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGT
CAGTCTCTGTCGGATATAAATTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCTTTCCTGATGACGTGAAAGAGAAGCTTGCAGGAATTGAAGAGAACCGAAGT
AAATTTCAAAGCTTGTACGAATCTGTGAACTCTCTTTCGACAGTAGATGCACTTAGGCTATTCAATGACGATATTACGGCACAACGCACGAAAGAAGAGCGTGTCAATGC
AAATACAAATTCCCAATCAAAGGATTTGTTAAAGAAAAATTCAGATTATGGTCGTCTAAGAAAGTTCTATGGAGAGCTTGAAGCACTTTTACACAAATCACCTGCTCTGA
ATGTTCATTTACACGAAGAACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCAAGGTCTGGTTGCAGGTTCTGGTGTGGATGACTCGTGCGAAGATGATGGATCTTTATGGGGTGGTTCTGATGAAGGCTTGGAGGAAACAACTGATTTGGACAG
GGAGTGGCAGAGAAGACATGACCAATTTCATACGATTGGATACCGCGATGGCTTAATTGCTGGGAAAGAAGCTGCAGCTCAAGAGGGATTTAATGTTGGCTTCAAGCAGT
CAGTCTCTGTCGGATATAAATTGGGTCTTGTCAGAGGTGTTAGCAGTGTGCTTGCTTCCTTTCCTGATGACGTGAAAGAGAAGCTTGCAGGAATTGAAGAGAACCGAAGT
AAATTTCAAAGCTTGTACGAATCTGTGAACTCTCTTTCGACAGTAGATGCACTTAGGCTATTCAATGACGATATTACGGCACAACGCACGAAAGAAGAGCGTGTCAATGC
AAATACAAATTCCCAATCAAAGGATTTGTTAAAGAAAAATTCAGATTATGGTCGTCTAAGAAAGTTCTATGGAGAGCTTGAAGCACTTTTACACAAATCACCTGCTCTGA
ATGTTCATTTACACGAAGAACAGTAG
Protein sequenceShow/hide protein sequence
MSQGLVAGSGVDDSCEDDGSLWGGSDEGLEETTDLDREWQRRHDQFHTIGYRDGLIAGKEAAAQEGFNVGFKQSVSVGYKLGLVRGVSSVLASFPDDVKEKLAGIEENRS
KFQSLYESVNSLSTVDALRLFNDDITAQRTKEERVNANTNSQSKDLLKKNSDYGRLRKFYGELEALLHKSPALNVHLHEEQ