; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020788 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020788
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr05:2447128..2448628
RNA-Seq ExpressionHG10020788
SyntenyHG10020788
Gene Ontology termsGO:0006606 - protein import into nucleus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR028156 - RPA-interacting protein
IPR028159 - RPA-interacting protein, C-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99094.1 uncharacterized protein E5676_scaffold248G002980 [Cucumis melo var. makuwa]3.6e-4948.65Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        MEDD+ N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQK----------------------------AVKITILSRMFTQLFAESEGPILTWE
                         DAYEG+GEEILLEMQRIFYEDLNVD+RQK                            A K  ILSRMFTQ FAESEGPI+TWE
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQK----------------------------AVKITILSRMFTQLFAESEGPILTWE

Query:  DEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMIHCTHCGLRLNKGNEV
        DEEDEFLARAVYEHMQLS+EK LEKFWCP+CKQGELQEN+H  IHCT CGLRLNKGNEV
Subjt:  DEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMIHCTHCGLRLNKGNEV

XP_008437393.1 PREDICTED: uncharacterized protein LOC103482820 [Cucumis melo]2.1e-4449.35Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        MEDD+ N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEG+GEEILLEMQRIFYEDLNVD+RQK                ESEGPI+TWEDEEDEFLARAVYEHMQLS+EK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV
        P+CKQGELQEN+H  IHCT CGLRLNKGNEV
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV

XP_011651131.1 uncharacterized protein LOC101216926 isoform X1 [Cucumis sativus]9.3e-4549.78Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        ME+D  N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEG+GEEILLEMQRIFYEDLNVDLRQK                ESE PI+TWEDEEDEFLARAVYEHMQLS+EK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV
        P+CKQGELQENNH  IHCTHCGLRLNKGNEV
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV

XP_031737335.1 uncharacterized protein LOC101216926 isoform X3 [Cucumis sativus]2.9e-4650.21Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        ME+D  N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEG+GEEILLEMQRIFYEDLNVDLRQK                ESE PI+TWEDEEDEFLARAVYEHMQLS+EK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEVFH
        P+CKQGELQENNH  IHCTHCGLRLNKGNEVFH
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEVFH

XP_038893884.1 uncharacterized protein LOC120082685 isoform X1 [Benincasa hispida]4.5e-4751.08Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        MEDDD NS I+THRSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEGEGEEILLEMQRIFYEDLN+DLR K                ESEGPI+TWEDEEDEFLARAVYEHM+L+SEK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV
        PICKQGELQENNH  IHCTHCGLRL KGNEV
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV

TrEMBL top hitse value%identityAlignment
A0A0A0LTY4 Uncharacterized protein4.5e-4549.78Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        ME+D  N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEG+GEEILLEMQRIFYEDLNVDLRQK                ESE PI+TWEDEEDEFLARAVYEHMQLS+EK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV
        P+CKQGELQENNH  IHCTHCGLRLNKGNEV
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV

A0A1S3AU18 uncharacterized protein LOC1034828201.0e-4449.35Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        MEDD+ N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC
                         DAYEG+GEEILLEMQRIFYEDLNVD+RQK                ESEGPI+TWEDEEDEFLARAVYEHMQLS+EK LEKFWC
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWC

Query:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV
        P+CKQGELQEN+H  IHCT CGLRLNKGNEV
Subjt:  PICKQGELQENNHLMIHCTHCGLRLNKGNEV

A0A5D3BJ40 Uncharacterized protein1.8e-4948.65Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        MEDD+ N  I+T RSSIKTHPRYNNQQSWKQK                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQK----------------------------AVKITILSRMFTQLFAESEGPILTWE
                         DAYEG+GEEILLEMQRIFYEDLNVD+RQK                            A K  ILSRMFTQ FAESEGPI+TWE
Subjt:  -----------------DAYEGEGEEILLEMQRIFYEDLNVDLRQK----------------------------AVKITILSRMFTQLFAESEGPILTWE

Query:  DEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMIHCTHCGLRLNKGNEV
        DEEDEFLARAVYEHMQLS+EK LEKFWCP+CKQGELQEN+H  IHCT CGLRLNKGNEV
Subjt:  DEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMIHCTHCGLRLNKGNEV

A0A6J1GT42 uncharacterized protein LOC111457192 isoform X17.2e-4348.02Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        ME+ D +S  +THRSSIKTHPRYNN QSWK+K                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICK
                     DAY GEGEEILLEMQRIFYEDLNVDL+QK                ESEGPI+TWEDEEDEFLARAVYEHMQL+SEKDL K WCPICK
Subjt:  -------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICK

Query:  QGELQENNHLMIHCTHCGLRLNKGNEV
        QG+LQEN H  IHCTHCG++LNKGNEV
Subjt:  QGELQENNHLMIHCTHCGLRLNKGNEV

A0A6J1JWH1 uncharacterized protein LOC111490304 isoform X11.6e-4247.58Show/hide
Query:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------
        ME+ D +S  +THRSSIKTHPRYNN QSWK+K                                                                    
Subjt:  MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQK--------------------------------------------------------------------

Query:  -------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICK
                     DAY GEGEEILLEMQRIFYEDLNVDL+QK                ESEGPI+TWEDEED+FLARAVYEHMQL+SEKDL K WCPICK
Subjt:  -------------DAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICK

Query:  QGELQENNHLMIHCTHCGLRLNKGNEV
        QG+LQEN H  IHCTHCG++LNKGNEV
Subjt:  QGELQENNHLMIHCTHCGLRLNKGNEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G12760.1 unknown protein9.0e-2247.83Show/hide
Query:  KDAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMI
        K  YEG+ EEILLEMQ+IFY+DL        +  T ++  F Q        + TWEDEED++LA  V ++M L+SE++  + WCPICK+GEL E NH  I
Subjt:  KDAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDLEKFWCPICKQGELQENNHLMI

Query:  HCTHCGLRLNKGNEV
         C  C ++LNKG EV
Subjt:  HCTHCGLRLNKGNEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGATGATTCAAATTCCGACATTGAAACCCATCGTTCTTCGATTAAGACCCATCCTCGCTACAACAATCAACAGTCATGGAAGCAGAAGGATGCTTATGAAGG
TGAAGGTGAAGAAATATTGTTGGAAATGCAAAGGATTTTTTATGAAGATCTGAATGTTGATCTGAGACAAAAAGCTGTCAAAATCACTATTTTGAGTAGGATGTTTACTC
AGCTCTTTGCAGAATCTGAAGGCCCTATTCTAACATGGGAAGATGAAGAAGACGAGTTCTTAGCCCGTGCAGTTTACGAGCATATGCAACTTAGTAGTGAGAAGGATCTT
GAGAAGTTTTGGTGTCCTATATGTAAACAAGGAGAGCTGCAAGAGAACAACCACTTGATGATACATTGCACTCATTGTGGACTTCGGCTTAACAAAGGCAATGAGGTTTT
CCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGATGATTCAAATTCCGACATTGAAACCCATCGTTCTTCGATTAAGACCCATCCTCGCTACAACAATCAACAGTCATGGAAGCAGAAGGATGCTTATGAAGG
TGAAGGTGAAGAAATATTGTTGGAAATGCAAAGGATTTTTTATGAAGATCTGAATGTTGATCTGAGACAAAAAGCTGTCAAAATCACTATTTTGAGTAGGATGTTTACTC
AGCTCTTTGCAGAATCTGAAGGCCCTATTCTAACATGGGAAGATGAAGAAGACGAGTTCTTAGCCCGTGCAGTTTACGAGCATATGCAACTTAGTAGTGAGAAGGATCTT
GAGAAGTTTTGGTGTCCTATATGTAAACAAGGAGAGCTGCAAGAGAACAACCACTTGATGATACATTGCACTCATTGTGGACTTCGGCTTAACAAAGGCAATGAGGTTTT
CCACTAA
Protein sequenceShow/hide protein sequence
MEDDDSNSDIETHRSSIKTHPRYNNQQSWKQKDAYEGEGEEILLEMQRIFYEDLNVDLRQKAVKITILSRMFTQLFAESEGPILTWEDEEDEFLARAVYEHMQLSSEKDL
EKFWCPICKQGELQENNHLMIHCTHCGLRLNKGNEVFH