; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015031 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015031
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationChr02:23161317..23165210
RNA-Seq ExpressionHG10015031
SyntenyHG10015031
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648632.1 hypothetical protein Csa_009176 [Cucumis sativus]3.8e-7977.83Show/hide
Query:  FFRIRTNCSRKQFPELPRHFAP-QNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRY
        F RIR+NCSR Q PELPRHFAP Q++I F VSRNPS R CLSNA+ISANDPLKSED FSNHEMEGSMEKNEN +KHP KSNE          VLDKLRRY
Subjt:  FFRIRTNCSRKQFPELPRHFAP-QNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRY

Query:  GISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCL
        G+SGILSYGLLNTVYYLTTFL+VWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGK         +
Subjt:  GISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCL

Query:  GLALLLFIAVTL
           L  FI V L
Subjt:  GLALLLFIAVTL

KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]1.9e-7882.56Show/hide
Query:  FAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTF
        F     I+ FVSRNPS R CL NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGILSYGLLNTVYYLTTF
Subjt:  FAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTF

Query:  LIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
        L+VWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +AIVGFCLGL+LLLFIAVTLLSA
Subjt:  LIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA

XP_008445925.1 PREDICTED: uncharacterized protein LOC103488806 isoform X5 [Cucumis melo]4.4e-9185.92Show/hide
Query:  RIRTNCSRKQFPELPRHFA-PQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGI
        RIR+NCSR QFPELPR F+ PQ+RI F VSRNPS RLCLSNA+ISANDPLKSED FSNHE EGSMEKNEN +KHP KSNE          VLDKLRRYG+
Subjt:  RIRTNCSRKQFPELPRHFA-PQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGI

Query:  SGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGL
        SGILSYGLLNT YYLTTFL+VWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVGFCLGL
Subjt:  SGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGL

Query:  ALLLFIAVTLLSA
        ALLLFI VTLLSA
Subjt:  ALLLFIAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]4.2e-7884.57Show/hide
Query:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI
        + FVSRNPS R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGILSYGLLNTVYYLTTFL+VWFYI
Subjt:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI

Query:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
        APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +AIVGFCLGL+LLLFIAVTLLSA
Subjt:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]4.2e-7878.57Show/hide
Query:  SRKQFPELPRHFAPQNRIS----FFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGI
        S K  P+     +P  R+S     FVSRNPS R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGI
Subjt:  SRKQFPELPRHFAPQNRIS----FFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGI

Query:  LSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALL
        LSYGLLNTVYYLTTFL+VWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +AIVGFCLGL+LL
Subjt:  LSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALL

Query:  LFIAVTLLSA
        LFIAVTLLSA
Subjt:  LFIAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X52.1e-9185.92Show/hide
Query:  RIRTNCSRKQFPELPRHFA-PQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGI
        RIR+NCSR QFPELPR F+ PQ+RI F VSRNPS RLCLSNA+ISANDPLKSED FSNHE EGSMEKNEN +KHP KSNE          VLDKLRRYG+
Subjt:  RIRTNCSRKQFPELPRHFA-PQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGI

Query:  SGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGL
        SGILSYGLLNT YYLTTFL+VWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVGFCLGL
Subjt:  SGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGL

Query:  ALLLFIAVTLLSA
        ALLLFI VTLLSA
Subjt:  ALLLFIAVTLLSA

A0A5A7SW01 Uncharacterized protein1.9e-7188.1Show/hide
Query:  ANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIM
        ANDPLKSED FSNHE EGSMEKNEN +KHP KSNE          VLDKLRRYG+SGILSYGLLNT YYLTTFL+VWFYIAPAPAKMGYVAAAGRFLKIM
Subjt:  ANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIM

Query:  ATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
        ATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVGFCLGLALLLFI VTLLSA
Subjt:  ATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582383.5e-7882.05Show/hide
Query:  FAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTF
        F     I+ FVSRNPS R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGILSYGLLNTVYYLTTF
Subjt:  FAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTF

Query:  LIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
        L+VWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +AIVGFCLGL+LLLFIAVTLLSA
Subjt:  LIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X22.1e-7884.57Show/hide
Query:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI
        + FVSRNPS R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGILSYGLLNTVYYLTTFL+VWFYI
Subjt:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI

Query:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
        APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +AIVGFCLGL+LLLFIAVTLLSA
Subjt:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X14.3e-6884.05Show/hide
Query:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI
        + FVSRNPS R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHP KS E          VLDKLRRYG+SGILSYGLLNTVYYLTTFL+VWFYI
Subjt:  SFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGILSYGLLNTVYYLTTFLIVWFYI

Query:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        APAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  APAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein4.7e-5158.88Show/hide
Query:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG
        F +  N  R Q   LP HF      S F   N S RL  S   +S N   KS+        EG M +KN  S+K+P+ S E          +L KL+RYG
Subjt:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG

Query:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLG
        +SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  GA+ALAP VDRGLSWFTVK NFESQGKAF A+VG CLG
Subjt:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLG

Query:  LALLLFIAVTLLSA
        +AL+LFI VTLL A
Subjt:  LALLLFIAVTLLSA

AT2G38695.2 unknown protein1.5e-2852.15Show/hide
Query:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG
        F +  N  R Q   LP HF      S F   N S RL  S   +S N   KS+        EG M +KN  S+K+P+ S E          +L KL+RYG
Subjt:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG

Query:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG
        +SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein3.3e-4448.09Show/hide
Query:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG
        F +  N  R Q   LP HF      S F   N S RL  S   +S N   KS+        EG M +KN  S+K+P+ S E          +L KL+RYG
Subjt:  FRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYG

Query:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-------------------------------------
        +SGILSYGLLNTVYY T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G                                     
Subjt:  ISGILSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-------------------------------------

Query:  -----------ALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA
                   A+ALAP VDRGLSWFTVK NFESQGKAF A+VG CLG+AL+LFI VTLL A
Subjt:  -----------ALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAACCATTTTTGTTTCTTCAGGATTCGAACGAATTGCAGTCGAAAACAATTTCCAGAGCTTCCCCGCCATTTCGCGCCCCAAAACCGGATAAGCTTTTTTGTTAG
TCGGAACCCTAGCTTCCGACTCTGCCTCAGCAATGCCGAAATTAGTGCCAACGATCCATTGAAATCTGAGGATGGCTTTTCCAATCATGAAATGGAAGGTTCAATGGAAA
AGAATGAGAATAGTGAGAAACATCCCTGGAAATCAAATGAGTACTACACTACTTGCCTTGCATACATTTGGGTACTGGATAAATTGAGGAGATATGGAATTTCTGGAATA
TTGTCTTACGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTCATTGTGTGGTTCTATATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCGGCTGCTGGAAG
ATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACCAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTTGTCGACAGAGGATTGTCGTGGT
TCACTGTCAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCA
TAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAACCATTTTTGTTTCTTCAGGATTCGAACGAATTGCAGTCGAAAACAATTTCCAGAGCTTCCCCGCCATTTCGCGCCCCAAAACCGGATAAGCTTTTTTGTTAG
TCGGAACCCTAGCTTCCGACTCTGCCTCAGCAATGCCGAAATTAGTGCCAACGATCCATTGAAATCTGAGGATGGCTTTTCCAATCATGAAATGGAAGGTTCAATGGAAA
AGAATGAGAATAGTGAGAAACATCCCTGGAAATCAAATGAGTACTACACTACTTGCCTTGCATACATTTGGGTACTGGATAAATTGAGGAGATATGGAATTTCTGGAATA
TTGTCTTACGGATTGTTGAATACAGTCTACTATCTTACAACATTTCTCATTGTGTGGTTCTATATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCGGCTGCTGGAAG
ATTTCTCAAAATAATGGCTACAGTATGGGCTGGAAGCCAAGTTACCAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCACCATTTGTCGACAGAGGATTGTCGTGGT
TCACTGTCAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCATTGCTGTTACTCTGCTTTCAGCA
TAA
Protein sequenceShow/hide protein sequence
MRNHFCFFRIRTNCSRKQFPELPRHFAPQNRISFFVSRNPSFRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENSEKHPWKSNEYYTTCLAYIWVLDKLRRYGISGI
LSYGLLNTVYYLTTFLIVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFIAVTLLSA