; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G020080 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G020080
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationGy14Chr5:26253233..26262128
RNA-Seq ExpressionCsGy5G020080
SyntenyCsGy5G020080
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648632.1 hypothetical protein Csa_009176 [Cucumis sativus]7.13e-191100Show/hide
Query:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
        MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
Subjt:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS

Query:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCF
        GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCF
Subjt:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCF

Query:  LFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG
        LFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG
Subjt:  LFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG

XP_008445925.1 PREDICTED: uncharacterized protein LOC103488806 isoform X5 [Cucumis melo]2.37e-11794.12Show/hide
Query:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
        MAISLAS SRIRSNCSRSQ PELPR F+P QS+IRFSVSRNPSVR CLSNAKISANDPLKSEDDFSNHE EGSMEKNEN+QKHPQKSNEVLDKLRRYGLS
Subjt:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS

Query:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK
        GILSYGLLNT YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK
Subjt:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK

XP_022983929.1 uncharacterized protein LOC111482401 isoform X1 [Cucurbita maxima]1.53e-11984.47Show/hide
Query:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA
        VSRNPS+R CL+NA+ISANDPLKSE  FSNHE EGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAPAP KMGYVAAA
Subjt:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA

Query:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS
        GRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKV R ETSIH   FLFHFIDV LKV P  G  GDCW+LLR+ SLVIHCCYS
Subjt:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS

Query:  AFSLRQ
        AFS+RQ
Subjt:  AFSLRQ

XP_023534249.1 uncharacterized protein LOC111795864 isoform X1 [Cucurbita pepo subsp. pepo]1.35e-12082.94Show/hide
Query:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA
        VSRNPS+R CL+NA+ISANDPLKSE+ FSNHE EGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAP P KMGYVAAA
Subjt:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA

Query:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS
        GRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKV R ETSIH  CFLFHFID+ LK+ P  G  GDCW+LLRI SLVIHCCYS
Subjt:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS

Query:  AFSLRQPWKVE
        AFS+RQ   +E
Subjt:  AFSLRQPWKVE

XP_023534266.1 uncharacterized protein LOC111795864 isoform X3 [Cucurbita pepo subsp. pepo]9.43e-9284.28Show/hide
Query:  VLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA
        VLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKV 
Subjt:  VLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA

Query:  RKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVE
        R ETSIH  CFLFHFID+ LK+ P  G  GDCW+LLRI SLVIHCCYSAFS+RQ   +E
Subjt:  RKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVE

TrEMBL top hitse value%identityAlignment
A0A0A0KRT8 Uncharacterized protein4.04e-14482.64Show/hide
Query:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
        MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
Subjt:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS

Query:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCF
        GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG +    F  +G      N    S     R++  + V   
Subjt:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCF

Query:  LFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG
              ++L+V    GIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG
Subjt:  LFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG

A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X51.15e-11794.12Show/hide
Query:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS
        MAISLAS SRIRSNCSRSQ PELPR F+P QS+IRFSVSRNPSVR CLSNAKISANDPLKSEDDFSNHE EGSMEKNEN+QKHPQKSNEVLDKLRRYGLS
Subjt:  MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLS

Query:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK
        GILSYGLLNT YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK
Subjt:  GILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGK

A0A6J1GY06 uncharacterized protein LOC1114582381.13e-8686.54Show/hide
Query:  IRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGY
        I   VSRNPS+R CL+NA+ISANDPLKSE+ FSNHE EGSMEKNEN QKHPQKS EVLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAP P KMGY
Subjt:  IRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGY

Query:  VAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA
        VAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTV YNF+SQGK  
Subjt:  VAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X21.61e-8687.5Show/hide
Query:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA
        VSRNPS+R CL+NA+ISANDPLKSE  FSNHE EGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAPAP KMGYVAAA
Subjt:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA

Query:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA
        GRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGK  
Subjt:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X17.40e-12084.47Show/hide
Query:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA
        VSRNPS+R CL+NA+ISANDPLKSE  FSNHE EGSMEKNEN +KHP+KS EVLDKLRRYG+SGILSYGLLNTVYYLTTFLVVWFYIAPAP KMGYVAAA
Subjt:  VSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAA

Query:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS
        GRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTV YNF+SQGKV R ETSIH   FLFHFIDV LKV P  G  GDCW+LLR+ SLVIHCCYS
Subjt:  GRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGDCWVLLRIGSLVIHCCYS

Query:  AFSLRQ
        AFS+RQ
Subjt:  AFSLRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein4.0e-4454.11Show/hide
Query:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI
        SL +FS + +N  R+Q   LP HF+      R   S             +S N   KS+ +      EG M +KN   +K+P  S E+L KL+RYGLSGI
Subjt:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI

Query:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLF
        LSYGLLNTVYY T FL+VWFY+APAPGKMGY+AAA RFLK+MA VWAGSQVTKL R  GA+ALAP VDRGLSWFTV  NFESQGK       I +   L 
Subjt:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLF

Query:  HFIDVKL
         FI V L
Subjt:  HFIDVKL

AT2G38695.2 unknown protein2.5e-3052.83Show/hide
Query:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI
        SL +FS + +N  R+Q   LP HF+      R   S             +S N   KS+ +      EG M +KN   +K+P  S E+L KL+RYGLSGI
Subjt:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI

Query:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG
        LSYGLLNTVYY T FL+VWFY+APAPGKMGY+AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein2.7e-3743.92Show/hide
Query:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI
        SL +FS + +N  R+Q   LP HF+      R   S             +S N   KS+ +      EG M +KN   +K+P  S E+L KL+RYGLSGI
Subjt:  SLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSM-EKNENQQKHPQKSNEVLDKLRRYGLSGI

Query:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-----------------------------------------
        LSYGLLNTVYY T FL+VWFY+APAPGKMGY+AAA RFLK+MA VWAGSQVTKL R  G                                         
Subjt:  LSYGLLNTVYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG-----------------------------------------

Query:  -------ALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKL
               A+ALAP VDRGLSWFTV  NFESQGK       I +   L  FI V L
Subjt:  -------ALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATATCCCTTGCGTCGTTTTCCCGGATTCGATCGAATTGCAGTCGAAGTCAACTTCCAGAGCTTCCCCGCCATTTTGCGCCCCTCCAGAGCCAAATAAGATTTTC
TGTTAGTCGGAACCCTAGCGTCCGATTCTGCCTCAGCAATGCCAAAATTAGCGCCAACGATCCATTGAAATCTGAGGATGACTTTTCCAATCATGAAATGGAAGGTTCAA
TGGAAAAGAATGAGAATCAGCAGAAACATCCCCAGAAATCAAATGAGGTACTGGATAAACTGAGGAGATATGGACTTTCTGGAATATTGTCTTACGGATTGTTGAATACA
GTCTACTATCTTACAACGTTTCTCGTTGTGTGGTTCTACATTGCACCAGCACCTGGGAAAATGGGCTATGTTGCAGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGT
ATGGGCTGGAAGCCAAGTTACAAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTTTGGCGCCATTCGTAGACAGAGGGTTGTCATGGTTCACCGTCAACTACAACTTCGAGT
CTCAGGGGAAGGTTGCTAGAAAAGAAACTTCTATTCATGTATTTTGTTTCTTGTTTCATTTCATTGATGTTAAGCTTAAAGTTAAACCATTTACAGGCATTTATGGCGAT
TGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCATTGTTGTTACTCTGCTTTCAGCTTAAGACAACCTTGGAAAGTTGAATTGGTTTCGTCACCAAATCGAAACCA
AAACCTTTCTTGTAGCATATCTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAATATCCCTTGCGTCGTTTTCCCGGATTCGATCGAATTGCAGTCGAAGTCAACTTCCAGAGCTTCCCCGCCATTTTGCGCCCCTCCAGAGCCAAATAAGATTTTC
TGTTAGTCGGAACCCTAGCGTCCGATTCTGCCTCAGCAATGCCAAAATTAGCGCCAACGATCCATTGAAATCTGAGGATGACTTTTCCAATCATGAAATGGAAGGTTCAA
TGGAAAAGAATGAGAATCAGCAGAAACATCCCCAGAAATCAAATGAGGTACTGGATAAACTGAGGAGATATGGACTTTCTGGAATATTGTCTTACGGATTGTTGAATACA
GTCTACTATCTTACAACGTTTCTCGTTGTGTGGTTCTACATTGCACCAGCACCTGGGAAAATGGGCTATGTTGCAGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGT
ATGGGCTGGAAGCCAAGTTACAAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTTTGGCGCCATTCGTAGACAGAGGGTTGTCATGGTTCACCGTCAACTACAACTTCGAGT
CTCAGGGGAAGGTTGCTAGAAAAGAAACTTCTATTCATGTATTTTGTTTCTTGTTTCATTTCATTGATGTTAAGCTTAAAGTTAAACCATTTACAGGCATTTATGGCGAT
TGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCATTGTTGTTACTCTGCTTTCAGCTTAAGACAACCTTGGAAAGTTGAATTGGTTTCGTCACCAAATCGAAACCA
AAACCTTTCTTGTAGCATATCTGGATAA
Protein sequenceShow/hide protein sequence
MAISLASFSRIRSNCSRSQLPELPRHFAPLQSQIRFSVSRNPSVRFCLSNAKISANDPLKSEDDFSNHEMEGSMEKNENQQKHPQKSNEVLDKLRRYGLSGILSYGLLNT
VYYLTTFLVVWFYIAPAPGKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVNYNFESQGKVARKETSIHVFCFLFHFIDVKLKVKPFTGIYGD
CWVLLRIGSLVIHCCYSAFSLRQPWKVELVSSPNRNQNLSCSISG