; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G012115 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G012115
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGag-pol polyprotein
Genome locationGy14Chr4:14063516..14064910
RNA-Seq ExpressionCsGy4G012115
SyntenyCsGy4G012115
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046617.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.71e-1735.29Show/hide
Query:  SNGQAKSSAYAKHRGSSSC--------------------------------------------LELHKARHEYDSLSKNVKMLTSSTQSLQNMSDDEKSR
        S+GQ KSSAYA+H+GSSSC                                             EL +A HE++SLSK VKMLTS TQ+L+N+ +D KS 
Subjt:  SNGQAKSSAYAKHRGSSSC--------------------------------------------LELHKARHEYDSLSKNVKMLTSSTQSLQNMSDDEKSR

Query:  PNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFP
         NKM LG+S                                  +KK+WVCH+ G+ GH+  FCY LHGFP
Subjt:  PNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFP

KAA0063069.1 putative leucine-rich repeat-containing protein [Cucumis melo var. makuwa]4.38e-1669.23Show/hide
Query:  DGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLEL
        DGG TT+ VRCHECEV S +EF+EF+RALI  V  DES E GCS+GQ KS AYA+HRGSSSC+ L
Subjt:  DGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLEL

KAE8649447.1 hypothetical protein Csa_018892 [Cucumis sativus]1.25e-13067.87Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLELHKARHEYDSLSKNVKMLTSSTQSLQNMSDD
        MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLELHKARHEYDSLSKNVKMLTSSTQSLQNMSDD
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLELHKARHEYDSLSKNVKMLTSSTQSLQNMSDD

Query:  EKSRPNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFPSNRK----------------------
        EKSRPNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFPSNRK                      
Subjt:  EKSRPNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFPSNRK----------------------

Query:  ----------------------------------------------------------------------------PKFILQEAYCKALDRGFSTTPFNT
                                                                                    PKFILQEAYCKALDRGFSTTPFNT
Subjt:  ----------------------------------------------------------------------------PKFILQEAYCKALDRGFSTTPFNT

Query:  NLCGN
        NLCGN
Subjt:  NLCGN

KGN46724.1 hypothetical protein Csa_020938 [Cucumis sativus]5.38e-1868.18Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSS
        MALI DGGL TR VR HECEV  GSEF++F RALISIVT+DE IED  S+ + K  AY +HRGSSS
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSS

TYK16282.1 hypothetical protein E5676_scaffold21G00530 [Cucumis melo var. makuwa]9.36e-1772.31Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSS
        MA IRDGG TT+SVRCHECEV S SEF+EF+RALI  V  DES E GCS+GQ KS AY +HRGSS
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSS

TrEMBL top hitse value%identityAlignment
A0A0A0KCZ9 Uncharacterized protein2.60e-1868.18Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSS
        MALI DGGL TR VR HECEV  GSEF++F RALISIVT+DE IED  S+ + K  AY +HRGSSS
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSS

A0A0A0L8S2 Uncharacterized protein1.31e-1866.67Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRG
        MA I+D G  T SVRC ECEV SGS+F+EF RAL S+V  DES EDGCS+GQ KS AY +HRG
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRG

A0A5A7TXF9 Gag-pol polyprotein2.76e-1735.29Show/hide
Query:  SNGQAKSSAYAKHRGSSSC--------------------------------------------LELHKARHEYDSLSKNVKMLTSSTQSLQNMSDDEKSR
        S+GQ KSSAYA+H+GSSSC                                             EL +A HE++SLSK VKMLTS TQ+L+N+ +D KS 
Subjt:  SNGQAKSSAYAKHRGSSSC--------------------------------------------LELHKARHEYDSLSKNVKMLTSSTQSLQNMSDDEKSR

Query:  PNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFP
         NKM LG+S                                  +KK+WVCH+ G+ GH+  FCY LHGFP
Subjt:  PNKMGLGYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFP

A0A5A7VBT6 Putative leucine-rich repeat-containing protein2.12e-1669.23Show/hide
Query:  DGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLEL
        DGG TT+ VRCHECEV S +EF+EF+RALI  V  DES E GCS+GQ KS AYA+HRGSSSC+ L
Subjt:  DGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLEL

A0A5D3CXM9 Uncharacterized protein4.53e-1772.31Show/hide
Query:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSS
        MA IRDGG TT+SVRCHECEV S SEF+EF+RALI  V  DES E GCS+GQ KS AY +HRGSS
Subjt:  MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTGATCAGAGACGGTGGTTTAACAACTCGCAGCGTAAGGTGTCATGAATGTGAAGTATTTTCTGGTAGTGAGTTCAAGGAATTTCGGAGAGCCCTCATTAGTAT
TGTTACTATAGATGAATCTATTGAAGATGGTTGTTCCAATGGGCAAGCGAAATCTAGTGCATATGCTAAGCATCGAGGATCATCATCTTGTTTAGAATTACACAAGGCTC
GCCATGAGTATGATTCCTTGTCCAAGAATGTCAAGATGCTCACATCAAGTACTCAAAGTCTTCAAAATATGTCAGACGACGAAAAATCGAGACCAAATAAAATGGGTCTG
GGTTACTCTATTAGTAGCTCCGTTGGTACTTCCACAACAGTATTTATAAAAGCATCCAGTAAAATTGATCAATGTACAAACCCATTACCTGTTGAAAGAGAAAATCATAT
TATCAAGAAAAGATGGGTCTGTCATTTCTGGGGCAAGCATGGTCACATACTATCATTTTGTTACCGTCTACATGGTTTTCCCTCAAATAGAAAACCAAAGTTCATTTTGC
AGGAGGCATATTGTAAAGCATTGGATAGGGGCTTCTCCACTACTCCATTCAACACCAATCTTTGTGGTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTGATCAGAGACGGTGGTTTAACAACTCGCAGCGTAAGGTGTCATGAATGTGAAGTATTTTCTGGTAGTGAGTTCAAGGAATTTCGGAGAGCCCTCATTAGTAT
TGTTACTATAGATGAATCTATTGAAGATGGTTGTTCCAATGGGCAAGCGAAATCTAGTGCATATGCTAAGCATCGAGGATCATCATCTTGTTTAGAATTACACAAGGCTC
GCCATGAGTATGATTCCTTGTCCAAGAATGTCAAGATGCTCACATCAAGTACTCAAAGTCTTCAAAATATGTCAGACGACGAAAAATCGAGACCAAATAAAATGGGTCTG
GGTTACTCTATTAGTAGCTCCGTTGGTACTTCCACAACAGTATTTATAAAAGCATCCAGTAAAATTGATCAATGTACAAACCCATTACCTGTTGAAAGAGAAAATCATAT
TATCAAGAAAAGATGGGTCTGTCATTTCTGGGGCAAGCATGGTCACATACTATCATTTTGTTACCGTCTACATGGTTTTCCCTCAAATAGAAAACCAAAGTTCATTTTGC
AGGAGGCATATTGTAAAGCATTGGATAGGGGCTTCTCCACTACTCCATTCAACACCAATCTTTGTGGTAATTAA
Protein sequenceShow/hide protein sequence
MALIRDGGLTTRSVRCHECEVFSGSEFKEFRRALISIVTIDESIEDGCSNGQAKSSAYAKHRGSSSCLELHKARHEYDSLSKNVKMLTSSTQSLQNMSDDEKSRPNKMGL
GYSISSSVGTSTTVFIKASSKIDQCTNPLPVERENHIIKKRWVCHFWGKHGHILSFCYRLHGFPSNRKPKFILQEAYCKALDRGFSTTPFNTNLCGN