; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G012910 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G012910
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationCG_Chr11:26182511..26186483
RNA-Seq ExpressionClCG11G012910
SyntenyClCG11G012910
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]4.2e-8679.28Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  ++ Q P+     +P  R+S     FVSRNPS+R CL NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

XP_008445925.1 PREDICTED: uncharacterized protein LOC103488806 isoform X5 [Cucumis melo]6.7e-9290.5Show/hide
Query:  SNCSRRQFPELPRHFA-PRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY
        SNCSR QFPELPR F+ P++RI F VSRNPSVRLCLSNA+ISANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKLRRYG+SGILSYGLLNT Y
Subjt:  SNCSRRQFPELPRHFA-PRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY

Query:  YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWF V YNFESQGKAFMAIVGFCLGLALLLF+ VTLLSA
Subjt:  YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]3.6e-8578.83Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]1.1e-8679.73Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-8679.73Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X53.2e-9290.5Show/hide
Query:  SNCSRRQFPELPRHFA-PRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY
        SNCSR QFPELPR F+ P++RI F VSRNPSVRLCLSNA+ISANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKLRRYG+SGILSYGLLNT Y
Subjt:  SNCSRRQFPELPRHFA-PRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY

Query:  YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWF V YNFESQGKAFMAIVGFCLGLALLLF+ VTLLSA
Subjt:  YLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA

A0A5A7SW01 Uncharacterized protein1.3e-7776.61Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKL
        M T  KLTNGNI CV FPDS        EL      R          P+           ANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKL
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKL

Query:  RRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVG
        RRYG+SGILSYGLLNT YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWF V YNFESQGKAFMAIVG
Subjt:  RRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVG

Query:  FCLGLALLLFVAVTLLSA
        FCLGLALLLF+ VTLLSA
Subjt:  FCLGLALLLFVAVTLLSA

A0A6J1GY06 uncharacterized protein LOC1114582381.7e-8578.83Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP PAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X25.4e-8779.73Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWF VKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLLSA
        AIVGFCLGL+LLLF+AVTLLSA
Subjt:  AIVGFCLGLALLLFVAVTLLSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X16.6e-7779.19Show/hide
Query:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVVFP +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRIS----FFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGK
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWF VKYNF+SQGK
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein5.2e-5062.11Show/hide
Query:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF
        LP HF      S F   N S RL  S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY T FL+VWF
Subjt:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF

Query:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        Y+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  GA+ALAP VDRGLSWF VK NFESQGKAF A+VG CLG+AL+LF+ VTLL A
Subjt:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA

AT2G38695.2 unknown protein1.9e-2856.83Show/hide
Query:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF
        LP HF      S F   N S RL  S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY T FL+VWF
Subjt:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF

Query:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG
        Y+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein3.6e-4349.58Show/hide
Query:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF
        LP HF      S F   N S RL  S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY T FL+VWF
Subjt:  LPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYYLTTFLVVWF

Query:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG------------------------------------------------ALALAPFVDRGLS
        Y+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G                                                A+ALAP VDRGLS
Subjt:  YIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG------------------------------------------------ALALAPFVDRGLS

Query:  WFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA
        WF VK NFESQGKAF A+VG CLG+AL+LF+ VTLL A
Subjt:  WFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAACTCTACTCAAGCTCACCAACGGCAACATCCGTTGCGTCGTTTTCCCGGATTCGAATTGCAGTCGAAGACAATTTCCAGAGCTTCCCCGCCATTTCGCGCCCCG
AACCCGGATAAGCTTTTTTGTTAGTCGGAACCCTAGCGTCCGACTCTGCCTCAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAGGATGGCTTTTCCAATC
ACGAAATGGAAGGTTCAATGGAAAAGAATGAGAATCGTGAGAAACATCCCCGGAAATCAAATGAGGTACTGGATAAATTGAGGAGATATGGAATTTCCGGAATATTGTCT
TACGGATTGCTGAATACACTCTACTATCTTACAACATTTCTCGTGGTGTGGTTCTACATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCGGCTGCTGGAAGATTTCT
CAAAATAATGGCTACAGTGTGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCGCCATTTGTCGACAGAGGATTGTCGTGGTTCATGG
TCAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCGTTGCTGTTACTCTGCTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAACTCTACTCAAGCTCACCAACGGCAACATCCGTTGCGTCGTTTTCCCGGATTCGAATTGCAGTCGAAGACAATTTCCAGAGCTTCCCCGCCATTTCGCGCCCCG
AACCCGGATAAGCTTTTTTGTTAGTCGGAACCCTAGCGTCCGACTCTGCCTCAGCAATGCCGAAATTAGCGCCAATGATCCATTGAAATCTGAGGATGGCTTTTCCAATC
ACGAAATGGAAGGTTCAATGGAAAAGAATGAGAATCGTGAGAAACATCCCCGGAAATCAAATGAGGTACTGGATAAATTGAGGAGATATGGAATTTCCGGAATATTGTCT
TACGGATTGCTGAATACACTCTACTATCTTACAACATTTCTCGTGGTGTGGTTCTACATTGCACCAGCACCTGCGAAAATGGGCTATGTTGCGGCTGCTGGAAGATTTCT
CAAAATAATGGCTACAGTGTGGGCTGGAAGCCAAGTTACTAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCGCCATTTGTCGACAGAGGATTGTCGTGGTTCATGG
TCAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTCGTTGCTGTTACTCTGCTTTCAGCATAA
Protein sequenceShow/hide protein sequence
MRTLLKLTNGNIRCVVFPDSNCSRRQFPELPRHFAPRTRISFFVSRNPSVRLCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILS
YGLLNTLYYLTTFLVVWFYIAPAPAKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFMVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLLSA