; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G220340 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G220340
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionGag-Pol polyprotein/retrotransposon
Genome locationCicolChr11:24797601..24801520
RNA-Seq ExpressionCcUC11G220340
SyntenyCcUC11G220340
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601085.1 hypothetical protein SDJN03_06318, partial [Cucurbita argyrosperma subsp. sororia]1.0e-8478.38Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  ++ Q P+     +P  R+S     FVSRNPS+R CL NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

XP_008445925.1 PREDICTED: uncharacterized protein LOC103488806 isoform X5 [Cucumis melo]1.5e-9190Show/hide
Query:  SNCSRRQFPELPRHFA-PRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY
        SNCSR QFPELPR F+ P+SRI F VSRNPSVR CLSNA+ISANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKLRRYG+SGILSYGLLNT Y
Subjt:  SNCSRRQFPELPRHFA-PRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY

Query:  YLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA
        YLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVGFCLGLALLLF+ VTL SA
Subjt:  YLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA

XP_022956520.1 uncharacterized protein LOC111458238 [Cucurbita moschata]8.8e-8477.93Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

XP_022983937.1 uncharacterized protein LOC111482401 isoform X2 [Cucurbita maxima]2.7e-8578.83Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

XP_023534257.1 uncharacterized protein LOC111795864 isoform X2 [Cucurbita pepo subsp. pepo]3.6e-8578.83Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

TrEMBL top hitse value%identityAlignment
A0A1S3BDD7 uncharacterized protein LOC103488806 isoform X57.2e-9290Show/hide
Query:  SNCSRRQFPELPRHFA-PRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY
        SNCSR QFPELPR F+ P+SRI F VSRNPSVR CLSNA+ISANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKLRRYG+SGILSYGLLNT Y
Subjt:  SNCSRRQFPELPRHFA-PRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLY

Query:  YLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA
        YLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVGFCLGLALLLF+ VTL SA
Subjt:  YLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA

A0A5A7SW01 Uncharacterized protein1.5e-7675.69Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKL
        M T  KLTNGNI CV  PDS        EL      R+         P+           ANDPLKSED FSNHE EGSMEKNENR+KHP+KSNEVLDKL
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKL

Query:  RRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVG
        RRYG+SGILSYGLLNT YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTV YNFESQGKAFMAIVG
Subjt:  RRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVG

Query:  FCLGLALLLFVAVTLFSA
        FCLGLALLLF+ VTL SA
Subjt:  FCLGLALLLFVAVTLFSA

A0A6J1GY06 uncharacterized protein LOC1114582384.2e-8477.93Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  ++ Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE+GFSNHE EGSMEKNEN +KHP+KS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAP P KMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALA+APFVDR LSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

A0A6J1J0R3 uncharacterized protein LOC111482401 isoform X21.3e-8578.83Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGKA +
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFM

Query:  AIVGFCLGLALLLFVAVTLFSA
        AIVGFCLGL+LLLF+AVTL SA
Subjt:  AIVGFCLGLALLLFVAVTLFSA

A0A6J1J900 uncharacterized protein LOC111482401 isoform X19.5e-7678.68Show/hide
Query:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV
        M T LKLTNG I CVV P +  +  Q P+     +P  R+S     FVSRNPS+R CL+NAEISANDPLKSE GFSNHE EGSMEKNEN +KHPRKS EV
Subjt:  MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRIS----FFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEV

Query:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK
        LDKLRRYG+SGILSYGLLNT+YYLTTFLVVWFYIAPAP KMGYVAAAGRFLKIMAT+WAGSQVTKLARAAGALA+APFVDRGLSWFTVKYNF+SQGK
Subjt:  LDKLRRYGISGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38695.1 unknown protein1.2e-4959.3Show/hide
Query:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY
        CS R F  L      +S +   F   N S R   S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY
Subjt:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY

Query:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA
         T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  GA+ALAP VDRGLSWFTVK NFESQGKAF A+VG CLG+AL+LF+ VTL  A
Subjt:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA

AT2G38695.2 unknown protein1.6e-2753.38Show/hide
Query:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY
        CS R F  L      +S +   F   N S R   S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY
Subjt:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY

Query:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG
         T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G
Subjt:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG

AT2G38695.3 unknown protein8.0e-4347.77Show/hide
Query:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY
        CS R F  L      +S +   F   N S R   S   +S N   KS+        EG M +KN   +K+P  S E+L KL+RYG+SGILSYGLLNT+YY
Subjt:  CSRRQFPELPRHFAPRSRI-SFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSM-EKNENREKHPRKSNEVLDKLRRYGISGILSYGLLNTLYY

Query:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG------------------------------------------------ALAL
         T FL+VWFY+APAP KMGY+AAA RFLK+MA VWAGSQVTKL R  G                                                A+AL
Subjt:  LTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAG------------------------------------------------ALAL

Query:  APFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA
        AP VDRGLSWFTVK NFESQGKAF A+VG CLG+AL+LF+ VTL  A
Subjt:  APFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLFVAVTLFSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAACTCTACTCAAGCTCACGAACGGCAACATCCGTTGCGTCGTTTCCCCGGATTCGAATTGCAGTCGAAGACAATTTCCAGAGCTTCCCCGCCATTTCGCG
CCCCGAAGCCGGATTAGCTTTTTTGTTAGTCGGAACCCTAGCGTCCGATTCTGCCTCAGCAATGCCGAAATTAGCGCCAACGATCCATTGAAATCTGAGGATGGC
TTTTCCAATCACGAAATGGAAGGTTCAATGGAAAAGAATGAGAATCGTGAGAAACATCCCCGGAAATCAAATGAGGTACTGGATAAATTGAGGAGATATGGAATT
TCCGGAATATTGTCTTACGGATTGCTGAATACACTCTACTATCTTACAACATTTCTCGTGGTGTGGTTCTACATTGCACCAGCACCTGTGAAAATGGGCTATGTT
GCGGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTGTGGGCTGGAAGCCAAGTTACGAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCGCCATTTGTC
GACAGAGGATTGTCGTGGTTCACGGTTAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTC
GTTGCTGTTACTCTGTTTTCAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAACTCTACTCAAGCTCACGAACGGCAACATCCGTTGCGTCGTTTCCCCGGATTCGAATTGCAGTCGAAGACAATTTCCAGAGCTTCCCCGCCATTTCGCG
CCCCGAAGCCGGATTAGCTTTTTTGTTAGTCGGAACCCTAGCGTCCGATTCTGCCTCAGCAATGCCGAAATTAGCGCCAACGATCCATTGAAATCTGAGGATGGC
TTTTCCAATCACGAAATGGAAGGTTCAATGGAAAAGAATGAGAATCGTGAGAAACATCCCCGGAAATCAAATGAGGTACTGGATAAATTGAGGAGATATGGAATT
TCCGGAATATTGTCTTACGGATTGCTGAATACACTCTACTATCTTACAACATTTCTCGTGGTGTGGTTCTACATTGCACCAGCACCTGTGAAAATGGGCTATGTT
GCGGCTGCTGGAAGATTTCTCAAAATAATGGCTACAGTGTGGGCTGGAAGCCAAGTTACGAAGCTTGCAAGAGCTGCAGGGGCTCTTGCTCTGGCGCCATTTGTC
GACAGAGGATTGTCGTGGTTCACGGTTAAATACAACTTTGAGTCTCAGGGGAAGGCATTTATGGCGATTGTTGGGTTCTGCTTAGGATTGGCTCTCTTGTTATTC
GTTGCTGTTACTCTGTTTTCAGCATAA
Protein sequenceShow/hide protein sequence
MRTLLKLTNGNIRCVVSPDSNCSRRQFPELPRHFAPRSRISFFVSRNPSVRFCLSNAEISANDPLKSEDGFSNHEMEGSMEKNENREKHPRKSNEVLDKLRRYGI
SGILSYGLLNTLYYLTTFLVVWFYIAPAPVKMGYVAAAGRFLKIMATVWAGSQVTKLARAAGALALAPFVDRGLSWFTVKYNFESQGKAFMAIVGFCLGLALLLF
VAVTLFSA