; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G018480 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G018480
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr01:32984586..32986259
RNA-Seq ExpressionClCG01G018480
SyntenyClCG01G018480
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143553.1 uncharacterized protein LOC101209623 isoform X1 [Cucumis sativus]9.2e-5082.01Show/hide
Query:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES
        V  EEKGLKH  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HI P  K KK+LLEVNISIKMDGGT MEI+ETKKEA P PES
Subjt:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES

Query:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK
        LRPRTSRS+T +R+VPEMKRLDWAKSLRSS A +P +GK
Subjt:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK

XP_008440564.1 PREDICTED: uncharacterized protein LOC103484942 [Cucumis melo]6.8e-5385.61Show/hide
Query:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES
        V  EEKGLKH  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HIKP  K  K+LLEVNISIKMDGGT MEIRETKKEA PPPES
Subjt:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES

Query:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK
        LRPRTSRS+TL+R+VPEMKRLDWAKSLRSS A +PFVGK
Subjt:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK

XP_022962743.1 uncharacterized protein LOC111463142 isoform X1 [Cucurbita moschata]2.1e-4973.01Show/hide
Query:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT
        EKGL HAISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQ R  SG I PPVKTKK+LLEVNISIKM+GGT MEI++     PPPP S R RT
Subjt:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT

Query:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR
        +RSDT +R VPE+KRLDW KSLRSS A  P VGKNV F RNKM  VP +     Y+PNYS KR
Subjt:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR

XP_031742667.1 uncharacterized protein LOC101209623 isoform X2 [Cucumis sativus]1.6e-4983.09Show/hide
Query:  EEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRP
        EEKGLKH  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HI P  K KK+LLEVNISIKMDGGT MEI+ETKKEA P PESLRP
Subjt:  EEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRP

Query:  RTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK
        RTSRS+T +R+VPEMKRLDWAKSLRSS A +P +GK
Subjt:  RTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK

XP_038883628.1 uncharacterized protein LOC120074545 [Benincasa hispida]3.2e-5888Show/hide
Query:  VNEEEKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESL
        V  E+KGL+HAISEEEMKIRREIENEIERDLEEEIKGGIYQ ALRL+RLYQ RK SGHIK P+KTKKDLLEVNISIKMDGGTMMEIRETKK A PPPESL
Subjt:  VNEEEKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESL

Query:  RPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKMAVP
        RPRTSRSDTLA AVPEMKRLDWAKSLRSS A  PFV +NVRFDRNKMAVP
Subjt:  RPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKMAVP

TrEMBL top hitse value%identityAlignment
A0A0A0KJZ6 Uncharacterized protein4.5e-5082.01Show/hide
Query:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES
        V  EEKGLKH  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HI P  K KK+LLEVNISIKMDGGT MEI+ETKKEA P PES
Subjt:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES

Query:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK
        LRPRTSRS+T +R+VPEMKRLDWAKSLRSS A +P +GK
Subjt:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK

A0A1S3B1F7 uncharacterized protein LOC1034849423.3e-5385.61Show/hide
Query:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES
        V  EEKGLKH  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLY  RKNS HIKP  K  K+LLEVNISIKMDGGT MEIRETKKEA PPPES
Subjt:  VNEEEKGLKH-AISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPES

Query:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK
        LRPRTSRS+TL+R+VPEMKRLDWAKSLRSS A +PFVGK
Subjt:  LRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGK

A0A6J1HFP1 uncharacterized protein LOC111463142 isoform X19.9e-5073.01Show/hide
Query:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT
        EKGL HAISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQ R  SG I PPVKTKK+LLEVNISIKM+GGT MEI++     PPPP S R RT
Subjt:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT

Query:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR
        +RSDT +R VPE+KRLDW KSLRSS A  P VGKNV F RNKM  VP +     Y+PNYS KR
Subjt:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR

A0A6J1HHY9 uncharacterized protein LOC111463142 isoform X29.9e-5073.01Show/hide
Query:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT
        EKGL HAISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQ R  SG I PPVKTKK+LLEVNISIKM+GGT MEI++     PPPP S R RT
Subjt:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT

Query:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR
        +RSDT +R VPE+KRLDW KSLRSS A  P VGKNV F RNKM  VP +     Y+PNYS KR
Subjt:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR

A0A6J1KUF8 uncharacterized protein LOC1114972921.4e-4871.78Show/hide
Query:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT
        EKGL HAISEEEMKIRREIE EIERDLEEEIKGGIYQQALRL RLYQ R  SG I PP+KTKK+LLEVNISIKM+GGT MEI+E     PPPP +   RT
Subjt:  EKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKEAPPPPESLRPRT

Query:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR
        +RSDT +R VPE+KRLDW KSLRSS A  P VGKNV F RNKM  VP +     Y+PNYS KR
Subjt:  SRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKM-AVPGNDEFQCYKPNYSLKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26290.1 unknown protein1.3e-1250Show/hide
Query:  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHI-KPPVKTKKDLLEVNISIKMDGGTMMEIRETKKE
        +S++E +IR E+E EIER+LE E K GIY  AL+L RLY+ R+    +    ++  K +LEVNI+IKM+G T +EI E KKE
Subjt:  ISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHI-KPPVKTKKDLLEVNISIKMDGGTMMEIRETKKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCACTTTTGCCATCGAATCGATCTCCTCATAATAATGCTCATGCAAAAGGTAAATGAGGAGGAGAAGGGTTTAAAGCATGCAATTTCAGAAGAAGAGATGAAGAT
CCGGCGAGAAATTGAGAATGAAATAGAGAGAGATTTGGAGGAAGAAATTAAGGGAGGAATTTACCAACAAGCTCTCCGCCTGCACCGCCTTTATCAAGACCGGAAAAATT
CTGGTCATATTAAGCCGCCGGTGAAAACCAAGAAAGACCTTTTGGAGGTAAATATCAGCATCAAAATGGATGGAGGGACGATGATGGAAATAAGAGAGACCAAAAAGGAG
GCGCCGCCGCCGCCAGAAAGCCTCCGGCCGAGGACTTCCCGGTCCGACACCCTTGCTCGAGCAGTCCCAGAGATGAAAAGGTTGGATTGGGCGAAGTCTCTCCGGTCGAG
TCCGGCGCAGCTGCCATTTGTCGGGAAAAACGTCAGGTTTGATCGGAATAAAATGGCGGTGCCGGGCAATGATGAATTTCAGTGCTATAAACCTAATTATTCGTTAAAAA
GGCGAACTTGA
mRNA sequenceShow/hide mRNA sequence
GTAAAAACCTCCAAAAGCCATAAAGTCCTTCACCCATGGACCAGCTCCACAATTGCTTCAACCCCAACATCCTAGGGTTGACTGATGTGTCACTTTTGCCATCGAATCGA
TCTCCTCATAATAATGCTCATGCAAAAGGTAAATGAGGAGGAGAAGGGTTTAAAGCATGCAATTTCAGAAGAAGAGATGAAGATCCGGCGAGAAATTGAGAATGAAATAG
AGAGAGATTTGGAGGAAGAAATTAAGGGAGGAATTTACCAACAAGCTCTCCGCCTGCACCGCCTTTATCAAGACCGGAAAAATTCTGGTCATATTAAGCCGCCGGTGAAA
ACCAAGAAAGACCTTTTGGAGGTAAATATCAGCATCAAAATGGATGGAGGGACGATGATGGAAATAAGAGAGACCAAAAAGGAGGCGCCGCCGCCGCCAGAAAGCCTCCG
GCCGAGGACTTCCCGGTCCGACACCCTTGCTCGAGCAGTCCCAGAGATGAAAAGGTTGGATTGGGCGAAGTCTCTCCGGTCGAGTCCGGCGCAGCTGCCATTTGTCGGGA
AAAACGTCAGGTTTGATCGGAATAAAATGGCGGTGCCGGGCAATGATGAATTTCAGTGCTATAAACCTAATTATTCGTTAAAAAGGCGAACTTGAAATTTTATTTAATTT
GTTAATTTTGATGTTGGCAATGGCATGCAGCTGTGGTTTCATTTGTTGGTATTTTTGGATTTCTTTCCTTTTTCAGTTAAGAGCAAAGTTTTATACCAATTGAGTTACCT
TTTAAGATATATTTGGAAGTGTTGAGTATTGTATAAAGTATATACATGATTTCTCTCTATGCAACTTATTAATTATCGTGCAGGAGATAATTCTCTTA
Protein sequenceShow/hide protein sequence
MCHFCHRIDLLIIMLMQKVNEEEKGLKHAISEEEMKIRREIENEIERDLEEEIKGGIYQQALRLHRLYQDRKNSGHIKPPVKTKKDLLEVNISIKMDGGTMMEIRETKKE
APPPPESLRPRTSRSDTLARAVPEMKRLDWAKSLRSSPAQLPFVGKNVRFDRNKMAVPGNDEFQCYKPNYSLKRRT