; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G010690 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G010690
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr06:22999582..23000112
RNA-Seq ExpressionClCG06G010690
SyntenyClCG06G010690
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]5.8e-2144.53Show/hide
Query:  PAPSRFLDPQQLQPNPDFLVWER--------------------KDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQ
        P P++FLD  QLQ NP F+ WER                    K+G+++S+YLA+IKEV DK+SA+GE +SY D + + L+GL  EY+ FVTSI N+SD+
Subjt:  PAPSRFLDPQQLQPNPDFLVWER--------------------KDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQ

Query:  PSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS
         SL++V SLL  Y   LE++N   QL  +Q N    S
Subjt:  PSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]6.4e-2871.88Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL
        +KDGLSVSQYLA+IKE+  K S+IGE IS  DHI++I++GLG EYNAFVTSIQN+SD  +LEDVR+LLLAY+ RLEKQN V+QLNV QAN ANL L
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]3.5e-3450.29Show/hide
Query:  PSRFLDPQQLQPNPDFLVWE---------------------------------------------------------RKDGLSVSQYLAQIKEVADKFSA
        P +FLD  QLQPNP +  WE                                                         RKDG SVSQYLA+IKE+ADKF+A
Subjt:  PSRFLDPQQLQPNPDFLVWE---------------------------------------------------------RKDGLSVSQYLAQIKEVADKFSA

Query:  IGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL
        +GE +SY DH+AH+LDGLGSEYNAFVTSI N++D PSLEDVRSLLLAYE+RL+KQN V+QLN+AQAN  NLSL
Subjt:  IGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]8.7e-2561.46Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL
        +KDGL+VSQYLAQIK+V D F+AIGE +SY DH+++IL+GLGSEYN FV+SI N++++PS+ DVR+LL+ Y+SRLEKQ   + L + QAN A+LS+
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]8.4e-2870.53Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS
        RKD LS+SQYL+QIK+VADKFS +GE ISY DH+ HILDGLGSEYNAFVTSIQN  D  S+EDV SLLL+YE++LEKQN ++ LN+AQA  + LS
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS

TrEMBL top hitse value%identityAlignment
A0A438FTV3 Uncharacterized protein2.9e-1850.53Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS
        +KDGLS+ +Y+ ++K + +  +AIGE +S  DH+ ++  GL  EYN FVTSIQN+SDQP++E + SLLL+Y+ RLE+QN V+ LN AQ + A+L+
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS

A0A5C7IHH0 Uncharacterized protein2.0e-1958.75Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNL
        +K+G +++QYL Q KE+ DKF+AIGE +SY DH+ ++L+GLG EY+AFVTSI+N+ D+PS+EDV SLLL++E RL K+ L
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNL

A0A6J1D6N7 uncharacterized protein LOC1110174383.1e-2871.88Show/hide
Query:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL
        +KDGLSVSQYLA+IKE+  K S+IGE IS  DHI++I++GLG EYNAFVTSIQN+SD  +LEDVR+LLLAY+ RLEKQN V+QLNV QAN ANL L
Subjt:  RKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL

A0A6J1DQX7 uncharacterized protein LOC1110223151.7e-3450.29Show/hide
Query:  PSRFLDPQQLQPNPDFLVWE---------------------------------------------------------RKDGLSVSQYLAQIKEVADKFSA
        P +FLD  QLQPNP +  WE                                                         RKDG SVSQYLA+IKE+ADKF+A
Subjt:  PSRFLDPQQLQPNPDFLVWE---------------------------------------------------------RKDGLSVSQYLAQIKEVADKFSA

Query:  IGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL
        +GE +SY DH+AH+LDGLGSEYNAFVTSI N++D PSLEDVRSLLLAYE+RL+KQN V+QLN+AQAN  NLSL
Subjt:  IGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLSL

A5BPS3 Uncharacterized protein2.8e-2144.53Show/hide
Query:  PAPSRFLDPQQLQPNPDFLVWER--------------------KDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQ
        P P++FLD  QLQ NP F+ WER                    K+G+++S+YLA+IKEV DK+SA+GE +SY D + + L+GL  EY+ FVTSI N+SD+
Subjt:  PAPSRFLDPQQLQPNPDFLVWER--------------------KDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQ

Query:  PSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS
         SL++V SLL  Y   LE++N   QL  +Q N    S
Subjt:  PSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.5e-0631.18Show/hide
Query:  DGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS
        D LSV +Y  ++K ++D  + +   IS    + H+L+GL  +Y+  +  I+++S  PS  + RS+LL  ESRL  +   ++ +++  N  +LS
Subjt:  DGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNVAQANFANLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCCTGCTCCATCAAGATTCTTGGATCCACAACAGTTGCAGCCTAATCCTGATTTTCTGGTATGGGAAAGGAAGGATGGATTATCTGTTAGTCAATATTTAGCTCA
AATTAAAGAAGTTGCTGATAAATTTTCTGCTATTGGTGAACTTATTTCTTATGGAGATCATATTGCCCATATTTTGGATGGTTTAGGAAGTGAATATAATGCATTTGTCA
CATCTATTCAAAATCAGTCTGACCAACCATCCCTTGAGGATGTTAGGAGCTTACTTTTAGCTTATGAAAGTCGTTTGGAGAAACAAAACTTAGTTAATCAGCTCAATGTT
GCTCAGGCTAATTTTGCAAATCTCTCTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCCTGCTCCATCAAGATTCTTGGATCCACAACAGTTGCAGCCTAATCCTGATTTTCTGGTATGGGAAAGGAAGGATGGATTATCTGTTAGTCAATATTTAGCTCA
AATTAAAGAAGTTGCTGATAAATTTTCTGCTATTGGTGAACTTATTTCTTATGGAGATCATATTGCCCATATTTTGGATGGTTTAGGAAGTGAATATAATGCATTTGTCA
CATCTATTCAAAATCAGTCTGACCAACCATCCCTTGAGGATGTTAGGAGCTTACTTTTAGCTTATGAAAGTCGTTTGGAGAAACAAAACTTAGTTAATCAGCTCAATGTT
GCTCAGGCTAATTTTGCAAATCTCTCTCTTTAA
Protein sequenceShow/hide protein sequence
MVPAPSRFLDPQQLQPNPDFLVWERKDGLSVSQYLAQIKEVADKFSAIGELISYGDHIAHILDGLGSEYNAFVTSIQNQSDQPSLEDVRSLLLAYESRLEKQNLVNQLNV
AQANFANLSL