; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G001030 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G001030
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag/pol protein
Genome locationCG_Chr04:3256272..3273477
RNA-Seq ExpressionClCG04G001030
SyntenyClCG04G001030
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK29682.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-4861.73Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +GDNY  WKSNLNTILV+DDLRF LTEEC   P+   N+  + A DRWI+ N+KARV IL  +SDVL KK+E++  AKEIMD L+ + GQ   S
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK
        +RH+ IKY+Y   MKEGTSVREHVLDMM++FNIAE+NG  IDE +QE SSW++L EGEI LK
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]4.3e-4762.67Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +G NY TWK+NLNTILV+DDLRF LTEECP  P++  N+NV++A DRW++ NDKARV IL  ++DVL KKHE ++ AKEIMD L+AM G+ SS+
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS
        +RH+ +KYVYN  MKEGTSVREHVLDMMV+FN AEVNG  IDE +++ ++
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]7.9e-4967.12Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K + DNYGTWKSNLNTILV+DDL+F LTEECPP+P+   N+ + DA DRW + N+KA+V IL  ISD+L KKHE MV AKEIMD LQA+ GQ SSS
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ
          HD IKYVYN  MKEGT+VREHVLDMMV+FNI EVNG +++EK+Q
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ

XP_038885834.1 uncharacterized protein LOC120076130 [Benincasa hispida]1.5e-4767.61Show/hide
Query:  KSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSSIRHD
        K  G+NY TWK+NLNTILV+DDL+F LTEECPPIPSS  N+ V+DA +RWIRVNDK    IL  ISDVL KKHE+M   K+IM+ L+ M GQ S S+RHD
Subjt:  KSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSSIRHD

Query:  TIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ
        +IKY+YN  MKEG SVREHVL+MMV+FN+AEVN V++DEKSQ
Subjt:  TIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ

XP_038895830.1 uncharacterized protein LOC120083997 [Benincasa hispida]7.4e-4763.7Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K  GDNYGTWKSN+NTILV+DDLRF LTEECPP P    N+ V+DA DRW++ N+KARV IL  ISDVL KKHE +   +EIMD LQ + G+ S+S
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ
          HDTIK+VYN  MKEGTS++EHVL+MMV F++AE+NGV+++EKSQ
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ

TrEMBL top hitse value%identityAlignment
A0A5D3E173 Gag/pol protein1.5e-4861.73Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +GDNY  WKSNLNTILV+DDLRF LTEEC   P+   N+  + A DRWI+ N+KARV IL  +SDVL KK+E++  AKEIMD L+ + GQ   S
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK
        +RH+ IKY+Y   MKEGTSVREHVLDMM++FNIAE+NG  IDE +QE SSW++L EGEI LK
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK

A0A5D3E3F1 Gag/pol protein6.1e-4758.64Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +GDNY  WK  LNTILV+DDLRF LTEECP  P+S  N+  +   DRWI+ ++KA V IL  +SDVL KKHE++   K+I+D L+ M GQ   S
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK
        +RH+TIKY+Y   MKE TS+REHVLDMM++ NIAEVNG  IDE +QE SSW+KL EGE+ LK
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALK

A0A6J1DUZ9 uncharacterized protein LOC1110242948.0e-4762Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +G NY TWK+NLNTILV+DDL+F LTEECP  P+   N+NV++A DRW++ NDKARV IL  ++DVL KKHE ++ AKEIMD L+AM G+ SS+
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS
        +RH+ +KYVYN  MKEGTSVREHVLDMMV+FN AEVNG  IDE +++ ++
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS

A0A6J1DXQ5 uncharacterized protein LOC1110244572.1e-4762.67Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +G NY TWK+NLNTILV+DDLRF LTEECP  P++  N+NV++A DRW++ NDKARV IL  ++DVL KKHE ++ AKEIMD L+AM G+ SS+
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS
        +RH+ +KYVYN  MKEGTSVREHVLDMMV+FN AEVNG  IDE +++ ++
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSS

A0A6J1E205 uncharacterized protein LOC1110252588.0e-4766.44Show/hide
Query:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS
        L   K +GDNYG WKSNLNTILV+DDLRF LTEECPP  +  +NQ V+DA DRW + N+KARV IL  ISDVL KKHE +  A+EIMD LQA+ GQ S+S
Subjt:  LGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGISDVLFKKHENMVAAKEIMDLLQAMLGQLSSS

Query:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ
        I HD IKYVYN  MKEG+SVREHVL+MMV+FN+AEVN  +++E SQ
Subjt:  IRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQ

SwissProt top hitse value%identityAlignment
Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.6e-0441.67Show/hide
Query:  DHESLKYFFTQKKLNMRHRRWLELAKDYDCEILYHPSKVNVVVDALSR
        DH+ L +    +  N + +RW    ++Y+CE++Y P K NVV DALSR
Subjt:  DHESLKYFFTQKKLNMRHRRWLELAKDYDCEILYHPSKVNVVVDALSR

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGCATCACAGTTGACACTCCAAAAGTAGAAGCAGTTTCTAATTGGCCAGATTTTCTTTACTACGCATCTTGTGATTATGTCGTATTTGGTATGGCCAATTCCTT
GGTTGCACGTATGGAGATTTCTCTTTCCCATGTTTTCCAAGGGCAAGTTTTAGTTGAAGTAGGTGATCTTTCAAGGACATGTAGTCTCAAAGGATTGGATTTTAGTAGAC
CTTGCAAAGATAAAGACCATGAGAGCTTGAAGTACTTTTTCACTCAAAAGAAGTTGAATATGAGACATCGTAGATGGCTAGAGTTAGCTAAAGATTACGATTGTGAGATT
TTGTACCACCCAAGCAAGGTGAATGTAGTGGTAGATGCTCTTAGTAGAAAGGCAGCTCATTCAACAACTCTTGTTACCAGGCAAACTCATTTATACGATGACTTTAACCG
TGCAGGAATTGCAGTGGCAGTAGGAGAAGTTTCTACACTACTGGCACAGTTGACAGTACAACCTACCTTAGGACAGAAAATCATTGATGCACAACAGGGTGGTCCTTACC
TTGTTGAGAAGTTTCACCAGGGTAAGAAGGGTAGAGAAAATACCTCGAGCCCATGTTCGAGTACCAAGCACTACGGTGCTGGAGTAACCCATGTCATACTCCCAGGAAAA
GCTCGAACGACCTTCAATAAAAGGGTACTTGTACCTAAAACTGACACAGGTGGGGTGCCTCCTCACAAAGAGGGTCGTAGTGACCAAGCTCAGGTGACTGTTTACAAAAA
CACAGGGGAGCCGGCGACCGAAGCCTCAATGAATGGCAGTTGTAACTATAATGGTCCTAAGGGTCCTCTTATCTCTTCCTTCTCCGAGCTCGGGTCATGGAAGAAGGAGC
CGAGAAGAAAGTATCAGCAACAACAGGGGATGGATTCTCACTCTCTGAGAGGTGACTCTGAAAATCCTCAGAATCTTCTCACAGAAGAGCTCGAAACGGAGGGTGGCTAT
GAAGCTGTCGCACAGGCTATTATTGCGAAAATCGGCGCACACAACCAACTATTGTTAAGAATTTCATCTTTGTACCAAAAACAGGCAAGGGGCGGAATACCACAAAGAGA
AAGGCGCCGTGGCAGCGCCGTCTCAGCGCCGCGACGCTGTCTTGCGTTCTTCAAGGATGCGTCGAGGCAGTGCCTTTCCAGCGCCGCGTTGCTACTCTGCGTTCATGCTG
ATGCGTTTCTGCTTCAGCGTCATGGCAGCGCTGCCTTAGCGCCGCGGCGCACCATGCGTTTCAAGCAAGGATGCGTTTTTCAGCACTTTTTCTCCATATTCTTGCTCGAA
ACACGCACGAGGAAGAAGAATTGGTCCAATTGGACTAGTGGCGATCTGGTTCGATGTTTTCCAGCCGATTTAGCCCTTATTTCAGCTTTGGAGGTTCGGTTTGGGGGGTT
TCGGCTTGGTTTGGGCCGGCCAAAGAGTTCGGGAGATAATTATGGTACATGGAAATCAAACTTGAACACTATACTAGTCATGGATGATTTACGGTTCGCTTTAACGGAGG
AGTGTCCTCCAATTCCTAGCTCAACTACAAATCAAAATGTTCAGGATGCATGTGATAGATGGATTAGGGTTAACGATAAAGCTCGAGTTTGCATCTTAGTAGGCATATCA
GATGTTTTGTTTAAGAAGCATGAGAACATGGTGGCCGCTAAGGAAATTATGGACTTGCTGCAGGCGATGTTGGGACAACTGTCCTCATCGATCAGGCACGATACTATTAA
ATATGTTTACAATTCCTGCATGAAAGAAGGAACCTCTGTTAGAGAACATGTTCTGGATATGATGGTCTACTTTAATATTGCGGAAGTAAACGGCGTTATCATAGACGAGA
AGAGTCAAGAAACTAGTTCTTGGAGAAAGCTACGAGAAGGTGAGATAGCTCTCAAGGGTAAAGGTTCTCACTCACTGGGAGACGACTCTGAAAATTCTCATAATCTTTCT
CTCAGAAGAGCTCGAAATGGAGGCTGGTTATGGCGTCTGTATCTACTTTTTCTCAATGCATGGTCTCAACGCATAGTCTCGACGCATGGGTTTCCACTTGTTCTCGCAGT
AGCTTTGCCTCTAGTACTTGGGTTATTTGTGCTTCTTTTCTTCTTAATCCTGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGCATCACAGTTGACACTCCAAAAGTAGAAGCAGTTTCTAATTGGCCAGATTTTCTTTACTACGCATCTTGTGATTATGTCGTATTTGGTATGGCCAATTCCTT
GGTTGCACGTATGGAGATTTCTCTTTCCCATGTTTTCCAAGGGCAAGTTTTAGTTGAAGTAGGTGATCTTTCAAGGACATGTAGTCTCAAAGGATTGGATTTTAGTAGAC
CTTGCAAAGATAAAGACCATGAGAGCTTGAAGTACTTTTTCACTCAAAAGAAGTTGAATATGAGACATCGTAGATGGCTAGAGTTAGCTAAAGATTACGATTGTGAGATT
TTGTACCACCCAAGCAAGGTGAATGTAGTGGTAGATGCTCTTAGTAGAAAGGCAGCTCATTCAACAACTCTTGTTACCAGGCAAACTCATTTATACGATGACTTTAACCG
TGCAGGAATTGCAGTGGCAGTAGGAGAAGTTTCTACACTACTGGCACAGTTGACAGTACAACCTACCTTAGGACAGAAAATCATTGATGCACAACAGGGTGGTCCTTACC
TTGTTGAGAAGTTTCACCAGGGTAAGAAGGGTAGAGAAAATACCTCGAGCCCATGTTCGAGTACCAAGCACTACGGTGCTGGAGTAACCCATGTCATACTCCCAGGAAAA
GCTCGAACGACCTTCAATAAAAGGGTACTTGTACCTAAAACTGACACAGGTGGGGTGCCTCCTCACAAAGAGGGTCGTAGTGACCAAGCTCAGGTGACTGTTTACAAAAA
CACAGGGGAGCCGGCGACCGAAGCCTCAATGAATGGCAGTTGTAACTATAATGGTCCTAAGGGTCCTCTTATCTCTTCCTTCTCCGAGCTCGGGTCATGGAAGAAGGAGC
CGAGAAGAAAGTATCAGCAACAACAGGGGATGGATTCTCACTCTCTGAGAGGTGACTCTGAAAATCCTCAGAATCTTCTCACAGAAGAGCTCGAAACGGAGGGTGGCTAT
GAAGCTGTCGCACAGGCTATTATTGCGAAAATCGGCGCACACAACCAACTATTGTTAAGAATTTCATCTTTGTACCAAAAACAGGCAAGGGGCGGAATACCACAAAGAGA
AAGGCGCCGTGGCAGCGCCGTCTCAGCGCCGCGACGCTGTCTTGCGTTCTTCAAGGATGCGTCGAGGCAGTGCCTTTCCAGCGCCGCGTTGCTACTCTGCGTTCATGCTG
ATGCGTTTCTGCTTCAGCGTCATGGCAGCGCTGCCTTAGCGCCGCGGCGCACCATGCGTTTCAAGCAAGGATGCGTTTTTCAGCACTTTTTCTCCATATTCTTGCTCGAA
ACACGCACGAGGAAGAAGAATTGGTCCAATTGGACTAGTGGCGATCTGGTTCGATGTTTTCCAGCCGATTTAGCCCTTATTTCAGCTTTGGAGGTTCGGTTTGGGGGGTT
TCGGCTTGGTTTGGGCCGGCCAAAGAGTTCGGGAGATAATTATGGTACATGGAAATCAAACTTGAACACTATACTAGTCATGGATGATTTACGGTTCGCTTTAACGGAGG
AGTGTCCTCCAATTCCTAGCTCAACTACAAATCAAAATGTTCAGGATGCATGTGATAGATGGATTAGGGTTAACGATAAAGCTCGAGTTTGCATCTTAGTAGGCATATCA
GATGTTTTGTTTAAGAAGCATGAGAACATGGTGGCCGCTAAGGAAATTATGGACTTGCTGCAGGCGATGTTGGGACAACTGTCCTCATCGATCAGGCACGATACTATTAA
ATATGTTTACAATTCCTGCATGAAAGAAGGAACCTCTGTTAGAGAACATGTTCTGGATATGATGGTCTACTTTAATATTGCGGAAGTAAACGGCGTTATCATAGACGAGA
AGAGTCAAGAAACTAGTTCTTGGAGAAAGCTACGAGAAGGTGAGATAGCTCTCAAGGGTAAAGGTTCTCACTCACTGGGAGACGACTCTGAAAATTCTCATAATCTTTCT
CTCAGAAGAGCTCGAAATGGAGGCTGGTTATGGCGTCTGTATCTACTTTTTCTCAATGCATGGTCTCAACGCATAGTCTCGACGCATGGGTTTCCACTTGTTCTCGCAGT
AGCTTTGCCTCTAGTACTTGGGTTATTTGTGCTTCTTTTCTTCTTAATCCTGCGTTAA
Protein sequenceShow/hide protein sequence
MKGITVDTPKVEAVSNWPDFLYYASCDYVVFGMANSLVARMEISLSHVFQGQVLVEVGDLSRTCSLKGLDFSRPCKDKDHESLKYFFTQKKLNMRHRRWLELAKDYDCEI
LYHPSKVNVVVDALSRKAAHSTTLVTRQTHLYDDFNRAGIAVAVGEVSTLLAQLTVQPTLGQKIIDAQQGGPYLVEKFHQGKKGRENTSSPCSSTKHYGAGVTHVILPGK
ARTTFNKRVLVPKTDTGGVPPHKEGRSDQAQVTVYKNTGEPATEASMNGSCNYNGPKGPLISSFSELGSWKKEPRRKYQQQQGMDSHSLRGDSENPQNLLTEELETEGGY
EAVAQAIIAKIGAHNQLLLRISSLYQKQARGGIPQRERRRGSAVSAPRRCLAFFKDASRQCLSSAALLLCVHADAFLLQRHGSAALAPRRTMRFKQGCVFQHFFSIFLLE
TRTRKKNWSNWTSGDLVRCFPADLALISALEVRFGGFRLGLGRPKSSGDNYGTWKSNLNTILVMDDLRFALTEECPPIPSSTTNQNVQDACDRWIRVNDKARVCILVGIS
DVLFKKHENMVAAKEIMDLLQAMLGQLSSSIRHDTIKYVYNSCMKEGTSVREHVLDMMVYFNIAEVNGVIIDEKSQETSSWRKLREGEIALKGKGSHSLGDDSENSHNLS
LRRARNGGWLWRLYLLFLNAWSQRIVSTHGFPLVLAVALPLVLGLFVLLFFLILR