; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G008460 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G008460
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr01:10032438..10033333
RNA-Seq ExpressionClCG01G008460
SyntenyClCG01G008460
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]2.7e-3438.75Show/hide
Query:  SSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTID--------------TNAFAIGASSSQTVVSD--------IETSKIEEVLNPVYEVWVAVQF
        SS  FS P LNQ+LNQ+ ++K DR ++LLWK LALP +                + F + ASSS T V++          +S    ++N ++E WV    
Subjt:  SSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTID--------------TNAFAIGASSSQTVVSD--------IETSKIEEVLNPVYEVWVAVQF

Query:  DDA----KSGNPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP
                S  P   +Q +               FG+QSRAEED+LRQ+ Q TRKG+ +M EYL +MK + DNLG V SPVP RAL   VLL LDE YN 
Subjt:  DDA----KSGNPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP

Query:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNH---NPFVNL-----VNGRGNGSQNKLKPVIMTLGGHQFGNQ-FSGYRG
        V+  +QGK D+SWL+MQS+LL +EK L  QN Q+      +   +P +N+     +NG+ N S  K          + +  Q FSG RG
Subjt:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNH---NPFVNL-----VNGRGNGSQNKLKPVIMTLGGHQFGNQ-FSGYRG

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.3e-3941.87Show/hide
Query:  PTFNTSSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTIDTNAF-------------AIGASSSQTVVSDIETSKIEEVLNPVYEVWVAV------
        P    S   F+ P LNQLLNQITSIK DRG+FLLW+NLALP + +                 +  + + T +    +S+    LNP YE W+ V      
Subjt:  PTFNTSSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTIDTNAF-------------AIGASSSQTVVSDIETSKIEEVLNPVYEVWVAV------

Query:  ---QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVAT
               A       G  T + L      LFG+QSRAE DYL+Q+FQQT KGS +M EYL+LMK H DNL L  S V  R L   VL  LDE+YNP+V  
Subjt:  ---QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVAT

Query:  LQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN--HNPFVNLVNGRG-------NGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKG
        +QGK ++SW EM +ELL+YEKRL++QN  + GI  N    P VN V+GR        N   N         GG+Q G+   G R  G+G
Subjt:  LQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN--HNPFVNLVNGRG-------NGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKG

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]1.0e-2839.58Show/hide
Query:  TSIKFDRGDFLLWKNLALPTIDT--------------NAFAIGASSS-------------------------QTVVSDIETSKIEEVLNPVYEVWVAV--
        T+IK D+ ++LLW+NLALP + +                F++    S                         Q + +   +S + +V NP YE    V  
Subjt:  TSIKFDRGDFLLWKNLALPTIDT--------------NAFAIGASSS-------------------------QTVVSDIETSKIEEVLNPVYEVWVAV--

Query:  -------QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP
                F  A+      G +  + L      LFGLQSRA EDYLRQ+FQQT KG+ +M EYLR+MK H DNLGL  SPVPTRAL   VLL LDE++NP
Subjt:  -------QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP

Query:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN
         VAT+QG+ ++SW  MQ+ELL++EKR    N QR G   N
Subjt:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]4.1e-3041.95Show/hide
Query:  ALPTIDTNAFAIGASSSQTVVSDIETSKIEEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQT
        ++P   T A A   S   +  S   +S     +NP YE W+AV  D    G       P   +Q +              LFG+QSR EEDYLR +FQ T
Subjt:  ALPTIDTNAFAIGASSSQTVVSDIETSKIEEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQT

Query:  RKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR--VGIVNNHNPFVNLVNGR
        RKG+ +M EYL+ MK++ DNL    SP+P R L   VLL LDE+YN +VA +QG+ D+SWL+MQSELL YE+RL+ Q+ Q+  VG     N  VN+ N R
Subjt:  RKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR--VGIVNNHNPFVNLVNGR

Query:  GNGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKGRNN
             NK      ++GG Q G    G RG G+GRNN
Subjt:  GNGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKGRNN

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]4.1e-3041.95Show/hide
Query:  ALPTIDTNAFAIGASSSQTVVSDIETSKIEEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQT
        ++P   T A A   S   +  S   +S     +NP YE W+AV  D    G       P   +Q +              LFG+QSR EEDYLR +FQ T
Subjt:  ALPTIDTNAFAIGASSSQTVVSDIETSKIEEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQT

Query:  RKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR--VGIVNNHNPFVNLVNGR
        RKG+ +M EYL+ MK++ DNL    SP+P R L   VLL LDE+YN +VA +QG+ D+SWL+MQSELL YE+RL+ Q+ Q+  VG     N  VN+ N R
Subjt:  RKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR--VGIVNNHNPFVNLVNGR

Query:  GNGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKGRNN
             NK      ++GG Q G    G RG G+GRNN
Subjt:  GNGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKGRNN

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein3.5e-2747.93Show/hide
Query:  FGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR
        FG++SRAEED+LRQ FQ TRKG++ M +YLR+MK + DNLG   SP+P RAL   VLL LDE YNPV+  +QGK ++SWL+MQS+LL +EKRL  QN Q+
Subjt:  FGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR

Query:  -VGIVNNHNPFVNLVNGRGN----GSQNKLKPVIMTLGGHQF----GNQFSGY-------RGCGKGRNN
         +G +   N  +N+   R N    GS+N           HQF     N F G+       RGCG+GR +
Subjt:  -VGIVNNHNPFVNLVNGRGN----GSQNKLKPVIMTLGGHQF----GNQFSGY-------RGCGKGRNN

A0A5A7SIT7 Uncharacterized protein1.3e-3438.75Show/hide
Query:  SSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTID--------------TNAFAIGASSSQTVVSD--------IETSKIEEVLNPVYEVWVAVQF
        SS  FS P LNQ+LNQ+ ++K DR ++LLWK LALP +                + F + ASSS T V++          +S    ++N ++E WV    
Subjt:  SSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTID--------------TNAFAIGASSSQTVVSD--------IETSKIEEVLNPVYEVWVAVQF

Query:  DDA----KSGNPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP
                S  P   +Q +               FG+QSRAEED+LRQ+ Q TRKG+ +M EYL +MK + DNLG V SPVP RAL   VLL LDE YN 
Subjt:  DDA----KSGNPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNP

Query:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNH---NPFVNL-----VNGRGNGSQNKLKPVIMTLGGHQFGNQ-FSGYRG
        V+  +QGK D+SWL+MQS+LL +EK L  QN Q+      +   +P +N+     +NG+ N S  K          + +  Q FSG RG
Subjt:  VVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNH---NPFVNL-----VNGRGNGSQNKLKPVIMTLGGHQFGNQ-FSGYRG

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2547.88Show/hide
Query:  LFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRALV---LLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQ
        LFG+QSRAEED+LRQ+FQ TRK      +YLR+MK + D LG   SPVP RA +   LL LDE YNPV+A +QGK ++SW++MQSELL++EKRL+ Q+ Q
Subjt:  LFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRALV---LLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQ

Query:  RVGIVNNHNPFVNLVNGRGNGSQNKLKPVIMTLGGHQF-GNQ----------FSGYRGCGKGRNN
        +    N  N   N+V    N +QN+          HQF GN           F+  RG GKGR N
Subjt:  RVGIVNNHNPFVNLVNGRGNGSQNKLKPVIMTLGGHQF-GNQ----------FSGYRGCGKGRNN

A0A6J1D5J0 uncharacterized protein LOC1110175018.6e-2646.28Show/hide
Query:  SSQTVVSDIETSKI--EEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRL
        +SQT +S   +S I  E  +NP+YE WV    D    G       P    Q +              LFG+QS+AEEDYLRQ+FQQTRKGS +M+++LR+
Subjt:  SSQTVVSDIETSKI--EEVLNPVYEVWVAVQFDDAKSG------NPSYGLQTI----------QRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRL

Query:  MKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNHNPFVNLVNGRGNG
        MK H DNLG   SPVPTR+L   VLL LDE+YNPVVAT+QGK  +SW EMQ+E  S       QN       N+  PF    N RG G
Subjt:  MKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNNHNPFVNLVNGRGNG

A0A6J1DCW4 uncharacterized protein LOC1110195986.1e-4041.87Show/hide
Query:  PTFNTSSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTIDTNAF-------------AIGASSSQTVVSDIETSKIEEVLNPVYEVWVAV------
        P    S   F+ P LNQLLNQITSIK DRG+FLLW+NLALP + +                 +  + + T +    +S+    LNP YE W+ V      
Subjt:  PTFNTSSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTIDTNAF-------------AIGASSSQTVVSDIETSKIEEVLNPVYEVWVAV------

Query:  ---QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVAT
               A       G  T + L      LFG+QSRAE DYL+Q+FQQT KGS +M EYL+LMK H DNL L  S V  R L   VL  LDE+YNP+V  
Subjt:  ---QFDDAKSGNPSYGLQTIQRL-----TLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRAL---VLLKLDEDYNPVVAT

Query:  LQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN--HNPFVNLVNGRG-------NGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKG
        +QGK ++SW EM +ELL+YEKRL++QN  + GI  N    P VN V+GR        N   N         GG+Q G+   G R  G+G
Subjt:  LQGKEDVSWLEMQSELLSYEKRLDFQNLQRVGIVNN--HNPFVNLVNGRG-------NGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CATAGTGTTATGGCAAATGCCTCTTCTTCGTCTGGGAATCAACTAGGAAATGGTGGAATGCCGACCTTCAACACTTCCTCTCCGACCTTTAGCATTCCTTCTCTCAATCA
ATTGCTGAATCAGATAACATCTATCAAGTTTGACCGTGGGGATTTCCTATTATGGAAAAATCTGGCTCTTCCCACCATAGACACCAATGCCTTTGCTATTGGAGCCTCAA
GCTCACAAACTGTGGTGAGCGATATCGAAACCTCTAAGATAGAGGAAGTTCTCAACCCTGTGTATGAAGTATGGGTGGCTGTACAATTCGATGACGCCAAAAGTGGCAAC
CCAAGTTATGGGTTGCAAACAATCCAAAGACTTACACTGTTTGGATTACAATCAAGAGCAGAAGAAGACTACCTTCGCCAAATTTTTCAGCAAACTCGAAAAGGTAGTAA
TCGTATGTCAGAATATTTGAGATTGATGAAGCTTCATTTTGACAATCTAGGTCTTGTCAGAAGCCCTGTTCCAACAAGAGCACTGGTCCTTCTCAAACTTGATGAAGACT
ACAACCCGGTTGTTGCAACCTTACAAGGAAAGGAGGATGTGAGCTGGCTGGAAATGCAATCAGAGCTTCTGTCCTATGAAAAGCGGTTAGATTTTCAAAACTTACAAAGA
GTTGGTATCGTCAATAACCATAATCCATTTGTTAATCTTGTCAATGGAAGAGGAAATGGGAGTCAAAATAAACTCAAGCCAGTAATTATGACATTAGGTGGTCATCAGTT
TGGAAATCAATTCTCTGGTTACCGAGGTTGTGGTAAAGGACGAAACAATTGA
mRNA sequenceShow/hide mRNA sequence
CATAGTGTTATGGCAAATGCCTCTTCTTCGTCTGGGAATCAACTAGGAAATGGTGGAATGCCGACCTTCAACACTTCCTCTCCGACCTTTAGCATTCCTTCTCTCAATCA
ATTGCTGAATCAGATAACATCTATCAAGTTTGACCGTGGGGATTTCCTATTATGGAAAAATCTGGCTCTTCCCACCATAGACACCAATGCCTTTGCTATTGGAGCCTCAA
GCTCACAAACTGTGGTGAGCGATATCGAAACCTCTAAGATAGAGGAAGTTCTCAACCCTGTGTATGAAGTATGGGTGGCTGTACAATTCGATGACGCCAAAAGTGGCAAC
CCAAGTTATGGGTTGCAAACAATCCAAAGACTTACACTGTTTGGATTACAATCAAGAGCAGAAGAAGACTACCTTCGCCAAATTTTTCAGCAAACTCGAAAAGGTAGTAA
TCGTATGTCAGAATATTTGAGATTGATGAAGCTTCATTTTGACAATCTAGGTCTTGTCAGAAGCCCTGTTCCAACAAGAGCACTGGTCCTTCTCAAACTTGATGAAGACT
ACAACCCGGTTGTTGCAACCTTACAAGGAAAGGAGGATGTGAGCTGGCTGGAAATGCAATCAGAGCTTCTGTCCTATGAAAAGCGGTTAGATTTTCAAAACTTACAAAGA
GTTGGTATCGTCAATAACCATAATCCATTTGTTAATCTTGTCAATGGAAGAGGAAATGGGAGTCAAAATAAACTCAAGCCAGTAATTATGACATTAGGTGGTCATCAGTT
TGGAAATCAATTCTCTGGTTACCGAGGTTGTGGTAAAGGACGAAACAATTGA
Protein sequenceShow/hide protein sequence
HSVMANASSSSGNQLGNGGMPTFNTSSPTFSIPSLNQLLNQITSIKFDRGDFLLWKNLALPTIDTNAFAIGASSSQTVVSDIETSKIEEVLNPVYEVWVAVQFDDAKSGN
PSYGLQTIQRLTLFGLQSRAEEDYLRQIFQQTRKGSNRMSEYLRLMKLHFDNLGLVRSPVPTRALVLLKLDEDYNPVVATLQGKEDVSWLEMQSELLSYEKRLDFQNLQR
VGIVNNHNPFVNLVNGRGNGSQNKLKPVIMTLGGHQFGNQFSGYRGCGKGRNN