; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G009120 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G009120
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr10:19974504..19975660
RNA-Seq ExpressionClCG10G009120
SyntenyClCG10G009120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]1.0e-4337.83Show/hide
Query:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA
        +++A  T +P   SS  FS  PLNQ+LNQ+ +      N++L KTL LPIL+ YKLEGHL G  PCP+ F+   ++    V EE   A +  GASSS T 
Subjt:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA

Query:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA
          +N   +   +T   L+G +             G    +      QD FGVQSR EE +LRQ+ Q T KGN+KM EYL +MKT+ DNL Q GSP+  RA
Subjt:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA

Query:  LISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTI----------DNSSITKGAITTSTTI-------------EAEAEVMVIISQHVS---------
        LISQVLLGLD+ YN ++V IQGKP ISWLD   +  I                KG IT S  +              +  +      QH S         
Subjt:  LISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTI----------DNSSITKGAITTSTTI-------------EAEAEVMVIISQHVS---------

Query:  ----YVAKLG-TLLLFVIRFNKKYAP--APNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH
               K G + L+   RFNK+++     + ++      +  NP  FV+TQ+   F ATP+ V DPNW  D+  TNH
Subjt:  ----YVAKLG-TLLLFVIRFNKKYAP--APNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]1.0e-2743.63Show/hide
Query:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA
        +++A  T +P   SS  FS  PLNQ+LNQ+ +      N++L KTL LPIL+ YKLEGHL G  PCP+ F+   ++    V EE   A +  GASSS T 
Subjt:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA

Query:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA
          +N   +   +T   L+G +             G    +      QD FGVQSR EE +LRQ+ Q T KGN+KM EYL +MKT+ DNL Q GSP+  RA
Subjt:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA

Query:  LISQ
        LISQ
Subjt:  LISQ

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]4.1e-4037.21Show/hide
Query:  SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTANSINPPMDH---S
        S   F++ PLNQLLNQITS      NF+L + L LPILRSYKL  +L G KPCP   L PT TP  N+          +G++SS ++ ++NP  +     
Subjt:  SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTANSINPPMDH---S

Query:  RSTLIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEY
           L+G +    A        G    +      Q+LFGVQSR E  YL+Q+FQQTCKG+ +M EYLKLMK+H+DNLA AGS ++ R L+SQVL GLD+EY
Subjt:  RSTLIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEY

Query:  NPIVVGIQGKPSISWLD-----TSIRTTIDNSSITKGAITTSTT------------IEAEAEVMVIISQHVSYVAKLGTLLLFVI-RFNKKYAPAPNHSK
        NPIVV +QGK ++SW +      +    ++  +  K  I  + T             +         + H S   + G        + N+   P P   K
Subjt:  NPIVVGIQGKPSISWLD-----TSIRTTIDNSSITKGAITTSTT------------IEAEAEVMVIISQHVSYVAKLGTLLLFVI-RFNKKYAPAPNHSK

Query:  EVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH
               G N +A   T +    + TPE V DP+W AD+  T+H
Subjt:  EVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH

XP_038896600.1 uncharacterized protein LOC120084860 [Benincasa hispida]1.0e-2778.57Show/hide
Query:  QDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISWLD
        Q+LFGVQSR EE YLRQ+FQQT KG  KMS YLKLMK HSDNLAQ  SP++TR LISQVLLGLD+EYN +VVGIQGKP ISWLD
Subjt:  QDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISWLD

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]2.1e-3645.54Show/hide
Query:  NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENV--GEESNSAMMSKGASS------STTANSINP-----PMDHSRST----LIGMVVQF
        N++L + L LPILRSY+LEGHL G  PCP  F   T      V  G+E+       G +S       TTA++ +P     P   SR+     L+G +  F
Subjt:  NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENV--GEESNSAMMSKGASS------STTANSINP-----PMDHSRST----LIGMVVQF

Query:  HAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKP
                  G + +K      Q+LFG+QSR  E YLRQ+FQQTCKG  KM EYL++MKTHSDNL   GSP+ TRAL+SQVLLGLD+E+NP V  IQG+ 
Subjt:  HAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKP

Query:  SISWLDTSIRTTI
         ISW  T+++T +
Subjt:  SISWLDTSIRTTI

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein5.0e-4437.83Show/hide
Query:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA
        +++A  T +P   SS  FS  PLNQ+LNQ+ +      N++L KTL LPIL+ YKLEGHL G  PCP+ F+   ++    V EE   A +  GASSS T 
Subjt:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA

Query:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA
          +N   +   +T   L+G +             G    +      QD FGVQSR EE +LRQ+ Q T KGN+KM EYL +MKT+ DNL Q GSP+  RA
Subjt:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA

Query:  LISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTI----------DNSSITKGAITTSTTI-------------EAEAEVMVIISQHVS---------
        LISQVLLGLD+ YN ++V IQGKP ISWLD   +  I                KG IT S  +              +  +      QH S         
Subjt:  LISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTI----------DNSSITKGAITTSTTI-------------EAEAEVMVIISQHVS---------

Query:  ----YVAKLG-TLLLFVIRFNKKYAP--APNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH
               K G + L+   RFNK+++     + ++      +  NP  FV+TQ+   F ATP+ V DPNW  D+  TNH
Subjt:  ----YVAKLG-TLLLFVIRFNKKYAP--APNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH

A0A5D3BCH9 Uncharacterized protein5.0e-2843.63Show/hide
Query:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA
        +++A  T +P   SS  FS  PLNQ+LNQ+ +      N++L KTL LPIL+ YKLEGHL G  PCP+ F+   ++    V EE   A +  GASSS T 
Subjt:  VSSATSTRSP---SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTA

Query:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA
          +N   +   +T   L+G +             G    +      QD FGVQSR EE +LRQ+ Q T KGN+KM EYL +MKT+ DNL Q GSP+  RA
Subjt:  NSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRA

Query:  LISQ
        LISQ
Subjt:  LISQ

A0A5D3E3L7 Uncharacterized protein9.5e-2735.47Show/hide
Query:  PILRSYKLEGHLFGAKPCPAMFLQPTT-TPFENVGEESNSAMMSKG--ASSSTTANS--INPPMDH---SRSTLIGMVVQFHAIRGCSSAYGIQEFKGTM
        P     ++ G   G    P  FL        E VG E+ S+++S+G  ASSST+ NS  +NP  +    S   L+G++             G    K   
Subjt:  PILRSYKLEGHLFGAKPCPAMFLQPTT-TPFENVGEESNSAMMSKG--ASSSTTANS--INPPMDH---SRSTLIGMVVQFHAIRGCSSAYGIQEFKGTM

Query:  VGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTIDNSSIT
           Q+LFG++SR EE +LR  FQ T +GN KM +YL++MK ++DNL QAGSP+  R LISQVLLGLD+ YNP+   IQGKP ISWLD      I  +   
Subjt:  VGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISWLDTSIRTTIDNSSIT

Query:  KGAITTSTTIEAEAEVMVIISQHVSYVAKLGTLLLFVIRFNKKYAPAPNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH
           +     I+ E+E +++ +  V                N+ + P  N  K++          AF+ TQ ++  LATPE V D N   D+  TNH
Subjt:  KGAITTSTTIEAEAEVMVIISQHVSYVAKLGTLLLFVIRFNKKYAPAPNHSKEVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH

A0A6J1D5J0 uncharacterized protein LOC1110175011.8e-2547.26Show/hide
Query:  EESNSAMMSKGASSSTTANSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLM
        E S + + +  +SS  T  +INP  +   +T   L+G +         +   G +         Q+LFGVQS+ EE YLRQ+FQQT KG+ KM+++L++M
Subjt:  EESNSAMMSKGASSSTTANSINPPMDHSRST---LIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLM

Query:  KTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISW
        K+H+DNL QAGSP+ TR+LISQVLLGLD+EYNP+V  IQGK  ISW
Subjt:  KTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVVGIQGKPSISW

A0A6J1DCW4 uncharacterized protein LOC1110195982.0e-4037.21Show/hide
Query:  SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTANSINPPMDH---S
        S   F++ PLNQLLNQITS      NF+L + L LPILRSYKL  +L G KPCP   L PT TP  N+          +G++SS ++ ++NP  +     
Subjt:  SSTNFSTFPLNQLLNQITS------NFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTANSINPPMDH---S

Query:  RSTLIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEY
           L+G +    A        G    +      Q+LFGVQSR E  YL+Q+FQQTCKG+ +M EYLKLMK+H+DNLA AGS ++ R L+SQVL GLD+EY
Subjt:  RSTLIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEY

Query:  NPIVVGIQGKPSISWLD-----TSIRTTIDNSSITKGAITTSTT------------IEAEAEVMVIISQHVSYVAKLGTLLLFVI-RFNKKYAPAPNHSK
        NPIVV +QGK ++SW +      +    ++  +  K  I  + T             +         + H S   + G        + N+   P P   K
Subjt:  NPIVVGIQGKPSISWLD-----TSIRTTIDNSSITKGAITTSTT------------IEAEAEVMVIISQHVSYVAKLGTLLLFVI-RFNKKYAPAPNHSK

Query:  EVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH
               G N +A   T +    + TPE V DP+W AD+  T+H
Subjt:  EVVTPQLGSNPWAFVATQSNNLFLATPEIVSDPNWNADNERTNH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAATTTCGTTTCGTCTGCAACCTCCACTAGAAGTCCTAGTTCCACAAACTTCAGCACTTTTCCCCTCAATCAGCTCCTTAATCAAATCACGAGTAATTTT
ATGCTTCGAAAAACCCTAGTGTTACCCATTCTTCGTAGCTACAAATTGGAAGGGCACTTGTTTGGAGCCAAACCTTGTCCTGCAATGTTCCTTCAACCCACCACC
ACACCATTTGAAAATGTGGGTGAAGAGTCAAATTCAGCCATGATGTCTAAAGGAGCTTCAAGTTCTACTACTGCCAATTCCATCAATCCACCCATGGATCACAGT
AGATCAACTCTTATTGGGATGGTTGTACAATTCCATGCCATCAGAGGTTGTAGTTCAGCTTATGGGATTCAAGAATTCAAAGGAACTATGGTAGGCCATCAAGAT
CTGTTTGGCGTGCAATCAAGGGTTGAAGAGGGTTACCTTCGACAAATATTCCAACAAACTTGCAAAGGTAATTCGAAAATGAGTGAATATTTGAAACTAATGAAA
ACTCACTCTGATAATCTAGCACAGGCTGGGAGCCCAATTACCACTCGAGCTCTAATCTCACAAGTCCTCTTGGGGTTAGATAAGGAGTACAACCCGATTGTAGTC
GGGATTCAAGGTAAACCAAGCATATCATGGTTGGATACCTCAATCAGAACAACAATTGACAACAGTTCAATAACCAAAGGGGCAATAACAACTTCAACAACAATA
GAGGCTGAGGCAGAAGTAATGGTAATAATAAGCCAACATGTCAGCTATGTGGCAAAGTTGGGCACTCTACTATTGTTTGTTATCAGGTTTAACAAGAAATATGCT
CCTGCTCCAAATCACTCAAAGGAAGTTGTCACACCTCAGTTAGGAAGCAATCCATGGGCGTTTGTTGCTACTCAAAGCAATAATCTATTTCTTGCCACACCTGAA
ATAGTCTCTGATCCTAACTGGAATGCTGACAACGAAAGAACTAATCAC
mRNA sequenceShow/hide mRNA sequence
ATGACCAATTTCGTTTCGTCTGCAACCTCCACTAGAAGTCCTAGTTCCACAAACTTCAGCACTTTTCCCCTCAATCAGCTCCTTAATCAAATCACGAGTAATTTT
ATGCTTCGAAAAACCCTAGTGTTACCCATTCTTCGTAGCTACAAATTGGAAGGGCACTTGTTTGGAGCCAAACCTTGTCCTGCAATGTTCCTTCAACCCACCACC
ACACCATTTGAAAATGTGGGTGAAGAGTCAAATTCAGCCATGATGTCTAAAGGAGCTTCAAGTTCTACTACTGCCAATTCCATCAATCCACCCATGGATCACAGT
AGATCAACTCTTATTGGGATGGTTGTACAATTCCATGCCATCAGAGGTTGTAGTTCAGCTTATGGGATTCAAGAATTCAAAGGAACTATGGTAGGCCATCAAGAT
CTGTTTGGCGTGCAATCAAGGGTTGAAGAGGGTTACCTTCGACAAATATTCCAACAAACTTGCAAAGGTAATTCGAAAATGAGTGAATATTTGAAACTAATGAAA
ACTCACTCTGATAATCTAGCACAGGCTGGGAGCCCAATTACCACTCGAGCTCTAATCTCACAAGTCCTCTTGGGGTTAGATAAGGAGTACAACCCGATTGTAGTC
GGGATTCAAGGTAAACCAAGCATATCATGGTTGGATACCTCAATCAGAACAACAATTGACAACAGTTCAATAACCAAAGGGGCAATAACAACTTCAACAACAATA
GAGGCTGAGGCAGAAGTAATGGTAATAATAAGCCAACATGTCAGCTATGTGGCAAAGTTGGGCACTCTACTATTGTTTGTTATCAGGTTTAACAAGAAATATGCT
CCTGCTCCAAATCACTCAAAGGAAGTTGTCACACCTCAGTTAGGAAGCAATCCATGGGCGTTTGTTGCTACTCAAAGCAATAATCTATTTCTTGCCACACCTGAA
ATAGTCTCTGATCCTAACTGGAATGCTGACAACGAAAGAACTAATCAC
Protein sequenceShow/hide protein sequence
MTNFVSSATSTRSPSSTNFSTFPLNQLLNQITSNFMLRKTLVLPILRSYKLEGHLFGAKPCPAMFLQPTTTPFENVGEESNSAMMSKGASSSTTANSINPPMDHS
RSTLIGMVVQFHAIRGCSSAYGIQEFKGTMVGHQDLFGVQSRVEEGYLRQIFQQTCKGNSKMSEYLKLMKTHSDNLAQAGSPITTRALISQVLLGLDKEYNPIVV
GIQGKPSISWLDTSIRTTIDNSSITKGAITTSTTIEAEAEVMVIISQHVSYVAKLGTLLLFVIRFNKKYAPAPNHSKEVVTPQLGSNPWAFVATQSNNLFLATPE
IVSDPNWNADNERTNH