; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G005155 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G005155
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionCCHC-type domain-containing protein
Genome locationCG_Chr11:5562058..5562705
RNA-Seq ExpressionClCG11G005155
SyntenyClCG11G005155
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAD34493.1 Gag-Pol [Ipomoea batatas]5.8e-8073.49Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA KFEIEKFN  NFSLWKLK+ A LRKDNCLAAI  RP   TDD +W++M  +A+A+ +L++AD VLSSIEEKKT  EIWDH  +LYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTS+TEH+N +N+LFSQ+TSL  KI+  E  ELLLQSLPDSYDQL+INL NN+LTDYL FDD+AAAVLEEE+RRKNKED+ V+ QQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSSEG
        TV RGRS ERG S G
Subjt:  TVTRGRSAERGSSEG

KAA0026163.1 Gag-Pol [Cucumis melo var. makuwa]3.3e-8377.51Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA KFEIEKFN TNFSLW LKM   LR DNCL AID  P +ITDD++WN+M+GNA+ N HLALADNVLSSI+EKK  KEIWDH TKLYE KSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTSMTEHMN +N+LFSQ+  LG+KI+ NE  ELLLQSL DSYDQLVINLKNN+L DYL+FDD+ +AVLEEENRRKNKEDKL++ QQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAE
        TVTRGR  E
Subjt:  TVTRGRSAE

KAA0044949.1 hypothetical protein E6C27_scaffold74G002510 [Cucumis melo var. makuwa]6.0e-8578.87Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        M  KF+IEKFN TNFSLWKLKM A  RKDNCL AID RP +ITDD++WN+M+GNA+AN HLAL DNVLSSIEEKK  KEIWDH  KLYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMS ST MTEHMN +N+LFSQ+T LG+KI+ N+H ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+AA VLEEENR KNKEDKLVSSQQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
        TVTR R  E  SS
Subjt:  TVTRGRSAERGSS

KAA0061179.1 Gag-Pol [Cucumis melo var. makuwa]2.0e-8880.75Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA  FEIEKFN TNFSLWKLKM   LRKDNCL  +D RP +I DDS+WN+M+GNA AN HLALADNVLSSIEEKKT KEIWDH TKLY+AKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTSMTEHMNI+N+LFSQ+T L +KI+ NE  ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+ +AVLEEENRRKNKEDKLVSSQQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
         VTRGRS+E  SS
Subjt:  TVTRGRSAERGSS

TYK16527.1 hypothetical protein E5676_scaffold21G003420 [Cucumis melo var. makuwa]3.3e-8377.93Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        M  KF+IEKFN TNFSLWKLKM A  RKDNCL AID RP +ITDD++WN+M+GNA+AN HLALADNVLSSIEEKK  KEIWDH  KLYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMS ST MTEHMN +N+LFSQ+T LG+KI+ N+  ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+ A VLEEENR KNKEDKLVSSQQAE L
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
        TVTR R  E  SS
Subjt:  TVTRGRSAERGSS

TrEMBL top hitse value%identityAlignment
A0A5A7SNG9 Gag-Pol1.6e-8377.51Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA KFEIEKFN TNFSLW LKM   LR DNCL AID  P +ITDD++WN+M+GNA+ N HLALADNVLSSI+EKK  KEIWDH TKLYE KSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTSMTEHMN +N+LFSQ+  LG+KI+ NE  ELLLQSL DSYDQLVINLKNN+L DYL+FDD+ +AVLEEENRRKNKEDKL++ QQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAE
        TVTRGR  E
Subjt:  TVTRGRSAE

A0A5A7TUN0 Uncharacterized protein2.9e-8578.87Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        M  KF+IEKFN TNFSLWKLKM A  RKDNCL AID RP +ITDD++WN+M+GNA+AN HLAL DNVLSSIEEKK  KEIWDH  KLYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMS ST MTEHMN +N+LFSQ+T LG+KI+ N+H ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+AA VLEEENR KNKEDKLVSSQQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
        TVTR R  E  SS
Subjt:  TVTRGRSAERGSS

A0A5A7V644 Gag-Pol9.6e-8980.75Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA  FEIEKFN TNFSLWKLKM   LRKDNCL  +D RP +I DDS+WN+M+GNA AN HLALADNVLSSIEEKKT KEIWDH TKLY+AKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTSMTEHMNI+N+LFSQ+T L +KI+ NE  ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+ +AVLEEENRRKNKEDKLVSSQQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
         VTRGRS+E  SS
Subjt:  TVTRGRSAERGSS

A0A5D3CXA6 Uncharacterized protein1.6e-8377.93Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        M  KF+IEKFN TNFSLWKLKM A  RKDNCL AID RP +ITDD++WN+M+GNA+AN HLALADNVLSSIEEKK  KEIWDH  KLYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMS ST MTEHMN +N+LFSQ+T LG+KI+ N+  ELLLQSLPDSYDQLVINL NN+LTDYL+FDD+ A VLEEENR KNKEDKLVSSQQAE L
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
        TVTR R  E  SS
Subjt:  TVTRGRSAERGSS

Q6BCY1 Gag-Pol2.8e-8073.49Show/hide
Query:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        MA KFEIEKFN  NFSLWKLK+ A LRKDNCLAAI  RP   TDD +W++M  +A+A+ +L++AD VLSSIEEKKT  EIWDH  +LYEAKSLHNKIFLK
Subjt:  MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        RKLYTLRMSESTS+TEH+N +N+LFSQ+TSL  KI+  E  ELLLQSLPDSYDQL+INL NN+LTDYL FDD+AAAVLEEE+RRKNKED+ V+ QQAEAL
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSSEG
        TV RGRS ERG S G
Subjt:  TVTRGRSAERGSSEG

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-1427.32Show/hide
Query:  KFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLKRKL
        K  I+ F+   +++WK ++ A L + + L  +DG      DDS W K E  A +     L+D+ L+      T ++I ++   +YE KSL +++ L+++L
Subjt:  KFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLKRKL

Query:  YTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKN
         +L++S   S+  H +I + L S++ + G KI+  + +  LL +LP  YD ++  ++  +  + L    +   +L++E + KN
Subjt:  YTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-2533.8Show/hide
Query:  KFEIEKFNITN-FSLWKLKMNAFLRKDNC--LAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK
        K+E+ KFN  N FS W+ +M   L +     +  +D +        +W  ++  A +   L L+D+V+++I ++ T + IW     LY +K+L NK++LK
Subjt:  KFEIEKFNITN-FSLWKLKMNAFLRKDNC--LAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLK

Query:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL
        ++LY L MSE T+   H+N+ N L +Q+ +LG KI+  +   LLL SLP SYD L   + +   T  +   D+ +A+L  E  RK  E     +Q    +
Subjt:  RKLYTLRMSESTSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEAL

Query:  TVTRGRSAERGSS
        T  RGRS +R S+
Subjt:  TVTRGRSAERGSS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAAAGTTCGAGATTGAGAAGTTCAACATAACTAATTTCTCGTTGTGGAAGTTGAAGATGAATGCTTTCTTGAGAAAAGACAATTGCCTTGCAGCCATCGATGG
GAGGCCAACGAAGATCACAGATGATAGCGAGTGGAACAAGATGGAAGGGAATGCTATTGCAAACACTCATCTAGCATTGGCAGATAATGTATTGTCAAGCATAGAGGAGA
AGAAAACTGTAAAGGAGATTTGGGATCATTTCACCAAATTGTATGAGGCTAAATCACTTCACAACAAGATTTTCCTTAAGAGGAAGTTGTATACTCTTCGGATGTCAGAG
TCCACATCAATGACAGAGCACATGAACATAATGAATAGTCTATTTTCTCAAATCACATCATTGGGTCATAAAATAAAGTCAAATGAACATGTTGAACTTCTACTTCAAAG
TCTTCCTGATTCGTATGATCAACTTGTCATCAATTTAAAAAATAATGTTCTCACCGACTATCTAAACTTTGATGATATTGCAGCTGCTGTTCTAGAAGAGGAAAATCGGC
GCAAGAATAAAGAAGATAAGTTGGTAAGTTCACAACAAGCAGAAGCATTGACGGTGACAAGAGGCAGATCAGCGGAGCGTGGCTCAAGTGAGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAAAGTTCGAGATTGAGAAGTTCAACATAACTAATTTCTCGTTGTGGAAGTTGAAGATGAATGCTTTCTTGAGAAAAGACAATTGCCTTGCAGCCATCGATGG
GAGGCCAACGAAGATCACAGATGATAGCGAGTGGAACAAGATGGAAGGGAATGCTATTGCAAACACTCATCTAGCATTGGCAGATAATGTATTGTCAAGCATAGAGGAGA
AGAAAACTGTAAAGGAGATTTGGGATCATTTCACCAAATTGTATGAGGCTAAATCACTTCACAACAAGATTTTCCTTAAGAGGAAGTTGTATACTCTTCGGATGTCAGAG
TCCACATCAATGACAGAGCACATGAACATAATGAATAGTCTATTTTCTCAAATCACATCATTGGGTCATAAAATAAAGTCAAATGAACATGTTGAACTTCTACTTCAAAG
TCTTCCTGATTCGTATGATCAACTTGTCATCAATTTAAAAAATAATGTTCTCACCGACTATCTAAACTTTGATGATATTGCAGCTGCTGTTCTAGAAGAGGAAAATCGGC
GCAAGAATAAAGAAGATAAGTTGGTAAGTTCACAACAAGCAGAAGCATTGACGGTGACAAGAGGCAGATCAGCGGAGCGTGGCTCAAGTGAGGGCTGA
Protein sequenceShow/hide protein sequence
MAEKFEIEKFNITNFSLWKLKMNAFLRKDNCLAAIDGRPTKITDDSEWNKMEGNAIANTHLALADNVLSSIEEKKTVKEIWDHFTKLYEAKSLHNKIFLKRKLYTLRMSE
STSMTEHMNIMNSLFSQITSLGHKIKSNEHVELLLQSLPDSYDQLVINLKNNVLTDYLNFDDIAAAVLEEENRRKNKEDKLVSSQQAEALTVTRGRSAERGSSEG