; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G012793 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G012793
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr04:27623888..27625164
RNA-Seq ExpressionClCG04G012793
SyntenyClCG04G012793
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0140097 - catalytic activity, acting on DNA (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045287.1 uncharacterized protein E6C27_scaffold316G00450 [Cucumis melo var. makuwa]2.0e-2261.36Show/hide
Query:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA
        IK LWS  DIG  F+E+IGRS G+LTMWDES+ISV E++KG ++LSVKC  I KK CWI+NVYGP  ++ERK +W ELS   A C+ A
Subjt:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA

KAA0063088.1 uncharacterized protein E6C27_scaffold623G00050 [Cucumis melo var. makuwa]7.2e-3360.71Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L++++LD+VLIQE+KK+  DI  IK LWSSKD GW   E  G S G+LT+WD SK+ VIE LKGGYSLS+  + + KKSCWITNVYGPND++ER+ +W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        L SL  YC +AW
Subjt:  LSSLVAYCVEAW

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]5.3e-2859.05Show/hide
Query:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY
        LV+    +   +DI  IK LWSSKDIGW  VE+ GR  G+LTMWD SKI V+E LKGGYSLS+  +   KKSCWITNVYGP DY ER+ +W  L SL  Y
Subjt:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY

Query:  CVEAW
        C  AW
Subjt:  CVEAW

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.5e-1741.07Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L + N D+V++Q++K  +++ + +K +WSS  +GWA +EA G S G+L +W E  I+V++ ++G +S+S+        S WIT VYGP+ YR R   W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        LSSL   C E W
Subjt:  LSSLVAYCVEAW

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]2.3e-3161.61Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        LK++N D+VLIQETKKD ++ + IK LWSSK++G AFVEA G+S G+LT+WD+SKI V  + K  +SLS+KC  INKK CWITNVYGP DY+ER+ LWAE
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        LSSL     + W
Subjt:  LSSLVAYCVEAW

TrEMBL top hitse value%identityAlignment
A0A1U8B190 uncharacterized protein LOC1046062231.0e-1640.95Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWI-TNVYGPNDYRERKHLWA
        L+R   D+VL+QE+K   LD   ++  W S+ +GW+   + G S G++T+W E  + V+E L G +S+S+KC  +     W+ TNVYGPN YRER  +W 
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWI-TNVYGPNDYRERKHLWA

Query:  ELSSL
        EL ++
Subjt:  ELSSL

A0A5A7TTX5 Uncharacterized protein9.5e-2361.36Show/hide
Query:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA
        IK LWS  DIG  F+E+IGRS G+LTMWDES+ISV E++KG ++LSVKC  I KK CWI+NVYGP  ++ERK +W ELS   A C+ A
Subjt:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA

A0A5A7V639 Uncharacterized protein3.5e-3360.71Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L++++LD+VLIQE+KK+  DI  IK LWSSKD GW   E  G S G+LT+WD SK+ VIE LKGGYSLS+  + + KKSCWITNVYGPND++ER+ +W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        L SL  YC +AW
Subjt:  LSSLVAYCVEAW

A0A5D3BHE3 Uncharacterized protein2.6e-2859.05Show/hide
Query:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY
        LV+    +   +DI  IK LWSSKDIGW  VE+ GR  G+LTMWD SKI V+E LKGGYSLS+  +   KKSCWITNVYGP DY ER+ +W  L SL  Y
Subjt:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY

Query:  CVEAW
        C  AW
Subjt:  CVEAW

A0A803QI00 Uncharacterized protein1.1e-1528.63Show/hide
Query:  NKEEDED----SIVSASSDDLDYLGSEEDLE------EEALLSNNGSALKNLFQSMENQDLDIVKVINCKLIGKDIIP-QNLISIVEDCDLVLG------
        ++E DED      +  SS+D D  G  ED E       E +L N     K   +    Q+L + K+ +   + KD +  + +I  +++ D+ +       
Subjt:  NKEEDED----SIVSASSDDLDYLGSEEDLE------EEALLSNNGSALKNLFQSMENQDLDIVKVINCKLIGKDIIP-QNLISIVEDCDLVLG------

Query:  ----------------------LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKK
                              + + N DLV++QE K+ S+D   I  +W S+   W  + AIGRS G L +WD   I+V++ L G +S+SV      K 
Subjt:  ----------------------LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKK

Query:  SCWITNVYGPNDYRERKHLWAELSSLVAYCVEAW
          W + VYGP  Y+ R   W EL+ L A C ++W
Subjt:  SCWITNVYGPNDYRERKHLWAELSSLVAYCVEAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCTTCAGTGTAATCAGCAATAGAATGATCCCTTTGCCTACATTGTTGGTATGTGCTATAAAGGATTTGCTGGTCTTCTTGGGTTTCTTGGAGGCCATTGTGAA
TTCGGATTTGTTTGAAGAATGTTGTGTCAGAATTTACTCTCAAGGTTTATCCCATATCAGGTCAGATCATCATATCCCCAAGAATAATTCCTTTTCCCAAGCGGAATTTA
AAATCCCTGGCTCTAACTCACCCTTTATTCGAGGTATCCCAAGCCCTGATAATCGTGGGGTGCAAATCAACAAAGAAGAAGATGAAGATTCAATTGTCAGCGCTAGTAGT
GATGATTTGGACTACTTAGGCTCTGAGGAAGATCTGGAAGAGGAGGCCCTTTTATCCAACAATGGAAGTGCTTTGAAGAATCTGTTCCAATCTATGGAAAATCAAGACCT
TGACATTGTGAAAGTTATAAACTGCAAACTGATTGGGAAGGATATAATCCCTCAAAATCTAATCTCAATTGTTGAGGATTGTGACTTGGTCCTTGGTTTAAAGCGACTGA
ATCTGGATTTAGTTTTAATACAAGAAACAAAGAAGGATAGTCTTGACATCAATACTATCAAAGAACTATGGAGCTCCAAGGATATTGGATGGGCGTTTGTGGAGGCAATT
GGAAGGTCGAGAGGTATGTTAACCATGTGGGATGAAAGTAAGATTTCAGTCATTGAAATGCTAAAAGGTGGATACTCACTTTCAGTCAAATGCCTTATAATCAACAAAAA
GAGCTGCTGGATAACAAATGTATATGGCCCTAATGATTACCGCGAGAGGAAGCATCTGTGGGCCGAACTGTCTTCTTTGGTGGCATACTGTGTAGAGGCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCTTCAGTGTAATCAGCAATAGAATGATCCCTTTGCCTACATTGTTGGTATGTGCTATAAAGGATTTGCTGGTCTTCTTGGGTTTCTTGGAGGCCATTGTGAA
TTCGGATTTGTTTGAAGAATGTTGTGTCAGAATTTACTCTCAAGGTTTATCCCATATCAGGTCAGATCATCATATCCCCAAGAATAATTCCTTTTCCCAAGCGGAATTTA
AAATCCCTGGCTCTAACTCACCCTTTATTCGAGGTATCCCAAGCCCTGATAATCGTGGGGTGCAAATCAACAAAGAAGAAGATGAAGATTCAATTGTCAGCGCTAGTAGT
GATGATTTGGACTACTTAGGCTCTGAGGAAGATCTGGAAGAGGAGGCCCTTTTATCCAACAATGGAAGTGCTTTGAAGAATCTGTTCCAATCTATGGAAAATCAAGACCT
TGACATTGTGAAAGTTATAAACTGCAAACTGATTGGGAAGGATATAATCCCTCAAAATCTAATCTCAATTGTTGAGGATTGTGACTTGGTCCTTGGTTTAAAGCGACTGA
ATCTGGATTTAGTTTTAATACAAGAAACAAAGAAGGATAGTCTTGACATCAATACTATCAAAGAACTATGGAGCTCCAAGGATATTGGATGGGCGTTTGTGGAGGCAATT
GGAAGGTCGAGAGGTATGTTAACCATGTGGGATGAAAGTAAGATTTCAGTCATTGAAATGCTAAAAGGTGGATACTCACTTTCAGTCAAATGCCTTATAATCAACAAAAA
GAGCTGCTGGATAACAAATGTATATGGCCCTAATGATTACCGCGAGAGGAAGCATCTGTGGGCCGAACTGTCTTCTTTGGTGGCATACTGTGTAGAGGCGTGGTGA
Protein sequenceShow/hide protein sequence
MKFFSVISNRMIPLPTLLVCAIKDLLVFLGFLEAIVNSDLFEECCVRIYSQGLSHIRSDHHIPKNNSFSQAEFKIPGSNSPFIRGIPSPDNRGVQINKEEDEDSIVSASS
DDLDYLGSEEDLEEEALLSNNGSALKNLFQSMENQDLDIVKVINCKLIGKDIIPQNLISIVEDCDLVLGLKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAI
GRSRGMLTMWDESKISVIEMLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEAW