; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC04G080090 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC04G080090
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationCmU531Chr04:26288842..26290118
RNA-Seq ExpressionCmUC04G080090
SyntenyCmUC04G080090
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0140097 - catalytic activity, acting on DNA (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045287.1 uncharacterized protein E6C27_scaffold316G00450 [Cucumis melo var. makuwa]6.7e-2362.5Show/hide
Query:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA
        IK LWS  DIG  F+E+IGRS G+LTMWDES+ISV EV+KG ++LSVKC  I KK CWI+NVYGP  ++ERK +W ELS   A C+ A
Subjt:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA

KAA0063088.1 uncharacterized protein E6C27_scaffold623G00050 [Cucumis melo var. makuwa]4.2e-3360.71Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L++++LD+VLIQE+KK+  DI  IK LWSSKD GW   E  G S G+LT+WD SK+ VIE LKGGYSLS+  + + KKSCWITNVYGPND++ER+ +W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        L SL  YC +AW
Subjt:  LSSLVAYCVEAW

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]3.1e-2859.05Show/hide
Query:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY
        LV+    +   +DI  IK LWSSKDIGW  VE+ GR  G+LTMWD SKI V+E LKGGYSLS+  +   KKSCWITNVYGP DY ER+ +W  L SL  Y
Subjt:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY

Query:  CVEAW
        C  AW
Subjt:  CVEAW

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.5e-1741.07Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L + N D+V++Q++K  +++ + +K +WSS  +GWA +EA G S G+L +W E  I+V++ ++G +S+S+        S WIT VYGP+ YR R   W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        LSSL   C E W
Subjt:  LSSLVAYCVEAW

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.0e-3161.61Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        LK++N D+VLIQETKKD ++ + IK LWSSK++G AFVEA G+S G+LT+WD+SKI V  + K  +SLS+KC  INKK CWITNVYGP DY+ER+ LWAE
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        LSSL     + W
Subjt:  LSSLVAYCVEAW

TrEMBL top hitse value%identityAlignment
A0A1U8B190 uncharacterized protein LOC1046062231.0e-1640.95Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWI-TNVYGPNDYRERKHLWA
        L+R   D+VL+QE+K   LD   ++  W S+ +GW+   + G S G++T+W E  + V+E L G +S+S+KC  +     W+ TNVYGPN YRER  +W 
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWI-TNVYGPNDYRERKHLWA

Query:  ELSSL
        EL ++
Subjt:  ELSSL

A0A438DN31 Uncharacterized protein8.6e-1639.62Show/hide
Query:  DLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVA
        D+V+IQETKK+  D   +  +W++++  WA + A G S G+L +WD  K+S  EV+ G +S+S+K  +   +S W++ VYGPN+   RK  W ELS +V 
Subjt:  DLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVA

Query:  YCVEAW
             W
Subjt:  YCVEAW

A0A5A7TTX5 Uncharacterized protein3.3e-2362.5Show/hide
Query:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA
        IK LWS  DIG  F+E+IGRS G+LTMWDES+ISV EV+KG ++LSVKC  I KK CWI+NVYGP  ++ERK +W ELS   A C+ A
Subjt:  IKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEA

A0A5A7V639 Uncharacterized protein2.0e-3360.71Show/hide
Query:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE
        L++++LD+VLIQE+KK+  DI  IK LWSSKD GW   E  G S G+LT+WD SK+ VIE LKGGYSLS+  + + KKSCWITNVYGPND++ER+ +W E
Subjt:  LKRLNLDLVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAE

Query:  LSSLVAYCVEAW
        L SL  YC +AW
Subjt:  LSSLVAYCVEAW

A0A5D3BHE3 Uncharacterized protein1.5e-2859.05Show/hide
Query:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY
        LV+    +   +DI  IK LWSSKDIGW  VE+ GR  G+LTMWD SKI V+E LKGGYSLS+  +   KKSCWITNVYGP DY ER+ +W  L SL  Y
Subjt:  LVLIQETKKDSLDINTIKELWSSKDIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAY

Query:  CVEAW
        C  AW
Subjt:  CVEAW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCTTCAGTGTAATCAGCAATAGAATGATCCCTTTGCCTACATTGTTGGTATGTGCTATAAAGGATTTGCTGGTCTTCTTGGGTTTCTTGGAGGCCATT
GTGAATTCGGATTTGTTTGAAGAATGTTGTGTCAGAATTTACTCTCAAGGTTTATCCCATATCAGGTCAGATCATCATATCCCCAAGAATAATTCCTTTTCCCAA
GCGGAATTTAAAATCCCTGGCTCTAACTCACCCTTTATTCGAGGTATCCCAAGCCCTGATAATCGTGGGGTGCAAATCAACAAAGAAGAAGATGAAGATTCAATT
GTCAGCGCTAGTAGTGATGATTTGGACTACTTAGGCTCTGAGGAAGATCTGGAAGAGGAGGCCCTTTTATCCAACAATGGAAGTGCTTTGAAGAATCTGTTCCAA
TCTATGGAAAATCAAGACCTTGACATTGTGAAAGTTATAAACCGCAAACTGATTGGGAAGGATATAATCCCTCAAAATCTAATCTCAATTGTTGAGGATTGTGAC
TTGGTCCTTGGTTTAAAGCGACTGAATCTGGATTTAGTTTTAATACAAGAAACAAAGAAGGATAGTCTTGACATCAATACTATCAAAGAACTATGGAGCTCCAAG
GATATTGGATGGGCGTTTGTGGAGGCAATTGGAAGGTCGAGAGGTATGTTAACCATGTGGGATGAAAGTAAGATTTCAGTCATTGAAGTGCTAAAAGGTGGATAC
TCACTTTCAGTCAAATGCCTTATAATCAACAAAAAGAGCTGCTGGATAACAAATGTATATGGCCCTAATGATTACCGCGAGAGGAAGCATCTGTGGGCCGAACTG
TCTTCTTTGGTGGCATACTGTGTAGAGGCGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCTTCAGTGTAATCAGCAATAGAATGATCCCTTTGCCTACATTGTTGGTATGTGCTATAAAGGATTTGCTGGTCTTCTTGGGTTTCTTGGAGGCCATT
GTGAATTCGGATTTGTTTGAAGAATGTTGTGTCAGAATTTACTCTCAAGGTTTATCCCATATCAGGTCAGATCATCATATCCCCAAGAATAATTCCTTTTCCCAA
GCGGAATTTAAAATCCCTGGCTCTAACTCACCCTTTATTCGAGGTATCCCAAGCCCTGATAATCGTGGGGTGCAAATCAACAAAGAAGAAGATGAAGATTCAATT
GTCAGCGCTAGTAGTGATGATTTGGACTACTTAGGCTCTGAGGAAGATCTGGAAGAGGAGGCCCTTTTATCCAACAATGGAAGTGCTTTGAAGAATCTGTTCCAA
TCTATGGAAAATCAAGACCTTGACATTGTGAAAGTTATAAACCGCAAACTGATTGGGAAGGATATAATCCCTCAAAATCTAATCTCAATTGTTGAGGATTGTGAC
TTGGTCCTTGGTTTAAAGCGACTGAATCTGGATTTAGTTTTAATACAAGAAACAAAGAAGGATAGTCTTGACATCAATACTATCAAAGAACTATGGAGCTCCAAG
GATATTGGATGGGCGTTTGTGGAGGCAATTGGAAGGTCGAGAGGTATGTTAACCATGTGGGATGAAAGTAAGATTTCAGTCATTGAAGTGCTAAAAGGTGGATAC
TCACTTTCAGTCAAATGCCTTATAATCAACAAAAAGAGCTGCTGGATAACAAATGTATATGGCCCTAATGATTACCGCGAGAGGAAGCATCTGTGGGCCGAACTG
TCTTCTTTGGTGGCATACTGTGTAGAGGCGTGGTGA
Protein sequenceShow/hide protein sequence
MKFFSVISNRMIPLPTLLVCAIKDLLVFLGFLEAIVNSDLFEECCVRIYSQGLSHIRSDHHIPKNNSFSQAEFKIPGSNSPFIRGIPSPDNRGVQINKEEDEDSI
VSASSDDLDYLGSEEDLEEEALLSNNGSALKNLFQSMENQDLDIVKVINRKLIGKDIIPQNLISIVEDCDLVLGLKRLNLDLVLIQETKKDSLDINTIKELWSSK
DIGWAFVEAIGRSRGMLTMWDESKISVIEVLKGGYSLSVKCLIINKKSCWITNVYGPNDYRERKHLWAELSSLVAYCVEAW