; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G011592 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G011592
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase domain-containing protein
Genome locationCG_Chr02:24470111..24473422
RNA-Seq ExpressionClCG02G011592
SyntenyClCG02G011592
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063979.1 hypothetical protein E6C27_scaffold616G001380 [Cucumis melo var. makuwa]5.5e-3136.82Show/hide
Query:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF
        MEV S KI    +C W +  +F +ED++   ++                  Q E HC+  K          + K   +  W+V             LF F
Subjt:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF

Query:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF
        D+W +I  FLED +Q+K   NPLFADKA++K+ Q N++++    GKW+DY K+HLLFEKW   + S P+ IKG+ GWL I+N PL+ W     E IG+HF
Subjt:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF

Query:  G
        G
Subjt:  G

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]2.9e-4039.83Show/hide
Query:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-
        + + + I++ +IK+ WSSKDIGW  VE++G+ GG+LTMW+ S++ V+E+LKGGYSLS+   T  KK CW+TN+Y   P DY+ERRF+W  L SLS YCT 
Subjt:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-

Query:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASV
                            E Q                             REG  +S SLLD F +   W+E  E++RV  +A   SDHFP+LL A  
Subjt:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASV

Query:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII
         + GPSPFRF +SWL   +  RII+    +++   WAGF++
Subjt:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]3.9e-3750.62Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  A  FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]3.9e-3750.62Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  A  FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]3.9e-3750.62Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  A  FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

TrEMBL top hitse value%identityAlignment
A0A5A7T996 Ulp1-like peptidase9.7e-2653.12Show/hide
Query:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG
        I+ FLED +++K++INP F ++A  K++   ++ ++ T  KWF+Y KFHLLFEKW+ I  SRP+ IKG+ GWL I+NLPL+ W RA FE IG+HFG
Subjt:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG

A0A5A7UR38 Transposon TX1 uncharacterized8.2e-2556.44Show/hide
Query:  REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESV
        REGR +S SLLD FLV+ +W+E+  DTR + + RL SDH PILL A  FE GPSPFRFC+SWLL+ +  +II  S+   NH  W GF+I+SKLRS+K ++
Subjt:  REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESV

Query:  K
        K
Subjt:  K

A0A5A7VEL7 Uncharacterized protein2.6e-3136.82Show/hide
Query:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF
        MEV S KI    +C W +  +F +ED++   ++                  Q E HC+  K          + K   +  W+V             LF F
Subjt:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF

Query:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF
        D+W +I  FLED +Q+K   NPLFADKA++K+ Q N++++    GKW+DY K+HLLFEKW   + S P+ IKG+ GWL I+N PL+ W     E IG+HF
Subjt:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF

Query:  G
        G
Subjt:  G

A0A5D3BHE3 Uncharacterized protein1.4e-4039.83Show/hide
Query:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-
        + + + I++ +IK+ WSSKDIGW  VE++G+ GG+LTMW+ S++ V+E+LKGGYSLS+   T  KK CW+TN+Y   P DY+ERRF+W  L SLS YCT 
Subjt:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-

Query:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASV
                            E Q                             REG  +S SLLD F +   W+E  E++RV  +A   SDHFP+LL A  
Subjt:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASV

Query:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII
         + GPSPFRF +SWL   +  RII+    +++   WAGF++
Subjt:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII

A0A5D3E3A5 Ulp1-like peptidase9.7e-2653.12Show/hide
Query:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG
        I+ FLED +++K++INP F ++A  K++   ++ ++ T  KWF+Y KFHLLFEKW+ I  SRP+ IKG+ GWL I+NLPL+ W RA FE IG+HFG
Subjt:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTAGTTAGTTGCAAGATCAATGGAATTTTCTTCTGCACTTGGTTCGAAGATGGGAACTTCATAGTCGAAGATATGGAATCGAATTCTTCTGTTTTTGCA
CCAGATAAACAGAGCGAGTACCATTGCTGGATCCGAAAGGAAAATGAAGTGTTTAAGGAAGATTTTGCCAACTTGTGGGTTGTATCAAGACTATTCGCGTTCGAT
GAATGGAAAGATATTGTCAGATTTTTGGAAGATTTTTATCAAATCAAGGTCAGTATTAACCCTCTGTTTGCCGATAAGGCTCTGATTAAAGTATCTCAATGTAAT
TTAAAAGAGATAATCGAAACGCCTGGAAAATGGTTTGACTATCGTAAATTCCATCTTCTATTTGAAAAATGGAATTCAATTCAAGATAGTAGGCCTACTTGTATC
AAAGGTTATGGTGGATGGTTGGTTATAAGAAATCTGCCGTTGGAATATTGGAACAGAGCCACCTTTGAAGCCATTGGCTCCCACTTTGGAGTAGCAAAGGATGAA
GATGTTAGTTGTATTTTATCAAGCTGCAACAGAAAAGACCGAAATGTGATCTTATCGTCGGAGAAGATATCAAAAAGGTCTGAGCTCGACGAAGATGCTGAGCCT
GCGTCAAAGAAGACTGAGGAGACTCAGCTAAACGAGCCTGAAACAGAGGACTCACTTAATGAGGATCTCAACAGGCTGTTTCAAGCCACGGAAGATTTTGTTAGC
AGAGCATCCGAAGAAGCTGTCGTTTCCCAACGTCCAAAGACCAAGAAAGAAGCAATTGAGTTGGACATTATCAAAGCTTTTTGGAGCTCTAAGGACATTGGATGG
ATTGATGTTGAAGCCTATGGAAAATCAGGGGGAATGCTCACGATGTGGGAAGAAAGTGAAGTATCAGTCCTAGAATCCCTTAAAGGAGGATACTCACTCTCGGTT
AAATGCAAAACATTAAGCAAAAAAGTGTGTTGGGTGACTAATATCTACAGACCTACACCTACCGATTACAAAGAAAGAAGATTCATATGGCCGGAACTAGCTTCT
CTCTCAGCTTACTGCACAGAGGACCAAAGAGAAGGAAGAGGTATGTCTTGTTCACTGTTGGACACATTTCTTGTATCCAACAACTGGGAGGAGGCTTTGGAAGAC
ACAAGAGTGACAAGCCAAGCAAGATTATACTCAGATCATTTTCCCATCCTATTAACAGCAAGTGTATTTGAAATGGGACCCTCTCCATTTCGATTTTGTCATAGT
TGGCTACTAAATCATCAGAATTGTAGAATCATTGAAAGATCCATAGTCATGAGCAATCATCATGGATGGGCAGGCTTCATTATATTCTCAAAATTAAGATCAATA
AAAGAATCAGTGAAAAAGAGGCAAACGGAGTATCGTGGAAAGCAACGGATGCAGGAAGAGAAAATATTAGAATTTCTTGGCCAAGAAGAATCTAGAGCTGAATCA
ATAGATACCCCCTCAATAGAGATGGATATGAGATCCTCCCTCAAGGCTGAGTTGATGAGCATCTATAGAGTTGAGGAAAGGAACCTTATTCAAAAAAGTAAACTC
AACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTAGTTAGTTGCAAGATCAATGGAATTTTCTTCTGCACTTGGTTCGAAGATGGGAACTTCATAGTCGAAGATATGGAATCGAATTCTTCTGTTTTTGCA
CCAGATAAACAGAGCGAGTACCATTGCTGGATCCGAAAGGAAAATGAAGTGTTTAAGGAAGATTTTGCCAACTTGTGGGTTGTATCAAGACTATTCGCGTTCGAT
GAATGGAAAGATATTGTCAGATTTTTGGAAGATTTTTATCAAATCAAGGTCAGTATTAACCCTCTGTTTGCCGATAAGGCTCTGATTAAAGTATCTCAATGTAAT
TTAAAAGAGATAATCGAAACGCCTGGAAAATGGTTTGACTATCGTAAATTCCATCTTCTATTTGAAAAATGGAATTCAATTCAAGATAGTAGGCCTACTTGTATC
AAAGGTTATGGTGGATGGTTGGTTATAAGAAATCTGCCGTTGGAATATTGGAACAGAGCCACCTTTGAAGCCATTGGCTCCCACTTTGGAGTAGCAAAGGATGAA
GATGTTAGTTGTATTTTATCAAGCTGCAACAGAAAAGACCGAAATGTGATCTTATCGTCGGAGAAGATATCAAAAAGGTCTGAGCTCGACGAAGATGCTGAGCCT
GCGTCAAAGAAGACTGAGGAGACTCAGCTAAACGAGCCTGAAACAGAGGACTCACTTAATGAGGATCTCAACAGGCTGTTTCAAGCCACGGAAGATTTTGTTAGC
AGAGCATCCGAAGAAGCTGTCGTTTCCCAACGTCCAAAGACCAAGAAAGAAGCAATTGAGTTGGACATTATCAAAGCTTTTTGGAGCTCTAAGGACATTGGATGG
ATTGATGTTGAAGCCTATGGAAAATCAGGGGGAATGCTCACGATGTGGGAAGAAAGTGAAGTATCAGTCCTAGAATCCCTTAAAGGAGGATACTCACTCTCGGTT
AAATGCAAAACATTAAGCAAAAAAGTGTGTTGGGTGACTAATATCTACAGACCTACACCTACCGATTACAAAGAAAGAAGATTCATATGGCCGGAACTAGCTTCT
CTCTCAGCTTACTGCACAGAGGACCAAAGAGAAGGAAGAGGTATGTCTTGTTCACTGTTGGACACATTTCTTGTATCCAACAACTGGGAGGAGGCTTTGGAAGAC
ACAAGAGTGACAAGCCAAGCAAGATTATACTCAGATCATTTTCCCATCCTATTAACAGCAAGTGTATTTGAAATGGGACCCTCTCCATTTCGATTTTGTCATAGT
TGGCTACTAAATCATCAGAATTGTAGAATCATTGAAAGATCCATAGTCATGAGCAATCATCATGGATGGGCAGGCTTCATTATATTCTCAAAATTAAGATCAATA
AAAGAATCAGTGAAAAAGAGGCAAACGGAGTATCGTGGAAAGCAACGGATGCAGGAAGAGAAAATATTAGAATTTCTTGGCCAAGAAGAATCTAGAGCTGAATCA
ATAGATACCCCCTCAATAGAGATGGATATGAGATCCTCCCTCAAGGCTGAGTTGATGAGCATCTATAGAGTTGAGGAAAGGAACCTTATTCAAAAAAGTAAACTC
AACTAG
Protein sequenceShow/hide protein sequence
MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDKQSEYHCWIRKENEVFKEDFANLWVVSRLFAFDEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCN
LKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFGVAKDEDVSCILSSCNRKDRNVILSSEKISKRSELDEDAEP
ASKKTEETQLNEPETEDSLNEDLNRLFQATEDFVSRASEEAVVSQRPKTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSV
KCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCTEDQREGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTASVFEMGPSPFRFCHS
WLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEYRGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKL
N