; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G12180 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G12180
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr02:23871427..23874738
RNA-Seq ExpressionClc02G12180
SyntenyClc02G12180
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063979.1 hypothetical protein E6C27_scaffold616G001380 [Cucumis melo var. makuwa]5.5e-3136.82Show/hide
Query:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF
        MEV S KI    +C W +  +F +ED++   ++                  Q E HC+  K          + K   +  W+V             LF F
Subjt:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF

Query:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF
        D+W +I  FLED +Q+K   NPLFADKA++K+ Q N++++    GKW+DY K+HLLFEKW   + S P+ IKG+ GWL I+N PL+ W     E IG+HF
Subjt:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF

Query:  G
        G
Subjt:  G

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]4.5e-4140.25Show/hide
Query:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-
        + + + I++ +IK+ WSSKDIGW  VE++G+ GG+LTMW+ S++ V+E+LKGGYSLS+   T  KK CW+TN+Y   P DY+ERRF+W  L SLS YCT 
Subjt:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-

Query:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGV
                            E Q                             REG  +S SLLD F +   W+E  E++RV  +A   SDHFP+LL AG 
Subjt:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGV

Query:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII
         + GPSPFRF +SWL   +  RII+    +++   WAGF++
Subjt:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]4.6e-3851.25Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  AG FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]4.6e-3851.25Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  AG FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]4.6e-3851.25Show/hide
Query:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY
        CS   + LVS NW+E  ED+RV+ QAR  SDHFP+L  AG FE GPSPFRFC+SWL N + CRIIE S ++     WAGF ++S+LR +K+SVK+   E+
Subjt:  CSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESVKKRQTEY

Query:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN
           Q+++EE +L+ + +++ +A++++  S E D+R SLKA+L+S+Y+ EER+LIQKSKLN
Subjt:  RGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN

TrEMBL top hitse value%identityAlignment
A0A5A7T996 Ulp1-like peptidase9.7e-2653.12Show/hide
Query:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG
        I+ FLED +++K++INP F ++A  K++   ++ ++ T  KWF+Y KFHLLFEKW+ I  SRP+ IKG+ GWL I+NLPL+ W RA FE IG+HFG
Subjt:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG

A0A5A7UR38 Transposon TX1 uncharacterized9.7e-2657.43Show/hide
Query:  REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESV
        REGR +S SLLD FLV+ +W+E+  DTR + + RL SDH PILL AG FE GPSPFRFC+SWLL+ +  +II  S+   NH  W GF+I+SKLRS+K ++
Subjt:  REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFIIFSKLRSIKESV

Query:  K
        K
Subjt:  K

A0A5A7VEL7 Uncharacterized protein2.6e-3136.82Show/hide
Query:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF
        MEV S KI    +C W +  +F +ED++   ++                  Q E HC+  K          + K   +  W+V             LF F
Subjt:  MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDK-------------QSEYHCWIRKE-------NEVFKEDFANLWVV-----------SRLFAF

Query:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF
        D+W +I  FLED +Q+K   NPLFADKA++K+ Q N++++    GKW+DY K+HLLFEKW   + S P+ IKG+ GWL I+N PL+ W     E IG+HF
Subjt:  DEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHF

Query:  G
        G
Subjt:  G

A0A5D3BHE3 Uncharacterized protein2.2e-4140.25Show/hide
Query:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-
        + + + I++ +IK+ WSSKDIGW  VE++G+ GG+LTMW+ S++ V+E+LKGGYSLS+   T  KK CW+TN+Y   P DY+ERRF+W  L SLS YCT 
Subjt:  KTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNIYRPTPTDYKERRFIWPELASLSAYCT-

Query:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGV
                            E Q                             REG  +S SLLD F +   W+E  E++RV  +A   SDHFP+LL AG 
Subjt:  --------------------EDQ-----------------------------REGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGV

Query:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII
         + GPSPFRF +SWL   +  RII+    +++   WAGF++
Subjt:  FEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNHHGWAGFII

A0A5D3E3A5 Ulp1-like peptidase9.7e-2653.12Show/hide
Query:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG
        I+ FLED +++K++INP F ++A  K++   ++ ++ T  KWF+Y KFHLLFEKW+ I  SRP+ IKG+ GWL I+NLPL+ W RA FE IG+HFG
Subjt:  IVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEIIETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTAGTTAGTTGCAAGATCAATGGAATTTTCTTCTGCACTTGGTTCGAAGATGGGAACTTCATAGTCGAAGATATGGAATCGAATTCTTCTGTTTTTGCACCAGA
TAAACAGAGCGAGTACCATTGCTGGATCCGAAAGGAAAATGAAGTGTTTAAGGAAGATTTTGCCAACTTGTGGGTTGTATCAAGACTATTCGCGTTCGATGAATGGAAAG
ATATTGTCAGATTTTTGGAAGATTTTTATCAAATCAAGGTCAGTATTAACCCTCTGTTTGCCGATAAGGCTCTGATTAAAGTATCTCAATGTAATTTAAAAGAGATAATC
GAAACGCCTGGAAAATGGTTTGACTATCGTAAATTCCATCTTCTATTTGAAAAATGGAATTCAATTCAAGATAGTAGGCCTACTTGTATCAAAGGTTATGGTGGATGGTT
GGTTATAAGAAATCTGCCGTTGGAATATTGGAACAGAGCCACCTTTGAAGCCATCGGCTCCCACTTTGGAGTAGCAAAGGATGAAGATGTTAGTTGTATTTTATCAAGCT
GCAACAGAAAAGACCGAAATGTGATCTTATCGTCGGAGAAGATATCAAAAAGGTCTGAGCTCGACGAAGATGCTGAGCCTGCGTCAAAGAAGACTGAGGAGACTCAGCTA
AACGAGCCTGAAACAGAGGACTCACTTAATGAGGATCTCAACAGGCTGTTTCAAGCCACGGAAGATTTTGTTAGCAGAGCATCCGAAGAAGCTGTCGTTTCCCAACGTCC
AAAGACCAAGAAAGAAGCAATTGAGTTGGACATTATCAAAGCTTTTTGGAGCTCTAAGGACATTGGATGGATTGATGTTGAAGCCTATGGAAAATCAGGGGGAATGCTCA
CGATGTGGGAAGAAAGTGAAGTATCAGTCCTAGAATCCCTTAAAGGAGGATACTCACTCTCGGTTAAATGCAAAACATTAAGCAAAAAAGTGTGTTGGGTGACTAATATC
TACAGACCTACACCTACCGATTACAAAGAAAGAAGATTCATATGGCCGGAACTAGCTTCTCTCTCAGCTTACTGCACAGAGGACCAAAGAGAAGGAAGAGGTATGTCTTG
TTCACTGTTGGACACATTTCTTGTATCCAACAACTGGGAGGAGGCTTTGGAAGACACAAGAGTGACAAGCCAAGCAAGATTATACTCAGATCATTTTCCCATCCTATTAA
CAGCAGGTGTATTTGAAATGGGACCCTCTCCATTTCGATTTTGTCATAGTTGGCTACTAAATCATCAGAATTGTAGAATCATTGAAAGATCCATAGTCATGAGCAATCAT
CATGGATGGGCAGGCTTCATTATATTCTCAAAATTAAGATCAATAAAAGAATCAGTGAAAAAGAGGCAAACGGAGTATCGTGGAAAGCAACGGATGCAGGAAGAGAAAAT
ATTAGAATTTCTTGGCCAAGAAGAATCTAGAGCTGAATCAATAGATACCCCCTCAATAGAGATGGATATGAGATCCTCCCTCAAGGCTGAGTTGATGAGCATCTATAGAG
TTGAGGAAAGGAACCTTATTCAAAAAAGTAAACTCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTAGTTAGTTGCAAGATCAATGGAATTTTCTTCTGCACTTGGTTCGAAGATGGGAACTTCATAGTCGAAGATATGGAATCGAATTCTTCTGTTTTTGCACCAGA
TAAACAGAGCGAGTACCATTGCTGGATCCGAAAGGAAAATGAAGTGTTTAAGGAAGATTTTGCCAACTTGTGGGTTGTATCAAGACTATTCGCGTTCGATGAATGGAAAG
ATATTGTCAGATTTTTGGAAGATTTTTATCAAATCAAGGTCAGTATTAACCCTCTGTTTGCCGATAAGGCTCTGATTAAAGTATCTCAATGTAATTTAAAAGAGATAATC
GAAACGCCTGGAAAATGGTTTGACTATCGTAAATTCCATCTTCTATTTGAAAAATGGAATTCAATTCAAGATAGTAGGCCTACTTGTATCAAAGGTTATGGTGGATGGTT
GGTTATAAGAAATCTGCCGTTGGAATATTGGAACAGAGCCACCTTTGAAGCCATCGGCTCCCACTTTGGAGTAGCAAAGGATGAAGATGTTAGTTGTATTTTATCAAGCT
GCAACAGAAAAGACCGAAATGTGATCTTATCGTCGGAGAAGATATCAAAAAGGTCTGAGCTCGACGAAGATGCTGAGCCTGCGTCAAAGAAGACTGAGGAGACTCAGCTA
AACGAGCCTGAAACAGAGGACTCACTTAATGAGGATCTCAACAGGCTGTTTCAAGCCACGGAAGATTTTGTTAGCAGAGCATCCGAAGAAGCTGTCGTTTCCCAACGTCC
AAAGACCAAGAAAGAAGCAATTGAGTTGGACATTATCAAAGCTTTTTGGAGCTCTAAGGACATTGGATGGATTGATGTTGAAGCCTATGGAAAATCAGGGGGAATGCTCA
CGATGTGGGAAGAAAGTGAAGTATCAGTCCTAGAATCCCTTAAAGGAGGATACTCACTCTCGGTTAAATGCAAAACATTAAGCAAAAAAGTGTGTTGGGTGACTAATATC
TACAGACCTACACCTACCGATTACAAAGAAAGAAGATTCATATGGCCGGAACTAGCTTCTCTCTCAGCTTACTGCACAGAGGACCAAAGAGAAGGAAGAGGTATGTCTTG
TTCACTGTTGGACACATTTCTTGTATCCAACAACTGGGAGGAGGCTTTGGAAGACACAAGAGTGACAAGCCAAGCAAGATTATACTCAGATCATTTTCCCATCCTATTAA
CAGCAGGTGTATTTGAAATGGGACCCTCTCCATTTCGATTTTGTCATAGTTGGCTACTAAATCATCAGAATTGTAGAATCATTGAAAGATCCATAGTCATGAGCAATCAT
CATGGATGGGCAGGCTTCATTATATTCTCAAAATTAAGATCAATAAAAGAATCAGTGAAAAAGAGGCAAACGGAGTATCGTGGAAAGCAACGGATGCAGGAAGAGAAAAT
ATTAGAATTTCTTGGCCAAGAAGAATCTAGAGCTGAATCAATAGATACCCCCTCAATAGAGATGGATATGAGATCCTCCCTCAAGGCTGAGTTGATGAGCATCTATAGAG
TTGAGGAAAGGAACCTTATTCAAAAAAGTAAACTCAACTAG
Protein sequenceShow/hide protein sequence
MEVVSCKINGIFFCTWFEDGNFIVEDMESNSSVFAPDKQSEYHCWIRKENEVFKEDFANLWVVSRLFAFDEWKDIVRFLEDFYQIKVSINPLFADKALIKVSQCNLKEII
ETPGKWFDYRKFHLLFEKWNSIQDSRPTCIKGYGGWLVIRNLPLEYWNRATFEAIGSHFGVAKDEDVSCILSSCNRKDRNVILSSEKISKRSELDEDAEPASKKTEETQL
NEPETEDSLNEDLNRLFQATEDFVSRASEEAVVSQRPKTKKEAIELDIIKAFWSSKDIGWIDVEAYGKSGGMLTMWEESEVSVLESLKGGYSLSVKCKTLSKKVCWVTNI
YRPTPTDYKERRFIWPELASLSAYCTEDQREGRGMSCSLLDTFLVSNNWEEALEDTRVTSQARLYSDHFPILLTAGVFEMGPSPFRFCHSWLLNHQNCRIIERSIVMSNH
HGWAGFIIFSKLRSIKESVKKRQTEYRGKQRMQEEKILEFLGQEESRAESIDTPSIEMDMRSSLKAELMSIYRVEERNLIQKSKLN