; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G07260 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G07260
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr07:19427543..19430281
RNA-Seq ExpressionClc07G07260
SyntenyClc07G07260
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERN04558.1 hypothetical protein AMTR_s04509p00000770, partial [Amborella trichopoda]1.8e-2464.29Show/hide
Query:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT
        +++ PWYADIVNYLV    P DF  QQ K+  H+SKFY WDEPYLY+  PD I+RRCVP  E  SIL  CH  PYGGHFGGQRT
Subjt:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]1.2e-4181.72Show/hide
Query:  MNAESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        M A+SQEPWY DIVNYLVC QWPE+FNA Q+K+L+HESKFYCWDEPYLYRLG DHILRRCVPEYETHSIL+SCHE PYGGHFGGQRT  + L+
Subjt:  MNAESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

XP_038888126.1 uncharacterized protein LOC120078022 [Benincasa hispida]1.9e-2662.79Show/hide
Query:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        PWY++IVN++VC Q PE +  QQ+K+L+H+ KFY WD P+LYRL PDHILRRCVPE+E   IL  CH+ PYGGHFGGQRT  + L+
Subjt:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

XP_038891742.1 uncharacterized protein LOC120081139 [Benincasa hispida]2.5e-2660Show/hide
Query:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        E +EPWY DIVNYL  KQ+P +FN+QQ+KRL H++KFY W E +LY+ GPD I+RRC+PE ET   L  CH+ PYG HFGGQRT  + L+
Subjt:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

XP_038902405.1 uncharacterized protein LOC120089044 [Benincasa hispida]4.7e-2566.25Show/hide
Query:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT
        PWY DIVN+LVCKQ+PE++ + Q+KRL HE KFY WD+P LY+ GPD ILR  VP+   H IL  CHE PYGGHFGGQRT
Subjt:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase2.4e-2255.06Show/hide
Query:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        S  PWYADIVNYL C   P D +AQQ+K+   +++ Y WD+P+L++ GPD+ILRRCVPE E + IL  CH  PYGGHF G RT  + L+
Subjt:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

A0A2G9HWC5 DNA-directed DNA polymerase2.4e-2255.06Show/hide
Query:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        S  PWYADIVNYL C   P D +AQQ+K+   +++ Y WD+P+L++ GPD+ILRRCVPE E + IL  CH  PYGGHF G RT  + L+
Subjt:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

A0A2G9HYA0 Reverse transcriptase4.0e-2253.93Show/hide
Query:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK
        S+ PWYADIVNYL C   P D + QQ+K+   +++ Y WD+P+L++ GPD+ILRRCVPE E + IL  CH  PYGGHF G RT  + L+
Subjt:  SQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELK

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)1.8e-2244.09Show/hide
Query:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELKQSPALKEKLIIGGG
        PW+AD  NY+V K  P DF +QQRK+  H+ KFY WDEP+LY+ G D +LRRCVPE E   +L  CH+  YGGHF G RT  + L+              
Subjt:  PWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELKQSPALKEKLIIGGG

Query:  RNVLGLPCVGKVVMLYVSTDLRCQALG
           L  P + K    YV    RCQ  G
Subjt:  RNVLGLPCVGKVVMLYVSTDLRCQALG

U5CXV9 RT_RNaseH domain-containing protein (Fragment)8.7e-2564.29Show/hide
Query:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT
        +++ PWYADIVNYLV    P DF  QQ K+  H+SKFY WDEPYLY+  PD I+RRCVP  E  SIL  CH  PYGGHFGGQRT
Subjt:  ESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCAGAGAGTCAGGAACCATGGTATGCAGACATAGTGAATTACTTGGTCTGCAAACAATGGCCTGAAGATTTCAACGCTCAACAAAGGAAGAGGCTCCAA
CACGAAAGTAAGTTCTACTGTTGGGACGAGCCATATCTATACAGACTTGGCCCGGACCACATCCTGCGTCGATGCGTTCCAGAATATGAAACGCATAGCATTCTG
AGAAGTTGTCATGAAGAACCTTACGGAGGACACTTTGGGGGACAGAGAACATGGTGCCGTGAGCTGAAACAGTCACCCGCGTTGAAGGAGAAGTTGATCATTGGC
GGGGGAAGAAACGTACTTGGCTTACCATGCGTTGGTAAGGTTGTCATGCTTTATGTAAGCACTGATTTGCGTTGCCAAGCGTTGGGGTGCAGCCATGCGTTTGGG
ACTATGCGTTTCAAAGGAAGAAGTGGCCGGGAGCGAGAACAGAATGGTCTTGACGCGACGAAAGTAGTATTGAGTGACGGAAGGGTATGCGTCAAAAATAGGAAG
CGTGGAGGATGGCACTATGCGTTGGGTGTGTGCCTGATAATTGGAGTAGAACGTGAAGAGGAAGCCGAGTTTAGAGCTGAAAATTTGAGAAAAACAGAGTTTACA
AGATTATCTCGCAATCCGGAACAGCAGGAGATGATTCCGGAATACCGTGAGAACGAGGAGGAGTTAAGCCTTGAAACTTTGAGTGCTCAAAGACACGAGAAGGCA
TGTGAGTTTCCCCGGGACCCAGTACTCAAGGACATGAGAAGGTACATGGGCAACCCGTCATATTCCTATGTGCACATAGAGGTTGATTTGGCTGGTGCGTTGAGT
TTGCTCCGCATTGAGCCAGGGATGTTTTCGACGTTGGGAGTGTTAAGCTTTGCTCCACCCCAACTAAGGCTGTTAGCTTATGTTTTCCAATGTTTTCCTTTTCTA
TCCCCAGGTAGCAAGGACGTTCCCGGCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCAGAGAGTCAGGAACCATGGTATGCAGACATAGTGAATTACTTGGTCTGCAAACAATGGCCTGAAGATTTCAACGCTCAACAAAGGAAGAGGCTCCAA
CACGAAAGTAAGTTCTACTGTTGGGACGAGCCATATCTATACAGACTTGGCCCGGACCACATCCTGCGTCGATGCGTTCCAGAATATGAAACGCATAGCATTCTG
AGAAGTTGTCATGAAGAACCTTACGGAGGACACTTTGGGGGACAGAGAACATGGTGCCGTGAGCTGAAACAGTCACCCGCGTTGAAGGAGAAGTTGATCATTGGC
GGGGGAAGAAACGTACTTGGCTTACCATGCGTTGGTAAGGTTGTCATGCTTTATGTAAGCACTGATTTGCGTTGCCAAGCGTTGGGGTGCAGCCATGCGTTTGGG
ACTATGCGTTTCAAAGGAAGAAGTGGCCGGGAGCGAGAACAGAATGGTCTTGACGCGACGAAAGTAGTATTGAGTGACGGAAGGGTATGCGTCAAAAATAGGAAG
CGTGGAGGATGGCACTATGCGTTGGGTGTGTGCCTGATAATTGGAGTAGAACGTGAAGAGGAAGCCGAGTTTAGAGCTGAAAATTTGAGAAAAACAGAGTTTACA
AGATTATCTCGCAATCCGGAACAGCAGGAGATGATTCCGGAATACCGTGAGAACGAGGAGGAGTTAAGCCTTGAAACTTTGAGTGCTCAAAGACACGAGAAGGCA
TGTGAGTTTCCCCGGGACCCAGTACTCAAGGACATGAGAAGGTACATGGGCAACCCGTCATATTCCTATGTGCACATAGAGGTTGATTTGGCTGGTGCGTTGAGT
TTGCTCCGCATTGAGCCAGGGATGTTTTCGACGTTGGGAGTGTTAAGCTTTGCTCCACCCCAACTAAGGCTGTTAGCTTATGTTTTCCAATGTTTTCCTTTTCTA
TCCCCAGGTAGCAAGGACGTTCCCGGCGCTTAG
Protein sequenceShow/hide protein sequence
MNAESQEPWYADIVNYLVCKQWPEDFNAQQRKRLQHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEEPYGGHFGGQRTWCRELKQSPALKEKLIIG
GGRNVLGLPCVGKVVMLYVSTDLRCQALGCSHAFGTMRFKGRSGREREQNGLDATKVVLSDGRVCVKNRKRGGWHYALGVCLIIGVEREEEAEFRAENLRKTEFT
RLSRNPEQQEMIPEYRENEEELSLETLSAQRHEKACEFPRDPVLKDMRRYMGNPSYSYVHIEVDLAGALSLLRIEPGMFSTLGVLSFAPPQLRLLAYVFQCFPFL
SPGSKDVPGA