; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16610 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16610
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr09:26288290..26289105
RNA-Seq ExpressionClc09G16610
SyntenyClc09G16610
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037220.1 reverse transcriptase [Cucumis melo var. makuwa]1.8e-6463.59Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+     K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

KAA0037573.1 reverse transcriptase [Cucumis melo var. makuwa]1.4e-6464.08Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+   L K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK  ++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

TYK02461.1 uncharacterized protein E5676_scaffold1738G00820 [Cucumis melo var. makuwa]8.0e-6562.75Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKATRPSSFKLGDSSSGFVAHMEERVKK
        MS+     K+Q DRLVE+EEQMLYL EV D+I +LE R+ EISEK D IDAVAG ++G+P++EL+ RV+TLE    R  +++  DSSSGFVAHME RV +
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKATRPSSFKLGDSSSGFVAHMEERVKK

Query:  LDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESKIT
        LDSSQK +++M+ D+SEDFRA LDV+R EIVD +TR+NLTMR + N+ P GGAV + +VK+ EPKPFCG RDAKALENFIFD+EQYF+ TNT  EE+K+T
Subjt:  LDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESKIT

Query:  LATM
        L TM
Subjt:  LATM

TYK03099.1 reverse transcriptase [Cucumis melo var. makuwa]3.6e-6564.08Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+   L K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

TYK22948.1 uncharacterized protein E5676_scaffold386G00340 [Cucumis melo var. makuwa]1.0e-6463.11Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLE--TKATRPSSFKLGDSSSGFVAHMEERV
        MS+     K+Q DRLVEIEEQMLYL EV D+IR+LE RV+EISEKA+ IDAVAGR++G+P++EL+ RV+ LE  T A R  +++ G+SSSGF AHMEERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLE--TKATRPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
         +LD++QK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ +++VK+ EPKPFCGARDAKALEN+IFD+EQYF+ TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

TrEMBL top hitse value%identityAlignment
A0A5A7T3M3 Reverse transcriptase6.6e-6564.08Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+   L K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK  ++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

A0A5D3BRU5 Uncharacterized protein3.9e-6562.75Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKATRPSSFKLGDSSSGFVAHMEERVKK
        MS+     K+Q DRLVE+EEQMLYL EV D+I +LE R+ EISEK D IDAVAG ++G+P++EL+ RV+TLE    R  +++  DSSSGFVAHME RV +
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKATRPSSFKLGDSSSGFVAHMEERVKK

Query:  LDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESKIT
        LDSSQK +++M+ D+SEDFRA LDV+R EIVD +TR+NLTMR + N+ P GGAV + +VK+ EPKPFCG RDAKALENFIFD+EQYF+ TNT  EE+K+T
Subjt:  LDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESKIT

Query:  LATM
        L TM
Subjt:  LATM

A0A5D3BYE6 Reverse transcriptase1.7e-6564.08Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+   L K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

A0A5D3C4R1 Reverse transcriptase8.7e-6563.59Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV
        MS+     K+Q DRLVE+EEQMLYL EV D+IR+LE R++EISEK + IDAVAGR++G P++ELM RV+ LET     R  +++ GDSS+G VAH+EERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLETKAT--RPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
        ++LDSSQK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ ++RVKI EPKPFCGARDAKALEN+IFD+EQYFR TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

A0A5D3DHI2 Retrotrans_gag domain-containing protein5.1e-6563.11Show/hide
Query:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLE--TKATRPSSFKLGDSSSGFVAHMEERV
        MS+     K+Q DRLVEIEEQMLYL EV D+IR+LE RV+EISEKA+ IDAVAGR++G+P++EL+ RV+ LE  T A R  +++ G+SSSGF AHMEERV
Subjt:  MSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLE--TKATRPSSFKLGDSSSGFVAHMEERV

Query:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK
         +LD++QK +++M+  +SEDFRA LDVVR EI DV+ R++LTMR + N+ PAGGA+ +++VK+ EPKPFCGARDAKALEN+IFD+EQYF+ TNT  EE+K
Subjt:  KKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFDMEQYFRTTNTTVEESK

Query:  ITLATM
        +TLATM
Subjt:  ITLATM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCGGAGGTTGGGAGAGGCACTCGGTGCCGTAGGCGATCAGAATTTGATCAGGCTGACTTGTTCTTGGTATCAGAGTCAAGTCAGCTCATAAGAGAGAAGGCCAG
AATCATGTCGACCGAAAAACAGTTAACCAAATCCCAAGTGGATCGACTGGTAGAGATAGAAGAGCAGATGCTCTACCTGAGAGAAGTTCTCGATGCCATCCGTTTCCTGG
AAAAGCGAGTACAAGAAATCTCTGAGAAGGCTGATGGGATTGACGCAGTTGCTGGCCGCTTAGATGGGATGCCCGTCAAAGAGTTGATGTTAAGGGTTGAGACCCTAGAG
ACGAAAGCTACAAGACCTAGTAGCTTCAAGCTTGGGGATAGCTCATCGGGCTTTGTAGCCCATATGGAAGAAAGAGTCAAGAAGCTGGATAGCTCTCAAAAAGCAATAAT
CCAGATGGTCACCGACTTGTCAGAAGACTTTCGAGCCGCACTTGATGTCGTCAGGACAGAGATTGTGGATGTCAGTACAAGGGTAAATCTCACCATGAGAGTGGTGGGAA
ACCGAACCCCAGCTGGGGGTGCGGTTCAATTGAATCGAGTGAAAATTTTCGAACCCAAGCCCTTCTGTGGGGCTCGAGACGCAAAAGCTTTGGAGAATTTCATATTTGAC
ATGGAACAATACTTCAGGACAACAAACACGACTGTTGAAGAGTCGAAGATTACTTTGGCGACGATGCCAAACTATGGTGGAGGTCCCGTTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCGGAGGTTGGGAGAGGCACTCGGTGCCGTAGGCGATCAGAATTTGATCAGGCTGACTTGTTCTTGGTATCAGAGTCAAGTCAGCTCATAAGAGAGAAGGCCAG
AATCATGTCGACCGAAAAACAGTTAACCAAATCCCAAGTGGATCGACTGGTAGAGATAGAAGAGCAGATGCTCTACCTGAGAGAAGTTCTCGATGCCATCCGTTTCCTGG
AAAAGCGAGTACAAGAAATCTCTGAGAAGGCTGATGGGATTGACGCAGTTGCTGGCCGCTTAGATGGGATGCCCGTCAAAGAGTTGATGTTAAGGGTTGAGACCCTAGAG
ACGAAAGCTACAAGACCTAGTAGCTTCAAGCTTGGGGATAGCTCATCGGGCTTTGTAGCCCATATGGAAGAAAGAGTCAAGAAGCTGGATAGCTCTCAAAAAGCAATAAT
CCAGATGGTCACCGACTTGTCAGAAGACTTTCGAGCCGCACTTGATGTCGTCAGGACAGAGATTGTGGATGTCAGTACAAGGGTAAATCTCACCATGAGAGTGGTGGGAA
ACCGAACCCCAGCTGGGGGTGCGGTTCAATTGAATCGAGTGAAAATTTTCGAACCCAAGCCCTTCTGTGGGGCTCGAGACGCAAAAGCTTTGGAGAATTTCATATTTGAC
ATGGAACAATACTTCAGGACAACAAACACGACTGTTGAAGAGTCGAAGATTACTTTGGCGACGATGCCAAACTATGGTGGAGGTCCCGTTACATAA
Protein sequenceShow/hide protein sequence
MGAEVGRGTRCRRRSEFDQADLFLVSESSQLIREKARIMSTEKQLTKSQVDRLVEIEEQMLYLREVLDAIRFLEKRVQEISEKADGIDAVAGRLDGMPVKELMLRVETLE
TKATRPSSFKLGDSSSGFVAHMEERVKKLDSSQKAIIQMVTDLSEDFRAALDVVRTEIVDVSTRVNLTMRVVGNRTPAGGAVQLNRVKIFEPKPFCGARDAKALENFIFD
MEQYFRTTNTTVEESKITLATMPNYGGGPVT