; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C01G016560 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C01G016560
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCla97Chr01:30245484..30249529
RNA-Seq ExpressionCla97C01G016560
SyntenyCla97C01G016560
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034303.1 hypothetical protein SDJN02_04030 [Cucurbita argyrosperma subsp. argyrosperma]7.9e-6583.54Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWNP+SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQP EDE EA NGA+QIINENEL
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL

Query:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        ELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRR  D K+V+GQEMAV G  EN SG+QN   + +
Subjt:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

XP_008440360.1 PREDICTED: uncharacterized protein LOC103484836 [Cucumis melo]1.4e-6685.89Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQP EDEPE  NGA QIINE ELE
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        LTLGPSSYNTSDSG TT+SSSSTGSSHEGRRCTDTK+VKGQEMA  G TENSSG QN  ++ +
Subjt:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

XP_022949937.1 uncharacterized protein LOC111453182 [Cucurbita moschata]7.9e-6583.54Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWNP+SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQP EDE EA NGA+QIINENEL
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL

Query:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        ELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRR  D K+V+GQEMAV G  EN SG+QN   + +
Subjt:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

XP_038883305.1 uncharacterized protein LOC120074293 isoform X1 [Benincasa hispida]1.0e-7288.34Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTH TKLDMEQP E EPEA NGA+QIINENELE
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        LTLGPSSYNTSDSG+TTHSSSSTGSSHEGRRCTD+++VKGQEM V G TENSSGY+N  ++ +
Subjt:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

XP_038883306.1 uncharacterized protein LOC120074293 isoform X2 [Benincasa hispida]1.0e-7288.34Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTH TKLDMEQP E EPEA NGA+QIINENELE
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        LTLGPSSYNTSDSG+TTHSSSSTGSSHEGRRCTD+++VKGQEM V G TENSSGY+N  ++ +
Subjt:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

TrEMBL top hitse value%identityAlignment
A0A0A0KIH1 Uncharacterized protein3.3e-6182.93Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEA-TNGAVQIINENEL
        MRMAMLKHEETFKQQV ELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRST TTKLDMEQP EDEPE   NGA QIINE EL
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEA-TNGAVQIINENEL

Query:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        ELTLGPSSYNTSDSG TT+SSSSTGSSH+ RRCTDTK+VKGQEMA  G TENSSG QN  ++ +
Subjt:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

A0A1S3B1P5 uncharacterized protein LOC1034848367.0e-6785.89Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRE+    ESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQP EDEPE  NGA QIINE ELE
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        LTLGPSSYNTSDSG TT+SSSSTGSSHEGRRCTDTK+VKGQEMA  G TENSSG QN  ++ +
Subjt:  LTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

A0A6J1BU22 uncharacterized protein LOC1110053597.5e-6181.44Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR--ESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTED-EPE-ATNGAVQIINE
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR  ++GWN ESWDKRNEICFRQIYEQDAK+YY+STHTTKLD+EQP ED EPE   NG +QIINE
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR--ESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTED-EPE-ATNGAVQIINE

Query:  NELELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        NE+ELTLGPSSYNTSDSGVT  SSSSTGSSHEGRR  D+K+VKGQEM V G TE SSGYQN  S  +
Subjt:  NELELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

A0A6J1GEC8 uncharacterized protein LOC1114531823.8e-6583.54Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWNP+SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQP EDE EA NGA+QIINENEL
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL

Query:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        ELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRR  D K+V+GQEMAV G  EN SG+QN   + +
Subjt:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

A0A6J1IUL1 uncharacterized protein LOC1114787141.4e-6281.71Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL
        MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSR+SGWN +SWDKRNEICFRQIYEQDAKNY RS T TTKLD+EQP EDE EA NGA+QIINENEL
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRS-THTTKLDMEQPTEDEPEATNGAVQIINENEL

Query:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ
        ELTLGPSSYNTSDSG+TTHSSSSTGSSHEGRR  D  +V+ QEMAV G  EN SG+QN   + +
Subjt:  ELTLGPSSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26620.1 Plant protein of unknown function (DUF863)1.7e-0461.76Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        M+  ML+HE  FK QVHELHRLYR QK L++ V+
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT1G69360.1 Plant protein of unknown function (DUF863)1.3e-0461.76Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE
        ++  ML+HE  FK QV+ELHRLYRTQK+LM  V+
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVE

AT5G67390.1 unknown protein1.8e-1137.58Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +        K N +           N    T   ++D E          N  ++I++E+E+E
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSY----------------------NTSDSGVTTHSSSSTGSSH
        LTLGPS Y                       + +SG  + SSSSTGSS+
Subjt:  LTLGPSSY----------------------NTSDSGVTTHSSSSTGSSH

AT5G67390.2 unknown protein1.8e-1137.58Show/hide
Query:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE
        M+MAMLKHEETFKQQV+ELHRLY+ QK LMKN+E ++ +        K N +           N    T   ++D E          N  ++I++E+E+E
Subjt:  MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELE

Query:  LTLGPSSY----------------------NTSDSGVTTHSSSSTGSSH
        LTLGPS Y                       + +SG  + SSSSTGSS+
Subjt:  LTLGPSSY----------------------NTSDSGVTTHSSSSTGSSH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGATGGCCATGTTAAAGCATGAAGAAACATTCAAACAACAGGTACATGAACTGCATCGACTTTATCGAACCCAAAAGACATTAATGAAAAACGTAGAGAAA
AGCAGGGAAAGTGGATGGAATCCAGAAAGTTGGGACAAAAGGAATGAGATATGTTTCAGACAAATCTATGAACAGGATGCAAAAAATTATTACAGATCAACTCAT
ACGACCAAACTAGACATGGAACAACCCACTGAAGATGAACCGGAAGCCACTAATGGAGCTGTGCAGATCATAAACGAGAATGAACTCGAACTAACTCTAGGGCCT
TCAAGTTACAATACTTCAGATTCAGGAGTAACCACTCACTCTTCTTCTTCAACAGGGTCCAGCCATGAGGGAAGAAGATGTACGGATACAAAGAAAGTTAAAGGT
CAAGAAATGGCGGTTTTTGGGGGAACTGAAAATTCCTCAGGCTACCAAAATGAAAAAAGCCAAAATCAGAGCCTTACTCGGCAACAAAAGGCTCACAATGCCCTC
ATTGATCCATCAAAACTTCCAGACTCTGTAACTGCACAAGAAAGAGAAGATATGGAATTAACAGCATTTGAGACACTAATTCTAAACCTCATTGACAATGTCCTA
AGAGAAGTCAATGGAGAGTCAATTTACATGAGGAGTAATGGAGCATGTCAGATCATTGAGATTGGATCAATTACATTGAAACTAAAGGATGAAACTGTGGAACTC
CTAAGGAATGTTAGGCATGTTCCACATCTTAAAAGAAACCTAATTTCCATAGGGATGCTTGACTCAATAGGATGTGAATACAGGAGAAAAGGTGGATGTTTAAAG
GTCCTGAAGGATTCCAAAGTTATCTTGGTTGGAGAAAAGGTGAATGATTTGTTCATTGTAAGAGGGGTTGAAAGGTTGAATGGAGCTTATACAGTAACTTCACCA
AACTTGACTGAAGCAGACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGATGGCCATGTTAAAGCATGAAGAAACATTCAAACAACAGGTACATGAACTGCATCGACTTTATCGAACCCAAAAGACATTAATGAAAAACGTAGAGAAA
AGCAGGGAAAGTGGATGGAATCCAGAAAGTTGGGACAAAAGGAATGAGATATGTTTCAGACAAATCTATGAACAGGATGCAAAAAATTATTACAGATCAACTCAT
ACGACCAAACTAGACATGGAACAACCCACTGAAGATGAACCGGAAGCCACTAATGGAGCTGTGCAGATCATAAACGAGAATGAACTCGAACTAACTCTAGGGCCT
TCAAGTTACAATACTTCAGATTCAGGAGTAACCACTCACTCTTCTTCTTCAACAGGGTCCAGCCATGAGGGAAGAAGATGTACGGATACAAAGAAAGTTAAAGGT
CAAGAAATGGCGGTTTTTGGGGGAACTGAAAATTCCTCAGGCTACCAAAATGAAAAAAGCCAAAATCAGAGCCTTACTCGGCAACAAAAGGCTCACAATGCCCTC
ATTGATCCATCAAAACTTCCAGACTCTGTAACTGCACAAGAAAGAGAAGATATGGAATTAACAGCATTTGAGACACTAATTCTAAACCTCATTGACAATGTCCTA
AGAGAAGTCAATGGAGAGTCAATTTACATGAGGAGTAATGGAGCATGTCAGATCATTGAGATTGGATCAATTACATTGAAACTAAAGGATGAAACTGTGGAACTC
CTAAGGAATGTTAGGCATGTTCCACATCTTAAAAGAAACCTAATTTCCATAGGGATGCTTGACTCAATAGGATGTGAATACAGGAGAAAAGGTGGATGTTTAAAG
GTCCTGAAGGATTCCAAAGTTATCTTGGTTGGAGAAAAGGTGAATGATTTGTTCATTGTAAGAGGGGTTGAAAGGTTGAATGGAGCTTATACAGTAACTTCACCA
AACTTGACTGAAGCAGACCTATGA
Protein sequenceShow/hide protein sequence
MRMAMLKHEETFKQQVHELHRLYRTQKTLMKNVEKSRESGWNPESWDKRNEICFRQIYEQDAKNYYRSTHTTKLDMEQPTEDEPEATNGAVQIINENELELTLGP
SSYNTSDSGVTTHSSSSTGSSHEGRRCTDTKKVKGQEMAVFGGTENSSGYQNEKSQNQSLTRQQKAHNALIDPSKLPDSVTAQEREDMELTAFETLILNLIDNVL
REVNGESIYMRSNGACQIIEIGSITLKLKDETVELLRNVRHVPHLKRNLISIGMLDSIGCEYRRKGGCLKVLKDSKVILVGEKVNDLFIVRGVERLNGAYTVTSP
NLTEADL