; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G18724 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G18724
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg3412:110034..113436
RNA-Seq ExpressionCucsat.G18724
SyntenyCucsat.G18724
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040613.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]5.42e-5870.34Show/hide
Query:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH
        MKEE+TTIEKNGTWKMV+  +GK+AIDLKWV+KTKF ADG LEK+KARLVAKG+ QQHG +FE+TFS +A FE V++VLALAAQ+QWSVYQFDVK  FL+
Subjt:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH

Query:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKVLNMLC
         EL+EEVYV QP+GFV + S+EKVYKLTKALYG +   +   M C
Subjt:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKVLNMLC

KAA0050371.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.11e-5972.79Show/hide
Query:  PMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFL
        P + +  TIEKNGTWKMV+ SE K+AI LKWV+KTKF   G LEK+KARLVAKGY QQHG DFE+ FS +A FE V+IVLALAAQ+QWS+YQFDVK AFL
Subjt:  PMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFL

Query:  HGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        +GELQEEVYV QPEGFV + S+EKVYKLTKALYG +
Subjt:  HGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

KAA0054939.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]7.62e-5974.64Show/hide
Query:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA
        Q+ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKG+ QQHG DFE+TFS +A FE V+IVLALAAQ+QWSVYQFDVK A
Subjt:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA

Query:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        FL+GELQEEVYV QPEGFV + S+EKVYKLTKALYG +
Subjt:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

KAA0066378.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.23e-6971.6Show/hide
Query:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF
        +H M+ HL G  PW+ SIILLNL LWFLT+C+MMRQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKGY QQHG DF
Subjt:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF

Query:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV
        ++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQEEVYV QPEGFV + S+EKV
Subjt:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV

TYK00906.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.66e-6871.6Show/hide
Query:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF
        +H M+ HL G  PW+ SIILLNL LWFLT+C+MMRQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKGY QQHG DF
Subjt:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF

Query:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV
        ++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQEEVYV QPEGFV + S+EKV
Subjt:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV

TrEMBL top hitse value%identityAlignment
A0A5A7TC06 Retrovirus-related Pol polyprotein from transposon TNT 1-942.62e-5870.34Show/hide
Query:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH
        MKEE+TTIEKNGTWKMV+  +GK+AIDLKWV+KTKF ADG LEK+KARLVAKG+ QQHG +FE+TFS +A FE V++VLALAAQ+QWSVYQFDVK  FL+
Subjt:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH

Query:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKVLNMLC
         EL+EEVYV QP+GFV + S+EKVYKLTKALYG +   +   M C
Subjt:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKVLNMLC

A0A5A7UN91 Putative gag-pol polyprotein, identical3.69e-5974.64Show/hide
Query:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA
        Q+ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKG+ QQHG DFE+TFS +A FE V+IVLALAAQ+QWSVYQFDVK A
Subjt:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA

Query:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        FL+GELQEEVYV QPEGFV + S+EKVYKLTKALYG +
Subjt:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

A0A5A7VF84 Retrovirus-related Pol polyprotein from transposon TNT 1-941.56e-6971.6Show/hide
Query:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF
        +H M+ HL G  PW+ SIILLNL LWFLT+C+MMRQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKGY QQHG DF
Subjt:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF

Query:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV
        ++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQEEVYV QPEGFV + S+EKV
Subjt:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV

A0A5D3BRM6 Putative gag-pol polyprotein, identical8.05e-6971.6Show/hide
Query:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF
        +H M+ HL G  PW+ SIILLNL LWFLT+C+MMRQQ MKEE+  IEKNGTWKMV+  EGK+AI LKWV+K+KF ADG LEK+KA LVAKGY QQHG DF
Subjt:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDF

Query:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV
        ++T S IA FE VKIVLAL A +QW VYQFDVK AFL+GELQEEVYV QPEGFV + S+EKV
Subjt:  EKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKV

A0A5D3BWT3 Retrovirus-related Pol polyprotein from transposon TNT 1-945.39e-6072.79Show/hide
Query:  PMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFL
        P + +  TIEKNGTWKMV+ SE K+AI LKWV+KTKF   G LEK+KARLVAKGY QQHG DFE+ FS +A FE V+IVLALAAQ+QWS+YQFDVK AFL
Subjt:  PMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFL

Query:  HGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        +GELQEEVYV QPEGFV + S+EKVYKLTKALYG +
Subjt:  HGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-2136.84Show/hide
Query:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA
        ++ +  E+   + N TW + +  E K+ +D +WVF  K+   G   +YKARLVA+G+ Q++  D+E+TF+ +A   + + +L+L  Q    V+Q DVK A
Subjt:  QQPMKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLA

Query:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYG--------FETYEKVL
        FL+G L+EE+Y+  P+G  I  + + V KL KA+YG        FE +E+ L
Subjt:  FLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYG--------FETYEKVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-3048.89Show/hide
Query:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH
        M+EE+ +++KNGT+K+VE  +GK  +  KWVFK K   D  L +YKARLV KG+ Q+ G DF++ FS +    +++ +L+LAA     V Q DVK AFLH
Subjt:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH

Query:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        G+L+EE+Y+ QPEGF + G K  V KL K+LYG +
Subjt:  GELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

P92520 Uncharacterized mitochondrial protein AtMg008204.9e-1139.53Show/hide
Query:  QPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ
        Q M+EE+  + +N TW +V     ++ +  KWVFKTK  +DG L++ KARLVAKG+ Q+ G  F +T+S +     ++ +L +A Q
Subjt:  QPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.6e-2436.78Show/hide
Query:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVESEGK--SAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSD
        K+ ++  L      RT+I  L    W         +  M  EI     N TW +V       + +  +W+F  K+ +DG L +YKARLVAKGY Q+ G D
Subjt:  KHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVESEGK--SAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSD

Query:  FEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        + +TFS +    +++IVL +A  R W + Q DV  AFL G L ++VY+ QP GF+ +     V KL KALYG +
Subjt:  FEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2443.17Show/hide
Query:  QQPMKEEITTIEKNGTWKMVESEGKSA--IDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKL
        +Q M  EI     N TW +V     S   +  +W+F  KF +DG L +YKARLVAKGY Q+ G D+ +TFS +    +++IVL +A  R W + Q DV  
Subjt:  QQPMKEEITTIEKNGTWKMVESEGKSA--IDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKL

Query:  AFLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE
        AFL G L +EVY+ QP GFV +   + V +L KA+YG +
Subjt:  AFLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-2340.29Show/hide
Query:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH
        M +EI  +E   TW++      K  I  KWV+K K+ +DG +E+YKARLVAKGY QQ G DF +TFS +    +VK++LA++A   ++++Q D+  AFL+
Subjt:  MKEEITTIEKNGTWKMVE-SEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQRQWSVYQFDVKLAFLH

Query:  GELQEEVYVGQPEGFVIEGS----KEKVYKLTKALYGFE
        G+L EE+Y+  P G+            V  L K++YG +
Subjt:  GELQEEVYVGQPEGFVIEGS----KEKVYKLTKALYGFE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.5e-1239.53Show/hide
Query:  QPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ
        Q M+EE+  + +N TW +V     ++ +  KWVFKTK  +DG L++ KARLVAKG+ Q+ G  F +T+S +     ++ +L +A Q
Subjt:  QPMKEEITTIEKNGTWKMVESE-GKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAHFENVKIVLALAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAACATTGGATGAGTTGCCACCTTGGAGGTTCTGATCCATGGAGGACATCTATAATTCTTCTCAATTTGCCCTTATGGTTTCTGACCCGGTGTGTTATGATGAG
GCAGCAACCAATGAAGGAAGAAATAACAACGATTGAGAAGAATGGGACGTGGAAAATGGTAGAATCGGAGGGAAAAAGTGCAATCGACTTGAAGTGGGTCTTTAAGACGA
AATTTGTTGCGGATGGAATTTTAGAGAAGTACAAAGCTCGACTCGTGGCGAAAGGATACGTGCAGCAACACGGTAGTGATTTTGAGAAAACTTTCTCTTCAATAGCTCAT
TTTGAAAACGTGAAGATTGTTCTAGCATTGGCAGCACAACGACAATGGTCGGTTTATCAATTTGATGTCAAGTTAGCCTTTCTCCATGGAGAATTGCAAGAAGAAGTCTA
TGTTGGACAACCAGAAGGTTTTGTCATAGAAGGCAGCAAAGAAAAGGTGTATAAGTTGACAAAGGCTTTGTACGGGTTTGAAACTTATGAGAAGGTTTTAAATATGTTAT
GCGTTACACTGTTGGAGCAATGTAGTATGGCATTTTGTACTCTAAATTTTCCAATTTCAAGCTATGCGGGTTCACGGACAGCGATTGGGCGAGCTCATTGGATGATAGGC
AGAGTGTTTCAGCAAATGTATTCACACTCGAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAAACATTGGATGAGTTGCCACCTTGGAGGTTCTGATCCATGGAGGACATCTATAATTCTTCTCAATTTGCCCTTATGGTTTCTGACCCGGTGTGTTATGATGAG
GCAGCAACCAATGAAGGAAGAAATAACAACGATTGAGAAGAATGGGACGTGGAAAATGGTAGAATCGGAGGGAAAAAGTGCAATCGACTTGAAGTGGGTCTTTAAGACGA
AATTTGTTGCGGATGGAATTTTAGAGAAGTACAAAGCTCGACTCGTGGCGAAAGGATACGTGCAGCAACACGGTAGTGATTTTGAGAAAACTTTCTCTTCAATAGCTCAT
TTTGAAAACGTGAAGATTGTTCTAGCATTGGCAGCACAACGACAATGGTCGGTTTATCAATTTGATGTCAAGTTAGCCTTTCTCCATGGAGAATTGCAAGAAGAAGTCTA
TGTTGGACAACCAGAAGGTTTTGTCATAGAAGGCAGCAAAGAAAAGGTGTATAAGTTGACAAAGGCTTTGTACGGGTTTGAAACTTATGAGAAGGTTTTAAATATGTTAT
GCGTTACACTGTTGGAGCAATGTAGTATGGCATTTTGTACTCTAAATTTTCCAATTTCAAGCTATGCGGGTTCACGGACAGCGATTGGGCGAGCTCATTGGATGATAGGC
AGAGTGTTTCAGCAAATGTATTCACACTCGAGTTAG
Protein sequenceShow/hide protein sequence
MMKHWMSCHLGGSDPWRTSIILLNLPLWFLTRCVMMRQQPMKEEITTIEKNGTWKMVESEGKSAIDLKWVFKTKFVADGILEKYKARLVAKGYVQQHGSDFEKTFSSIAH
FENVKIVLALAAQRQWSVYQFDVKLAFLHGELQEEVYVGQPEGFVIEGSKEKVYKLTKALYGFETYEKVLNMLCVTLLEQCSMAFCTLNFPISSYAGSRTAIGRAHWMIG
RVFQQMYSHSS