; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh13G003200 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh13G003200
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr13:3524170..3525276
RNA-Seq ExpressionCmoCh13G003200
SyntenyCmoCh13G003200
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO45752.1 pol protein [Cucumis melo subsp. melo]4.1e-15073.26Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV +G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

KAA0046094.1 gag protease polyprotein [Cucumis melo var. makuwa]1.8e-15072.98Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLER E
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT+EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

KAA0058399.1 pol protein [Cucumis melo var. makuwa]9.1e-15072.98Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E  QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A Q  EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

KAA0059792.1 pol protein [Cucumis melo var. makuwa]5.3e-15072.7Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLRKVL T R NKL+AKFSKCEFW +QV FLGHV+  +G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLM +GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A Q  EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

TYK01613.1 pol protein [Cucumis melo var. makuwa]2.4e-15073.26Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

TrEMBL top hitse value%identityAlignment
A0A5A7TQ36 Reverse transcriptase4.4e-15072.98Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLDNF+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G   FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A Q  EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

A0A5A7TSQ8 Reverse transcriptase8.9e-15172.98Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLER E
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT+EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

A0A5A7UV42 Reverse transcriptase2.6e-15072.7Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLRKVL T R NKL+AKFSKCEFW +QV FLGHV+  +G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLM +GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A Q  EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

A0A5D3BPI1 Reverse transcriptase1.2e-15073.26Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV++G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

Q84KB0 Pol protein2.0e-15073.26Show/hide
Query:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF
        M LMN+VFREFLD F+IVFIDDIL+YSKT  +HEEHLR VL T R NKL+AKFSKCEFW +QV FLGHVV   G++VDP KIEAVT W RP+TV+EVRSF
Subjt:  MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSF

Query:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP
        LGL GYYRRFV++F++IATPLT+LT+KG  F W+  CE SF+ LKQ+LV+  VLT+  G G FV+YSDASK+GLGCVLMQ+GKV+AY SRQLK +E+NYP
Subjt:  LGLVGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYP

Query:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE
        THDLELAAVVFALKIWRHYLY E+ QIFTDHKSLKYFFTQKELN+RQRRWLELVKDYD EILYHPGKANVVADALS+K++H++ALIT Q  + +DLERAE
Subjt:  THDLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAE

Query:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL
        +AV +G V  QLAQLT+QPTLRQR+ID Q  DP LV KR L +A QT EFSL++DGG L
Subjt:  VAVAMGEVAAQLAQLTIQPTLRQRLIDKQHGDPDLVGKRRLIDANQTEEFSLTADGGAL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-5339.12Show/hide
Query:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL
        MN + R  L+   +V++DDI+V+S + ++H + L  V        L  +  KCEF  ++  FLGHV+   GI  +P KIEA+  +P PT   E+++FLGL
Subjt:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL

Query:  VGYYRRFVQDFAKIATPLTELTKKG-KSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYPTH
         GYYR+F+ +FA IA P+T+  KK  K  T N + + +FK+LK  +  D +L +      F + +DAS   LG VL Q G  ++Y+SR L E+E NY T 
Subjt:  VGYYRRFVQDFAKIATPLTELTKKG-KSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYPTH

Query:  DLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQ-KLAHTSALITTQKAIQQD
        + EL A+V+A K +RHYL     +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADALS+ KL  T     TQ + ++D
Subjt:  DLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQ-KLAHTSALITTQKAIQQD

P0CT41 Transposon Tf2-12 polyprotein2.3e-3932.65Show/hide
Query:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL
        +N +  E  ++ ++ ++DDIL++SK+  +H +H++ VL   +   L    +KCEF   QV F+G+ +   G T     I+ V  W +P    E+R FLG 
Subjt:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL

Query:  VGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGK-----VIAYVSRQLKEYERN
        V Y R+F+   +++  PL  L KK   + W      + + +KQ LVS  VL         ++ +DAS   +G VL Q+        + Y S ++ + + N
Subjt:  VGYYRRFVQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGK-----VIAYVSRQLKEYERN

Query:  YPTHDLELAAVVFALKIWRHYLYR--ERTQIFTDHKSLKYFFTQKE--LNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSAL
        Y   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALS+ +  T  +
Subjt:  YPTHDLELAAVVFALKIWRHYLYR--ERTQIFTDHKSLKYFFTQKE--LNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSAL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.7e-4535.07Show/hide
Query:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL
        ++ V RE +     V++DD++++S+    H  H+  VL       +     K  F+ E V +LG +V   G   DP K++A+  +P P  V +VRSFLGL
Subjt:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL

Query:  VGYYRRFVQDFAKIATPLTELTK-----------KGKSFTWNDQCEVSFKELKQRLVS-DAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQ
          YYR F++DFA IA P+T++ K           K     +N+    +F+ L+  L S D +L        F + +DAS  G+G VL Q G+ I  +SR 
Subjt:  VGYYRRFVQDFAKIATPLTELTK-----------KGKSFTWNDQCEVSFKELKQRLVS-DAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQ

Query:  LKEYERNYPTHDLELAAVVFALKIWRHYLYRER-TQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQK
        LK+ E+NY T++ EL A+V+AL   +++LY  R   IFTDH+ L +    +  N + +RW   +  ++ ++ Y PGK N VADALS++
Subjt:  LKEYERNYPTHDLELAAVVFALKIWRHYLYRER-TQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQK

P20825 Retrovirus-related Pol polyprotein from transposon 2976.5e-5036.39Show/hide
Query:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL
        MN + R  L+   +V++DDI+++S +  +H   ++ V T      L  +  KCEF  ++  FLGH+V   GI  +P K++A+ ++P PT   E+R+FLGL
Subjt:  MNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGL

Query:  VGYYRRFVQDFAKIATPLTE-LTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYPTH
         GYYR+F+ ++A IA P+T  L K+ K  T   +   +F++LK  ++ D +L +   +  FV+ +DAS   LG VL Q G  I+++SR L ++E NY   
Subjt:  VGYYRRFVQDFAKIATPLTE-LTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYPTH

Query:  DLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQ-KLAHTSALITTQKAIQQD
        + EL A+V+A K +RHYL   +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK N VADALS+ K+        TQ + ++D
Subjt:  DLELAAVVFALKIWRHYLYRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQ-KLAHTSALITTQKAIQQD

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.9e-4936.57Show/hide
Query:  LMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLG
        +++ + RE +     V+IDDI+V+S+  + H ++LR VL +     L     K  F   QV FLG++V + GI  DP K+ A++  P PT+V E++ FLG
Subjt:  LMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLG

Query:  LVGYYRRFVQDFAKIATPLTELT-------KKGKS----FTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQ----RGKVIAY
        +  YYR+F+QD+AK+A PLT LT       K  +S     T ++    SF +LK  L S  +L        F + +DAS   +G VL Q    R + IAY
Subjt:  LVGYYRRFVQDFAKIATPLTELT-------KKGKS----FTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQ----RGKVIAY

Query:  VSRQLKEYERNYPTHDLELAAVVFALKIWRHYLYRERT-QIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALI
        +SR L + E NY T + E+ A++++L   R YLY   T +++TDH+ L +    +  N + +RW   +++Y+ E++Y PGK+NVVADALS+     + L 
Subjt:  VSRQLKEYERNYPTHDLELAAVVFALKIWRHYLYRERT-QIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALI

Query:  TTQKAIQQD
        T   A  +D
Subjt:  TTQKAIQQD

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.2e-2442.28Show/hide
Query:  HLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLG--HVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGLVGYYRRFVQDFAKIATPLTELTKKGKSFTW
        HL  VL     ++ +A   KC F   Q+ +LG  H++   G++ DP K+EA+  WP P   TE+R FLGL GYYRRFV+++ KI  PLTEL KK  S  W
Subjt:  HLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLG--HVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGLVGYYRRFVQDFAKIATPLTELTKKGKSFTW

Query:  NDQCEVSFKELKQRLVSDAVLTI
         +   ++FK LK  + +  VL +
Subjt:  NDQCEVSFKELKQRLVSDAVLTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTGATGAACAAAGTGTTTAGAGAGTTTTTAGACAACTTCATGATCGTCTTCATCGATGATATCCTGGTGTACTCCAAGACGGGGGAACAACACGAGGAACATCT
CCGGAAAGTGCTGACCACTCCGAGAACGAACAAGTTGTTTGCAAAGTTCTCCAAGTGTGAATTCTGGGCAGAACAGGTGGGATTTCTTGGGCATGTAGTTTTAAGCATGG
GAATCACCGTAGACCCAACAAAGATTGAAGCGGTGACGAATTGGCCTCGCCCGACCACAGTGACGGAAGTACGAAGTTTCCTTGGACTAGTAGGGTATTATCGGCGGTTT
GTGCAGGATTTCGCGAAGATCGCCACCCCCCTCACTGAGTTAACCAAGAAAGGAAAGTCGTTCACCTGGAATGATCAGTGTGAGGTTAGTTTCAAAGAACTCAAACAGAG
GTTAGTATCTGACGCGGTACTCACTATATCGTCAGGAGATGGAGGCTTTGTAGTATACAGCGATGCTTCAAAAAGGGGGTTAGGATGTGTCCTTATGCAACGTGGGAAGG
TTATTGCCTATGTTTCACGTCAACTAAAAGAATACGAGCGAAACTATCCCACCCACGACCTAGAACTCGCAGCAGTGGTATTTGCATTGAAGATTTGGAGACACTACCTT
TATAGAGAAAGGACCCAAATTTTCACTGACCACAAGAGCCTTAAATACTTCTTTACTCAGAAGGAGTTGAACGTGAGACAACGCAGGTGGTTGGAGTTGGTTAAAGATTA
TGATGTGGAGATACTCTATCACCCGGGAAAGGCGAATGTGGTAGCTGATGCGTTGAGCCAAAAATTAGCACACACGTCTGCCCTAATCACTACACAAAAAGCGATCCAGC
AGGATCTAGAACGCGCCGAGGTAGCAGTGGCCATGGGGGAAGTCGCCGCTCAATTGGCCCAGTTAACGATACAACCAACCTTGAGGCAACGTCTTATTGATAAACAGCAT
GGTGATCCGGATTTGGTTGGAAAAAGACGTTTGATAGATGCTAATCAGACAGAGGAGTTTTCATTGACAGCTGATGGGGGGGCTCTTGTACCATGGGCGGTTGTGTGTGC
CAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCTGATGAACAAAGTGTTTAGAGAGTTTTTAGACAACTTCATGATCGTCTTCATCGATGATATCCTGGTGTACTCCAAGACGGGGGAACAACACGAGGAACATCT
CCGGAAAGTGCTGACCACTCCGAGAACGAACAAGTTGTTTGCAAAGTTCTCCAAGTGTGAATTCTGGGCAGAACAGGTGGGATTTCTTGGGCATGTAGTTTTAAGCATGG
GAATCACCGTAGACCCAACAAAGATTGAAGCGGTGACGAATTGGCCTCGCCCGACCACAGTGACGGAAGTACGAAGTTTCCTTGGACTAGTAGGGTATTATCGGCGGTTT
GTGCAGGATTTCGCGAAGATCGCCACCCCCCTCACTGAGTTAACCAAGAAAGGAAAGTCGTTCACCTGGAATGATCAGTGTGAGGTTAGTTTCAAAGAACTCAAACAGAG
GTTAGTATCTGACGCGGTACTCACTATATCGTCAGGAGATGGAGGCTTTGTAGTATACAGCGATGCTTCAAAAAGGGGGTTAGGATGTGTCCTTATGCAACGTGGGAAGG
TTATTGCCTATGTTTCACGTCAACTAAAAGAATACGAGCGAAACTATCCCACCCACGACCTAGAACTCGCAGCAGTGGTATTTGCATTGAAGATTTGGAGACACTACCTT
TATAGAGAAAGGACCCAAATTTTCACTGACCACAAGAGCCTTAAATACTTCTTTACTCAGAAGGAGTTGAACGTGAGACAACGCAGGTGGTTGGAGTTGGTTAAAGATTA
TGATGTGGAGATACTCTATCACCCGGGAAAGGCGAATGTGGTAGCTGATGCGTTGAGCCAAAAATTAGCACACACGTCTGCCCTAATCACTACACAAAAAGCGATCCAGC
AGGATCTAGAACGCGCCGAGGTAGCAGTGGCCATGGGGGAAGTCGCCGCTCAATTGGCCCAGTTAACGATACAACCAACCTTGAGGCAACGTCTTATTGATAAACAGCAT
GGTGATCCGGATTTGGTTGGAAAAAGACGTTTGATAGATGCTAATCAGACAGAGGAGTTTTCATTGACAGCTGATGGGGGGGCTCTTGTACCATGGGCGGTTGTGTGTGC
CAAATGA
Protein sequenceShow/hide protein sequence
MGLMNKVFREFLDNFMIVFIDDILVYSKTGEQHEEHLRKVLTTPRTNKLFAKFSKCEFWAEQVGFLGHVVLSMGITVDPTKIEAVTNWPRPTTVTEVRSFLGLVGYYRRF
VQDFAKIATPLTELTKKGKSFTWNDQCEVSFKELKQRLVSDAVLTISSGDGGFVVYSDASKRGLGCVLMQRGKVIAYVSRQLKEYERNYPTHDLELAAVVFALKIWRHYL
YRERTQIFTDHKSLKYFFTQKELNVRQRRWLELVKDYDVEILYHPGKANVVADALSQKLAHTSALITTQKAIQQDLERAEVAVAMGEVAAQLAQLTIQPTLRQRLIDKQH
GDPDLVGKRRLIDANQTEEFSLTADGGALVPWAVVCAK