; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0074461 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0074461
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:21965484..21966537
RNA-Seq ExpressionCmc03g0074461
SyntenyCmc03g0074461
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031931.1 pol protein [Cucumis melo var. makuwa]4.3e-15681.48Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPS+S WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL +VL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +G PFVWSKACEDSFQNLKQKLVT LVLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLKSH+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

KAA0036671.1 gag protease polyprotein [Cucumis melo var. makuwa]3.3e-15681.77Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL MVL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +GTPFVWSKACEDSFQNLKQKLVT  VLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLK H+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

KAA0048687.1 pol protein [Cucumis melo var. makuwa]3.3e-15681.77Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL MVL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +G PFVWSKACEDSFQNLKQKLVT  VLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLKSH+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

KAA0059069.1 pol protein [Cucumis melo var. makuwa]1.9e-15687.35Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        TTF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHL +VL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE---EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPT
        VS+DP  IEAVT+W+RPSTVSE   +G PFVWSKACEDSFQNLK KLVT  VLT P GSGSFVIYSD SKK LGCVLMQQ KVV+YASRQLKSH+Q+YPT
Subjt:  VSIDPTKIEAVTSWSRPSTVSE---EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPT

Query:  HDLELAAVVFALKIWRHYLYGEKI
        HDLELAAVVFALKIWRHYLYGEKI
Subjt:  HDLELAAVVFALKIWRHYLYGEKI

TYK11727.1 putative polyprotein [Cucumis melo var. makuwa]4.6e-16693.15Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAPVELKELKVQLEELLDKGFIRPSVS WGAPVLYVKK DGSMRLCI+YRELNKV VKNKYPLLR+DDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSEEGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPTHDL
        VSIDPTKIEAVTSWSRPSTVSE         ACEDSFQNLKQKLVTT VLTRP GSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSH+QSYPTHDL
Subjt:  VSIDPTKIEAVTSWSRPSTVSEEGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPTHDL

Query:  ELAAVVFALKIWRHYLYGEKI
        ELAAVVFALKIWRHYL+ E +
Subjt:  ELAAVVFALKIWRHYLYGEKI

TrEMBL top hitse value%identityAlignment
A0A5A7SQU8 Reverse transcriptase2.1e-15681.48Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPS+S WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL +VL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +G PFVWSKACEDSFQNLKQKLVT LVLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLKSH+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5A7T538 Reverse transcriptase1.6e-15681.77Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL MVL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +GTPFVWSKACEDSFQNLKQKLVT  VLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLK H+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5A7U330 Reverse transcriptase1.6e-15681.77Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        T F SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL MVL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
        VS+DP KIEAVT W+RPSTVSE                              +G PFVWSKACEDSFQNLKQKLVT  VLT P GSGSFVIYSD SKK L
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI
        GCVLMQQGKVV+YASRQLKSH+Q+YPTHDLELAAVVFALKIWRHYLYGEKI
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI

A0A5A7UZV8 Reverse transcriptase9.4e-15787.35Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAP ELKELKVQL+ELLDKGFIRPSVS WGAPVL+VKK DGSMRLCI+YRELNKVTVKN+YPL RIDDLFDQLQGATV SKIDLRS YHQLRIKD DV K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        TTF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFL TFVIVFIDDILIYSKTEAEHEEHL +VL+T R NKLYAKFSKCEFW KQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE---EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPT
        VS+DP  IEAVT+W+RPSTVSE   +G PFVWSKACEDSFQNLK KLVT  VLT P GSGSFVIYSD SKK LGCVLMQQ KVV+YASRQLKSH+Q+YPT
Subjt:  VSIDPTKIEAVTSWSRPSTVSE---EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPT

Query:  HDLELAAVVFALKIWRHYLYGEKI
        HDLELAAVVFALKIWRHYLYGEKI
Subjt:  HDLELAAVVFALKIWRHYLYGEKI

A0A5D3CIH0 Putative polyprotein2.2e-16693.15Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        MAPVELKELKVQLEELLDKGFIRPSVS WGAPVLYVKK DGSMRLCI+YRELNKV VKNKYPLLR+DDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
        TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEH+EHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSEEGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPTHDL
        VSIDPTKIEAVTSWSRPSTVSE         ACEDSFQNLKQKLVTT VLTRP GSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSH+QSYPTHDL
Subjt:  VSIDPTKIEAVTSWSRPSTVSEEGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPTHDL

Query:  ELAAVVFALKIWRHYLYGEKI
        ELAAVVFALKIWRHYL+ E +
Subjt:  ELAAVVFALKIWRHYLYGEKI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.2e-4932.76Show/hide
Query:  KELKVQLEELLDKGFIRPSVSSWGAPVLYV-KKNDGS----MRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKT
        +E++ Q++++L++G IR S S + +P+  V KK D S     R+ I+YR+LN++TV +++P+  +D++  +L      + IDL   +HQ+ +    V KT
Subjt:  KELKVQLEELLDKGFIRPSVSSWGAPVLYV-KKNDGS----MRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKT

Query:  TFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGV
         F +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V E      L  +  KCEF  ++ +FLGHV++  G+
Subjt:  TFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGV

Query:  SIDPTKIEAVTSWSRPSTVSE----EGTPFVWSK---------------------------ACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
          +P KIEA+  +  P+   E     G    + K                             + +F+ LK  +    +L  P  +  F + +D S   L
Subjt:  SIDPTKIEAVTSWSRPSTVSE----EGTPFVWSK---------------------------ACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYG
        G VL Q G  +SY SR L  H+ +Y T + EL A+V+A K +RHYL G
Subjt:  GCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYG

P0CT41 Transposon Tf2-12 polyprotein2.5e-4529.91Show/hide
Query:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK
        + P +++ +  ++ + L  G IR S +    PV++V K +G++R+ ++Y+ LNK    N YPL  I+ L  ++QG+T+ +K+DL+S YH +R++  D  K
Subjt:  MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQK

Query:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG
          F    G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H+  VL+  +   L    +KCEF   QV F+G+ +S+ G
Subjt:  TTFHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAG

Query:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL
         +     I+ V  W +P    E                              +   + W+     + +N+KQ LV+  VL     S   ++ +D S   +
Subjt:  VSIDPTKIEAVTSWSRPSTVSE------------------------------EGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDL

Query:  GCVLMQQGK-----VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYL
        G VL Q+        V Y S ++     +Y   D E+ A++ +LK WRHYL
Subjt:  GCVLMQQGK-----VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYL

P20825 Retrovirus-related Pol polyprotein from transposon 2974.8e-4931.81Show/hide
Query:  ELKVQLEELLDKGFIRPSVSSWGAPVLYV-KKNDGS----MRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTT
        E++ Q++E+L++G IR S S + +P   V KK D S     R+ I+YR+LN++T+ ++YP+  +D++  +L      + IDL   +HQ+ + +  + KT 
Subjt:  ELKVQLEELLDKGFIRPSVSSWGAPVLYV-KKNDGS----MRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTT

Query:  FHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVS
        F ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   + +V        L  +  KCEF  K+ +FLGH+V+  G+ 
Subjt:  FHSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVS

Query:  IDPTKIEAVTSW-----------------------------SRPSTVSEEGTPFVWSKACE--DSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLG
         +P K++A+ S+                             ++P T   +    + ++  E  ++F+ LK  ++   +L  P     FV+ +D S   LG
Subjt:  IDPTKIEAVTSW-----------------------------SRPSTVSEEGTPFVWSKACE--DSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLG

Query:  CVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEK
         VL Q G  +S+ SR L  H+ +Y   + EL A+V+A K +RHYL G +
Subjt:  CVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.2e-4633.14Show/hide
Query:  KELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTTFHSR
        +E+   +++LLD  FI PS S   +PV+ V K DG+ RLC++YR LNK T+ + +PL RID+L  ++  A + + +DL S YHQ+ ++  D  KT F + 
Subjt:  KELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTTFHSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVSIDPT
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VLE  +   L  K  KC+F  ++  FLG+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVSIDPT

Query:  KIEAVTSWSRPSTV--------------------SEEGTPF--------VWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQG
        K  A+  +  P TV                    S+   P          W++  + + + LK  L  + VL       ++ + +D SK  +G VL +  
Subjt:  KIEAVTSWSRPSTV--------------------SEEGTPF--------VWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQG

Query:  K------VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGE
               VV Y S+ L+S  ++YP  +LEL  ++ AL  +R+ L+G+
Subjt:  K------VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGE

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-4633.14Show/hide
Query:  KELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTTFHSR
        +E+   +++LLD  FI PS S   +PV+ V K DG+ RLC++YR LNK T+ + +PL RID+L  ++  A + + +DL S YHQ+ ++  D  KT F + 
Subjt:  KELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTTFHSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVSIDPT
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VLE  +   L  K  KC+F  ++  FLG+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVSIDPT

Query:  KIEAVTSWSRPSTV--------------------SEEGTPF--------VWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQG
        K  A+  +  P TV                    S+   P          W++  + +   LK  L  + VL       ++ + +D SK  +G VL +  
Subjt:  KIEAVTSWSRPSTV--------------------SEEGTPF--------VWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQG

Query:  K------VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGE
               VV Y S+ L+S  ++YP  +LEL  ++ AL  +R+ L+G+
Subjt:  K------VVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-0741.27Show/hide
Query:  HLCMVLETHRANKLYAKFSKCEFWFKQVSFLG--HVVSKAGVSIDPTKIEAVTSWSRPSTVSE
        HL MVL+    ++ YA   KC F   Q+++LG  H++S  GVS DP K+EA+  W  P   +E
Subjt:  HLCMVLETHRANKLYAKFSKCEFWFKQVSFLG--HVVSKAGVSIDPTKIEAVTSWSRPSTVSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCAGTAGAGTTGAAAGAGCTGAAAGTGCAGTTAGAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTGTGTCATCTTGGGGTGCACCAGTTTTATATGTTAA
AAAGAATGATGGATCGATGCGCTTATGTATTGAATACAGAGAGTTGAATAAGGTAACCGTTAAGAACAAATATCCCTTGCTCAGGATCGACGATCTATTTGACCAATTAC
AGGGAGCTACAGTCTTATCTAAGATCGACCTTCGGTCAAGATATCATCAGTTGAGGATTAAGGATATTGATGTACAGAAGACAACCTTTCATTCCAGATATGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGATTTGATGAACAGAGTGTTCAGGGAGTTCCTAGACACTTTTGTTATCGTGTTCATTGA
CGATATTTTGATATATTCCAAGACAGAGGCAGAGCATGAGGAGCATTTATGCATGGTTCTCGAAACCCATCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTGAGT
TTTGGTTTAAGCAGGTATCTTTTCTAGGCCACGTGGTTTCTAAAGCTGGTGTTTCTATAGATCCAACTAAGATAGAGGCAGTCACCAGTTGGTCCCGACCTTCCACAGTT
AGTGAGGAAGGGACTCCTTTTGTTTGGAGTAAGGCCTGTGAAGACAGTTTTCAGAACCTTAAACAGAAACTCGTTACTACATTGGTTCTTACTAGACCTTATGGTTCAGG
GAGTTTTGTGATTTACAGTGATACTTCTAAGAAAGATTTGGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCTCTTATGCTTCTCGTCAGTTGAAGAGTCATGATCAAA
GTTACCCTACCCATGATTTAGAGTTGGCAGCAGTAGTCTTTGCACTAAAGATTTGGAGGCATTACTTGTATGGTGAAAAGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCAGTAGAGTTGAAAGAGCTGAAAGTGCAGTTAGAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTGTGTCATCTTGGGGTGCACCAGTTTTATATGTTAA
AAAGAATGATGGATCGATGCGCTTATGTATTGAATACAGAGAGTTGAATAAGGTAACCGTTAAGAACAAATATCCCTTGCTCAGGATCGACGATCTATTTGACCAATTAC
AGGGAGCTACAGTCTTATCTAAGATCGACCTTCGGTCAAGATATCATCAGTTGAGGATTAAGGATATTGATGTACAGAAGACAACCTTTCATTCCAGATATGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGATTTGATGAACAGAGTGTTCAGGGAGTTCCTAGACACTTTTGTTATCGTGTTCATTGA
CGATATTTTGATATATTCCAAGACAGAGGCAGAGCATGAGGAGCATTTATGCATGGTTCTCGAAACCCATCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTGAGT
TTTGGTTTAAGCAGGTATCTTTTCTAGGCCACGTGGTTTCTAAAGCTGGTGTTTCTATAGATCCAACTAAGATAGAGGCAGTCACCAGTTGGTCCCGACCTTCCACAGTT
AGTGAGGAAGGGACTCCTTTTGTTTGGAGTAAGGCCTGTGAAGACAGTTTTCAGAACCTTAAACAGAAACTCGTTACTACATTGGTTCTTACTAGACCTTATGGTTCAGG
GAGTTTTGTGATTTACAGTGATACTTCTAAGAAAGATTTGGGTTGTGTTTTGATGCAGCAAGGTAAGGTAGTCTCTTATGCTTCTCGTCAGTTGAAGAGTCATGATCAAA
GTTACCCTACCCATGATTTAGAGTTGGCAGCAGTAGTCTTTGCACTAAAGATTTGGAGGCATTACTTGTATGGTGAAAAGATATAA
Protein sequenceShow/hide protein sequence
MAPVELKELKVQLEELLDKGFIRPSVSSWGAPVLYVKKNDGSMRLCIEYRELNKVTVKNKYPLLRIDDLFDQLQGATVLSKIDLRSRYHQLRIKDIDVQKTTFHSRYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLCMVLETHRANKLYAKFSKCEFWFKQVSFLGHVVSKAGVSIDPTKIEAVTSWSRPSTV
SEEGTPFVWSKACEDSFQNLKQKLVTTLVLTRPYGSGSFVIYSDTSKKDLGCVLMQQGKVVSYASRQLKSHDQSYPTHDLELAAVVFALKIWRHYLYGEKI