; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0062421 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0062421
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:2644480..2645400
RNA-Seq ExpressionCmc03g0062421
SyntenyCmc03g0062421
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032541.1 pol protein [Cucumis melo var. makuwa]5.6e-16192.16Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHY+F++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CE SFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

KAA0040547.1 pol protein [Cucumis melo var. makuwa]2.5e-16192.48Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEI+SFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CESSFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

KAA0058812.1 pol protein [Cucumis melo var. makuwa]4.3e-16191.83Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAF SRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDF+DSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CESSFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

KAA0063793.1 pol protein [Cucumis melo var. makuwa]1.9e-16192.48Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CE SFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

TYK01306.1 pol protein [Cucumis melo var. makuwa]6.6e-16293.14Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRDSDIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFR RYGHYEFI+MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTR+GTPFVWSPTCESSFQ+LKQKLVT PVLTV DGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

TrEMBL top hitse value%identityAlignment
A0A5A7TG62 Reverse transcriptase1.2e-16192.48Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEI+SFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CESSFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

A0A5A7USG7 Reverse transcriptase2.1e-16191.83Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAF SRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDF+DSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CESSFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

A0A5A7V4E4 Reverse transcriptase2.7e-16192.16Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CESSFQELKQKLV  PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

A0A5A7V6R2 Reverse transcriptase9.3e-16292.48Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRD DIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFRSRYGHYEF++MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTRKGTPFVWSP CE SFQELKQKLVT PVLTV DGSG+FVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

A0A5D3BSV9 Reverse transcriptase3.2e-16293.14Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        MAP ELKELKVQLQ+LLDKGFIRPSVSPWG PVLFVKKKDGSMRL IDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQL+IRDSDIPK
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
        TAFR RYGHYEFI+MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DIL+YSKTEAEHEEHL QVLETLRAN+LYAKFSKCEFWL+KV FLGHVVS+EG
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTNWP+PSTVSEIRSFLGLAGYYRRFVEDFSRI SPLTQLTR+GTPFVWSPTCESSFQ+LKQKLVT PVLTV DGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        GCV+MQ
Subjt:  GCVMMQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.1e-6238.89Show/hide
Query:  KELKVQLQKLLDKGFIRPSVSPWGVPVLFV-KKKDGS----MRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPKT
        +E++ Q+Q +L++G IR S SP+  P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+++    + KT
Subjt:  KELKVQLQKLLDKGFIRPSVSPWGVPVLFV-KKKDGS----MRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPKT

Query:  AFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEGV
        AF +++GHYE++ M FGL NAPA F   MN + +  L+   +V++ DI+V+S +  EH + L  V E L    L  +  KCEF  ++  FLGHV++ +G+
Subjt:  AFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEGV

Query:  SVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPF-VWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
          +P KIEA+  +P P+   EI++FLGL GYYR+F+ +F+ I  P+T+  +K       +P  +S+F++LK  +   P+L V D +  F + +DAS   L
Subjt:  SVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPF-VWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        G V+ Q
Subjt:  GCVMMQ

P0CT34 Transposon Tf2-1 polyprotein4.4e-6036.6Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +++R  D  K
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
         AFR   G +E+++M +G++ APA F   +N +  +  +S V+ ++ DIL++SK+E+EH +H++ VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
         +     I+ V  W QP    E+R FLG   Y R+F+   S++T PL  L +K   + W+PT   + + +KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        G V+ Q
Subjt:  GCVMMQ

P0CT35 Transposon Tf2-2 polyprotein4.4e-6036.6Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +++R  D  K
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
         AFR   G +E+++M +G++ APA F   +N +  +  +S V+ ++ DIL++SK+E+EH +H++ VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
         +     I+ V  W QP    E+R FLG   Y R+F+   S++T PL  L +K   + W+PT   + + +KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        G V+ Q
Subjt:  GCVMMQ

P0CT41 Transposon Tf2-12 polyprotein4.4e-6036.6Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +++R  D  K
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPK

Query:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG
         AFR   G +E+++M +G++ APA F   +N +  +  +S V+ ++ DIL++SK+E+EH +H++ VL+ L+   L    +KCEF   +V F+G+ +S +G
Subjt:  TAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEG

Query:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL
         +     I+ V  W QP    E+R FLG   Y R+F+   S++T PL  L +K   + W+PT   + + +KQ LV+ PVL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGL

Query:  GCVMMQ
        G V+ Q
Subjt:  GCVMMQ

P20825 Retrovirus-related Pol polyprotein from transposon 2972.6e-6036.86Show/hide
Query:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKD-----GSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRD
        +A T   E++ Q+Q++L++G IR S SP+  P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+++ +
Subjt:  MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKD-----GSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRD

Query:  SDIPKTAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHV
          I KTAF ++ GHYE++ M FGL NAPA F   MN + +  L+   +V++ DI+++S +  EH   ++ V   L    L  +  KCEF  K+  FLGH+
Subjt:  SDIPKTAFRSRYGHYEFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHV

Query:  VSNEGVSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPF-VWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSD
        V+ +G+  +P K++A+ ++P P+   EIR+FLGL GYYR+F+ +++ I  P+T   +K T           +F++LK  ++  P+L + D    FV+ +D
Subjt:  VSNEGVSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPF-VWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSD

Query:  ASKKGLGCVMMQ
        AS   LG V+ Q
Subjt:  ASKKGLGCVMMQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.6e-2642.75Show/hide
Query:  HLRQVLETLRANRLYAKFSKCEFWLKKVFFLG--HVVSNEGVSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC F   ++ +LG  H++S EGVS DPAK+EA+  WP+P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W
Subjt:  HLRQVLETLRANRLYAKFSKCEFWLKKVFFLG--HVVSNEGVSVDPAKIEAVTNWPQPSTVSEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVW

Query:  SPTCESSFQELKQKLVTTPVLTVSDGSGSFV
        +     +F+ LK  + T PVL + D    FV
Subjt:  SPTCESSFQELKQKLVTTPVLTVSDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAACCGAGTTAAAGGAGCTGAAGGTACAGCTACAGAAGTTGCTGGACAAGGGTTTCATTCGACCCAGTGTGTCACCTTGGGGAGTCCCAGTGTTGTTTGTGAA
GAAGAAGGATGGGTCGATGCGCCTTGGCATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCCCTTGCCAAGGATTGATGACTTGTTCGACCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTACGATCAGGCTACCACCAGTTGAAGATCAGGGACAGTGACATTCCGAAGACGGCCTTTCGTTCGAGATACGGACATTAC
GAGTTCATTTTGATGTCTTTTGGGTTGACTAATGCCCCTGCGGTGTTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGT
TGACATCTTGGTTTACTCCAAGACTGAGGCTGAACATGAGGAGCATTTACGCCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAAGTGCGAGT
TCTGGCTGAAGAAGGTATTTTTCCTTGGACATGTGGTGTCCAACGAGGGAGTTTCTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCAACCGTCTACAGTT
AGTGAGATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAACCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTT
TGTCTGGAGCCCAACATGCGAGAGTAGTTTCCAGGAGCTTAAGCAGAAGCTGGTGACTACACCAGTCCTGACAGTGTCCGATGGGTCGGGAAGCTTTGTGATCTATAGTG
ATGCCTCCAAGAAGGGACTGGGCTGTGTTATGATGCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAACCGAGTTAAAGGAGCTGAAGGTACAGCTACAGAAGTTGCTGGACAAGGGTTTCATTCGACCCAGTGTGTCACCTTGGGGAGTCCCAGTGTTGTTTGTGAA
GAAGAAGGATGGGTCGATGCGCCTTGGCATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAGAACCGCTACCCCTTGCCAAGGATTGATGACTTGTTCGACCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTACGATCAGGCTACCACCAGTTGAAGATCAGGGACAGTGACATTCCGAAGACGGCCTTTCGTTCGAGATACGGACATTAC
GAGTTCATTTTGATGTCTTTTGGGTTGACTAATGCCCCTGCGGTGTTCATGGACTTGATGAACAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGT
TGACATCTTGGTTTACTCCAAGACTGAGGCTGAACATGAGGAGCATTTACGCCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAAGTGCGAGT
TCTGGCTGAAGAAGGTATTTTTCCTTGGACATGTGGTGTCCAACGAGGGAGTTTCTGTGGATCCAGCAAAGATCGAAGCGGTGACCAACTGGCCTCAACCGTCTACAGTT
AGTGAGATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAACCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTT
TGTCTGGAGCCCAACATGCGAGAGTAGTTTCCAGGAGCTTAAGCAGAAGCTGGTGACTACACCAGTCCTGACAGTGTCCGATGGGTCGGGAAGCTTTGTGATCTATAGTG
ATGCCTCCAAGAAGGGACTGGGCTGTGTTATGATGCAGTAA
Protein sequenceShow/hide protein sequence
MAPTELKELKVQLQKLLDKGFIRPSVSPWGVPVLFVKKKDGSMRLGIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLKIRDSDIPKTAFRSRYGHY
EFILMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILVYSKTEAEHEEHLRQVLETLRANRLYAKFSKCEFWLKKVFFLGHVVSNEGVSVDPAKIEAVTNWPQPSTV
SEIRSFLGLAGYYRRFVEDFSRITSPLTQLTRKGTPFVWSPTCESSFQELKQKLVTTPVLTVSDGSGSFVIYSDASKKGLGCVMMQ