; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0226031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0226031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:17584540..17585378
RNA-Seq ExpressionCmc08g0226031
SyntenyCmc08g0226031
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031895.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.1e-11188.89Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGAT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFREFLDTF 
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSG
         VSFLGHVVSKAGV +DPAKIE VT W R STVSEVRSFL LAGYYRRFV NFSRI TP TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVP GSG
Subjt:  IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSG

Query:  SFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

KAA0032794.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.6e-11081.64Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGAT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA TVFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKIE VTSWPR STVSEVRSFL L GYYRRFVENFSRI TP TQLTRKGAPFVWSKACED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLK+KLVTAPV TVP GSGSF+IY  ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

KAA0036553.1 pol protein [Cucumis melo var. makuwa]4.5e-11081.64Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLPWIDDLFDQLQ AT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  ----------------------IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
                              IVSFLGHVVSKAGV +DPAKIE VT W R STVSE RSFL LAGYYRRFVENFS I TP TQLTRKGAPFVWSKACED
Subjt:  ----------------------IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

KAA0051368.1 pol protein [Cucumis melo var. makuwa]6.5e-10981.25Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGA +FSK DLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFR+FLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKIE VT W R STVSEVRSFL LAGYYRRFVENFSRI TP TQLTRKGAPFVWSKACED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

TYK21025.1 pol protein [Cucumis melo var. makuwa]7.6e-11082.03Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKNKYPLP IDDLFD LQGAT+FSKIDLRSRYHQLRIKD +VP  AFRSRYGHYEFIVMSFGL NA TVFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKI+ VTSWPR STVSEVRSFL LAGYYRRFVENFSRI TP TQLTRK APFVWSK CED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS+ASK GLGCVLMQQ KVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

TrEMBL top hitse value%identityAlignment
A0A5A7SRV9 Reverse transcriptase1.5e-11188.89Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGAT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFREFLDTF 
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSG
         VSFLGHVVSKAGV +DPAKIE VT W R STVSEVRSFL LAGYYRRFV NFSRI TP TQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVP GSG
Subjt:  IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSG

Query:  SFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

A0A5A7T0Y9 Reverse transcriptase2.2e-11081.64Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLPWIDDLFDQLQ AT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  ----------------------IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
                              IVSFLGHVVSKAGV +DPAKIE VT W R STVSE RSFL LAGYYRRFVENFS I TP TQLTRKGAPFVWSKACED
Subjt:  ----------------------IVSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

A0A5A7U7V9 Reverse transcriptase3.1e-10981.25Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGA +FSK DLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA  VFMDLMNRVFR+FLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKIE VT W R STVSEVRSFL LAGYYRRFVENFSRI TP TQLTRKGAPFVWSKACED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

A0A5D3DBS9 Pol protein3.7e-11082.03Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKNKYPLP IDDLFD LQGAT+FSKIDLRSRYHQLRIKD +VP  AFRSRYGHYEFIVMSFGL NA TVFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKI+ VTSWPR STVSEVRSFL LAGYYRRFVENFSRI TP TQLTRK APFVWSK CED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLKQKLVTAPVLTVP GSGSFVIYS+ASK GLGCVLMQQ KVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

A0A5D3E456 Reverse transcriptase3.7e-11081.64Show/hide
Query:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV
        MRLCIDYRELNKVTVKN+YPLP IDDLFDQLQGAT+FSKIDLRS YHQLRIKD +VP TAFRSRYGHYEFIVMSFGL NA TVFMDLMNRVFREFLDTFV
Subjt:  MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFV

Query:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED
        I                      VSFLGHVVSKAGV +DPAKIE VTSWPR STVSEVRSFL L GYYRRFVENFSRI TP TQLTRKGAPFVWSKACED
Subjt:  I----------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
        SFQNLK+KLVTAPV TVP GSGSF+IY  ASK GLGCVLMQQGKVV YASRQLKSH
Subjt:  SFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.3e-3731.9Show/hide
Query:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------
        R+ IDYR+LN++TV +++P+P +D++  +L     F+ IDL   +HQ+ +   +V  TAF +++GHYE++ M FGL NA   F   MN + R        
Subjt:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------

Query:  -FLDTFVIVS-------------------------------------FLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI
         +LD  ++ S                                     FLGHV++  G+  +P KIE +  +P  +   E+++FL L GYYR+F+ NF+ I
Subjt:  -FLDTFVIVS-------------------------------------FLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI

Query:  PTPFTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
          P T+  +K      +    DS F+ LK  +   P+L VP  +  F + + AS   LG VL Q G  + Y SR L  H
Subjt:  PTPFTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

P10394 Retrovirus-related Pol polyprotein from transposon 4121.7e-3231.05Show/hide
Query:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------
        RL IDYR++NK  + +K+PLP IDD+ DQL  A  FS +DL S +HQ+ + + +   T+F +  G Y F  + FGL  A   F  +M   F         
Subjt:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------

Query:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI
         ++D  ++                                     V+FLGH  +  G+  D  K + + ++P        R F+    YYRRF++NF+  
Subjt:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI

Query:  PTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQ--QGKVVP--YASR
            T+L +K  PF W+  C+ +F +LK +L+   +L  P  S  F I + ASK   G VL Q   G  +P  YASR
Subjt:  PTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQ--QGKVVP--YASR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.1e-3130.56Show/hide
Query:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------
        RL ID+R+LN+ T+ ++YP+P I  +   L  A  F+ +DL+S YHQ+ + + +   T+F    G YEF  + FGL NAS++F   ++ V RE       
Subjt:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------

Query:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI
         ++D  +I                                     V +LG +VSK G   DP K++ +  +P    V +VRSFL LA YYR F+++F+ I
Subjt:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI

Query:  PTPFTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLK
          P T + +           K  P  +++   ++FQ L+  L +  V L  P     F + + AS +G+G VL Q+G+ +   SR LK
Subjt:  PTPFTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-3730.82Show/hide
Query:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------
        R+ IDYR+LN++T+ ++YP+P +D++  +L     F+ IDL   +HQ+ + + ++  TAF ++ GHYE++ M FGL NA   F   MN + R        
Subjt:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------

Query:  -FLDTFVIVS-------------------------------------FLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI
         +LD  +I S                                     FLGH+V+  G+  +P K++ + S+P  +   E+R+FL L GYYR+F+ N++ I
Subjt:  -FLDTFVIVS-------------------------------------FLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI

Query:  PTPFTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH
          P T   +K       K    ++F+ LK  ++  P+L +P     FV+ + AS   LG VL Q G  + + SR L  H
Subjt:  PTPFTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQGKVVPYASRQLKSH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.1e-3228.28Show/hide
Query:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------
        R+ +D++ LN VT+ + YP+P I+     L  A  F+ +DL S +HQ+ +K+S++P TAF +  G YEF+ + FGL NA  +F  +++ + RE       
Subjt:  RLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFRE-------

Query:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI
         ++D  ++                                     V FLG++V+  G+  DP K+  ++  P  ++V E++ FL +  YYR+F+++++++
Subjt:  -FLDTFVI-------------------------------------VSFLGHVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRI

Query:  PTPFTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQ--QGKVVP--YASRQL
          P T LTR              P    +    SF +LK  L ++ +L  P  +  F + + AS   +G VL Q  QG+  P  Y SR L
Subjt:  PTPFTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQ--QGKVVP--YASRQL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.1e-1741.24Show/hide
Query:  VSFLG--HVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVP
        +++LG  H++S  GV  DPAK+E +  WP     +E+R FL L GYYRRFV+N+ +I  P T+L +K +   W++    +F+ LK  + T PVL +P
Subjt:  VSFLG--HVVSKAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCTATGTATTGACTACAGGGAGTTGAATAAGGTAACCGTTAAGAATAAATATCCCTTGCCCTGGATCGACGATCTGTTTGACCAGTTACAAGGAGCTACAATGTT
CTCTAAGATCGACCTTCGGTCGAGATATCATCAACTGAGAATTAAGGATAGCAATGTACCAAATACAGCATTTCGTTCCAGATATGGGCACTATGAGTTTATTGTGATGT
CTTTTGGTTTGATGAATGCTTCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAGAGAGTTCCTAGACACTTTTGTGATCGTATCCTTTCTAGGCCATGTGGTTTCT
AAAGCTGGTGTTTTCATGGATCCAGCTAAGATAGAGGAGGTCACCAGTTGGCCCCGATCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTATGTTTAGCAGGTTATTATCG
ACGATTTGTGGAGAACTTTTCCCGTATACCTACTCCTTTTACTCAGTTGACCAGGAAGGGAGCTCCTTTCGTTTGGAGCAAAGCCTGTGAGGACAGTTTTCAGAACCTTA
AACAGAAGCTAGTTACTGCACCAGTTCTTACCGTACCTTATGGTTCTGGGAGTTTTGTGATTTACAGTCATGCTTCTAAGAATGGTTTGGGTTGTGTTTTGATGCAACAA
GGTAAGGTAGTCCCTTATGCTTCTCGTCAGTTGAAGAGTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCCTATGTATTGACTACAGGGAGTTGAATAAGGTAACCGTTAAGAATAAATATCCCTTGCCCTGGATCGACGATCTGTTTGACCAGTTACAAGGAGCTACAATGTT
CTCTAAGATCGACCTTCGGTCGAGATATCATCAACTGAGAATTAAGGATAGCAATGTACCAAATACAGCATTTCGTTCCAGATATGGGCACTATGAGTTTATTGTGATGT
CTTTTGGTTTGATGAATGCTTCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAGAGAGTTCCTAGACACTTTTGTGATCGTATCCTTTCTAGGCCATGTGGTTTCT
AAAGCTGGTGTTTTCATGGATCCAGCTAAGATAGAGGAGGTCACCAGTTGGCCCCGATCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTATGTTTAGCAGGTTATTATCG
ACGATTTGTGGAGAACTTTTCCCGTATACCTACTCCTTTTACTCAGTTGACCAGGAAGGGAGCTCCTTTCGTTTGGAGCAAAGCCTGTGAGGACAGTTTTCAGAACCTTA
AACAGAAGCTAGTTACTGCACCAGTTCTTACCGTACCTTATGGTTCTGGGAGTTTTGTGATTTACAGTCATGCTTCTAAGAATGGTTTGGGTTGTGTTTTGATGCAACAA
GGTAAGGTAGTCCCTTATGCTTCTCGTCAGTTGAAGAGTCATTAG
Protein sequenceShow/hide protein sequence
MRLCIDYRELNKVTVKNKYPLPWIDDLFDQLQGATMFSKIDLRSRYHQLRIKDSNVPNTAFRSRYGHYEFIVMSFGLMNASTVFMDLMNRVFREFLDTFVIVSFLGHVVS
KAGVFMDPAKIEEVTSWPRSSTVSEVRSFLCLAGYYRRFVENFSRIPTPFTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPYGSGSFVIYSHASKNGLGCVLMQQ
GKVVPYASRQLKSH