; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G005660 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G005660
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr18:4714142..4715110
RNA-Seq ExpressionCmoCh18G005660
SyntenyCmoCh18G005660
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.2e-15884.78Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELL AGFIRPAKAP+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVM FGLTNAPATFCT++NQVF+EYLDQFV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I FLGHV+  GQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMD+DK+KAIQEW+VPTSV ELRSFLGLANYYRRF+EGFSRRA P+TELLKK   W WS + Q AFE++K  M +G VLGL DVTKPFEVET+ASD+AL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVL+Q+ HPI YESRKLN+AE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]1.8e-16289.75Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQL                      CGQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMDSDKIKAIQEWKVPTSVS+LRSFLGLANYYRRFVEGFSRRAAPLTELLKKD+ WSWS+DCQMAFE++KTTMTRG VLGLVDVTKPFE+ET+ASDFAL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVLIQEGHPIA+ESRKLNDAE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]1.3e-18197.83Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRI EGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATF TL+NQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKD+PWSWSNDCQMAFEN+KTTMTRG VLGLVDVTKPFEVET+ASDFAL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVLIQEGHPIAYESRKLNDAE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]4.3e-16497.59Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI+FLGHVVRCGQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEV
        ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKD+PWSWSNDCQMAFEN+KTTMTRG VLGLVD+TKPFE+
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEV

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]9.3e-18398.14Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI+FLGHVVRCGQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKD+PWSWSNDCQMAFEN+KTTMTRG VLGLVDVTKPFEVET+ASDFAL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVLIQEGHPIAYESRKLNDAE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

TrEMBL top hitse value%identityAlignment
A0A5D3C4R1 Reverse transcriptase4.4e-15480.75Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELL AGFIRPAKAPYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INFLGHV+ CG+
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        I M+  KI AI++W +P SVSELRSFLGLANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ +K  +  G +LG+ DVTKPFEVET+ASD+AL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVL+Q GHPIAYESRKLN AE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

A0A5D3C9P8 Reverse transcriptase2.0e-15481.06Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELL AGFIRPAKAPYGAPVLFQKKKDG+LRLCIDYRALNK+TVRNKYPLPII+DLFD+LHGAKYF+KLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVF+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INFLGHV+ CG+
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        I M+  KI AI++W +P SVSELRSFLGLANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ +K  +  G +LG+ DVTKPFEVET+ASD+AL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVL+Q GHPIAYESRKLN AE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

A0A6J1D906 Reverse transcriptase5.9e-15984.78Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELL AGFIRPAKAP+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVM FGLTNAPATFCT++NQVF+EYLDQFV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I FLGHV+  GQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMD+DK+KAIQEW+VPTSV ELRSFLGLANYYRRF+EGFSRRA P+TELLKK   W WS + Q AFE++K  M +G VLGL DVTKPFEVET+ASD+AL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVL+Q+ HPI YESRKLN+AE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

A0A6J1DK29 uncharacterized protein LOC1110218294.7e-15683.23Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPP+LAELRKQLDEL   GFIRPAKAP+GA VLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCT++NQVF+EYLDQFV+VYLDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I FLGHV+  GQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMD+DK+K IQEW+VPTSV+ELRSFLGLANYYRRF+EGFSRRA P+TELL K     WS + Q AFE++K  M +G VLGL DVTKPFEVET+ASD+AL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVL+Q+ HPIAYESRKLN+AE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

A0A6J1IEF9 uncharacterized protein LOC1114749458.8e-16389.75Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
        TTCVTRYGAFEFLVMPFGLTNAPATFCTL+NQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQL                      CGQ
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
        ISMDSDKIKAIQEWKVPTSVS+LRSFLGLANYYRRFVEGFSRRAAPLTELLKKD+ WSWS+DCQMAFE++KTTMTRG VLGLVDVTKPFE+ET+ASDFAL
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEGHPIAYESRKLNDAE
        GGVLIQEGHPIA+ESRKLNDAE
Subjt:  GGVLIQEGHPIAYESRKLNDAE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.1e-6840.19Show/hide
Query:  ELRKQLDELLKAGFIRPAKAPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTT
        E+  Q+ ++L  G IR + +PY +P+ +  KK+D +     R+ IDYR LN++TV +++P+P + ++  +L    YFT +DL  G++Q+ +      KT 
Subjt:  ELRKQLDELLKAGFIRPAKAPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTT

Query:  CVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQIS
          T++G +E+L MPFGL NAPATF   +N +    L++  +VYLDDI+V+ST+L+EH   L LVF+KL +  L ++ +KC F +    FLGHV+    I 
Subjt:  CVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQIS

Query:  MDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSN-DCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFALG
         + +KI+AIQ++ +PT   E+++FLGL  YYR+F+  F+  A P+T+ LKK+     +N +   AF+ +K  ++   +L + D TK F + T+ASD ALG
Subjt:  MDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSN-DCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFALG

Query:  GVLIQEGHPIAYESRKLNDAE
         VL Q+GHP++Y SR LN+ E
Subjt:  GVLIQEGHPIAYESRKLNDAE

P0CT34 Transposon Tf2-1 polyprotein1.8e-6438.23Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        + P ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
               G FE+LVMP+G++ APA F   IN +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +    
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
         +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +    VL   D +K   +ET+ASD A+
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEG-----HPIAYESRKLNDAE
        G VL Q+      +P+ Y S K++ A+
Subjt:  GGVLIQEG-----HPIAYESRKLNDAE

P0CT35 Transposon Tf2-2 polyprotein1.8e-6438.23Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        + P ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
               G FE+LVMP+G++ APA F   IN +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +    
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
         +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +    VL   D +K   +ET+ASD A+
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEG-----HPIAYESRKLNDAE
        G VL Q+      +P+ Y S K++ A+
Subjt:  GGVLIQEG-----HPIAYESRKLNDAE

P0CT41 Transposon Tf2-12 polyprotein1.8e-6438.23Show/hide
Query:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK
        + P ++  +  ++++ LK+G IR +KA    PV+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K
Subjt:  MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPK

Query:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ
               G FE+LVMP+G++ APA F   IN +  E  +  V+ Y+DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +    
Subjt:  TTCVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQ

Query:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL
         +   + I  + +WK P +  ELR FLG  NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +    VL   D +K   +ET+ASD A+
Subjt:  ISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFAL

Query:  GGVLIQEG-----HPIAYESRKLNDAE
        G VL Q+      +P+ Y S K++ A+
Subjt:  GGVLIQEG-----HPIAYESRKLNDAE

P20825 Retrovirus-related Pol polyprotein from transposon 2974.8e-6540.5Show/hide
Query:  ELRKQLDELLKAGFIRPAKAPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTT
        E+  Q+ E+L  G IR + +PY +P  +  KK D +     R+ IDYR LN++T+ ++YP+P + ++  +L   +YFT +DL  G++Q+ + E    KT 
Subjt:  ELRKQLDELLKAGFIRPAKAPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTT

Query:  CVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQIS
          T+ G +E+L MPFGL NAPATF   +N +    L++  +VYLDDI+++ST+L EH   ++LVF KL    L ++ +KC F +   NFLGH+V    I 
Subjt:  CVTRYGAFEFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQIS

Query:  MDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSN-DCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFALG
         +  K+KAI  + +PT   E+R+FLGL  YYR+F+  ++  A P+T  LKK         +   AFE +K  + R  +L L D  K F + T+AS+ ALG
Subjt:  MDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSN-DCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFALG

Query:  GVLIQEGHPIAYESRKLNDAE
         VL Q GHPI++ SR LND E
Subjt:  GVLIQEGHPIAYESRKLNDAE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-2140.77Show/hide
Query:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSW
        HL +V     Q+Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P + +ELR FLGL  YYRRFV+ + +   PLTELLKK+    W
Subjt:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSW

Query:  SNDCQMAFENMKTTMTRGSVLGLVDVTKPF
        +    +AF+ +K  +T   VL L D+  PF
Subjt:  SNDCQMAFENMKTTMTRGSVLGLVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCGCCCGAGCTAGCCGAATTGAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGGCAAAGGCACCTTATGGAGCCCCCGTATTGTTCCAGAA
GAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACAGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACCAACTTC
ACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTT
GAATTCCTAGTAATGCCCTTTGGCTTGACGAACGCCCCAGCTACGTTCTGCACGTTGATCAACCAAGTTTTCTATGAATACCTGGATCAATTCGTCATAGTATACCTCGA
CGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTACACTTGAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGTTGTATGTGAAAAAGGAGAAATGTGCAT
TCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTA
TCTGAGTTGCGGTCCTTCCTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACTACCCTTG
GTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATATGAAAACAACCATGACGAGGGGTTCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCA
ATGCTTCCGACTTTGCTCTCGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTACGAAAGTCGAAAGCTCAACGATGCCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCGCCCGAGCTAGCCGAATTGAGGAAACAACTGGACGAGTTGCTGAAGGCGGGATTCATTCGCCCGGCAAAGGCACCTTATGGAGCCCCCGTATTGTTCCAGAA
GAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACAGTACGCAACAAATATCCTCTGCCGATAATATCCGACTTGTTCGACCAACTTC
ACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAGGATATTATCAAGTACGTATTGCCGAAGGGGACGAACCCAAGACGACGTGCGTAACAAGATATGGGGCCTTT
GAATTCCTAGTAATGCCCTTTGGCTTGACGAACGCCCCAGCTACGTTCTGCACGTTGATCAACCAAGTTTTCTATGAATACCTGGATCAATTCGTCATAGTATACCTCGA
CGACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTACACTTGAAGCTAGTGTTTGACAAGCTCCGGCAGAACCAGTTGTATGTGAAAAAGGAGAAATGTGCAT
TCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTATGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTA
TCTGAGTTGCGGTCCTTCCTAGGACTAGCCAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACTACCCTTG
GTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATATGAAAACAACCATGACGAGGGGTTCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCA
ATGCTTCCGACTTTGCTCTCGGTGGGGTCCTTATTCAAGAAGGCCACCCCATCGCTTACGAAAGTCGAAAGCTCAACGATGCCGAATGA
Protein sequenceShow/hide protein sequence
MAPPELAELRKQLDELLKAGFIRPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCVTRYGAF
EFLVMPFGLTNAPATFCTLINQVFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSV
SELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDYPWSWSNDCQMAFENMKTTMTRGSVLGLVDVTKPFEVETNASDFALGGVLIQEGHPIAYESRKLNDAE