; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G008750 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G008750
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr16:4698131..4700405
RNA-Seq ExpressionCmoCh16G008750
SyntenyCmoCh16G008750
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK07954.1 reverse transcriptase [Cucumis melo var. makuwa]1.6e-23757.59Show/hide
Query:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLS+LQ+K       +++ LM+VD  IN +P+KST++DSGATHNFI + EA+RL L  EKD G+MKA       I+G+ KR   +LG W+G +D 
Subjt:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI    P+V+   ++QP  L+ ISA+QLK+GL+++EPTFMAIPL     +  TVP +I  VL+ Y D++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+G LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------
        +LFD+LHGAKYF+KLD    Y Q                           G TN                                              
Subjt:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------

Query:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                      CG+IG++  K  AI++W +P SVS +RSFLGLANYYR FVEGFS+ A+PLTELLKKD  W+W  ECQ  F
Subjt:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        + LK  + +G +  + DVTKPFE+ETDASD+AL GVL+Q GHPIAYESRKLN AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSA CHFF QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDF+FEHK G SNQA D LSRK EHAA+C++AH+  S+I GS+RD ++E L K+ + + V+ LAKAGKT+QFW++ DLL+TKGNR
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PR G LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.0e-25762.06Show/hide
Query:  MGTLKFLSALQRKAN-PKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        M  LKFLSA+Q++ N PK   +K LMFVDA IN   +KST++DSGATHNFI++QEARRL LTIEKD GKMK        IVGVSKR   KLGTWTG +D 
Subjt:  MGTLKFLSALQRKAN-PKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FL+EH+VIPMPLAKC+++  ++PTV+  SIKQPG +R ISA+QLK+GL +EEPTFMAIP++E+      VP +I+ V+  Y DI+
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP+TLPPRRGIDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K P+GAPVLFQKKK+G LRLCIDYRALNKVTVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLDY------------DQGTTNC----------------------------------------------------------------
        +LFDQLHGAKYFTKLD             D+  T C                                                                
Subjt:  NLFDQLHGAKYFTKLDY------------DQGTTNC----------------------------------------------------------------

Query:  -------------------------------GQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                       GQI +D++K KAIQEW++PTSV  +RSFLGLANYYR F+EGFS+ ATP+TELLKK   W WS E Q  F
Subjt:  -------------------------------GQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        EDLKA M KG V  L DVTKPFE+ETDASD+AL GVL+Q+ HPI YESRKLN+AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSAICHFF+QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDFKFEHKAGKSNQA D LSRKGEHA LCM+AHIH+SK DGS+RD+I E+L   PS + VVELAK  KT+QFW++GDLL T+GN 
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PRTG LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]2.8e-24259.63Show/hide
Query:  MGTLKFLSALQRKANP-KEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLSALQ+KA   KE +++ LM+V+A +N + +KST++DSGATHNF+ + EARRL L  +KDPGKMKA       I+GV+KR   KLGTW+G +D 
Subjt:  MGTLKFLSALQRKANP-KEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        V+V MDDFDVVLG+ FLLEH+VIPMPLAKCLV+   +P V+  SIKQP  ++ ISA+QLK+G+AQ+EPTFMAIP+ E   +E  VP +I+ VL++YAD++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P++LP+ LPPR GIDHEIELL G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+  LRLCIDYRALNK+ VRNK PLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLDYDQG-----------------------------------------------------------------TTNCGQIGIDSNKNK
        +LFDQLHGAKYF+KLD   G                                                                    CG+IG++  K K
Subjt:  NLFDQLHGAKYFTKLDYDQG-----------------------------------------------------------------TTNCGQIGIDSNKNK

Query:  AIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGH
        AIQEWKIPTS++ +RSFLGLANYYR FVEGFS+   PLT+LLKK+  W+W+ EC   FE LK  M +G V  + DVT+PFE+ETDASDFAL GVL+Q+GH
Subjt:  AIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGH

Query:  PIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAA
        PIAYES+KLNDAERRY  SEKEML V HCLR WRQYLLG++F+VKTDNS++CHFF+QPKL++KQARWQE LAEFDF+FEHK G++NQA D LSRK E AA
Subjt:  PIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAA

Query:  LCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LCM+AH+ +SK+ GS+R+ I+E+L  +P+ +A+++LA  G T+QF ++ DL  TKGN LY+PR+G LRK L+ ECHDT+W GH  WQRTYAL+K+G
Subjt:  LCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]3.9e-28972.79Show/hide
Query:  MGTLKFLSALQRKANPKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDLV
        MG LKFLSALQRK  PKEII+K LMFVDA INSR SKSTLIDSGATHNFIADQEARRLGLTI KDPGKMKA       IVGVSK  PFK+G WTGE+DLV
Subjt:  MGTLKFLSALQRKANPKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDLV

Query:  VVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIP
        VV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI D NPTVIPASIKQPGNLR ISAIQLKRGLA+EEPTFMAIPL+EE TTE TVP +IKEVLD+Y DI+P
Subjt:  VVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIP

Query:  ESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISN
        ESLPQTLPPRRGIDHEIELL GVKPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+G LRLCIDYRALNKVTVRNKYPLPIIS+
Subjt:  ESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISN

Query:  LFDQLHGAKYFTKLDYDQG---------------------------------------------------------------------------------
        LFDQLHGAKYFTKLD   G                                                                                 
Subjt:  LFDQLHGAKYFTKLDYDQG---------------------------------------------------------------------------------

Query:  --------------------------TTNCGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFE
                                     CGQI +DS+K KAIQEWK+PTSVS +RSFLGLANYYR FVEGFS+ A PLTELLKKDHPWSWSN+CQM FE
Subjt:  --------------------------TTNCGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFE

Query:  DLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPK
        +LK TMT+G V  LVDVTKPFE+ETDASDFAL GVL+QEGHPIAYESRKLNDAERRYTVSEKEML V HCLRVWRQYLLGSQF+VKTDNSA CHFFDQPK
Subjt:  DLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPK

Query:  LTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRL
        LTAKQARWQESLAEFDFKFEHKAGKSNQA D LSRKGEHAALCM+AHIHSSKIDGSMRDIIKEHLHK+PS K VVELAKAGKT+QFW++GDLL+TKGNRL
Subjt:  LTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRL

Query:  YIPRTGELRKKLI
        Y+PRTGELRKKLI
Subjt:  YIPRTGELRKKLI

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]3.9e-30573.41Show/hide
Query:  MGTLKFLSALQRKANPKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDLV
        MG LKFLSALQRK  PKEII+K LMFVDA INSR SKSTLIDSGATHNFIADQEARRLGLTI KDPGKMKA       IVGVSK  PFK+G WTGE+DLV
Subjt:  MGTLKFLSALQRKANPKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDLV

Query:  VVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIP
        VV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI D NPTVIPASIKQPGNLR ISAIQLKRGLA+EEPTFMAIPL+EE TTE TVP +IK+VLDSY DI+P
Subjt:  VVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIP

Query:  ESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISN
        ESLPQTLPPRRGIDHEIELL GVKPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+G LRLCIDYRALNKVTVRNKYPLPIIS+
Subjt:  ESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISN

Query:  LFDQLHGAKYFTKLDYDQG---------------------------------------------------------------------------------
        LFDQLHGAKYFTKLD   G                                                                                 
Subjt:  LFDQLHGAKYFTKLDYDQG---------------------------------------------------------------------------------

Query:  --------------------------TTNCGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFE
                                     CGQI +DS+K KAIQEWK+PTSVS +RSFLGLANYYR FVEGFS+ A PLTELLKKDHPWSWSN+CQM FE
Subjt:  --------------------------TTNCGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFE

Query:  DLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPK
        +LK TMT+G V  LVDVTKPFE+ETDASDFAL GVL+QEGHPIAYESRKLNDAERRYTVSEKEML V HCLRVWRQYLLGSQF+VKTDNSA CHFFDQPK
Subjt:  DLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPK

Query:  LTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRL
        LTAKQARWQESLAEFDFKFEHKAGKSNQA D LSRKGEHAALCM+AHIHSSKIDGSMRDIIKEHLHK+PS KAVVELAKAGKT+QFW++GDLL+TKGNRL
Subjt:  LTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRL

Query:  YIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        Y+PRTGELRKKLIQECHDTLW GHP WQRTYALIK+G
Subjt:  YIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

TrEMBL top hitse value%identityAlignment
A0A5D3BRZ6 Reverse transcriptase1.3e-23757.45Show/hide
Query:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLS+LQ+K       +++ LM+VD  IN +P+KST++DSGATHNFI + EA+RL L  EKD G+MKA       I+G+ KR   +LG W+G +D 
Subjt:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI    P+V+   ++QP  L+ ISA+QLK+GL+++EPTFMAIPL     +  TVP +I  VL+ Y D++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQ+KK+G LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------
        +LFD+LHGAKYF+KLD    Y Q                           G TN                                              
Subjt:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------

Query:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                      CG+IG++  K  AI++W +P SVS +RSFLGLANYYR FVEGFS+ A+PLTELLKKD  W+W  ECQ  F
Subjt:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        + LK  + +G +  + DVTKPFE+ETDASD+AL GVL+Q GHPIAYESRKLN AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSA CHFF QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDF+FEHK G SNQA D LSRK EHAA+C++AH+  S+I GS+RD ++E L K+ + + V+ LAKAGKT+QFW++ DLL+TKGNR
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PR G LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

A0A5D3C4R1 Reverse transcriptase1.3e-23757.45Show/hide
Query:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLS+LQ+K       +++ LM+VD  IN +P+KST++DSGATHNFI + EA+RL L  EKD G+MKA       I+G+ KR   +LG W+G +D 
Subjt:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI    P+V+   ++QP  L+ ISA+QLK+GL+++EPTFMAIPL     +  TVP +I  VL+ Y D++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQ+KK+G LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------
        +LFD+LHGAKYF+KLD    Y Q                           G TN                                              
Subjt:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------

Query:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                      CG+IG++  K  AI++W +P SVS +RSFLGLANYYR FVEGFS+ A+PLTELLKKD  W+W  ECQ  F
Subjt:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        + LK  + +G +  + DVTKPFE+ETDASD+AL GVL+Q GHPIAYESRKLN AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSA CHFF QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDF+FEHK G SNQA D LSRK EHAA+C++AH+  S+I GS+RD ++E L K+ + + V+ LAKAGKT+QFW++ DLL+TKGNR
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PR G LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

A0A5D3C9P8 Reverse transcriptase7.6e-23857.59Show/hide
Query:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLS+LQ+K       +++ LM+VD  IN +P+KST++DSGATHNFI + EA+RL L  EKD G+MKA       I+G+ KR   +LG W+G +D 
Subjt:  MGTLKFLSALQRKANPKEI-IKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FLLEH+VIPMPLAKCLVI    P+V+   ++QP  L+ ISA+QLK+GL+++EPTFMAIPL     +  TVP +I  VL+ Y D++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+G LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------
        +LFD+LHGAKYF+KLD    Y Q                           G TN                                              
Subjt:  NLFDQLHGAKYFTKLD----YDQ---------------------------GTTN----------------------------------------------

Query:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                      CG+IG++  K  AI++W +P SVS +RSFLGLANYYR FVEGFS+ A+PLTELLKKD  W+W  ECQ  F
Subjt:  ------------------------------CGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        + LK  + +G +  + DVTKPFE+ETDASD+AL GVL+Q GHPIAYESRKLN AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSA CHFF QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDF+FEHK G SNQA D LSRK EHAA+C++AH+  S+I GS+RD ++E L K+ + + V+ LAKAGKT+QFW++ DLL+TKGNR
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PR G LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

A0A6J1D906 Reverse transcriptase5.1e-25862.06Show/hide
Query:  MGTLKFLSALQRKAN-PKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        M  LKFLSA+Q++ N PK   +K LMFVDA IN   +KST++DSGATHNFI++QEARRL LTIEKD GKMK        IVGVSKR   KLGTWTG +D 
Subjt:  MGTLKFLSALQRKAN-PKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        VVV MDDFDVVLGM+FL+EH+VIPMPLAKC+++  ++PTV+  SIKQPG +R ISA+QLK+GL +EEPTFMAIP++E+      VP +I+ V+  Y DI+
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP+TLPPRRGIDHEIEL+ G KPP KNAYRMAPPELAELRKQLDELL AGFI P K P+GAPVLFQKKK+G LRLCIDYRALNKVTVRNKYPLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLDY------------DQGTTNC----------------------------------------------------------------
        +LFDQLHGAKYFTKLD             D+  T C                                                                
Subjt:  NLFDQLHGAKYFTKLDY------------DQGTTNC----------------------------------------------------------------

Query:  -------------------------------GQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF
                                       GQI +D++K KAIQEW++PTSV  +RSFLGLANYYR F+EGFS+ ATP+TELLKK   W WS E Q  F
Subjt:  -------------------------------GQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVF

Query:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP
        EDLKA M KG V  L DVTKPFE+ETDASD+AL GVL+Q+ HPI YESRKLN+AERRYTVSEKEML V HCLR WRQYLLGS F+VKTDNSAICHFF+QP
Subjt:  EDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQP

Query:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR
        KLT+KQARWQE LAEFDFKFEHKAGKSNQA D LSRKGEHA LCM+AHIH+SK DGS+RD+I E+L   PS + VVELAK  KT+QFW++GDLL T+GN 
Subjt:  KLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNR

Query:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LY+PRTG LRKKL+ ECHDTLW GHP WQRTYAL+K+G
Subjt:  LYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

A0A6J1DLQ6 uncharacterized protein LOC1110223201.3e-24259.63Show/hide
Query:  MGTLKFLSALQRKANP-KEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL
        MG LKFLSALQ+KA   KE +++ LM+V+A +N + +KST++DSGATHNF+ + EARRL L  +KDPGKMKA       I+GV+KR   KLGTW+G +D 
Subjt:  MGTLKFLSALQRKANP-KEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAA------IVGVSKRAPFKLGTWTGEMDL

Query:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII
        V+V MDDFDVVLG+ FLLEH+VIPMPLAKCLV+   +P V+  SIKQP  ++ ISA+QLK+G+AQ+EPTFMAIP+ E   +E  VP +I+ VL++YAD++
Subjt:  VVVCMDDFDVVLGMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADII

Query:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS
        P++LP+ LPPR GIDHEIELL G KPP KNAYRMAPPELAELRKQLDELL AGFI P K PYGAPVLFQKKK+  LRLCIDYRALNK+ VRNK PLPII+
Subjt:  PESLPQTLPPRRGIDHEIELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIIS

Query:  NLFDQLHGAKYFTKLDYDQG-----------------------------------------------------------------TTNCGQIGIDSNKNK
        +LFDQLHGAKYF+KLD   G                                                                    CG+IG++  K K
Subjt:  NLFDQLHGAKYFTKLDYDQG-----------------------------------------------------------------TTNCGQIGIDSNKNK

Query:  AIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGH
        AIQEWKIPTS++ +RSFLGLANYYR FVEGFS+   PLT+LLKK+  W+W+ EC   FE LK  M +G V  + DVT+PFE+ETDASDFAL GVL+Q+GH
Subjt:  AIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGH

Query:  PIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAA
        PIAYES+KLNDAERRY  SEKEML V HCLR WRQYLLG++F+VKTDNS++CHFF+QPKL++KQARWQE LAEFDF+FEHK G++NQA D LSRK E AA
Subjt:  PIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGEHAA

Query:  LCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG
        LCM+AH+ +SK+ GS+R+ I+E+L  +P+ +A+++LA  G T+QF ++ DL  TKGN LY+PR+G LRK L+ ECHDT+W GH  WQRTYAL+K+G
Subjt:  LCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.5e-3834.6Show/hide
Query:  PYG---APVLFQKKKNGVLRLCIDYRALNKV------TVRNKYPLPIISNLFDQLHGAKYFTKLD----YDQGTTNCGQI----GIDSNKNK--AIQEWK
        P+G   AP  FQ+  N +LR  ++   L  +      +      L  +  +F++L  A    +LD      Q TT  G +    GI  N  K  AIQ++ 
Subjt:  PYG---APVLFQKKKNGVLRLCIDYRALNKV------TVRNKYPLPIISNLFDQLHGAKYFTKLD----YDQGTTNCGQI----GIDSNKNK--AIQEWK

Query:  IPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSN-ECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYE
        IPT    +++FLGL  YYR F+  F+  A P+T+ LKK+     +N E    F+ LK  +++  + ++ D TK F + TDASD AL  VL Q+GHP++Y 
Subjt:  IPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSN-ECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEGHPIAYE

Query:  SRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSR
        SR LN+ E  Y+  EKE+L +    + +R YLLG  F + +D+  +   +      +K  RW+  L+EFDF  ++  GK N   D LSR
Subjt:  SRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSR

P0CT42 Transposon Tf2-7 polyprotein4.1e-3924.07Show/hide
Query:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP
        ++  + T   +  +  +  E  +P   KE  D  A+   E LP+   P +G++ E+EL     + P +N Y + P ++  +  ++++ L +G I  +K  
Subjt:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP

Query:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------
           PV+F  KK G LR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLD                                             
Subjt:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------

Query:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY
                                                                             ++G T C       N +K +Q WK P +   
Subjt:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY

Query:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK
        +R FLG  NY R F+   SQ   PL  LLKKD  W W+       E++K  +    V    D +K   +ETDASD A+  VL Q+      +P+ Y S K
Subjt:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK

Query:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------
        ++ A+  Y+VS+KEML +   L+ WR YL  +   F + TD+ + I    ++ +   K+ ARWQ  L +F+F+  ++ G +N   D LSR          
Subjt:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------

Query:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP
          E  ++  +  I  S  D     ++ E+ +    +  +    K  + +   +K  LL+   +++ +P   +L + +I++ H+   + HP
Subjt:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP

P0CT43 Transposon Tf2-8 polyprotein4.1e-3924.07Show/hide
Query:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP
        ++  + T   +  +  +  E  +P   KE  D  A+   E LP+   P +G++ E+EL     + P +N Y + P ++  +  ++++ L +G I  +K  
Subjt:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP

Query:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------
           PV+F  KK G LR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLD                                             
Subjt:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------

Query:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY
                                                                             ++G T C       N +K +Q WK P +   
Subjt:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY

Query:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK
        +R FLG  NY R F+   SQ   PL  LLKKD  W W+       E++K  +    V    D +K   +ETDASD A+  VL Q+      +P+ Y S K
Subjt:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK

Query:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------
        ++ A+  Y+VS+KEML +   L+ WR YL  +   F + TD+ + I    ++ +   K+ ARWQ  L +F+F+  ++ G +N   D LSR          
Subjt:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------

Query:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP
          E  ++  +  I  S  D     ++ E+ +    +  +    K  + +   +K  LL+   +++ +P   +L + +I++ H+   + HP
Subjt:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP

P20825 Retrovirus-related Pol polyprotein from transposon 2971.0e-3727.58Show/hide
Query:  PPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGV-----LRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDYDQG
        P     Y +A     E+  Q+ E+L  G I  +  PY +P     KK         R+ IDYR LN++T+ ++YP+P +  +  +L   +YFT +D  +G
Subjt:  PPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGV-----LRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDYDQG

Query:  ----------------TTNCGQI-----------------------------------------------------------------------------
                        +T  G                                                                               
Subjt:  ----------------TTNCGQI-----------------------------------------------------------------------------

Query:  ------------GIDSN--KNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSN-ECQMVFEDLKATMTKGLVFELVDVT
                    GI  N  K KAI  + IPT    +R+FLGL  YYR F+  ++  A P+T  LKK         E    FE LKA + +  + +L D  
Subjt:  ------------GIDSN--KNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSN-ECQMVFEDLKATMTKGLVFELVDVT

Query:  KPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFK
        K F + TDAS+ AL  VL Q GHPI++ SR LND E  Y+  EKE+L +    + +R YLLG QF++ +D+  +    +  +  AK  RW+  L+E+ FK
Subjt:  KPFEIETDASDFALDGVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFK

Query:  FEHKAGKSNQATDVLSR
         ++  GK N   D LSR
Subjt:  FEHKAGKSNQATDVLSR

Q9UR07 Transposon Tf2-11 polyprotein4.1e-3924.07Show/hide
Query:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP
        ++  + T   +  +  +  E  +P   KE  D  A+   E LP+   P +G++ E+EL     + P +N Y + P ++  +  ++++ L +G I  +K  
Subjt:  LAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHEIELL-SGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEP

Query:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------
           PV+F  KK G LR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLD                                             
Subjt:  YGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDY--------------------------------------------

Query:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY
                                                                             ++G T C       N +K +Q WK P +   
Subjt:  ---------------------------------------------------------------------DQGTTNCGQIGIDSNKNKAIQEWKIPTSVSY

Query:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK
        +R FLG  NY R F+   SQ   PL  LLKKD  W W+       E++K  +    V    D +K   +ETDASD A+  VL Q+      +P+ Y S K
Subjt:  VRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALDGVLVQEG-----HPIAYESRK

Query:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------
        ++ A+  Y+VS+KEML +   L+ WR YL  +   F + TD+ + I    ++ +   K+ ARWQ  L +F+F+  ++ G +N   D LSR          
Subjt:  LNDAERRYTVSEKEMLDVAHCLRVWRQYLLGS--QFIVKTDN-SAICHFFDQPKLTAKQ-ARWQESLAEFDFKFEHKAGKSNQATDVLSR----------

Query:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP
          E  ++  +  I  S  D     ++ E+ +    +  +    K  + +   +K  LL+   +++ +P   +L + +I++ H+   + HP
Subjt:  KGEHAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHP

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-0937.08Show/hide
Query:  IGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPF
        +  D  K +A+  W  P + + +R FLGL  YYR FV+ + +   PLTELLKK +   W+    + F+ LK  +T   V  L D+  PF
Subjt:  IGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACGCTTAAATTCTTGTCAGCCCTCCAACGAAAGGCCAACCCAAAGGAGATAATAAAGAAATGGCTCATGTTCGTAGATGCAATAATAAACTCCCGACCA
AGCAAGAGCACCTTGATAGACTCAGGTGCGACCCACAACTTCATCGCCGACCAAGAAGCACGAAGATTGGGACTCACCATAGAAAAGGACCCTGGAAAAATGAAA
GCTGCTATTGTGGGAGTTTCTAAAAGAGCCCCCTTCAAATTAGGGACCTGGACAGGAGAAATGGACCTTGTCGTGGTCTGCATGGATGACTTCGACGTGGTACTT
GGGATGGACTTCCTCCTAGAACACGAAGTTATCCCAATGCCATTGGCGAAATGCTTGGTGATTGTCGACCACAACCCCACAGTGATACCTGCAAGCATCAAGCAA
CCAGGTAATCTTAGAAAGATCTCGGCCATACAATTGAAAAGAGGACTCGCACAAGAGGAACCTACATTTATGGCTATACCCCTAATGGAAGAAATGACCACTGAG
GGAACCGTTCCAAGCAAAATCAAGGAAGTACTAGACAGTTATGCTGACATAATACCAGAGAGCTTACCACAAACACTACCACCCCGTCGAGGCATTGATCACGAA
ATTGAACTCCTCTCCGGGGTTAAACCCCCAACAAAGAATGCATACCGAATGGCTCCCCCTGAGCTAGCCGAATTGAGGAAACAACTAGATGAGTTGTTGATGGCA
GGATTCATCCCCCCGACGAAGGAACCTTACGGAGCCCCCGTACTATTTCAGAAAAAGAAGAATGGGGTGTTGCGTCTTTGCATAGACTATAGAGCCTTAAACAAG
GTAACGGTACGCAACAAATACCCACTGCCGATAATATCCAACTTGTTTGACCAGCTTCACGGAGCCAAATACTTCACGAAGTTGGACTACGATCAAGGTACTACC
AATTGTGGACAGATCGGTATAGATAGCAATAAGAATAAAGCTATCCAAGAGTGGAAAATCCCTACTTCAGTATCCTATGTGCGATCCTTCTTAGGATTGGCAAAC
TACTATAGGTGGTTTGTCGAAGGGTTTTCACAAGGAGCCACCCCATTAACAGAACTCCTAAAGAAAGACCACCCATGGTCGTGGTCAAATGAATGTCAAATGGTC
TTTGAAGATCTGAAGGCAACCATGACGAAGGGTCTTGTCTTCGAATTAGTAGATGTTACTAAACCATTTGAAATAGAGACGGATGCTTCCGACTTTGCCCTAGAT
GGTGTCCTTGTTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGGAAGCTCAATGATGCTGAAAGGAGGTACACTGTCTCCGAAAAAGAAATGTTGGATGTGGCT
CATTGCCTACGAGTCTGGAGACAGTACCTCTTGGGATCACAGTTCATAGTGAAGACGGATAACAGCGCCATTTGCCATTTCTTTGATCAACCAAAATTGACGGCA
AAACAAGCCCGATGGCAGGAGTCATTGGCTGAATTCGACTTCAAATTCGAACACAAGGCAGGAAAGAGCAACCAAGCAACTGACGTGCTGAGTCGGAAGGGCGAA
CATGCAGCCCTGTGCATGATAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAAGAACATTTACATAAAAACCCATCTGTCAAAGCC
GTCGTTGAACTAGCCAAAGCTGGGAAGACACAACAGTTTTGGATCAAGGGAGACCTTCTAATGACAAAGGGAAACAGATTATACATTCCAAGAACAGGAGAACTG
AGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGTCGGACACCCTAGGTGGCAACGAACATACGCCTTAATAAAAAGGGGTACTTCTGGCCAAATATGC
GAGACGACATCATGTAGTACACCAAGACATGCCTCATTTGCCAACAGGACAAAGTCGAGAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTACGCTTAAATTCTTGTCAGCCCTCCAACGAAAGGCCAACCCAAAGGAGATAATAAAGAAATGGCTCATGTTCGTAGATGCAATAATAAACTCCCGACCA
AGCAAGAGCACCTTGATAGACTCAGGTGCGACCCACAACTTCATCGCCGACCAAGAAGCACGAAGATTGGGACTCACCATAGAAAAGGACCCTGGAAAAATGAAA
GCTGCTATTGTGGGAGTTTCTAAAAGAGCCCCCTTCAAATTAGGGACCTGGACAGGAGAAATGGACCTTGTCGTGGTCTGCATGGATGACTTCGACGTGGTACTT
GGGATGGACTTCCTCCTAGAACACGAAGTTATCCCAATGCCATTGGCGAAATGCTTGGTGATTGTCGACCACAACCCCACAGTGATACCTGCAAGCATCAAGCAA
CCAGGTAATCTTAGAAAGATCTCGGCCATACAATTGAAAAGAGGACTCGCACAAGAGGAACCTACATTTATGGCTATACCCCTAATGGAAGAAATGACCACTGAG
GGAACCGTTCCAAGCAAAATCAAGGAAGTACTAGACAGTTATGCTGACATAATACCAGAGAGCTTACCACAAACACTACCACCCCGTCGAGGCATTGATCACGAA
ATTGAACTCCTCTCCGGGGTTAAACCCCCAACAAAGAATGCATACCGAATGGCTCCCCCTGAGCTAGCCGAATTGAGGAAACAACTAGATGAGTTGTTGATGGCA
GGATTCATCCCCCCGACGAAGGAACCTTACGGAGCCCCCGTACTATTTCAGAAAAAGAAGAATGGGGTGTTGCGTCTTTGCATAGACTATAGAGCCTTAAACAAG
GTAACGGTACGCAACAAATACCCACTGCCGATAATATCCAACTTGTTTGACCAGCTTCACGGAGCCAAATACTTCACGAAGTTGGACTACGATCAAGGTACTACC
AATTGTGGACAGATCGGTATAGATAGCAATAAGAATAAAGCTATCCAAGAGTGGAAAATCCCTACTTCAGTATCCTATGTGCGATCCTTCTTAGGATTGGCAAAC
TACTATAGGTGGTTTGTCGAAGGGTTTTCACAAGGAGCCACCCCATTAACAGAACTCCTAAAGAAAGACCACCCATGGTCGTGGTCAAATGAATGTCAAATGGTC
TTTGAAGATCTGAAGGCAACCATGACGAAGGGTCTTGTCTTCGAATTAGTAGATGTTACTAAACCATTTGAAATAGAGACGGATGCTTCCGACTTTGCCCTAGAT
GGTGTCCTTGTTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGGAAGCTCAATGATGCTGAAAGGAGGTACACTGTCTCCGAAAAAGAAATGTTGGATGTGGCT
CATTGCCTACGAGTCTGGAGACAGTACCTCTTGGGATCACAGTTCATAGTGAAGACGGATAACAGCGCCATTTGCCATTTCTTTGATCAACCAAAATTGACGGCA
AAACAAGCCCGATGGCAGGAGTCATTGGCTGAATTCGACTTCAAATTCGAACACAAGGCAGGAAAGAGCAACCAAGCAACTGACGTGCTGAGTCGGAAGGGCGAA
CATGCAGCCCTGTGCATGATAGCCCATATTCACTCAAGTAAGATCGATGGATCGATGCGCGACATCATCAAAGAACATTTACATAAAAACCCATCTGTCAAAGCC
GTCGTTGAACTAGCCAAAGCTGGGAAGACACAACAGTTTTGGATCAAGGGAGACCTTCTAATGACAAAGGGAAACAGATTATACATTCCAAGAACAGGAGAACTG
AGGAAGAAGCTCATTCAGGAATGTCATGATACCTTATGGGTCGGACACCCTAGGTGGCAACGAACATACGCCTTAATAAAAAGGGGTACTTCTGGCCAAATATGC
GAGACGACATCATGTAGTACACCAAGACATGCCTCATTTGCCAACAGGACAAAGTCGAGAAAGTAA
Protein sequenceShow/hide protein sequence
MGTLKFLSALQRKANPKEIIKKWLMFVDAIINSRPSKSTLIDSGATHNFIADQEARRLGLTIEKDPGKMKAAIVGVSKRAPFKLGTWTGEMDLVVVCMDDFDVVL
GMDFLLEHEVIPMPLAKCLVIVDHNPTVIPASIKQPGNLRKISAIQLKRGLAQEEPTFMAIPLMEEMTTEGTVPSKIKEVLDSYADIIPESLPQTLPPRRGIDHE
IELLSGVKPPTKNAYRMAPPELAELRKQLDELLMAGFIPPTKEPYGAPVLFQKKKNGVLRLCIDYRALNKVTVRNKYPLPIISNLFDQLHGAKYFTKLDYDQGTT
NCGQIGIDSNKNKAIQEWKIPTSVSYVRSFLGLANYYRWFVEGFSQGATPLTELLKKDHPWSWSNECQMVFEDLKATMTKGLVFELVDVTKPFEIETDASDFALD
GVLVQEGHPIAYESRKLNDAERRYTVSEKEMLDVAHCLRVWRQYLLGSQFIVKTDNSAICHFFDQPKLTAKQARWQESLAEFDFKFEHKAGKSNQATDVLSRKGE
HAALCMIAHIHSSKIDGSMRDIIKEHLHKNPSVKAVVELAKAGKTQQFWIKGDLLMTKGNRLYIPRTGELRKKLIQECHDTLWVGHPRWQRTYALIKRGTSGQIC
ETTSCSTPRHASFANRTKSRK