; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G21040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G21040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr3:17113097..17115553
RNA-Seq ExpressionCSPI03G21040
SyntenyCSPI03G21040
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG96384.1 ADP glucose pyrophosphorylase large subunit 1 [Prunus dulcis]6.5e-19247.19Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHADICGP T TSNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSGIKRQL  AYTPQQNG                             AVNWTI+VLN+CPTLAV+DVTPEE WSG+KPSVDHFR+FGCIAH  V + RRT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEEDINASANEIDNDEDI---RDEEVTEGR-G
        KLD++SI+CVLLG                                          V+ L+  DGDG N EE ++ + N  + D ++   RD EV E   G
Subjt:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEEDINASANEIDNDEDI---RDEEVTEGR-G

Query:  FSEGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP------------------
         SEGE+ V ++RQ RE  PPTWMG+YVSGEG+           STDP+ FE+AV + +WRLAM++EIKSIE+N+TWTL ELP                  
Subjt:  FSEGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP------------------

Query:  -------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-------------------------
                                                    AQK+W IFQLDVKSAFL GEL+EDVYVEQP+                         
Subjt:  -------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-------------------------

Query:  ----DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-----------------------
             R+EAHF+ EGF+RC+SEQTLFTK S EGKI+IV+VYVDDL FTG+ E M+ EFK+SM+REFD+  LGK   F                       
Subjt:  ----DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-----------------------

Query:  -------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQL
                 N V SP+VPGFK++K+ NG +VD+T++KQL+                                   +R LRYLK  VNYGIHY+KGG G+L
Subjt:  -------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQL

Query:  LAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSK
        LA+TDSDYAGDMEDRKSTSGYVFLMSSGA        PIVTLSTTEAEFVAAAVCACQGVWMKRILKE+G  D  CTT++CDNSSTIKLSKNP+MHGRSK
Subjt:  LAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSK

Query:  HIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI
        HIDVRFHFLRNLTK+G +EL+HCGSQ+QVADIMTKPLKLEVFQ+ R+LLGV EI
Subjt:  HIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI

BBH05194.1 transposable element gene [Prunus dulcis]8.9e-16545.14Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNKS--------------------------------------------------DFCKQSGIKRQLMA
        RD  PK S WRA Q LE IHADICGP +  SNS KS                                                  DFCKQSGIKRQL  
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNKS--------------------------------------------------DFCKQSGIKRQLMA

Query:  AYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRTKLDDKSISCV
        AYTPQQNG                             AVNWT +VLN+CPTL V++VTP+EAWSG+KPSV+HFR+FGC+AH  + D RR           
Subjt:  AYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRTKLDDKSISCV

Query:  LLGVIHLDLSDGDGANEEEDINASANEIDNDEDIRDEEVTEGRGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEGVS------------TDPICFEDAVH
              L+  + +  NE E+  +   + D + D       E RG   G  +  D  + R   PP ++ +Y+SGEG+S             DP  FE+AV 
Subjt:  LLGVIHLDLSDGDGANEEEDINASANEIDNDEDIRDEEVTEGRGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEGVS------------TDPICFEDAVH

Query:  NESWRLAMDNEIKSIERNKTWTLIELP-------------------------------------------------------------TAQKDWKIFQLD
        N  WR AMD+EIKSIE+NKTWTL ELP                                                              AQ  WKIFQLD
Subjt:  NESWRLAMDNEIKSIERNKTWTLIELP-------------------------------------------------------------TAQKDWKIFQLD

Query:  VKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAML
        VKSAFL GEL+E+VYVEQP+                              RIEAHF+NEGFQRC+SEQTLFTK + EGKIIIV++YVDDL FTGD E M+
Subjt:  VKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAML

Query:  IEFKNSMIREFDVL-LGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEI
         EFK+SM+REFD+  LG    F    +   ++     +K+ +G+TVD+T+ KQL+                                   +R LRYLK  
Subjt:  IEFKNSMIREFDVL-LGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEI

Query:  VNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSS
        VNYGIHY++GG G+LLA+TDSDYAGDMEDRKSTSGYVFL+SSGA        PIVTLSTTEAEFVAAAVCACQ +WMKR+LKE+G  DE CT I CDNSS
Subjt:  VNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSS

Query:  TIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILD
        TIKLSKNP+MHGRSKHIDVR+HFLRNLTK+G + L+HCGS +QVAD+MTKPLK++ FQ+ R LLGV EI D
Subjt:  TIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILD

PNX96091.1 retrotransposon-related protein [Trifolium pratense]5.4e-17845.09Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHAD+CGP + +SNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSG+KRQL  AYTPQQNG                             AVNWTI+VLN+CPTLAV+DVTPEEAWSG+KPSV+HFR+FGCIAH  V + +RT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLGVIH-------------------------------------LDLSDGDGANE----EEDINASANEIDNDEDIRDEEVT---EGRGFS
        KLD +SI+CVLLGV                                        DL   DG NE    E D     +E DN  ++  E      E    +
Subjt:  KLDDKSISCVLLGVIH-------------------------------------LDLSDGDGANE----EEDINASANEIDNDEDIRDEEVT---EGRGFS

Query:  EGEQSVRDVRQSRESHPPTWMGEYVSGEG-----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------
        EGE+ V++ R+ R   PP WM ++ SGEG           VS DP+CFE+AV +E+WRLAM+ EIKSIE+N+TWTL ELP                    
Subjt:  EGEQSVRDVRQSRESHPPTWMGEYVSGEG-----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------

Query:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------
                                                  A K+W+IFQLDVKSAFL G+L+EDVYVEQP+                           
Subjt:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------

Query:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------
           RIEAHF+ EGFQ C+SEQTLFTK S EGKIIIV+VYVDDL FTG+ E M+ EFKNSM+REFD+  LGK   F                         
Subjt:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------

Query:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA
               N V SP+VPGFK+++NG+G  VD+T++KQL+                                   +R+LRYLK  VNYGI+Y+KGG G+LLA
Subjt:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA

Query:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI
        +TDSDYAGD EDRKSTSGYVFLMSSGA        PIVTLSTTEAEFVAAAVCACQGVWMKRIL+E+G     C +I+CDNSSTIKLSKNP+MHGRSKHI
Subjt:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI

Query:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILDKN
        DVRFHFLRNL+K G +ELIHC SQ+QVADIMTKPLKLEVFQ+ R+LLGV EI + N
Subjt:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILDKN

RVW51356.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.9e-14638.66Show/hide
Query:  MIGGLTRDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK-----------------------------------------------------------
        M+G   RD  PKRS+WRA QRL+ +HADIC P    SNS K                                                           
Subjt:  MIGGLTRDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK-----------------------------------------------------------

Query:  -SDFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARV
         + FCK +GI RQL AAYTPQQNG                             AVNWT HVLN+ PTLAV+ VTPEEAWSG+KP+VD+FR+FGCI H  V
Subjt:  -SDFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARV

Query:  SDGRRTKLDDKSISCVLLGV---------------------------------------IHLDL-----SDGDGANEEEDINASANEIDNDEDIRDEE--
         D +R KLDDKS  CVLLGV                                       + LD+     S+ +G+ ++ +   +    + ++++   E  
Subjt:  SDGRRTKLDDKSISCVLLGV---------------------------------------IHLDL-----SDGDGANEEEDINASANEIDNDEDIRDEE--

Query:  --VTEGRGFSEG--EQSVRDVRQSRESHPPTWMGEYVSG----EG----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP-----
          ++E  G + G   +S    +Q R    P WM +YVSG    EG           + DP  FE+AV +  WR AMD EI++IERN+TW L +LP     
Subjt:  --VTEGRGFSEG--EQSVRDVRQSRESHPPTWMGEYVSG----EG----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP-----

Query:  --------------------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK------------
                                                                 A+  W ++QLDVKSAFL GELNE V++EQP+            
Subjt:  --------------------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK------------

Query:  -----------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF----------
                          RIEA+F+ EGF+RC+ + TLF K    GKI+IV++YVDDL FTG+ E+M ++FKNSM  EFD+  LGK + F          
Subjt:  -----------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF----------

Query:  --------------------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVN
                              N V +P+VPG +L KN  GV VD T +KQL+                                   +RVLRYLK  V+
Subjt:  --------------------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVN

Query:  YGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTI
         G+ Y+  G G+L+AYTDSDYAGD++DRKSTSGYVFL+S GA        P+VTLSTT+AEFVAAA CACQGVWM+R+L+++G     CTT+LCDN+STI
Subjt:  YGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTI

Query:  KLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV
        KLSKNP+MHGRSKHIDVRFHFLR+LT++GVVEL HCG+QEQVADIMTKPLKL+VF +   LLGV
Subjt:  KLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV

RVX07197.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-14940.83Show/hide
Query:  MIGGLTRDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK-----------------------------------------------------------
        M+G   RD  PKRS+WRA QRL+ +HADICGP    SNS K                                                           
Subjt:  MIGGLTRDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK-----------------------------------------------------------

Query:  -SDFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARV
         + FCK +GI RQL  AYTPQQNG                              VNWT HVLN+ PTLAV+ VTPEEAWSG+KP+VD+FR+FGCI H  V
Subjt:  -SDFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARV

Query:  SDGRRTKLDDKSISCVLLG--VIHLDLSDGDGANEEE---------DINASANEIDNDEDIRDEEV-TEGR--------GFSEGE------QSVRDVRQS
         D +R KLDDKS  CVLLG  V   D     G + EE         D N   +E +  ED  +E V  EGR          S GE      +S    +Q 
Subjt:  SDGRRTKLDDKSISCVLLG--VIHLDLSDGDGANEEE---------DINASANEIDNDEDIRDEEV-TEGR--------GFSEGE------QSVRDVRQS

Query:  RESHPPTWMGEYVSG----EG----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP-----------------------------
        R    P WM +YVSG    EG           + DP  FE+AV +  WR AMD EI++IERN TW L +LP                             
Subjt:  RESHPPTWMGEYVSG----EG----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP-----------------------------

Query:  --------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHF
                                         A+  W ++QLDVKSAFL GELNE V++EQP+                              RIEA+F
Subjt:  --------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHF

Query:  LNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF------------------------------SCNP
        + EGF+RC+ + TLF K    GKI+IV++YVDDL FTG+ E+M ++FKNSM  EFD+  LGK + F                                N 
Subjt:  LNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF------------------------------SCNP

Query:  VNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGD
        V +P+VPG +L KN  GV VD T +KQL+                                   +RVLRYLK  V+ G+ Y+K G G+L+AYTDSDYAGD
Subjt:  VNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGD

Query:  MEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRN
        ++DRKSTSGYVFL+S GA        P+VTLSTTEAEFVA A CACQGVWM+R+L+++G     CTT+LCDN+STIKLSKNP+MHGRSKHIDVRFHFL +
Subjt:  MEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRN

Query:  LTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV
        LT++GVVEL HCG+QEQVADIMTKPLKL+VF + R LLGV
Subjt:  LTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV

TrEMBL top hitse value%identityAlignment
A0A2K3MZ63 Retrotransposon-related protein2.6e-17845.09Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHAD+CGP + +SNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSG+KRQL  AYTPQQNG                             AVNWTI+VLN+CPTLAV+DVTPEEAWSG+KPSV+HFR+FGCIAH  V + +RT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLGVIH-------------------------------------LDLSDGDGANE----EEDINASANEIDNDEDIRDEEVT---EGRGFS
        KLD +SI+CVLLGV                                        DL   DG NE    E D     +E DN  ++  E      E    +
Subjt:  KLDDKSISCVLLGVIH-------------------------------------LDLSDGDGANE----EEDINASANEIDNDEDIRDEEVT---EGRGFS

Query:  EGEQSVRDVRQSRESHPPTWMGEYVSGEG-----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------
        EGE+ V++ R+ R   PP WM ++ SGEG           VS DP+CFE+AV +E+WRLAM+ EIKSIE+N+TWTL ELP                    
Subjt:  EGEQSVRDVRQSRESHPPTWMGEYVSGEG-----------VSTDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------

Query:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------
                                                  A K+W+IFQLDVKSAFL G+L+EDVYVEQP+                           
Subjt:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------

Query:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------
           RIEAHF+ EGFQ C+SEQTLFTK S EGKIIIV+VYVDDL FTG+ E M+ EFKNSM+REFD+  LGK   F                         
Subjt:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------

Query:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA
               N V SP+VPGFK+++NG+G  VD+T++KQL+                                   +R+LRYLK  VNYGI+Y+KGG G+LLA
Subjt:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA

Query:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI
        +TDSDYAGD EDRKSTSGYVFLMSSGA        PIVTLSTTEAEFVAAAVCACQGVWMKRIL+E+G     C +I+CDNSSTIKLSKNP+MHGRSKHI
Subjt:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI

Query:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILDKN
        DVRFHFLRNL+K G +ELIHC SQ+QVADIMTKPLKLEVFQ+ R+LLGV EI + N
Subjt:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILDKN

A0A2N9GWV7 Integrase catalytic domain-containing protein1.6e-16445.43Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHADICGP T TSNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSGIKRQL AAYTPQQNG                              VNWTI+VLN+CPTLAV+DVTPEE WSG+KPS+DHFR+FGCIAH    + RRT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEE-DINASANEIDND-EDIRDEEVTEGRGFS
        KLD++SI+CVLLG                                          V+ L+  DGDG  E E  ++ + N  + D E +R+EE     G S
Subjt:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEE-DINASANEIDND-EDIRDEEVTEGRGFS

Query:  EGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------
        EGE+ VR++RQSRE  PPTWMG+YVSGEG+           STDP+ FE+AV + +WRLAM++EIKSIE+N+TWTL ELP                    
Subjt:  EGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------

Query:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKDRIEAHFLNEGFQRCNSEQTLFTKGSM
                                                  AQK+W IFQLDVKSAFL GEL+E+VYVEQP+          G+++  SE  ++    +
Subjt:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKDRIEAHFLNEGFQRCNSEQTLFTKGSM

Query:  EGKIIIVTVYVDDLTFTGD-----GEAMLIEFKNSMI--REFDV-LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHGERVL--------
           + ++  +  D++  G      G  +L +     I  R++ + +L +  +   N V SP+VPGFK++++ NG  VD+T++KQL+     L        
Subjt:  EGKIIIVTVYVDDLTFTGD-----GEAMLIEFKNSMI--REFDV-LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHGERVL--------

Query:  --RYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCT
           +L   +NYGIHY+KGG G+LLA+TDSDYAGDME+RKSTSGYVFLMSS A        PIVTLSTTEAEFVAAAVCACQGVWMKRILKE+G  D GCT
Subjt:  --RYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCT

Query:  TILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI
        T++CDNSSTIKLSKNP+MHGRSKHIDVRFHFLRNLTK+G +EL+HCGSQ+QVADIMTKPLKLE FQ+ R+LLGV EI
Subjt:  TILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI

A0A2N9H9R9 Uncharacterized protein2.4e-19247.3Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHADICGP T TSNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSGIKRQL AAYTPQQNG                             AVNWTI+VLN+CPTLAV+DVTPEEAWSG+KPS+DHFR+FGCIAH  V + RRT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEE-DINASANEIDND-EDIRDEEVTEGRGFS
        KLD++SI+CVLLG                                          V+ L+  DGDG  E E  ++ + N  + D E +R+EE     G S
Subjt:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEE-DINASANEIDND-EDIRDEEVTEGRGFS

Query:  EGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------
        EGE+ VR++RQSRE  PPTWMG+YVSGEG+           STDP+ FE+AV + +WRLAM++EIKSIE+N+TWTL ELP                    
Subjt:  EGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP--------------------

Query:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------
                                                  AQK+W IFQLDVKSAFL GEL+E+VYVEQP+                           
Subjt:  -----------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK---------------------------

Query:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------
           RIEAHF+ EGF+RC+SEQTLFTK S EGKIIIV+VYVDDL FTG+ E M+ EFK+SM+REFD+  LGK   F                         
Subjt:  --DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-------------------------

Query:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA
               N V SP+VPGFK++++ NG  VD+T++KQL+                                   +R LRYLK  VNYGIHY+KGG G+LLA
Subjt:  -----SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQLLA

Query:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI
        +TDSDYAGDMEDRKSTSGYVFLMSS A        PIVTLSTTEAEFVAAAVCACQGVWMKRILKE+G  D GCTT++CDNSSTIKLSKNP+MHGRSKHI
Subjt:  YTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHI

Query:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI
        DVRFHFLRNLTK+G +EL+HCGSQ+QVADIMTKPLKLE FQ+ R+LLGV EI
Subjt:  DVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI

A0A4Y1QX25 ADP glucose pyrophosphorylase large subunit 13.2e-19247.19Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK
        RDP PK+S WRA Q+LE IHADICGP T TSNSNK                                                            +DFCK
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNK------------------------------------------------------------SDFCK

Query:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT
        QSGIKRQL  AYTPQQNG                             AVNWTI+VLN+CPTLAV+DVTPEE WSG+KPSVDHFR+FGCIAH  V + RRT
Subjt:  QSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRT

Query:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEEDINASANEIDNDEDI---RDEEVTEGR-G
        KLD++SI+CVLLG                                          V+ L+  DGDG N EE ++ + N  + D ++   RD EV E   G
Subjt:  KLDDKSISCVLLG------------------------------------------VIHLDLSDGDGANEEEDINASANEIDNDEDI---RDEEVTEGR-G

Query:  FSEGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP------------------
         SEGE+ V ++RQ RE  PPTWMG+YVSGEG+           STDP+ FE+AV + +WRLAM++EIKSIE+N+TWTL ELP                  
Subjt:  FSEGEQSVRDVRQSRESHPPTWMGEYVSGEGV-----------STDPICFEDAVHNESWRLAMDNEIKSIERNKTWTLIELP------------------

Query:  -------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-------------------------
                                                    AQK+W IFQLDVKSAFL GEL+EDVYVEQP+                         
Subjt:  -------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK-------------------------

Query:  ----DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-----------------------
             R+EAHF+ EGF+RC+SEQTLFTK S EGKI+IV+VYVDDL FTG+ E M+ EFK+SM+REFD+  LGK   F                       
Subjt:  ----DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDVL-LGKNEIF-----------------------

Query:  -------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQL
                 N V SP+VPGFK++K+ NG +VD+T++KQL+                                   +R LRYLK  VNYGIHY+KGG G+L
Subjt:  -------SCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEIVNYGIHYRKGGKGQL

Query:  LAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSK
        LA+TDSDYAGDMEDRKSTSGYVFLMSSGA        PIVTLSTTEAEFVAAAVCACQGVWMKRILKE+G  D  CTT++CDNSSTIKLSKNP+MHGRSK
Subjt:  LAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSK

Query:  HIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI
        HIDVRFHFLRNLTK+G +EL+HCGSQ+QVADIMTKPLKLEVFQ+ R+LLGV EI
Subjt:  HIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEI

A0A4Y1RLR1 Transposable element protein4.3e-16545.14Show/hide
Query:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNKS--------------------------------------------------DFCKQSGIKRQLMA
        RD  PK S WRA Q LE IHADICGP +  SNS KS                                                  DFCKQSGIKRQL  
Subjt:  RDPFPKRSIWRARQRLEFIHADICGPTTATSNSNKS--------------------------------------------------DFCKQSGIKRQLMA

Query:  AYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRTKLDDKSISCV
        AYTPQQNG                             AVNWT +VLN+CPTL V++VTP+EAWSG+KPSV+HFR+FGC+AH  + D RR           
Subjt:  AYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGRRTKLDDKSISCV

Query:  LLGVIHLDLSDGDGANEEEDINASANEIDNDEDIRDEEVTEGRGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEGVS------------TDPICFEDAVH
              L+  + +  NE E+  +   + D + D       E RG   G  +  D  + R   PP ++ +Y+SGEG+S             DP  FE+AV 
Subjt:  LLGVIHLDLSDGDGANEEEDINASANEIDNDEDIRDEEVTEGRGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEGVS------------TDPICFEDAVH

Query:  NESWRLAMDNEIKSIERNKTWTLIELP-------------------------------------------------------------TAQKDWKIFQLD
        N  WR AMD+EIKSIE+NKTWTL ELP                                                              AQ  WKIFQLD
Subjt:  NESWRLAMDNEIKSIERNKTWTLIELP-------------------------------------------------------------TAQKDWKIFQLD

Query:  VKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAML
        VKSAFL GEL+E+VYVEQP+                              RIEAHF+NEGFQRC+SEQTLFTK + EGKIIIV++YVDDL FTGD E M+
Subjt:  VKSAFLQGELNEDVYVEQPK-----------------------------DRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAML

Query:  IEFKNSMIREFDVL-LGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEI
         EFK+SM+REFD+  LG    F    +   ++     +K+ +G+TVD+T+ KQL+                                   +R LRYLK  
Subjt:  IEFKNSMIREFDVL-LGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLI---------------------------------HGERVLRYLKEI

Query:  VNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSS
        VNYGIHY++GG G+LLA+TDSDYAGDMEDRKSTSGYVFL+SSGA        PIVTLSTTEAEFVAAAVCACQ +WMKR+LKE+G  DE CT I CDNSS
Subjt:  VNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSS

Query:  TIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILD
        TIKLSKNP+MHGRSKHIDVR+HFLRNLTK+G + L+HCGS +QVAD+MTKPLK++ FQ+ R LLGV EI D
Subjt:  TIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVHEILD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-2623.86Show/hide
Query:  ESWRLAMDNEIKSIERNKTWTLIELPTAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK------DRI--------------------------EAHFLNE
        + +++  +     + R  ++  I     Q + K+ Q+DVK+AFL G L E++Y+  P+      D +                          E  F+N 
Subjt:  ESWRLAMDNEIKSIERNKTWTLIELPTAQKDWKIFQLDVKSAFLQGELNEDVYVEQPK------DRI--------------------------EAHFLNE

Query:  GFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNS
           RC     +  KG++   I ++ +YVDD+         +  FK  ++ +F +                               +L K  + +CN V++
Subjt:  GFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNS

Query:  PMVPGFK---LNKNGNGVT-----------------------------VDDTHHKQLIHG-ERVLRYLKEIVNYGIHYRK--GGKGQLLAYTDSDYAGDM
        P+        LN + +  T                                 ++ +L    +RVLRYLK  ++  + ++K    + +++ Y DSD+AG  
Subjt:  PMVPGFK---LNKNGNGVT-----------------------------VDDTHHKQLIHG-ERVLRYLKEIVNYGIHYRK--GGKGQLLAYTDSDYAGDM

Query:  EDRKSTSGYVFLM---------SSGAPIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRN
         DRKST+GY+F M         +     V  S+TEAE++A      + +W+K +L  +    E    I  DN   I ++ NP  H R+KHID+++HF R 
Subjt:  EDRKSTSGYVFLM---------SSGAPIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRN

Query:  LTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV
          ++ V+ L +  ++ Q+ADI TKPL    F   R  LG+
Subjt:  LTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGV

P0CV72 Secreted RxLR effector protein 1614.1e-1138.54Show/hide
Query:  THHKQLIHGERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWM
        TH + L   +RVLRYL+    YG+ + + G  +L+ Y+D+D+AGD+E R+STSGY+F ++ G           V LS+TE E++A +    + VW+
Subjt:  THHKQLIHGERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-4922.85Show/hide
Query:  DFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSD
        ++C   GI+ +     TPQ NG                             AV    +++N+ P++ +    PE  W+  + S  H ++FGC A A V  
Subjt:  DFCKQSGIKRQLMAAYTPQQNG-----------------------------AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSD

Query:  GRRTKLDDKSISCVLLG-----------------VIH--------------LDLSD-------------GDGANEEEDINASANEIDNDEDIRDEEVTEG
         +RTKLDDKSI C+ +G                 VI                D+S+                +N      ++ +E+    +   E + +G
Subjt:  GRRTKLDDKSISCVLLG-----------------VIH--------------LDLSD-------------GDGANEEEDINASANEIDNDEDIRDEEVTEG

Query:  RGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEG---VSTDPICFEDAVHNESWR------------LAMDNEIKSIERNKTWTLIELP------------
            EG + V    Q  E H P    E    E     ST+ +   D    ES +             AM  E++S+++N T+ L+ELP            
Subjt:  RGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEG---VSTDPICFEDAVHNESWR------------LAMDNEIKSIERNKTWTLIELP------------

Query:  -------------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKD------------------
                                                          A  D ++ QLDVK+AFL G+L E++Y+EQP+                   
Subjt:  -------------------------------------------------TAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKD------------------

Query:  -----------RIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV--------------------------
                   + ++   ++ + +  S+  ++ K   E   II+ +YVDD+   G  + ++ + K  + + FD+                          
Subjt:  -----------RIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV--------------------------

Query:  -------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHH---------------------KQLIHGERVL-RYLK-------EIVNYGIHYRKGG
               +L +  + +  PV++P+    KL+K     TV++  +                       + H   V+ R+L+       E V + + Y +G 
Subjt:  -------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHH---------------------KQLIHGERVL-RYLK-------EIVNYGIHYRKGG

Query:  KGQLL----------AYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSST
         G  L           YTD+D AGD+++RKS++GY+F  S GA          V LSTTEAE++AA     + +W+KR L+E+G   +    + CD+ S 
Subjt:  KGQLL----------AYTDSDYAGDMEDRKSTSGYVFLMSSGA--------PIVTLSTTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSST

Query:  IKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVH
        I LSKN + H R+KHIDVR+H++R +  D  ++++   + E  AD++TK +    F+  + L+G+H
Subjt:  IKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-3727.87Show/hide
Query:  KDWKIFQLDVKSAFLQGELNEDVYVEQP-----KDR------------------------IEAHFLNEGFQRCNSEQTLFTKGSMEGK-IIIVTVYVDDL
        + W I QLDV +AFLQG L +DVY+ QP     KDR                        +  + L  GF    S+ +LF      GK I+ + VYVDD+
Subjt:  KDWKIFQLDVKSAFLQGELNEDVYVEQP-----KDR------------------------IEAHFLNEGFQRCNSEQTLFTKGSMEGK-IIIVTVYVDDL

Query:  TFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHG---
          TG+   +L    +++ + F V                               LL +  + +  PV +PM P  KL+        D T ++ ++     
Subjt:  TFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHG---

Query:  ------------------------------ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTT
                                      +R+LRYL    N+GI  +KG    L AY+D+D+AGD +D  ST+GY+  +        S     V  S+T
Subjt:  ------------------------------ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTT

Query:  EAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRP
        EAE+ + A  + +  W+  +L E+G        I CDN     L  NP+ H R KHI + +HF+RN  + G + ++H  + +Q+AD +TKPL    FQ  
Subjt:  EAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRP

Query:  RRLLGVHEI
           +GV  +
Subjt:  RRLLGVHEI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.2e-3627.45Show/hide
Query:  KDWKIFQLDVKSAFLQGELNEDVYVEQP-----KDR------------------------IEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLT
        + W I QLDV +AFLQG L ++VY+ QP     KDR                        +  + L  GF    S+ +LF        II + VYVDD+ 
Subjt:  KDWKIFQLDVKSAFLQGELNEDVYVEQP-----KDR------------------------IEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLT

Query:  FTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHG----
         TG+   +L    +++ + F V                               LL +  + +  PV +PM    KL  +      D T ++ ++      
Subjt:  FTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHG----

Query:  -----------------------------ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTTE
                                     +RVLRYL    ++GI  +KG    L AY+D+D+AGD +D  ST+GY+  +        S     V  S+TE
Subjt:  -----------------------------ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTTE

Query:  AEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPR
        AE+ + A  + +  W+  +L E+G        I CDN     L  NP+ H R KHI + +HF+RN  + G + ++H  + +Q+AD +TKPL    FQ   
Subjt:  AEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPR

Query:  RLLGVHEI
        R +GV ++
Subjt:  RLLGVHEI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.9e-0419.2Show/hide
Query:  LIELPTAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKD---------------------------------RIEAHFLNEGFQRCNSEQTLFTKGSMEGK
        LI   +A  ++ + QLD+ +AFL G+L+E++Y++ P                                   +     +  GF + +S+ T F K +    
Subjt:  LIELPTAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKD---------------------------------RIEAHFLNEGFQRCNSEQTLFTKGSMEGK

Query:  IIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDT
         + V VYVDD+    + +A + E K+ +   F +                               LL +  +  C P + PM P    + +  G  VD  
Subjt:  IIIVTVYVDDLTFTGDGEAMLIEFKNSMIREFDV-------------------------------LLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDT

Query:  HHKQLIHGERVLRYLKEIVNYGIH
         +++LI     L+  +  +++ ++
Subjt:  HHKQLIHGERVLRYLKEIVNYGIH

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.0e-0627.5Show/hide
Query:  AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGR---RTKLDDKSISCVLLGVIHLDLSDGDG
        A N  +H++NK P+ A+    P+E W    P+  + R FGC+A+    +G+   R K  ++  S ++  ++ +  + G G
Subjt:  AVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHARVSDGR---RTKLDDKSISCVLLGVIHLDLSDGDG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.8e-0633.72Show/hide
Query:  ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTTEAEFVAAAVCACQGVW
        +RVLRY+K  + +G++  K  K  + A+ DSD+AG    R+ST+G+   +        +   P V+ S+TE E+ A A+ A +  W
Subjt:  ERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLM--------SSGAPIVTLSTTEAEFVAAAVCACQGVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGGGGTCTTACACGAGACCCATTTCCAAAGAGGAGCATTTGGAGAGCTAGACAAAGATTAGAATTTATTCATGCAGATATTTGTGGCCCGACCACTGCTACTTC
CAATAGCAATAAGAGTGATTTTTGCAAACAAAGTGGGATCAAAAGGCAACTAATGGCCGCTTATACTCCGCAACAAAATGGGGCAGTAAATTGGACTATTCATGTCCTGA
ATAAATGTCCAACCTTGGCTGTTCAAGATGTCACTCCAGAGGAGGCTTGGAGTGGAATAAAACCATCAGTGGATCATTTTAGAATTTTTGGTTGCATAGCACATGCACGT
GTCTCGGATGGAAGAAGAACGAAGCTTGATGATAAAAGCATTTCTTGTGTATTGCTTGGAGTTATACACTTAGATTTAAGTGATGGAGATGGTGCTAATGAAGAAGAAGA
TATAAATGCGAGTGCAAACGAGATTGATAACGATGAAGATATAAGAGATGAGGAAGTTACTGAAGGTCGTGGTTTTAGTGAAGGTGAACAAAGTGTTAGAGATGTAAGAC
AATCTCGTGAAAGCCATCCTCCTACATGGATGGGTGAATATGTTAGTGGAGAAGGGGTATCAACTGATCCAATATGTTTTGAGGATGCTGTACATAACGAAAGTTGGAGA
TTGGCAATGGATAATGAAATTAAATCCATTGAGAGGAACAAAACATGGACACTCATTGAGCTACCAACTGCACAAAAGGATTGGAAGATTTTTCAACTTGATGTCAAGTC
GGCTTTCCTTCAAGGTGAATTGAATGAAGATGTCTACGTAGAGCAGCCAAAAGATCGAATTGAAGCACATTTCCTTAATGAAGGATTTCAAAGGTGCAATAGCGAACAAA
CATTATTCACCAAGGGAAGCATGGAAGGAAAAATAATCATTGTAACTGTCTATGTTGATGATTTAACTTTTACTGGTGACGGTGAAGCTATGTTGATTGAATTTAAAAAT
TCCATGATACGAGAATTTGACGTCTTACTTGGGAAAAATGAGATTTTTTCTTGTAATCCAGTGAATAGTCCAATGGTTCCTGGCTTCAAATTGAATAAGAATGGAAATGG
GGTCACCGTTGATGATACACACCACAAGCAACTGATACATGGCGAAAGAGTTCTTCGATACTTAAAAGAAATTGTGAATTATGGAATCCATTACAGGAAAGGAGGAAAAG
GTCAATTGTTGGCATATACAGATAGTGACTATGCTGGAGATATGGAAGATCGAAAAAGTACATCAGGATATGTATTCTTAATGAGTTCAGGTGCTCCAATTGTGACTTTA
TCTACCACTGAAGCTGAATTTGTAGCTGCTGCAGTTTGTGCCTGTCAAGGAGTTTGGATGAAGAGAATTTTGAAGGAGATGGGTTTCTGTGATGAAGGTTGTACAACTAT
ACTATGTGACAATAGTTCAACTATCAAATTGTCTAAAAATCCCATTATGCATGGACGAAGCAAGCATATTGATGTGAGGTTTCATTTTCTAAGAAATCTTACTAAAGATG
GTGTAGTTGAATTGATTCATTGTGGAAGCCAAGAACAAGTTGCAGACATCATGACAAAACCATTAAAACTGGAAGTTTTTCAAAGGCCTCGAAGGTTGCTGGGAGTTCAT
GAGATTTTAGATAAAAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGGGGGTCTTACACGAGACCCATTTCCAAAGAGGAGCATTTGGAGAGCTAGACAAAGATTAGAATTTATTCATGCAGATATTTGTGGCCCGACCACTGCTACTTC
CAATAGCAATAAGAGTGATTTTTGCAAACAAAGTGGGATCAAAAGGCAACTAATGGCCGCTTATACTCCGCAACAAAATGGGGCAGTAAATTGGACTATTCATGTCCTGA
ATAAATGTCCAACCTTGGCTGTTCAAGATGTCACTCCAGAGGAGGCTTGGAGTGGAATAAAACCATCAGTGGATCATTTTAGAATTTTTGGTTGCATAGCACATGCACGT
GTCTCGGATGGAAGAAGAACGAAGCTTGATGATAAAAGCATTTCTTGTGTATTGCTTGGAGTTATACACTTAGATTTAAGTGATGGAGATGGTGCTAATGAAGAAGAAGA
TATAAATGCGAGTGCAAACGAGATTGATAACGATGAAGATATAAGAGATGAGGAAGTTACTGAAGGTCGTGGTTTTAGTGAAGGTGAACAAAGTGTTAGAGATGTAAGAC
AATCTCGTGAAAGCCATCCTCCTACATGGATGGGTGAATATGTTAGTGGAGAAGGGGTATCAACTGATCCAATATGTTTTGAGGATGCTGTACATAACGAAAGTTGGAGA
TTGGCAATGGATAATGAAATTAAATCCATTGAGAGGAACAAAACATGGACACTCATTGAGCTACCAACTGCACAAAAGGATTGGAAGATTTTTCAACTTGATGTCAAGTC
GGCTTTCCTTCAAGGTGAATTGAATGAAGATGTCTACGTAGAGCAGCCAAAAGATCGAATTGAAGCACATTTCCTTAATGAAGGATTTCAAAGGTGCAATAGCGAACAAA
CATTATTCACCAAGGGAAGCATGGAAGGAAAAATAATCATTGTAACTGTCTATGTTGATGATTTAACTTTTACTGGTGACGGTGAAGCTATGTTGATTGAATTTAAAAAT
TCCATGATACGAGAATTTGACGTCTTACTTGGGAAAAATGAGATTTTTTCTTGTAATCCAGTGAATAGTCCAATGGTTCCTGGCTTCAAATTGAATAAGAATGGAAATGG
GGTCACCGTTGATGATACACACCACAAGCAACTGATACATGGCGAAAGAGTTCTTCGATACTTAAAAGAAATTGTGAATTATGGAATCCATTACAGGAAAGGAGGAAAAG
GTCAATTGTTGGCATATACAGATAGTGACTATGCTGGAGATATGGAAGATCGAAAAAGTACATCAGGATATGTATTCTTAATGAGTTCAGGTGCTCCAATTGTGACTTTA
TCTACCACTGAAGCTGAATTTGTAGCTGCTGCAGTTTGTGCCTGTCAAGGAGTTTGGATGAAGAGAATTTTGAAGGAGATGGGTTTCTGTGATGAAGGTTGTACAACTAT
ACTATGTGACAATAGTTCAACTATCAAATTGTCTAAAAATCCCATTATGCATGGACGAAGCAAGCATATTGATGTGAGGTTTCATTTTCTAAGAAATCTTACTAAAGATG
GTGTAGTTGAATTGATTCATTGTGGAAGCCAAGAACAAGTTGCAGACATCATGACAAAACCATTAAAACTGGAAGTTTTTCAAAGGCCTCGAAGGTTGCTGGGAGTTCAT
GAGATTTTAGATAAAAACTAA
Protein sequenceShow/hide protein sequence
MIGGLTRDPFPKRSIWRARQRLEFIHADICGPTTATSNSNKSDFCKQSGIKRQLMAAYTPQQNGAVNWTIHVLNKCPTLAVQDVTPEEAWSGIKPSVDHFRIFGCIAHAR
VSDGRRTKLDDKSISCVLLGVIHLDLSDGDGANEEEDINASANEIDNDEDIRDEEVTEGRGFSEGEQSVRDVRQSRESHPPTWMGEYVSGEGVSTDPICFEDAVHNESWR
LAMDNEIKSIERNKTWTLIELPTAQKDWKIFQLDVKSAFLQGELNEDVYVEQPKDRIEAHFLNEGFQRCNSEQTLFTKGSMEGKIIIVTVYVDDLTFTGDGEAMLIEFKN
SMIREFDVLLGKNEIFSCNPVNSPMVPGFKLNKNGNGVTVDDTHHKQLIHGERVLRYLKEIVNYGIHYRKGGKGQLLAYTDSDYAGDMEDRKSTSGYVFLMSSGAPIVTL
STTEAEFVAAAVCACQGVWMKRILKEMGFCDEGCTTILCDNSSTIKLSKNPIMHGRSKHIDVRFHFLRNLTKDGVVELIHCGSQEQVADIMTKPLKLEVFQRPRRLLGVH
EILDKN