; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G01690 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G01690
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:2328459..2330406
RNA-Seq ExpressionCSPI05G01690
SyntenyCSPI05G01690
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.3e-17853.73Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  VIRP+ SPYSSPVLLVKKKDGGWRFCVDYRKLNQ T SDKFPIPVIEELLDEL+GA +FSKLDLKSGYHQIR+K  D+EKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE DE+KI+ MVNWP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL-NIA-----------------------------ITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         P DVT LRGFLGLT YYRR VKG  NIA                              TI VL LPDW+L F +ET ASG GL AVLSQ+GH IAF++Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL-NIA-----------------------------ITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQ+W             ++     + E  E+  + ++                G  NKAA A S++E  IEL  MTT+
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------
        GIV++ +V EEV +DE L+KI+ + K+  +   K+ W NGRL YK  +VL + SS+I  LLHTFH+SVLGGHS FL+TYKRMSGEL+W+GMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------

Query:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 DK+LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YFITLKHPF+AK+VA  FID+I+ +HGIP SIISDRDKIF
Subjt:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFW+ELF +M T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQPHKWD+FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]2.7e-18153.96Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQA VIRP+HSPYSSPVLLVKKKDGGWRFCVDYRKLNQVT SDKFPIPVIEELLDELHGAT+FSKLD+KS YHQIR++  D+EKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GV+ DEEKI+DMV WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAI-TILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGL+ YYRR VKG                             L  A+ TI VL LP+W+L F++ET ASG GL AVLSQ GH IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAI-TILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELM VVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S++E  +E++++TT 
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------
        GIV+ME++++EV QDE+L+K I+E K+N    SK+ W NG+L YK  +VLSK+SS+I  LLHTFH+S+LGGHS FL+TYKRMSGEL+W+GMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------

Query:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D +LEEWSMDFIEGLPKAGGMNVIMVV+D + KY+YFIT+KHPFTAK+VA  FI++I+SKHG+PKSI+SDRD++F
Subjt:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFI
        IS+FW ELF TM T LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP KW +FI
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFI

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]1.7e-18354.03Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQA VIRP+ SPYSSPVLLVKKKDGGWRFCVDYRKLNQVT +DKFPIPVIEELLDELHGAT FSKLDLKSGYHQIR++  D+EKT F THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMN+                                                                   GVE DE+KI+ MVNWP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAITIL-VLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKD+T LRGFLGLT YYRR VK                              L +A+T L VL LPDW+  F +ET ASG+GL AVLSQ GH IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAITIL-VLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKW-----------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLSPRAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNK A A S+ +  +EL+TMTTT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKW-----------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------
        GIV++E++E+EV+ D++L+KII E K   D+  KY+W NGRL YK  +VL ++SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------

Query:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 DK+LE+W+MDFIEGLP AGG NVIMVV+D + KYSYF+ LKHP+TAK+VA +F+++++SKHGIPKSII+DRDKIF
Subjt:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELFTTM T+LKRS AFHPQT+GQ ERVN+  ET LRCFCNEQP KWDK IP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]1.2e-19256.47Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSPYSSPVLLV+KKDGGWRFCVDYRKLNQVT SDKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR+K  D+EKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMN-------------------------------------------------------------------QGVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMN                                                                   +GVE D +KI+DMVNWP
Subjt:  PFGLTNAPAIFQLLMN-------------------------------------------------------------------QGVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAITIL-VLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                             L +A+T L VL LPDWNL FI+ET ASGI L AVLSQ GH IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKG-----------------------------LNIAITIL-VLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RA+TK +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+IEQ +E+  M+TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKTD------
        GIVNME+VE+EV+ DE+LK IIEE K+N DE SK++W NG LWYK  IVLSK S++I  LLHTFH+S+LGGHS FL+TYKRM GEL+WKGMK D      
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKTD------

Query:  --------------------------KMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                  ++LE+WSMDFIEGLPKAGGMNVIMV++D + KYSYFIT++HPF A++VAEVFIDR++S+HGIPKSIISDRDKIF
Subjt:  --------------------------KMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        ISNFWKE+F +M T+LKRS AFHPQT+GQ ERVN+  ET LRCFCNEQP KW+KFIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-17752.51Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSP+SSPVLLVKKKDGGWRFCVDYRKLN++T +DKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR++  DIEKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE D++K++ M+ WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                               +   TI VL LPDW+L F++ET ASG GL AVLSQ  H IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+++  IEL  ++TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------
        GIV+ME+V +EV++DE+L+ +I++ + N     KY   NG L YK  +VLSK SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------

Query:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D++LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YF+T+KHPF+AK+VA  FID+I+ +HGIPKSIISDRDKIF
Subjt:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELF  M+T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP+KW +FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

TrEMBL top hitse value%identityAlignment
A0A5A7UYM1 Ty3/gypsy retrotransposon protein3.0e-17853.73Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  VIRP+ SPYSSPVLLVKKKDGGWRFCVDYRKLNQ T SDKFPIPVIEELLDEL+GA +FSKLDLKSGYHQIR+K  D+EKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE DE+KI+ MVNWP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL-NIA-----------------------------ITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         P DVT LRGFLGLT YYRR VKG  NIA                              TI VL LPDW+L F +ET ASG GL AVLSQ+GH IAF++Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL-NIA-----------------------------ITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQ+W             ++     + E  E+  + ++                G  NKAA A S++E  IEL  MTT+
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------
        GIV++ +V EEV +DE L+KI+ + K+  +   K+ W NGRL YK  +VL + SS+I  LLHTFH+SVLGGHS FL+TYKRMSGEL+W+GMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKT-------

Query:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 DK+LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YFITLKHPF+AK+VA  FID+I+ +HGIP SIISDRDKIF
Subjt:  -------------------------DKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFW+ELF +M T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQPHKWD+FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

A0A5D3BBH7 Ty3/gypsy retrotransposon protein8.9e-17852.51Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSP+SSPVLLVKKKDGGWRFCVDYRKLN++T +DKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR++  DIEKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE D++K++ M+ WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                               +   TI VL LPDW+L F++ET ASG GL AVLSQ  H IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+++  IEL  ++TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------
        GIV+ME+V +EV++DE+L+ +I++ + N     KY   NG L YK  +VLSK SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------

Query:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D++LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YF+T+KHPF+AK+VA  FID+I+ +HGIPKSIISDRDKIF
Subjt:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELF  M+T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP+KW +FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

A0A5D3DWA9 Ty3/gypsy retrotransposon protein8.9e-17852.51Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSP+SSPVLLVKKKDGGWRFCVDYRKLN++T +DKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR++  DIEKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE D++K++ M+ WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                               +   TI VL LPDW+L F++ET ASG GL AVLSQ  H IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+++  IEL  ++TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------
        GIV+ME+V +EV++DE+L+ +I++ + N     KY   NG L YK  +VLSK SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------

Query:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D++LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YF+T+KHPF+AK+VA  FID+I+ +HGIPKSIISDRDKIF
Subjt:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELF  M+T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP+KW +FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

A0A5D3DZK6 Ty3/gypsy retrotransposon protein8.9e-17852.51Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSP+SSPVLLVKKKDGGWRFCVDYRKLN++T +DKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR++  DIEKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE D++K++ M+ WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                               +   TI VL LPDW+L F++ET ASG GL AVLSQ  H IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+++  IEL  ++TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------
        GIV+ME+V +EV++DE+L+ +I++ + N     KY   NG L YK  +VLSK SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------

Query:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D++LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YF+T+KHPF+AK+VA  FID+I+ +HGIPKSIISDRDKIF
Subjt:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELF  M+T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP+KW +FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

A0A5D3E325 Ty3/gypsy retrotransposon protein8.9e-17852.51Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM
        MLQ  +IRP+HSP+SSPVLLVKKKDGGWRFCVDYRKLN++T +DKFPIPVIEELLDELHGAT+FSKLDLKSGYHQIR++  DIEKT F+THEGHY+F+VM
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVM

Query:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP
        PFGLTNAPA FQ LMNQ                                                                   GVE D++K++ M+ WP
Subjt:  PFGLTNAPAIFQLLMNQ-------------------------------------------------------------------GVE-DEEKIQDMVNWP

Query:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ
         PKDVT LRGFLGLT YYRR VKG                               +   TI VL LPDW+L F++ET ASG GL AVLSQ  H IAFF+Q
Subjt:  LPKDVTSLRGFLGLTRYYRRLVKGL------------------------------NIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQ

Query:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT
        KLS RAQ K +YERELMAVVLSVQKW             ++     + E  E+  + ++                GLQNKAA A S+++  IEL  ++TT
Subjt:  KLSPRAQTKLVYERELMAVVLSVQKWTR-----------EEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQIEQLIELSTMTTT

Query:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------
        GIV+ME+V +EV++DE+L+ +I++ + N     KY   NG L YK  +VLSK SS+I +LLHTFH+S+LGGHS FL+TYKRMSGEL WKGMK        
Subjt:  GIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMK--------

Query:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF
                                 D++LE+W+MDFIEGLPKAGGMNVIMVV+D + KY+YF+T+KHPF+AK+VA  FID+I+ +HGIPKSIISDRDKIF
Subjt:  ------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRDKIF

Query:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP
        +SNFWKELF  M+T+LKRS AFHPQT+GQ ERVNQ  ET LRCFCNEQP+KW +FIP
Subjt:  ISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.9e-4824.96Show/hide
Query:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP
        L++ +IR + +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+TIF+KLDLKS YH IRV+ GD  K  F+   G ++++VMP
Subjt:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP

Query:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL
        +G++ APA FQ  +N                             + V+D                                       +E I  ++ W  
Subjt:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL

Query:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI
        PK+   LR FLG   Y R+       L   LN                          ++  VL   D++   ++ET AS + + AVLSQK      + +
Subjt:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI

Query:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---
         +++ K+S       V ++E++A++ S++ W                   +   I +ES     R  R                G  N  A A S+I   
Subjt:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---

Query:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY
         E + + S   +   VN           V  E   D  L  ++    +  +E  + + G   +  K+ I+L   + +   ++  +H      H       
Subjt:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY

Query:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI
          +     WKG++                                +++  E  SMDFI  LP++ G N + VV+D   K +  +      TA++ A +F 
Subjt:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI

Query:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW
         R+I+  G PK II+D D IF S  WK+     + V+K S+ + PQT+GQ ER NQ  E  LRC C+  P+ W
Subjt:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW

P0CT35 Transposon Tf2-2 polyprotein4.9e-4824.96Show/hide
Query:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP
        L++ +IR + +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+TIF+KLDLKS YH IRV+ GD  K  F+   G ++++VMP
Subjt:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP

Query:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL
        +G++ APA FQ  +N                             + V+D                                       +E I  ++ W  
Subjt:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL

Query:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI
        PK+   LR FLG   Y R+       L   LN                          ++  VL   D++   ++ET AS + + AVLSQK      + +
Subjt:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI

Query:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---
         +++ K+S       V ++E++A++ S++ W                   +   I +ES     R  R                G  N  A A S+I   
Subjt:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---

Query:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY
         E + + S   +   VN           V  E   D  L  ++    +  +E  + + G   +  K+ I+L   + +   ++  +H      H       
Subjt:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY

Query:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI
          +     WKG++                                +++  E  SMDFI  LP++ G N + VV+D   K +  +      TA++ A +F 
Subjt:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI

Query:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW
         R+I+  G PK II+D D IF S  WK+     + V+K S+ + PQT+GQ ER NQ  E  LRC C+  P+ W
Subjt:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW

P0CT36 Transposon Tf2-3 polyprotein4.9e-4824.96Show/hide
Query:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP
        L++ +IR + +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+TIF+KLDLKS YH IRV+ GD  K  F+   G ++++VMP
Subjt:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP

Query:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL
        +G++ APA FQ  +N                             + V+D                                       +E I  ++ W  
Subjt:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL

Query:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI
        PK+   LR FLG   Y R+       L   LN                          ++  VL   D++   ++ET AS + + AVLSQK      + +
Subjt:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI

Query:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---
         +++ K+S       V ++E++A++ S++ W                   +   I +ES     R  R                G  N  A A S+I   
Subjt:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---

Query:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY
         E + + S   +   VN           V  E   D  L  ++    +  +E  + + G   +  K+ I+L   + +   ++  +H      H       
Subjt:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY

Query:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI
          +     WKG++                                +++  E  SMDFI  LP++ G N + VV+D   K +  +      TA++ A +F 
Subjt:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI

Query:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW
         R+I+  G PK II+D D IF S  WK+     + V+K S+ + PQT+GQ ER NQ  E  LRC C+  P+ W
Subjt:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW

P0CT37 Transposon Tf2-4 polyprotein4.9e-4824.96Show/hide
Query:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP
        L++ +IR + +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+TIF+KLDLKS YH IRV+ GD  K  F+   G ++++VMP
Subjt:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP

Query:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL
        +G++ APA FQ  +N                             + V+D                                       +E I  ++ W  
Subjt:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL

Query:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI
        PK+   LR FLG   Y R+       L   LN                          ++  VL   D++   ++ET AS + + AVLSQK      + +
Subjt:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI

Query:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---
         +++ K+S       V ++E++A++ S++ W                   +   I +ES     R  R                G  N  A A S+I   
Subjt:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---

Query:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY
         E + + S   +   VN           V  E   D  L  ++    +  +E  + + G   +  K+ I+L   + +   ++  +H      H       
Subjt:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY

Query:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI
          +     WKG++                                +++  E  SMDFI  LP++ G N + VV+D   K +  +      TA++ A +F 
Subjt:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI

Query:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW
         R+I+  G PK II+D D IF S  WK+     + V+K S+ + PQT+GQ ER NQ  E  LRC C+  P+ W
Subjt:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW

P0CT41 Transposon Tf2-12 polyprotein4.9e-4824.96Show/hide
Query:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP
        L++ +IR + +  + PV+ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++ G+TIF+KLDLKS YH IRV+ GD  K  F+   G ++++VMP
Subjt:  LQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMP

Query:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL
        +G++ APA FQ  +N                             + V+D                                       +E I  ++ W  
Subjt:  FGLTNAPAIFQLLMN-----------------------------QGVED---------------------------------------EEKIQDMVNWPL

Query:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI
        PK+   LR FLG   Y R+       L   LN                          ++  VL   D++   ++ET AS + + AVLSQK      + +
Subjt:  PKDVTSLRGFLGLTRYYRR-------LVKGLN------------------------IAITILVLELPDWNLSFIVETGASGIGLEAVLSQKG-----HLI

Query:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---
         +++ K+S       V ++E++A++ S++ W                   +   I +ES     R  R                G  N  A A S+I   
Subjt:  AFFTQKLSPRAQTKLVYERELMAVVLSVQKW---------------TREEIHNPIISESAEIPSRTER----------------GLQNKAAGAQSQI---

Query:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY
         E + + S   +   VN           V  E   D  L  ++    +  +E  + + G   +  K+ I+L   + +   ++  +H      H       
Subjt:  -EQLIELSTMTTTGIVNM--------ELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSSMISNLLHTFHNSVLGGHSRFLKTY

Query:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI
          +     WKG++                                +++  E  SMDFI  LP++ G N + VV+D   K +  +      TA++ A +F 
Subjt:  KRMSGELHWKGMK--------------------------------TDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFI

Query:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW
         R+I+  G PK II+D D IF S  WK+     + V+K S+ + PQT+GQ ER NQ  E  LRC C+  P+ W
Subjt:  DRIISKHGIPKSIISDRDKIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKW

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein9.5e-0778.57Show/hide
Query:  MLQARVIRPNHSPYSSPVLLVKKKDGGW
        ML+AR+I+P+ SPYSSPVLLV+KKDGGW
Subjt:  MLQARVIRPNHSPYSSPVLLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-0737.37Show/hide
Query:  LLMNQGVE-DEEKIQDMVNWPLPKDVTSLRGFLGLTRYYRRLV-----------------------------KGLNIAITIL-VLELPDWNLSFIVETG
        ++  +GV  D  K++ MV WP PK+ T LRGFLGLT YYRR V                             K L  A+T L VL LPD  L F+   G
Subjt:  LLMNQGVE-DEEKIQDMVNWPLPKDVTSLRGFLGLTRYYRRLV-----------------------------KGLNIAITIL-VLELPDWNLSFIVETG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCAAGCAAGAGTGATAAGACCCAACCATAGCCCTTATTCCAGCCCAGTCTTATTAGTGAAGAAAAAGGATGGAGGGTGGAGATTTTGTGTTGATTACCGAAAGCT
AAACCAGGTGACTACCTCTGACAAATTCCCAATACCGGTGATAGAAGAACTATTAGATGAGTTGCACGGAGCCACAATATTCTCAAAGCTGGACTTGAAGTCAGGTTATC
ACCAAATAAGGGTGAAGGTGGGAGATATTGAGAAGACAACATTCAAAACTCACGAAGGCCATTATAAATTCATAGTTATGCCCTTCGGCCTCACAAACGCACCTGCCATC
TTCCAGTTATTAATGAACCAGGGTGTAGAAGATGAAGAGAAAATCCAAGATATGGTGAACTGGCCACTACCAAAGGATGTCACCAGCTTGAGGGGATTCTTGGGTTTAAC
CAGATACTACCGAAGATTAGTGAAAGGATTGAACATAGCCATAACTATACTAGTGTTAGAATTGCCTGACTGGAACTTGTCTTTTATAGTAGAAACAGGCGCGTCTGGAA
TAGGATTAGAGGCTGTGTTATCTCAGAAAGGTCACCTCATTGCCTTCTTCACTCAAAAACTATCCCCTAGAGCACAAACCAAATTAGTATATGAGAGGGAACTTATGGCT
GTGGTGCTCTCGGTGCAGAAATGGACTAGGGAAGAAATTCACAATCCTATCATATCAGAAAGCGCTGAAATTCCTTCTAGAACAGAGAGAGGACTACAAAACAAGGCTGC
TGGTGCCCAGTCTCAGATAGAGCAACTCATAGAACTTAGTACCATGACTACCACTGGAATTGTAAACATGGAGTTAGTTGAAGAAGAGGTACAGCAAGATGAAGATCTTA
AGAAGATCATAGAGGAGAGGAAAAGGAACACAGATGAGACAAGCAAATATCGTTGGGGGAATGGGAGATTATGGTATAAAAACATAATAGTATTGTCGAAACACTCATCA
ATGATATCGAACCTGCTGCATACATTTCATAACTCAGTTCTAGGAGGCCACTCCAGATTTCTAAAGACATATAAGAGGATGAGTGGAGAACTACATTGGAAAGGGATGAA
AACCGACAAAATGCTTGAAGAATGGTCCATGGATTTCATTGAAGGGTTGCCCAAAGCTGGAGGGATGAATGTAATCATGGTGGTAATCGATAGTGTAGGCAAGTATTCGT
ATTTCATCACCCTTAAACATCCATTCACAGCCAAAAGAGTAGCGGAAGTCTTCATTGACAGAATAATCAGCAAGCATGGCATACCAAAATCAATCATCTCAGACAGAGAT
AAGATCTTCATCAGCAACTTTTGGAAGGAATTATTCACTACTATGGATACTGTCTTGAAGAGAAGTATGGCATTTCATCCCCAAACCAATGGGCAGATCGAAAGAGTGAA
TCAGTATTCGGAAACAGATTTGAGATGTTTCTGCAATGAACAACCCCATAAGTGGGATAAGTTCATTCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCAAGCAAGAGTGATAAGACCCAACCATAGCCCTTATTCCAGCCCAGTCTTATTAGTGAAGAAAAAGGATGGAGGGTGGAGATTTTGTGTTGATTACCGAAAGCT
AAACCAGGTGACTACCTCTGACAAATTCCCAATACCGGTGATAGAAGAACTATTAGATGAGTTGCACGGAGCCACAATATTCTCAAAGCTGGACTTGAAGTCAGGTTATC
ACCAAATAAGGGTGAAGGTGGGAGATATTGAGAAGACAACATTCAAAACTCACGAAGGCCATTATAAATTCATAGTTATGCCCTTCGGCCTCACAAACGCACCTGCCATC
TTCCAGTTATTAATGAACCAGGGTGTAGAAGATGAAGAGAAAATCCAAGATATGGTGAACTGGCCACTACCAAAGGATGTCACCAGCTTGAGGGGATTCTTGGGTTTAAC
CAGATACTACCGAAGATTAGTGAAAGGATTGAACATAGCCATAACTATACTAGTGTTAGAATTGCCTGACTGGAACTTGTCTTTTATAGTAGAAACAGGCGCGTCTGGAA
TAGGATTAGAGGCTGTGTTATCTCAGAAAGGTCACCTCATTGCCTTCTTCACTCAAAAACTATCCCCTAGAGCACAAACCAAATTAGTATATGAGAGGGAACTTATGGCT
GTGGTGCTCTCGGTGCAGAAATGGACTAGGGAAGAAATTCACAATCCTATCATATCAGAAAGCGCTGAAATTCCTTCTAGAACAGAGAGAGGACTACAAAACAAGGCTGC
TGGTGCCCAGTCTCAGATAGAGCAACTCATAGAACTTAGTACCATGACTACCACTGGAATTGTAAACATGGAGTTAGTTGAAGAAGAGGTACAGCAAGATGAAGATCTTA
AGAAGATCATAGAGGAGAGGAAAAGGAACACAGATGAGACAAGCAAATATCGTTGGGGGAATGGGAGATTATGGTATAAAAACATAATAGTATTGTCGAAACACTCATCA
ATGATATCGAACCTGCTGCATACATTTCATAACTCAGTTCTAGGAGGCCACTCCAGATTTCTAAAGACATATAAGAGGATGAGTGGAGAACTACATTGGAAAGGGATGAA
AACCGACAAAATGCTTGAAGAATGGTCCATGGATTTCATTGAAGGGTTGCCCAAAGCTGGAGGGATGAATGTAATCATGGTGGTAATCGATAGTGTAGGCAAGTATTCGT
ATTTCATCACCCTTAAACATCCATTCACAGCCAAAAGAGTAGCGGAAGTCTTCATTGACAGAATAATCAGCAAGCATGGCATACCAAAATCAATCATCTCAGACAGAGAT
AAGATCTTCATCAGCAACTTTTGGAAGGAATTATTCACTACTATGGATACTGTCTTGAAGAGAAGTATGGCATTTCATCCCCAAACCAATGGGCAGATCGAAAGAGTGAA
TCAGTATTCGGAAACAGATTTGAGATGTTTCTGCAATGAACAACCCCATAAGTGGGATAAGTTCATTCCTTAG
Protein sequenceShow/hide protein sequence
MLQARVIRPNHSPYSSPVLLVKKKDGGWRFCVDYRKLNQVTTSDKFPIPVIEELLDELHGATIFSKLDLKSGYHQIRVKVGDIEKTTFKTHEGHYKFIVMPFGLTNAPAI
FQLLMNQGVEDEEKIQDMVNWPLPKDVTSLRGFLGLTRYYRRLVKGLNIAITILVLELPDWNLSFIVETGASGIGLEAVLSQKGHLIAFFTQKLSPRAQTKLVYERELMA
VVLSVQKWTREEIHNPIISESAEIPSRTERGLQNKAAGAQSQIEQLIELSTMTTTGIVNMELVEEEVQQDEDLKKIIEERKRNTDETSKYRWGNGRLWYKNIIVLSKHSS
MISNLLHTFHNSVLGGHSRFLKTYKRMSGELHWKGMKTDKMLEEWSMDFIEGLPKAGGMNVIMVVIDSVGKYSYFITLKHPFTAKRVAEVFIDRIISKHGIPKSIISDRD
KIFISNFWKELFTTMDTVLKRSMAFHPQTNGQIERVNQYSETDLRCFCNEQPHKWDKFIP