; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G15150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G15150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:15921895..15923660
RNA-Seq ExpressionCSPI05G15150
SyntenyCSPI05G15150
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056659.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-20666.37Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED+EKTAFRTHEGHYEFLVM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQ
                     KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQKLS RAQ KSIYERELMAVVLSVQ
Subjt:  -------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQ

Query:  KWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTSTGIVDIEIIEKEVRRIKNFRKLLPN
        KWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++TGIVD+E++ KEV + +  + L   
Subjt:  KWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTSTGIVDIEIIEKEVRRIKNFRKLLPN

Query:  LKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVEECDICQRNKFKATKPVGVLQPIPI
        L+      G    T+GT                                        M+G    +G + D+K+YVE+C+ICQRNK +ATKP GVLQP+PI
Subjt:  LKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVEECDICQRNKFKATKPVGVLQPIPI

Query:  PDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYH
        PD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  H
Subjt:  PDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYH

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]9.6e-20664.22Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQAGVIRPS SPYSSPVLLVKK D GWRFCVDYRKLN VT++DKFPIPVIEELLDELHGATVFSKLD+KS YHQIRM+EEDVEKTAFRTHEGHYEFLVM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA+LRD+ LFAN+ KCVIAHS++QYLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KN+F W+E+AT+AF++LK AMTT+PVLALP+W  PF IETDASG GLGAVLSQ+GHPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELM VVLSVQKWRHYLLGRKFTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR ++S+E+N++T+ 
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKER--------WIK-----------------------------VGNTSG------TMAGCYIKEGWQADVKRYVE
        GIVD+E+I+KEV + +  +K +  LK+         W                               +G  SG       M+G    EG +AD+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKER--------WIK-----------------------------VGNTSG------TMAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK++ATKP GVL PIP PD  LE+W+MDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP+TAKQ    F+E +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]3.4e-23574.45Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQAGVIRPS SPYSSPVLLVKK D GWRFCVDYRKLN VTVADKFPIPVIEELLDELHGAT FSKLDLKSGYHQIRMREEDVEKTAF THEGHYEFLVM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANR+KCVIAHSQVQYLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMT+T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNL--------KERWIK-----------------------------VGNTSG------TMAGCYIKEGWQADVKRYVE
        GIVDIEIIEKEV   +  +K++  L        K +W                               +G  SG       M+G    +G +AD+KRYVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNL--------KERWIK-----------------------------VGNTSG------TMAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQQ---FLEGVVHYHG
        ECD CQRNKF+ATKP GVLQPIPIPD  LEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ    FLE VV  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQQ---FLEGVVHYHG

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-20463.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.4e-20463.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

TrEMBL top hitse value%identityAlignment
A0A5A7UT21 Ty3/gypsy retrotransposon protein1.2e-20666.37Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EED+EKTAFRTHEGHYEFLVM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQ
                     KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQKLS RAQ KSIYERELMAVVLSVQ
Subjt:  -------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQ

Query:  KWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTSTGIVDIEIIEKEVRRIKNFRKLLPN
        KWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++TGIVD+E++ KEV + +  + L   
Subjt:  KWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTSTGIVDIEIIEKEVRRIKNFRKLLPN

Query:  LKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVEECDICQRNKFKATKPVGVLQPIPI
        L+      G    T+GT                                        M+G    +G + D+K+YVE+C+ICQRNK +ATKP GVLQP+PI
Subjt:  LKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVEECDICQRNKFKATKPVGVLQPIPI

Query:  PDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYH
        PD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  H
Subjt:  PDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYH

A0A5D3BBH7 Ty3/gypsy retrotransposon protein6.7e-20563.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

A0A5D3DWA9 Ty3/gypsy retrotransposon protein6.7e-20563.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

A0A5D3DZK6 Ty3/gypsy retrotransposon protein6.7e-20563.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

A0A5D3E325 Ty3/gypsy retrotransposon protein6.7e-20563.71Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM
        MLQ G+IRPS SP+SSPVLLVKK D GWRFCVDYRKLN +T+ADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREED+EKTAFRTHEGHYEF+VM
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVM

Query:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------
        PFGLTNAPATFQSLMN+VFKPFLRRCVLVFF DILVYS DI EH KHLGMVFA LRD++L+ANR KCV AHSQ+ YLG                      
Subjt:  PFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG----------------------

Query:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ
                                             KNAF W+E AT+AF+ LK AM+T+PVLALPDWS PF IETDASG GLGAVLSQ+ HPIAFFSQ
Subjt:  -------------------------------------KNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGHPIAFFSQ

Query:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST
        KLS RAQ KSIYERELMAVVLSVQKWRHYLLGR+FTI+SDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR DHS+EL  +++T
Subjt:  KLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHSVELNTMTST

Query:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE
        GIVD+E++ KEV + +  + L+  L+      G    T+GT                                        M+G    +G + D+K+YVE
Subjt:  GIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGN---TSGT----------------------------------------MAGCYIKEGWQADVKRYVE

Query:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG
        +C+ICQRNK +ATKP GVLQP+PIPD  LEDWTMDFIEGLP AGG NVIMVVVDRLSKY+YF+ +KHP++AKQ   +F++ +V  HG
Subjt:  ECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLEGVVHYHG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.6e-7038.83Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLV-KKND----RGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHY
        ML  G+IR S SPY+SP+ +V KK D    + +R  +DYRKLN +TV D+ PIP ++E+L +L     F+ +DL  G+HQI M  E V KTAF T  GHY
Subjt:  MLQAGVIRPSCSPYSSPVLLV-KKND----RGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHY

Query:  EFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLGK----------------
        E+L MPFGL NAPATFQ  MN++ +P L +  LV+  DI+V+S  +DEH++ LG+VF  L    L     KC     +  +LG                 
Subjt:  EFLVMPFGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLGK----------------

Query:  -------------NAF--------------------------------HWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGH
                      AF                                  N E   AF +LK  ++  P+L +PD+++ FT+ TDAS V LGAVLSQDGH
Subjt:  -------------NAF--------------------------------HWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDGH

Query:  PIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR
        P+++ S+ L+      S  E+EL+A+V + + +RHYLLGR F I SD + L +L   ++   +  +W  KL  +DF+I Y  G +N VADALSR
Subjt:  PIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSR

P0CT34 Transposon Tf2-1 polyprotein1.4e-6930.26Show/hide
Query:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP
        L++G+IR S +  + PV+ V K +   R  VDY+ LN     + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+R+ D  K AFR   G +E+LVMP
Subjt:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP

Query:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------
        +G++ APA FQ  +N +        V+ +  DIL++S    EH+KH+  V   L++  L  N+ KC    SQV+++G                       
Subjt:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------

Query:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI
         KN                                     + W    T A + +K  + + PVL   D+S+   +ETDAS V +GAVLSQ       +P+
Subjt:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI

Query:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS
         ++S K+S      S+ ++E++A++ S++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS

Query:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------
         E          +N +    I D                + ++  E +R++   +    LL N K++ +   +T  + T+   Y +EG            
Subjt:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------

Query:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE
               W+     ++ YV+ C  CQ NK +  KP G LQPIP  +   E  +MDFI  LP + GYN + VVVDR SK +  +P     TA+Q    F +
Subjt:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE

Query:  GVVHYHGH
         V+ Y G+
Subjt:  GVVHYHGH

P0CT35 Transposon Tf2-2 polyprotein1.4e-6930.26Show/hide
Query:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP
        L++G+IR S +  + PV+ V K +   R  VDY+ LN     + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+R+ D  K AFR   G +E+LVMP
Subjt:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP

Query:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------
        +G++ APA FQ  +N +        V+ +  DIL++S    EH+KH+  V   L++  L  N+ KC    SQV+++G                       
Subjt:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------

Query:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI
         KN                                     + W    T A + +K  + + PVL   D+S+   +ETDAS V +GAVLSQ       +P+
Subjt:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI

Query:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS
         ++S K+S      S+ ++E++A++ S++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS

Query:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------
         E          +N +    I D                + ++  E +R++   +    LL N K++ +   +T  + T+   Y +EG            
Subjt:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------

Query:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE
               W+     ++ YV+ C  CQ NK +  KP G LQPIP  +   E  +MDFI  LP + GYN + VVVDR SK +  +P     TA+Q    F +
Subjt:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE

Query:  GVVHYHGH
         V+ Y G+
Subjt:  GVVHYHGH

P0CT36 Transposon Tf2-3 polyprotein1.4e-6930.26Show/hide
Query:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP
        L++G+IR S +  + PV+ V K +   R  VDY+ LN     + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+R+ D  K AFR   G +E+LVMP
Subjt:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP

Query:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------
        +G++ APA FQ  +N +        V+ +  DIL++S    EH+KH+  V   L++  L  N+ KC    SQV+++G                       
Subjt:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------

Query:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI
         KN                                     + W    T A + +K  + + PVL   D+S+   +ETDAS V +GAVLSQ       +P+
Subjt:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI

Query:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS
         ++S K+S      S+ ++E++A++ S++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS

Query:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------
         E          +N +    I D                + ++  E +R++   +    LL N K++ +   +T  + T+   Y +EG            
Subjt:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------

Query:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE
               W+     ++ YV+ C  CQ NK +  KP G LQPIP  +   E  +MDFI  LP + GYN + VVVDR SK +  +P     TA+Q    F +
Subjt:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE

Query:  GVVHYHGH
         V+ Y G+
Subjt:  GVVHYHGH

P0CT41 Transposon Tf2-12 polyprotein1.4e-6930.26Show/hide
Query:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP
        L++G+IR S +  + PV+ V K +   R  VDY+ LN     + +P+P+IE+LL ++ G+T+F+KLDLKS YH IR+R+ D  K AFR   G +E+LVMP
Subjt:  LQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMP

Query:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------
        +G++ APA FQ  +N +        V+ +  DIL++S    EH+KH+  V   L++  L  N+ KC    SQV+++G                       
Subjt:  FGLTNAPATFQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-----------------------

Query:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI
         KN                                     + W    T A + +K  + + PVL   D+S+   +ETDAS V +GAVLSQ       +P+
Subjt:  -KN------------------------------------AFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDASGVGLGAVLSQDG-----HPI

Query:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS
         ++S K+S      S+ ++E++A++ S++ WRHYL      F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N +ADALSR    
Subjt:  AFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLG--RKFTIVSDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKDHS

Query:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------
         E          +N +    I D                + ++  E +R++   +    LL N K++ +   +T  + T+   Y +EG            
Subjt:  VE----------LNTMTSTGIVD----------------IEIIEKEVRRIKNFRK----LLPNLKERWIKVGNT--SGTMAGCYIKEG------------

Query:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE
               W+     ++ YV+ C  CQ NK +  KP G LQPIP  +   E  +MDFI  LP + GYN + VVVDR SK +  +P     TA+Q    F +
Subjt:  -------WQA---DVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPIAGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQ---QFLE

Query:  GVVHYHGH
         V+ Y G+
Subjt:  GVVHYHGH

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein4.1e-0571.43Show/hide
Query:  MLQAGVIRPSCSPYSSPVLLVKKNDRGW
        ML+A +I+PS SPYSSPVLLV+K D GW
Subjt:  MLQAGVIRPSCSPYSSPVLLVKKNDRGW

ATMG00860.1 DNA/RNA polymerases superfamily protein4.0e-0829.77Show/hide
Query:  MKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-------------------------------------------------------------KNAFH
        M HLGMV  I   H+ +ANR KC     Q+ YLG                                                             KN+  
Subjt:  MKHLGMVFAILRDHELFANRTKCVIAHSQVQYLG-------------------------------------------------------------KNAFH

Query:  WNEEATIAFDQLKLAMTTLPVLALPDWSQPF
        W E A +AF  LK A+TTLPVLALPD   PF
Subjt:  WNEEATIAFDQLKLAMTTLPVLALPDWSQPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAAGCGGGAGTTATTCGTCCAAGCTGCAGCCCATATTCGAGCCCGGTCCTCTTAGTGAAGAAAAACGATAGGGGGTGGAGGTTCTGTGTAGACTACCGTAAACT
CAATCATGTAACTGTGGCTGACAAATTTCCAATCCCAGTGATTGAAGAACTCTTAGATGAACTTCATGGTGCGACAGTTTTCTCAAAGTTAGATCTTAAATCTGGATACC
ACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCTTTTCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACC
TTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCGAAGGTGTGTCCTGGTTTTTTTTTATGACATTCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACA
TTTAGGAATGGTTTTTGCTATCTTGAGGGACCATGAATTGTTTGCAAATAGAACTAAATGTGTAATTGCTCATTCCCAAGTTCAATATTTGGGGAAAAATGCATTCCATT
GGAATGAGGAAGCTACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTATTAGCATTGCCGGATTGGTCTCAGCCCTTCACAATCGAAACTGATGCT
TCAGGAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTCTTCAGCCAGAAACTGTCCCCAAGAGCCCAGGGTAAGTCGATCTATGAAAGGGAATT
GATGGCGGTTGTCCTTTCGGTGCAAAAATGGAGACATTATCTCCTGGGCAGAAAGTTCACAATTGTTTCAGATCAGAAGGCTCTGAAATTTCTGTTAGAACAGAGGGAAG
TTCAGCCTCAATTCCAAAAGTGGCTCACAAAACTTTTGGGGTATGACTTTGAGATCTTATACCAGCCGGGTCTGCAGAATAAGGTTGCGGATGCTCTCTCAAGGAAGGAC
CATTCGGTGGAGTTAAATACAATGACAAGCACAGGCATAGTTGATATAGAGATAATAGAGAAGGAAGTGAGAAGGATCAAGAACTTCAGAAAATTATTGCCGAACTTAAA
GGAGAGGTGGATCAAGGTGGGAAATACCAGTGGAACAATGGCAGGCTGTTATATAAAGGAAGGATGGCAGGCTGATGTCAAGAGATATGTGGAAGAATGTGACATATGTC
AACGCAATAAATTCAAAGCGACCAAACCTGTTGGAGTTCTTCAACCCATTCCTATCCCCGACACGACATTGGAAGATTGGACCATGGACTTCATCGAAGGTTTGCCAATA
GCAGGAGGTTACAACGTGATTATGGTAGTCGTCGATCGCCTAAGTAAGTACTCCTATTTCTTGCCTCTGAAACACCCGTACACAGCCAAGCAACAATTTCTGGAAGGAGT
TGTTCACTACCATGGGCACCATTTTGAAGAGAAGCACGGCATTCCATCCCCAAACCGATGGACAAACCGAGAGGGTAAACAGATGCCTAGAGACCTATTTGAGATGCTTT
TGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACAAGCGGGAGTTATTCGTCCAAGCTGCAGCCCATATTCGAGCCCGGTCCTCTTAGTGAAGAAAAACGATAGGGGGTGGAGGTTCTGTGTAGACTACCGTAAACT
CAATCATGTAACTGTGGCTGACAAATTTCCAATCCCAGTGATTGAAGAACTCTTAGATGAACTTCATGGTGCGACAGTTTTCTCAAAGTTAGATCTTAAATCTGGATACC
ACCAGATTAGGATGAGGGAGGAAGATGTGGAGAAAACAGCTTTTCGCACGCATGAAGGACATTATGAGTTCTTGGTGATGCCTTTCGGCCTTACGAATGCTCCTGCCACC
TTCCAATCTCTCATGAACGAGGTGTTCAAACCATTCCTTCGAAGGTGTGTCCTGGTTTTTTTTTATGACATTCTAGTTTATAGTGTGGACATAGATGAGCACATGAAACA
TTTAGGAATGGTTTTTGCTATCTTGAGGGACCATGAATTGTTTGCAAATAGAACTAAATGTGTAATTGCTCATTCCCAAGTTCAATATTTGGGGAAAAATGCATTCCATT
GGAATGAGGAAGCTACAATAGCGTTTGACCAGCTGAAGCTAGCAATGACAACCTTACCGGTATTAGCATTGCCGGATTGGTCTCAGCCCTTCACAATCGAAACTGATGCT
TCAGGAGTAGGTTTAGGCGCAGTTTTATCACAGGATGGTCATCCCATCGCATTCTTCAGCCAGAAACTGTCCCCAAGAGCCCAGGGTAAGTCGATCTATGAAAGGGAATT
GATGGCGGTTGTCCTTTCGGTGCAAAAATGGAGACATTATCTCCTGGGCAGAAAGTTCACAATTGTTTCAGATCAGAAGGCTCTGAAATTTCTGTTAGAACAGAGGGAAG
TTCAGCCTCAATTCCAAAAGTGGCTCACAAAACTTTTGGGGTATGACTTTGAGATCTTATACCAGCCGGGTCTGCAGAATAAGGTTGCGGATGCTCTCTCAAGGAAGGAC
CATTCGGTGGAGTTAAATACAATGACAAGCACAGGCATAGTTGATATAGAGATAATAGAGAAGGAAGTGAGAAGGATCAAGAACTTCAGAAAATTATTGCCGAACTTAAA
GGAGAGGTGGATCAAGGTGGGAAATACCAGTGGAACAATGGCAGGCTGTTATATAAAGGAAGGATGGCAGGCTGATGTCAAGAGATATGTGGAAGAATGTGACATATGTC
AACGCAATAAATTCAAAGCGACCAAACCTGTTGGAGTTCTTCAACCCATTCCTATCCCCGACACGACATTGGAAGATTGGACCATGGACTTCATCGAAGGTTTGCCAATA
GCAGGAGGTTACAACGTGATTATGGTAGTCGTCGATCGCCTAAGTAAGTACTCCTATTTCTTGCCTCTGAAACACCCGTACACAGCCAAGCAACAATTTCTGGAAGGAGT
TGTTCACTACCATGGGCACCATTTTGAAGAGAAGCACGGCATTCCATCCCCAAACCGATGGACAAACCGAGAGGGTAAACAGATGCCTAGAGACCTATTTGAGATGCTTT
TGCAATGA
Protein sequenceShow/hide protein sequence
MLQAGVIRPSCSPYSSPVLLVKKNDRGWRFCVDYRKLNHVTVADKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMREEDVEKTAFRTHEGHYEFLVMPFGLTNAPAT
FQSLMNEVFKPFLRRCVLVFFYDILVYSVDIDEHMKHLGMVFAILRDHELFANRTKCVIAHSQVQYLGKNAFHWNEEATIAFDQLKLAMTTLPVLALPDWSQPFTIETDA
SGVGLGAVLSQDGHPIAFFSQKLSPRAQGKSIYERELMAVVLSVQKWRHYLLGRKFTIVSDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQPGLQNKVADALSRKD
HSVELNTMTSTGIVDIEIIEKEVRRIKNFRKLLPNLKERWIKVGNTSGTMAGCYIKEGWQADVKRYVEECDICQRNKFKATKPVGVLQPIPIPDTTLEDWTMDFIEGLPI
AGGYNVIMVVVDRLSKYSYFLPLKHPYTAKQQFLEGVVHYHGHHFEEKHGIPSPNRWTNREGKQMPRDLFEMLLQ