; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G18960 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G18960
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr5:20133334..20138295
RNA-Seq ExpressionCSPI05G18960
SyntenyCSPI05G18960
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037196.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-20953.29Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+     E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A +NQ+T   +ID+++IKEE   D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         FVKE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.3e-20853.02Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+HL H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

TYK06572.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-20752.75Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI++K G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

TYK13876.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.6e-20953.29Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+     E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A +NQ+T   +ID+++IKEE   D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         FVKE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

TYK24654.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-20752.75Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI++K G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

TrEMBL top hitse value%identityAlignment
A0A5A7T4Y0 Ty3/gypsy retrotransposon protein3.2e-20953.29Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+     E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A +NQ+T   +ID+++IKEE   D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         FVKE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

A0A5A7UAE4 Ty3/gypsy retrotransposon protein2.1e-20853.02Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+HL H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

A0A5D3C5N7 Ty3/gypsy retrotransposon protein1.7e-20752.75Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI++K G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

A0A5D3CU05 Ty3/gypsy retrotransposon protein3.2e-20953.29Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E+     E +A +L+RF   FEWP TLPP+R I+HHI+LK G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H+ H+  +L +L++ ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL+MRDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL+FLLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A +NQ+T   +ID+++IKEE   D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         FVKE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

A0A5D3DM31 Ty3/gypsy retrotransposon protein1.7e-20752.75Show/hide
Query:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD
        F+   KK+VIRGDPSLTKARVSLKNLMKSWG +DQGFLVECR +E     E E   E      E +A +L+RF   FEWP TLPP+R I+HHI++K G D
Subjt:  FYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHHIHLKKGTD

Query:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------
        PVNVRPYRYA+ QK EMERLV+EML                                 ALNNVT+PDKFPIPVIEELFDEL GA++F+KIDLK+      
Subjt:  PVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------

Query:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL
                                                  +PYLR+FVLVFFDDIL+YS+G+E+H  H+  +L +L+  ELY N +KCSFA+ R+ YL
Subjt:  ------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYL

Query:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT
        GH IS +G+E DPEKIRA+ EWP PANVREVRGFLGLTGYYR+FV+NYGTIAA L+QLLK G +KW EE + AF +L++AMM+LPVL +PDFS+PFEI++
Subjt:  GHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQT

Query:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG
        DAS +G+GAVL Q R+P+AY+S TL++RDR++PVYEREL+AVV+AVQRWRPYLLG+KF VKTDQRSL++LLEQ RV+QPQYQKW+AKLLGYSFEVVY+PG
Subjt:  DASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPG

Query:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL
        LENKAADALSR+   A++NQ+T   LID++++KEE  +D  L+EI RL  + G E+ +Y+                                  GHSGFL
Subjt:  LENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRL--QGGEEVKNYS---------------------------------WGHSGFL

Query:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE
        RTYKRLT E++W GMK                              EIP  +W DISMDFIEGLPKS G++VI VVVDRLS Y HFL LKHPF AK VAE
Subjt:  RTYKRLTEELFWVGMKS-----------------------------EIPQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAE

Query:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS
         F+KE+ +     R I   R     +L+  WK  + ++  K  +S
Subjt:  LFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.5e-5132.01Show/hide
Query:  LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------------------------------RPYLRKFVLVFFDDILI
        LN +TV D+ PIP ++E+  +L     FT IDL                                                  RP L K  LV+ DDI++
Subjt:  LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------------------------------RPYLRKFVLVFFDDILI

Query:  YSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLL
        +S  L++HL  +  + E L K  L     KC F +    +LGH+++ +G++ +PEKI AI+++PIP   +E++ FLGLTGYYRKF+ N+  IA  +++ L
Subjt:  YSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLL

Query:  K--IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK
        K  +       E   AF +L+  +   P+L +PDF+  F + TDAS   LGAVL Q+  P++Y S TL   +      E+EL+A+V A + +R YLLG+ 
Subjt:  K--IGGFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK

Query:  FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL-VAEINQLTVHTL----IDLKVIKEEVMKDEFLKEICRLQGGE
        F + +D + L +L   +     +  +W  KL  + F++ Y  G EN  ADALSR+ L    +++ T H+      DL  I E  + + F +++   +G  
Subjt:  FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL-VAEINQLTVHTL----IDLKVIKEEVMKDEFLKEICRLQGGE

Query:  EVK
        ++K
Subjt:  EVK

P0CT41 Transposon Tf2-12 polyprotein1.0e-4725.28Show/hide
Query:  LPPRRLIEHHIHLKKG--------TDPVNVRPYRYAYQQKTEMERLVEEMLQGALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKSRPYL-------
        LPP ++   +  + +G        +  +N  P  +  +++  +  +V+      LN    P+ +P+P+IE+L  ++ G+T+FTK+DLKS  +L       
Subjt:  LPPRRLIEHHIHLKKG--------TDPVNVRPYRYAYQQKTEMERLVEEMLQGALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKSRPYL-------

Query:  -----------------------------------------RKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIIS
                                                    V+ + DDILI+SK   +H+ H++ +L+ L+   L  N+ KC F Q +V ++G+ IS
Subjt:  -----------------------------------------RKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIIS

Query:  REGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIG-GFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASR
         +G     E I  + +W  P N +E+R FLG   Y RKF+     +   L+ LLK    +KWT     A   ++Q ++S PVL   DFS    ++TDAS 
Subjt:  REGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIG-GFKWTEEAQVAFNRLQQAMMSLPVLALPDFSVPFEIQTDASR

Query:  YGLGAVLVQNR-----RPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLG--KKFLVKTDQRSL-QFLLEQRRVIQPQYQKWIAKLLGYSFEVV
          +GAVL Q        P+ YYS  ++       V ++E++A++ +++ WR YL    + F + TD R+L   +  +      +  +W   L  ++FE+ 
Subjt:  YGLGAVLVQNR-----RPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLG--KKFLVKTDQRSL-QFLLEQRRVIQPQYQKWIAKLLGYSFEVV

Query:  YKPGLENKAADALSR----------------VPLVAEI-------NQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGG-------------------EEV
        Y+PG  N  ADALSR                +  V +I       NQ+      D K++     +D+ ++E  +L+ G                     +
Subjt:  YKPGLENKAADALSR----------------VPLVAEI-------NQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGG-------------------EEV

Query:  KNY----SWGHSGFLRTYKRLTEELFWVGMKSEI-----------------------------PQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDH
        K Y       H G       +     W G++ +I                              +R WE +SMDFI  LP+S GY  +FVVVDR S    
Subjt:  KNY----SWGHSGFLRTYKRLTEELFWVGMKSEI-----------------------------PQRMWEDISMDFIEGLPKSFGYEVIFVVVDRLSNYDH

Query:  FLCLKHPFDAKTVAELFVKEI
         +       A+  A +F + +
Subjt:  FLCLKHPFDAKTVAELFVKEI

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-5434.45Show/hide
Query:  LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------------------------------RPYLRKFVLVFFDDILI
        LN +T+PD++PIP ++E+  +L     FT IDL                                                  RP L K  LV+ DDI+I
Subjt:  LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS------------------------------------------------RPYLRKFVLVFFDDILI

Query:  YSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLL
        +S  L +HLN ++ +   L    L     KC F +   ++LGHI++ +G++ +P K++AI  +PIP   +E+R FLGLTGYYRKF+ NY  IA  ++  L
Subjt:  YSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLL

Query:  KIGGFKWTEEAQV--AFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK
        K      T++ +   AF +L+  ++  P+L LPDF   F + TDAS   LGAVL QN  PI++ S TL   +      E+EL+A+V A + +R YLLG++
Subjt:  KIGGFKWTEEAQV--AFNRLQQAMMSLPVLALPDFSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKK

Query:  FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL
        FL+ +D + L++ L   +    + ++W  +L  Y F++ Y  G EN  ADALSR+ +
Subjt:  FLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.8e-5230.11Show/hide
Query:  DPVNVRPYRYAYQQKTEMERLVEEMLQGA------------------------------------LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS
        DP+  + Y Y    + E+ER ++E+LQ                                      LN VT+PD +PIP I      L  A  FT +DL S
Subjt:  DPVNVRPYRYAYQQKTEMERLVEEMLQGA------------------------------------LNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKS

Query:  ------------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQ
                                                        R ++ K   V+ DDI+++S+  + H  ++R +L  L K  L  N +K  F  
Subjt:  ------------------------------------------------RPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKNELYANKKKCSFAQ

Query:  FRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK--IGGFKWTEEAQV----------AFNRLQQAMM
         +V++LG+I++ +G++ DP+K+RAI E P P +V+E++ FLG+T YYRKF+Q+Y  +A  L+ L +      K ++ ++V          +FN L+  + 
Subjt:  FRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLK--IGGFKWTEEAQV----------AFNRLQQAMM

Query:  SLPVLALPDFSVPFEIQTDASRYGLGAVLVQN----RRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFL-VKTDQRSLQFLLEQRRVI
        S  +LA P F+ PF + TDAS + +GAVL Q+     RPIAY S +L   +      E+E++A++ ++   R YL G   + V TD + L F L  R   
Subjt:  SLPVLALPDFSVPFEIQTDASRYGLGAVLVQN----RRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFL-VKTDQRSLQFLLEQRRVI

Query:  QPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLT
          + ++W A++  Y+ E++YKPG  N  ADALSR+P   ++NQL+
Subjt:  QPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRVPLVAEINQLT

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.3e-4925.74Show/hide
Query:  LPPRRL------IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEE
        LPPR        ++H I +K G     ++PY    + + E+ ++V+++L                                  LN  T+ D FP+P I+ 
Subjt:  LPPRRL------IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLQG-------------------------------ALNNVTVPDKFPIPVIEE

Query:  LFDELHGATMFTKIDLKS-------RPYLR---------------------------------------KFVLVFFDDILIYSKGLEDHLNHMRALLEVL
        L   +  A +FT +DL S        P  R                                       +FV V+ DDILI+S+  E+H  H+  +LE L
Subjt:  LFDELHGATMFTKIDLKS-------RPYLR---------------------------------------KFVLVFFDDILIYSKGLEDHLNHMRALLEVL

Query:  RKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQ
        +   L   KKKC FA    ++LG+ I  + +     K  AI+++P P  V++ + FLG+  YYR+F+ N   IA  + QL      +WTE+   A ++L+
Subjt:  RKNELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQ

Query:  QAMMSLPVLALPDFSVPFEIQTDASRYGLGAVL--VQNRRP----IAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLE
         A+ + PVL   +    + + TDAS+ G+GAVL  V N+      + Y+S +L    +  P  E EL+ ++ A+  +R  L GK F ++TD  SL   L+
Subjt:  QAMMSLPVLALPDFSVPFEIQTDASRYGLGAVL--VQNRRP----IAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLE

Query:  QRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRV----------PLVAE-------INQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGGEEV-
         +     + Q+W+  L  Y F + Y  G +N  ADA+SR           P+  E        + L    LI +K + +  +  E +      Q   E+ 
Subjt:  QRRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAADALSRV----------PLVAE-------INQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGGEEV-

Query:  ----KNYS--------------------------------WGHSGFLRTYKRLTEELFWVGMKSEIPQRM-----------------------------W
            KNYS                                 GH G   T  +++   +W  ++  I Q +                             W
Subjt:  ----KNYS--------------------------------WGHSGFLRTYKRLTEELFWVGMKSEIPQRM-----------------------------W

Query:  EDISMDFIEGL-PKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAELFVKEIAQNWTEARLITPKRMGRL
         DISMDF+ GL P S    +I VVVDR S   HF+  +   DA  + +L  + I       R IT  R  R+
Subjt:  EDISMDFIEGL-PKSFGYEVIFVVVDRLSNYDHFLCLKHPFDAKTVAELFVKEIAQNWTEARLITPKRMGRL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.2e-3653.44Show/hide
Query:  LNHMRALLEVLRKNELYANKKKCSFAQFRVDYLG--HIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFK
        +NH+  +L++  +++ YAN+KKC+F Q ++ YLG  HIIS EGV  DP K+ A+  WP P N  E+RGFLGLTGYYR+FV+NYG I   L++LLK    K
Subjt:  LNHMRALLEVLRKNELYANKKKCSFAQFRVDYLG--HIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFK

Query:  WTEEAQVAFNRLQQAMMSLPVLALPDFSVPF
        WTE A +AF  L+ A+ +LPVLALPD  +PF
Subjt:  WTEEAQVAFNRLQQAMMSLPVLALPDFSVPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTATACTCTCTAGGTGTAACCGAAGTGGACTGGAAGAATTTGATTCTGACTTTTACAGACCAACAAAGAAGATTGTTATAAGGGGGGATCCAAGCCTGACAAAAGC
AAGGGTCAGTTTAAAAAATCTCATGAAATCTTGGGGAGAAGAGGATCAGGGATTTCTAGTGGAATGTCGTATGTTGGAAAGAAGAGAGTCATCGGAGGAGGAAGATTCGA
TTGAGGAAGTGTTGACTATAGAAGAATCAATGGCAGTTGTGTTGGAAAGATTTGAGGATGCGTTCGAATGGCCCGAAACATTACCTCCACGTAGATTGATAGAACATCAT
ATCCATCTAAAGAAAGGAACCGACCCAGTAAATGTTCGTCCTTACCGCTATGCATATCAACAAAAGACAGAAATGGAGAGATTAGTGGAAGAGATGCTTCAGGGAGCTCT
AAATAATGTAACAGTACCAGACAAGTTCCCAATACCAGTGATTGAGGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAAGGCCAT
ATCTCCGGAAATTTGTCTTAGTGTTTTTTGATGATATCTTGATCTATAGCAAAGGGTTGGAGGATCATTTAAATCATATGAGAGCTTTGTTAGAAGTGTTGAGGAAGAAT
GAATTATATGCAAATAAGAAGAAATGCAGCTTTGCTCAATTTCGGGTGGATTACTTGGGGCATATTATTTCAAGAGAGGGAGTGGAAGTGGATCCTGAGAAAATCAGAGC
TATAAAGGAATGGCCAATTCCAGCTAATGTGAGGGAGGTTCGAGGATTTCTTGGGTTGACCGGATATTATCGTAAATTTGTTCAAAATTATGGGACAATTGCAGCTTCTC
TATCACAGCTGTTGAAGATAGGGGGGTTTAAATGGACAGAGGAAGCTCAAGTGGCTTTTAATAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTAGCTCTACCAGAT
TTCAGTGTGCCATTTGAAATCCAAACTGATGCCTCAAGGTATGGATTAGGAGCTGTTTTGGTGCAGAATCGGCGGCCAATTGCTTATTATAGCCATACATTGGCAATGAG
AGATAGGGCTAAACCTGTATATGAAAGGGAGCTGATGGCAGTAGTTATGGCTGTACAAAGGTGGCGTCCATACCTATTGGGGAAGAAGTTCCTAGTCAAAACTGATCAAC
GGTCTTTACAGTTTTTATTGGAGCAAAGGAGGGTAATACAACCACAATACCAAAAATGGATTGCAAAGTTGCTGGGCTATTCATTCGAGGTGGTCTATAAACCTGGCTTA
GAAAACAAGGCTGCTGATGCCTTATCCCGAGTACCCCTTGTGGCAGAAATTAACCAACTAACGGTCCATACGTTGATTGATCTGAAGGTTATAAAGGAGGAGGTAATGAA
AGACGAATTCTTGAAAGAGATCTGCAGATTGCAGGGAGGAGAGGAGGTAAAGAATTACTCATGGGGACACTCCGGGTTCTTAAGAACGTATAAGAGGCTAACAGAAGAAC
TCTTTTGGGTCGGCATGAAATCGGAAATACCACAGAGGATGTGGGAAGACATCTCTATGGACTTTATTGAGGGATTACCAAAATCTTTTGGGTATGAGGTAATTTTTGTG
GTGGTGGATCGGTTAAGTAATTACGACCACTTCTTATGTCTAAAGCACCCCTTTGATGCCAAGACCGTAGCTGAATTATTCGTTAAGGAAATAGCACAAAATTGGACAGA
AGCACGACTTATCACCCCCAAACGGATGGGTAGACTGAGGTGGCTAACAGATCAGTGGAAATTTACCTACGCTGTTTCTGTGGTGAAAGACCAAAAGAGTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTATACTCTCTAGGTGTAACCGAAGTGGACTGGAAGAATTTGATTCTGACTTTTACAGACCAACAAAGAAGATTGTTATAAGGGGGGATCCAAGCCTGACAAAAGC
AAGGGTCAGTTTAAAAAATCTCATGAAATCTTGGGGAGAAGAGGATCAGGGATTTCTAGTGGAATGTCGTATGTTGGAAAGAAGAGAGTCATCGGAGGAGGAAGATTCGA
TTGAGGAAGTGTTGACTATAGAAGAATCAATGGCAGTTGTGTTGGAAAGATTTGAGGATGCGTTCGAATGGCCCGAAACATTACCTCCACGTAGATTGATAGAACATCAT
ATCCATCTAAAGAAAGGAACCGACCCAGTAAATGTTCGTCCTTACCGCTATGCATATCAACAAAAGACAGAAATGGAGAGATTAGTGGAAGAGATGCTTCAGGGAGCTCT
AAATAATGTAACAGTACCAGACAAGTTCCCAATACCAGTGATTGAGGAGTTGTTTGATGAGTTACATGGAGCTACTATGTTTACTAAGATAGATCTTAAATCAAGGCCAT
ATCTCCGGAAATTTGTCTTAGTGTTTTTTGATGATATCTTGATCTATAGCAAAGGGTTGGAGGATCATTTAAATCATATGAGAGCTTTGTTAGAAGTGTTGAGGAAGAAT
GAATTATATGCAAATAAGAAGAAATGCAGCTTTGCTCAATTTCGGGTGGATTACTTGGGGCATATTATTTCAAGAGAGGGAGTGGAAGTGGATCCTGAGAAAATCAGAGC
TATAAAGGAATGGCCAATTCCAGCTAATGTGAGGGAGGTTCGAGGATTTCTTGGGTTGACCGGATATTATCGTAAATTTGTTCAAAATTATGGGACAATTGCAGCTTCTC
TATCACAGCTGTTGAAGATAGGGGGGTTTAAATGGACAGAGGAAGCTCAAGTGGCTTTTAATAGGCTACAACAAGCGATGATGTCTCTTCCTGTATTAGCTCTACCAGAT
TTCAGTGTGCCATTTGAAATCCAAACTGATGCCTCAAGGTATGGATTAGGAGCTGTTTTGGTGCAGAATCGGCGGCCAATTGCTTATTATAGCCATACATTGGCAATGAG
AGATAGGGCTAAACCTGTATATGAAAGGGAGCTGATGGCAGTAGTTATGGCTGTACAAAGGTGGCGTCCATACCTATTGGGGAAGAAGTTCCTAGTCAAAACTGATCAAC
GGTCTTTACAGTTTTTATTGGAGCAAAGGAGGGTAATACAACCACAATACCAAAAATGGATTGCAAAGTTGCTGGGCTATTCATTCGAGGTGGTCTATAAACCTGGCTTA
GAAAACAAGGCTGCTGATGCCTTATCCCGAGTACCCCTTGTGGCAGAAATTAACCAACTAACGGTCCATACGTTGATTGATCTGAAGGTTATAAAGGAGGAGGTAATGAA
AGACGAATTCTTGAAAGAGATCTGCAGATTGCAGGGAGGAGAGGAGGTAAAGAATTACTCATGGGGACACTCCGGGTTCTTAAGAACGTATAAGAGGCTAACAGAAGAAC
TCTTTTGGGTCGGCATGAAATCGGAAATACCACAGAGGATGTGGGAAGACATCTCTATGGACTTTATTGAGGGATTACCAAAATCTTTTGGGTATGAGGTAATTTTTGTG
GTGGTGGATCGGTTAAGTAATTACGACCACTTCTTATGTCTAAAGCACCCCTTTGATGCCAAGACCGTAGCTGAATTATTCGTTAAGGAAATAGCACAAAATTGGACAGA
AGCACGACTTATCACCCCCAAACGGATGGGTAGACTGAGGTGGCTAACAGATCAGTGGAAATTTACCTACGCTGTTTCTGTGGTGAAAGACCAAAAGAGTGGATGA
Protein sequenceShow/hide protein sequence
MVILSRCNRSGLEEFDSDFYRPTKKIVIRGDPSLTKARVSLKNLMKSWGEEDQGFLVECRMLERRESSEEEDSIEEVLTIEESMAVVLERFEDAFEWPETLPPRRLIEHH
IHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLQGALNNVTVPDKFPIPVIEELFDELHGATMFTKIDLKSRPYLRKFVLVFFDDILIYSKGLEDHLNHMRALLEVLRKN
ELYANKKKCSFAQFRVDYLGHIISREGVEVDPEKIRAIKEWPIPANVREVRGFLGLTGYYRKFVQNYGTIAASLSQLLKIGGFKWTEEAQVAFNRLQQAMMSLPVLALPD
FSVPFEIQTDASRYGLGAVLVQNRRPIAYYSHTLAMRDRAKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLQFLLEQRRVIQPQYQKWIAKLLGYSFEVVYKPGL
ENKAADALSRVPLVAEINQLTVHTLIDLKVIKEEVMKDEFLKEICRLQGGEEVKNYSWGHSGFLRTYKRLTEELFWVGMKSEIPQRMWEDISMDFIEGLPKSFGYEVIFV
VVDRLSNYDHFLCLKHPFDAKTVAELFVKEIAQNWTEARLITPKRMGRLRWLTDQWKFTYAVSVVKDQKSG