; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036626 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036626
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr3:49591664..49594351
RNA-Seq ExpressionLag0036626
SyntenyLag0036626
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80093.1 hypothetical protein VITISV_010721 [Vitis vinifera]6.9e-10834.7Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PF F N WL      +       +    GW G     KL+ +K+ LK+W      +   ++K +L+++   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA+ +GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L + +G  +     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVS------SAQNSLLVAPFPPDEI---------------------------------WKEILDPILIANEAVEEYRIKKKKGWILKLDFEKAF
          L W  +S          S  ++ F P  +                                  ++ILD +LIANE V+E R   ++G + K+DFEKA+
Subjt:  LNLHWLTVS------SAQNSLLVAPFPPDEI---------------------------------WKEILDPILIANEAVEEYRIKKKKGWILKLDFEKAF

Query:  DCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQF
        D V WDFLD VLE+K F  +W +W++GC+ +  F++ +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++ +A  + + +GF VG +   VS LQF
Subjt:  DCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQF

Query:  ADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHF
        ADDT+ F    +  +  L   +  F   SGLK+N +KS++ G+N++++ L ++++ L+CK    P +YLGLPLGG PK   FW PVI++I ++LD W   
Subjt:  ADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHF

Query:  NLSRGGRLTLCNSVLSSGKCSF-SLRSPWMSIST---------VWKRI-----EYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSD
         LS G R+TL  S L+   C F SL     S++          +W  +     ++L ++   NG RI  W+D W   +PL +++P+L S+ +  N  +S 
Subjt:  NLSRGGRLTLCNSVLSSGKCSF-SLRSPWMSIST---------VWKRI-----EYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSD

Query:  YWDASH-LSWNVIFRRFL-KEEIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS
            +   SWN  F R L   EI D + L+ S   + I+P   D   WSL P G F+VKS
Subjt:  YWDASH-LSWNVIFRRFL-KEEIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS

RVW39368.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.6e-10732.75Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH P+  E    +WGP+PFRF N WL   +  +       + +  GW G     KL+ +KS LK+W        K ++K +L +++R+D +  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSS---LEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKR
          L+S   LE++LR   R EL ++ + EE    Q+ +++W+K GD N+ FFHR    ++ +  I +L S  G++L    DI  EI+ +F +LY+K  G+ 
Subjt:  CPLSS---LEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKR

Query:  FIPLNLHWLTVSSAQNSLLVAPFPPDEI--------------------------W---------------------------------------KEILDP
        +    + W+ +S      L  PF  +E+                          W                                       + ILD 
Subjt:  FIPLNLHWLTVSSAQNSLLVAPFPPDEI--------------------------W---------------------------------------KEILDP

Query:  ILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS
        +LIANE V+E R   ++G + K+DFEKA+D VDW FLD VL+ K FS KW  W++GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF LV++VLS
Subjt:  ILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS

Query:  ALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGL
         ++  A   GL +GF VG D   VS+LQFADDT+ F K     L  L   +  F   SGLKIN EKS++SG+N  +  L  ++   +C+    P  YLGL
Subjt:  ALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGL

Query:  PLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSS------------------------------------------------------
        PLGG PK   FW PV+++I ++LD WK   LS GGR+TL  S LS                                                       
Subjt:  PLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSS------------------------------------------------------

Query:  --GKCS--------------------------------------------FSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQ
          GK S                                            +S R PW +I+ V++         +GNG+RI FW D W  N+ L  +F  
Subjt:  --GKCS--------------------------------------------FSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQ

Query:  LFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKRFLMLCGKLRAREE
        L+ + S  N +VS+    S  L+WN+ FRR L + EI   Q L+ SLSS+  +P   D+  WSL  SG FSVKS    LA S      L L  K     +
Subjt:  LFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKRFLMLCGKLRAREE

Query:  LPS
        +PS
Subjt:  LPS

RVW90400.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.3e-11032.56Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL  S   +      S+    GW G     KL+ +K+ LK+W      +   K+K +LA +   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA ++GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK-------------------------------------------------------------------------
          L W  +     S L  PF  +EI+K                                                                         
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK-------------------------------------------------------------------------

Query:  ------------------------------------EILDPILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVR
                                            +ILD +LIANE V+E R   ++G + K+DFEKA+D V WDFLD VLE+K FS +W +W++GC+ 
Subjt:  ------------------------------------EILDPILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVR

Query:  NPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSG
        +  +++ +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++ +A  + + +GF VG +   VS LQFADDT+ F    +  L  L   +  F   SG
Subjt:  NPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSG

Query:  LKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGR---------LTLCNSVLSS----
        LK+N +KS++ G+N++++ L +++  L+CK    P +YLGLPLGG PK + FW PVI++I ++LD W+   LS GG+           L + V+ S    
Subjt:  LKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGR---------LTLCNSVLSS----

Query:  --------GKCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIP
                    +S R PW +I+ V++       F +G+G RI FW+D W  ++ L  ++P+L S+ +  N  +S     S   SWN  FRR L + EI 
Subjt:  --------GKCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIP

Query:  DFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKR--FLMLCGKLRAREEL-PSLCGSCL
        D +SL+ SL  + I+P   D   WS+ PSG F+VKS      + L R +   L L G L+  + L P++C  C+
Subjt:  DFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKR--FLMLCGKLRAREEL-PSLCGSCL

RVW91038.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.2e-10632.62Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL  S   +      S+    GW G     KL+ +K+ LK+W      +   K+K +LA +   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA ++GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY     + +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK---------------------------------------------------------------------EILD
          L W  +     S L +PF  +EI+K                                                                     +ILD
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK---------------------------------------------------------------------EILD

Query:  PILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVL
         +LIANE V+E R   ++G + K+DFEKA+D + WDFLD VLE+K FS +W +W++GC+ +  +++ +NG  +G + ASRGLRQGDPLSPFLF +V++VL
Subjt:  PILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVL

Query:  SALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLG
        S ++ +A  + + +GF VG +   VS LQFADDT+ F    +  L  L   +  F   SGLK+N +KS++ G+N++++ L +++  L+CK    P +YLG
Subjt:  SALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLG

Query:  LPLGGYPKKAIFWQPVIDKIQKKLDRW---------------------------KHFNL----------SRG----GRLTLCNSVL--------------
        LPLGG PK + FW PVI++I ++LD W                           K  +L          SRG    G++++ N  L              
Subjt:  LPLGGYPKKAIFWQPVIDKIQKKLDRW---------------------------KHFNL----------SRG----GRLTLCNSVL--------------

Query:  -------------SSG-----KCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNV
                     S+G        +S R PW +I+ V++       F +G+G RI FW+D W  ++PL  ++P+L S+ +  N  +S     S   SWN 
Subjt:  -------------SSG-----KCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNV

Query:  IFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS
         FRR L + EI D +SL+ SL  + I+P   D   WS+ PSG F+VKS
Subjt:  IFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS

RVX19456.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.8e-10833.1Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL+     +       +    GW G     +L+ LK+ LK+W  +       ++K +L +I   D++  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RAV +GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK------------------------------------------------------EILDPILIANEAVEEYRIK
          L W  +S    S L +PF  +EI+K                                                      +ILD +LIANE V+E +  
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK------------------------------------------------------EILDPILIANEAVEEYRIK

Query:  KKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQG
         ++G + K+DFEKA+D V WDFLD V+E K F+ KW +W++GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++  A  + +++G
Subjt:  KKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQG

Query:  FIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQP
        F VG +   VS LQFADDT+ F    +  L  L   +  F   S LK+N +KS++ G+N+ +  L ++++ L+CK    P +YLGLPLGG PK   FW P
Subjt:  FIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQP

Query:  VIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSSGKCSF---------------------------------------------------------------
        VI++I  +LD W+   LS GGR+TL  S L+   C F                                                               
Subjt:  VIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSSGKCSF---------------------------------------------------------------

Query:  -----SLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLES
             S R PW +I+ V++       F +G+G RI FW D W  ++ L ++FP+L  +    N  +S    ++   SWN  FRR L + EI + +SL++S
Subjt:  -----SLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLES

Query:  LSSISIAP-FDDNCIWSLEPSGCFSVKS
        L  I ++P   D   WSL  SG F+VKS
Subjt:  LSSISIAP-FDDNCIWSLEPSGCFSVKS

TrEMBL top hitse value%identityAlignment
A0A438DVQ8 LINE-1 retrotransposable element ORF2 protein1.3e-10732.75Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH P+  E    +WGP+PFRF N WL   +  +       + +  GW G     KL+ +KS LK+W        K ++K +L +++R+D +  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSS---LEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKR
          L+S   LE++LR   R EL ++ + EE    Q+ +++W+K GD N+ FFHR    ++ +  I +L S  G++L    DI  EI+ +F +LY+K  G+ 
Subjt:  CPLSS---LEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKR

Query:  FIPLNLHWLTVSSAQNSLLVAPFPPDEI--------------------------W---------------------------------------KEILDP
        +    + W+ +S      L  PF  +E+                          W                                       + ILD 
Subjt:  FIPLNLHWLTVSSAQNSLLVAPFPPDEI--------------------------W---------------------------------------KEILDP

Query:  ILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS
        +LIANE V+E R   ++G + K+DFEKA+D VDW FLD VL+ K FS KW  W++GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF LV++VLS
Subjt:  ILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS

Query:  ALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGL
         ++  A   GL +GF VG D   VS+LQFADDT+ F K     L  L   +  F   SGLKIN EKS++SG+N  +  L  ++   +C+    P  YLGL
Subjt:  ALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGL

Query:  PLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSS------------------------------------------------------
        PLGG PK   FW PV+++I ++LD WK   LS GGR+TL  S LS                                                       
Subjt:  PLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSS------------------------------------------------------

Query:  --GKCS--------------------------------------------FSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQ
          GK S                                            +S R PW +I+ V++         +GNG+RI FW D W  N+ L  +F  
Subjt:  --GKCS--------------------------------------------FSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQ

Query:  LFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKRFLMLCGKLRAREE
        L+ + S  N +VS+    S  L+WN+ FRR L + EI   Q L+ SLSS+  +P   D+  WSL  SG FSVKS    LA S      L L  K     +
Subjt:  LFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKRFLMLCGKLRAREE

Query:  LPS
        +PS
Subjt:  LPS

A0A438I181 Transposon TX1 uncharacterized 149 kDa protein2.1e-11032.56Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL  S   +      S+    GW G     KL+ +K+ LK+W      +   K+K +LA +   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA ++GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK-------------------------------------------------------------------------
          L W  +     S L  PF  +EI+K                                                                         
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK-------------------------------------------------------------------------

Query:  ------------------------------------EILDPILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVR
                                            +ILD +LIANE V+E R   ++G + K+DFEKA+D V WDFLD VLE+K FS +W +W++GC+ 
Subjt:  ------------------------------------EILDPILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVR

Query:  NPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSG
        +  +++ +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++ +A  + + +GF VG +   VS LQFADDT+ F    +  L  L   +  F   SG
Subjt:  NPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSG

Query:  LKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGR---------LTLCNSVLSS----
        LK+N +KS++ G+N++++ L +++  L+CK    P +YLGLPLGG PK + FW PVI++I ++LD W+   LS GG+           L + V+ S    
Subjt:  LKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGR---------LTLCNSVLSS----

Query:  --------GKCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIP
                    +S R PW +I+ V++       F +G+G RI FW+D W  ++ L  ++P+L S+ +  N  +S     S   SWN  FRR L + EI 
Subjt:  --------GKCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIP

Query:  DFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKR--FLMLCGKLRAREEL-PSLCGSCL
        D +SL+ SL  + I+P   D   WS+ PSG F+VKS      + L R +   L L G L+  + L P++C  C+
Subjt:  DFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKSLTRHLASSLWRKR--FLMLCGKLRAREEL-PSLCGSCL

A0A438I2T6 Transposon TX1 uncharacterized 149 kDa protein1.1e-10632.62Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL  S   +      S+    GW G     KL+ +K+ LK+W      +   K+K +LA +   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA ++GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY     + +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK---------------------------------------------------------------------EILD
          L W  +     S L +PF  +EI+K                                                                     +ILD
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK---------------------------------------------------------------------EILD

Query:  PILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVL
         +LIANE V+E R   ++G + K+DFEKA+D + WDFLD VLE+K FS +W +W++GC+ +  +++ +NG  +G + ASRGLRQGDPLSPFLF +V++VL
Subjt:  PILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVL

Query:  SALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLG
        S ++ +A  + + +GF VG +   VS LQFADDT+ F    +  L  L   +  F   SGLK+N +KS++ G+N++++ L +++  L+CK    P +YLG
Subjt:  SALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLG

Query:  LPLGGYPKKAIFWQPVIDKIQKKLDRW---------------------------KHFNL----------SRG----GRLTLCNSVL--------------
        LPLGG PK + FW PVI++I ++LD W                           K  +L          SRG    G++++ N  L              
Subjt:  LPLGGYPKKAIFWQPVIDKIQKKLDRW---------------------------KHFNL----------SRG----GRLTLCNSVL--------------

Query:  -------------SSG-----KCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNV
                     S+G        +S R PW +I+ V++       F +G+G RI FW+D W  ++PL  ++P+L S+ +  N  +S     S   SWN 
Subjt:  -------------SSG-----KCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNV

Query:  IFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS
         FRR L + EI D +SL+ SL  + I+P   D   WS+ PSG F+VKS
Subjt:  IFRRFLKE-EIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS

A0A438KE15 LINE-1 retrotransposable element ORF2 protein8.8e-10933.1Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PFRF N WL+     +       +    GW G     +L+ LK+ LK+W  +       ++K +L +I   D++  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RAV +GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVSSAQNSLLVAPFPPDEIWK------------------------------------------------------EILDPILIANEAVEEYRIK
          L W  +S    S L +PF  +EI+K                                                      +ILD +LIANE V+E +  
Subjt:  LNLHWLTVSSAQNSLLVAPFPPDEIWK------------------------------------------------------EILDPILIANEAVEEYRIK

Query:  KKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQG
         ++G + K+DFEKA+D V WDFLD V+E K F+ KW +W++GC+ +  F+I +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++  A  + +++G
Subjt:  KKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQG

Query:  FIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQP
        F VG +   VS LQFADDT+ F    +  L  L   +  F   S LK+N +KS++ G+N+ +  L ++++ L+CK    P +YLGLPLGG PK   FW P
Subjt:  FIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQP

Query:  VIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSSGKCSF---------------------------------------------------------------
        VI++I  +LD W+   LS GGR+TL  S L+   C F                                                               
Subjt:  VIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSSGKCSF---------------------------------------------------------------

Query:  -----SLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLES
             S R PW +I+ V++       F +G+G RI FW D W  ++ L ++FP+L  +    N  +S    ++   SWN  FRR L + EI + +SL++S
Subjt:  -----SLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSDYWDASH-LSWNVIFRRFLKE-EIPDFQSLLES

Query:  LSSISIAP-FDDNCIWSLEPSGCFSVKS
        L  I ++P   D   WSL  SG F+VKS
Subjt:  LSSISIAP-FDDNCIWSLEPSGCFSVKS

A5AI05 Reverse transcriptase domain-containing protein3.3e-10834.7Show/hide
Query:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ
        L R  SDH+P++ E   F+WGP+PF F N WL      +       +    GW G     KL+ +K+ LK+W      +   ++K +L+++   D+L  +
Subjt:  LGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLDTLSDQ

Query:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP
          LS      RA+ +GEL EL + EE +  Q+ +++W+K GD N+ FFH+    ++ +  I  L + +G  +     I+ EIL YFE LY    G+ +  
Subjt:  CPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIP

Query:  LNLHWLTVS------SAQNSLLVAPFPPDEI---------------------------------WKEILDPILIANEAVEEYRIKKKKGWILKLDFEKAF
          L W  +S          S  ++ F P  +                                  ++ILD +LIANE V+E R   ++G + K+DFEKA+
Subjt:  LNLHWLTVS------SAQNSLLVAPFPPDEI---------------------------------WKEILDPILIANEAVEEYRIKKKKGWILKLDFEKAF

Query:  DCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQF
        D V WDFLD VLE+K F  +W +W++GC+ +  F++ +NG  +G + ASRGLRQGDPLSPFLF +V++VLS ++ +A  + + +GF VG +   VS LQF
Subjt:  DCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQF

Query:  ADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHF
        ADDT+ F    +  +  L   +  F   SGLK+N +KS++ G+N++++ L ++++ L+CK    P +YLGLPLGG PK   FW PVI++I ++LD W   
Subjt:  ADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHF

Query:  NLSRGGRLTLCNSVLSSGKCSF-SLRSPWMSIST---------VWKRI-----EYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSD
         LS G R+TL  S L+   C F SL     S++          +W  +     ++L ++   NG RI  W+D W   +PL +++P+L S+ +  N  +S 
Subjt:  NLSRGGRLTLCNSVLSSGKCSF-SLRSPWMSIST---------VWKRI-----EYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFSLSSSPNGSVSD

Query:  YWDASH-LSWNVIFRRFL-KEEIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS
            +   SWN  F R L   EI D + L+ S   + I+P   D   WSL P G F+VKS
Subjt:  YWDASH-LSWNVIFRRFL-KEEIPDFQSLLESLSSISIAP-FDDNCIWSLEPSGCFSVKS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.3e-1629Show/hide
Query:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS-ALIDEAYCKG
        R K K   I+ +D EKAFD +   F+ K L        +++ ++     P  +I +NG+         G RQG PLSP LF +V EVL+ A+  E   KG
Subjt:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS-ALIDEAYCKG

Query:  LYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAI
        +     +G + V +S+  FADD +++ +        L++ I  F   SG KIN +KS     N +     Q+   L          YLG+ L    K   
Subjt:  LYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAI

Query:  --FWQPVIDKIQKKLDRWKHFNLSRGGRLTL
           ++P++ +I++  ++WK+   S  GR+ +
Subjt:  --FWQPVIDKIQKKLDRWKHFNLSRGGRLTL

P08548 LINE-1 reverse transcriptase homolog6.4e-1627.31Show/hide
Query:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS-ALIDEAYCKG
        ++K K   IL +D EKAFD +   F+ + L+       +++ ++     P  +I +NG          G RQG PLSP LF +V EVL+ A+ +E   KG
Subjt:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLS-ALIDEAYCKG

Query:  LYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFI-------YLGLPLG
        ++    +GS+ + +S+  FADD +++ +   +    L++ I+ +   SG KIN  K S++ +  + +Q ++        K+ +PF        YLG+ L 
Subjt:  LYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFI-------YLGLPLG

Query:  GYPKKAI--FWQPVIDKIQKKLDRWKHFNLSRGGRLTL
           K      ++ +  +I + +++WK+   S  GR+ +
Subjt:  GYPKKAI--FWQPVIDKIQKKLDRWKHFNLSRGGRLTL

P11369 LINE-1 retrotransposable element ORF2 protein9.9e-1728.7Show/hide
Query:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGL
        ++K K   I+ LD EKAFD +   F+ KVLE       ++  ++     P  +I +NG     I    G RQG PLSP+LF +V EVL+  I +   +  
Subjt:  RIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGL

Query:  YQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAI-
         +G  +G + V +S+L  ADD +++     N    L+  I  F    G KIN  KS       ++   +++ +            YLG+ L    K    
Subjt:  YQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVLPFIYLGLPLGGYPKKAI-

Query:  -FWQPVIDKIQKKLDRWKHFNLSRGGRLTL
          ++ +  +I++ L RWK    S  GR+ +
Subjt:  -FWQPVIDKIQKKLDRWKHFNLSRGGRLTL

P92555 Uncharacterized mitochondrial protein AtMg012504.8e-1152.24Show/hide
Query:  INGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDT
        ING P+G +  SRGLRQGDPLSP+LF+L +EVLS L   A  +G   G  V +++  ++ L FADDT
Subjt:  INGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDT

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.5e-0928.49Show/hide
Query:  LIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFIN-GRPRGRILASRGLRQGDPLSPFLFLLVSEVLS
        L+ +  +   R ++K   ++ LD  KAFD V    + + L+          ++ G + +   +I +  G    +I   RG++QGDPLSPFLF       +
Subjt:  LIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFIN-GRPRGRILASRGLRQGDPLSPFLFLLVSEVLS

Query:  ALIDEAYCK-----GLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKS
        A++DE  C      G+  G  +G +   + +L FADD LL  + +D +L   + T+  F    G+ +N +KS
Subjt:  ALIDEAYCK-----GLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKS

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.2e-0545.45Show/hide
Query:  DPILIANEAVEEYRIKK-KKGW-ILKLDFEKAFDCVDWDFLDKVLELKAFSSKWI
        D I+   EAV   R KK  KGW +LKLD EKA+D + WD+L+  L    F   W+
Subjt:  DPILIANEAVEEYRIKK-KKGW-ILKLDFEKAFDCVDWDFLDKVLELKAFSSKWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.4e-1252.24Show/hide
Query:  INGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDT
        ING P+G +  SRGLRQGDPLSP+LF+L +EVLS L   A  +G   G  V +++  ++ L FADDT
Subjt:  INGRPRGRILASRGLRQGDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAATCCGGTTCGGGTGGTTGACATTCTTGGTTGTGCAAGACTTGGTCGATCAATCTCTGACCATTTTCCTTTGCTTTTTGAAGCTGGAAATTTTCAGTGGGGACC
CTCGCCTTTTCGGTTTTACAATTCTTGGCTGAATCAATCTGATTGTGTGCAAATTATTGAGTCATCACTATCACAAGATTCTTCCTATGGGTGGGCAGGTTTTGTGATTG
CTTGTAAATTGAGAAACCTGAAATCTACCTTAAAAGATTGGTATGCAGACTATGAGTCGAAAAGAAAGAGTAAGGAGAAAGGGTTGCTGGCGGAAATTAATAGGCTTGAT
ACACTATCAGATCAGTGTCCCTTATCTTCTCTTGAGCAAAGTCTCCGTGCTGTGGCAAGGGGGGAATTATTAGAACTATATATGTCCGAAGAAAGAAATTTGATTCAAAG
ATGCAAGTTGCAATGGTTAAAAGCTGGGGATGAAAATACGAACTTTTTTCACAGATTCTTGGCTGCAAAAAAGAGAAAATTATTGATTACCAACCTGTGTTCAGTTGATG
GGGATTCTCTTATGGCTTTCAGAGATATTGAATCTGAAATTCTTGGTTATTTTGAGTCACTTTATACGAAGATACCAGGTAAGAGATTTATTCCTTTGAATCTTCATTGG
CTTACGGTTTCTTCAGCACAAAACTCTCTTTTAGTTGCTCCTTTTCCACCGGATGAGATATGGAAGGAAATCTTGGATCCCATTCTTATTGCAAATGAAGCTGTGGAAGA
ATACAGAATTAAGAAGAAAAAAGGTTGGATTCTCAAACTTGATTTTGAAAAAGCTTTTGATTGTGTAGATTGGGATTTTCTTGACAAAGTTTTAGAACTCAAAGCTTTCA
GCAGTAAATGGATTCAATGGGTTCAAGGTTGTGTTCGTAACCCAAAATTCTCTATTTTCATTAATGGTAGACCTCGTGGGCGTATTCTAGCTTCTCGTGGTTTGAGACAA
GGTGATCCATTGTCTCCATTTTTATTTCTTTTGGTTAGTGAAGTGCTTAGTGCCCTTATTGACGAGGCTTACTGCAAAGGATTATATCAAGGCTTCATCGTGGGTAGTGA
CAATGTTCACGTTTCTATTCTACAATTCGCAGATGACACGCTTTTATTTTGCAAATTTGATGATAATATGCTGGATGTTTTGGTTCAAACCATTCGTTTTTTTGAATGGT
GCTCGGGCTTGAAGATTAATTGGGAGAAATCATCATTAAGTGGGGTGAATGTGGATGAATCTCAACTTCAACAAGTTTCTCAACGGTTGAATTGTAAGAAAGAAGTTCTG
CCTTTCATTTATCTTGGTCTTCCTCTTGGTGGGTATCCTAAAAAAGCAATCTTTTGGCAGCCAGTGATTGATAAAATTCAGAAAAAATTGGATAGATGGAAGCATTTCAA
TTTGTCTCGAGGTGGTCGGCTAACCCTTTGTAACTCGGTTCTTTCTTCTGGAAAATGCAGTTTTAGTCTTAGAAGTCCTTGGATGAGTATCTCTACTGTTTGGAAGCGTA
TTGAATATTTAGCTCATTTCAAATTGGGAAATGGGAAAAGGATTTCATTTTGGAATGACCCGTGGATTGCAAATCGTCCTTTGAAGCTGAAATTCCCGCAGTTATTTTCT
CTATCTTCTTCTCCTAATGGTTCGGTTTCAGATTATTGGGATGCTTCTCACTTATCTTGGAATGTTATATTTCGTAGATTTTTGAAGGAGGAAATTCCAGATTTTCAATC
CTTATTGGAAAGTTTATCTTCTATTTCCATTGCTCCTTTTGATGATAATTGTATATGGTCTTTGGAACCTTCAGGGTGTTTTTCTGTGAAGTCTCTTACCCGTCATTTGG
CTTCCTCACTTTGGAGAAAGAGATTTTTAATGCTCTGTGGAAAACTAAGAGCCCGAGAAGAACTTCCATCCTTGTGTGGATCATGCTTAATGGTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAATCCGGTTCGGGTGGTTGACATTCTTGGTTGTGCAAGACTTGGTCGATCAATCTCTGACCATTTTCCTTTGCTTTTTGAAGCTGGAAATTTTCAGTGGGGACC
CTCGCCTTTTCGGTTTTACAATTCTTGGCTGAATCAATCTGATTGTGTGCAAATTATTGAGTCATCACTATCACAAGATTCTTCCTATGGGTGGGCAGGTTTTGTGATTG
CTTGTAAATTGAGAAACCTGAAATCTACCTTAAAAGATTGGTATGCAGACTATGAGTCGAAAAGAAAGAGTAAGGAGAAAGGGTTGCTGGCGGAAATTAATAGGCTTGAT
ACACTATCAGATCAGTGTCCCTTATCTTCTCTTGAGCAAAGTCTCCGTGCTGTGGCAAGGGGGGAATTATTAGAACTATATATGTCCGAAGAAAGAAATTTGATTCAAAG
ATGCAAGTTGCAATGGTTAAAAGCTGGGGATGAAAATACGAACTTTTTTCACAGATTCTTGGCTGCAAAAAAGAGAAAATTATTGATTACCAACCTGTGTTCAGTTGATG
GGGATTCTCTTATGGCTTTCAGAGATATTGAATCTGAAATTCTTGGTTATTTTGAGTCACTTTATACGAAGATACCAGGTAAGAGATTTATTCCTTTGAATCTTCATTGG
CTTACGGTTTCTTCAGCACAAAACTCTCTTTTAGTTGCTCCTTTTCCACCGGATGAGATATGGAAGGAAATCTTGGATCCCATTCTTATTGCAAATGAAGCTGTGGAAGA
ATACAGAATTAAGAAGAAAAAAGGTTGGATTCTCAAACTTGATTTTGAAAAAGCTTTTGATTGTGTAGATTGGGATTTTCTTGACAAAGTTTTAGAACTCAAAGCTTTCA
GCAGTAAATGGATTCAATGGGTTCAAGGTTGTGTTCGTAACCCAAAATTCTCTATTTTCATTAATGGTAGACCTCGTGGGCGTATTCTAGCTTCTCGTGGTTTGAGACAA
GGTGATCCATTGTCTCCATTTTTATTTCTTTTGGTTAGTGAAGTGCTTAGTGCCCTTATTGACGAGGCTTACTGCAAAGGATTATATCAAGGCTTCATCGTGGGTAGTGA
CAATGTTCACGTTTCTATTCTACAATTCGCAGATGACACGCTTTTATTTTGCAAATTTGATGATAATATGCTGGATGTTTTGGTTCAAACCATTCGTTTTTTTGAATGGT
GCTCGGGCTTGAAGATTAATTGGGAGAAATCATCATTAAGTGGGGTGAATGTGGATGAATCTCAACTTCAACAAGTTTCTCAACGGTTGAATTGTAAGAAAGAAGTTCTG
CCTTTCATTTATCTTGGTCTTCCTCTTGGTGGGTATCCTAAAAAAGCAATCTTTTGGCAGCCAGTGATTGATAAAATTCAGAAAAAATTGGATAGATGGAAGCATTTCAA
TTTGTCTCGAGGTGGTCGGCTAACCCTTTGTAACTCGGTTCTTTCTTCTGGAAAATGCAGTTTTAGTCTTAGAAGTCCTTGGATGAGTATCTCTACTGTTTGGAAGCGTA
TTGAATATTTAGCTCATTTCAAATTGGGAAATGGGAAAAGGATTTCATTTTGGAATGACCCGTGGATTGCAAATCGTCCTTTGAAGCTGAAATTCCCGCAGTTATTTTCT
CTATCTTCTTCTCCTAATGGTTCGGTTTCAGATTATTGGGATGCTTCTCACTTATCTTGGAATGTTATATTTCGTAGATTTTTGAAGGAGGAAATTCCAGATTTTCAATC
CTTATTGGAAAGTTTATCTTCTATTTCCATTGCTCCTTTTGATGATAATTGTATATGGTCTTTGGAACCTTCAGGGTGTTTTTCTGTGAAGTCTCTTACCCGTCATTTGG
CTTCCTCACTTTGGAGAAAGAGATTTTTAATGCTCTGTGGAAAACTAAGAGCCCGAGAAGAACTTCCATCCTTGTGTGGATCATGCTTAATGGTTCCTTAA
Protein sequenceShow/hide protein sequence
MMNPVRVVDILGCARLGRSISDHFPLLFEAGNFQWGPSPFRFYNSWLNQSDCVQIIESSLSQDSSYGWAGFVIACKLRNLKSTLKDWYADYESKRKSKEKGLLAEINRLD
TLSDQCPLSSLEQSLRAVARGELLELYMSEERNLIQRCKLQWLKAGDENTNFFHRFLAAKKRKLLITNLCSVDGDSLMAFRDIESEILGYFESLYTKIPGKRFIPLNLHW
LTVSSAQNSLLVAPFPPDEIWKEILDPILIANEAVEEYRIKKKKGWILKLDFEKAFDCVDWDFLDKVLELKAFSSKWIQWVQGCVRNPKFSIFINGRPRGRILASRGLRQ
GDPLSPFLFLLVSEVLSALIDEAYCKGLYQGFIVGSDNVHVSILQFADDTLLFCKFDDNMLDVLVQTIRFFEWCSGLKINWEKSSLSGVNVDESQLQQVSQRLNCKKEVL
PFIYLGLPLGGYPKKAIFWQPVIDKIQKKLDRWKHFNLSRGGRLTLCNSVLSSGKCSFSLRSPWMSISTVWKRIEYLAHFKLGNGKRISFWNDPWIANRPLKLKFPQLFS
LSSSPNGSVSDYWDASHLSWNVIFRRFLKEEIPDFQSLLESLSSISIAPFDDNCIWSLEPSGCFSVKSLTRHLASSLWRKRFLMLCGKLRAREELPSLCGSCLMVP