; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028435 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028435
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:21708382..21711463
RNA-Seq ExpressionLag0028435
SyntenyLag0028435
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4277969.1 unnamed protein product [Prunus armeniaca]1.5e-10232.08Show/hide
Query:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        ++ L+PK+ +P RVS++RPISLCN  YK+ISK + NR+K +LP++IS  QS FIP R ++DN +  FE +H L+RR     K   LKLDM+KAYDR+EW 
Subjt:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFCSV----------------------------------------
        FL  ++  MGF  ++  LI+ CV++VS+S  + G   G +IPSRGLRQGDP+SPY F  V                                        
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFCSV----------------------------------------

Query:  -WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLG
           A   EA+ +  +   YE ASGQ +N  KS + FSP+T    Q  I  +L+V+  PCH++Y  LP+ + +++    + +KDRVW ++  W+GK  S  
Subjt:  -WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLG

Query:  GKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSRVQV---------EDWEWSVYP----------
        GKEVL+KS+ QAIP Y+M+ FRLP  L REI   +A+FWW    +   IHW            G  G R  +         + W    +P          
Subjt:  GKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSRVQV---------EDWEWSVYP----------

Query:  ----------------------------------------------IYGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAI
                                                      IYG  WV  +    +QS P+LP++SRVCDLF+ SG WD  K+   F  PE EAI
Subjt:  ----------------------------------------------IYGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAI

Query:  LRIPLRSGLLDDRLIWHFEKHGMFSMKSGY-------RLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLP----------TKGN
        L IPL    L DR IW+F K+G +S+KSGY       RL       V   SS S   L  W  LW+L VP K    LWR++ + LP          T+G 
Subjt:  LRIPLRSGLLDDRLIWHFEKHGMFSMKSGY-------RLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLP----------TKGN

Query:  V-------------ALLEIFSVIPVVIPLDL---------------VDVIW--VLKEKLGALDFELVTVFWWSVWNLHNNLCWRGK--------SDGRDL
        V             AL+     + V   LD                +D +W  +  +K     F +     W +WN  N + +  +           +D 
Subjt:  V-------------ALLEIFSVIPVVIPLDL---------------VDVIW--VLKEKLGALDFELVTVFWWSVWNLHNNLCWRGK--------SDGRDL

Query:  WAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPDT---------------VLLAACLDLPRCWSVDLAEGWALVKGVE
         A  + Y  A H  +             S PV    W PP G  FKLN D +   +T               ++ A  +  P   SV   E +AL  G+ 
Subjt:  WAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPDT---------------VLLAACLDLPRCWSVDLAEGWALVKGVE

Query:  LVLQMGFFNFCVEVDSLRLVRILHGE
          L M      +E DSL+ V +++ E
Subjt:  LVLQMGFFNFCVEVDSLRLVRILHGE

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.3e-10131.34Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        I L+PK   P+ VS+FRPISLCN  Y++I+K++ N +KHIL K++S NQS FI  R + DN I+G+E ++++R+  G K+   ALKLD+SKAYDR+EW+F
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF---CSVWRASVMEA--------------VTIWDLLI--------
        LR  + ++GF+  W +L + C+++ SFS  +NG   G + P RGLRQG PLSPY F     V+   +++A              ++I  LL         
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF---CSVWRASVMEA--------------VTIWDLLI--------

Query:  ---------------RYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGGK
                       RY  ASGQ+ NYEKS + FS N      + I  +  ++    H +Y  LPS + R ++     IK R+W ++ SW+ K FS GG+
Subjt:  ---------------RYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGGK

Query:  EVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWS-------VVGLGAAGSR---------VQVEDWEWSVYP-----------I
        EVL+K++ QA+P Y M+ F+LP  +  +I  A+ARFWW  S +   IHW+           G  G R         +  + W    +P           I
Subjt:  EVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWS-------VVGLGAAGSR---------VQVEDWEWSVYP-----------I

Query:  YGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDS
        Y SNW+    + +  S P+L +   V DL   +  W +  I  HF+  +   I+RI L      D+ +WH++K+G +S+KSGY++A  L     P SSDS
Subjt:  YGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDS

Query:  ERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-----------------------ALLE------------IFSVIPVVIPLDLVDVIWVLKEKL
           L  W+ +W   +P K K+F+WR     LPT  N+                       ALL+                I  ++  DL+ V+  ++ K 
Subjt:  ERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-----------------------ALLE------------IFSVIPVVIPLDLVDVIWVLKEKL

Query:  GALDFELVTVFWWSVWNLHNNLCWRGKSDGRDL-WAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVR------------
        G  + EL+ V  W++W   N   +  K +   L  A +E  + ++  +          Q    + V    W PP  G FK+N DA+V             
Subjt:  GALDFELVTVFWWSVWNLHNNLCWRGKSDGRDL-WAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVR------------

Query:  ---PDTVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRILHGELTDSSEVTV
              V+ AA        S    E  A++ G++   + GF    +E DS  +V     +LT S +V +
Subjt:  ---PDTVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRILHGELTDSSEVTV

XP_030509188.1 uncharacterized protein LOC115723863 [Cannabis sativa]2.2e-10133.95Show/hide
Query:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        +I L+PK+K P  ++ +RPISLCN  YKL+SKA+V R+K  LP++IS  QS FI  R + DN ++ FE +H L+ R  G   +AA+KLDMSKA+DR+EW+
Subjt:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------
        F+  V+ +MGF     DLILRC+++V++SF LNG   G V PSRG+RQGDPLSPY F  C+                                       
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------

Query:  VWRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLG
          RA+   A +I   L  Y +ASGQ+IN +K V++FS NT    Q + + +L +   PCH+QY  LPSF  R++      I D++WK + SWK   FS G
Subjt:  VWRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLG

Query:  GKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSR---------VQVEDWEWSVYP----------
        GKE+LLK+++QAIP Y M+CFRLP  L  +I   MA FWW  S     IHW            G  G R         +  + W    +P          
Subjt:  GKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSR---------VQVEDWEWSVYP----------

Query:  ------IYGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQV
               + S+ +  NPSL  +S  + P +  V DL +   QWD + +R +F  P+ + IL IPL     DD +IW     G++++KSGY+LA S   Q 
Subjt:  ------IYGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQV

Query:  CPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKG-----------------------NVALLE------IFSVIPVVIPLDLV------DVI
          +SS S     WWS  W++ +P K ++F+W++    LP                          N AL +      ++ +  + I    +      D++
Subjt:  CPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKG-----------------------NVALLE------IFSVIPVVIPLDLV------DVI

Query:  WVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKS--DGRDLWAWSEEYLRAYHGVVGQRE-----SRCSLQPCPSRPVEQS-SWTPPVGGGFKLNTDAS
          L   L   +FEL  V  W  W+   N  + G +    + + +++  YL  +     +R      S  +  P PS     +  WT P  G  KLNTDA+
Subjt:  WVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKS--DGRDLWAWSEEYLRAYHGVVGQRE-----SRCSLQPCPSRPVEQS-SWTPPVGGGFKLNTDAS

Query:  V
        +
Subjt:  V

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]8.9e-10331.75Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        IVL+PK+ AP ++SDFRPISLCN  YK+ISK + NR+K +LP +ISS QS F+PG  + DN ++  + +H +R R  GK+   ALKLD+SKAYDR+EW+F
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CSVWRASVM---------------------------------
        L+ ++ ++GF   W + ++ CVS+ +FS  +NG+  G++ PSRGLRQGDPLSPY F  C+   +S++                                 
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CSVWRASVM---------------------------------

Query:  ------EAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
              E   +  +L  Y  ASGQ IN EKS + FS NT E  +++    L V      + Y  LP+F+ R +  T  FIKDRVWK++Q WKGK  S  G
Subjt:  ------EAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSR-----------------VQVED----------
        KEVL+K++ Q+IP YTM  F+LP  L  E+    ARFWW   E   +IHW   G+       G  G R                 +Q  D          
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGL-------GAAGSR-----------------VQVED----------

Query:  -------------------WE--------------WSV-----YPIYGSNWVVDNPSLRVQSAP-SLPLSSRVCDLF-SPSGQWDEVKIRTHFLGPECEA
                           W+              W V       +    W+++ P+ +V   P       RV DL    S  WD   + + F   + EA
Subjt:  -------------------WE--------------WSV-----YPIYGSNWVVDNPSLRVQSAP-SLPLSSRVCDLF-SPSGQWDEVKIRTHFLGPECEA

Query:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQ----VCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIFS----
        I RIPL    + D L+W   K G +S++SGY LA  ++       C SS  S R +  W +LW+L VPNK KV+ WR   + LPT+ N+A  +I      
Subjt:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQ----VCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIFS----

Query:  ---------------VIPVVIPL----------------DLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGK-SDGRDLWAWSEEYLRAYHGV
                         PV   +                D++ +   L  +L   +FEL     W +WN  N +   GK  D   L   +E YL  Y   
Subjt:  ---------------VIPVVIPL----------------DLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGK-SDGRDLWAWSEEYLRAYHGV

Query:  VGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPD---------------TVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEV
          Q + + S+   PSR    + W PP    +KLN DA++  +                V+ A  +  P  W+ + AE  A  + VE  +  GF    +E 
Subjt:  VGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPD---------------TVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEV

Query:  DSLRLVRILHGELTDSSEV
        D++ +++ +     D S +
Subjt:  DSLRLVRILHGELTDSSEV

XP_030963556.1 uncharacterized protein LOC115984674 [Quercus lobata]8.9e-10332.42Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        I L+PKIK+P + +DFRPISLCN  YK++SK + NR+K +LPKL+S +QS F+  R + DN ++ FE  H L+ +T GK+ + A+KLDMSKAYDR+EW+F
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF-------------------------CSV----------------
        L  V+ ++GF  +W  L+  C+ SVSFS  +NGE  GN  P+RGLRQGDPLSPY F                         CS                 
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF-------------------------CSV----------------

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RA+  EA +I ++L +YE ASGQ IN EK+ + FSPNT    Q+ I  +L V+    +++Y  LPSF+   +  +  +I++R+W +++ WK +  S GG
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW----------SVVGLG--------------------------------AA
        +EVL+K+++QA+P +TM CF+LP  L ++I   + +FWW    E  +IHW          S  GLG                                  
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW----------SVVGLG--------------------------------AA

Query:  GSRVQVEDWEWSVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSGQ-WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSG
        GS+ ++ D       I G  W+ D  S RV S   + P ++RVC L     + W E +IR  FL  E EAIL +PL      DRLIW    +G ++ KS 
Subjt:  GSRVQVEDWEWSVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSGQ-WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSG

Query:  YRLAFSLVSQVC-----PSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF----------SVIPVVIPL---DLVDVIW-----
        YRL    V         P +S+S     +W  LW L VPNK + FLWR + + LPTK N+    I            +   V P+    ++  +W     
Subjt:  YRLAFSLVSQVC-----PSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF----------SVIPVVIPL---DLVDVIW-----

Query:  ------------------VLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKS-DGRDLWAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVG
                          +L +K+  L  EL     WS+W+  N       S     ++  + E LR +H V  Q E R  L          + W PP+ 
Subjt:  ------------------VLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKS-DGRDLWAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVG

Query:  GGFKLNTDASVRPD-----------------TVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRILHGELTDSSEVTVEFGH
          +K+N D +  PD                    L+  + LP   +V   E  A  + +   +++G  +   E DS  + ++    LT        FGH
Subjt:  GGFKLNTDASVRPD-----------------TVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRILHGELTDSSEVTVEFGH

TrEMBL top hitse value%identityAlignment
A0A2N9EVW3 Uncharacterized protein1.2e-10531.91Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        + L+PK K P  V+++RPISLCN  YKLISK + NR+K ILP +IS +QS F+PGR + DN ++ FE +H +  +  GK    ALKLDMSKAYDR+EW F
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V
        +  V++RMGF ++W  LIL C+SSVS+S  +NG   G++IP+RGLRQGDP+SPY F  C+                                        
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RAS++E   I ++L  YE+ASGQ +N  K+ + FS NT +  Q+ I  +L V     +++Y  LPS + + ++     IK+RVW +++ WK K  S  G
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDWEW---------------
        +EVL+K++IQAIP YTMNCF+LP  L +EI   + RFWW  + +  +IHW       +  G G  G R         +  + W +               
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDWEW---------------

Query:  --------SVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSGQ-WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRL
                +  PI GSNW+++    R+ S   +LP+ +RV +L   S   W+  KI+  FL  + +AIL+IPL     +DRL W   ++G +S++SGY+L
Subjt:  --------SVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSGQ-WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRL

Query:  AFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-----------------------------ALLEIFSVIP------VVI
                 P SS        W R+WR  VP K K FLWR S + LPTK  +                             A+ +++S+ P       + 
Subjt:  AFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-----------------------------ALLEIFSVIP------VVI

Query:  PLDLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS
        P+   +++  + +    L FE      W +W+  N    R  SD           L + +  V    +   LQP       Q  W PP    FK+N D +
Subjt:  PLDLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS

Query:  VRPDT---------------VLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRIL
        +  ++               V+      +    + ++ E  A  + +   +++G  N  +E D++ ++R L
Subjt:  VRPDT---------------VLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRIL

A0A2N9GB96 Uncharacterized protein2.5e-10331.05Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        + L+PK+K P  V+++RPISLCN  YKLISK + NR+K +LP +I+  QS F+PGR + DN ++ FE +H +  +  G+    ALKLDMSKAYDR+EWSF
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V
        LR V+ +MGF  QW  L++ C+++VS+S  +NGE  G++ PSRGLRQGDP+SPY F  C+                                        
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RA+  E   I DLL  YE+ASGQ +N  K+ + FS NT + +Q  I ++L V     +++Y  LPS + + ++     IKDRVW +++ WK K  S  G
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------
        +E+L+K++IQAIP YTMNCF+LP  L ++I   M RFWW   ++  ++HW          G G  G R         +  + W                 
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------

Query:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA
                                           W V      PI  +NW++D    RV S  P  P  S+V  L   S  +WD  KIR  FL  + EA
Subjt:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA

Query:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------
        IL+IP+ S    D+LIWH  + G +S++SGY +    V    P SS        W  +W +  P K + FLWR   E LP+K  ++  +I          
Subjt:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------

Query:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE
              +  +     ++  W        + K++ G              AL  E      W +W+  N       SD         E L   H  +  +E
Subjt:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE

Query:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL
                 S P  + SW PP    +K+N D +             +R    L+ A L   +  C S ++ E  A  + ++  L++G F+   E DS  +
Subjt:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL

Query:  VR
        +R
Subjt:  VR

A0A2N9GLG8 Reverse transcriptase domain-containing protein2.5e-10331.05Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        + L+PK+K P  V+++RPISLCN  YKLISK + NR+K +LP +I+  QS F+PGR + DN ++ FE +H +  +  G+    ALKLDMSKAYDR+EWSF
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V
        LR V+ +MGF  QW  L++ C+++VS+S  +NGE  G++ PSRGLRQGDP+SPY F  C+                                        
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RA+  E   I DLL  YE+ASGQ +N  K+ + FS NT + +Q  I ++L V     +++Y  LPS + + ++     IKDRVW +++ WK K  S  G
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------
        +E+L+K++IQAIP YTMNCF+LP  L ++I   M RFWW   ++  ++HW          G G  G R         +  + W                 
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------

Query:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA
                                           W V      PI  +NW++D    RV S  P  P  S+V  L   S  +WD  KIR  FL  + EA
Subjt:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA

Query:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------
        IL+IP+ S    D+LIWH  + G +S++SGY +    V    P SS        W  +W +  P K + FLWR   E LP+K  ++  +I          
Subjt:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------

Query:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE
              +  +     ++  W        + K++ G              AL  E      W +W+  N       SD         E L   H  +  +E
Subjt:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE

Query:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL
                 S P  + SW PP    +K+N D +             +R    L+ A L   +  C S ++ E  A  + ++  L++G F+   E DS  +
Subjt:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL

Query:  VR
        +R
Subjt:  VR

A0A2N9H0J9 Uncharacterized protein2.5e-10331.05Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        + L+PK+K P  V+++RPISLCN  YKLISK + NR+K +LP +I+  QS F+PGR + DN ++ FE +H +  +  G+    ALKLDMSKAYDR+EWSF
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V
        LR V+ +MGF  QW  L++ C+++VS+S  +NGE  G++ PSRGLRQGDP+SPY F  C+                                        
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RA+  E   I DLL  YE+ASGQ +N  K+ + FS NT + +Q  I ++L V     +++Y  LPS + + ++     IKDRVW +++ WK K  S  G
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------
        +E+L+K++IQAIP YTMNCF+LP  L ++I   M RFWW   ++  ++HW          G G  G R         +  + W                 
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW-------SVVGLGAAGSR---------VQVEDW-----------------

Query:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA
                                           W V      PI  +NW++D    RV S  P  P  S+V  L   S  +WD  KIR  FL  + EA
Subjt:  ----------------------------------EWSV-----YPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPSG-QWDEVKIRTHFLGPECEA

Query:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------
        IL+IP+ S    D+LIWH  + G +S++SGY +    V    P SS        W  +W +  P K + FLWR   E LP+K  ++  +I          
Subjt:  ILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIF---------

Query:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE
              +  +     ++  W        + K++ G              AL  E      W +W+  N       SD         E L   H  +  +E
Subjt:  ----SVIPVVIPLDLVDVIW--------VLKEKLG--------------ALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRE

Query:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL
                 S P  + SW PP    +K+N D +             +R    L+ A L   +  C S ++ E  A  + ++  L++G F+   E DS  +
Subjt:  SRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDAS-------------VRPDTVLLAACLD--LPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRL

Query:  VR
        +R
Subjt:  VR

A0A2N9IP69 Reverse transcriptase domain-containing protein6.7e-10431.6Show/hide
Query:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF
        + L+PK+K P +V+D+RPISLCN  Y+LISK + NR K +LP +IS  QS F+PGR + DN ++ FE +H +  +  GK    ALKLDMSKAYDR+EW+F
Subjt:  IVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSF

Query:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V
        L+ V+ +MGF   W  LI+ C+S+VS+S  +NGE  GN+IPSRGLRQGDP+SPY F  C+                                        
Subjt:  LRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF--CS---------------------------------------V

Query:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG
         RA+  E   I ++L  YER SGQ +N  K+ + FS NT + +Q  I  +L V     +++Y  LPS + + ++     IK+RVW +++ WK K  S  G
Subjt:  WRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQY--LPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGG

Query:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW----------SVVGLG----------------------------------
        +E+L+K+++QAIP YTMNCF+LP  L ++I   + RFWW   E   +IHW           V GLG                                  
Subjt:  KEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW----------SVVGLG----------------------------------

Query:  ------------------------------AAGSRVQVEDWEWSVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPS-GQWDEVKIRTHFLGPEC
                                       AGS  +V D +    PI  +NW+++    RV S  P+L   ++V DL   S   WDE KIR+ FL  + 
Subjt:  ------------------------------AAGSRVQVEDWEWSVYPIYGSNWVVDNPSLRVQS-APSLPLSSRVCDLFSPS-GQWDEVKIRTHFLGPEC

Query:  EAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-------------
        +AIL+IPL      D+L WH   +G +S++SGY+L         P+SS+       W ++W L  P K K F+WR   E LPTK  +             
Subjt:  EAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWSRLWRLGVPNKHKVFLWRLSLERLPTKGNV-------------

Query:  ----------ALL------EIFSVIPVVIP------LDLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWR-GKSDGRDLWAWSEEYLRAYHGVVG
                  ALL      +++S++P +            D++  +  K   L  E   V  W +W+  N    R   +D   LW  +  YL  +   + 
Subjt:  ----------ALL------EIFSVIPVVIP------LDLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWR-GKSDGRDLWAWSEEYLRAYHGVVG

Query:  QRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPDT------VLLAACLDLPRCW---------SVDLAEGWALVKGVELVLQMGFFNFCVEVDS
          +   +++P P  P+ +  W+PP+  GFK+N D ++  D       V++  C  L              VDL E  A  + +   +++G  +   E DS
Subjt:  QRESRCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPDT------VLLAACLDLPRCW---------SVDLAEGWALVKGVELVLQMGFFNFCVEVDS

Query:  LRLVRIL
          +++ L
Subjt:  LRLVRIL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.2e-1824.41Show/hide
Query:  IVLVPKI-KAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        I+L+PK  +   +  +FRPISL N   K+++K + NR++  + KLI  +Q GFIPG     N       I  + R          + +D  KA+D+I+  
Subjt:  IVLVPKI-KAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFCSVWRA------------------------------------S
        F+   +N++G    +  +I       + +  LNG++L       G RQG PLSP  F  V                                        
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFCSVWRA------------------------------------S

Query:  VMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQYLPSFMPRNRLGTLKFIKDRVWKQIQ----SWKGKFFSLGGKE
        ++ A  +  L+  + + SG  IN +KS  AF  N    ++  I   L  +      +YL   + R+     K     + K+I+     WK    S  G+ 
Subjt:  VMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQYLPSFMPRNRLGTLKFIKDRVWKQIQ----SWKGKFFSLGGKE

Query:  VLLKSIIQAIPCYTMNC--FRLPYCLIREIHWAMARFWWN
         ++K  I     Y  N    +LP     E+     +F WN
Subjt:  VLLKSIIQAIPCYTMNC--FRLPYCLIREIHWAMARFWWN

P08548 LINE-1 reverse transcriptase homolog3.5e-1723.24Show/hide
Query:  IVLVPKI-KAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        I L+PK  K P R  ++RPISL N   K+++K + NR++  + K+I  +Q GFIPG     N       I  + +          L +D  KA+D I+  
Subjt:  IVLVPKI-KAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF-----------------------------------CSVWRASV
        F+   + ++G    +  LI    S  + +  LNG +L +     G RQG PLSP  F                                     V+  + 
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF-----------------------------------CSVWRASV

Query:  MEAVT-IWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQYLPSFMPRNRLGTLK----FIKDRVWKQIQSWKGKFFSLGGKE
         ++ T + +++  Y   SG  IN  KS VAF       +++ +   +  +  P   +YL  ++ ++     K     ++  + + +  WK    S  G+ 
Subjt:  MEAVT-IWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHKQYLPSFMPRNRLGTLK----FIKDRVWKQIQSWKGKFFSLGGKE

Query:  VLLKSIIQAIPCYTMNC--FRLPYCLIREIHWAMARFWWN
         ++K  I     Y  N    + P    +++   +  F WN
Subjt:  VLLKSIIQAIPCYTMNC--FRLPYCLIREIHWAMARFWWN

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-1733.76Show/hide
Query:  IVLVPK-IKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        I L+PK  K P ++ +FRPISL N   K+++K + NR++  +  +I  +Q GFIPG     N       IH + +          + LD  KA+D+I+  
Subjt:  IVLVPK-IKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF
        F+  V+ R G    + ++I    S    +  +NGE+L  +    G RQG PLSPY F
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF

P14381 Transposon TX1 uncharacterized 149 kDa protein2.4e-1823.5Show/hide
Query:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS
        ++ L+PK    R + ++RP+SL +  YK+++KA+  R+K +L ++I  +QS  +PGR + DN  L  + +H   RRTG     A L LD  KA+DR++  
Subjt:  MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWS

Query:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFC-------------------------SVWRASVMEAVTIWDLL
        +L   +    F  Q+   +    +S      +N      +   RG+RQG PLS   +                           V  A   + + +   L
Subjt:  FLRTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFC-------------------------SVWRASVMEAVTIWDLL

Query:  IRYER----------ASGQMINYEKS--------VVAFSPNTGEDSQ------QYISHVLSVSRCPCHKQYLPSFMPRNRLGTLKFIKDRVWKQIQSWKG
        +  ER          AS   IN+ KS         V F P    D        +Y+   LS    P  + ++              +++ V  ++  WKG
Subjt:  IRYER----------ASGQMINYEKS--------VVAFSPNTGEDSQ------QYISHVLSVSRCPCHKQYLPSFMPRNRLGTLKFIKDRVWKQIQSWKG

Query:  --KFFSLGGKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGLGA
          K  S+ G+ +++  ++ +   Y + C       I +I   +  F W G       HW   G+ +
Subjt:  --KFFSLGGKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGLGA

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM1.6e-0927.1Show/hide
Query:  VLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFL
        V +PK    +R  DFRPIS+ +   + ++  +  R+   +       Q GF+P     DNA +    +    +    +S + A  LD+SKA+D +  + +
Subjt:  VLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFL

Query:  RTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF
           +   G  + + D +         S N +G      +P+RG++QGDPLSP  F
Subjt:  RTVINRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCF

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.4e-0728.99Show/hide
Query:  VQSAPSLPLSSR-------VCDLFSPSGQ---WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERW
        V S P  PL++        + +LF   G    WD+ KI       +   I RI L      D++IW++   G ++++SGY L     S   P+ +     
Subjt:  VQSAPSLPLSSR-------VCDLFSPSGQ---WDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERW

Query:  LIWWSRLWRLGVPNKHKVFLWR------LSLERLPTKG
        +   +R+W L +  K K FLWR       + ERL T+G
Subjt:  LIWWSRLWRLGVPNKHKVFLWR------LSLERLPTKG

AT4G20520.1 RNA binding;RNA-directed DNA polymerases9.8e-1542.05Show/hide
Query:  VVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRTVINRMGFAQQWTDLILR
        +V R+K ++  LI   Q+ FIPGR   DN +   E +H +RR+ G K  W  LKLD+ KAYDRI W +L   +   GF + W   I R
Subjt:  VVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRTVINRMGFAQQWTDLILR

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-0442.5Show/hide
Query:  AIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW
        A+P YTM CF LP  + ++I   +A FWW   +E   +HW
Subjt:  AIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.6e-0471.43Show/hide
Query:  FNLNGERLGNVIPSRGLRQGDPLSPYCF
        F +NG   G V PSRGLRQGDPLSPY F
Subjt:  FNLNGERLGNVIPSRGLRQGDPLSPYCF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGTTCTTGTTCCGAAGATCAAGGCCCCTCGGCGAGTTTCTGATTTTCGGCCCATCTCTCTATGCAATTTTAGCTATAAGCTGATTTCAAAGGCAGTGGTTAATAG
GATGAAGCATATCCTTCCTAAACTTATTTCGTCCAACCAGAGTGGCTTTATTCCTGGGAGGTGTGTGGTGGATAATGCCATCTTAGGGTTTGAATGCATTCATGAGTTAA
GGAGGCGGACTGGAGGAAAGTCTAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCATACGACAGGATAGAGTGGTCGTTTCTGCGGACAGTTATAAATAGAATGGGT
TTCGCTCAACAGTGGACTGATTTGATTCTCCGGTGCGTTAGCTCGGTTTCCTTTTCGTTTAACCTAAATGGGGAGAGGTTGGGGAATGTGATTCCTTCTCGTGGGCTCAG
GCAGGGAGACCCGCTGTCTCCGTATTGTTTTTGCTCTGTGTGGAGGGCAAGCGTTATGGAAGCAGTGACTATCTGGGATCTGTTGATCCGCTATGAGCGAGCCTCGGGTC
AGATGATTAACTATGAGAAGTCAGTGGTTGCTTTCAGTCCGAACACTGGTGAGGACTCACAACAGTATATCAGTCATGTGCTCTCGGTATCTCGGTGTCCGTGTCATAAA
CAATACCTTCCCTCATTTATGCCTAGGAATCGCTTGGGTACGTTGAAGTTTATTAAGGACCGTGTCTGGAAGCAGATTCAGAGTTGGAAGGGTAAGTTCTTTTCCTTGGG
TGGTAAGGAAGTCCTTCTAAAGTCTATCATTCAGGCCATCCCTTGCTACACGATGAATTGCTTTCGTCTGCCCTATTGCCTGATTAGAGAAATCCATTGGGCCATGGCTA
GGTTCTGGTGGAATGGTTCTGAGGAGGTGAATAGGATCCATTGGTCTGTTGTGGGGTTGGGAGCTGCTGGATCGAGGGTGCAGGTGGAGGATTGGGAATGGTCGGTCTAC
CCCATATATGGTTCGAATTGGGTGGTGGATAATCCGTCTCTACGTGTGCAGTCTGCTCCTTCACTTCCTTTATCCAGTAGGGTCTGTGATCTGTTTTCTCCGTCGGGACA
GTGGGACGAGGTCAAGATTCGTACCCATTTTTTGGGGCCTGAGTGTGAGGCCATTCTAAGGATTCCCTTGCGCTCTGGTCTGCTAGACGATCGACTTATTTGGCATTTTG
AGAAGCATGGCATGTTCTCTATGAAGAGTGGGTATAGGTTGGCTTTCTCTTTGGTGTCTCAGGTGTGTCCGTCTTCTTCTGATTCTGAGCGATGGCTGATTTGGTGGTCT
AGGTTATGGAGGCTTGGGGTCCCGAATAAGCACAAGGTCTTTTTATGGCGTCTCTCCCTTGAGCGGCTGCCCACAAAGGGAAATGTGGCTCTGCTTGAAATTTTCTCAGT
TATACCAGTCGTTATACCATTGGATCTTGTTGATGTCATCTGGGTATTGAAGGAGAAGTTAGGTGCATTAGACTTCGAGCTTGTGACAGTGTTCTGGTGGTCAGTTTGGA
ATTTGCATAACAATTTGTGTTGGAGGGGAAAATCTGATGGTCGGGATTTGTGGGCATGGTCTGAAGAGTATCTGAGGGCGTATCATGGTGTTGTCGGGCAGCGGGAGTCT
CGCTGCAGTTTGCAGCCTTGCCCCAGTCGGCCGGTCGAGCAGTCTTCATGGACTCCCCCGGTGGGCGGTGGTTTCAAGCTGAACACCGATGCCTCTGTCAGGCCTGATAC
TGTGCTTCTAGCGGCATGCTTGGACTTGCCTAGGTGCTGGAGTGTGGATCTGGCTGAAGGTTGGGCATTGGTGAAGGGCGTGGAGTTAGTGTTACAGATGGGTTTCTTCA
ATTTCTGTGTGGAAGTGGATTCATTAAGACTGGTTCGAATTTTACATGGGGAGTTGACTGACTCTTCAGAGGTGACGGTCGAGTTTGGGCATTTCGAGAACGGGAGCGAC
CCAAGGAACCATCTATACTGGGAATGCGATCAGACTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGTTCTTGTTCCGAAGATCAAGGCCCCTCGGCGAGTTTCTGATTTTCGGCCCATCTCTCTATGCAATTTTAGCTATAAGCTGATTTCAAAGGCAGTGGTTAATAG
GATGAAGCATATCCTTCCTAAACTTATTTCGTCCAACCAGAGTGGCTTTATTCCTGGGAGGTGTGTGGTGGATAATGCCATCTTAGGGTTTGAATGCATTCATGAGTTAA
GGAGGCGGACTGGAGGAAAGTCTAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCATACGACAGGATAGAGTGGTCGTTTCTGCGGACAGTTATAAATAGAATGGGT
TTCGCTCAACAGTGGACTGATTTGATTCTCCGGTGCGTTAGCTCGGTTTCCTTTTCGTTTAACCTAAATGGGGAGAGGTTGGGGAATGTGATTCCTTCTCGTGGGCTCAG
GCAGGGAGACCCGCTGTCTCCGTATTGTTTTTGCTCTGTGTGGAGGGCAAGCGTTATGGAAGCAGTGACTATCTGGGATCTGTTGATCCGCTATGAGCGAGCCTCGGGTC
AGATGATTAACTATGAGAAGTCAGTGGTTGCTTTCAGTCCGAACACTGGTGAGGACTCACAACAGTATATCAGTCATGTGCTCTCGGTATCTCGGTGTCCGTGTCATAAA
CAATACCTTCCCTCATTTATGCCTAGGAATCGCTTGGGTACGTTGAAGTTTATTAAGGACCGTGTCTGGAAGCAGATTCAGAGTTGGAAGGGTAAGTTCTTTTCCTTGGG
TGGTAAGGAAGTCCTTCTAAAGTCTATCATTCAGGCCATCCCTTGCTACACGATGAATTGCTTTCGTCTGCCCTATTGCCTGATTAGAGAAATCCATTGGGCCATGGCTA
GGTTCTGGTGGAATGGTTCTGAGGAGGTGAATAGGATCCATTGGTCTGTTGTGGGGTTGGGAGCTGCTGGATCGAGGGTGCAGGTGGAGGATTGGGAATGGTCGGTCTAC
CCCATATATGGTTCGAATTGGGTGGTGGATAATCCGTCTCTACGTGTGCAGTCTGCTCCTTCACTTCCTTTATCCAGTAGGGTCTGTGATCTGTTTTCTCCGTCGGGACA
GTGGGACGAGGTCAAGATTCGTACCCATTTTTTGGGGCCTGAGTGTGAGGCCATTCTAAGGATTCCCTTGCGCTCTGGTCTGCTAGACGATCGACTTATTTGGCATTTTG
AGAAGCATGGCATGTTCTCTATGAAGAGTGGGTATAGGTTGGCTTTCTCTTTGGTGTCTCAGGTGTGTCCGTCTTCTTCTGATTCTGAGCGATGGCTGATTTGGTGGTCT
AGGTTATGGAGGCTTGGGGTCCCGAATAAGCACAAGGTCTTTTTATGGCGTCTCTCCCTTGAGCGGCTGCCCACAAAGGGAAATGTGGCTCTGCTTGAAATTTTCTCAGT
TATACCAGTCGTTATACCATTGGATCTTGTTGATGTCATCTGGGTATTGAAGGAGAAGTTAGGTGCATTAGACTTCGAGCTTGTGACAGTGTTCTGGTGGTCAGTTTGGA
ATTTGCATAACAATTTGTGTTGGAGGGGAAAATCTGATGGTCGGGATTTGTGGGCATGGTCTGAAGAGTATCTGAGGGCGTATCATGGTGTTGTCGGGCAGCGGGAGTCT
CGCTGCAGTTTGCAGCCTTGCCCCAGTCGGCCGGTCGAGCAGTCTTCATGGACTCCCCCGGTGGGCGGTGGTTTCAAGCTGAACACCGATGCCTCTGTCAGGCCTGATAC
TGTGCTTCTAGCGGCATGCTTGGACTTGCCTAGGTGCTGGAGTGTGGATCTGGCTGAAGGTTGGGCATTGGTGAAGGGCGTGGAGTTAGTGTTACAGATGGGTTTCTTCA
ATTTCTGTGTGGAAGTGGATTCATTAAGACTGGTTCGAATTTTACATGGGGAGTTGACTGACTCTTCAGAGGTGACGGTCGAGTTTGGGCATTTCGAGAACGGGAGCGAC
CCAAGGAACCATCTATACTGGGAATGCGATCAGACTGAGTAG
Protein sequenceShow/hide protein sequence
MIVLVPKIKAPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISSNQSGFIPGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRTVINRMG
FAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYCFCSVWRASVMEAVTIWDLLIRYERASGQMINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHK
QYLPSFMPRNRLGTLKFIKDRVWKQIQSWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPYCLIREIHWAMARFWWNGSEEVNRIHWSVVGLGAAGSRVQVEDWEWSVY
PIYGSNWVVDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEVKIRTHFLGPECEAILRIPLRSGLLDDRLIWHFEKHGMFSMKSGYRLAFSLVSQVCPSSSDSERWLIWWS
RLWRLGVPNKHKVFLWRLSLERLPTKGNVALLEIFSVIPVVIPLDLVDVIWVLKEKLGALDFELVTVFWWSVWNLHNNLCWRGKSDGRDLWAWSEEYLRAYHGVVGQRES
RCSLQPCPSRPVEQSSWTPPVGGGFKLNTDASVRPDTVLLAACLDLPRCWSVDLAEGWALVKGVELVLQMGFFNFCVEVDSLRLVRILHGELTDSSEVTVEFGHFENGSD
PRNHLYWECDQTE