; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036229 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036229
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:42042353..42047699
RNA-Seq ExpressionLag0036229
SyntenyLag0036229
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]7.8e-11130.39Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLVLA---------------------
        M  +LS    +VT EMN+ LL PY  EEV  A++  HP+KAPGPD    LF+QKYW V+G++  +  L+ LNS G+V                       
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLVLA---------------------

Query:  -NRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILING
           + ++L ++I + QS F+PGR I+DN+++ +E LH+L+ KRKG+ G+ +LKL MS AYDRVEW +L +IME LGF    ISLVM CV T +FS+L+NG
Subjt:  -NRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILING

Query:  ESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMV-GVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKLMV
           G I PSRG+RQGD  SPYLFLLC EGL +LL     R  V G+ I R  P+I+HL FADDS+   KA V    + Q +L  YERA GQC+N  K  +
Subjt:  ESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMV-GVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKLMV

Query:  FFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM---------------------------------------------
         FS+NV +D ++ +  +      +    YLGLP    R K + F  +  +VW+++                                             
Subjt:  FFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM---------------------------------------------

Query:  ---------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCG
                       HW   E LC  K  GG+ FRDL  FN A+LAKQ WR+L N +  + KV   +YFPSSS+      A+S + WKG +  +D L  G
Subjt:  ---------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCG

Query:  LRKNLGNGLDVNVIK---------------------------------------------------CLPISSSTPDNWIWHYDGKGEYSVKSGYKL----
         R  +GNG  + + K                                                    L IS ++ D+W W ++  G +SVKS Y+     
Subjt:  LRKNLGNGLDVNVIK---------------------------------------------------CLPISSSTPDNWIWHYDGKGEYSVKSGYKL----

Query:  -SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPNQTIRCEWINYYLS
          +L  Q +S S    E  +W+ +W +++P K+KIF  +     +PT +NL   HV  +    +C++ ME   HAL+ C+  R +    + C  ++   S
Subjt:  -SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPNQTIRCEWINYYLS

Query:  EFL------KANPKGGS----------------------------FVKTEEDILQLISDGEDI------------------------IIHTDASVMGTLS
        E         A  +G                              F  T  + L L  + E +                         ++ D +    LS
Subjt:  EFL------KANPKGGS----------------------------FVKTEEDILQLISDGEDI------------------------IIHTDASVMGTLS

Query:  NCSIGIVMRDKQG-LLKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIK----AIQNSFVKHDMN
           +G+V+R+ +G ++ A   +  +VS++   +E VA+L G+ L     + +L + +D L L+N++N N +     A  L DI+    A+Q   V H   
Subjt:  NCSIGIVMRDKQG-LLKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIK----AIQNSFVKHDMN

Query:  KGNTCYPDLGASSHVINDLSNFY-FVSEYQANSEFTDEKEV
         GN     L   + +I+D+  ++ F   + + + + D+ ++
Subjt:  KGNTCYPDLGASSHVINDLSNFY-FVSEYQANSEFTDEKEV

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.6e-13235.57Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLV-----------------------
        + ++++  P ++T E+N+ LLAPYT+EE+  AIR   PTKA GPD +PALFYQ YW VVG  T+  CL  LN+   +                       
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLV-----------------------

Query:  ----------------LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS
                        + NRLK V+  +I + QS F+P R+I+DN+I+GHE LH +   + G IG AALKL +SKA+DRVEW YL  IM K+GF+  WI 
Subjt:  ----------------LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS

Query:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM
         +++C++T  FSI +NG   G  +PSRGIRQGD  SPYLFLLC EGLSAL+        + G+    +   I+HL FADDSLIFL++   +    + +L 
Subjt:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM

Query:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEVGGLNFRDLVNFNQA
         Y RA GQC+NFSK  + FS NV  + +Q+L  IL++++    G+YLGLPS F R +            R++HW +   +C PKE GGLNFRDL  FNQA
Subjt:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEVGGLNFRDLVNFNQA

Query:  MLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------------------------------
        ++AK  WR L + NL VSKVL  +YF  +S+L  + ++ S +FWKGF+WG DLL  GLR  +GNG                                   
Subjt:  MLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------------------------------

Query:  ------------------DVNVIKCLPISS-STPDNWIWHYDGKGEYSVKSGYKLSM-LKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSI
                          D ++I  +PISS +  D+W+WHYD +G YSV+SGYKL M LKC   S S      T W  +WK+ VP+K+KIFI ++ H  I
Subjt:  ------------------DVNVIKCLPISS-STPDNWIWHYDGKGEYSVKSGYKLSM-LKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSI

Query:  PTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI-----PNQT---------------------------------------------------
        PT  NL    +      ++C +  E+  HA F C RAR+I     P  T                                                   
Subjt:  PTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI-----PNQT---------------------------------------------------

Query:  --IRCEWINYYLSEFLKANPKGGSFVKTEEDILQLI-----SDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEG
           +CEW+  +L    +A     S  +T+ +   ++     S    + ++TDA+  G  ++ S G ++RD    L A  ++      SPL  E+  +LEG
Subjt:  --IRCEWINYYLSEFLKANPKGGSFVKTEEDILQLI-----SDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEG

Query:  MHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF
        +  A + N   L V SDSL  I  I   I         + +I+A+   F
Subjt:  MHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]7.1e-11232.14Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLVLANRL---KIVLNEIIDECQSTF
        ++ VLS  P  ++ + N+ LL  +T  +V +A+ S    K+PG D   A+FYQ YW +VG +     L +LN       + +   K VL+ +I E QS F
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLVLANRL---KIVLNEIIDECQSTF

Query:  IPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSS
        +  R ITDN+++  E +H L+H+++G  GYAALKL MSKA+DRVEW +L  +M K+GF++ WI+L+M C+ T +FS  INGE  G + P RG+RQGD  S
Subjt:  IPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSS

Query:  PYLFLLCPEGLSALL-VSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILS
        PYLFL+C EGLS LL     +  + G++++R  P ISHL FADDSL+F +AT    G  +  L  Y RA GQ +N  K ++ FS N P   +     IL 
Subjt:  PYLFLLCPEGLSALL-VSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILS

Query:  MRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVW------------------------------------------------------------RRMHWHR
        M + E   +YLGLP+   R K++ F  + +K+W                                                            +++HW +
Subjt:  MRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVW------------------------------------------------------------RRMHWHR

Query:  LENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------
         + LCK K  GG+ FR  V+FNQA+LAKQAWR+    N  +S+VL GRY+P S  +T + +      W+G VWG +LL  GLR  +G+GL          
Subjt:  LENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------

Query:  -----------------------------------------DVNVIKCLPIS-SSTPDNWIWHYDGKGEYSVKSGYKL-SMLKCQEASLSDVEKETTWWQ
                                                 DV+ I  +P+S ++  D WIWH+D  G+YSV +GY   S L+ +E S      + TWW+
Subjt:  -----------------------------------------DVNVIKCLPIS-SSTPDNWIWHYDGKGEYSVKSGYKL-SMLKCQEASLSDVEKETTWWQ

Query:  RVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPNQTIRCEWINY----------YLSEFLK--------
          W   +P+KVKIF  +   +SIP   +L++  V  +   S+C    ET  HALF C  A+E+   +  C   N+          YL    K        
Subjt:  RVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPNQTIRCEWINY----------YLSEFLK--------

Query:  ----ANPKGGSFVKTEEDILQLISDGED-IIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSD
            ++P  G    ++  +++  +  E+ I ++ DA++  + S   IG+++R+  G + A  +  ++ +     +E  A+  G+  A  L++    V +D
Subjt:  ----ANPKGGSFVKTEEDILQLISDGED-IIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSD

Query:  SLTLINSIN
         L L+N+++
Subjt:  SLTLINSIN

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.2e-11632.3Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNS----DGL--------------------
        M  VLS    +VT EMN+ LL PY  EEV  A++  HP+KAPGPD  P LF+QKYW ++G++  +  L+ LNS     GL                    
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNS----DGL--------------------

Query:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS
                       V+ANRLK VL +II   QS F+PGR I+DN+++ +E LH+L++KRKG+ G+ +LKL MSKAYDRV+W +L +IM  LGF    IS
Subjt:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS

Query:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLV-SARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM
        L+M+CV T +FS+L+NG   G I PSRG+RQGD  SPYLFLLC EGL +LL  +A    + G+ I R  P+I+HL FADDS+IF KA V    + Q++L 
Subjt:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLV-SARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM

Query:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM----------------------------
         YERA GQC+N  K  + FS+NV +D ++ +  +      +    YLG P    R K + F  +  +VW+++                            
Subjt:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM----------------------------

Query:  --------------------------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASS
                                        HW R E LC  K  GG+ FRDL  FN A+LAKQ WR+L N +    KV   +YFP+S +    V A+S
Subjt:  --------------------------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASS

Query:  PFFWKGFVWGMDLLNCGLRKNLGNGLDVNVIK------CLPI------------------SSSTPDNWIWHYDGKGEYSVKSGYKLSMLKCQEAS--LSD
         + WKG    +D L  G R  +GNG  V + K      C  +                  S+   D+  W ++  G +SVKS Y+      Q A+   S 
Subjt:  PFFWKGFVWGMDLLNCGLRKNLGNGLDVNVIK------CLPI------------------SSSTPDNWIWHYDGKGEYSVKSGYKLSMLKCQEAS--LSD

Query:  VEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPN---------QTIRCEW----------
           E  +W+ +W +++P K+K+F  +     +PT +NL   HV  +    +C++ ME T HALF C+  R +           QT    W          
Subjt:  VEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPN---------QTIRCEW----------

Query:  ----------------------------------INYYLS---EFLKANPKGGSFVKTEEDILQLISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQG-L
                                          +N  LS   E+ +     GS  K  + +       + + ++ D +     S   IG+V+RD+ G +
Subjt:  ----------------------------------INYYLS---EFLKANPKGGSFVKTEEDILQLISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQG-L

Query:  LKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF----VKHDMNKGNTCYPDLGASSHV
        + A   +  +VS++   +E VA+L G+ L     V ++ + +D L L+N++NEN      IA  L DI+ +   F    V H    GN     L   + +
Subjt:  LKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF----VKHDMNKGNTCYPDLGASSHV

Query:  INDL
        I+D+
Subjt:  INDL

XP_042974784.1 uncharacterized protein LOC122306423 [Carya illinoinensis]9.8e-11431.87Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNS----DGL--------------------
        M  +LS    +VT EMN+ LL PY  EEV  A++  HP+KAPGP+    LF+QKYW V+G++  +  L+ LNS     GL                    
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNS----DGL--------------------

Query:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS
                       V+ANRLK +L ++I + QS F+PGR I+DN+++ +E LH+L+ KRKG+ G+ +LKL MSKAYDRVEW +L +IME LGF    IS
Subjt:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS

Query:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMV-GVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM
        LVM CV T +FS+L+NG   G I PSRG+RQGD  SPYLFLLC EGL +LL     R  V G+ I R  P+I+HL FADDS+ F KA V    + Q +L 
Subjt:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMV-GVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM

Query:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM----------------------------
         YERA GQC+N  K  + FS+NV +D ++ +  +      +    YLGLP    R K + F  +  +VW+++                            
Subjt:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRM----------------------------

Query:  --------------------------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASS
                                        HW   E LC  K  G + FRDL  FN A+LAKQ WR+L N +  + KV   +YFPSSS+      A+S
Subjt:  --------------------------------HWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASS

Query:  PFFWKGFVWGMDLLNCGLRKNLGNGLDVNVIK---------------------------------------------------CLPISSSTPDNWIWHYD
         + WKG +  +D L  G R  +GNG  + + K                                                    L IS ++ D+  W ++
Subjt:  PFFWKGFVWGMDLLNCGLRKNLGNGLDVNVIK---------------------------------------------------CLPISSSTPDNWIWHYD

Query:  GKGEYSVKSGYKL-----SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAR
          G +SVKS Y+       +L  Q +S S    E  +W+ +W +++P K+KIF  +     +PT +NL   HV  +    +C++ ME   HAL       
Subjt:  GKGEYSVKSGYKL-----SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAR

Query:  EIP-------NQTIRCEWINYYLSEFLKANPKGGSFVK---TEEDILQLISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQG-LLKAVQNLSSQVSNSPL
         I        N  +  E     +  F  +NPK    V+      D L+L  DG         +    LS   +G+V+R+ +G ++ A   +  +VS++  
Subjt:  EIP-------NQTIRCEWINYYLSEFLKANPKGGSFVK---TEEDILQLISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQG-LLKAVQNLSSQVSNSPL

Query:  GVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF----VKHDMNKGNTCYPDLGASSHVINDLSNFY-FVSEYQA
         +E VA+L G+ L     + +L + +D L L+N++N N +     A  L DI+ +  +F    V H  + GN     L   + +I+D+  ++ F   + +
Subjt:  GVEVVAVLEGMHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF----VKHDMNKGNTCYPDLGASSHVINDLSNFY-FVSEYQA

Query:  NSEFTDEKEV
         + + D+ ++
Subjt:  NSEFTDEKEV

TrEMBL top hitse value%identityAlignment
A0A2N9EMZ0 Reverse transcriptase domain-containing protein5.3e-11335.61Show/hide
Query:  MNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL---------------------------------------
        MN  L++ +T  EV  A++   P KAPGPD  P +FYQKYW ++G    +  L  LNS  +                                       
Subjt:  MNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL---------------------------------------

Query:  VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILI
        VLANRLKI+L  I+ E QS FIPGR ITDN+++  ETLH++QH++ GK G  ALKL MSKAYDRVEW YL  +MEK+GFH  W++L+M+C++T ++SIL+
Subjt:  VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILI

Query:  NGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKL
        NGE  G+IKPSRG+RQGD  SPYLFLLC EGL +L+   ++  ++ GVSI+RS PKI+HLFFADDSL+F KAT +   + Q IL  YE+A GQ VN  K 
Subjt:  NGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKL

Query:  MVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR---------------------------------------------
         +FFS++ P  ++  + N+L +   +    YLGLPS   R K   F  + ++VW                                              
Subjt:  MVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR---------------------------------------------

Query:  ---------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLN
                       +MHW     LCK K  GG+  RDL  FN+A+LAKQ WR+L N +   SKV   +YFP  S+L     +   + WK  +   DL+ 
Subjt:  ---------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLN

Query:  CGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSVKSGY
         G    +G G                                                       + +VI  +P+S   P D+ +W     G Y+V+SGY
Subjt:  CGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSVKSGY

Query:  KLSMLKCQEA--SLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI
         L + +C +A  S SD  K T  W  +W + VP K++ F+ +  HNS+PT  NL + H+  + + S C  ++E+T HAL+QC   + +
Subjt:  KLSMLKCQEA--SLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI

A0A2N9FFZ2 Reverse transcriptase domain-containing protein2.8e-11435.84Show/hide
Query:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL-----------------------------------
        VT EMN  L++ +T  EV  A++   P KAPGPD  P +FYQKYW ++G    +  L  LNS  +                                   
Subjt:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL-----------------------------------

Query:  ----VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF
            VLANRLKI+L  I+ E QS FIPGR ITDN+++  ETLH++QH++ GK G  ALKL MSKAYDRVEW YL  +MEK+GFH  W++L+M+C++T ++
Subjt:  ----VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF

Query:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVN
        SIL+NGE  G+IKPSRG+RQGD  SPYLFLLC EGL +L+   ++  ++ GVSI+RS PKI+HLFFADDSL+F KAT +   + Q IL  YE+A GQ VN
Subjt:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVN

Query:  FSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------------------------
          K  +FFS++ P  ++  + N+L +   +    YLGLPS   R K   F  + ++VW                                          
Subjt:  FSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------------------------

Query:  -------------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGM
                           +MHW     LCK K  GG+  RDL  FN+A+LAKQ WR+L N +   SKV   +YFP  S+L     +   + WK  +   
Subjt:  -------------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGM

Query:  DLLNCGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSV
        DL+  G    +G G                                                       + +VI  +P+S   P D+ +W     G Y+V
Subjt:  DLLNCGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSV

Query:  KSGYKLSMLKC--QEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI
        +SGY L + +C   E S SD  K T  W  +W + VP K++ F+ +  HNS+PT  NL + H+  + + S C  ++E+T HAL+QC   + +
Subjt:  KSGYKLSMLKC--QEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI

A0A2N9HG19 Uncharacterized protein5.8e-11232.4Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL------------------------
        +  V+   P+ VT EMN +L   +T +EVT+A++   P KAPGPD  P LFYQ+YW ++G       L  LNS  L                        
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL------------------------

Query:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS
                       VLANRLK +L +I+ E QS F+PGR ITDN+++  ETLH++ H+R GK G  A KL MSKAYDRVEW YL ++MEK+GFH  W++
Subjt:  ---------------VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS

Query:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSAR-LRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM
        L+M+C+TT ++SIL+NGE  GFIKP+RG+RQGD  SPYLFL C EGL++L+  A+    + GVSI+R  PKI+HLFFADDSL+F KAT       Q IL 
Subjt:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSAR-LRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM

Query:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------RMHWHRL
        +YE A GQ +N  K  +FFS++ P   +  +  +L +   +    YLGLPS   R K   F  + ++VW                        +MHW   
Subjt:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------RMHWHRL

Query:  ENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDVNV--IKCLP
          LCK K  GGL  R+L  FN+A+LAKQ WR++ N++    KV   +YFP  S+L    +  S F WK  +   +L+  GL   +G G  V +   K LP
Subjt:  ENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDVNV--IKCLP

Query:  I-----------------------------------------------------SSSTPDNWIWHYDGKGEYSVKSGYKLSMLKC--QEASLSDVEKETT
        +                                                     SS+  D   W     G Y+V+SGY L + +   ++   SD    T 
Subjt:  I-----------------------------------------------------SSSTPDNWIWHYDGKGEYSVKSGYKLSMLKC--QEASLSDVEKETT

Query:  WWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHV---PINENFSVCH----EEMETTDHALFQCTRAREIPNQTI------------RCEWINY---
         W ++W +++P K++ F+ +  H S+PT  NL + HV   P+  +F   H    E+       +  C +     N  +             C W N    
Subjt:  WWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHV---PINENFSVCH----EEMETTDHALFQCTRAREIPNQTI------------RCEWINY---

Query:  -----------YLSEFLKANPKGGSFVKTEEDILQL---ISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEGM
                    L+EFL+A        +  E  +++           I+ D ++    +   IG+++R+ +G   A    S    +S   +E  A     
Subjt:  -----------YLSEFLKANPKGGSFVKTEEDILQL---ISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEGM

Query:  HLARSLNVQRLTVLSDSLTLINSI
         LA  L +  + +  DS T++N++
Subjt:  HLARSLNVQRLTVLSDSLTLINSI

A0A2N9HYE3 Reverse transcriptase domain-containing protein1.6e-11435.84Show/hide
Query:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL-----------------------------------
        VT EMN  L++ +T  EV  A++   P KAPGPD  P +FYQKYW ++G    +  L  LNS  +                                   
Subjt:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGL-----------------------------------

Query:  ----VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF
            VLANRLKI+L  I+ E QS FIPGR ITDN+++  ETLH++QH++ GK G  ALKL MSKAYDRVEW YL  +MEK+GFH  W++L+M+C++T ++
Subjt:  ----VLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF

Query:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVN
        SIL+NGE  G+IKPSRG+RQGD  SPYLFLLC EGL +L+   ++  ++ GVSI+RS PKI+HLFFADDSL+F KAT +   + Q IL  YE+A GQ VN
Subjt:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLR-SMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVN

Query:  FSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------------------------
          K  +FFS++ P  ++  + N+L +   +    YLGLPS   R K   F  + ++VW                                          
Subjt:  FSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR-----------------------------------------

Query:  -------------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGM
                           +MHW     LCK K  GG+  RDL  FN+A+LAKQ WR+L N +   SKV   +YFP  S+L     +   + WK  +   
Subjt:  -------------------RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGM

Query:  DLLNCGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSV
        DL+  G    +G G                                                       + +VI  +P+S   P D+ +W     G Y+V
Subjt:  DLLNCGLRKNLGNGL------------------------------------------------------DVNVIKCLPISSSTP-DNWIWHYDGKGEYSV

Query:  KSGYKLSMLKCQEA--SLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI
        +SGY L + +C +A  S SD  K T  W  +W + VP K++ F+ +  HNS+PT  NL + H+  + + S C  ++E+T HAL+QC   + +
Subjt:  KSGYKLSMLKCQEA--SLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI

A0A6J1DX30 uncharacterized protein LOC1110248741.7e-13235.57Show/hide
Query:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLV-----------------------
        + ++++  P ++T E+N+ LLAPYT+EE+  AIR   PTKA GPD +PALFYQ YW VVG  T+  CL  LN+   +                       
Subjt:  MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLV-----------------------

Query:  ----------------LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS
                        + NRLK V+  +I + QS F+P R+I+DN+I+GHE LH +   + G IG AALKL +SKA+DRVEW YL  IM K+GF+  WI 
Subjt:  ----------------LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWIS

Query:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM
         +++C++T  FSI +NG   G  +PSRGIRQGD  SPYLFLLC EGLSAL+        + G+    +   I+HL FADDSLIFL++   +    + +L 
Subjt:  LVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILM

Query:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEVGGLNFRDLVNFNQA
         Y RA GQC+NFSK  + FS NV  + +Q+L  IL++++    G+YLGLPS F R +            R++HW +   +C PKE GGLNFRDL  FNQA
Subjt:  DYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEVGGLNFRDLVNFNQA

Query:  MLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------------------------------
        ++AK  WR L + NL VSKVL  +YF  +S+L  + ++ S +FWKGF+WG DLL  GLR  +GNG                                   
Subjt:  MLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGL----------------------------------

Query:  ------------------DVNVIKCLPISS-STPDNWIWHYDGKGEYSVKSGYKLSM-LKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSI
                          D ++I  +PISS +  D+W+WHYD +G YSV+SGYKL M LKC   S S      T W  +WK+ VP+K+KIFI ++ H  I
Subjt:  ------------------DVNVIKCLPISS-STPDNWIWHYDGKGEYSVKSGYKLSM-LKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSI

Query:  PTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI-----PNQT---------------------------------------------------
        PT  NL    +      ++C +  E+  HA F C RAR+I     P  T                                                   
Subjt:  PTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREI-----PNQT---------------------------------------------------

Query:  --IRCEWINYYLSEFLKANPKGGSFVKTEEDILQLI-----SDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEG
           +CEW+  +L    +A     S  +T+ +   ++     S    + ++TDA+  G  ++ S G ++RD    L A  ++      SPL  E+  +LEG
Subjt:  --IRCEWINYYLSEFLKANPKGGSFVKTEEDILQLI-----SDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEG

Query:  MHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF
        +  A + N   L V SDSL  I  I   I         + +I+A+   F
Subjt:  MHLARSLNVQRLTVLSDSLTLINSINENIQGEACIAATLWDIKAIQNSF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-1622.54Show/hide
Query:  KVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVV--------------------------------GHTTMS----NCLAILNSD
        ++  E  + L  P T  E+ + I S    K+PGPD + A FYQ+Y + +                                G  T        ++++N D
Subjt:  KVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVV--------------------------------GHTTMS----NCLAILNSD

Query:  ----GLVLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGK-IGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTT
              +LANR++  + ++I   Q  FIPG     N+    ++++ +QH  + K   +  + +   KA+D+++ P++ + + KLG    ++ ++      
Subjt:  ----GLVLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGK-IGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTT

Query:  TTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQC
         T +I++NG+         G RQG   SP LF +  E L+  +   + + + G+ + +   K+S   FADD +++L+  +        ++ ++ +  G  
Subjt:  TTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQC

Query:  VNFSKLMVFFSRNVPNDSRQHLSNI---LSMRVNESLGSYLGLPST
        +N  K   F    + N++RQ  S I   L   +      YLG+  T
Subjt:  VNFSKLMVFFSRNVPNDSRQHLSNI---LSMRVNESLGSYLGLPST

P11369 LINE-1 retrotransposable element ORF2 protein3.1e-1723.08Show/hide
Query:  KVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWD------------VVGHTTMSNC------------------------LAILNSD
        K+  +    L +P + +E+ + I S    K+PGPD + A FYQ + +            +    T+ N                         ++++N D
Subjt:  KVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWD------------VVGHTTMSNC------------------------LAILNSD

Query:  ----GLVLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTT
              +LANR++  +  II   Q  FIPG     N+      +HY+ +K K K  +  + L   KA+D+++ P++ +++E+ G    +++++    +  
Subjt:  ----GLVLANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTT

Query:  TFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCV
          +I +NGE    I    G RQG   SPYLF +  E L+  +   + + + G+ I +   KIS L  ADD ++++        +   ++  +    G  +
Subjt:  TFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCV

Query:  NFSKLMVF-FSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEV
        N +K M F +++N   +     +   S+  N     YLG+        T++ K L DK ++ +     E+L + K++
Subjt:  NFSKLMVF-FSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWRRMHWHRLENLCKPKEV

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-1324.5Show/hide
Query:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVG---HTTMSNC--------------------------------LAILNSDGL
        V++   + L  P T +E++ A+R     K+PG D     F+Q +WD +G   H  ++                                  +++L++D  
Subjt:  VTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVG---HTTMSNC--------------------------------LAILNSDGL

Query:  VLAN----RLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF
        ++A     RLK VL E+I   QS  +PGR+I DN+ L  + LH+    R+  +  A L L   KA+DRV+  YL   ++   F   ++  +     +   
Subjt:  VLAN----RLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTF

Query:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNF
         + IN      +   RG+RQG   S  L+ L  E    LL     + + G+ +     ++    +ADD ++  +  V+   + Q     Y  A    +N+
Subjt:  SILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSMVGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNF

Query:  SK
        SK
Subjt:  SK

P92555 Uncharacterized mitochondrial protein AtMg012503.6e-1048.53Show/hide
Query:  LINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDS
        +ING   G + PSRG+RQGD  SPYLF+LC E LS L   A+ +  + G+ ++ + P+I+HL FADD+
Subjt:  LINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003102.2e-1538.94Show/hide
Query:  RRMHWHRLENLCKPKE-VGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDV
        R++ W   + LCK KE  GGL FRDL  FNQA+LAKQ++R++   +  +S++L  RYFP SS++  +V     + W+  + G +LL+ GL + +G+G+  
Subjt:  RRMHWHRLENLCKPKE-VGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDV

Query:  NVIKCLPISSSTP
         V     I   TP
Subjt:  NVIKCLPISSSTP

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.5e-0829.36Show/hide
Query:  SSTPDNWIWHYDGKGEYSVKSGYKL------SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEME
        S  PD  IW+Y+  GEY+V+SGY L      + +         ++ +T    R+W + +  K+K F+ +    ++ T   L    + I+ +   CH E E
Subjt:  SSTPDNWIWHYDGKGEYSVKSGYKL------SMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEME

Query:  TTDHALFQC
        + +HALF C
Subjt:  TTDHALFQC

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.8e-1231.3Show/hide
Query:  LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILIN
        +  RLK ++  +I   Q++FIPGR  TDN++   E +H ++ K KG  G+  LKL + KAYDR+ W YL   +   GF   W+  + +    +TF     
Subjt:  LANRLKIVLNEIIDECQSTFIPGRSITDNMILGHETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILIN

Query:  GESFGFIKPSR---------GIRQGDLSSPY
            G    S+         G R  D+++P+
Subjt:  GESFGFIKPSR---------GIRQGDLSSPY

AT4G29090.1 Ribonuclease H-like superfamily protein8.0e-1335.14Show/hide
Query:  LLDKVWRR------MHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCG
        L D  WR       MHW   ++L   K  GG+ F+D+  FN A+L KQ WR+L+     ++KV   RYF  S  L   + +   F WK      ++L  G
Subjt:  LLDKVWRR------MHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCG

Query:  LRKNLGNGLDV
         R  +GNG D+
Subjt:  LRKNLGNGLDV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-1638.94Show/hide
Query:  RRMHWHRLENLCKPKE-VGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDV
        R++ W   + LCK KE  GGL FRDL  FNQA+LAKQ++R++   +  +S++L  RYFP SS++  +V     + W+  + G +LL+ GL + +G+G+  
Subjt:  RRMHWHRLENLCKPKE-VGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDV

Query:  NVIKCLPISSSTP
         V     I   TP
Subjt:  NVIKCLPISSSTP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.6e-1148.53Show/hide
Query:  LINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDS
        +ING   G + PSRG+RQGD  SPYLF+LC E LS L   A+ +  + G+ ++ + P+I+HL FADD+
Subjt:  LINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRS-MVGVSIARSCPKISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGTGTGCTTAGCCGTGCGCCGCGGAAAGTCACTGATGAGATGAATCAGATGCTATTGGCCCCATATACCCGGGAGGAGGTCACGTCAGCCATCAGGAGCTTCCA
CCCGACTAAGGCCCCTGGCCCAGACGACTACCCTGCATTATTTTATCAGAAATATTGGGATGTCGTAGGGCACACGACGATGTCCAATTGTCTGGCTATTTTGAACTCGG
ATGGGCTAGTACTGGCTAATAGACTTAAGATTGTCCTGAATGAGATTATCGATGAGTGCCAATCAACTTTTATTCCTGGTAGATCAATAACTGATAATATGATTTTGGGG
CATGAAACTTTACATTACCTTCAACATAAGCGAAAAGGAAAGATTGGGTATGCGGCGCTCAAACTTTACATGAGCAAAGCATACGACAGAGTGGAGTGGCCTTACTTGGG
CCAAATTATGGAGAAATTGGGTTTCCATGTGCATTGGATTTCATTAGTAATGAAGTGTGTAACAACGACTACATTTTCCATTCTTATAAATGGGGAATCTTTTGGTTTTA
TCAAACCATCCCGTGGGATTAGACAAGGTGATCTCTCATCTCCTTACCTATTCTTACTTTGTCCGGAAGGTCTTTCTGCCCTATTGGTGTCAGCTAGATTAAGATCCATG
GTAGGTGTATCCATAGCAAGATCCTGTCCAAAAATCTCCCACCTATTCTTTGCGGATGACAGTCTGATCTTCCTTAAAGCTACGGTTGAGAAGTTTGGGCAATTTCAGGC
TATTCTGATGGATTATGAAAGGGCATATGGACAGTGTGTTAATTTTTCTAAATTGATGGTGTTTTTCTCGAGAAATGTTCCTAATGATTCAAGGCAGCATCTTAGTAATA
TTCTATCAATGAGGGTGAATGAATCATTGGGATCATACCTAGGCTTGCCATCAACCTTTCATAGAGGTAAAACTCGAGATTTCAAATTCCTTCTCGATAAAGTTTGGCGT
AGAATGCATTGGCACCGATTGGAGAATTTGTGTAAGCCAAAAGAGGTAGGTGGTTTAAATTTTCGGGATTTAGTCAATTTCAATCAGGCAATGCTGGCGAAACAAGCATG
GAGAGTATTAACTAATTCGAATCTCACGGTTTCTAAAGTTCTATGTGGAAGGTATTTTCCTTCGTCGTCAGTATTAACTGGTACAGTTTCTGCCTCATCTCCCTTCTTTT
GGAAAGGATTTGTTTGGGGGATGGATCTCCTTAATTGTGGCTTAAGGAAAAATTTAGGGAACGGGCTTGATGTGAATGTGATAAAATGTTTACCTATTAGTAGTTCGACA
CCGGATAATTGGATATGGCATTATGATGGTAAAGGAGAGTACTCTGTTAAAAGTGGATACAAGCTCTCTATGTTGAAGTGCCAAGAGGCCTCTTTGTCAGACGTGGAAAA
AGAGACTACTTGGTGGCAGAGGGTGTGGAAGATGAGAGTACCTAGCAAAGTGAAAATTTTCATCGGGAAAACTTTTCACAACTCCATCCCAACCATGGTAAACCTATGGA
ATCATCATGTACCAATAAACGAGAATTTTTCGGTTTGCCACGAGGAGATGGAAACTACAGACCATGCCCTTTTTCAGTGTACGAGGGCTCGAGAGATCCCTAATCAAACG
ATTAGATGTGAATGGATTAATTACTATCTATCAGAGTTCTTGAAGGCCAACCCAAAAGGTGGTTCTTTTGTTAAGACGGAGGAAGATATTCTTCAACTTATTTCAGACGG
CGAAGATATTATTATTCACACTGATGCATCTGTTATGGGAACACTGAGTAATTGCAGTATTGGAATTGTTATGCGAGATAAACAGGGGCTTCTTAAGGCAGTGCAGAATC
TATCTTCTCAGGTGAGCAACTCTCCTTTGGGAGTGGAAGTGGTAGCAGTGCTGGAAGGGATGCATCTGGCTAGGTCTTTGAATGTGCAACGTCTCACTGTTCTGTCTGAT
TCTTTGACTCTGATAAATTCAATTAACGAGAACATACAGGGGGAGGCTTGTATTGCTGCGACTCTCTGGGATATCAAAGCAATTCAAAACTCCTTTGTGAAGCATGACAT
GAACAAGGGGAATACGTGCTATCCAGATTTAGGAGCTTCTAGCCATGTCATCAACGATTTATCAAACTTCTACTTTGTTTCAGAATATCAAGCCAACTCGGAGTTCACGG
ATGAAAAGGAGGTGAAAAGACCTGAGAAGATCATGGCGAGCTCGAGTATTGGTGCCAAGGAGTCAAAAGGTTCGAGTCTCGACGCTAGGCTGAAGATGCACCCTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGTGTGCTTAGCCGTGCGCCGCGGAAAGTCACTGATGAGATGAATCAGATGCTATTGGCCCCATATACCCGGGAGGAGGTCACGTCAGCCATCAGGAGCTTCCA
CCCGACTAAGGCCCCTGGCCCAGACGACTACCCTGCATTATTTTATCAGAAATATTGGGATGTCGTAGGGCACACGACGATGTCCAATTGTCTGGCTATTTTGAACTCGG
ATGGGCTAGTACTGGCTAATAGACTTAAGATTGTCCTGAATGAGATTATCGATGAGTGCCAATCAACTTTTATTCCTGGTAGATCAATAACTGATAATATGATTTTGGGG
CATGAAACTTTACATTACCTTCAACATAAGCGAAAAGGAAAGATTGGGTATGCGGCGCTCAAACTTTACATGAGCAAAGCATACGACAGAGTGGAGTGGCCTTACTTGGG
CCAAATTATGGAGAAATTGGGTTTCCATGTGCATTGGATTTCATTAGTAATGAAGTGTGTAACAACGACTACATTTTCCATTCTTATAAATGGGGAATCTTTTGGTTTTA
TCAAACCATCCCGTGGGATTAGACAAGGTGATCTCTCATCTCCTTACCTATTCTTACTTTGTCCGGAAGGTCTTTCTGCCCTATTGGTGTCAGCTAGATTAAGATCCATG
GTAGGTGTATCCATAGCAAGATCCTGTCCAAAAATCTCCCACCTATTCTTTGCGGATGACAGTCTGATCTTCCTTAAAGCTACGGTTGAGAAGTTTGGGCAATTTCAGGC
TATTCTGATGGATTATGAAAGGGCATATGGACAGTGTGTTAATTTTTCTAAATTGATGGTGTTTTTCTCGAGAAATGTTCCTAATGATTCAAGGCAGCATCTTAGTAATA
TTCTATCAATGAGGGTGAATGAATCATTGGGATCATACCTAGGCTTGCCATCAACCTTTCATAGAGGTAAAACTCGAGATTTCAAATTCCTTCTCGATAAAGTTTGGCGT
AGAATGCATTGGCACCGATTGGAGAATTTGTGTAAGCCAAAAGAGGTAGGTGGTTTAAATTTTCGGGATTTAGTCAATTTCAATCAGGCAATGCTGGCGAAACAAGCATG
GAGAGTATTAACTAATTCGAATCTCACGGTTTCTAAAGTTCTATGTGGAAGGTATTTTCCTTCGTCGTCAGTATTAACTGGTACAGTTTCTGCCTCATCTCCCTTCTTTT
GGAAAGGATTTGTTTGGGGGATGGATCTCCTTAATTGTGGCTTAAGGAAAAATTTAGGGAACGGGCTTGATGTGAATGTGATAAAATGTTTACCTATTAGTAGTTCGACA
CCGGATAATTGGATATGGCATTATGATGGTAAAGGAGAGTACTCTGTTAAAAGTGGATACAAGCTCTCTATGTTGAAGTGCCAAGAGGCCTCTTTGTCAGACGTGGAAAA
AGAGACTACTTGGTGGCAGAGGGTGTGGAAGATGAGAGTACCTAGCAAAGTGAAAATTTTCATCGGGAAAACTTTTCACAACTCCATCCCAACCATGGTAAACCTATGGA
ATCATCATGTACCAATAAACGAGAATTTTTCGGTTTGCCACGAGGAGATGGAAACTACAGACCATGCCCTTTTTCAGTGTACGAGGGCTCGAGAGATCCCTAATCAAACG
ATTAGATGTGAATGGATTAATTACTATCTATCAGAGTTCTTGAAGGCCAACCCAAAAGGTGGTTCTTTTGTTAAGACGGAGGAAGATATTCTTCAACTTATTTCAGACGG
CGAAGATATTATTATTCACACTGATGCATCTGTTATGGGAACACTGAGTAATTGCAGTATTGGAATTGTTATGCGAGATAAACAGGGGCTTCTTAAGGCAGTGCAGAATC
TATCTTCTCAGGTGAGCAACTCTCCTTTGGGAGTGGAAGTGGTAGCAGTGCTGGAAGGGATGCATCTGGCTAGGTCTTTGAATGTGCAACGTCTCACTGTTCTGTCTGAT
TCTTTGACTCTGATAAATTCAATTAACGAGAACATACAGGGGGAGGCTTGTATTGCTGCGACTCTCTGGGATATCAAAGCAATTCAAAACTCCTTTGTGAAGCATGACAT
GAACAAGGGGAATACGTGCTATCCAGATTTAGGAGCTTCTAGCCATGTCATCAACGATTTATCAAACTTCTACTTTGTTTCAGAATATCAAGCCAACTCGGAGTTCACGG
ATGAAAAGGAGGTGAAAAGACCTGAGAAGATCATGGCGAGCTCGAGTATTGGTGCCAAGGAGTCAAAAGGTTCGAGTCTCGACGCTAGGCTGAAGATGCACCCTTCCTAA
Protein sequenceShow/hide protein sequence
MNSVLSRAPRKVTDEMNQMLLAPYTREEVTSAIRSFHPTKAPGPDDYPALFYQKYWDVVGHTTMSNCLAILNSDGLVLANRLKIVLNEIIDECQSTFIPGRSITDNMILG
HETLHYLQHKRKGKIGYAALKLYMSKAYDRVEWPYLGQIMEKLGFHVHWISLVMKCVTTTTFSILINGESFGFIKPSRGIRQGDLSSPYLFLLCPEGLSALLVSARLRSM
VGVSIARSCPKISHLFFADDSLIFLKATVEKFGQFQAILMDYERAYGQCVNFSKLMVFFSRNVPNDSRQHLSNILSMRVNESLGSYLGLPSTFHRGKTRDFKFLLDKVWR
RMHWHRLENLCKPKEVGGLNFRDLVNFNQAMLAKQAWRVLTNSNLTVSKVLCGRYFPSSSVLTGTVSASSPFFWKGFVWGMDLLNCGLRKNLGNGLDVNVIKCLPISSST
PDNWIWHYDGKGEYSVKSGYKLSMLKCQEASLSDVEKETTWWQRVWKMRVPSKVKIFIGKTFHNSIPTMVNLWNHHVPINENFSVCHEEMETTDHALFQCTRAREIPNQT
IRCEWINYYLSEFLKANPKGGSFVKTEEDILQLISDGEDIIIHTDASVMGTLSNCSIGIVMRDKQGLLKAVQNLSSQVSNSPLGVEVVAVLEGMHLARSLNVQRLTVLSD
SLTLINSINENIQGEACIAATLWDIKAIQNSFVKHDMNKGNTCYPDLGASSHVINDLSNFYFVSEYQANSEFTDEKEVKRPEKIMASSSIGAKESKGSSLDARLKMHPS