; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039295 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039295
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
Genome locationchr2:40831249..40838017
RNA-Seq ExpressionLag0039295
SyntenyLag0039295
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2607536.1 hypothetical protein D8674_007253 [Pyrus ussuriensis x Pyrus communis]8.1e-12834.1Show/hide
Query:  DWTYRPPAHVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY---------
        D T   P   +    +NP  E+W  KDQ L+  +N+TLS   + + VG +SS+++W  LE+ +   S  NI  L+   QS+ K    + +Y         
Subjt:  DWTYRPPAHVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY---------

Query:  -----------------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEE----------AVIEK------QSKHDDVVTQPAAMFASQPSS---NS
                         TL+ LP EF +F  S+  R  S S DELH L   +E          +V+E       QS+   + T  AA F+   S+   NS
Subjt:  -----------------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEE----------AVIEK------QSKHDDVVTQPAAMFASQPSS---NS

Query:  SQRPNSSGNFGGGRAPSH--GNFGRESNGRGGYGYSSGRGNGNSGRSSFSSPQSSDGQ-----------------GRVVTLIL-----------------
        ++  +S G F   R   +  GNF   +  RG   Y   R   +SG S F +P    G                  GR+    L                 
Subjt:  SQRPNSSGNFGGGRAPSH--GNFGRESNGRGGYGYSSGRGNGNSGRSSFSSPQSSDGQ-----------------GRVVTLIL-----------------

Query:  ---------------------------YLANRFRLYAH--------------LASEYAGDDQ-----------VSEKSTGRILFQGPSVNGLYPLSSFST
                                   Y+ N   L  H               A ++  D+            V ++ +GR L +G   +G YPL S S 
Subjt:  ---------------------------YLANRFRLYAH--------------LASEYAGDDQ-----------VSEKSTGRILFQGPSVNGLYPLSSFST

Query:  SVSPSCYVAHVAANKTYSLWHCRLGHPSHTILNSV-----LRVIGLNTC--FVSPC--------------------------------------------
        S SP+ + A ++      +WH RLGHPS +I   V     L V G +T   F S C                                            
Subjt:  SVSPSCYVAHVAANKTYSLWHCRLGHPSHTILNSV-----LRVIGLNTC--FVSPC--------------------------------------------

Query:  -------------------DSDGISPT-VAAPEPVS----------FTNSPAPSSPRVVLADASDPIPSPAAPETI-----------PNVATITNDLPST
                           DS  I P+   +  PVS          +T+ P P++     A +S P PS + P+ +           P  + ++  +P  
Subjt:  -------------------DSDGISPT-VAAPEPVS----------FTNSPAPSSPRVVLADASDPIPSPAAPETI-----------PNVATITNDLPST

Query:  MPLSD------------------ASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYK
        +P+SD                   ++   PL    +  P++Y  ASK   W  AM+ +++AL   GTW L P    ++ +GCKWV+R+K   DG++ RYK
Subjt:  MPLSD------------------ASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYK

Query:  TRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFT
         RLVAKG+HQ+EG+DF +TFSPV K  T+ I+L+LA Q+ W L QL++ N FLHGDLHEDVYMQQP GF D   P++VCKL KSLYGLK  PRAWF+   
Subjt:  TRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFT

Query:  SYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNA
          L  LGF  S +D+S FVL     V+ +L+YVDDI++T  N++L +  I++L +RF + DLG L YFLGLEV  S+ GI + QTKY  ++L R  +  A
Subjt:  SYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNA

Query:  KTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWA
        K C TP+  +     GS  S+    ++R LVG L YLT++RPD+AF V+ L QFM +P   H  AA RVLR++ G++S GL+F K +S+ L+A+SD+DWA
Subjt:  KTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWA

Query:  GCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        GC+ DR ST+G+ +FLGSN +SW AKKQS V+RSSTEAEYR+LA TAAEL
Subjt:  GCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

KAB2613850.1 hypothetical protein D8674_036166 [Pyrus ussuriensis x Pyrus communis]1.4e-12734.29Show/hide
Query:  DWTYRPPAHVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY---------
        D T   PA ++    INP +E W  KDQ L+   N+TLS   +   VG +SS+ +W  LE+ +   S  ++  L+   QSI K    +  Y         
Subjt:  DWTYRPPAHVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY---------

Query:  -----------------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSK-----------------HDDVVTQPAAMFASQPSS----
                         TL  LP EF +F  S+  R  S S DELH LL  +E  + ++ K                    ++  P  ++ +Q       
Subjt:  -----------------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSK-----------------HDDVVTQPAAMFASQPSS----

Query:  ---NSSQRPNSSGNFGGGRAPS-HGNFGRESNGRGG--YGYSSGRGN-----------GNS---------------------------------------
           NS++   + G+ G  R  S HG F R ++  GG   G S  RG+           GNS                                       
Subjt:  ---NSSQRPNSSGNFGGGRAPS-HGNFGRESNGRGG--YGYSSGRGN-----------GNS---------------------------------------

Query:  -------------------------------GRSSFSSPQSSDGQGRVVTLILYLANRFRLYAHLASEY------AGDDQVSEKSTGRILFQGPSVNGLY
                                       G S+  +P SS     V+ +     N    +  L   Y      +    V ++STG++L +G   +G Y
Subjt:  -------------------------------GRSSFSSPQSSDGQGRVVTLILYLANRFRLYAHLASEY------AGDDQVSEKSTGRILFQGPSVNGLY

Query:  PLSSFSTSVSPSCYV-----AHVAANKTYSLWHCRLGHPSHTILNSVL---------------------------RVIGL--------------------
        PL S ++S S +  +     A V+      +WH RLGHPS  I   V+                           R  GL                    
Subjt:  PLSSFSTSVSPSCYV-----AHVAANKTYSLWHCRLGHPSHTILNSVL---------------------------RVIGL--------------------

Query:  -------------------------------NTCFVSPCDSDGISPTVAAPEPVSF---TNSPAPSSPRVVLADASDPIPSPAA---PETIP--NVA-TI
                                       ++  + P  +  +SP  +    +SF   T+  A   P V     SD +PSPAA   P   P  +VA ++
Subjt:  -------------------------------NTCFVSPCDSDGISPTVAAPEPVSF---TNSPAPSSPRVVLADASDPIPSPAA---PETIP--NVA-TI

Query:  TNDLPSTMPLSDASSAST-----------PLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYK
        +  L +T P+   S A             PLS   ++ P++Y  ASKFP W  AM+ +++AL   GTW L P    ++ +GCKWV+RVK++PDG++ RYK
Subjt:  TNDLPSTMPLSDASSAST-----------PLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYK

Query:  TRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFT
         RLVAKG+HQQ+GIDF ETFSPV K  T+ I+LSLA Q+ W L QL++ N FLHGDL EDVYM QP GF+D   P +VCKL KSLYGLK  PRAWF+   
Subjt:  TRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFT

Query:  SYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNA
          L  LGF  S +D+S FVL     V+ +L+YVDDI++T  ++ L +  I +L +RF + DLG L YFLGLEV  SS GI + QTKY  D+L R  +  A
Subjt:  SYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNA

Query:  KTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWA
        K C TP+        GS   +     +R +VG L YLT++RPD+AF V+ L QFM +P   H  AAKRVLR++ G++S GL+F KG  + LTA+SD+DWA
Subjt:  KTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWA

Query:  GCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        G + DRRST+G+ +FLG N +SW AKKQS V+RSSTEAEYR+LA TAAEL
Subjt:  GCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

RVW58434.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-12838.07Show/hide
Query:  INPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITK-----------------------KP---EDVQ
        +NP Y  W  KDQ +++ INATLS   LA V G TS+++VW  L   ++S SRT + +LK   Q++ +                       KP   +D+ 
Subjt:  INPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITK-----------------------KP---EDVQ

Query:  IYTLNRLPSEFNTFRTSM--RTRSQSVSF-----DELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAP-------SHGN
         Y ++ L   FN F TS+   TR ++VS         H+    +  ++    +   +++    +  +   +N  + P     + G   P       +H N
Subjt:  IYTLNRLPSEFNTFRTSM--RTRSQSVSF-----DELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAP-------SHGN

Query:  FGRESNGRGGYGYSSGRGNGNSGRSSFSSPQS-------SDGQGRVVTLILYLANRFRLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGLYPLSSFSTS
          +E         ++     N    +   P +        +GQG      L +A+      H  +  A  +   +  TG  L +G S  GLYP+   S S
Subjt:  FGRESNGRGGYGYSSGRGNGNSGRSSFSSPQS-------SDGQGRVVTLILYLANRFRLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGLYPLSSFSTS

Query:  VSPSCYVAHVAANK-TYSLWHCRLGHPSHTILNSVLRVIGL-------NTCFVSPCD--------------SDGISPTVAAPEPVSFTNSPAPSS-----
        ++ S  ++ V   K + S+WH RLGH S  I++ +L    L          F  PC                D ++  V     V F  +  P+      
Subjt:  VSPSCYVAHVAANK-TYSLWHCRLGHPSHTILNSVLRVIGL-------NTCFVSPCD--------------SDGISPTVAAPEPVSFTNSPAPSS-----

Query:  ----PRVVLADASD-PIPSPAAPETIPNVATITNDLP------STMPLSDASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPL
            P V   D+ D P+ S   P T P+++T+ + +P      +  PL   SS  TPL+      PTSY  A+  P W  AMR ++DAL    TW+L P 
Subjt:  ----PRVVLADASD-PIPSPAAPETIPNVATITNDLP------STMPLSDASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPL

Query:  PPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSS
        P   + +G KWVY+VK+   G + R+K RLVA G+ Q+EGIDF ETFSPV+K  TV ++L+L+ Q+ W++RQL+V N FLHG L EDVYM+QP+GF++S 
Subjt:  PPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSS

Query:  QPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEV
         P YVCKL KSLYGLK  PRAWF   +  LL  GF +S  D+S FV       L++L+YVDDI++T ++ SLI SLI KLQ+ F M DLG L YFLG++ 
Subjt:  QPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEV

Query:  HHSSSGIQISQTKYARDVLHRFGITNAKTCSTPIS--LKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLR
           SSG+ + Q+KY  D+LHR  +  AK  S+P +  LK S   G P ++S I  +R  VGAL Y T +RPDIAF+V+ L Q M  P+++H  AAKRVLR
Subjt:  HHSSSGIQISQTKYARDVLHRFGITNAKTCSTPIS--LKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLR

Query:  YISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAELF
        Y+ GTI  GL++ KG    L AF DSDWAG   DRRSTTG+ +F GS  +SW AKKQS+V+RSSTEAEYRALA T AEL+
Subjt:  YISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAELF

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]1.5e-12656.04Show/hide
Query:  EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAA
        EP+SY+AA K   W  AM+ + DAL +Q TW L PLPP K+ +GCKW+Y++K+HPDG++ARYK RLVAKG+ Q+ G+D+ ETFSPVVK  TV ++LSLAA
Subjt:  EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAA

Query:  QYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIV
          GWQL QL+VKN FLHG L E+VYM QPQGF+D   PT+VCKL +SLYGLK  PRAW E FT +LLTLGF +S AD S FV   + +++ LLLYVDDI+
Subjt:  QYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIV

Query:  ITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPS-AEKGSPCSSSDIQSFRPLVGALHY
        +T ++ + ++S+I +L + FDM +LG L YFLGL++ + SSG+ + Q+KY  D+LH+  +   K C TP          GSP S SD   +R +VGAL Y
Subjt:  ITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPS-AEKGSPCSSSDIQSFRPLVGALHY

Query:  LTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSST
        LTF+RPDIA++V+ + QFM +P   H +A KR+LRY+ GT+  G+ F  G SL + A++D+DWAG   DRRSTTGFV+FLGSNP+SW +KKQ  VSRSST
Subjt:  LTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSST

Query:  EAEYRALASTAAEL
        EAEYRA+A+T AE+
Subjt:  EAEYRALASTAAEL

TQD93593.1 hypothetical protein C1H46_020801 [Malus baccata]3.4e-0948.61Show/hide
Query:  YEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY
        Y+ W + D+ALMTLI ATLS AAL+ V+GC SS+ +W  L++ +S+ +RT+IV +K D Q+I K  E + +Y
Subjt:  YEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY

TQE03277.1 hypothetical protein C1H46_011089 [Malus baccata]3.1e-12735.78Show/hide
Query:  SQI-NPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITK--------------------------KPE
        SQI N  Y  W + D+ALM LI ATLSPAA++  +G +S+  +W  L++H+S+ SRT++  +K + Q+I K                            E
Subjt:  SQI-NPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITK--------------------------KPE

Query:  DVQIYTLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSK---------HDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFG
        D+ I  LN LP E+NTFR  +R R   +S  +    L AEE ++E             + + +++P +  +    S+    P SS N  G     +G F 
Subjt:  DVQIYTLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSK---------HDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFG

Query:  RESNGRGGYGYSSGRGNGNSGRSSFSSPQSSDGQGRVVT---------------LILYLANRF---RLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGL
        + +N R       G+G  N G    +  Q    Q  V T               +   L N +     + H  +      Q+  K+     F   + NGL
Subjt:  RESNGRGGYGYSSGRGNGNSGRSSFSSPQSSDGQGRVVT---------------LILYLANRF---RLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGL

Query:  YPLSSFSTSVSPSCYVAHVA------------------------ANKTYSLWHCRLGHPSHTI-------------LNSVLRVIGLNTCFVSPCDSD---
          +     S++P  Y AH A                        +  +  LW    G  +H                N  ++      C  S   SD   
Subjt:  YPLSSFSTSVSPSCYVAHVA------------------------ANKTYSLWHCRLGHPSHTI-------------LNSVLRVIGLNTCFVSPCDSD---

Query:  ------GISPTVAAPEPVSFTNSPA-----PSSPRVVLADASDPIPSPAAPETIPN-------VATITNDLPSTMPLSDASSASTPLSSDPET-------
                 P +  P+P+S  +S        SSP  +   AS     P   E  PN       +A++      T   +        LSS   +       
Subjt:  ------GISPTVAAPEPVSFTNSPA-----PSSPRVVLADASDPIPSPAAPETIPN-------VATITNDLPSTMPLSDASSASTPLSSDPET-------

Query:  -EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLA
         EP +Y +A K PVW  AM+ + +AL  Q TW L PL   K+ +GCKW++++K++ DGSI+R+K RLVAKG+ Q+ G+D+ ETFSPV+K  T+ +IL+LA
Subjt:  -EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLA

Query:  AQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDI
        A + W LRQL+VKN FLHG LHE+VYM QP GF D + P +VCKL KSLYGLK  PRAW E FT++L +LGF ++ ADSS FV  +   ++ LLLYVDDI
Subjt:  AQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDI

Query:  VITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHY
        ++T + +S+I  +I  L   FD+ DLG+L YFLG+++    +   +SQTKY  ++L +  + + K C TP        K      ++  ++R +VGAL Y
Subjt:  VITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHY

Query:  LTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSST
        LTF+RPDIAF+V  + QFMQ+P   H  A KR+LRY+ GT+++G+ +   + L+L AFSD+DWAG   DRRSTTG V+FLG NP+SW ++KQ+IVSRSST
Subjt:  LTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSST

Query:  EAEYRALASTAAEL
        EAEYRA++ T+AEL
Subjt:  EAEYRALASTAAEL

TrEMBL top hitse value%identityAlignment
A0A2N9E3Q2 Reverse transcriptase Ty1/copia-type domain-containing protein5.4e-15436.64Show/hide
Query:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------
        ++NPLY  W  +D+AL +LI++TLSP+A++ V+G TS+  +W++L   Y+S SR+NIVNLK +  SI K  + V  Y                       
Subjt:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------

Query:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPA--AMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGY
            L  LPSEF++F ++M T++++V F+ELH L+K EE ++   + +   +   A  A  +SQP+ N+S    S+  FGG R    G    ++ GRGGY
Subjt:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPA--AMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGY

Query:  G-----YSSGRGNGNSGRS-----SFS-------SPQ--SSDGQGRVVTLILYLANRFRL------------------------------YAH-------
        G      SSGRG G+S  S     +FS       +PQ  SS+   R +  I Y      L                              Y H       
Subjt:  G-----YSSGRGNGNSGRS-----SFS-------SPQ--SSDGQGRVVTLILYLANRFRL------------------------------YAH-------

Query:  ---------------------------------LASEYAGDDQVSEKS---------------------------------------------TGRILFQ
                                         L   ++G+ Q+   S                                             TG+ L++
Subjt:  ---------------------------------LASEYAGDDQVSEKS---------------------------------------------TGRILFQ

Query:  GPSVNGLYPLSSFS-------------TSVSPSCYVAHVAANKTYS----------LWHCRLGHPSHTILNSVL-------RVIGLNTCFVSPCDSD---
        G S +GLYP++  S             +S SP    A + A+ T S          LWH RLGHP   +L+ VL       ++   NT F+ P  S    
Subjt:  GPSVNGLYPLSSFS-------------TSVSPSCYVAHVAANKTYS----------LWHCRLGHPSHTILNSVL-------RVIGLNTCFVSPCDSD---

Query:  ----GISPTVAAPEPVSFT-----------NSPAPSSPRVVLADASDPIP----------SPA---------------------------------APET
              +P++  P P++F            N P P+S    +    +PIP          +PA                                 AP +
Subjt:  ----GISPTVAAPEPVSFT-----------NSPAPSSPRVVLADASDPIP----------SPA---------------------------------APET

Query:  IPNVATITNDLPSTMPLSDASSASTPLSSDP------------------------ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEI
         PN  + T   P    L++  + +  LS+ P                        +TEP +YT ASKFP W  AM ++F ALQ+Q TWQL P  P ++ +
Subjt:  IPNVATITNDLPSTMPLSDASSASTPLSSDP------------------------ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEI

Query:  GCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCK
        GC+WVY++KR+ DGS++RYK RLVAKG+HQQ G+DF ETFSPVVK PTV IILSLA Q  W LRQL+V N FLHG L E V+M QP GF++SS P++VC 
Subjt:  GCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCK

Query:  LLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGI
        L KSLYGLK  PRAWFE FTS+LLTLGF AS AD+S F+L      +YLLLYVDDI+IT ++ S ++ +IS+L + F+  DLG L YFLGL++ +   G 
Subjt:  LLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGI

Query:  QISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYG
         + Q KY  D+LH+F +T+ +  STPI+  P     S    SD   +R LVGAL Y TF+RPDI F V+ + QFM  PS  H VAAKR+LRY+ GT+  G
Subjt:  QISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYG

Query:  LFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        + F+ G    LTAF+D+DWA    DRRST+G ++FLG+NP++W AKKQ  VSRSSTEAEYR+LAS AAEL
Subjt:  LFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

A0A2N9EN11 Reverse transcriptase Ty1/copia-type domain-containing protein8.7e-15238.78Show/hide
Query:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------
        +I+P++  W  +D+AL +LI+ATLSP+A++ V+G T++  +W+++   Y+S SR++IVNLK +  SI K  + V  Y                       
Subjt:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------

Query:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGYG-
            L  L S+F++F ++M T+++ VSF+ELH L+K EE +++  + +   +T  A   A+  SS+S+   +S+  F        G  G  +N  GG   
Subjt:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGYG-

Query:  YSSGRGNGNSGRSSFS---SPQSSDG----------------------QGR--------VVTLILYLANR------------------------------
        YS GRGN NS ++  S   +P S                         QGR        + T + +  N+                              
Subjt:  YSSGRGNGNSGRSSFS---SPQSSDG----------------------QGR--------VVTLILYLANR------------------------------

Query:  --------------------------FRLYAHL-----------ASEYAGDD-----------QVSEKSTGRILFQGPSVNGLYP------------LSS
                                  FRL   L              +  D+           Q+ +  TG+ L++G S +GLYP            LSS
Subjt:  --------------------------FRLYAHL-----------ASEYAGDD-----------QVSEKSTGRILFQGPSVNGLYP------------LSS

Query:  FSTSVS------PSCYVAHVAA---NKTYS-LWHCRLGHPSHTILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSFTN---SPAPSSPRVVLADASD
         S++ S      P+ Y+A + +   N + S LWH RLGHP       +       T   +P      SP    PEP + T     P PS    +L     
Subjt:  FSTSVS------PSCYVAHVAA---NKTYS-LWHCRLGHPSHTILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSFTN---SPAPSSPRVVLADASD

Query:  P-IP-SPAAPETIPNVATITNDLPSTMP--------LSDASSASTPLSSD-PETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCK
        P IP SP A   +P +  I N   S+ P        +S     ST  S D  +TEP +YT ASK P W   M ++F+ALQ+Q TW L P  P ++ +GC+
Subjt:  P-IP-SPAAPETIPNVATITNDLPSTMP--------LSDASSASTPLSSD-PETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCK

Query:  WVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLK
        WVY++KR  DGS++RYK RLVAKG+HQQ G+D+ ETFSPVVK PTV IILSLAAQ  W LRQL+V N FLHG L E VYM QP GF+D  QP++VC L K
Subjt:  WVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLK

Query:  SLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQIS
        SLYGLK  PRAWFE FTS+LLTLGF AS AD+S FVLK    ++YLLLYVDDI+IT S++++++++IS+L + F++ DLG L YFLGL++ + ++G  + 
Subjt:  SLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQIS

Query:  QTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFF
        Q+KY  D+L +F +++ K  STPI+  P        S +D   +R LVGAL Y TF+RPDI F V+ + QFM +PS  HLVAAKR+LRY+ G++  G+ F
Subjt:  QTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFF

Query:  RKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        + G  L LTAF+D+DWAG  +DRRST+G  +FLG+NP++W +KKQ  VSRSSTEAEYR+LA+ A EL
Subjt:  RKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

A0A2N9FGX9 Reverse transcriptase Ty1/copia-type domain-containing protein6.4e-15542.27Show/hide
Query:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------
        Q+NP Y  W  KDQAL+++I+ATLS +AL+ V+G  S+K VW+ LE+ Y+S SR+N++ LK D  +I K  + + +Y                       
Subjt:  QINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------------

Query:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGYGY
            L+ LP  F  F ++MRTRS +VSF++L VLL AEE  ++  +   +   +P+ +   +  S +       G       P H           G+  
Subjt:  ---TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGYGY

Query:  SSGRGNGNSGRSSFSSPQSSDGQGRVVTLILYLANRFRLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGLYPLSSFSTSVSPSCYVAHVAANKTYSLWH
           + +     ++   P    G   V  +     +    +   AS+++    + +  +G++L++G               +S +  V   A+N + S WH
Subjt:  SSGRGNGNSGRSSFSSPQSSDGQGRVVTLILYLANRFRLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGLYPLSSFSTSVSPSCYVAHVAANKTYSLWH

Query:  CRLG-HPSH-TILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSFTNSPAPSSPRVVLADASDPIPSPAAPETIPNVATITNDLPSTMPLSDASSAST
         RLG  P+H +  +S  RV+    C  S      ++  +  PEP+  T+ P PS+       +S  +P P  P  +P+++T TN  P    L    +   
Subjt:  CRLG-HPSH-TILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSFTNSPAPSSPRVVLADASDPIPSPAAPETIPNVATITNDLPSTMPLSDASSAST

Query:  PLSSDP----ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVK
           +      +TEP S+T AS    W+TAM  +F AL +Q TW L P  P ++ +GCKWVY++KR+ DG+++RYK RLVAKG+HQQ GID+DETFSPVVK
Subjt:  PLSSDP----ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVK

Query:  KPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGA
          TV IIL+LAAQ+ W LRQL++ N FLHG L EDV+M QPQGF+D ++P YVCKL KSLYGLK  PRAWF+ FT++LL+LGF AS AD S FV    GA
Subjt:  KPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGA

Query:  VLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPI--SLKPSAEKGSPCSSSD
        +LYLLLYVDDI++T ++ +LI  LIS LQS F++ DLG L YFLGL++ + +SG  + QTKYA D+L RF +T+ K  STP   S + +  +G P   SD
Subjt:  VLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPI--SLKPSAEKGSPCSSSD

Query:  IQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSW
          SFR LVGAL YLTF+RPD++F V+ + QFM  P+  HL+AAKR+LRY+ GT+  GL FR  +SL L A++D+DWAG  +DRRST+G ++FLGS P++W
Subjt:  IQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSW

Query:  GAKKQSIVSRSSTEAEYRALASTAAELF
         +KKQ+ VSRSSTEAEYR+LAS  AELF
Subjt:  GAKKQSIVSRSSTEAEYRALASTAAELF

A0A2N9FJ85 Uncharacterized protein3.9e-15237.17Show/hide
Query:  SQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY----------------------
        +QINP+Y  W  +D+ALM+LI+ATLSP+A + V+G +S+  +W +L K Y+S SR+NI+NLK     + K  + +  Y                      
Subjt:  SQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY----------------------

Query:  ----TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQ---RPNSSGNF-GGGRAPSHGNFGRE--SN
             L  LPSE+ +F ++M T+++SVSF+ELHVL+ ++E +++    +    +  A    + P +NS +        G+F   GR  + G F R   + 
Subjt:  ----TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQ---RPNSSGNF-GGGRAPSHGNFGRE--SN

Query:  GRGGYGYSSGRGNGN-----SGRSSFSSPQSS------------------------------------DGQGRVVTLI----------------------
        G   +GYS   G  +     +  +S  SP  S                                    +GQ   ++ I                      
Subjt:  GRGGYGYSSGRGNGN-----SGRSSFSSPQSS------------------------------------DGQGRVVTLI----------------------

Query:  ----LYLANRFRLYAHLASEYAGDD-QVSEKSTGRILFQGPSVNGLYPL-------SSFSTSVSPSCYVAHVAANKTYS-LWHCRLGHPSHTILNSVLRV
            L   N+F    H A  +  D  Q+ ++ +G+ L++G S +GLYPL        S S S SPS   A + + ++ S LWH R GHP   +L  +L  
Subjt:  ----LYLANRFRLYAHLASEYAGDD-QVSEKSTGRILFQGPSVNGLYPL-------SSFSTSVSPSCYVAHVAANKTYS-LWHCRLGHPSHTILNSVLRV

Query:  -----IGLNTCFVSPCDSDGIS-------------------PTVAAPEPVSFTNSPAPSS-----------PRVVLADAS----------DPIPSPAAPE
             +  ++ F   C    ++                     V  P P++  N P PS+           P  +L + S           P+ S + P 
Subjt:  -----IGLNTCFVSPCDSDGIS-------------------PTVAAPEPVSFTNSPAPSS-----------PRVVLADAS----------DPIPSPAAPE

Query:  TIPNVATITN------DLPS--------TMPLSD--------------------------ASSASTPLS---------------SDP-------------
         +P  +T+         LP+        T+P+                            +SS   PL                S P             
Subjt:  TIPNVATITN------DLPS--------TMPLSD--------------------------ASSASTPLS---------------SDP-------------

Query:  --ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIIL
          +TEP + T AS+FP W  AM+A+FDAL +Q TW L P P G++ IGC+WVY++KRH DGSIARYK RLVAKG+HQQ G+DFDETFSPVVK PTV I+L
Subjt:  --ETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIIL

Query:  SLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYV
        SLAAQ  W LRQL++ N FLHG L EDVYM QP GF+DS+ P  VCKL KSLYGLK  PRAWFE FTS+LLT+GF AS AD S FV +    +LYLLLYV
Subjt:  SLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYV

Query:  DDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGA
        DDI++T ++ + I SLI++L S F++ DLG L YFLGL++ +    + ++Q KY  D+L +F +T  K  STP  +    +  S    SD   +R LVGA
Subjt:  DDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGA

Query:  LHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSR
        L Y TF+RPDI + V+ + Q+M  P+  HL AAKR+LRY+ GT+  G+ F+ G+S  LTAF+DSDWAG   DRRSTTG  +FLG+NP++W +KKQ  VSR
Subjt:  LHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSR

Query:  SSTEAEYRALASTAAEL
        SSTEAEYRALA+ AA+L
Subjt:  SSTEAEYRALASTAAEL

A0A2N9GEE1 Reverse transcriptase Ty1/copia-type domain-containing protein3.3e-15140.41Show/hide
Query:  HVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------
        H + V ++NPLY  W  +D+ L +LI++TLSP+A++ V+G T++  +W V+   Y+S SR+++VNLK +  SI K  + V  Y                 
Subjt:  HVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIY-----------------

Query:  ---------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIE------KQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNF
                  L  LPS+F++F ++M T++++V F ELH L++ EE +++      K+  H  +   P A  A+     ++  P+S+         +  +F
Subjt:  ---------TLNRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIE------KQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNF

Query:  GRE-SNGRGGYGYSSGR----GNGNS------GRSSFSSPQSSDGQGRVVTLILYLAN-----RFRLYAHLASEYAGDD-QVSEKSTGRILFQGPSVNGL
          +  N    + Y+  +    GNGN       G S   +        +V+ +    +N     RF    H +  +  D+ Q+ ++ TG+ L++G S  GL
Subjt:  GRE-SNGRGGYGYSSGR----GNGNS------GRSSFSSPQSSDGQGRVVTLILYLAN-----RFRLYAHLASEYAGDD-QVSEKSTGRILFQGPSVNGL

Query:  YPLSSF----------------STSVSPSCYVAHVAANK----------TYSLWHCRLGHPSHTILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSF
        YP+                   ST++ PS    H +               SLWH  LGHP                           SPT    +P + 
Subjt:  YPLSSF----------------STSVSPSCYVAHVAANK----------TYSLWHCRLGHPSHTILNSVLRVIGLNTCFVSPCDSDGISPTVAAPEPVSF

Query:  TNSPAPSSPRVVLADASDPIPSPAA--PETIPNVATITNDLPSTMPLSDASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLP
        T  P P+ P   L  +S PI +  A  P  +P+    T    S +      +AST +    +TEP S+T ASK P W  AM ++FDALQ+Q TW L P  
Subjt:  TNSPAPSSPRVVLADASDPIPSPAA--PETIPNVATITNDLPSTMPLSDASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLP

Query:  PGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQ
          ++ +GC+WVY++KR+ DGS++RYK +LVAKG+HQ+ G+DFDETFSPVVK PTV IILSL AQ  W LRQL+V N FL G L E +YM QP GFIDS+ 
Subjt:  PGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQ

Query:  PTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVH
        P+ VCKL KSLYGLK  PRAWFE FTS+LLTLGF AS AD+S FVL      +YLLLYVDDI+IT +N++ I+ ++S+L + F++ DLG L YFLGL++ 
Subjt:  PTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVH

Query:  HSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYIS
        +   G+ + Q KY  D+LH+F +T+ K   TPI+  P        + SD   FR LVGAL Y TF+RPDIAF V+ + QFM  PST+H VAAKR+LRY+ 
Subjt:  HSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYIS

Query:  GTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        GT+  G+ F+ G  L LTAF+D+DWA    DRRST+G ++FLG NP++W AKKQ  VSRSSTEAEYR+LA+ A EL
Subjt:  GTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-6837.35Show/hide
Query:  WLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKN
        W  A+  + +A +   TW +T  P  K+ +  +WV+ VK +  G+  RYK RLVA+G+ Q+  ID++ETF+PV +  +   ILSL  QY  ++ Q++VK 
Subjt:  WLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKN

Query:  GFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAV---LYLLLYVDDIVITSSNTSLIN
         FL+G L E++YM+ PQG   S     VCKL K++YGLK   R WFE F   L    F  S  D   ++L   G +   +Y+LLYVDD+VI + + + +N
Subjt:  GFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAV---LYLLLYVDDIVITSSNTSLIN

Query:  SLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS-FRPLVGALHYLTF-SRPDIA
        +    L  +F MTDL  + +F+G+ +      I +SQ+ Y + +L +F + N    STP+  K + E  +  S  D  +  R L+G L Y+   +RPD+ 
Subjt:  SLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS-FRPLVGALHYLTF-SRPDIA

Query:  FTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSL--RLTAFSDSDWAGCSIDRRSTTGFVL-FLGSNPVSWGAKKQSIVSRSSTEAEYRA
          V+ LS++    ++      KRVLRY+ GTI   L F+K  +   ++  + DSDWAG  IDR+STTG++      N + W  K+Q+ V+ SSTEAEY A
Subjt:  FTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSL--RLTAFSDSDWAGCSIDRRSTTGFVL-FLGSNPVSWGAKKQSIVSRSSTEAEYRA

Query:  LASTAAE
        L     E
Subjt:  LASTAAE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.7e-7640.44Show/hide
Query:  LTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNG
        + AM+ + ++LQK GT++L  LP GK  + CKWV+++K+  D  + RYK RLV KG+ Q++GIDFDE FSPVVK  ++  ILSLAA    ++ QL+VK  
Subjt:  LTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNG

Query:  FLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKL-HGAVLYLLLYVDDIVITSSNTSLINSLI
        FLHGDL E++YM+QP+GF  + +   VCKL KSLYGLK  PR W+  F S++ +  +  + +D   +  +      + LLLYVDD++I   +  LI  L 
Subjt:  FLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKL-HGAVLYLLLYVDDIVITSSNTSLINSLI

Query:  SKLQSRFDMTDLGHLTYFLGLEV--HHSSSGIQISQTKYARDVLHRFGITNAKTCSTPIS--LKPS--------AEKGSPCSSSDIQSFRPLVGALHY-L
          L   FDM DLG     LG+++    +S  + +SQ KY   VL RF + NAK  STP++  LK S         EKG+         +   VG+L Y +
Subjt:  SKLQSRFDMTDLGHLTYFLGLEV--HHSSSGIQISQTKYARDVLHRFGITNAKTCSTPIS--LKPS--------AEKGSPCSSSDIQSFRPLVGALHY-L

Query:  TFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTE
          +RPDIA  V  +S+F+++P   H  A K +LRY+ GT    L F  G+   L  ++D+D AG   +R+S+TG++       +SW +K Q  V+ S+TE
Subjt:  TFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTE

Query:  AEYRALASTAAEL
        AEY A   T  E+
Subjt:  AEYRALASTAAEL

P92519 Uncharacterized mitochondrial protein AtMg008106.6e-5652.23Show/hide
Query:  LYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS
        +YLLLYVDDI++T S+ +L+N LI +L S F M DLG + YFLG+++    SG+ +SQTKYA  +L+  G+ + K  STP+ LK ++   S     D   
Subjt:  LYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS

Query:  FRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAK
        FR +VGAL YLT +RPDI++ V+ + Q M  P+       KRVLRY+ GTI +GL+  K + L + AF DSDWAGC+  RRSTTGF  FLG N +SW AK
Subjt:  FRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAK

Query:  KQSIVSRSSTEAEYRALASTAAEL
        +Q  VSRSSTE EYRALA TAAEL
Subjt:  KQSIVSRSSTEAEYRALASTAAEL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-10642.48Show/hide
Query:  SFTNSPAPSSPRVVLADASDPIPSPAA-----PETIPNVATITNDLP-STMPLSDASSA---------STPLSSDPETEPTSYTAASKFPVWLTAMRAKF
        S ++SP+P++     A +S   P+P +     P  +  +    N  P +T  +   + A         S  +S   E+EP +   A K   W  AM ++ 
Subjt:  SFTNSPAPSSPRVVLADASDPIPSPAA-----PETIPNVATITNDLP-STMPLSDASSA---------STPLSSDPETEPTSYTAASKFPVWLTAMRAKF

Query:  DALQKQGTWQLTPLPPGK-SEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLH
        +A     TW L P PP   + +GC+W++  K + DGS+ RYK RLVAKGY+Q+ G+D+ ETFSPV+K  ++ I+L +A    W +RQL+V N FL G L 
Subjt:  DALQKQGTWQLTPLPPGK-SEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLH

Query:  EDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFD
        +DVYM QP GFID  +P YVCKL K+LYGLK  PRAW+    +YLLT+GF  S +D+S FVL+   +++Y+L+YVDDI+IT ++ +L+++ +  L  RF 
Subjt:  EDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFD

Query:  MTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSP
        + D   L YFLG+E     +G+ +SQ +Y  D+L R  +  AK  +TP++  P     S    +D   +R +VG+L YL F+RPDI++ V+ LSQFM  P
Subjt:  MTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSP

Query:  STIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL
        +  HL A KR+LRY++GT ++G+F +KGN+L L A+SD+DWAG   D  ST G++++LG +P+SW +KKQ  V RSSTEAEYR++A+T++E+
Subjt:  STIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAEL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-10240.95Show/hide
Query:  SPTVAAPEPVSFTNSPAPSSPRVVLADASDPIPSPAAPETIPNV--ATITNDLPSTMPLSDASSA--------------STPLSSDPETEPTSYTAASKF
        SP   +P P S  +SP   +P   +++ + P  S  +   +P V  A     + +  P++  S A              S   S    +EP +   A K 
Subjt:  SPTVAAPEPVSFTNSPAPSSPRVVLADASDPIPSPAAPETIPNV--ATITNDLPSTMPLSDASSA--------------STPLSSDPETEPTSYTAASKF

Query:  PVWLTAMRAKFDALQKQGTWQLT-PLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLE
          W  AM ++ +A     TW L  P PP  + +GC+W++  K + DGS+ RYK RLVAKGY+Q+ G+D+ ETFSPV+K  ++ I+L +A    W +RQL+
Subjt:  PVWLTAMRAKFDALQKQGTWQLT-PLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLE

Query:  VKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLIN
        V N FL G L ++VYM QP GF+D  +P YVC+L K++YGLK  PRAW+    +YLLT+GF  S +D+S FVL+   +++Y+L+YVDDI+IT ++T L+ 
Subjt:  VKNGFLHGDLHEDVYMQQPQGFIDSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLIN

Query:  SLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFT
          +  L  RF + +   L YFLG+E      G+ +SQ +Y  D+L R  +  AK  +TP++  P     S     D   +R +VG+L YL F+RPD+++ 
Subjt:  SLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFT

Query:  VSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTA
        V+ LSQ+M  P+  H  A KRVLRY++GT  +G+F +KGN+L L A+SD+DWAG + D  ST G++++LG +P+SW +KKQ  V RSSTEAEYR++A+T+
Subjt:  VSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTA

Query:  AEL
        +EL
Subjt:  AEL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-10045.32Show/hide
Query:  EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAA
        EP++Y  A +F VW  AM  +  A++   TW++  LPP K  IGCKWVY++K + DG+I RYK RLVAKGY QQEGIDF ETFSPV K  +V +IL+++A
Subjt:  EPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAA

Query:  QYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFI----DSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYV
         Y + L QL++ N FL+GDL E++YM+ P G+     DS  P  VC L KS+YGLK   R WF  F+  L+  GF  S +D ++F+       L +L+YV
Subjt:  QYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFI----DSSQPTYVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYV

Query:  DDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGA
        DDI+I S+N + ++ L S+L+S F + DLG L YFLGLE+  S++GI I Q KYA D+L   G+   K  S P+    +    S     D +++R L+G 
Subjt:  DDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGA

Query:  LHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSR
        L YL  +R DI+F V+ LSQF ++P   H  A  ++L YI GT+  GLF+     ++L  FSD+ +  C   RRST G+ +FLG++ +SW +KKQ +VS+
Subjt:  LHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSR

Query:  SSTEAEYRALASTAAEL
        SS EAEYRAL+    E+
Subjt:  SSTEAEYRALASTAAEL

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.5e-1544.9Show/hide
Query:  YLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSW-GAKKQSIVS
        YLT +RPD+ F V+ LSQF  +  T  + A  +VL Y+ GT+  GLF+   + L+L AF+DSDWA C   RRS TGF   +   P+ + GA ++SI+S
Subjt:  YLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSW-GAKKQSIVS

ATMG00810.1 DNA/RNA polymerases superfamily protein4.7e-5752.23Show/hide
Query:  LYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS
        +YLLLYVDDI++T S+ +L+N LI +L S F M DLG + YFLG+++    SG+ +SQTKYA  +L+  G+ + K  STP+ LK ++   S     D   
Subjt:  LYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTKYARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQS

Query:  FRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAK
        FR +VGAL YLT +RPDI++ V+ + Q M  P+       KRVLRY+ GTI +GL+  K + L + AF DSDWAGC+  RRSTTGF  FLG N +SW AK
Subjt:  FRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSDSDWAGCSIDRRSTTGFVLFLGSNPVSWGAK

Query:  KQSIVSRSSTEAEYRALASTAAEL
        +Q  VSRSSTE EYRALA TAAEL
Subjt:  KQSIVSRSSTEAEYRALASTAAEL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.5e-2346.85Show/hide
Query:  STPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKP
        S  +++  + EP S   A K P W  AM+ + DAL +  TW L P P  ++ +GCKWV++ K H DG++ R K RLVAKG+HQ+EGI F ET+SPVV+  
Subjt:  STPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQGTWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKP

Query:  TVFIILSLAAQ
        T+  IL++A Q
Subjt:  TVFIILSLAAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTGTCGATGGATGTGGCAAGTCTTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACATGTATGAAAAACCCTCTGCAAATAATAAGGT
CTACCTAGTTAAGAAGTTTTTCAACATGCAAACGTTTGAGGATGCTTCAGTGAATTCCTATATTAATGAGTTGATAACGTCTTTACCTGATAGTTGGGAAACGATGAAGA
CAACAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCAGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACTGTA
GGGTCAGCTTTGGTTATGACTAAAGGAGGGCATCATGGTCTAGTGAGGATGGGGAATGGTAGAGCCTCCAAGACTAGAGGGATTGGAGATGTTGGTCTGAAGACAGAATG
TGCAGGTAGTGGCATTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGAGACAGTGGATGCCGGTTAAAGCTGCATATGGTTGTT
GTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCATAAACAATTGGGAAGTCAAGTAGAGAAAGTTGATGGCTAT
CGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAAGCATCAAGGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTC
TAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGCCATTCTTAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGTGGGAGACGAGATCGTTG
TTTTGTCTCCAAGTGGGAAATTGTCAGGATTGGTGGAGACAAACCATCACTGTTGTCGTGTGGAGAGAATAAAAGAAGGGTTTTGGGCCTTAAGGCCCATTTACATTTAC
CTTAGGTTTCCTGTAATTTCCCAAATTCTAGTAGAACCTGCAGTTGCAAAAGACTGGACCTACCGACCACCAGCTCATGTTGTTACTGTTTCACAAATTAATCCTCTCTA
TGAGGATTGGGTTGTGAAGGATCAAGCCCTGATGACGTTAATCAACGCCACACTTTCGCCCGCAGCTTTGGCATATGTTGTCGGATGCACCTCATCTAAGCAAGTTTGGG
AAGTACTCGAAAAGCATTATTCATCAAGCTCCAGAACCAATATTGTCAACCTGAAGCCTGACTTTCAATCTATTACCAAGAAACCGGAAGATGTTCAGATTTATACTCTA
AATAGACTGCCCTCTGAGTTTAATACTTTTCGTACATCCATGAGAACTCGTTCCCAATCAGTCTCTTTTGATGAGTTACATGTTTTGTTGAAAGCTGAGGAAGCTGTGAT
TGAGAAACAATCAAAACATGATGATGTTGTGACACAACCTGCTGCTATGTTTGCATCTCAACCTTCATCAAATTCTTCTCAACGCCCCAACTCGTCTGGAAATTTTGGTG
GAGGAAGGGCACCTAGTCATGGAAATTTTGGCCGCGAAAGTAATGGCAGAGGAGGTTATGGCTATTCCTCTGGTCGAGGGAATGGAAATTCAGGCAGGAGTTCTTTTTCT
TCTCCTCAATCCTCTGATGGACAAGGACGAGTTGTAACGCTCATCCTCTACTTGGCTAACCGATTCAGGTTGTACGCTCATCTCGCTTCAGAATATGCAGGTGATGATCA
AGTTTCGGAAAAGTCTACGGGCAGAATTTTGTTCCAAGGGCCTAGTGTCAATGGCCTTTATCCGCTGTCTTCTTTTTCTACCTCTGTTTCTCCATCATGTTATGTTGCTC
ATGTTGCTGCAAATAAAACTTATAGTCTATGGCATTGTCGTTTGGGGCACCCTAGCCATACCATTTTGAATTCTGTTCTTCGTGTTATAGGGTTAAATACTTGTTTTGTT
TCCCCTTGCGATTCTGATGGCATTTCTCCTACAGTTGCAGCACCCGAGCCTGTCTCCTTCACCAATTCCCCTGCTCCTTCTTCCCCACGGGTTGTTTTAGCTGATGCCAG
TGACCCTATTCCTTCTCCTGCAGCACCCGAGACTATTCCTAATGTTGCTACTATTACTAATGATTTACCTTCCACTATGCCTTTATCAGATGCTTCTTCAGCATCAACAC
CTTTGTCCTCTGATCCTGAAACTGAACCAACTTCATATACTGCTGCTTCCAAGTTTCCTGTCTGGCTGACGGCTATGCGTGCAAAATTCGATGCTTTACAAAAGCAAGGT
ACATGGCAACTCACTCCCTTGCCCCCTGGTAAAAGTGAAATTGGTTGTAAATGGGTATATCGGGTCAAGCGCCATCCAGATGGATCGATTGCTCGTTACAAAACCCGTCT
TGTGGCTAAGGGATATCACCAACAAGAGGGTATCGACTTTGATGAGACTTTCAGTCCGGTGGTCAAGAAACCTACTGTTTTTATAATTCTCTCTTTAGCAGCTCAGTATG
GTTGGCAACTTCGCCAGCTTGAAGTAAAGAATGGTTTCCTCCATGGTGACCTACATGAGGATGTTTATATGCAACAACCCCAAGGCTTCATTGACTCCTCTCAGCCAACC
TATGTTTGTAAATTGCTCAAGTCTCTCTATGGACTGAAACCGACTCCCCGTGCTTGGTTTGAGTGTTTTACCTCATACTTACTAACCCTTGGTTTTCATGCTTCTCAAGC
CGACTCTTCATTTTTTGTACTCAAGCTTCATGGTGCTGTTCTTTATCTTCTCCTATATGTTGACGATATTGTCATCACGAGCTCGAACACTTCCCTGATTAACTCTCTAA
TATCTAAGCTTCAGTCTCGCTTTGATATGACGGACCTTGGCCATCTCACATATTTTCTTGGCCTAGAGGTTCACCACTCGTCTTCAGGTATACAAATATCCCAAACAAAA
TATGCTCGTGATGTCCTTCACCGTTTTGGAATTACCAATGCCAAGACCTGTTCCACGCCTATTTCCCTTAAACCGTCTGCCGAGAAAGGCTCTCCTTGCTCCTCCTCTGA
TATTCAAAGCTTTCGGCCACTGGTTGGTGCTTTACATTATCTAACCTTCTCGCGGCCAGACATTGCTTTCACAGTAAGTTGGTTATCACAATTTATGCAGTCTCCCTCTA
CTATTCACTTGGTTGCAGCAAAGCGTGTGCTCAGATATATCAGTGGCACAATCTCTTATGGGTTATTCTTTCGCAAAGGGAATTCTTTGCGTTTAACTGCCTTCTCAGAC
TCGGATTGGGCAGGCTGTTCCATTGACAGGCGTTCAACAACAGGTTTTGTCCTCTTTCTCGGTTCCAATCCTGTTTCTTGGGGTGCAAAGAAACAATCCATTGTTTCTCG
AAGTTCCACGGAGGCTGAATATAGAGCTCTCGCATCTACCGCTGCTGAATTATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTGTCGATGGATGTGGCAAGTCTTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAAGCGCTTACAAACATGTATGAAAAACCCTCTGCAAATAATAAGGT
CTACCTAGTTAAGAAGTTTTTCAACATGCAAACGTTTGAGGATGCTTCAGTGAATTCCTATATTAATGAGTTGATAACGTCTTTACCTGATAGTTGGGAAACGATGAAGA
CAACAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTTTCAGAAGTTTGTGATTTAGCCATAGCAGAGGAAATTCGTAGGCAGGGTAGTAATAAAGAGTCTACTGTA
GGGTCAGCTTTGGTTATGACTAAAGGAGGGCATCATGGTCTAGTGAGGATGGGGAATGGTAGAGCCTCCAAGACTAGAGGGATTGGAGATGTTGGTCTGAAGACAGAATG
TGCAGGTAGTGGCATTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAATGTTGCCAAAGGTTCAAAGAGACAGTGGATGCCGGTTAAAGCTGCATATGGTTGTT
GTAGAGGTACAGTTGAGCCAGCAGCAAGGATAGCCAATTTCGATCAGTCCGATCAAGATCCTTCAGTTCATAAACAATTGGGAAGTCAAGTAGAGAAAGTTGATGGCTAT
CGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAGTTGAAGCATCAAGGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTC
TAGCTTGGTAATAGGTTTGAATAGAGGATTCAAGCCATTCTTAGAGTGTATCTTCTTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGTGGGAGACGAGATCGTTG
TTTTGTCTCCAAGTGGGAAATTGTCAGGATTGGTGGAGACAAACCATCACTGTTGTCGTGTGGAGAGAATAAAAGAAGGGTTTTGGGCCTTAAGGCCCATTTACATTTAC
CTTAGGTTTCCTGTAATTTCCCAAATTCTAGTAGAACCTGCAGTTGCAAAAGACTGGACCTACCGACCACCAGCTCATGTTGTTACTGTTTCACAAATTAATCCTCTCTA
TGAGGATTGGGTTGTGAAGGATCAAGCCCTGATGACGTTAATCAACGCCACACTTTCGCCCGCAGCTTTGGCATATGTTGTCGGATGCACCTCATCTAAGCAAGTTTGGG
AAGTACTCGAAAAGCATTATTCATCAAGCTCCAGAACCAATATTGTCAACCTGAAGCCTGACTTTCAATCTATTACCAAGAAACCGGAAGATGTTCAGATTTATACTCTA
AATAGACTGCCCTCTGAGTTTAATACTTTTCGTACATCCATGAGAACTCGTTCCCAATCAGTCTCTTTTGATGAGTTACATGTTTTGTTGAAAGCTGAGGAAGCTGTGAT
TGAGAAACAATCAAAACATGATGATGTTGTGACACAACCTGCTGCTATGTTTGCATCTCAACCTTCATCAAATTCTTCTCAACGCCCCAACTCGTCTGGAAATTTTGGTG
GAGGAAGGGCACCTAGTCATGGAAATTTTGGCCGCGAAAGTAATGGCAGAGGAGGTTATGGCTATTCCTCTGGTCGAGGGAATGGAAATTCAGGCAGGAGTTCTTTTTCT
TCTCCTCAATCCTCTGATGGACAAGGACGAGTTGTAACGCTCATCCTCTACTTGGCTAACCGATTCAGGTTGTACGCTCATCTCGCTTCAGAATATGCAGGTGATGATCA
AGTTTCGGAAAAGTCTACGGGCAGAATTTTGTTCCAAGGGCCTAGTGTCAATGGCCTTTATCCGCTGTCTTCTTTTTCTACCTCTGTTTCTCCATCATGTTATGTTGCTC
ATGTTGCTGCAAATAAAACTTATAGTCTATGGCATTGTCGTTTGGGGCACCCTAGCCATACCATTTTGAATTCTGTTCTTCGTGTTATAGGGTTAAATACTTGTTTTGTT
TCCCCTTGCGATTCTGATGGCATTTCTCCTACAGTTGCAGCACCCGAGCCTGTCTCCTTCACCAATTCCCCTGCTCCTTCTTCCCCACGGGTTGTTTTAGCTGATGCCAG
TGACCCTATTCCTTCTCCTGCAGCACCCGAGACTATTCCTAATGTTGCTACTATTACTAATGATTTACCTTCCACTATGCCTTTATCAGATGCTTCTTCAGCATCAACAC
CTTTGTCCTCTGATCCTGAAACTGAACCAACTTCATATACTGCTGCTTCCAAGTTTCCTGTCTGGCTGACGGCTATGCGTGCAAAATTCGATGCTTTACAAAAGCAAGGT
ACATGGCAACTCACTCCCTTGCCCCCTGGTAAAAGTGAAATTGGTTGTAAATGGGTATATCGGGTCAAGCGCCATCCAGATGGATCGATTGCTCGTTACAAAACCCGTCT
TGTGGCTAAGGGATATCACCAACAAGAGGGTATCGACTTTGATGAGACTTTCAGTCCGGTGGTCAAGAAACCTACTGTTTTTATAATTCTCTCTTTAGCAGCTCAGTATG
GTTGGCAACTTCGCCAGCTTGAAGTAAAGAATGGTTTCCTCCATGGTGACCTACATGAGGATGTTTATATGCAACAACCCCAAGGCTTCATTGACTCCTCTCAGCCAACC
TATGTTTGTAAATTGCTCAAGTCTCTCTATGGACTGAAACCGACTCCCCGTGCTTGGTTTGAGTGTTTTACCTCATACTTACTAACCCTTGGTTTTCATGCTTCTCAAGC
CGACTCTTCATTTTTTGTACTCAAGCTTCATGGTGCTGTTCTTTATCTTCTCCTATATGTTGACGATATTGTCATCACGAGCTCGAACACTTCCCTGATTAACTCTCTAA
TATCTAAGCTTCAGTCTCGCTTTGATATGACGGACCTTGGCCATCTCACATATTTTCTTGGCCTAGAGGTTCACCACTCGTCTTCAGGTATACAAATATCCCAAACAAAA
TATGCTCGTGATGTCCTTCACCGTTTTGGAATTACCAATGCCAAGACCTGTTCCACGCCTATTTCCCTTAAACCGTCTGCCGAGAAAGGCTCTCCTTGCTCCTCCTCTGA
TATTCAAAGCTTTCGGCCACTGGTTGGTGCTTTACATTATCTAACCTTCTCGCGGCCAGACATTGCTTTCACAGTAAGTTGGTTATCACAATTTATGCAGTCTCCCTCTA
CTATTCACTTGGTTGCAGCAAAGCGTGTGCTCAGATATATCAGTGGCACAATCTCTTATGGGTTATTCTTTCGCAAAGGGAATTCTTTGCGTTTAACTGCCTTCTCAGAC
TCGGATTGGGCAGGCTGTTCCATTGACAGGCGTTCAACAACAGGTTTTGTCCTCTTTCTCGGTTCCAATCCTGTTTCTTGGGGTGCAAAGAAACAATCCATTGTTTCTCG
AAGTTCCACGGAGGCTGAATATAGAGCTCTCGCATCTACCGCTGCTGAATTATTTTAG
Protein sequenceShow/hide protein sequence
MCLSMDVASLVAHETTAVKLMEALTNMYEKPSANNKVYLVKKFFNMQTFEDASVNSYINELITSLPDSWETMKTTVSNSTGNNTLKFSEVCDLAIAEEIRRQGSNKESTV
GSALVMTKGGHHGLVRMGNGRASKTRGIGDVGLKTECAGSGIGHRKSTLYRCQLNVAKGSKRQWMPVKAAYGCCRGTVEPAARIANFDQSDQDPSVHKQLGSQVEKVDGY
RESPVVRRSNELKKSLRRVEASRWKARAVAKVKGQVSSLVIGLNRGFKPFLECIFFRNSCSGWKKMTVGDEIVVLSPSGKLSGLVETNHHCCRVERIKEGFWALRPIYIY
LRFPVISQILVEPAVAKDWTYRPPAHVVTVSQINPLYEDWVVKDQALMTLINATLSPAALAYVVGCTSSKQVWEVLEKHYSSSSRTNIVNLKPDFQSITKKPEDVQIYTL
NRLPSEFNTFRTSMRTRSQSVSFDELHVLLKAEEAVIEKQSKHDDVVTQPAAMFASQPSSNSSQRPNSSGNFGGGRAPSHGNFGRESNGRGGYGYSSGRGNGNSGRSSFS
SPQSSDGQGRVVTLILYLANRFRLYAHLASEYAGDDQVSEKSTGRILFQGPSVNGLYPLSSFSTSVSPSCYVAHVAANKTYSLWHCRLGHPSHTILNSVLRVIGLNTCFV
SPCDSDGISPTVAAPEPVSFTNSPAPSSPRVVLADASDPIPSPAAPETIPNVATITNDLPSTMPLSDASSASTPLSSDPETEPTSYTAASKFPVWLTAMRAKFDALQKQG
TWQLTPLPPGKSEIGCKWVYRVKRHPDGSIARYKTRLVAKGYHQQEGIDFDETFSPVVKKPTVFIILSLAAQYGWQLRQLEVKNGFLHGDLHEDVYMQQPQGFIDSSQPT
YVCKLLKSLYGLKPTPRAWFECFTSYLLTLGFHASQADSSFFVLKLHGAVLYLLLYVDDIVITSSNTSLINSLISKLQSRFDMTDLGHLTYFLGLEVHHSSSGIQISQTK
YARDVLHRFGITNAKTCSTPISLKPSAEKGSPCSSSDIQSFRPLVGALHYLTFSRPDIAFTVSWLSQFMQSPSTIHLVAAKRVLRYISGTISYGLFFRKGNSLRLTAFSD
SDWAGCSIDRRSTTGFVLFLGSNPVSWGAKKQSIVSRSSTEAEYRALASTAAELF