; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g013670 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g013670
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:21351142..21370897
RNA-Seq ExpressionLcy06g013670
SyntenyLcy06g013670
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.6e-18438.51Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        KA++R++ NKI G+ D  G WV+  E +E     +FQ LF SS+P    I + L+     +S+  N  L  PFT E+I   +  M PTKAPGPDG+ A F
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        +QK+W +VG  +   CL  LN +G +D +N T+I LIPKV++ + + +FRPISLC+V+Y+I++K +ANRLK +LNHIISP+QSAF+P RLITDN ++G+E
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
        C+H +   K  ++G+VALKLD+SKAYDRVEW +L + M+ LGF  +WI+LIM C+ +  F VL+NG P     P+RGLRQG PLSPYLFI+CAE  S LL
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
        N +E ++ +  L+  +   ++THL +ADDSL+F +AS  DC  +K   + Y K SGQI NF+KS+   S   + +    I++I Q+       +YLG P 
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           R++   F  +K +V   +  W  + FSA G+EILIK+VAQA+P YAMS FK P  LC ++    ARFWWG +  +  IHW  W  +   KR GG+GF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV------------DDGPLN-----
        RD+  FN+A++AKQ WR++  PNSL+ RV++ RY+KN  F  A++G+NPSF WRSI+WG ++ +KG RWRIGDG  V               P++     
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV------------DDGPLN-----

Query:  -----------------------------------------KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV
                                                 +ED ++WH D KG +SVKS Y+L +     NE   S++     LW I W   +P K+++
Subjt:  -----------------------------------------KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV

Query:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK-----------------GDVKTDGQADWSEISNRPVDM--------------
          W    +ILPT  NL KR     P+C  CK ++ET++H+L EC   +                  D  +  Q  WS  S    ++              
Subjt:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK-----------------GDVKTDGQADWSEISNRPVDM--------------

Query:  ---DQATPEHREPAPTATNDRQA----------------TIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFR
           +    + R  A  A +  +A                 I   +W PP+  + KLN DA+ + + +  G+G ++RD +G +++ G +
Subjt:  ---DQATPEHREPAPTATNDRQA----------------TIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFR

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]2.2e-18439.39Show/hide
Query:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY
        A+ R++ N+I GL +  G WVE  E ++R+  +YF  +F+S +P   + D  L      +SE  N  LL  F  EE+   ++ MHPTKAPGPDGM  IFY
Subjt:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY

Query:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC
        QKYW++V  D+ +  L  LN       IN+TYI LIPKV   + + +FRPISLC+V+YKIISKVLANRLK VL  +I  SQSAFVPGR I DN LV FE 
Subjt:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC

Query:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN
        +H +  +KKGK  ++A+KLDMSKAYDRVEW YL  ++ KLGF ++WIAL+M CV +V++ VL+NG P+ +  P RGLRQGDP+SPYLF++CAEGLS +L 
Subjt:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN

Query:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ
          E    +  + ++R  P V+HL +ADDS++F  AS  DC  + + LE YE  SGQ +N +K++   S NT  ++ R ++        +   +YLG P  
Subjt:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ

Query:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR
          + +++ FN IKD+V R + GWKG+  S AGREILIK+VAQA P Y MS FK P +LC ELNSM + FWWG ++K  K+ W SW+KLC+ K+ GG+GFR
Subjt:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR

Query:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG---------------------------
        D+  FN A+LAKQ+WR+  NPNSL+ RV + +YF   +F +A++G  PSF WRSI+  R +  KG RW IG+G                           
Subjt:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG---------------------------

Query:  ---------------------------------FNVDDGPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASNE-----ASISHNGKMEALWDIYWKTPV
                                         F++   P   +DS+IW     G FSV+SAY + ++    +       S S N KM+A+W + W+   
Subjt:  ---------------------------------FNVDDGPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASNE-----ASISHNGKMEALWDIYWKTPV

Query:  PPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK---GDVKTD------------------------------GQADWSE
        P KI+   W    DILPT+  L+ RG+     CILC    ET  H+LW C   +    D K                                    WS 
Subjt:  PPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK---GDVKTD------------------------------GQADWSE

Query:  ISNRPV-----------DMDQATPEHREP----APTATNDRQATIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW--
         +NR              + +A  E+ E      P        ++ +  W+PP  G +K+N D +   E    G+G V+R+E+G L+    +++      
Subjt:  ISNRPV-----------DMDQATPEHREP----APTATNDRQATIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW--

Query:  --LEALAVSDGLRL
          +EA AV +G+RL
Subjt:  --LEALAVSDGLRL

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.0e-18639.27Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        +A+ R+K N I G++D  G W + +E + + A +YF  ++ SS P    I+++ E  P  ++E  N  L+  FT+EE+   +K +HP KAPGPDGM A+F
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        +QKYW +VG ++ D  L  LN    I  +NKT I LIPK    K M DFRPISLC+V+YK+ISK+LANRLK +L HIIS +QSAF   RLITDN LV FE
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H ++ K  GK G +A+KLDMSKA+DRVEW ++ K+M ++GF +RW  L+MQC+ SVS+ +L+NGV      P RGLRQGDPLSP LF++CAEGLS L+
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
        N +   KL+T + INR CP VTHLF+ADDS+LF +A+  +C  ++  L  YE+ SGQ IN DKS+   SPNT  +   +I NIL         +YLG PS
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           RS+ ++F  +K++V   L GWKG+  S  G+EILIK+VAQAIP Y MSCF  P  LC+++  M   FWWG  ++ +K+ W SWK++C +K  GG+GF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------
        R++  FN AMLAKQ+WRI++NPNSL+ RVL+ RYF  G+ L A++G++PS++WRSI    E+ R+G RWR+G+G                          
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------

Query:  ----------------------------FNVDD---GPLN---KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNG-KMEALWDIYWKTPVPPK
                                    F V+     PL+    ED +IW  + KG FSVKSAY +   +   NE     NG     LW   W   +P K
Subjt:  ----------------------------FNVDD---GPLN---KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNG-KMEALWDIYWKTPVPPK

Query:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK------GDVKTDGQA---------------------------DWSEISN
        I++  W    D LPT  N+ KRG+  +  C +C    E + H L  C           D     Q+                            W+   N
Subjt:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK------GDVKTDGQA---------------------------DWSEISN

Query:  RPVDMDQATPEHREPAPTATND------RQATI-------ASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW----LEA
        R   +   +P          N+      + A++       + +RW  P +G++K+N D + +++ +   +G ++RD  G +++   + L  ++    +EA
Subjt:  RPVDMDQATPEHREPAPTATND------RQATI-------ASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW----LEA

Query:  LAVSDGLRL
        LA+  G+ L
Subjt:  LAVSDGLRL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]8.2e-19240.53Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        KA+ R++ N I G+ D +G W ++ E + +VA +YFQ ++ SS P    I ++L+  P  ++E  N  L+  FTREEI   +  MHPTKAPGPDGM AIF
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        +QKYW++VG DI    L  LN    +  INKT I L+PK+K    M DFRPISLC+V+YK+ISKVLANRLK +L  IIS +QSAF+ GRLITDN LV FE
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H +  KK+GK G  A+KLDMSKAYDRVEW +++++M K+GF ++WI L+M C+ SVS+ +L+NG      +P RGLRQGDP+SPY+F++CA+G S LL
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNT----NCDLARKIENILQVSHTESLGQYL
        N    K  ++ + I R CP +THLF+ADDSLLF +A+  +C  +   L+ YE  SGQ IN DKS+   S NT     C++ R + ++    H     +YL
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNT----NCDLARKIENILQVSHTESLGQYL

Query:  GFPSQNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLG
        G PS   +S+ EIF  +K+RV R L GWK +  S  GREILIK+VAQAIP Y MSCF+ P +LC E+ +M  RFWWG   + SKI W SWKKLC  K+ G
Subjt:  GFPSQNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLG

Query:  GMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV-------------------
        GMGFR++  FN AMLAKQ WR+I NPNSL+ ++ + RY+ +G+  +A++G +PS+TWRSI  G E+ R+G RWR+G+G  +                   
Subjt:  GMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV-------------------

Query:  ----DDGPLNK-------------------------------------EDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEA-LWDIYWKTP
            DD P                                        ED IIW  + KG FSVKSAY + + V  + E   S +G   + LW   W   
Subjt:  ----DDGPLNK-------------------------------------EDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEA-LWDIYWKTP

Query:  VPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVK--TDGQADWSEISNRPVD-----MDQATPEHRE--------
        +PPK+R+  W +  + LPT  NL+++G+ +  +C  C  + E+  H+  +C   K   +   D  AD   ++   VD     +D  TP   E        
Subjt:  VPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVK--TDGQADWSEISNRPVD-----MDQATPEHRE--------

Query:  -------------------------------PAPTATNDRQATIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW---
                                          +AT  +    +  +WM P  G++K+N D + +E  +   VG ++RD  G++ +     L  Q+   
Subjt:  -------------------------------PAPTATNDRQATIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW---

Query:  -LEALAVSDGLRL
         +EALA+  GL L
Subjt:  -LEALAVSDGLRL

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]3.3e-18540.09Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        +A+ R+K N I  L++  G W ++ E +   A +YF+ ++ SSSP    I++++   P  +++  N +L   FT EE+   +K +HPTKAPGPDGM A F
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        +  YWD+VG  I +  L  LN    +  INKT I LIPK  +   M +FRPISLC+  YKIISKVLANR K +L +IIS +QSAF P RLITDN LV FE
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H +N K +GK   +++KLDMSKA+DRVEW +++ +M KLGF ++WI LIM CV SVS+ VL+NG      +P RG+RQGDPLSP LF++CAEGLS L+
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
        + +   + +  + I R CP +THLF+ADDSLLF +A E +C  +   L  YE+ SGQ IN DKS+   SPNT+ +L   I NIL         +YLG PS
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           +S+ ++F  +KDRV + L GWKG+  S  GREILIK+VAQA+P Y MSCF+ P +LC +L S+   FWWG +DK +KI W SW+K+C +K  GGMGF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGF---------------------NVDD
        R+I  FN AMLAKQ WRI+ NPNSL+ RV + +YF   + L ++ G+NPS+ WRSI    ++ RKG RWR+G+G                       VD 
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGF---------------------NVDD

Query:  G------------------------------------PLN---KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNG-KMEALWDIYWKTPVPPK
        G                                    PL+    ED +IW  + +G F+VKSAY +   +  S E   S +G     LW   W+  VPPK
Subjt:  G------------------------------------PLN---KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNG-KMEALWDIYWKTPVPPK

Query:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK--------GDVKTDGQADWSEIS--------------------------
        I++  W    + LPT  NL  RG+  +  C LC   +ETITH L  C   K          V      D  EI+                          
Subjt:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTK--------GDVKTDGQADWSEIS--------------------------

Query:  NRPVDMDQATP--EHREPAPTATNDRQA---------TIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW----LEAL
        N+ +  D  +P  +  E A     + +A          +   RW  P  G +K+N DA+  ++++   +G V+RD +G +++   +VL   +     EAL
Subjt:  NRPVDMDQATP--EHREPAPTATNDRQA---------TIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVLHQQW----LEAL

Query:  AVSDGLRL
        A+ +G+ L
Subjt:  AVSDGLRL

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein2.3e-19245.83Show/hide
Query:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY
        A+ RKK N I GL DA+G        M  +   YF  +F++S+P   AI  ++      +++  N  LL PFT EEI   +  MHPTKAPGPDGM A+FY
Subjt:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY

Query:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC
        QK+W +VG D+ +  L+FL+    +  +N T+I LIPK+   + M  FRPISLC+V+YKIISKVLANRLK VL+HIIS +QSAFVPGRLITDN LV FE 
Subjt:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC

Query:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN
        +H + +K+KG+S  +A+KLDMSKAYDRVEW +L  +M KLGF+ RW+ LIMQC+ SVS+ V+LNG P     P RG+RQGDPLSPYLF++CAEGL+ LL 
Subjt:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN

Query:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ
         +E   ++  L I R  P ++HLF+ADDSLLF  A+  +C N+   L+TYE+ SGQ +N +K++   S NT+ DL   I  +L+ S T  LG+YLG P  
Subjt:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ

Query:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR
          R +K+ F  IK ++ + L GWKG+  S AGREILIKSVAQAIP Y MSCF+ P +LC+E+NSM ++FWWG + +  KIHW+ W  +C  K  GGMGFR
Subjt:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR

Query:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVD------DGPLNKE----------
        D+++FN+A+LAKQ WR++ +PN+LL R+L+ +YF N +F++A++ ++ SF WRSI   R + RKG RWRIG+G  V+        P N            
Subjt:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVD------DGPLNKE----------

Query:  ---------------------DSIIW--------------HH--------DTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV
                             DSI W              HH           G F+ +SAY L ++ +   + S S   ++ A W   W+  VP KI+ 
Subjt:  ---------------------DSIIW--------------HH--------DTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV

Query:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWEC
          W     ILPT+TNL +RG+  +  C +C    ET+ H LW+C
Subjt:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWEC

A0A2N9GPZ7 Reverse transcriptase domain-containing protein1.5e-18643.76Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        + N R++ N I GL D  G W      +  +A +YFQ +F SS+P  ++I  +L+     ++   N +L   FT++E+   +K M+PTKAPGPDGM AIF
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        YQ YWD+VG ++    L  L+    + +IN T+I LIPKVK  + + DFRPISLC+VIYKI+SKVLANRLK+VL  +IS +QSAFVPGRLITDN LV FE
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H+++ K+KGK G +ALKLDMSKAYDRVEWV+L  +M  +GF   WI L+M C+ SVS+ VL+NG     F+  RG+RQGD LSPYLF++CAEGLS LL
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
          +   K +T +  +R  P +THLF+ADDSLLF +A+  +C  +   L+ YE  SGQ +N  K++   + +T+  + R+I++  QV   +S  +YLG PS
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           RS+   F  IK RVWR + GWK +F S AGRE+LIK+VAQ+IP Y+MSCFK P SLCN+LN+M + FWWG  DK  K HW  W KLC +K  GG+GF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------
        RD+  FN A+LAKQ WR + + NSL+ RV + +YF  G+F+ A +GN PS+ WRSI   R++ R G +W IGDG                          
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------

Query:  ----------------FNVDD------------------GPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASN-EASISHNGKMEALWDIYWKTPVPPK
                        +NVD                    P  K D + W+    G F+VKSAY L ++ +A+      S  GK    W   W   +PPK
Subjt:  ----------------FNVDD------------------GPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASN-EASISHNGKMEALWDIYWKTPVPPK

Query:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATP
        ++V  W     ILPT   L  R M  N LC  C   +E+  H LW C     DV       W+E S +    D+  P
Subjt:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATP

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.5e-18643.76Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        + N R++ N I GL D  G W      +  +A +YFQ +F SS+P  ++I  +L+     ++   N +L   FT++E+   +K M+PTKAPGPDGM AIF
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        YQ YWD+VG ++    L  L+    + +IN T+I LIPKVK  + + DFRPISLC+VIYKI+SKVLANRLK+VL  +IS +QSAFVPGRLITDN LV FE
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H+++ K+KGK G +ALKLDMSKAYDRVEWV+L  +M  +GF   WI L+M C+ SVS+ VL+NG     F+  RG+RQGD LSPYLF++CAEGLS LL
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
          +   K +T +  +R  P +THLF+ADDSLLF +A+  +C  +   L+ YE  SGQ +N  K++   + +T+  + R+I++  QV   +S  +YLG PS
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           RS+   F  IK RVWR + GWK +F S AGRE+LIK+VAQ+IP Y+MSCFK P SLCN+LN+M + FWWG  DK  K HW  W KLC +K  GG+GF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------
        RD+  FN A+LAKQ WR + + NSL+ RV + +YF  G+F+ A +GN PS+ WRSI   R++ R G +W IGDG                          
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDG--------------------------

Query:  ----------------FNVDD------------------GPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASN-EASISHNGKMEALWDIYWKTPVPPK
                        +NVD                    P  K D + W+    G F+VKSAY L ++ +A+      S  GK    W   W   +PPK
Subjt:  ----------------FNVDD------------------GPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASN-EASISHNGKMEALWDIYWKTPVPPK

Query:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATP
        ++V  W     ILPT   L  R M  N LC  C   +E+  H LW C     DV       W+E S +    D+  P
Subjt:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATP

A0A2N9J3U0 Reverse transcriptase domain-containing protein1.6e-19342.69Show/hide
Query:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY
        A+ RKK N I GL DA+G        M  +   YF  +F++S+P   AI  ++      +++  N  LL PFT EEI   +  MHPTKAPGPDGM A+FY
Subjt:  ANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFY

Query:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC
        QK+W +VG D+ +  L+FL+    +  +N T+I LIPK+   + M  FRPISLC+V+YKIISKVLANRLK VL+HIIS +QSAFVPGRLITDN LV FE 
Subjt:  QKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFEC

Query:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN
        +H + +K+KG+S  +A+KLDMSKAYDRVEW +L  +M KLGF+ RW+ LIMQC+ SVS+ V+LNG P     P RG+RQGDPLSPYLF++CAEGL+ LL 
Subjt:  IHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLN

Query:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ
         +E   ++  L I R  P ++HLF+ADDSLLF  A+  +C N+   L+TYE+ SGQ +N +K++   S NT+ DL   I  +L+ S T  LG+YLG P  
Subjt:  YSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQ

Query:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR
          R +K+ F  IK ++ + L GWKG+  S AGREILIKSVAQAIP Y MSCF+ P +LC+E+NSM ++FWWG + +  KIHW+ W  +C  K  GGMGFR
Subjt:  NARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFR

Query:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVD------DGPLNKE----------
        D+++FN+A+LAKQ WR++ +PN+LL R+L+ +YF N +F++A++ ++ SF WRSI   R + RKG RWRIG+G  V+        P N            
Subjt:  DISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVD------DGPLNKE----------

Query:  ---------------------DSIIW--------------HH--------DTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV
                             DSI W              HH           G F+ +SAY L ++ +   + S S   ++ A W   W+  VP KI+ 
Subjt:  ---------------------DSIIW--------------HH--------DTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRV

Query:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATPEHREPAPTATNDRQATIASVRWMPPA
          W     ILPT+TNL +RG+  +  C +C    ET+ H LW+C           Q  W   S  P+ +    P   +           T     +   A
Subjt:  CRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATPEHREPAPTATNDRQATIASVRWMPPA

Query:  MGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLIS
          +W  + +A+W    +   +  ++RD KG L++
Subjt:  MGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLIS

A0A7N2L6Z9 Reverse transcriptase domain-containing protein3.8e-18740.33Show/hide
Query:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF
        +A+ R+K N I G++D  G W E  + +   A  YF+ ++ +S+P +  +D++    P  I+E  N +L   FTREEI   +K +HPTK+PGPDGM AIF
Subjt:  KANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIF

Query:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
        +QKYWD+VG ++ +  L  LN    +D INKT IVLIPK    K M DFRPISLC+VIYK+ISK LANRLK  L  II+ +QSAF   RLITDN L+ +E
Subjt:  YQKYWDVVGRDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         +H +  KK GK   +A KLDMSKA+DRVEW ++ ++M K+GF + WI+LIM+C+ SVS+ V++NG       P RGLRQGDPLSPYLF++CAEGLS LL
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS
        + +   +L+  + + R CP +THLF+ADDSLLF +A+  +C  +K+ LE YE  SGQ +N DKS+   SPNT  +L   I NIL         +YLG PS
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPS

Query:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF
           RS+K +F  IK+RV   L GWKG+  S+ G+EILIK+VAQAIP Y MSCF  P SLC+EL  M   FWWG +++ SK+ W SW+K+C  K LGG+GF
Subjt:  QNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGF

Query:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV-----------------------
        R++  FN A+LAKQ+WRI+ NP SL  R+L+ +YF  G+ L A +G+NPS+TWRSI    E+ +KG RWR+G+G  +                       
Subjt:  RDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV-----------------------

Query:  DDGPLNK-------------------------------------EDSIIWHHDTKGFFSVKSAYRLGIQVQASNE-ASISHNGKMEALWDIYWKTPVPPK
        +D P+                                       +D IIW  + KG FSVKSAY + + +  S E    S       LW   WK  +P K
Subjt:  DDGPLNK-------------------------------------EDSIIWHHDTKGFFSVKSAYRLGIQVQASNE-ASISHNGKMEALWDIYWKTPVPPK

Query:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECN-----------------QTKGDVK-------------------TDGQADWSE
        +++  W    + LPT  N+  RG+  N  C +C  ++E + H L  C+                     D+K                       A W  
Subjt:  IRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCKTKLETITHLLWECN-----------------QTKGDVK-------------------TDGQADWSE

Query:  ISNRPVDMDQATP-EHREPAPTATNDRQATI--------ASVR-WMPPAMGLWKLNNDASWN-EEKKLGGVGWVLRDEKGNLISTGFRVLHQ----QWLE
         + R  D D  +P +  E A    +D    I        +++R W  P  G++K+N D + + +     GVG V+RDE G +I+   ++L      +W E
Subjt:  ISNRPVDMDQATP-EHREPAPTATNDRQATI--------ASVR-WMPPAMGLWKLNNDASWN-EEKKLGGVGWVLRDEKGNLISTGFRVLHQ----QWLE

Query:  ALAVSDGLRL
          A+  GL L
Subjt:  ALAVSDGLRL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.7e-4427.1Show/hide
Query:  RKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEV-TPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQK
        +++ N+I  + +  G       +++     Y++ L+ +   +L+ +D  L+  T   +++ +   L  P T  EI  ++ S+   K+PGPDG  A FYQ+
Subjt:  RKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEV-TPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQK

Query:  YWDVVGRDICDFCL---QFLNGEGQI-DRINKTYIVLIPKVKELKTMKD-FRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVG
        Y +    ++  F L   Q +  EG + +   +  I+LIPK     T K+ FRPISL ++  KI++K+LANR++Q +  +I   Q  F+PG     N    
Subjt:  YWDVVGRDICDFCL---QFLNGEGQI-DRINKTYIVLIPKVKELKTMKD-FRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVG

Query:  FECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSR
           I  +N + K K+ V+ + +D  KA+D+++  ++ K + KLG +  ++ +I    +  +  ++LNG     F  K G RQG PLSP LF +  E L+R
Subjt:  FECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSR

Query:  LLNYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGF
         +     +K +  +++ +    V    +ADD +++ E       N+ K +  + KVSG  IN  KS      N N     +I   L  +      +YLG 
Subjt:  LLNYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGF

Query:  PSQNARSQKEIFNH----IKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCIN
          Q  R  K++F      +   +      WK    S  GR  ++K   + + I  +     K PM+   EL     +F W    KR++I   +   L   
Subjt:  PSQNARSQKEIFNH----IKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCIN

Query:  KRLGGMGFRDISVFNKAMLAKQSW
         + GG+   D  ++ KA + K +W
Subjt:  KRLGGMGFRDISVFNKAMLAKQSW

P08548 LINE-1 reverse transcriptase homolog3.3e-4225.91Show/hide
Query:  RKKVNKICGLYDASGGWVETD-EDMERVANNYFQALFQSSSPHLDAIDDILEVTPV-CISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQ
        +K+V  +          + TD  +++++ N Y++ L+     +L  ID  LE   +  +S+ +   L  P +  EI   ++++   K+PGPDG  + FYQ
Subjt:  RKKVNKICGLYDASGGWVETD-EDMERVANNYFQALFQSSSPHLDAIDDILEVTPV-CISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQ

Query:  KYWDVVGRDICDFCLQFLNGEGQI-DRINKTYIVLIPKVKELKTMKD-FRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE
         + + +   + +   Q +  EG + +   +  I LIPK  +  T K+ +RPISL ++  KI++K+L NR++Q +  II   Q  F+PG     N      
Subjt:  KYWDVVGRDICDFCLQFLNGEGQI-DRINKTYIVLIPKVKELKTMKD-FRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFE

Query:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL
         I  +N K K K  ++ L +D  KA+D ++  ++ + + K+G E  ++ LI       +  ++LNGV    F  + G RQG PLSP LF +  E L+  +
Subjt:  CIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLL

Query:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLG-FP
             +K +  + I      +    +ADD +++ E + +    + + ++ Y  VSG  IN  KS      N N    + +++ +  +      +YLG + 
Subjt:  NYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLG-FP

Query:  SQNARS-QKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINK-RL
        +++ +   KE +  ++  +   +  WK    S  GR  ++K   + +AI N+     K P+S   +L  +   F W    K+ +I     K L  NK + 
Subjt:  SQNARS-QKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINK-RL

Query:  GGMGFRDISVFNKAMLAKQSW
        GG+   D+ ++ K+++ K +W
Subjt:  GGMGFRDISVFNKAMLAKQSW

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-4127.31Show/hide
Query:  GWVETD-EDMERVANNYFQALFQSSSPHLDAIDDILEVTPV-CISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQKYWDVVGRDICDFCL
        G + TD E+++    ++++ L+ +   +LD +D  L+   V  +++ Q   L  P + +EI  V+ S+   K+PGPDG  A FYQ + + +   I     
Subjt:  GWVETD-EDMERVANNYFQALFQSSSPHLDAIDDILEVTPV-CISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQKYWDVVGRDICDFCL

Query:  QFLNGEGQI-DRINKTYIVLIPK-VKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFECIHAVNSKKKGKSGV
          +  EG + +   +  I LIPK  K+   +++FRPISL ++  KI++K+LANR+++ +  II P Q  F+PG     N       IH +N K K K+ +
Subjt:  QFLNGEGQI-DRINKTYIVLIPK-VKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFECIHAVNSKKKGKSGV

Query:  VALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLNYSESKKLMTSLRIN
        + + LD  KA+D+++  ++ K++ + G +  ++ +I          + +NG        K G RQG PLSPYLF +  E L+R +     +K +  ++I 
Subjt:  VALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLNYSESKKLMTSLRIN

Query:  RYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKS-NFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQNARSQKEI----F
        +    ++ L  ADD +++    +N    +   + ++ +V G  IN +KS  F+ + N   +  ++I      S   +  +YLG      +  K++    F
Subjt:  RYCPSVTHLFYADDSLLFFEASENDCINIKKTLETYEKVSGQIINFDKS-NFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQNARSQKEI----F

Query:  NHIKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRL-GGMGFRDISVFN
          +K  +   L+ WK    S  GR  ++K   + +AI  +     K P    NEL     +F W   +K+ +I     K L  +KR  GG+   D+ ++ 
Subjt:  NHIKDRVWRVLQGWKGRFFSAAGREILIKS--VAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRL-GGMGFRDISVFN

Query:  KAMLAKQSW
        +A++ K +W
Subjt:  KAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein4.0e-4027.81Show/hide
Query:  ANSRKKVNK--ICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAI
        A  +KK N+  I  L+   G  +E  E +   A +++Q LF       DA +++ +  PV +SE +  +L  P T +E+   ++ M   K+PG DG+   
Subjt:  ANSRKKVNK--ICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAI

Query:  FYQKYWDVVGRDICDFCLQ-FLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVG
        F+Q +WD +G D      + F  GE  +    +  + L+PK  +L+ +K++RP+SL S  YKI++K ++ RLK VL  +I P QS  VPGR I DN  + 
Subjt:  FYQKYWDVVGRDICDFCLQ-FLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVG

Query:  FECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSR
         + +H   +++ G S +  L LD  KA+DRV+  YL   +    F  +++  +     S    V +N    A  +  RG+RQG PLS  L+ +  E    
Subjt:  FECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSR

Query:  LLNYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKT---LETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQY
        LL     +K +T L +      V    YADD +L  +    D +++++     E Y   S   IN+ KS+ +   +   D          +S    + +Y
Subjt:  LLNYSESKKLMTSLRINRYCPSVTHLFYADDSLLFFEASENDCINIKKT---LETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQY

Query:  LG-FPSQNARSQKEIFNHIKDRVWRVLQGWKG--RFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCIN
        LG + S       + F  +++ V   L  WKG  +  S  GR ++I  +  +   Y + C         ++      F W  +      HW S     + 
Subjt:  LG-FPSQNARSQKEIFNHIKDRVWRVLQGWKG--RFFSAAGREILIKSVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCIN

Query:  KRLGGMG
         + GG G
Subjt:  KRLGGMG

P93295 Uncharacterized mitochondrial protein AtMg003107.3e-3449.25Show/hide
Query:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKR-LGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLK
        A+P YAMSCF+    LC +L S    FWW   + + KI W +W+KLC +K   GG+GFRD+  FN+A+LAKQS+RIIH P++LL+R+LR RYF + + ++
Subjt:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKR-LGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLK

Query:  AEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFN
          +G  PS+ WRSI+ GREL  +G    IGDG +
Subjt:  AEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFN

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.7e-1425.81Show/hide
Query:  LRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVDDG-----------PLN-----------------------------------
        ++ RYFK+ + L A++    S+ W S++ G  L +KG R  IGDG N+  G           PLN                                   
Subjt:  LRGRYFKNGNFLKAEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNVDDG-----------PLN-----------------------------------

Query:  --------------KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLC
                      K D IIW+++T G ++V+S Y L     ++N  +I+       L    W  P+ PK++   W      L T   L  RGM ++P C
Subjt:  --------------KEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLC

Query:  ILCKTKLETITHLLWEC
          C  + E+I H L+ C
Subjt:  ILCKTKLETITHLLWEC

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.7e-1532.87Show/hide
Query:  LANRLKQVLNHIISPSQSAFVPGRLITDNALVGFECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLN
        +  RLK ++ ++I P+Q++F+PGR+ TDN +   E +H++  +KKG  G + LKLD+ KAYDR+ W YL   +   GF + W+  I +   +   + +  
Subjt:  LANRLKQVLNHIISPSQSAFVPGRLITDNALVGFECIHAVNSKKKGKSGVVALKLDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLN

Query:  GVPRAEFSPKR-------GLRQGDPLSPYL--FIMCAEGLSRL
         V RA+ S +        G R  D  +P+    + CAE L  +
Subjt:  GVPRAEFSPKR-------GLRQGDPLSPYL--FIMCAEGLSRL

AT4G29090.1 Ribonuclease H-like superfamily protein5.7e-4226.24Show/hide
Query:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKA
        A+P Y M+CF  P ++C ++ S+ A FWW  + +   +HW++W  L   K  GG+GF+DI  FN A+L KQ WR++  P SL+ +V + RYF   + L A
Subjt:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKA

Query:  EMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV--------DDGPLNKE--------------------------------------------------
         +G+ PSF W+SI   +E+ R+G R  +G+G ++        D  P +                                                    
Subjt:  EMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFNV--------DDGPLNKE--------------------------------------------------

Query:  ---------DSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIY---WKTPVPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCK
                 DS  W + + G ++VKS Y   +  Q  N+ S        +L  IY   WK+   PKI+   W    + LP    L  R +     CI C 
Subjt:  ---------DSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIY---WKTPVPPKIRVCRWTIFHDILPTRTNLIKRGMEVNPLCILCK

Query:  TKLETITHLLWECNQTK------------GDVKTD--------------GQADWSEISN-----------------------------RPVDMDQATPEH
        +  ET+ HLL++C   +            G    D              G   W + S                              R  + D      
Subjt:  TKLETITHLLWECNQTK------------GDVKTD--------------GQADWSEISN-----------------------------RPVDMDQATPEH

Query:  REPAPTATNDRQATIASV-RWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVL
        R  A +     Q   +S  RW PP     K N DA+WN + +  G+GWVLR+EKG +   G R L
Subjt:  REPAPTATNDRQATIASV-RWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRDEKGNLISTGFRVL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-3549.25Show/hide
Query:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKR-LGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLK
        A+P YAMSCF+    LC +L S    FWW   + + KI W +W+KLC +K   GG+GFRD+  FN+A+LAKQS+RIIH P++LL+R+LR RYF + + ++
Subjt:  AIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKR-LGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLK

Query:  AEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFN
          +G  PS+ WRSI+ GREL  +G    IGDG +
Subjt:  AEMGNNPSFTWRSIVWGRELFRKGYRWRIGDGFN

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.5e-1345.59Show/hide
Query:  LLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLNYSESKKLMTSLRINRYCPSVTHLFYADDS
        ++NG P+   +P RGLRQGDPLSPYLFI+C E LS L   ++ +  +  +R++   P + HL +ADD+
Subjt:  LLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLNYSESKKLMTSLRINRYCPSVTHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCAAATAGCAGAAAGAAGGTCAACAAAATTTGCGGACTCTACGATGCAAGTGGGGGTTGGGTGGAGACCGATGAGGATATGGAGAGGGTTGCCAACAACTATTT
TCAAGCTCTGTTTCAATCGTCTAGCCCCCACTTGGATGCCATTGATGACATCCTGGAGGTGACCCCTGTTTGTATTTCAGAGGTCCAAAACAGGAAGCTCTTGGTTCCGT
TCACTAGGGAGGAGATTTATGGAGTAGTTAAGAGTATGCATCCTACTAAGGCTCCGGGCCCTGATGGGATGCAAGCTATTTTCTACCAGAAATACTGGGATGTGGTAGGA
CGAGATATTTGTGACTTCTGCCTGCAATTCCTGAATGGGGAAGGCCAGATAGATAGAATAAACAAGACATACATTGTTTTGATCCCGAAGGTAAAGGAGCTAAAAACCAT
GAAGGATTTCAGACCTATAAGTTTGTGCTCGGTGATCTACAAAATTATATCCAAGGTACTAGCGAATAGGCTAAAACAGGTGTTGAATCATATAATTTCCCCTAGTCAAT
CAGCGTTTGTCCCTGGAAGGCTCATAACTGATAATGCGCTTGTCGGGTTTGAATGCATTCATGCTGTAAATTCCAAAAAGAAAGGTAAATCAGGGGTTGTGGCTCTCAAA
TTGGATATGAGCAAGGCTTATGACCGGGTAGAATGGGTTTATCTTCGTAAGTTGATGGCTAAGTTGGGGTTCGAGGACAGATGGATTGCTCTTATCATGCAATGTGTGGA
GTCGGTCAGCTTCCAGGTGCTTTTGAATGGAGTTCCAAGAGCTGAGTTCTCTCCCAAGAGAGGGCTCCGTCAGGGTGACCCTCTATCCCCTTATTTGTTCATCATGTGCG
CAGAAGGCTTATCCAGGCTCCTCAACTACTCAGAATCCAAAAAACTGATGACAAGTTTGCGAATAAATAGGTACTGTCCTAGTGTGACTCATTTATTCTATGCAGATGAT
AGTTTACTATTTTTTGAGGCCTCTGAGAATGATTGCATTAATATTAAGAAAACTCTTGAAACTTATGAGAAGGTGTCTGGGCAGATTATAAATTTTGATAAATCCAACTT
CATGACTAGTCCTAATACTAACTGTGATCTAGCTAGAAAGATCGAGAATATTTTGCAGGTTTCACACACGGAGAGCTTGGGGCAATACCTGGGGTTCCCGTCACAGAATG
CTAGAAGCCAGAAGGAAATTTTCAACCACATCAAGGATCGAGTCTGGAGAGTACTACAAGGATGGAAAGGGAGGTTCTTCTCAGCTGCTGGAAGGGAGATTCTAATTAAA
TCAGTGGCCCAAGCCATTCCTAACTATGCAATGAGCTGTTTTAAATTCCCTATGTCTCTCTGTAATGAACTTAATTCCATGTGTGCCAGGTTTTGGTGGGGAGTGGAGGA
CAAAAGGAGCAAGATTCATTGGAGAAGTTGGAAAAAGTTGTGCATCAACAAAAGGCTTGGAGGCATGGGATTTAGAGATATTTCGGTGTTCAACAAGGCGATGTTAGCCA
AGCAGAGTTGGAGGATAATTCACAATCCGAATAGCCTCCTGACGAGAGTCCTCAGAGGCAGATATTTTAAGAATGGAAATTTCCTAAAGGCGGAGATGGGAAACAATCCT
TCTTTCACGTGGAGAAGCATTGTGTGGGGGAGGGAGTTGTTTCGGAAAGGCTACAGATGGCGAATAGGGGATGGATTCAATGTGGATGATGGTCCGCTGAACAAGGAGGA
TTCTATTATTTGGCATCATGATACAAAAGGATTCTTTTCTGTGAAAAGCGCGTATAGACTGGGGATCCAGGTTCAAGCATCCAATGAGGCGTCAATATCTCACAATGGGA
AGATGGAAGCTCTATGGGATATATATTGGAAAACCCCAGTTCCCCCAAAAATCAGAGTGTGCAGGTGGACAATTTTTCATGACATCCTCCCAACTCGCACAAATCTTATT
AAAAGGGGAATGGAAGTTAACCCACTGTGTATTTTGTGCAAGACAAAGCTGGAAACGATCACACATCTGCTGTGGGAGTGCAATCAAACAAAAGGCGATGTGAAGACCGA
TGGCCAGGCGGATTGGAGCGAGATCTCGAACAGACCCGTTGACATGGATCAAGCTACGCCGGAACACAGAGAGCCTGCCCCAACCGCGACAAATGATCGGCAAGCGACGA
TTGCGTCGGTTAGATGGATGCCGCCGGCGATGGGACTCTGGAAGTTGAACAATGACGCGTCCTGGAATGAGGAAAAGAAGTTAGGGGGAGTGGGCTGGGTGTTGCGAGAT
GAGAAAGGCAACCTGATTTCTACTGGTTTTCGGGTTCTTCATCAGCAGTGGCTGGAGGCCCTTGCCGTCTCAGATGGTTTGCGTTTGATTCCTATCGAGTCCCCTCTGGG
TGGAGCACGCACACCCCTGCGTTGGATGAGAGGAAGATCATCCTCATCATCCATGGGCGCCACGACCGGTGCTTCTTCGTTCTCAACGCGAGCATTGAGGTCAATTTGCA
CTGCGTTGGCCTCTGCGATCGTGGCTAGACTTGTGACAGACATTTCTTCGGCTGCTTTTTCTTGCGCTTCTTTGTCATCGCGTTCCTTTGCCGCCCTTTCCTCTTCTGCG
ATGTGCTCTGCAGCGGCTTTCCGGGAGATTTCATCTGCCGTAGCCTTGTCTTCTTTGTCAGCGCGCTCCTTTGCCTCCCTTTCTTCCCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCAAATAGCAGAAAGAAGGTCAACAAAATTTGCGGACTCTACGATGCAAGTGGGGGTTGGGTGGAGACCGATGAGGATATGGAGAGGGTTGCCAACAACTATTT
TCAAGCTCTGTTTCAATCGTCTAGCCCCCACTTGGATGCCATTGATGACATCCTGGAGGTGACCCCTGTTTGTATTTCAGAGGTCCAAAACAGGAAGCTCTTGGTTCCGT
TCACTAGGGAGGAGATTTATGGAGTAGTTAAGAGTATGCATCCTACTAAGGCTCCGGGCCCTGATGGGATGCAAGCTATTTTCTACCAGAAATACTGGGATGTGGTAGGA
CGAGATATTTGTGACTTCTGCCTGCAATTCCTGAATGGGGAAGGCCAGATAGATAGAATAAACAAGACATACATTGTTTTGATCCCGAAGGTAAAGGAGCTAAAAACCAT
GAAGGATTTCAGACCTATAAGTTTGTGCTCGGTGATCTACAAAATTATATCCAAGGTACTAGCGAATAGGCTAAAACAGGTGTTGAATCATATAATTTCCCCTAGTCAAT
CAGCGTTTGTCCCTGGAAGGCTCATAACTGATAATGCGCTTGTCGGGTTTGAATGCATTCATGCTGTAAATTCCAAAAAGAAAGGTAAATCAGGGGTTGTGGCTCTCAAA
TTGGATATGAGCAAGGCTTATGACCGGGTAGAATGGGTTTATCTTCGTAAGTTGATGGCTAAGTTGGGGTTCGAGGACAGATGGATTGCTCTTATCATGCAATGTGTGGA
GTCGGTCAGCTTCCAGGTGCTTTTGAATGGAGTTCCAAGAGCTGAGTTCTCTCCCAAGAGAGGGCTCCGTCAGGGTGACCCTCTATCCCCTTATTTGTTCATCATGTGCG
CAGAAGGCTTATCCAGGCTCCTCAACTACTCAGAATCCAAAAAACTGATGACAAGTTTGCGAATAAATAGGTACTGTCCTAGTGTGACTCATTTATTCTATGCAGATGAT
AGTTTACTATTTTTTGAGGCCTCTGAGAATGATTGCATTAATATTAAGAAAACTCTTGAAACTTATGAGAAGGTGTCTGGGCAGATTATAAATTTTGATAAATCCAACTT
CATGACTAGTCCTAATACTAACTGTGATCTAGCTAGAAAGATCGAGAATATTTTGCAGGTTTCACACACGGAGAGCTTGGGGCAATACCTGGGGTTCCCGTCACAGAATG
CTAGAAGCCAGAAGGAAATTTTCAACCACATCAAGGATCGAGTCTGGAGAGTACTACAAGGATGGAAAGGGAGGTTCTTCTCAGCTGCTGGAAGGGAGATTCTAATTAAA
TCAGTGGCCCAAGCCATTCCTAACTATGCAATGAGCTGTTTTAAATTCCCTATGTCTCTCTGTAATGAACTTAATTCCATGTGTGCCAGGTTTTGGTGGGGAGTGGAGGA
CAAAAGGAGCAAGATTCATTGGAGAAGTTGGAAAAAGTTGTGCATCAACAAAAGGCTTGGAGGCATGGGATTTAGAGATATTTCGGTGTTCAACAAGGCGATGTTAGCCA
AGCAGAGTTGGAGGATAATTCACAATCCGAATAGCCTCCTGACGAGAGTCCTCAGAGGCAGATATTTTAAGAATGGAAATTTCCTAAAGGCGGAGATGGGAAACAATCCT
TCTTTCACGTGGAGAAGCATTGTGTGGGGGAGGGAGTTGTTTCGGAAAGGCTACAGATGGCGAATAGGGGATGGATTCAATGTGGATGATGGTCCGCTGAACAAGGAGGA
TTCTATTATTTGGCATCATGATACAAAAGGATTCTTTTCTGTGAAAAGCGCGTATAGACTGGGGATCCAGGTTCAAGCATCCAATGAGGCGTCAATATCTCACAATGGGA
AGATGGAAGCTCTATGGGATATATATTGGAAAACCCCAGTTCCCCCAAAAATCAGAGTGTGCAGGTGGACAATTTTTCATGACATCCTCCCAACTCGCACAAATCTTATT
AAAAGGGGAATGGAAGTTAACCCACTGTGTATTTTGTGCAAGACAAAGCTGGAAACGATCACACATCTGCTGTGGGAGTGCAATCAAACAAAAGGCGATGTGAAGACCGA
TGGCCAGGCGGATTGGAGCGAGATCTCGAACAGACCCGTTGACATGGATCAAGCTACGCCGGAACACAGAGAGCCTGCCCCAACCGCGACAAATGATCGGCAAGCGACGA
TTGCGTCGGTTAGATGGATGCCGCCGGCGATGGGACTCTGGAAGTTGAACAATGACGCGTCCTGGAATGAGGAAAAGAAGTTAGGGGGAGTGGGCTGGGTGTTGCGAGAT
GAGAAAGGCAACCTGATTTCTACTGGTTTTCGGGTTCTTCATCAGCAGTGGCTGGAGGCCCTTGCCGTCTCAGATGGTTTGCGTTTGATTCCTATCGAGTCCCCTCTGGG
TGGAGCACGCACACCCCTGCGTTGGATGAGAGGAAGATCATCCTCATCATCCATGGGCGCCACGACCGGTGCTTCTTCGTTCTCAACGCGAGCATTGAGGTCAATTTGCA
CTGCGTTGGCCTCTGCGATCGTGGCTAGACTTGTGACAGACATTTCTTCGGCTGCTTTTTCTTGCGCTTCTTTGTCATCGCGTTCCTTTGCCGCCCTTTCCTCTTCTGCG
ATGTGCTCTGCAGCGGCTTTCCGGGAGATTTCATCTGCCGTAGCCTTGTCTTCTTTGTCAGCGCGCTCCTTTGCCTCCCTTTCTTCCCTCTAG
Protein sequenceShow/hide protein sequence
MKANSRKKVNKICGLYDASGGWVETDEDMERVANNYFQALFQSSSPHLDAIDDILEVTPVCISEVQNRKLLVPFTREEIYGVVKSMHPTKAPGPDGMQAIFYQKYWDVVG
RDICDFCLQFLNGEGQIDRINKTYIVLIPKVKELKTMKDFRPISLCSVIYKIISKVLANRLKQVLNHIISPSQSAFVPGRLITDNALVGFECIHAVNSKKKGKSGVVALK
LDMSKAYDRVEWVYLRKLMAKLGFEDRWIALIMQCVESVSFQVLLNGVPRAEFSPKRGLRQGDPLSPYLFIMCAEGLSRLLNYSESKKLMTSLRINRYCPSVTHLFYADD
SLLFFEASENDCINIKKTLETYEKVSGQIINFDKSNFMTSPNTNCDLARKIENILQVSHTESLGQYLGFPSQNARSQKEIFNHIKDRVWRVLQGWKGRFFSAAGREILIK
SVAQAIPNYAMSCFKFPMSLCNELNSMCARFWWGVEDKRSKIHWRSWKKLCINKRLGGMGFRDISVFNKAMLAKQSWRIIHNPNSLLTRVLRGRYFKNGNFLKAEMGNNP
SFTWRSIVWGRELFRKGYRWRIGDGFNVDDGPLNKEDSIIWHHDTKGFFSVKSAYRLGIQVQASNEASISHNGKMEALWDIYWKTPVPPKIRVCRWTIFHDILPTRTNLI
KRGMEVNPLCILCKTKLETITHLLWECNQTKGDVKTDGQADWSEISNRPVDMDQATPEHREPAPTATNDRQATIASVRWMPPAMGLWKLNNDASWNEEKKLGGVGWVLRD
EKGNLISTGFRVLHQQWLEALAVSDGLRLIPIESPLGGARTPLRWMRGRSSSSSMGATTGASSFSTRALRSICTALASAIVARLVTDISSAAFSCASLSSRSFAALSSSA
MCSAAAFREISSAVALSSLSARSFASLSSL