; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026563 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026563
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:39088117..39090429
RNA-Seq ExpressionLag0026563
SyntenyLag0026563
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.1e-11333.25Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWAR-----------------
        M+KAYDRVEW F+ + M+ LG     V+ +M C+S+ T+S+   G  +G I   RGLRQG PLSPYLFL+C +G S +L  A                  
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWAR-----------------

Query:  ---------------DNEVRLVARTL-SAFSRLSGQEINFMKSG----------------------LVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                        NE      TL   +  +SGQ+IN+ KS                       +V  H++YLGLP +    +    +++KD++W +I
Subjt:  ---------------DNEVRLVARTL-SAFSRLSGQEINFMKSG----------------------LVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          WK    S  GK++L+K VLQAIP YSMSCF++PK L K+ N IMARFWW  A++ R +HW  W  +C SKF GGLGFRDLE FN+ALLAKQ WR++  
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ
        PESL+ R+ + +Y     F+E E   N S++W+SL WG+ LL +  RWRVG+G SI +  DKW+P  S  +++    L     V  L +S+  W+V L++
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ

Query:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC----SLKGEAS---SLDSEKLKQWWK-----------------------------
         +F +++  A L IP       D LIWHYE+NG Y+VKSGY+LAC     + GE S    L+S+  K+ W                              
Subjt:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC----SLKGEAS---SLDSEKLKQWWK-----------------------------

Query:  ---------------------FVCSLFEE-------------------------------------FVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVS
                             ++C   +E                                     F  LCW +WNRRN  +F G+   E        ++
Subjt:  ---------------------FVCSLFEE-------------------------------------FVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVS

Query:  EYLTQF-------RAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVRE
        +   +F           GR+ S  A +     WRPP    YK+N + AV        VG +VRN  GE M    + +  +      E +A  + L    +
Subjt:  EYLTQF-------RAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVRE

Query:  MGFMRLEVESDSAKVI-SLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEE
        MGF    +E D+   I S+L +E  +  D G+L EE+  L   F+    +W  RS NKVAH LA+         TW+EE
Subjt:  MGFMRLEVESDSAKVI-SLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEE

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]9.5e-11031.28Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWA-RDNEVR-----------
        +SKAYDRVEW F+E+ M  LG     +SLIM C+++  +SV ING  +G I+  RGLRQG PLSPYLF+LCA+  S +L  A R+ ++R           
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWA-RDNEVR-----------

Query:  --------------------LVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIH
                             +      +++ SGQ  NF KS +                      V  +++YLGLP +    K++  K +K +V + I 
Subjt:  --------------------LVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIH

Query:  KWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHP
         W H  FS GGK++LIK V QA+P Y+MS F+LPK L +D  + +ARFWWG  ++   +HWA W  +  +K  GGLGFRDL  FN+AL+AKQGWRL+ +P
Subjt:  KWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHP

Query:  ESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQG
         SL+ RV+K +Y++  +F   +   N S++W+S+LWG  ++K+  RWR+GDG+ + + +DKWIPR +T + I  + L  E  V  L+ S N W V+ ++ 
Subjt:  ESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQG

Query:  LFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSL----KGEASSLDS------------EKLK-------------------------
         F +ED  AIL I  P G+ +D+++WH++K G Y+VKSGY+LA +     + E+S+  S            EK+K                         
Subjt:  LFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSL----KGEASSLDS------------EKLK-------------------------

Query:  --------------------------------------------------QWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLT
                                                           W +   +  E  +V CW IW+ RNK +F G++     D  +   ++  +
Subjt:  --------------------------------------------------QWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLT

Query:  QFRAFQG-RKESSGAGVREVAV----WRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRL
          +A+Q   K  +  G ++  +    W+PP     KLN +AAV+   Q   +GAIVR+  G+++    K     + V + EA AI   L +  ++    L
Subjt:  QFRAFQG-RKESSGAGVREVAV----WRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRL

Query:  EVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEIEGV
         VESD  +V+ LL +     +++  +  ++++ ++ F+   F +  R+ N  AH LA+  L       W+     E++ V
Subjt:  EVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEIEGV

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.6e-10932.79Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRC-VSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRLVARTLSAFSR
        MSKAYDRVEW F+E  M+ +G D         C V  +   +     R+G + L         L+  L ++ +  L  + G     E+     T  A  +
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRC-VSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRLVARTLSAFSR

Query:  L-------SGQEINFMKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMA
        L       S    N +   +V    +YLGLP   P  +     YIKDRVW ++  WK   FS+GGK+VLIK V QAIP Y+MSCF+LPK LI++ + I A
Subjt:  L-------SGQEINFMKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMA

Query:  RFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERAR
        RFWWG ++E +++HW +W  + + K  GG+GFRDLELFN+ALLAKQ WR++ HP S+L RVLKG+YF++CSFME +  GN SY+W+S+LWGR LLK+  R
Subjt:  RFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERAR

Query:  WRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLS-SNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC-
        WR+G+G S+ I  D W+P Q TL+++    L    RV  L+      W  ++V+  F  ++A+ IL IP  RG  +D+LIW+YEK G Y+V+SGYK+A  
Subjt:  WRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLS-SNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC-

Query:  --SLKGEASSLDSEKLKQWWK------------------------------------------------------FVCSL--------------------
                SS  SE+++ WW                                                       ++C                      
Subjt:  --SLKGEASSLDSEKLKQWWK------------------------------------------------------FVCSL--------------------

Query:  ----------FEEFVVLCWWIWNRRNKELFGG------RQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQ
                  FEE  V+ W +WN+RN   F        + G+E+ +    W ++Y  +FR  +    +         +W+PP    YK+NT+A+     Q
Subjt:  ----------FEEFVVLCWWIWNRRNKELFGG------RQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQ

Query:  SSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRS
         + +G I+ N+RG+VM    K +   Q VD+ EA+A  + L L  E+G                +     D+S+ G +  + K         SF + +R 
Subjt:  SSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRS

Query:  ANKVAHELARLTLELGRDGTWLEEVLIEIEGVYFSEVLDSV
         NK AH LAR  L L     W+E+  +E++     E L+ +
Subjt:  ANKVAHELARLTLELGRDGTWLEEVLIEIEGVYFSEVLDSV

XP_023919013.1 uncharacterized protein LOC112030568 [Quercus suber]2.1e-11736.43Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLG--------------------
        +SKAYDRVEW F++  MI LG     V  +M+CVS+ ++SV ING+  G IR SRG+RQGDPLSPYLFL+CA+G + +L                     
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLG--------------------

Query:  -------------WARDNEVRLVARTLSAFSRLSGQEINF----------------------MKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                      A ++EV++V+ TL  ++  SGQ IN                       +K   V   D YLGLP +   RK  +  +IK++VW  I
Subjt:  -------------WARDNEVRLVARTLSAFSRLSGQEINF----------------------MKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          WK    S  GK+VLIK V Q+IP Y+M  FQLP  L  + + + ARFWWG   E R++HW SW  +  SK  GG+GFRD+  FN A+LAKQGW+L+++
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV------EMLLSSNNTW
          SLL R LK KYF  C F++V+   N SYVWKSLL  + +L++   WRVG+G SI +L+D W+P Q T  V+      PE+ +      +++   N+ W
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV------EMLLSSNNTW

Query:  DVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLK------GEASSLDSEKLKQW---WKFVCSL----------------
        D + V  LF   D  AIL +P  R   QD L W + KNG Y VKSGY +A  L+      GEA S+ S  +  W   WK  C                  
Subjt:  DVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLK------GEASSLDSEKLKQW---WKFVCSL----------------

Query:  ---------------------FEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGA-GVREVAVWRPPIHPNYKLNTNAA
                             F+ F V+CW IW +RN  L GG    +     +    +YL++F   Q      G   V     W+PP    +KLN + A
Subjt:  ---------------------FEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGA-GVREVAVWRPPIHPNYKLNTNAA

Query:  VNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSF
               S  GA+VRN  GEVM  ++      ++ + +E LA R +L    + GFM + +E D+A+V+  +     D++ LG + E+I  L  GF+ +S 
Subjt:  VNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSF

Query:  RWCRRSANKVAHELARLTLELGRDGTWLEEVLI
           R SAN VAH LAR       +  WLEE L+
Subjt:  RWCRRSANKVAHELARLTLELGRDGTWLEEVLI

XP_030505068.1 uncharacterized protein LOC115720043 [Cannabis sativa]1.9e-11032.17Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGW-------------------
        MSKA+DRVEW F++  + A+G  L +V LI+RC+SSVTYS S+NG+  G +  +RG+ QGDPLSPYLF++CA+GL R+L                     
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGW-------------------

Query:  --------------ARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                      A       +  +L+ + R SGQ +N  KS L                         H+RYLGLP+     K+     IK+++W  +
Subjt:  --------------ARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          W+   FS+GGK+VL+K V QAIP Y+MSCF+L KSL+     +M +FWWG       ++W +W+ +  SK  GG+GF+    FN+ALLAKQ WR+  +
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ
        P SLLCRVLK ++F  CSF++       S  W+ ++WG+ LL +  RW+VGDG  I    + W+P  +T + +  +G     +V  L+  +  W+  L++
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ

Query:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKGEASSLDS------------------------------EKLKQWWK------
         LF + D   IL IP      QD L+WH+E +G Y+VKSGY LA SL+ ++ S+                                ++ +Q W+      
Subjt:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKGEASSLDS------------------------------EKLKQWWK------

Query:  --------------------FVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKE---------SSGAGVREVAVWRPP
                            +  S  E+F  L W IWN RN+E+ G +   +  D  + +   YL +F +   RK+         +  + V + + W  P
Subjt:  --------------------FVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKE---------SSGAGVREVAVWRPP

Query:  IHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEI
             KLNT+AAV+K+   S  GAI+++  G+++ T+A       + +I+E +A+  SL  ++E+      +E+DS  V++ L+     VS+   L + I
Subjt:  IHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEI

Query:  KQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEV
          L   F         RSAN  AH LA+  L    +  W EE+
Subjt:  KQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEV

TrEMBL top hitse value%identityAlignment
A0A2N9ESZ1 Uncharacterized protein2.1e-11032.58Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRLVARTLSAFSRL
        MSKAYDRVEW F+ + MI +G     VSL+M C+++V+YS+ ING   G I  SRGLRQGDP+SPYLFL+CA+  S +   A   E + +   L A+ + 
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRLVARTLSAFSRL

Query:  SGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQ
        SGQ++N  K+ L                      +  +++YLGL ++    K+     IK+RVW  +  WK    S  G+++LIK V+QAIP Y+M+CF+
Subjt:  SGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQ

Query:  LPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWK
        LP  L KD   I+ RFWWG  E+ R++HW  W+K+C +K  GGLGFR+L+ FN ALLAKQ WRL+    SL+ +V   K+F   + +E + K  GS+ W+
Subjt:  LPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWK

Query:  SLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEG-LHPEDRV-EMLLSSNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEK
        S++  + L+   + WRVG+G+ I I   KW+  +   R+I      HP   V E++  +   WDVE ++ +F   DA AIL IP       D+LIWH  K
Subjt:  SLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEG-LHPEDRV-EMLLSSNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEK

Query:  NGFYTVKSGYKL----------------------------------------ACS-------------------------------------------LK
        NG YTV+SGY +                                        AC                                              
Subjt:  NGFYTVKSGYKL----------------------------------------ACS-------------------------------------------LK

Query:  GEASSLDSEKL--KQWWKFVCS--------LFEEFVVLCWWIWNRRNKELFGGRQGL-EVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREVAVWRPPIH
         + + +D +K+    + + VC+        + E F V CW IW++RN++    R  L   E    G  ++ L Q  +    +E      +   +WRPP  
Subjt:  GEASSLDSEKL--KQWWKFVCS--------LFEEFVVLCWWIWNRRNKELFGGRQGL-EVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREVAVWRPPIH

Query:  PNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQ
          YK+N + A+ K ++   +G +VR++ G V+ T+++ V  +   +++EA A R ++   RE+G + +  E DS  +I  L S+    +  G++ E+ K 
Subjt:  PNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQ

Query:  LARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEIEGVYFSE
        L   FQ Y+F   RRS N VAH LAR  L++     W+E+   +I  + +S+
Subjt:  LARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEIEGVYFSE

A0A2N9EYC3 Reverse transcriptase domain-containing protein2.4e-11132.73Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL-------------------------
        MSKAYDRVEW +++  M  +G     V+++M CVS+V+YS+ +NG     I+ SRGLRQGDPLSPYLFLLCA+G                          
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL-------------------------

Query:  --------SRMLGWARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                S +   A  ++V  +   LS + + SGQ+IN  K+ L                      +  ++RYLGLP+     K +S   IK+RVW+ +
Subjt:  --------SRMLGWARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          WK    S  G+++LIK V QAIP Y+MSCF+LP  LIK+   ++ RFWWG   E  ++HW  W  +C SK  GG+G RDL +FN ALLAKQ WRL+ +
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPE-DRVEMLLSSN-NTWDVEL
        P SL  +V K KYF  CS +EV+    GSY W+S+L  R L+ + + WRVG G+ I I  DKW+   +  R+I    L+     VE L+ S+  +W  EL
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPE-DRVEMLLSSN-NTWDVEL

Query:  VQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKG--EASSLDSEKLKQWWKFVCSL---------------------------
        V+ LF  ++A  ILGIP       D L+W   K G YTV+SGY L  + +   E    D+ K+ Q WK + SL                           
Subjt:  VQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKG--EASSLDSEKLKQWWKFVCSL---------------------------

Query:  ----------------------------------------------------------------FEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWV
                                                                        F+ F V+CW IW RRN+     +Q  +         
Subjt:  ----------------------------------------------------------------FEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWV

Query:  SEYLTQFRAFQ--GRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFM
         E L +F   Q  G++ S  +    V  W+PP    YK+N + AV      + +G I+RN RGEVM  +++ +     V+ +EA A R ++   ++ GFM
Subjt:  SEYLTQFRAFQ--GRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFM

Query:  RLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEI
         +++E DS  ++  +   T   +  G + E+I+Q+ARG Q   F   +R  N +AH LA+         TW+E V  E+
Subjt:  RLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEI

A0A2N9IYC8 Reverse transcriptase domain-containing protein2.6e-11335.47Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL-------------------------
        MSKAYDRVEW +++  M  +G     V+++M CVS+V+YS+ +NG   G I+ SRGLRQG+PLSPYLFLLCA+G                          
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL-------------------------

Query:  --------SRMLGWARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                S +   A  ++V  +   LS + + SGQ+IN  K+ L                      +  ++RYLGLP+     K +S   IK+RVW+ +
Subjt:  --------SRMLGWARDNEVRLVARTLSAFSRLSGQEINFMKSGL----------------------VGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          WK    S  G+++LIK V QAIP Y+MSCF+LP  LIK+   ++ RFWWG   E  ++HW  W  +C SK  GG+G RDL +FN ALLAKQ WRL+ +
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPE-DRVEMLLSSN-NTWDVEL
        P SL  +V K KYF  CS +EV+    GSY W+S+L  R L+ +   WRVG G+ I I  DKW+   +  R+I    L+     VE L+ S+  +W  EL
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPE-DRVEMLLSSN-NTWDVEL

Query:  VQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKG--EASSLDSEKLKQWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLE
        V+ LF  ++A  ILGIP       D L+W   K G YTV+SGY L  + +   E+   D+ K+ Q WK               IW  R            
Subjt:  VQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKG--EASSLDSEKLKQWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLE

Query:  VEDGGWGWVSEYLTQFRAFQ--GRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSL
                  E L +F   Q  G++ S  +    V  W+PP    YK+N + AV      + +G I+RN +GEVM  +++ +     V+ +EA A R ++
Subjt:  VEDGGWGWVSEYLTQFRAFQ--GRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSL

Query:  GLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEI
           +++GFM +++E DS  ++  +   T   +  G + E+I+Q+ARG Q   F   +R  N +AH LA+         TW+E V  E+
Subjt:  GLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEI

A0A2N9J6I3 Uncharacterized protein9.3e-11134.73Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRL-----------
        MSKAYDRVEW +++K M+ LG     V+LIM CV+SV+YS+ +NG   G ++ SRGLRQGDPLSPYLFL+CA+GL+ +L  A    V             
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRL-----------

Query:  ----VARTLSAFSR-----LSGQEINFMKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLP
             A+T   FS+     +    +NF  +      ++YLGLP V    K  +   IKDR+W  +  WK    S  GK VLIK V+QAIP Y+MSCF+ P
Subjt:  ----VARTLSAFSR-----LSGQEINFMKSGLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLP

Query:  KSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSL
          L ++ + +  RFWWG  E GR++HW S +K+C +K  GG+GFRDL+ FN+ALLA+QGWRL+++P+SL+ R LK KYF   SFME + +GN SY+W+S+
Subjt:  KSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSL

Query:  LWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHG-EGLHPEDRVEMLLSSNN-TWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNG
           + +L+   RWRVG G SI I +D+WIP  ST +++     L     V+ L++ ++ +W+V L+Q +F   D   I  IP    + +D LIW   K G
Subjt:  LWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHG-EGLHPEDRVEMLLSSNN-TWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNG

Query:  FYTVKSGYKLAC--SLKGEASSLDSEKLKQWWK-------------------------------------FVCS--------------------LFEEFV
         +TVKS Y +    S  GEA S  S +L  +WK                                     F CS                    L E  +
Subjt:  FYTVKSGYKLAC--SLKGEASSLDSEKLKQWWK-------------------------------------FVCS--------------------LFEEFV

Query:  VLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREV---AVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLT
           W +W  RN+ ++  +      D         +    A  G  E+     REV     W PP   ++KLN         + + +G ++RN  G+VM  
Subjt:  VLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREV---AVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLT

Query:  VAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELA
        +   + +  E+D + A  +  ++   R++G MR+ +E     + +LL++    +   GVL ++I  L + FQF SF     + N+ A  LA
Subjt:  VAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELA

A0A5E4FZN9 PREDICTED: retrotransposon5.3e-11433.25Show/hide
Query:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWAR-----------------
        M+KAYDRVEW F+ + M+ LG     V+ +M C+S+ T+S+   G  +G I   RGLRQG PLSPYLFL+C +G S +L  A                  
Subjt:  MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWAR-----------------

Query:  ---------------DNEVRLVARTL-SAFSRLSGQEINFMKSG----------------------LVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI
                        NE      TL   +  +SGQ+IN+ KS                       +V  H++YLGLP +    +    +++KD++W +I
Subjt:  ---------------DNEVRLVARTL-SAFSRLSGQEINFMKSG----------------------LVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYI

Query:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH
          WK    S  GK++L+K VLQAIP YSMSCF++PK L K+ N IMARFWW  A++ R +HW  W  +C SKF GGLGFRDLE FN+ALLAKQ WR++  
Subjt:  HKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEH

Query:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ
        PESL+ R+ + +Y     F+E E   N S++W+SL WG+ LL +  RWRVG+G SI +  DKW+P  S  +++    L     V  L +S+  W+V L++
Subjt:  PESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQ

Query:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC----SLKGEAS---SLDSEKLKQWWK-----------------------------
         +F +++  A L IP       D LIWHYE+NG Y+VKSGY+LAC     + GE S    L+S+  K+ W                              
Subjt:  GLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLAC----SLKGEAS---SLDSEKLKQWWK-----------------------------

Query:  ---------------------FVCSLFEE-------------------------------------FVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVS
                             ++C   +E                                     F  LCW +WNRRN  +F G+   E        ++
Subjt:  ---------------------FVCSLFEE-------------------------------------FVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVS

Query:  EYLTQF-------RAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVRE
        +   +F           GR+ S  A +     WRPP    YK+N + AV        VG +VRN  GE M    + +  +      E +A  + L    +
Subjt:  EYLTQF-------RAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVRE

Query:  MGFMRLEVESDSAKVI-SLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEE
        MGF    +E D+   I S+L +E  +  D G+L EE+  L   F+    +W  RS NKVAH LA+         TW+EE
Subjt:  MGFMRLEVESDSAKVI-SLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEE

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.8e-2930.53Show/hide
Query:  IKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLA
        I +RV + +  W+    S  G+  L K VL ++P +SMS   LP+S++   +++   F WG   E ++ H   W KVC  K  GGLG R  +  NRAL++
Subjt:  IKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLA

Query:  KQGWRLIEHPESLLCRVLKGKY----FRECSFMEVEHKGNGSYVWKSLLWG-RGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV--
        K GWRL++   SL   VL+ KY     R+  ++    KG+ S  W+S+  G R ++     W  GDG+ I    D+W+  +  L + +GE     D V  
Subjt:  KQGWRLIEHPESLLCRVLKGKY----FRECSFMEVEHKGNGSYVWKSLLWG-RGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV--

Query:  EMLLSSNNTWDVELVQGLFCEE---DARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKL
        + L      WD   +          + RA++ +    G ++D+L W + ++G ++V+S Y++
Subjt:  EMLLSSNNTWDVELVQGLFCEE---DARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKL

P11369 LINE-1 retrotransposable element ORF2 protein1.8e-1020.62Show/hide
Query:  KAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD------------------
        KA+D+++  F+ K +   G+    +++I    S    ++ +NG +L  I L  G RQG PLSPYLF +  + L+R +   ++                  
Subjt:  KAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD------------------

Query:  ----------NEVRLVARTLSAFSRLSGQEINFMKS---------------------GLVGFHDRYLGLPAVFPGRKV--ASLKYIKDRVWAYIHKWKHW
                  N  R +   +++F  + G +IN  KS                      +V  + +YLG+      + +   + K +K  +   + +WK  
Subjt:  ----------NEVRLVARTLSAFSRLSGQEINFMKS---------------------GLVGFHDRYLGLPAVFPGRKV--ASLKYIKDRVWAYIHKWKHW

Query:  RFSVGGKDVLIKFVL--QAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGW
          S  G+  ++K  +  +AI  ++    ++P     +    + +F W + +   R+  +  +     +  GG+   DL+L+ RA++ K  W
Subjt:  RFSVGGKDVLIKFVL--QAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGW

P92555 Uncharacterized mitochondrial protein AtMg012505.1e-0557.14Show/hide
Query:  INGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD
        ING   G +  SRGLRQGDPLSPYLF+LC + LS +   A++
Subjt:  INGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD

P93295 Uncharacterized mitochondrial protein AtMg003104.8e-3244.3Show/hide
Query:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSK-FCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFME
        A+P Y+MSCF+L K L K     M  FWW   E  R++ W +W+K+C SK   GGLGFRDL  FN+ALLAKQ +R+I  P +LL R+L+ +YF   S ME
Subjt:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSK-FCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFME

Query:  VEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTL
               SY W+S++ GR LL       +GDG    +  D+WI  ++ L
Subjt:  VEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTL

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.6e-0435.62Show/hide
Query:  KAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL
        KA+D V    I + M A G+D  +   IM  ++    ++ + GR    I +  G++QGDPLSP LF +  D L
Subjt:  KAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGL

Arabidopsis top hitse value%identityAlignment
AT2G22440.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1)4.2e-0726.6Show/hide
Query:  ILRDKWIPR-----QSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKGEAS
        + +D WIP        ++  I    L+  D ++    + N W ++ +Q L    D   ILGI   R    D   W + K+G YTVKSGY +A  L     
Subjt:  ILRDKWIPR-----QSTLRVIHGEGLHPEDRVEMLLSSNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKGEAS

Query:  SLDSEKLKQWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQ------FRAF-QGRKESSGAGVREVAVWR
            +     +    SLF  F  L W     R++E   G + LE+    + W+  Y+ +      F  F +   E+    ++E AVW+
Subjt:  SLDSEKLKQWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQ------FRAF-QGRKESSGAGVREVAVWR

AT3G09510.1 Ribonuclease H-like superfamily protein1.8e-1331.16Show/hide
Query:  LKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNT---WDVELVQGLFCE
        +K +YF++ S ++ + +   SY W SLL G  LLK+  R  +GDG++I I  D  +      R ++ E  + E  +  L     +   WD   +     +
Subjt:  LKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVEMLLSSNNT---WDVELVQGLFCE

Query:  EDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKL
         D   I  I   + +  DK+IW+Y   G YTV+SGY L
Subjt:  EDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKL

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-4625.4Show/hide
Query:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEV
        A+P Y+M+CF LPK++ K    ++A FWW + +E + +HW +W  +   K  GG+GF+D+E FN ALL KQ WR++  PESL+ +V K +YF +   +  
Subjt:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEV

Query:  EHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV---------EMLLSSNNTWDVELVQGLFCEEDARAILG
              S+VWKS+   + +L++ AR  VG+G  I I R KW+  +     +  + + P++           +++  S   W  ++++ LF E + R ++G
Subjt:  EHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRV---------EMLLSSNNTWDVELVQGLFCEEDARAILG

Query:  IPRPRGQS-QDKLIWHYEKNGFYTVKSGYKLACSLKGEAS--------SLDSEKLKQW------------WK----------------------------
          RP G+   D   W Y  +G YTVKSGY +   +  + S        SL+    K W            WK                            
Subjt:  IPRPRGQS-QDKLIWHYEKNGFYTVKSGYKLACSLKGEAS--------SLDSEKLKQW------------WK----------------------------

Query:  ---------FVC--------------------------SLFEEFVV----------------LCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFR
                 F C                          +L+  F +                L W +W  RN+ +F GR     E      +        
Subjt:  ---------FVC--------------------------SLFEEFVV----------------LCWWIWNRRNKELFGGRQGLEVEDGGWGWVSEYLTQFR

Query:  AFQGRKESSGAGVR------EVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEV
         ++ R E+   G +          WRPP H   K NT+A  N+ ++   +G ++RNE+GEV    A+ +   + V   E  A+R ++  +    +  +  
Subjt:  AFQGRKESSGAGVR------EVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRLEV

Query:  ESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTL
        ESDS  +I +L ++    S L    +++++L   F    F +  R  N +A  +AR +L
Subjt:  ESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-3344.3Show/hide
Query:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSK-FCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFME
        A+P Y+MSCF+L K L K     M  FWW   E  R++ W +W+K+C SK   GGLGFRDL  FN+ALLAKQ +R+I  P +LL R+L+ +YF   S ME
Subjt:  AIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSK-FCGGLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFME

Query:  VEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTL
               SY W+S++ GR LL       +GDG    +  D+WI  ++ L
Subjt:  VEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.6e-0657.14Show/hide
Query:  INGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD
        ING   G +  SRGLRQGDPLSPYLF+LC + LS +   A++
Subjt:  INGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGGCTTACGACCGGGTGGAATGGTTCTTTATTGAGAAGTTTATGATTGCCCTTGGCCTTGATCTGAATGTGGTAAGCTTGATTATGAGGTGTGTGAGCTCGGT
GACATATTCGGTGTCGATTAACGGCAGGAGATTGGGGTGTATAAGACTGTCAAGGGGACTTCGTCAGGGTGATCCTTTATCACCGTATTTGTTTCTATTGTGTGCGGATG
GTTTGTCACGAATGTTGGGTTGGGCAAGAGATAATGAGGTGAGGTTGGTGGCAAGAACTCTGTCAGCTTTTTCTAGGCTATCTGGCCAAGAGATAAACTTTATGAAGTCT
GGGTTGGTGGGTTTCCATGACCGTTACTTGGGCCTTCCAGCTGTGTTCCCAGGGAGGAAAGTCGCTTCGTTGAAGTATATAAAGGATCGAGTCTGGGCATATATTCATAA
GTGGAAACACTGGAGGTTTTCGGTTGGAGGGAAAGATGTTCTCATAAAATTTGTTCTGCAAGCGATCCCTAACTACTCTATGTCATGCTTCCAACTCCCAAAGAGTTTGA
TTAAAGATTGTAACCGTATTATGGCGAGATTCTGGTGGGGGGATGCTGAGGAAGGGAGGAGAGTTCATTGGGCCTCTTGGAGGAAGGTGTGTGTGTCGAAATTTTGTGGG
GGCTTGGGTTTTAGGGACTTGGAGTTGTTTAACAGGGCGCTCCTAGCGAAACAAGGGTGGAGATTGATCGAGCACCCTGAATCTTTGCTATGTAGGGTGCTTAAGGGGAA
GTATTTTAGGGAGTGTTCTTTTATGGAAGTTGAGCATAAGGGGAATGGGTCTTATGTGTGGAAAAGTTTGTTGTGGGGGAGAGGGTTACTTAAGGAAAGGGCTAGATGGA
GGGTGGGGGATGGAAGATCTATTAGTATTCTAAGGGACAAGTGGATACCGCGACAATCTACGTTGAGAGTGATCCATGGGGAGGGTCTGCACCCGGAGGATAGGGTGGAG
ATGTTGTTGTCCTCAAACAACACATGGGATGTTGAACTAGTGCAAGGTCTGTTCTGTGAAGAGGATGCAAGGGCTATTCTTGGGATCCCGAGACCAAGGGGGCAATCTCA
AGATAAGCTTATTTGGCACTACGAAAAGAACGGTTTTTACACAGTCAAGAGTGGATACAAGCTGGCGTGTTCTTTGAAGGGAGAAGCCAGTAGTTTAGATTCAGAGAAAT
TAAAGCAGTGGTGGAAATTTGTATGTTCCCTTTTCGAGGAGTTCGTGGTCCTTTGCTGGTGGATCTGGAATAGACGGAACAAAGAGCTCTTTGGCGGAAGGCAGGGTTTG
GAGGTGGAAGATGGAGGGTGGGGTTGGGTTTCTGAGTATTTGACCCAATTTCGGGCTTTCCAAGGAAGGAAGGAGTCGTCAGGGGCGGGGGTGAGGGAGGTGGCGGTTTG
GAGGCCTCCGATTCATCCTAACTATAAGCTTAACACAAATGCAGCAGTAAACAAAATCTCGCAATCAAGTAGTGTGGGTGCGATTGTGAGGAATGAAAGAGGAGAAGTAA
TGCTTACAGTTGCAAAGTTGGTTACGCTTGCGCAAGAGGTGGACATCTTGGAAGCTTTGGCGATTCGGGATAGTCTGGGCCTTGTGAGGGAAATGGGATTCATGAGGTTG
GAGGTGGAGTCAGATTCGGCAAAGGTGATTTCGTTGCTTCGGTCGGAGACGAGTGATGTATCAGATCTGGGAGTGCTTGCAGAGGAGATTAAGCAGTTGGCGAGGGGCTT
TCAGTTCTACTCCTTCAGGTGGTGTAGGAGATCGGCGAATAAGGTGGCCCATGAGCTGGCGAGATTGACGTTGGAGCTGGGGCGTGATGGTACCTGGCTGGAGGAAGTGC
TGATAGAAATTGAAGGGGTGTATTTCTCTGAGGTTTTGGACAGCGTCTCTGTGCTGTCGGGAAGGGGTTGTTTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGGCTTACGACCGGGTGGAATGGTTCTTTATTGAGAAGTTTATGATTGCCCTTGGCCTTGATCTGAATGTGGTAAGCTTGATTATGAGGTGTGTGAGCTCGGT
GACATATTCGGTGTCGATTAACGGCAGGAGATTGGGGTGTATAAGACTGTCAAGGGGACTTCGTCAGGGTGATCCTTTATCACCGTATTTGTTTCTATTGTGTGCGGATG
GTTTGTCACGAATGTTGGGTTGGGCAAGAGATAATGAGGTGAGGTTGGTGGCAAGAACTCTGTCAGCTTTTTCTAGGCTATCTGGCCAAGAGATAAACTTTATGAAGTCT
GGGTTGGTGGGTTTCCATGACCGTTACTTGGGCCTTCCAGCTGTGTTCCCAGGGAGGAAAGTCGCTTCGTTGAAGTATATAAAGGATCGAGTCTGGGCATATATTCATAA
GTGGAAACACTGGAGGTTTTCGGTTGGAGGGAAAGATGTTCTCATAAAATTTGTTCTGCAAGCGATCCCTAACTACTCTATGTCATGCTTCCAACTCCCAAAGAGTTTGA
TTAAAGATTGTAACCGTATTATGGCGAGATTCTGGTGGGGGGATGCTGAGGAAGGGAGGAGAGTTCATTGGGCCTCTTGGAGGAAGGTGTGTGTGTCGAAATTTTGTGGG
GGCTTGGGTTTTAGGGACTTGGAGTTGTTTAACAGGGCGCTCCTAGCGAAACAAGGGTGGAGATTGATCGAGCACCCTGAATCTTTGCTATGTAGGGTGCTTAAGGGGAA
GTATTTTAGGGAGTGTTCTTTTATGGAAGTTGAGCATAAGGGGAATGGGTCTTATGTGTGGAAAAGTTTGTTGTGGGGGAGAGGGTTACTTAAGGAAAGGGCTAGATGGA
GGGTGGGGGATGGAAGATCTATTAGTATTCTAAGGGACAAGTGGATACCGCGACAATCTACGTTGAGAGTGATCCATGGGGAGGGTCTGCACCCGGAGGATAGGGTGGAG
ATGTTGTTGTCCTCAAACAACACATGGGATGTTGAACTAGTGCAAGGTCTGTTCTGTGAAGAGGATGCAAGGGCTATTCTTGGGATCCCGAGACCAAGGGGGCAATCTCA
AGATAAGCTTATTTGGCACTACGAAAAGAACGGTTTTTACACAGTCAAGAGTGGATACAAGCTGGCGTGTTCTTTGAAGGGAGAAGCCAGTAGTTTAGATTCAGAGAAAT
TAAAGCAGTGGTGGAAATTTGTATGTTCCCTTTTCGAGGAGTTCGTGGTCCTTTGCTGGTGGATCTGGAATAGACGGAACAAAGAGCTCTTTGGCGGAAGGCAGGGTTTG
GAGGTGGAAGATGGAGGGTGGGGTTGGGTTTCTGAGTATTTGACCCAATTTCGGGCTTTCCAAGGAAGGAAGGAGTCGTCAGGGGCGGGGGTGAGGGAGGTGGCGGTTTG
GAGGCCTCCGATTCATCCTAACTATAAGCTTAACACAAATGCAGCAGTAAACAAAATCTCGCAATCAAGTAGTGTGGGTGCGATTGTGAGGAATGAAAGAGGAGAAGTAA
TGCTTACAGTTGCAAAGTTGGTTACGCTTGCGCAAGAGGTGGACATCTTGGAAGCTTTGGCGATTCGGGATAGTCTGGGCCTTGTGAGGGAAATGGGATTCATGAGGTTG
GAGGTGGAGTCAGATTCGGCAAAGGTGATTTCGTTGCTTCGGTCGGAGACGAGTGATGTATCAGATCTGGGAGTGCTTGCAGAGGAGATTAAGCAGTTGGCGAGGGGCTT
TCAGTTCTACTCCTTCAGGTGGTGTAGGAGATCGGCGAATAAGGTGGCCCATGAGCTGGCGAGATTGACGTTGGAGCTGGGGCGTGATGGTACCTGGCTGGAGGAAGTGC
TGATAGAAATTGAAGGGGTGTATTTCTCTGAGGTTTTGGACAGCGTCTCTGTGCTGTCGGGAAGGGGTTGTTTTGTGTAA
Protein sequenceShow/hide protein sequence
MSKAYDRVEWFFIEKFMIALGLDLNVVSLIMRCVSSVTYSVSINGRRLGCIRLSRGLRQGDPLSPYLFLLCADGLSRMLGWARDNEVRLVARTLSAFSRLSGQEINFMKS
GLVGFHDRYLGLPAVFPGRKVASLKYIKDRVWAYIHKWKHWRFSVGGKDVLIKFVLQAIPNYSMSCFQLPKSLIKDCNRIMARFWWGDAEEGRRVHWASWRKVCVSKFCG
GLGFRDLELFNRALLAKQGWRLIEHPESLLCRVLKGKYFRECSFMEVEHKGNGSYVWKSLLWGRGLLKERARWRVGDGRSISILRDKWIPRQSTLRVIHGEGLHPEDRVE
MLLSSNNTWDVELVQGLFCEEDARAILGIPRPRGQSQDKLIWHYEKNGFYTVKSGYKLACSLKGEASSLDSEKLKQWWKFVCSLFEEFVVLCWWIWNRRNKELFGGRQGL
EVEDGGWGWVSEYLTQFRAFQGRKESSGAGVREVAVWRPPIHPNYKLNTNAAVNKISQSSSVGAIVRNERGEVMLTVAKLVTLAQEVDILEALAIRDSLGLVREMGFMRL
EVESDSAKVISLLRSETSDVSDLGVLAEEIKQLARGFQFYSFRWCRRSANKVAHELARLTLELGRDGTWLEEVLIEIEGVYFSEVLDSVSVLSGRGCFV