; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018774 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018774
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:34182897..34185230
RNA-Seq ExpressionLag0018774
SyntenyLag0018774
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.6e-11033.87Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        MGF + W   VM CI+SV Y +L+NG       PSRGLRQGDPLSP LFLLCAEG SAL+ +      I+G  IN+ CP +THLFFADDS++F KA Y +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
            + +L  YE ASGQ IN DKS+   S N   ET  +   +LG  ++S   +YLG+PS  GR+K  VF  +K++V   L GWKG L SMGGKE+LIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQAIP YTMSCF LP  +C  ++R+    WWG    + K  WISWK+MC +K+                  KQ+WR++ NPNSL+ ++L+ RYF   + 
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCRE----------------GNPRVITL------------------SNSYVGLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG
        L A +G+SPS + R                 GN + I +                   +++    V   +D + K WK  A+R  F P +   IL +   
Subjt:  LEAPIGNSPSLTCRE----------------GNPRVITL------------------SNSYVGLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG

Query:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASH-SNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESS
            +D+++W  +KKG FSVKSAY +A   +   E    SN       W  +W  N+  + K+ AWR   D +PT  N  K+G+     C  CG   E  
Subjt:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASH-SNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESS

Query:  SHLIWECKMVKSIWSY---------------------------------FVPSSLTMW----SLCREDWKPKDYWGWMEANLNREDIDRSIIIMLRTPA-
        +H +  C+    +W +                                 F   S  +W     +   D        W+ AN   ED  ++  + +  P  
Subjt:  SHLIWECKMVKSIWSY---------------------------------FVPSSLTMW----SLCREDWKPKDYWGWMEANLNREDIDRSIIIMLRTPA-

Query:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEED
        SQ +WE P    +K+N D    ++     +G ++RDS G ++    K +   +A   +EA A+ +GI    D   +L   + +E DA  V++ LN +   
Subjt:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEED

Query:  LSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA
         ++L      I+++++      F H NR  N  AH +A+ A
Subjt:  LSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.2e-11033.42Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        MGF   WI+ VM CI+SV Y +LVNG       P+RGLRQGDP+SPY+FLLCA+GFS+LL       +ISG  I + CP +THLFFADDSL+F KA   +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
         +    +L+ YE ASGQ IN+DKS+   S N  DE   +   +LG  + +   +YLG+PS  G++K  +F  +K+RVE+ L GWK  L S+GG+E+LIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQAIP YTMSCF++P  +C  I+ +  + WWG  G + K  W+SWKK+CK K                   KQ WRLI NPNSL+ +I + RY+   + 
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCRE----------------GN-PRVITLSNSYVGL-----------------RVKDFLD-DNNKWKENAIREAFSPQDASDILNMHAG
         +A +G SPS T R                 GN  R++   + ++                   RV   +D +  +WK++ +R+ F P +A  IL++   
Subjt:  LEAPIGNSPSLTCRE----------------GN-PRVITLSNSYVGL-----------------RVKDFLD-DNNKWKENAIREAFSPQDASDILNMHAG

Query:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQ-SKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESS
            +D+I+W  ++KG FSVKSAY +A   +   E   S+     +  W  +W  N+ P+ ++ AW++  + +PT  N L+KGV++  +C  CG   ES+
Subjt:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQ-SKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESS

Query:  SHLIWECKMVKSIW-------------------------SYFVPSSLTM-----WSL-CRED--------WKPKDYWGWMEANLNREDIDRSIIIMLRTP
         H+  +C++ K +W                          +  PS L +     W++ C  +          P+  WG+    +  E  + S       P
Subjt:  SHLIWECKMVKSIW-------------------------SYFVPSSLTM-----WSL-CRED--------WKPKDYWGWMEANLNREDIDRSIIIMLRTP

Query:  ASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEE
         S  +W  P P  +K+N D    E      +G ++RD+ G +       ++  ++++ +EA AM  G+    +   +L   + +E+DA  VV  +N  E 
Subjt:  ASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEE

Query:  DLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASK
            L      I +L       K +H  R  N AAH +A+ A  K
Subjt:  DLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASK

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.6e-10532.84Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        +GF   WI  +M+C+SSV Y VL+NGE      PSRG+RQGDPLSP LFLLCAEG SAL+       +I+G  I + CP +THLFFADDSL+F KAK  +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
          A   +L  YE ASGQ IN DKS+   S N   E       +LG  + S   +YLG+PS  G++K  VF  +KDRV K L GWKG L S+GG+E+LIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNK-----------------SPKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQA+P YTMSCF+LP  +C  ++ L    WWG    ++K  W+SW+KMC++K                   KQ WR++ NPNSL+ ++ + +YF   + 
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNK-----------------SPKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCRE----------------GNPRVITL----------SNSYVGLRVKDFLD----------DNNKWKENAIREAFSPQDASDILNMHA
        L +  G++PS   R                 GN R I +          ++  V  RV D+ D          D  +WK + I   F P +A+ IL +  
Subjt:  LEAPIGNSPSLTCRE----------------GNPRVITL----------SNSYVGLRVKDFLD----------DNNKWKENAIREAFSPQDASDILNMHA

Query:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKE-AAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES
             +D ++W  +K+G F+VKSAY +AS  + + E   S      +  W  IW+  V P+ K+ AWR   + +PT  N   +GV     C  C K  E+
Subjt:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKE-AAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES

Query:  SSHLIWECKMVKSIWSYFVPSSLTM-----------------------------WSL-------CREDWKPKDYWGW-MEANLNREDIDRSIIIMLRTPA
         +H +  C+  K  W+++  S + +                             WS+         ED        W +   +  E     +   L    
Subjt:  SSHLIWECKMVKSIWSYFVPSSLTM-----------------------------WSL-------CREDWKPKDYWGW-MEANLNREDIDRSIIIMLRTPA

Query:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGI-----KEVSDTCNRLGIHLEVETDANEVVRVLN
          ++W+ P    +K+N+DA   + E C  +G ++RD  G ++    K +  ++  +  EA AM EG+      EVS        H   E+D+  +++ ++
Subjt:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGI-----KEVSDTCNRLGIHLEVETDANEVVRVLN

Query:  GEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA
         E+    +       I  +A       F H  R  N  AH +AR A
Subjt:  GEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]1.7e-10432.45Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        +GF + WI  V SCI SV + VLVNGE    F P+RGLRQGDPLSPYLFLLCAEG  +L+++ E    I G  +    P ++HLFFADDSL+F +A   +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
          +  ++LK YE ASGQ IN +K+    S N       + + +LG+  +++  +YLG+PS  GR K   F  I++R+   +QGWK  L S GG+EVLIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        V QA+P +TM CF++P ++C  I+ L  K WWG  G   K HW+ WKK+CK+KS                  KQ WRLI N +SL +K+ + ++F   + 
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEA--------------------------PIGNSPSLTCREG-------NPRVITLSNSYV-GLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG
        L+                            IG+  S+  R         + RV++   ++    RV   +D+ N+ W E+ IRE F P +A  IL++   
Subjt:  LEA--------------------------PIGNSPSLTCREG-------NPRVITLSNSYV-GLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG

Query:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSS
            +D ++W+    G ++ KSAYRL  +         SN + +  FW  +W  NV  + +   WR  ND +PTK N LK+ +     C  CG   E   
Subjt:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSS

Query:  HLIWECKMVKSIWSYFVPSSLTMWSL--CREDWKPK-----------------------DYWGWM-----------EANLNREDIDRSIIIMLR------
        H IW C+M+K +W          W L  CRE    K                        + GW              +L  E I R  +  LR      
Subjt:  HLIWECKMVKSIWSYFVPSSLTMWSL--CREDWKPK-----------------------DYWGWM-----------EANLNREDIDRSIIIMLR------

Query:  -TPASQ------AQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANE
          P  Q        W  P P+ +K+N D          GLG ++RDSEG +I    +++     +  LEA A    I         LG+  +  E D+  
Subjt:  -TPASQ------AQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANE

Query:  VVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSP
        V ++L  E+  ++      DE ++LA    S  F+H  R  N  A  +A+ A + + P
Subjt:  VVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSP

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]4.0e-10632.35Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        +GF + WI  V SCI SV + VLVNGE    F P+RGLRQGDPLSPYLFLLCAEG  +L+++ E    I G  +    P ++HLFFADDSL+F +A   +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
        + +  ++LK YE ASGQ IN +K+    S N       + + +LG+  +++  +YLG+PS  GR K   F  I++RV + +QGWK  L S GG+EVLIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        V QA+P +TM CF+LP ++C  I+ L  K WWG  G   K HW+ WKK+CK+KS                  KQ WRLI N +SL +K+ + +YF   + 
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEA--------------------------PIGNSPSLTCREG-------NPRVITLSNSYV-GLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG
        L+                            IG+  S+  R         + RV++   ++    RV   +D+ N+ W E+ IRE F P +A  IL++   
Subjt:  LEA--------------------------PIGNSPSLTCREG-------NPRVITLSNSYV-GLRVKDFLDDNNK-WKENAIREAFSPQDASDILNMHAG

Query:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSS
            +D ++W+    G ++ KSAYRL  +       S SN + E  FW  +W  NV  + +   WR  ND +P K N  K+ +    +C  CG   E   
Subjt:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSS

Query:  HLIWECKMVKSIW------------SYFVPSSLTMWSLCREDWKPKDYWG------WMEANLNR--------EDIDRSIIIMLR-------------TPA
        H +W C+M+K +W            ++     L    L +++    + +G      W + N  R        E I R  +  LR             T  
Subjt:  HLIWECKMVKSIW------------SYFVPSSLTMWSLCREDWKPKDYWG------WMEANLNR--------EDIDRSIIIMLR-------------TPA

Query:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANEVVRVLNGEEE
            W    P+ +K+N D          GLG +VRDSEG +I    +++     +  LEA A    I    +    LG+  +  E D+  + ++L  E+ 
Subjt:  SQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANEVVRVLNGEEE

Query:  DLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSP
         +S      +E ++L+    S  F+H  R  N  A  +A+ A + + P
Subjt:  DLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSP

TrEMBL top hitse value%identityAlignment
A0A5E4GGB8 PREDICTED: reverse mRNAase (Fragment)1.8e-10439.53Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        MGF   W++ VM C+++V Y  LVNGE      P+RGLRQGDPLSPYLFLLCAEGF+ LL + E   ++ G  I +  P+++HLFFADDS VF KA  ++
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
            K + + YE ASGQ IN  KS    S N+  +T ++   VLG+ R  S   YLG+P   GRNK   F+ +K+RV K LQGW+    S+ GKEVL+K 
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQ+IP+Y MSCF LP  +C  I+++ A+ WWG  G   K HW+ W+++CK K+                  KQ WRL+ NP+SL  ++L+ +YF  +NF
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCR------------------EGN----------PRVITLS------NSYVGLRVKDFL--DDNNKWKENAIREAFSPQDASDILNMHA
         EA +G+ PS   +                  +G           PR  T +      +     +V + +  + + +W    +   F P D  DI+ +  
Subjt:  LEAPIGNSPSLTCR------------------EGN----------PRVITLS------NSYVGLRVKDFL--DDNNKWKENAIREAFSPQDASDILNMHA

Query:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYE-ASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES
          +   D IVW++DK G+F+VKSAYR+A +  S  E  S S+ S  +  W  IW ANV  + K+ AWR+ +DI+PTKAN +KKGVD+  +C FCG   ES
Subjt:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYE-ASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES

Query:  SSHLIWECKMVKSIWS
        + H++  C    + W+
Subjt:  SSHLIWECKMVKSIWS

A0A7N2LIH6 Uncharacterized protein1.1e-10432.71Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        MGF   WI  +M C++SV + VL+NGE +  F PSRGLRQGDP+SPYLFLLC EG SA++++KE    I G    +  P ++HLFFADDS++F +A   +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
             +VL+ YE  SGQ +N DK++   S+N KDE     + + G +      +YLG+P   GR K   F RIKD+V + + GWKG L S  G+EVLIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNK-----------------SPKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQA P YTM+ F+LP ++C+ ++ +    WWG  G + K  W+SWK +CK K                   KQ WRL +NPNSL  ++L+ +YF  S+F
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNK-----------------SPKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCR----------EGNPRVI------------------------TLSNSYVGLRVKDFL-DDNNKWKENAIREAFSPQDASDILNMHAG
        +EA +G  PS   R          EG+  V+                        T S S  G RV   +  +  +WK   +++ F P +A +IL++   
Subjt:  LEAPIGNSPSLTCR----------EGNPRVI------------------------TLSNSYVGLRVKDFL-DDNNKWKENAIREAFSPQDASDILNMHAG

Query:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSN-----QSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKY
        S +  D +VW+    G F+VKSAYR A K +       +N     +S+ +  W +IW      + K   WR    I+PTK   + + +     C FCG+ 
Subjt:  SKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSN-----QSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKY

Query:  KESSSHLIWECKMVKSIW----------SYFVPSSLTMWSLCR----EDWKPKDYWGWM----------------------EANLNREDIDRSIIIMLRT
         E+S H +W C + K  W           + V     +W L      +DW+      W                       EA   RE++  ++    + 
Subjt:  KESSSHLIWECKMVKSIW----------SYFVPSSLTMWSLCR----EDWKPKDYWGWM----------------------EANLNREDIDRSIIIMLRT

Query:  P---ASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAML--EGIKEVSDTCNRLGI-HLEVETDANEVVR
        P       +W  P  + +K+N DA    ++G  G+G ++R+++G ++  G    K  + ++ LEA A     GI    D    LG+ ++ VE DA  V++
Subjt:  P---ASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAML--EGIKEVSDTCNRLGI-HLEVETDANEVVR

Query:  VLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA
         L G +     +K   +  +       S K  H NR  NTAAH +AR +
Subjt:  VLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA

A0A803P4U9 Uncharacterized protein1.8e-10432.51Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        +G+   WI KVM+C+ SV + +L+NG  Q  F P RGLRQGDPLSP+LFLLC+EG + LL   E   KI G +   L  +L+HL FADDSLVFL A   +
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
          A K+VL  Y   SGQ INLDKS     + + DE+       LG++   +  +YLGMP+  G+NK  +F +I+DRVE  LQGWK  LFS  GKE+LIKA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        V QA+P Y MSCFR+   I   I+ + A+ WWG    K K HW SW+KMCK K                   KQ W+++ NP+ LL ++L+  YF  +NF
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPS------LTCRE----------GNPRVITLS-------NSYVGLR----------VKDFLDDNNKWKENAIREAFSPQDASDILNMHAGSK
        +EA +G+  S      L  RE          GN + + ++        +   LR          V+  ++ N +WK   I   F+ +D   +L +    +
Subjt:  LEAPIGNSPS------LTCRE----------GNPRVITLS-------NSYVGLR----------VKDFLDDNNKWKENAIREAFSPQDASDILNMHAGSK

Query:  DSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHL
        +  D I W+    G+++V S Y+L  +  +  E S ++ S+  A+W  +W + + P+ K+  WR+ +  IP K    K+G+ +   C+ C    E   H 
Subjt:  DSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHL

Query:  IWECKMVKSIWSYF-----VPSSLTM---------WSLCREDWKPKDYWGWMEAN---LNREDIDRSIII-----MLRTPASQAQ-------------WE
        +W    +  +W +F      PSSL             L  E +    +  W + N      +DI+  I I     M+    + AQ             W 
Subjt:  IWECKMVKSIWSYF-----VPSSLTM---------WSLCREDWKPKDYWGWMEAN---LNREDIDRSIII-----MLRTPASQAQ-------------WE

Query:  KPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKT
         P P T+ +N+DA+ +E +   GLG ++RD  G+L+    + +    ++   EA A+   +K +++T  RL  ++ + +D+  V+  L G+    +D   
Subjt:  KPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKT

Query:  FTDEIKALADRAFSMKFSHCNRLLNTAAHCVA
          D+    + +  ++ F    R  N+ AHC+A
Subjt:  FTDEIKALADRAFSMKFSHCNRLLNTAAHCVA

A0A803QSN3 Uncharacterized protein2.8e-10531.41Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        +G+   W+ K M+CI++V + VLVNGE     +P RGLRQGDPLSPY+FLLC+EGFS L++  E   KI G K  +    L+HLFFADDS VFL A  ++
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
         R+ K +L+ Y   SGQ INL KS     + +             +K      +YLG+P+  GR K  VF+ I+ ++   LQGWK +LFS  G+EVL+KA
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        V QAIP Y MSCFRLP  +   I  + A+ WWGS+ +K+K HW  W+K+CK K                   KQ W++I NP+SLL ++L+  YF  S F
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCRE------------------------------GNPRVITL---SNSYVGLRVKDFLDDNNKWKENAIREAFSPQDASDILNMHAGSK
        +EA +G   S   R                                 P   TL   +    G  +    D+  +W ++ I + F P D   +L M     
Subjt:  LEAPIGNSPSLTCRE------------------------------GNPRVITL---SNSYVGLRVKDFLDDNNKWKENAIREAFSPQDASDILNMHAGSK

Query:  DSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHL
        ++ D++VW +   G+++V S Y++A+   +   A  S++     +W  IWK  V P+ +  AWR+ N  IP       +G+ + P C  CG  +E+  H 
Subjt:  DSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHL

Query:  IWECKMVKSIWSY--------------------------------FVPSSLTMWSLCREDWK-------PKDYW-GW----MEANLNREDIDRSIIIMLR
        IW C  +K IW +                                F+  S  +W+  R + K       P + W  W    M+  ++        +  + 
Subjt:  IWECKMVKSIWSY--------------------------------FVPSSLTMWSLCREDWK-------PKDYW-GW----MEANLNREDIDRSIIIMLR

Query:  TPASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANEVVRVLNG
        TPA    W+ P P T  +N+DA+ + KE   GLG ++R+ EG+++   +KQ +  ++++  E  A+  GI+      N++  +   ++TD  +    LNG
Subjt:  TPASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIH-LEVETDANEVVRVLNG

Query:  EEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA
        +    +D     D+I+   + +      H  R  N AAH +A+ A
Subjt:  EEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSA

M5VU98 Reverse transcriptase domain-containing protein1.3e-11334.84Show/hide
Query:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD
        MGF   W++ VM C+++V Y  LVNGE      P+RGLRQGDPLSPYLFLLCAEGF+ LL + E   ++ G  I +  P+++HLFFADDS VF KA  ++
Subjt:  MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSD

Query:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA
            K + + YE ASGQ IN  KS    S N+  +T ++   VLG+ R  S   YLG+P   GRNK V F+ +K+RV K LQGW+    S+ GKEVL+K 
Subjt:  LRAFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKA

Query:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF
        VAQ+IP+Y MSCF LP  +C  I+++ A+ WWG  G   K HW+ W+++CK K+                  KQ WRL+ NP+SL  ++L+ +YF  +NF
Subjt:  VAQAIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS-----------------PKQSWRLIRNPNSLLFKILRGRYFKGSNF

Query:  LEAPIGNSPSLTCR------------------EGN----------PRVITLS------NSYVGLRVKDFL--DDNNKWKENAIREAFSPQDASDILNMHA
         EA +G+ PS   +                  +G           PR  T +      +     +V + +  + + +W    +   F P D  DI+ +  
Subjt:  LEAPIGNSPSLTCR------------------EGN----------PRVITLS------NSYVGLRVKDFL--DDNNKWKENAIREAFSPQDASDILNMHA

Query:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYE-ASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES
          +   D IVW++DK G+F+VKSAYR+A +  S  E  S S+ S     W  IW A V  + K+ AWR+ +DI+PTKAN +KKGVD+  +C FCG   ES
Subjt:  GSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYE-ASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKES

Query:  SSHLIWECKMVKSIWSYFVPSSLTMWSLCREDWKPKDYWGWMEANLNR----EDIDRSIIIMLRTPASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWL
        + H++  C    + W+    S LT  +       P +  G+ +  ++      D    +   +R P    +W  P     K N D  +    G   +G +
Subjt:  SSHLIWECKMVKSIWSYFVPSSLTMWSLCREDWKPKDYWGWMEANLNR----EDIDRSIIIMLRTPASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWL

Query:  VRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTA
         RD++G  +    K V +  + ++ E  A  EG+           I    E D+  VV  +    +D S++ T  +++K L  +  S  F    R  N  
Subjt:  VRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTA

Query:  AHCVAR
        AH +AR
Subjt:  AHCVAR

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.8e-0923.85Show/hide
Query:  GFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSDL
        G    ++  + +  S     + VNGE         G RQG PLSPYLF +  E  +  + +++   +I G +I K    ++ L  ADD +V++    +  
Subjt:  GFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSDL

Query:  RAFKQVLKHYEGASGQTINLDKS-TFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVV---FKRIKDRVEKTLQGWKGNLFSMGGKEVL
        R    ++  +    G  IN +KS  F+ +KN + E   +  E       ++  +YLG+ + T   K +    FK +K  +++ L+ WK    S  G+  +
Subjt:  RAFKQVLKHYEGASGQTINLDKS-TFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVV---FKRIKDRVEKTLQGWKGNLFSMGGKEVL

Query:  IKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKLWWGS
        +K       +Y  +    ++PT   + ++    K  W +
Subjt:  IKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKLWWGS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-0625.4Show/hide
Query:  FSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSDLR
        F   ++  + +  +S   LV +N          RG+RQG PLS  L+ L  E F  LL +     +++G  + +    +    +ADD ++ +     DL 
Subjt:  FSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSDLR

Query:  AFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGM-PSQTGRNKGVVFKRIKDRVEKTLQGWKG--NLFSMGGKEVLIK
          ++  + Y  AS   IN  KS+ +   ++K + L        I   S + +YLG+  S         F  +++ V   L  WKG   + SM G+ ++I 
Subjt:  AFKQVLKHYEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGM-PSQTGRNKGVVFKRIKDRVEKTLQGWKG--NLFSMGGKEVLIK

Query:  AVAQAIPVYTMSCFRLPTN--ICSFIDRLCAKLWWGSAGNKDKTHWIS
         +  +   Y + C   PT   I     RL   LW G        HW+S
Subjt:  AVAQAIPVYTMSCFRLPTN--ICSFIDRLCAKLWWGSAGNKDKTHWIS

P92555 Uncharacterized mitochondrial protein AtMg012502.4e-1347.37Show/hide
Query:  VCY---LVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDS
        VC+   L ++NG  Q    PSRGLRQGDPLSPYLF+LC E  S L  R +   ++ G +++   P + HL FADD+
Subjt:  VCY---LVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003101.4e-1637.5Show/hide
Query:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS------------------PKQSWRLIRNPNSLLFKILRGRYFKGSNFLE
        A+PVY MSCFRL   +C  +     + WW S  NK K  W++W+K+CK+K                    KQS+R+I  P++LL ++LR RYF  S+ +E
Subjt:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS------------------PKQSWRLIRNPNSLLFKILRGRYFKGSNFLE

Query:  APIGNSPSLTCR
          +G  PS   R
Subjt:  APIGNSPSLTCR

Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0630Show/hide
Query:  HSNQSKEAAFW-NSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHLIWECKMVKSIWSYF
        H N   E   W  +IW    +P+    AW  +   + TK   +  G    PLC FC  + E+  HL ++C+  + +W YF
Subjt:  HSNQSKEAAFW-NSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHLIWECKMVKSIWSYF

AT3G09510.1 Ribonuclease H-like superfamily protein1.1e-1319.9Show/hide
Query:  PRVITLSNSYVGLRVKDFLDDNNK---WKENAIREAFSPQDASDILNMHAGSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWN
        PR +    +Y  + + +  +       W ++ I +     D   I  ++       D+I+W+++  G ++V+S Y L +   ST   + +          
Subjt:  PRVITLSNSYVGLRVKDFLDDNNK---WKENAIREAFSPQDASDILNMHAGSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWN

Query:  SIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHLIWECKMVKSIWSYFVPSSLTMWSLCREDWKP--------------KD
         IW   ++P+ K   WR ++  + T      +G+ + P C  C +  ES +H ++ C      W     SSL    L   D++                D
Subjt:  SIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFCGKYKESSSHLIWECKMVKSIWSYFVPSSLTMWSLCREDWKP--------------KD

Query:  Y---------WGWMEANLN------REDIDRSIIIM--------------LRTPA-------SQAQWEKPRPNTWKLNSDATW-MEKEGCRGLGWLVRDS
        +         W   +A  N      RE   ++++                 +TP+       ++ +W  P     K N DA + ++K    G GW++R+ 
Subjt:  Y---------WGWMEANLN------REDIDRSIIIM--------------LRTPA-------SQAQWEKPRPNTWKLNSDATW-MEKEGCRGLGWLVRDS

Query:  EGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCV
         G+ I +G  ++         E  A+L  +++   T  R    + +E D   ++ ++NG     S L    ++I   A++  S++F    R  N  AH +
Subjt:  EGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCV

Query:  AR
        A+
Subjt:  AR

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-3222.38Show/hide
Query:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKSP-----------------KQSWRLIRNPNSLLFKILRGRYFKGSNFLEA
        A+P YTM+CF LP  +C  I  + A  WW +       HW +W  +   K+                  KQ WR++  P SL+ K+ + RYF  S+ L A
Subjt:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKSP-----------------KQSWRLIRNPNSLLFKILRGRYFKGSNFLEA

Query:  PIGNSPSLT----------CREGNPRVITLSNSYV-------------------------------GLRVKDFLDDNNK-WKENAIREAFSPQDASDILN
        P+G+ PS             R+G   V+      +                                L+V D +D++ + W+++ I   F   +   I  
Subjt:  PIGNSPSLT----------CREGNPRVITLSNSYV-------------------------------GLRVKDFLDDNNK-WKENAIREAFSPQDASDILN

Query:  MHAGSKDSKDEIVWSFDKKGIFSVKSAY----RLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFC
        +  G +   D   W +   G ++VKS Y    ++ +K  S  E S  + +     +  IWK+   P+ +   W+ +++ +P       + +     C  C
Subjt:  MHAGSKDSKDEIVWSFDKKGIFSVKSAY----RLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFC

Query:  GKYKESSSHLIWECKMVKSIWS--------------------YFV------------PSSLTMWSLCREDWKPKDYWGWMEANLNREDI-----DRSIII
           KE+ +HL+++C   +  W+                    Y+V             S L  W L R  WK ++   +     N +++     D     
Subjt:  GKYKESSSHLIWECKMVKSIWS--------------------YFV------------PSSLTMWSLCREDWKPKDYWGWMEANLNREDI-----DRSIII

Query:  MLRTPA------------SQAQWEKPRPNTW-KLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHL
         +RT A            S  +W +P P+ W K N+DATW       G+GW++R+ +G +   G + + +  ++   E  AM   +  +S        ++
Subjt:  MLRTPA------------SQAQWEKPRPNTW-KLNSDATWMEKEGCRGLGWLVRDSEGSLICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHL

Query:  EVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSPSPAMSS
          E+D+  ++ +LN  +E    LK    +++ L  +   +KF    R  NT A  VAR + S  +  P + S
Subjt:  EVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSPSPAMSS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.8e-1837.5Show/hide
Query:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS------------------PKQSWRLIRNPNSLLFKILRGRYFKGSNFLE
        A+PVY MSCFRL   +C  +     + WW S  NK K  W++W+K+CK+K                    KQS+R+I  P++LL ++LR RYF  S+ +E
Subjt:  AIPVYTMSCFRLPTNICSFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKS------------------PKQSWRLIRNPNSLLFKILRGRYFKGSNFLE

Query:  APIGNSPSLTCR
          +G  PS   R
Subjt:  APIGNSPSLTCR

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.7e-1447.37Show/hide
Query:  VCY---LVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDS
        VC+   L ++NG  Q    PSRGLRQGDPLSPYLF+LC E  S L  R +   ++ G +++   P + HL FADD+
Subjt:  VCY---LVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTAGCTCCAATTGGATCCAGAAGGTTATGAGTTGCATCTCCTCTGTATGCTACTTAGTTCTTGTTAATGGCGAGCACCAAACAGAGTTTAAGCCGAGCAGAGG
CCTCCGGCAGGGAGACCCCCTGTCGCCCTACCTGTTCCTGTTGTGTGCGGAAGGCTTTTCCGCTCTTTTGGAAAGGAAAGAATCTCTTTCTAAAATTTCTGGTTTTAAAA
TTAACAAGCTTTGTCCTTCTCTAACTCATCTCTTCTTTGCAGATGACAGCCTAGTCTTTCTTAAAGCAAAATACTCGGATCTTCGAGCCTTCAAACAAGTTTTGAAACAC
TACGAGGGAGCGTCGGGCCAGACTATCAACCTTGACAAGTCCACCTTTATGGCTAGTAAAAACGTTAAGGATGAGACTTTGGCCAAGTGTGAAGAAGTTCTAGGTATCAA
GAGATCGAGTTCGTTAGGCCAATATTTGGGGATGCCTTCGCAAACAGGGAGAAATAAAGGAGTGGTGTTCAAAAGAATCAAAGACAGGGTTGAGAAAACTCTTCAAGGAT
GGAAAGGAAATCTCTTTTCCATGGGAGGAAAGGAAGTGCTTATTAAGGCTGTGGCTCAGGCGATCCCAGTCTATACCATGAGTTGTTTTCGACTACCCACTAATATCTGT
TCTTTTATTGACAGGTTATGCGCTAAATTGTGGTGGGGCTCGGCTGGGAATAAAGACAAAACTCACTGGATAAGCTGGAAGAAGATGTGCAAAAACAAGAGTCCTAAACA
GAGCTGGAGGCTGATAAGGAACCCCAATAGCCTCCTGTTCAAGATTCTGAGAGGGCGATATTTCAAAGGAAGTAATTTCCTTGAAGCTCCCATTGGCAACTCTCCTTCCC
TCACTTGCCGAGAAGGCAATCCTAGAGTCATCACCCTGTCCAACTCCTATGTGGGCCTTAGAGTCAAAGATTTCCTTGATGACAACAACAAGTGGAAAGAAAATGCAATT
AGGGAAGCTTTCTCTCCTCAAGATGCGTCCGATATCCTGAATATGCATGCAGGGAGTAAGGATTCAAAGGACGAAATTGTGTGGAGCTTCGACAAAAAAGGCATTTTCTC
TGTGAAAAGTGCCTACCGACTAGCCTCGAAAGGCCTTAGCACTTATGAAGCCTCCCATTCTAACCAATCTAAGGAAGCAGCTTTCTGGAACAGTATTTGGAAGGCCAATG
TTCTCCCTAGATCCAAAGTGTGTGCGTGGAGGATTATCAATGACATCATCCCTACAAAGGCTAATGCTCTAAAAAAGGGAGTTGATCTTATTCCCTTATGCTCTTTTTGT
GGAAAATATAAAGAATCTTCTTCCCATCTAATATGGGAATGCAAAATGGTTAAATCTATATGGTCCTATTTTGTTCCAAGCTCTCTTACTATGTGGTCTTTGTGTAGGGA
GGATTGGAAGCCTAAGGACTACTGGGGATGGATGGAAGCGAATCTAAACAGGGAAGACATAGACAGAAGTATAATCATAATGCTGAGGACTCCGGCGAGTCAAGCTCAGT
GGGAGAAGCCACGGCCGAACACGTGGAAGCTCAACTCAGACGCAACCTGGATGGAAAAAGAGGGTTGCAGAGGTCTCGGATGGCTCGTGCGTGACTCGGAGGGTTCCTTG
ATCTGTTTCGGGATGAAGCAAGTTAAACAAAATTGGGCTATAAAGAATCTGGAAGCTTGCGCTATGTTGGAAGGTATCAAAGAAGTTTCAGATACCTGTAATCGCCTCGG
AATTCATCTGGAAGTCGAGACGGACGCCAACGAGGTCGTTCGGGTTCTCAACGGCGAGGAGGAAGATTTATCAGACCTGAAGACTTTCACCGATGAAATCAAGGCACTAG
CTGACCGTGCCTTCTCCATGAAATTCAGCCATTGTAATCGCCTTTTGAACACAGCTGCACACTGTGTTGCGAGGAGCGCTGCCAGCAAGTTCTCGCCGTCTCCGGCGATG
TCTTCTGGCGCTTTGTCTTCTTCGCGGGAACGGGAAGTATGTTTTTCGGCTCCCAACATTCCGGTTTGGGCTTTCCCTCTAATTAATGAGGGTGGTTGTACTAGACTGCT
TTTGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTAGCTCCAATTGGATCCAGAAGGTTATGAGTTGCATCTCCTCTGTATGCTACTTAGTTCTTGTTAATGGCGAGCACCAAACAGAGTTTAAGCCGAGCAGAGG
CCTCCGGCAGGGAGACCCCCTGTCGCCCTACCTGTTCCTGTTGTGTGCGGAAGGCTTTTCCGCTCTTTTGGAAAGGAAAGAATCTCTTTCTAAAATTTCTGGTTTTAAAA
TTAACAAGCTTTGTCCTTCTCTAACTCATCTCTTCTTTGCAGATGACAGCCTAGTCTTTCTTAAAGCAAAATACTCGGATCTTCGAGCCTTCAAACAAGTTTTGAAACAC
TACGAGGGAGCGTCGGGCCAGACTATCAACCTTGACAAGTCCACCTTTATGGCTAGTAAAAACGTTAAGGATGAGACTTTGGCCAAGTGTGAAGAAGTTCTAGGTATCAA
GAGATCGAGTTCGTTAGGCCAATATTTGGGGATGCCTTCGCAAACAGGGAGAAATAAAGGAGTGGTGTTCAAAAGAATCAAAGACAGGGTTGAGAAAACTCTTCAAGGAT
GGAAAGGAAATCTCTTTTCCATGGGAGGAAAGGAAGTGCTTATTAAGGCTGTGGCTCAGGCGATCCCAGTCTATACCATGAGTTGTTTTCGACTACCCACTAATATCTGT
TCTTTTATTGACAGGTTATGCGCTAAATTGTGGTGGGGCTCGGCTGGGAATAAAGACAAAACTCACTGGATAAGCTGGAAGAAGATGTGCAAAAACAAGAGTCCTAAACA
GAGCTGGAGGCTGATAAGGAACCCCAATAGCCTCCTGTTCAAGATTCTGAGAGGGCGATATTTCAAAGGAAGTAATTTCCTTGAAGCTCCCATTGGCAACTCTCCTTCCC
TCACTTGCCGAGAAGGCAATCCTAGAGTCATCACCCTGTCCAACTCCTATGTGGGCCTTAGAGTCAAAGATTTCCTTGATGACAACAACAAGTGGAAAGAAAATGCAATT
AGGGAAGCTTTCTCTCCTCAAGATGCGTCCGATATCCTGAATATGCATGCAGGGAGTAAGGATTCAAAGGACGAAATTGTGTGGAGCTTCGACAAAAAAGGCATTTTCTC
TGTGAAAAGTGCCTACCGACTAGCCTCGAAAGGCCTTAGCACTTATGAAGCCTCCCATTCTAACCAATCTAAGGAAGCAGCTTTCTGGAACAGTATTTGGAAGGCCAATG
TTCTCCCTAGATCCAAAGTGTGTGCGTGGAGGATTATCAATGACATCATCCCTACAAAGGCTAATGCTCTAAAAAAGGGAGTTGATCTTATTCCCTTATGCTCTTTTTGT
GGAAAATATAAAGAATCTTCTTCCCATCTAATATGGGAATGCAAAATGGTTAAATCTATATGGTCCTATTTTGTTCCAAGCTCTCTTACTATGTGGTCTTTGTGTAGGGA
GGATTGGAAGCCTAAGGACTACTGGGGATGGATGGAAGCGAATCTAAACAGGGAAGACATAGACAGAAGTATAATCATAATGCTGAGGACTCCGGCGAGTCAAGCTCAGT
GGGAGAAGCCACGGCCGAACACGTGGAAGCTCAACTCAGACGCAACCTGGATGGAAAAAGAGGGTTGCAGAGGTCTCGGATGGCTCGTGCGTGACTCGGAGGGTTCCTTG
ATCTGTTTCGGGATGAAGCAAGTTAAACAAAATTGGGCTATAAAGAATCTGGAAGCTTGCGCTATGTTGGAAGGTATCAAAGAAGTTTCAGATACCTGTAATCGCCTCGG
AATTCATCTGGAAGTCGAGACGGACGCCAACGAGGTCGTTCGGGTTCTCAACGGCGAGGAGGAAGATTTATCAGACCTGAAGACTTTCACCGATGAAATCAAGGCACTAG
CTGACCGTGCCTTCTCCATGAAATTCAGCCATTGTAATCGCCTTTTGAACACAGCTGCACACTGTGTTGCGAGGAGCGCTGCCAGCAAGTTCTCGCCGTCTCCGGCGATG
TCTTCTGGCGCTTTGTCTTCTTCGCGGGAACGGGAAGTATGTTTTTCGGCTCCCAACATTCCGGTTTGGGCTTTCCCTCTAATTAATGAGGGTGGTTGTACTAGACTGCT
TTTGTCTTAA
Protein sequenceShow/hide protein sequence
MGFSSNWIQKVMSCISSVCYLVLVNGEHQTEFKPSRGLRQGDPLSPYLFLLCAEGFSALLERKESLSKISGFKINKLCPSLTHLFFADDSLVFLKAKYSDLRAFKQVLKH
YEGASGQTINLDKSTFMASKNVKDETLAKCEEVLGIKRSSSLGQYLGMPSQTGRNKGVVFKRIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNIC
SFIDRLCAKLWWGSAGNKDKTHWISWKKMCKNKSPKQSWRLIRNPNSLLFKILRGRYFKGSNFLEAPIGNSPSLTCREGNPRVITLSNSYVGLRVKDFLDDNNKWKENAI
REAFSPQDASDILNMHAGSKDSKDEIVWSFDKKGIFSVKSAYRLASKGLSTYEASHSNQSKEAAFWNSIWKANVLPRSKVCAWRIINDIIPTKANALKKGVDLIPLCSFC
GKYKESSSHLIWECKMVKSIWSYFVPSSLTMWSLCREDWKPKDYWGWMEANLNREDIDRSIIIMLRTPASQAQWEKPRPNTWKLNSDATWMEKEGCRGLGWLVRDSEGSL
ICFGMKQVKQNWAIKNLEACAMLEGIKEVSDTCNRLGIHLEVETDANEVVRVLNGEEEDLSDLKTFTDEIKALADRAFSMKFSHCNRLLNTAAHCVARSAASKFSPSPAM
SSGALSSSREREVCFSAPNIPVWAFPLINEGGCTRLLLS