; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001130 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001130
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:25005282..25007804
RNA-Seq ExpressionLag0001130
SyntenyLag0001130
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]5.5e-13742.55Show/hide
Query:  SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWL--------
        +P+ I+  ++ C   L+ W+   + G + + I      +N L         +  +    ++++ LL++EE YW  R++  WLK GD+NTK+         
Subjt:  SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWL--------

Query:  ------------------QPSISRGFFSSFD-----PKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNI
                          + SI++   S F+        S I+EV EAI  ++ E  ++ + + FT+ E+  ALK + P KAPGPDG  A+FFQ YWS +
Subjt:  ------------------QPSISRGFFSSFD-----PKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNI

Query:  GKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSK
        G  VT + L VLN    + E+NKT ISLIPK N PK M + RPISLCNVVYKLI+K LANRLK +L  IIS NQSAF   RLITDNV++ FE ++ ++ K
Subjt:  GKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSK

Query:  KQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPT-FSSCVPKIFQLFLTGRNPTL
          GKE  +++K+DMSKA+DRVEW F+ K+M++MGF   WR  VM CITSVSYS+L+NG       P+RG+RQGDPLSP+ F  C   +  L        L
Subjt:  KQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPT-FSSCVPKIFQLFLTGRNPTL

Query:  ISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSK
        I+ +  SI      +   FFADDS++F  A  +EC  ++ +L  YEEA GQ +N +KS+   S N +     +   ILG +       YLG+P    +SK
Subjt:  ISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSK

Query:  NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSF
        +++FA +K+ V   L  WK KL   GGK++LIKAVAQAIP YTMSCF LP G+C ++  +   FWWG  + + K+ W SWK +C SK  GGLGFR+L +F
Subjt:  NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSF

Query:  NQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS
        N AMLAK +WR++ +P+SL+ R LK +YF     L A L   PS
Subjt:  NQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]7.4e-14243.25Show/hide
Query:  IIKYCWNSEGPF-SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDK
        II+  WNS     SP+ I ++++ C ++L+ WN++ + G++ R I   +E +N L  ++ + S    +    K+++ LL+ EE  W+ RSR +WL  GD+
Subjt:  IIKYCWNSEGPF-SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDK

Query:  NTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGP
        NTK+     S                                 +  +SS  P  + I EV++AI   + E  +  + + FT  EI  AL  M PTKAPGP
Subjt:  NTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGP

Query:  DGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITD
        DG  A+FFQ YW+ +G ++  + L VLN    M EINKT I+L+PK   P  M + RPISLCNVVYKLI+K LANRLK +L  IIS NQSAF+ GRLITD
Subjt:  DGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITD

Query:  NVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCV
        NV++ FE ++ +  KK+GKE   ++K+DMSKAYDRVEW F+ ++M++MGF E W + VM+CITSVSYS+LVNG       P RG+RQGDP+SP  F  C 
Subjt:  NVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCV

Query:  PKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKS
             L         IS V  SI      +   FFADDSL+F  A  +EC T+  +L  YE+A GQ +N +KS+   S N       +  R+LG +    
Subjt:  PKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKS

Query:  LGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCK
           YLG+P    KSK  +FA++K+ V + L  WKEKL   GG+++LIKAVAQAIP YTMSCF++P  +C+EI  +  +FWWG    + KI W SWK LCK
Subjt:  LGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCK

Query:  SKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS
        +KK GG+GFR+L +FN AMLAK  WR+I +P+SL+A+  K +Y+      +A L   PS
Subjt:  SKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.1e-14243.16Show/hide
Query:  IIKYCWNSEGPFS--PKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGD
        +I+  W+S G FS  P+ I S +Q C  +L +WNQ  + G++ + I      +N +   +   +    +    K+L+ LL+ EE  W+ RS+  W + GD
Subjt:  IIKYCWNSEGPFS--PKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGD

Query:  KNTKWL---------QPSISR--------------------GFFSSFDPKSS--AIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPD
        +NTK+          + SISR                     +F +    SS   I+EV+ AI  R+ +  + ++ K FT  E+ +ALK + PTKAPGPD
Subjt:  KNTKWL---------QPSISR--------------------GFFSSFDPKSS--AIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPD

Query:  GAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDN
        G  A FF NYW  +G  +T + L VLN    M+EINKT ISLIPK N+P  M E RPISLCN  YK+I+K LANR K +L  IIS NQSAF P RLITDN
Subjt:  GAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDN

Query:  VVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVP
        V++ FE ++ +N K +GKE  +S+K+DMSKA+DRVEWSF+  +M+++GF E W   +MNC++SVSYSVL+NG       P+RGIRQGDPLSP  F  C  
Subjt:  VVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVP

Query:  KIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSL
         +  L         I+ +  SI      +   FFADDSL+F  A E+EC  +  +L  YEEA GQ +N +KS+   S N S  L      ILG +     
Subjt:  KIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSL

Query:  GNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKS
          YLG+P    KSK ++FA++KD V K L  WK KL   GG+++LIKAVAQA+P YTMSCF+LP  +C+++ ++   FWWG    + KI W SW+ +C+S
Subjt:  GNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKS

Query:  KKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS
        K  GG+GFR++ +FN AMLAK  WR++ +P+SL+AR  K KYF     L +     PS
Subjt:  KKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS

XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]5.5e-13741.69Show/hide
Query:  WNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN---LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKW
        W      + +++ S I     SL  W++  + G +KR +    +++       +NF    A++   +   +D LLE+E  +W  R+R  WLK GD+NT +
Subjt:  WNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN---LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKW

Query:  L--------------------------QPSISRGFFSSFDPKSSA-----IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHAL
                                   Q  +   F + F    SA     +  +++ I+ R++++ + Q+ + F+E E+H AL  M PTKAPGPDG  AL
Subjt:  L--------------------------QPSISRGFFSSFDPKSSA-----IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHAL

Query:  FFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGF
        FFQ +W  +G  V++V LGVLN  +++S IN T+I LIPK   PK M E RPISLCNVVYK+I+K LANRLKEVL ++IS  QSAF+PGRLITDN ++ F
Subjt:  FFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGF

Query:  ECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQL
        E  + + +K+ GK+  + +K+DMSK YDRVEW F++++M  +GF + + + +MNC++++S+SVLVNG  S  F P RG+RQGDPLSP  F+ C   +  L
Subjt:  ECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQL

Query:  FLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLG
                 +S +  SI      L   FFA+DSL+F  A E E   ++ +L  YE+A  Q +NF+KSA   SKN     +     ILGV S  S G YLG
Subjt:  FLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLG

Query:  MPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGG
        +PY    SK  +   +K+ VW  L  WKEK+    G++VLIK+VAQAIP Y MSCFRLP+G+C EI ++   FWWG     RKIH  SWK L KSK  GG
Subjt:  MPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGG

Query:  LGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPSLGGAS-CGEKTCSLKDIGGEW
        +GFR    FN A+LAK  WR+++ P SL+AR  K +YF   SFLEA +   PS    S  G +  SL ++G  W
Subjt:  LGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPSLGGAS-CGEKTCSLKDIGGEW

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]6.5e-13842.84Show/hide
Query:  DSSTTIIKYCW-NSEGPFSP-KNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEW
        D    +I+  W N +G       +  KI++C   L AW    +      AI   +++++ L +   + ++    L   KK+D LL+++E YW  RSR  W
Subjt:  DSSTTIIKYCW-NSEGPFSP-KNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEW

Query:  LKWGDKNTKWLQPSIS--------RGFFSS------------------FDPKSSA-----IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTK
        L+ GD+NTK+     S        RG  +S                  FD    A     ++E ++A+  ++ E+  + +   FT  E+  AL  M PTK
Subjt:  LKWGDKNTKWLQPSIS--------RGFFSS------------------FDPKSSA-----IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTK

Query:  APGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGR
        APGPDG +ALF+Q +W  +G  V +  L  LN G  + EIN T I LIPK   P+ M E RPISLCNV+YK+I+K LANRLK+VL  IIS  QSAFVPGR
Subjt:  APGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGR

Query:  LITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TF
        LITDNV++ +E ++ ++++K+GK+  +++K+D+SKAYDRVEW FL  IM++MGF   W  RVM+C+T+ S+S+LVNG P E  +P+RGIRQGDP+SP  F
Subjt:  LITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TF

Query:  SSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVV
          C   +  L        +I+ V  SI      +    FADDSL+F  A   E  TI ++L  YE A GQ +N EKS+   S N S   + +   ILGV 
Subjt:  SSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVV

Query:  STKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWK
               YLG+P    ++K   F+++KD VWK LQ WK  L    GK++LIKAVAQAIP YTMS F++P  +C E+  +CA+FWWG    +RKIHWKSW 
Subjt:  STKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWK

Query:  FLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEA
         L   KK GG+GFRDL +FN AMLAK  WR+++   SLL R  K +YF   SFLEA
Subjt:  FLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEA

TrEMBL top hitse value%identityAlignment
A0A2K3NHG3 Ribonuclease H (Fragment)6.6e-13641.57Show/hide
Query:  NINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADV-LLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKW--LQPSISRG--
        +I  K+ + + SL  W   +  G + + I + ++++ L Q    + N    +  KE++LD +LE EE +WK RSRE WL+ GDKNTK+  ++ +I R   
Subjt:  NINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADV-LLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKW--LQPSISRG--

Query:  -------------------------FFSSFDPKSSA--IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKE
                                  F +   K     I E ++ ++ RI+++  Q++ K+FT+ E+ +A+K M    APGPDG  ALF+ NYW  IG++
Subjt:  -------------------------FFSSFDPKSSA--IDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKE

Query:  VTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQG
        VT + L +LN   D S +N TYI LIPK N P    + RPISLCNV  K++ K LANR+K++L  +ISPNQSAF+ GRLITDN ++  E  + +   K+ 
Subjt:  VTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQG

Query:  KEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQLFLTGRNPTLISK
        K   + +K DM+KAYDRVEW FL+  +  MGF +     +MNC+ +V +S+L+NG+PS+EF P RG+RQGDPLSP  F  C   +  L    +   LI  
Subjt:  KEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQLFLTGRNPTLISK

Query:  VLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSKNRM
        V+  I      +   FFADDSL+F  A  KE +TI+ ++  Y+ A GQ+VNF KS  + SK VS+ ++     IL +        YLGMP    +SK ++
Subjt:  VLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSKNRM

Query:  FAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQA
        F  I++ +WK L+ WKEK     G+  LIKAVAQAIP Y MS F LP G+C ++  +   FWWGS++  RKIHW  W  +CK K  GG+GFRDL +FN+A
Subjt:  FAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQA

Query:  MLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEAS
        +LAK  WR+I  P SL+A+ LK KY+    FL+AS
Subjt:  MLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEAS

A0A2N9GQ35 Reverse transcriptase domain-containing protein7.8e-13741.77Show/hide
Query:  INSKIQSCIKSLAAWNQLR---LKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWL--------QP
        +  K++ C  SL AW+Q R   L  S+K    + + E NL  +  SS     L+  + +L+ LLE+EE +W+ RSR  W+  GDKNTK+         Q 
Subjt:  INSKIQSCIKSLAAWNQLR---LKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWL--------QP

Query:  SISRG-------------------------FFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNI
        ++ RG                          F+S  P    I+  +E + + +  + +  +   FTE E+  AL+ M PTKAPGPDG  A+F+Q YW  +
Subjt:  SISRG-------------------------FFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNI

Query:  GKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSK
        G EVT   L +++ G  +S+IN T+I+L+PK   P+ + + RPI+LCNV+YK+I+K LANRLK++L  I+S +QSAFVPGRLITDNV++ FE +++++ K
Subjt:  GKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSK

Query:  KQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQLFLTGRNPTL
        + G++  +++K+DMSKAYDRVEWSFL+ IM R+GF+E W   +M CI SVSYSVL+NG     F  +RGIRQGD LSP  F  C   +  L         
Subjt:  KQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCVPKIFQLFLTGRNPTL

Query:  ISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSK
        I+ V  S   +   L   FFADDSL+F  A    C T+  +L  YEEA GQ +N  K++   +KN + ++  +   +  V   KS   YLG+P    +SK
Subjt:  ISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSK

Query:  NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSF
        +  F  +K  VW+ +  WKEK   + G+++L+KAVAQ+IP YTMSCF+LP  +C ++N++ + FWWG     RK HW  W  +CKSK  GGLGFRD+  F
Subjt:  NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSF

Query:  NQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS
        N+A+LAK  WR ++  +SLL+R  K KYF   SFLEA +   PS
Subjt:  NQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS

A0A7N2LIH6 Uncharacterized protein6.0e-13740.36Show/hide
Query:  IDSSTTIIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEI-NLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWL
        ++    I++  W+     S   +  +++ C K L  WNQ    G++ + I + +  +  L   N     A+ +   +K+++ L   EE  WK RSR  WL
Subjt:  IDSSTTIIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEI-NLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWL

Query:  KWGDKNTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPT
        ++GDKN+K+   + S                                 +  +SS  P S   D  +EA+  R+    + ++ K F   E+ +AL+ M PT
Subjt:  KWGDKNTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPT

Query:  KAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPG
        KAPGPDG   +F+Q YW  +G  VT   L  LN G    +INKTYI LIPK   P+ + E RPISLCNV+YK+I+K LANRLK+VL+ +I   QSAFVPG
Subjt:  KAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPG

Query:  RLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-T
        R+ITDNV++ FE +++IN +++GKE  +++K+DMSKAYDRVEW++L+ +M +MGF + W   +M C+TSVS+SVL+NG P   F P+RG+RQGDP+SP  
Subjt:  RLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-T

Query:  FSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGV
        F  C   +  +        LI  V+ +   AP  +   FFADDS+IF  A   EC  + KVL  YEE  GQ +N +K++   S+N    ++     I G 
Subjt:  FSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGV

Query:  VSTKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSW
           +    YLG+P    ++K + F +IKD V + +  WK KL    G++VLIKAVAQA P YTM+ F+LP+ +C E+N++   FWWG    ++K+ W SW
Subjt:  VSTKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSW

Query:  KFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS
        K LCK K  GG+GF+DL +FN A+LAK  WR+ ++P+SL  R LK KYF + SF+EA L   PS
Subjt:  KFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPS

A0A803NM27 Uncharacterized protein3.0e-13640.64Show/hide
Query:  IIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANF-SSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDK
        II +CWNS      + +N  +  C   L +W++ +  G+ K+ I+  +  +  L   +  S+S+   +   E  LD LL  EE YWK R+R +WL+ GD+
Subjt:  IIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEIN-LPQANF-SSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDK

Query:  NTKWLQPSISRGF---------------------------------FSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGP
        NTK+     S  F                                 F +      A+  V+  I   + +  ++ + + FT A++  ALK+M+   +PG 
Subjt:  NTKWLQPSISRGF---------------------------------FSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGP

Query:  DGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITD
        DG  ALF+Q+ W  +G  VT   L VLN+G     +NKT I+LIPK  KPK M + RPISLCNVVYKLI+K+L  R K VL  +IS  QSAF+P RLITD
Subjt:  DGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITD

Query:  NVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCV
        N+++ FE ++ +  K +GK+   ++K+DMSKA+DRVEWSFL  +M +MGFS  W   ++NC+ +   S ++NG  S   +P RG+RQGDPLSP  F  C 
Subjt:  NVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TFSSCV

Query:  PKIFQLFLTGRNPTLISKVLKSIIVAPH--YLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVST
          + +L     +       LK + V+ H   +    FADDSL+F  A ++ C  IK+VL TY +A GQ +N +KS    S N S + +     ILG+   
Subjt:  PKIFQLFLTGRNPTLISKVLKSIIVAPH--YLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVST

Query:  KSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFL
        +   +YLG+P  + + K ++F +IK+ +WK+L +W +K+F  GGK+VL+KAV Q+IP Y MSCF+LP   C EI ++ + FWWGSTS K+KIHWK WKFL
Subjt:  KSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFL

Query:  CKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPSLGGASCGEKTCSLKDIGGEWGME
        CKSK  GGLGFR+ + FNQA+LAK +WR+ ++P SLL R LKG+YF    FL A          A+CG    SL   G  WG E
Subjt:  CKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPSLGGASCGEKTCSLKDIGGEWGME

A0A803PBM9 Uncharacterized protein5.0e-13640.24Show/hide
Query:  DSSTTIIKYCWNSEGPF-SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLK
        +  T I+K  W+  G   +   +  K+  C K+L  WN+ R K  +K+ +   EE+I +   + ++ +   L   E+K +VLL++EE +W+ RSR  WLK
Subjt:  DSSTTIIKYCWNSEGPF-SPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLK

Query:  WGDKNTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTK
         GD+NTK+     +                                 +  F+S     + ++E    + N+I+   ++ +   FT+ +++ A++++ P K
Subjt:  WGDKNTKWLQPSIS---------------------------------RGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTK

Query:  APGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGR
        APG DG   LF++ YW  IG+EVT VCLG+LN+G  ++EIN T I LIPK  KP  M   RPISLCNV+YK++AK LA R K  L   IS  QSAFV GR
Subjt:  APGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGR

Query:  LITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TF
        LI DN ++GFE ++ +  ++ G    +++K+DMSKAYDRVEW FL  +M  +G+ E W  ++M C+TSVS+SVL+NG    +F P RG+RQGD LSP  F
Subjt:  LITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSP-TF

Query:  SSCVPKIFQLFLTGRNPTLISKVL--KSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILG
          C   +  L         I+ V   K  +   H     FFADDS +F    E EC T+  +L  Y    GQ +N EKS       +SS L       LG
Subjt:  SSCVPKIFQLFLTGRNPTLISKVL--KSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILG

Query:  VVSTKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKS
        V        YLG+P    + K  +F  IKD VW  L+SWK  +F   GK++LIKAV QAIP Y+MSCFRLP  +   ++++ A FWWG T   +KIHW +
Subjt:  VVSTKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKS

Query:  WKFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLE
        W  LCK K+ GGLGFR L  FNQA+LAK  WR+I  P SLLAR LK  Y+ + SFL+A  +
Subjt:  WKFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.1e-3024.63Show/hide
Query:  RINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGE-LRPISLCNVV
        R+N+ + + +++  T +EI   + S+   K+PGPDG  A F+Q Y   +   +  +   +  +G   +   +  I LIPK  +     E  RPISL N+ 
Subjt:  RINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGE-LRPISLCNVV

Query:  YKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSV
         K++ K LANR+++ +  +I  +Q  F+PG     N+      I  IN  K      IS  ID  KA+D+++  F+ K ++++G   ++ + +       
Subjt:  YKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSV

Query:  SYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQ
        + ++++NG   E F    G RQG PLSP   + V ++    +         K +K I +    +    FADD +++          + K++  + +  G 
Subjt:  SYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQ

Query:  VVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNN------KSKNRMFAKIKDSV--WKVLQ-SWKEKLFLAGGKKVLIKAVA---QAI
         +N +KS      N   +     G +   +++K +  YLG+  T +      ++   +  +IK+    WK +  SW  ++ +   K  ++  V     AI
Subjt:  VVNFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNN------KSKNRMFAKIKDSV--WKVLQ-SWKEKLFLAGGKKVLIKAVA---QAI

Query:  PVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQAMLAKLSW
        P+      +LP     E+     KF W     KR    KS   L +  K GG+   D   + +A + K +W
Subjt:  PVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQAMLAKLSW

P0C2F6 Putative ribonuclease H protein At1g657501.3e-1935.04Show/hide
Query:  MPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGG
        MP    +     F +I + V   +  W+EK     G+  L KAV  ++PV++MS   LP  I   ++ +   F WGST+ K+K H   W  +C  KK GG
Subjt:  MPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGG

Query:  LGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKY
        LG R   S N+A+++K+ WR++++ +SL    L+ KY
Subjt:  LGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKY

P11369 LINE-1 retrotransposable element ORF2 protein4.4e-3623.95Show/hide
Query:  YIDSSTTIIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTE-------EEINLPQANFSSSNADVLL---AKEKKLDVLLEEEENY
        +  S TT +K     E   SPK   S+ Q  IK     NQ+  + +++R I++T         +I+ P A  +  + D +L    + +K D+  + EE  
Subjt:  YIDSSTTIIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTE-------EEINLPQANFSSSNADVLL---AKEKKLDVLLEEEENY

Query:  WKIRSREEWLKWGDKNTKWLQPSISRGFFSSFDPKSSAIDEVMEAIKN----RINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSN
                           +Q +I   +   +  K   +DE+ + +      ++N++    ++   +  EI   + S+   K+PGPDG  A F+Q +  +
Subjt:  WKIRSREEWLKWGDKNTKWLQPSISRGFFSSFDPKSSAIDEVMEAIKN----RINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSN

Query:  IGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNK-PKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAIN
        +   +  +   +  +G   +   +  I+LIPK  K P  +   RPISL N+  K++ K LANR++E + AII P+Q  F+PG     N+      I+ IN
Subjt:  IGKEVTTVCLGVLNQGEDMSEINKTYISLIPKCNK-PKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAIN

Query:  SKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPT
          K      IS  +D  KA+D+++  F+ K+++R G    +   +    +    ++ VNG   E      G RQG PLSP   + V ++    +  +   
Subjt:  SKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPT

Query:  LISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKS-AFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNK
           K +K I +    +     ADD +++    +     +  ++ ++ E +G  +N  KS AF+ +KN  +  E +      +V+      YLG+  T   
Subjt:  LISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVVNFEKS-AFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNK

Query:  SK--NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSC--FRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLG
            ++ F  +K  + + L+ WK+      G+  ++K       +Y  +    ++P     E+     KF W +   K +I     K L K K+  GG+ 
Subjt:  SK--NRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSC--FRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLG

Query:  FRDLLSFNQAMLAKLSWRVIKD
          DL  + +A++ K +W   +D
Subjt:  FRDLLSFNQAMLAKLSWRVIKD

P14381 Transposon TX1 uncharacterized 149 kDa protein9.8e-2825.84Show/hide
Query:  SISRGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTY
        S  +  FS       A +E+ + +   ++E   ++++   T  E+ +AL+ M   K+PG DG    FFQ +W  +G +   V      +GE      +  
Subjt:  SISRGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTY

Query:  ISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSF
        +SL+PK    + +   RP+SL +  YK++AKA++ RLK VL  +I P+QS  VPGR I DNV L  + ++   +++ G   +  + +D  KA+DRV+  +
Subjt:  ISLIPKCNKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSF

Query:  LDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLI
        L   +    F   +   +     S    V +N   +      RG+RQG PLS    S   + F   L  R   L+ K     +V   Y      ADD ++
Subjt:  LDKIMDRMGFSEIWRRRVMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLI

Query:  FFGAL-----EKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVS--TKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWK
            L      +EC  +      Y  A    +N+ KS+ +   ++         R +   S   K LG YL          ++ F ++++ V   L  WK
Subjt:  FFGAL-----EKECTTIKKVLLTYEEALGQVVNFEKSAFMTSKNVSSSLETKCGRILGVVS--TKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWK

Query:  --EKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLG
           K+    G+ ++I  +  +   Y + C         +I      F W    GK   HW S        K GG G
Subjt:  --EKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLG

P93295 Uncharacterized mitochondrial protein AtMg003109.1e-2652.78Show/hide
Query:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLE
        A+PVY MSCFRL   +CK++ +   +FWW S   KRKI W +W+ LCKSK+  GGLGFRDL  FNQA+LAK S+R+I  P +LL+R L+ +YF   S +E
Subjt:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLE

Query:  ASLEILPS
         S+   PS
Subjt:  ASLEILPS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.7e-0923.79Show/hide
Query:  QSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWLQPSISRGFFSSF------
        + C K L       ++   K A+   E   +    N S S   V     KK +      E++++ +SR +WL+ GD NT++    I      +       
Subjt:  QSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKWLQPSISRGFFSSF------

Query:  --DPKSSAIDEVMEAI---------------------------KNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTT
          D +   + +V E I                             R N+  + ++  + ++ EI  A+ +M   KAPGPD   A FF   W  +      
Subjt:  --DPKSSAIDEVMEAI---------------------------KNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTT

Query:  VCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLI
                G  +   N T I+LIPK      +   RP+S C VVYK+I
Subjt:  VCLGVLNQGEDMSEINKTYISLIPKCNKPKYMGELRPISLCNVVYKLI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.7e-1439.02Show/hide
Query:  LANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIW
        +  RLK ++  +I P Q++F+PGR+ TDN+V   E ++++  KK  K W + +K+D+ KAYDR+ W +L+  +   GF E+W
Subjt:  LANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIW

AT4G29090.1 Ribonuclease H-like superfamily protein6.7e-2443.93Show/hide
Query:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEA
        A+P YTM+CF LP  +CK+I ++ A FWW +    + +HWK+W  L   K  GG+GF+D+ +FN A+L K  WR++  P SL+A+  K +YFH    L A
Subjt:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEA

Query:  SLEILPS
         L   PS
Subjt:  SLEILPS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.5e-2752.78Show/hide
Query:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLE
        A+PVY MSCFRL   +CK++ +   +FWW S   KRKI W +W+ LCKSK+  GGLGFRDL  FNQA+LAK S+R+I  P +LL+R L+ +YF   S +E
Subjt:  AIPVYTMSCFRLPNGICKEINNICAKFWWGSTSGKRKIHWKSWKFLCKSKK-MGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLE

Query:  ASLEILPS
         S+   PS
Subjt:  ASLEILPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATTGACAGTTCAACAACCATTATTAAATATTGCTGGAATTCTGAGGGTCCTTTCTCTCCTAAAAACATCAACTCCAAAATTCAAAGTTGCATCAAAAGTCTGGC
AGCTTGGAATCAGCTCAGGCTGAAAGGCTCCCTAAAAAGGGCTATTTCCAGAACCGAGGAAGAAATCAATCTCCCCCAAGCCAATTTCTCTTCTAGCAACGCTGATGTGC
TGCTGGCCAAGGAGAAGAAGCTAGATGTTTTACTTGAGGAGGAAGAGAATTATTGGAAAATTCGGTCTAGAGAAGAGTGGTTGAAGTGGGGGGACAAAAACACCAAGTGG
TTGCAGCCCAGCATTTCAAGAGGCTTTTTTTCTTCATTTGACCCCAAATCCAGTGCTATAGATGAAGTGATGGAGGCTATTAAGAATAGAATCAACGAGAATGATTCTCA
ACAGATGGACAAGGTGTTTACCGAAGCCGAGATCCATAGAGCTTTAAAAAGTATGAGTCCTACCAAAGCCCCGGGTCCTGACGGGGCTCATGCCTTATTCTTCCAAAATT
ATTGGTCGAACATAGGGAAGGAGGTGACGACTGTGTGTTTGGGGGTTCTAAATCAAGGGGAAGATATGTCTGAGATTAACAAGACTTATATTTCGCTTATCCCCAAATGC
AACAAGCCTAAGTACATGGGTGAGTTGAGGCCTATTAGCCTTTGTAATGTGGTGTATAAACTCATTGCTAAAGCCCTTGCTAATAGGCTCAAAGAAGTCCTCAATGCCAT
CATCTCCCCCAACCAGTCGGCTTTCGTCCCTGGGAGGCTCATAACTGATAATGTTGTGCTCGGGTTCGAGTGTATTTATGCCATTAATAGCAAAAAACAAGGTAAGGAGT
GGAGTATTTCTATGAAGATCGACATGAGCAAGGCCTACGATCGTGTCGAATGGAGTTTCTTGGATAAAATCATGGATCGTATGGGATTTAGCGAGATTTGGAGAAGAAGA
GTTATGAACTGCATCACCTCGGTCTCGTATTCTGTCTTGGTTAACGGGTACCCGAGTGAGGAGTTTCGCCCTAACAGGGGTATTAGGCAAGGGGACCCATTATCCCCTAC
CTTTTCCTCATGTGTGCCGAAGATTTTTCAGCTCTTCTTAACAGGGAGGAATCCAACTCTAATCTCAAAGGTTTTAAAATCAATAATCGTTGCCCCTCATTATCTCATTT
GTTTTTTTTTTGCAGATGATAGCCTCATTTTCTTCGGAGCGTTAGAAAAGGAGTGCACCACCATCAAAAAGGTGTTGTTAACGTATGAGGAAGCCTTGGGTCAAGTGGTG
AATTTCGAAAAATCAGCCTTCATGACGAGCAAGAATGTCAGTAGTTCTCTCGAGACTAAATGTGGGAGAATCCTTGGTGTTGTTTCCACAAAGTCTCTTGGAAATTACCT
TGGGATGCCTTATACTAACAACAAAAGCAAAAATAGGATGTTCGCTAAGATCAAAGATAGTGTTTGGAAAGTGCTCCAAAGCTGGAAGGAAAAGCTTTTCTTAGCGGGTG
GCAAAAAGGTTCTGATCAAAGCGGTGGCTCAAGCAATCCCGGTGTACACTATGAGTTGCTTCAGGCTCCCAAATGGCATTTGCAAAGAAATCAACAACATTTGCGCCAAG
TTTTGGTGGGGCTCCACAAGTGGGAAAAGGAAAATTCATTGGAAAAGCTGGAAGTTTTTGTGCAAGAGCAAGAAGATGGGCGGTTTAGGTTTTAGAGACCTCCTCTCTTT
TAACCAAGCCATGCTTGCTAAACTGAGTTGGCGTGTTATCAAAGACCCTTCCAGTCTTCTTGCTAGAACGTTGAAAGGAAAATATTTTCATGACCAATCGTTCCTAGAGG
CCTCTTTGGAAATCCTTCCCTCACTCGGAGGAGCATCATGTGGGGAAAAAACTTGTTCCTTAAAGGATATCGGTGGAGAGTGGGGAATGGAAAGTACATCGAGATTGATA
AAGACCCTTGGATCAATAGAGGGCAGGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTATATTGACAGTTCAACAACCATTATTAAATATTGCTGGAATTCTGAGGGTCCTTTCTCTCCTAAAAACATCAACTCCAAAATTCAAAGTTGCATCAAAAGTCTGGC
AGCTTGGAATCAGCTCAGGCTGAAAGGCTCCCTAAAAAGGGCTATTTCCAGAACCGAGGAAGAAATCAATCTCCCCCAAGCCAATTTCTCTTCTAGCAACGCTGATGTGC
TGCTGGCCAAGGAGAAGAAGCTAGATGTTTTACTTGAGGAGGAAGAGAATTATTGGAAAATTCGGTCTAGAGAAGAGTGGTTGAAGTGGGGGGACAAAAACACCAAGTGG
TTGCAGCCCAGCATTTCAAGAGGCTTTTTTTCTTCATTTGACCCCAAATCCAGTGCTATAGATGAAGTGATGGAGGCTATTAAGAATAGAATCAACGAGAATGATTCTCA
ACAGATGGACAAGGTGTTTACCGAAGCCGAGATCCATAGAGCTTTAAAAAGTATGAGTCCTACCAAAGCCCCGGGTCCTGACGGGGCTCATGCCTTATTCTTCCAAAATT
ATTGGTCGAACATAGGGAAGGAGGTGACGACTGTGTGTTTGGGGGTTCTAAATCAAGGGGAAGATATGTCTGAGATTAACAAGACTTATATTTCGCTTATCCCCAAATGC
AACAAGCCTAAGTACATGGGTGAGTTGAGGCCTATTAGCCTTTGTAATGTGGTGTATAAACTCATTGCTAAAGCCCTTGCTAATAGGCTCAAAGAAGTCCTCAATGCCAT
CATCTCCCCCAACCAGTCGGCTTTCGTCCCTGGGAGGCTCATAACTGATAATGTTGTGCTCGGGTTCGAGTGTATTTATGCCATTAATAGCAAAAAACAAGGTAAGGAGT
GGAGTATTTCTATGAAGATCGACATGAGCAAGGCCTACGATCGTGTCGAATGGAGTTTCTTGGATAAAATCATGGATCGTATGGGATTTAGCGAGATTTGGAGAAGAAGA
GTTATGAACTGCATCACCTCGGTCTCGTATTCTGTCTTGGTTAACGGGTACCCGAGTGAGGAGTTTCGCCCTAACAGGGGTATTAGGCAAGGGGACCCATTATCCCCTAC
CTTTTCCTCATGTGTGCCGAAGATTTTTCAGCTCTTCTTAACAGGGAGGAATCCAACTCTAATCTCAAAGGTTTTAAAATCAATAATCGTTGCCCCTCATTATCTCATTT
GTTTTTTTTTTGCAGATGATAGCCTCATTTTCTTCGGAGCGTTAGAAAAGGAGTGCACCACCATCAAAAAGGTGTTGTTAACGTATGAGGAAGCCTTGGGTCAAGTGGTG
AATTTCGAAAAATCAGCCTTCATGACGAGCAAGAATGTCAGTAGTTCTCTCGAGACTAAATGTGGGAGAATCCTTGGTGTTGTTTCCACAAAGTCTCTTGGAAATTACCT
TGGGATGCCTTATACTAACAACAAAAGCAAAAATAGGATGTTCGCTAAGATCAAAGATAGTGTTTGGAAAGTGCTCCAAAGCTGGAAGGAAAAGCTTTTCTTAGCGGGTG
GCAAAAAGGTTCTGATCAAAGCGGTGGCTCAAGCAATCCCGGTGTACACTATGAGTTGCTTCAGGCTCCCAAATGGCATTTGCAAAGAAATCAACAACATTTGCGCCAAG
TTTTGGTGGGGCTCCACAAGTGGGAAAAGGAAAATTCATTGGAAAAGCTGGAAGTTTTTGTGCAAGAGCAAGAAGATGGGCGGTTTAGGTTTTAGAGACCTCCTCTCTTT
TAACCAAGCCATGCTTGCTAAACTGAGTTGGCGTGTTATCAAAGACCCTTCCAGTCTTCTTGCTAGAACGTTGAAAGGAAAATATTTTCATGACCAATCGTTCCTAGAGG
CCTCTTTGGAAATCCTTCCCTCACTCGGAGGAGCATCATGTGGGGAAAAAACTTGTTCCTTAAAGGATATCGGTGGAGAGTGGGGAATGGAAAGTACATCGAGATTGATA
AAGACCCTTGGATCAATAGAGGGCAGGCCATGA
Protein sequenceShow/hide protein sequence
MYIDSSTTIIKYCWNSEGPFSPKNINSKIQSCIKSLAAWNQLRLKGSLKRAISRTEEEINLPQANFSSSNADVLLAKEKKLDVLLEEEENYWKIRSREEWLKWGDKNTKW
LQPSISRGFFSSFDPKSSAIDEVMEAIKNRINENDSQQMDKVFTEAEIHRALKSMSPTKAPGPDGAHALFFQNYWSNIGKEVTTVCLGVLNQGEDMSEINKTYISLIPKC
NKPKYMGELRPISLCNVVYKLIAKALANRLKEVLNAIISPNQSAFVPGRLITDNVVLGFECIYAINSKKQGKEWSISMKIDMSKAYDRVEWSFLDKIMDRMGFSEIWRRR
VMNCITSVSYSVLVNGYPSEEFRPNRGIRQGDPLSPTFSSCVPKIFQLFLTGRNPTLISKVLKSIIVAPHYLICFFFADDSLIFFGALEKECTTIKKVLLTYEEALGQVV
NFEKSAFMTSKNVSSSLETKCGRILGVVSTKSLGNYLGMPYTNNKSKNRMFAKIKDSVWKVLQSWKEKLFLAGGKKVLIKAVAQAIPVYTMSCFRLPNGICKEINNICAK
FWWGSTSGKRKIHWKSWKFLCKSKKMGGLGFRDLLSFNQAMLAKLSWRVIKDPSSLLARTLKGKYFHDQSFLEASLEILPSLGGASCGEKTCSLKDIGGEWGMESTSRLI
KTLGSIEGRP