; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036439 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036439
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:46513516..46518055
RNA-Seq ExpressionLag0036439
SyntenyLag0036439
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.9e-7128.05Show/hide
Query:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG---QWG------------
        SQ+E FR++MD C L D+ + G  FTW N +   +Q+ +RLDRF+ N+ F  +FP  S     W+++ H      ++   S     WG            
Subjt:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG---QWG------------

Query:  --------------------MRTMETP-----RLEECLAKCKFRLQ--KWGKEQYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQ
                            +  +E        LEE   K + R    KWG          +    +A +  I  +++ E+N++LLA + +EE+  A++Q
Subjt:  --------------------MRTMETP-----RLEECLAKCKFRLQ--KWGKEQYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQ

Query:  MHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLA
        M  +KA G                                             P +KQP+ +SDFR ISLCNV+YKI++K I NR+  ++  +I + Q A
Subjt:  MHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLA

Query:  FVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--------------------------------
        FVP R+I DN II HECLHTI S  +G  G  A KLD+SKA+DRVEW++L+ +M K+GF  G    +I                                
Subjt:  FVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--------------------------------

Query:  ----------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI-
                                                      LIF ++   +  ALR +L +Y  ASGQ IN + S + FSPNV    +  +Q I 
Subjt:  ----------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI-

Query:  -------------------------------------------------MVNFNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-
                                                         +  FN+AL+AK VWR   +PNLL S+V   +          S +S+ ++F 
Subjt:  -------------------------------------------------MVNFNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-

Query:  -------------------QTVSHKKFTSQW-----------------DRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKN
                              + K F+  W                 D  VA FIT     D+  +     +ED  LI ++PISS N++D W+WHY K 
Subjt:  -------------------QTVSHKKFTSQW-----------------DRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKN

Query:  GNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI
        GNY V+SGYKL M    +   +S       W S+WK  +PTKIK+FIW++ HE +PT   L  R I   P C +C    E+I HA   C RA+ I
Subjt:  GNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]3.7e-6427.69Show/hide
Query:  FIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHID------ASRKWE----------------------------------SKCGDLLAFTVSPGGVPVRDS
        F VD +GKSGGL ++W  + +V+I SY   HID        + W                                       ++L      GG+     
Subjt:  FIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHID------ASRKWE----------------------------------SKCGDLLAFTVSPGGVPVRDS

Query:  QLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG-------------------
         +  FR  + DC L DL   G SFTWSNRQ   + I E+LDRF+ N+E+   F     T+L    SDH P+M+    +Q                     
Subjt:  QLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG-------------------

Query:  -------------QWGMRTMETPR--LEECLAKCKFRLQKWGKEQY-----------------------------------------------FTSIYPD
                     QW +R  E+P    +    K   RL  W KE++                                               FT+  P 
Subjt:  -------------QWGMRTMETPR--LEECLAKCKFRLQKWGKEQY-----------------------------------------------FTSIYPD

Query:  EATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPGPMVKQ---------------------------------------------PKHV
            +AAL+ +  +V   MN  L   F  EE+  A+ QM  +KAPGP   Q                                             P+ V
Subjt:  EATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPGPMVKQ---------------------------------------------PKHV

Query:  SDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLG
        +++R ISLCNV Y +VAK I NR+   L  II   Q AFVP R I DN II +ECLH I+     +   VA KLD+ KAYDRVEWSFLK ++ +LGF+  
Subjt:  SDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLG

Query:  ATTLLILIFCQAT-SVQVS-ALRDILHAYE-MASGQVINVTNSTMYFSPNVSSNDRSMIQ------------------SIMVNFNKALLAKQVWRLFTNP
           L++      T SV ++ A   ++H    +  G  +     + Y     +  D+  I                     +  FN+ALLAKQ WRL  NP
Subjt:  ATTLLILIFCQAT-SVQVS-ALRDILHAYE-MASGQVINVTNSTMYFSPNVSSNDRSMIQ------------------SIMVNFNKALLAKQVWRLFTNP

Query:  NLLASRVFNA---------------------RSLDSQRNFFQTVSHKKF---------TSQW-----------------DRVVADFITPSKGRDILKLRE
          L ++V  A                     RS+   R   Q  +  +           S W                    V++ I P++  +   + +
Subjt:  NLLASRVFNA---------------------RSLDSQRNFFQTVSHKKF---------TSQW-----------------DRVVADFITPSKGRDILKLRE

Query:  QVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGR
             D  +I +I +     ED  +WHY + G Y VKSGY+LA+        +S       WQ+LWK  LP KIK+F+WKA    LPT   LW+RKIL  
Subjt:  QVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGR

Query:  PTCPMCNSEIETINHALCGCIRAKSICDCTITRIDDQIMRMDQQL
        P C +C    E + HAL  C  A+ I  CT      Q +R D  L
Subjt:  PTCPMCNSEIETINHALCGCIRAKSICDCTITRIDDQIMRMDQQL

XP_024044510.1 uncharacterized protein LOC112100177 [Citrus clementina]8.7e-6624.97Show/hide
Query:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------------------------SRKWE----------------SKCGDLLAFTVSPGGVPVRD
        CF V+ +G  GGL L+W +EV V I+SY+  H+DA                           WE                    ++L+     GG   R 
Subjt:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------------------------SRKWE----------------SKCGDLLAFTVSPGGVPVRD

Query:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSGQWGMRTMETPRLEECLAK
        +++  FR  + DC L DL   G  FTWSNR+     I E+LDRF+ N ++   + + + ++L    SDH PIM+        + G R  +    +    +
Subjt:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSGQWGMRTMETPRLEECLAK

Query:  CKFRLQ-------KWGKE---------QYFTSIY----PDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG--------------
         K R+Q        W ++          YFT+I+    P     +AAL+ +  KVS EMN++L   F +EE+  A+ QM  +KAPG              
Subjt:  CKFRLQ-------KWGKE---------QYFTSIY----PDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG--------------

Query:  -------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIK
                                       P V +P+ V++FR I+LCNV Y+IVAK I NR+  +L DII  NQ AFVP R I DN II +ECL+ I+
Subjt:  -------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIK

Query:  SRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI------------------------------------------------------
             R G +A KLD+SKAYDRVEWSF+K  M KLGF++    L++                                                      
Subjt:  SRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI------------------------------------------------------

Query:  -----------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSS---------------------------------
                               L+F +A++ +   L+ +   Y  ASGQ+ N   S+M+FS + S                                  
Subjt:  -----------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSS---------------------------------

Query:  ---ND--------------------------RSMIQSI---------------------------------------------------------MVNFN
           ND                          +++ Q++                                                         +  FN
Subjt:  ---ND--------------------------RSMIQSI---------------------------------------------------------MVNFN

Query:  KALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-------QTVSHKKFTSQWDRVVADFITPSKGRDILKLREQVRDEDVRLITTIPI
        +AL+AKQ WR+   P+ L +++  A+           L S  +F        + +  +  +   + VVAD I      +  ++ +   +     I  +P+
Subjt:  KALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-------QTVSHKKFTSQWDRVVADFITPSKGRDILKLREQVRDEDVRLITTIPI

Query:  SSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINH
          I   D  +W + K+G Y  KSGY++A+      G S   + K  W  +W + LP K+K+F+W+A    LPT   LWKRKI+  P C  C    E + H
Subjt:  SSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINH

Query:  ALCGC
        AL  C
Subjt:  ALCGC

XP_042942906.1 uncharacterized protein LOC122277092 [Carya illinoinensis]7.6e-6225.1Show/hide
Query:  LEWCFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHI---------------------DASRK---W-----------ESKC-----GDLLAFTVSPGGVP
        +E CF+VD  GK GGL L WK + +V I +Y+ +H+                     DA+++   W           E+ C      ++L+     GG  
Subjt:  LEWCFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHI---------------------DASRK---W-----------ESKC-----GDLLAFTVSPGGVP

Query:  VRDSQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIML-----------------------
          +SQ+  FR  ++  GL D+ YVG+ FTWSN   D+    ERLDRF+ N  +   F ++ +  L    SDH+P++L                       
Subjt:  VRDSQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIML-----------------------

Query:  -------------------------------HTLPQQSGQWGMRT---------METPRLEECLAKCKFR-----------LQKWGKEQ-------YFTS
                                         L Q+  +W  R            T     C ++ K +           + + G E+       +F+ 
Subjt:  -------------------------------HTLPQQSGQWGMRT---------METPRLEECLAKCKFR-----------LQKWGKEQ-------YFTS

Query:  IYPDE----ATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG---------------------------------------------P
        +Y  E    +  +  ++ I  +V+  M ++L   F REE+  A+KQM   K+PG                                             P
Subjt:  IYPDE----ATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG---------------------------------------------P

Query:  MVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLM
         + +PK V DFR ISLCNV YK++AK + NR+  +LQ+II +NQ AF+PGR I DN I ++E LH++K+R  GR G +A KLD+SKAYD+VEW FLK +M
Subjt:  MVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLM

Query:  VKLGFTLGATTLLI------------------------------------------------------------------------------LIFCQATS
         KLGF      L++                                                                              +IF +A  
Subjt:  VKLGFTLGATTLLI------------------------------------------------------------------------------LIFCQATS

Query:  VQVSALRDILHAYEMASGQVINVTNSTMYFSPN----------------VSSN-----------------------DR----------------------
         +   + DIL  YE ASGQ +N   +++ FS N                V +N                       DR                      
Subjt:  VQVSALRDILHAYEMASGQVINVTNSTMYFSPN----------------VSSN-----------------------DR----------------------

Query:  -SMIQSI---------------------MVNFNKALLAKQVWRLFTNPNLLASRVF------NARSLDSQRNFFQTV-----------------------
         +++Q+I                     + +FNKALLAKQVWR+ TNP+ +A+R+       N   L+++   + ++                       
Subjt:  -SMIQSI---------------------MVNFNKALLAKQVWRLFTNPNLLASRVF------NARSLDSQRNFFQTV-----------------------

Query:  -SHKKFTSQW------------------DRVVADFITPSKGR---DILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSR
         S K +  +W                  +  V   I    GR   D++K  E   +E+  LI   PIS  N++D+ +W  SK G Y VKS Y   +   R
Subjt:  -SHKKFTSQW------------------DRVVADFITPSKGR---DILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSR

Query:  -SGGESS-RKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI
          GGESS        W+S+W+   P  IK+F+WKA + CLPT + L+KR++   P CP+C ++ ET+ H L  C  A  +
Subjt:  -SGGESS-RKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI

XP_042950087.1 uncharacterized protein LOC122282202 [Carya illinoinensis]3.8e-6125.12Show/hide
Query:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDASRK------------------------WE----------------SKCGDLLAFTVSPGGVPVRD
        CF VD  G+SGGL L WK+++++RI SY+  HI AS K                        W                     ++L  +   GG    D
Subjt:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDASRK------------------------WE----------------SKCGDLLAFTVSPGGVPVRD

Query:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHT------------------------
         Q+  FR V+ DC L+DL YVG  FTWSNR+ + + + ERLDRF+AN  +C +FP L +TH    +SDH P+ L T                        
Subjt:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHT------------------------

Query:  -----------------LPQQSGQWGMRTMETPR--------LEECLAKCKFRL-----------------------QKWGKE-----------------
                         L Q  G++     E  R        +++ LA  K RL                       QKW +                  
Subjt:  -----------------LPQQSGQWGMRTMETPR--------LEECLAKCKFRL-----------------------QKWGKE-----------------

Query:  --QYFTSIY--PDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG-----------------------------------------
          +YF +++   D    +  L  +  +V+ EMN++LL  +  EEV  A+KQMH S+APG                                         
Subjt:  --QYFTSIY--PDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG-----------------------------------------

Query:  ----PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDR-VEWS
            P    P  V+DFR ISLCNV YKI++KVI NR+  +L  II  +Q AFVPGR I DN +I++E LH ++++  GRKG+++ KLDMSKAYDR V   
Subjt:  ----PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDR-VEWS

Query:  FLKVLM-------------VKLGFTLGATTLLI---------------------------------------LIFCQATSVQVSALRDILHAYEMASGQV
           VL+             ++ G  L     L+                                       +IFC+A  +    ++  L+ YE ASGQ 
Subjt:  FLKVLM-------------VKLGFTLGATTLLI---------------------------------------LIFCQATSVQVSALRDILHAYEMASGQV

Query:  INVTNSTMYFSPNVSSNDRSMIQSI-----------------MVN-------------------------------------------------------
        IN   ++M FS NV+ N +  I  +                 MV                                                        
Subjt:  INVTNSTMYFSPNVSSNDRSMIQSI-----------------MVN-------------------------------------------------------

Query:  -----FNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTV---SHKKFTSQWD-------------------RVVADFITPS--------KGRDI
             FN ALLAKQ WRL  N + L  +V+ A+   +   F   V   S   +   W+                   R+  D   P         +  + 
Subjt:  -----FNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTV---SHKKFTSQWD-------------------RVVADFITPS--------KGRDI

Query:  LKLREQVRDED----------------VRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSR--SGGESSRKAMKVWWQSLWKSGLPTKIKLF
        LK+   + D                  V+ I  + + SI+ ED   W + KNG++ V+S Y+    +    +G  S+ +   ++W+SLW   LP K+K+F
Subjt:  LKLREQVRDED----------------VRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSR--SGGESSRKAMKVWWQSLWKSGLPTKIKLF

Query:  IWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSICDCTITRID---------DQIMRMDQQLLEGNNTE--EEFRWKYQHSLNSK
         W+A  E LPT   L K+ +L   TC +CN  +E   HAL  C   +S+     +++D         D       QL + +N +  +  RW         
Subjt:  IWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSICDCTITRID---------DQIMRMDQQLLEGNNTE--EEFRWKYQHSLNSK

Query:  KSIVRVYTDATVIPNRSGAGYGVAIVGE-GNVLRCAMEMVEDVDLNPLAAEVQAILQALRLIQRLQYQEPK
           +++  D    P  S A  GV +  + G V+    ++ ++V     +AE    ++A+ L+   Q+  PK
Subjt:  KSIVRVYTDATVIPNRSGAGYGVAIVGE-GNVLRCAMEMVEDVDLNPLAAEVQAILQALRLIQRLQYQEPK

TrEMBL top hitse value%identityAlignment
A0A2N9FTN8 Uncharacterized protein3.1e-6928.18Show/hide
Query:  IVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------SRKW-------------ESKCGDLLAFTVSPGGVP----------VR-----------DSQ
        + + + K GGLCL WK E+++R+ S++  HIDA      +  W               +  DLL    S   +P          VR           D Q
Subjt:  IVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------SRKW-------------ESKCGDLLAFTVSPGGVP----------VR-----------DSQ

Query:  LENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPI---------------------MLHTLPQQS
        ++ F+ V+DDCG  DL ++G  FTW+N +   +   ERLDR +A  E+   FP   + HLDW  SDH+PI                      +H   +  
Subjt:  LENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPI---------------------MLHTLPQQS

Query:  GQWG-----------------MRTMETPRLE----ECLAKCKFRLQK-WGKE----------QYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDR
         +W                  ++T E   ++      +A  K RL   W K           ++  +  P+    +  ++ I   V+ EMN+ L   F  
Subjt:  GQWG-----------------MRTMETPRLE----ECLAKCKFRLQK-WGKE----------QYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDR

Query:  EEVIRAVKQMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQ
        +EV  A+KQM   KAPG                                             P VK P+ V +FR ISLCNV YK+ +KV+ NR+  IL 
Subjt:  EEVIRAVKQMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQ

Query:  DIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLL----------ILIF---------C
         I+ E+Q AFVPGR I DN +++ E LH +  +  G+ G +A KLDMSKAYDRVEW +LK +M K+GF     TL+          IL+           
Subjt:  DIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLL----------ILIF---------C

Query:  QATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI------------------------------------------------MVN
        +AT+  VS ++ IL  YE ASGQ IN   +T++FS + S  D++ I  +                                                +  
Subjt:  QATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI------------------------------------------------MVN

Query:  FNKALLAKQVWRLFTNPNLLASRVFNAR------SLDSQRNFFQTVSHKKFTSQ---------WD-------RVVADFITPSKGRDIL-----------K
        FN+ALLAKQVWRL  N + L  +VF A+       LD+Q N   + + K              W        R+  D   P +  +I+            
Subjt:  FNKALLAKQVWRLFTNPNLLASRVFNAR------SLDSQRNFFQTVSHKKFTSQ---------WD-------RVVADFITPSKGRDIL-----------K

Query:  LREQVRDE----------------DVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSS--RSGGESSRKAMKVWWQSLWKSGLPTKIKLFIW
        +++ + D+                +  +I  I +SS N +D   W  +KNG Y V+SGY L +  S     G S    +   W  +W   +P KI+ F+W
Subjt:  LREQVRDE----------------DVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSS--RSGGESSRKAMKVWWQSLWKSGLPTKIKLFIW

Query:  KAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC
        +A HE LPT   L  R +   P C  CN+ +E+  HAL  C
Subjt:  KAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC

A0A2N9G7B6 Uncharacterized protein1.2e-6826.57Show/hide
Query:  FIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------------------------SRKWE----------------SKCGDLLAFTVSPGGVPVRDS
        FI   + K GGLCL+WK E+++R+ S++  HIDA                           W                     +L+      G     + 
Subjt:  FIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------------------------SRKWE----------------SKCGDLLAFTVSPGGVPVRDS

Query:  QLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQ----------------------
        Q++ FR V+D+C   DL + G  FTW+N +   +   ERLDR +A  ++   FP   + HL+   SDH+P+ + T P                       
Subjt:  QLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQ----------------------

Query:  -QSGQWGMRTMETPRLEECLAKCKFRLQKWGKEQYFTSI-------------------YPDEAT-----FDAALKEIHCKVSEEMNDKLLASFDREEVIR
             W       P   E L K    +   G++ +  S+                   +P   T      D  L  I   V+EEMN++L+  F  EEV  
Subjt:  -QSGQWGMRTMETPRLEECLAKCKFRLQKWGKEQYFTSI-------------------YPDEAT-----FDAALKEIHCKVSEEMNDKLLASFDREEVIR

Query:  AVKQMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILE
        A+KQM   KAPG                                             P V+ P+ V +FR ISLCNV YKI++KV+ NR+  +L  I+ E
Subjt:  AVKQMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILE

Query:  NQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI----------------------------
        +Q AF+PGR I DN +++ E LH ++ + TG+ G +A KLDMSKAYDRVEW +LK +M K+GF     T+L+                            
Subjt:  NQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI----------------------------

Query:  ----------------------LIFCQATSVQVSALRDILHAYEMASGQVI---NVTNSTMYFSPNVSSNDRSMIQSIMV--------------------
                              L+FC+AT+  V+ ++DIL  YE ASG+ I   +V  +   ++ +       +I+ I +                    
Subjt:  ----------------------LIFCQATSVQVSALRDILHAYEMASGQVI---NVTNSTMYFSPNVSSNDRSMIQSIMV--------------------

Query:  -------------------NFNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTVSHKKFTSQWDRVVA--DFITPSK------GRDILKLR---
                            FN+ALLAKQVWRL  NP+ L  +VF A+   S+ + F+     K +  W  +++  D IT         G+DI   R   
Subjt:  -------------------NFNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTVSHKKFTSQWDRVVA--DFITPSK------GRDILKLR---

Query:  ----------------------EQVRDEDVR-----------------LITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAM--LSSRSGGESSRKAM
                              E + D D++                 +I  IP+S  N  D  +W  +K G Y V+SGY L +   +    G S    M
Subjt:  ----------------------EQVRDEDVR-----------------LITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAM--LSSRSGGESSRKAM

Query:  KVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC
           W+++W   +P+KI+ F+W+A H  LPT   L  R +L  P C  C  +IE+  H L  C
Subjt:  KVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC

A0A6J1DX30 uncharacterized protein LOC1110248743.3e-7128.05Show/hide
Query:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG---QWG------------
        SQ+E FR++MD C L D+ + G  FTW N +   +Q+ +RLDRF+ N+ F  +FP  S     W+++ H      ++   S     WG            
Subjt:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSG---QWG------------

Query:  --------------------MRTMETP-----RLEECLAKCKFRLQ--KWGKEQYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQ
                            +  +E        LEE   K + R    KWG          +    +A +  I  +++ E+N++LLA + +EE+  A++Q
Subjt:  --------------------MRTMETP-----RLEECLAKCKFRLQ--KWGKEQYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQ

Query:  MHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLA
        M  +KA G                                             P +KQP+ +SDFR ISLCNV+YKI++K I NR+  ++  +I + Q A
Subjt:  MHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLA

Query:  FVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--------------------------------
        FVP R+I DN II HECLHTI S  +G  G  A KLD+SKA+DRVEW++L+ +M K+GF  G    +I                                
Subjt:  FVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--------------------------------

Query:  ----------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI-
                                                      LIF ++   +  ALR +L +Y  ASGQ IN + S + FSPNV    +  +Q I 
Subjt:  ----------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSMIQSI-

Query:  -------------------------------------------------MVNFNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-
                                                         +  FN+AL+AK VWR   +PNLL S+V   +          S +S+ ++F 
Subjt:  -------------------------------------------------MVNFNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNFF-

Query:  -------------------QTVSHKKFTSQW-----------------DRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKN
                              + K F+  W                 D  VA FIT     D+  +     +ED  LI ++PISS N++D W+WHY K 
Subjt:  -------------------QTVSHKKFTSQW-----------------DRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKN

Query:  GNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI
        GNY V+SGYKL M    +   +S       W S+WK  +PTKIK+FIW++ HE +PT   L  R I   P C +C    E+I HA   C RA+ I
Subjt:  GNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSI

A0A803P8W5 Uncharacterized protein4.8e-6225.21Show/hide
Query:  GKSGGLCLMWKDEVDVRIRSYTLFHIDASRKWE-----------------------------------------SKCGDLLAFTVSPGGVPVRDSQLENF
        G  GGL L+WKD++DV + S+++ H D    ++                                             +LL+     GG   R+S +E F
Subjt:  GKSGGLCLMWKDEVDVRIRSYTLFHIDASRKWE-----------------------------------------SKCGDLLAFTVSPGGVPVRDSQLENF

Query:  RSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEF---CQLFPQLSMTHL------------------------DWTHSDHRPIMLHTLPQ
        R  +D C L  L Y GD +TW+NR      I ERLDR   N  +     +    ++T+L                        D   +  R   LH    
Subjt:  RSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEF---CQLFPQLSMTHL------------------------DWTHSDHRPIMLHTLPQ

Query:  QSGQWGMRTMETPRLEECL------------------------------AKCKFRLQKW-------GKEQY----------------FTSIYPDEATFDA
         S     + ++  ++ + L                              AK +++  K        G+  Y                FT+   D      
Subjt:  QSGQWGMRTMETPRLEECL------------------------------AKCKFRLQKW-------GKEQY----------------FTSIYPDEATFDA

Query:  ALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG-------------------------------------------PMVKQPKHVSDFRLISL
         +  I+  ++EEMN  LLA F R +V  A+  M   K+PG                                           P V QP+ + D+R ISL
Subjt:  ALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPG-------------------------------------------PMVKQPKHVSDFRLISL

Query:  CNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--
        C+V YK+V+K IVNR   +L   I +NQ AF+PGR I DN +++ E +H +K++  GR+ + A KLDMSKA+D+VEW FL+ +M+K+GF +    L++  
Subjt:  CNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLI--

Query:  ---------------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSM
                                                           L+FCQA      A+R +L  Y  ASGQ++N   + M FSPNV+   ++ 
Subjt:  ---------------------------------------------------LIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSSNDRSM

Query:  IQSIM---------------------------VNFNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTV------------SHKKFTSQWDRV--
          +++                           +N +K  L    W+     ++   R    + L  +  F   V            +   +T Q D +  
Subjt:  IQSIM---------------------------VNFNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTV------------SHKKFTSQWDRV--

Query:  VADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAY
        V  +IT  +  D+  L++     D+  I  I IS    EDR +WH+S+ G+Y VKSGY LA    +    S+  +   WW+ LW   LP K+K F+W+  
Subjt:  VADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAY

Query:  HECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRA
        ++ LPT   L  RK+L    C +CN   ET++HAL  C RA
Subjt:  HECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRA

A0A803Q852 Uncharacterized protein1.2e-6025.86Show/hide
Query:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------SRKWESKC----------------------------------GDLLAFTVSPGGVPVRD
        CF+V+ KGKSGGL LMW D++   I S++ FHID+       + W   C                                   ++L      GG     
Subjt:  CFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDA------SRKWESKC----------------------------------GDLLAFTVSPGGVPVRD

Query:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQ--------------------
          + NFR  +DDC L+++++ G+ FTW N + D   I ERLDR   N ++  +FP   + HL+  +SDH P++L    QQ                    
Subjt:  SQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQ--------------------

Query:  ---------SGQWGMRTMETPR--------LEECLAKCKFRLQKWGKE--QYFTSIYPDEATFDAALKEIHC----KVSEEMNDKLLASFDREEVIRAVK
                 +  WG+      R        L +  A+  +  +  G+    YF  ++  +      LKE       K+S E N+ L+A F +++V++A++
Subjt:  ---------SGQWGMRTMETPR--------LEECLAKCKFRLQKWGKE--QYFTSIYPDEATFDAALKEIHC----KVSEEMNDKLLASFDREEVIRAVK

Query:  QMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQL
         +H  KAPG                                             P + +P  +S+FR ISLCNV YKIVAK +  RM   L   I E Q 
Subjt:  QMHLSKAPG---------------------------------------------PMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQL

Query:  AFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGF----------TLGATTLLILIFCQATSVQVSALRDILHAY
        AFV GR I DN II +E LH++K +  G    +A KLDMSKAYD VEWSFL  +M  LG+           + + + L+LI  +         R +  A 
Subjt:  AFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGF----------TLGATTLLILIFCQATSVQVSALRDILHAY

Query:  ----EMASGQVINVTNSTMYFSPNVSSNDRSMIQSIMVNFNKALLAKQVWRLFTNPNLLASRVF------------------------------------
            +    + ++ +       P     +  +    +  FN+ALLAKQ WRL   P+ L +RV                                     
Subjt:  ----EMASGQVINVTNSTMYFSPNVSSNDRSMIQSIMVNFNKALLAKQVWRLFTNPNLLASRVF------------------------------------

Query:  ------NARSLDSQRNFF-----QTVSHKKFTSQWDRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAM
              N R++   ++ +      T++H+   +  +  V+  +   +  +   L +    EDV  I  IPI  I+ ED  VW ++K+G+YIVKSGY++A 
Subjt:  ------NARSLDSQRNFF-----QTVSHKKFTSQWDRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAM

Query:  -LSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSICD-CTITRIDDQIMRMD
         ++      S+   +  WW+  W   LP ++KLF WK     LP    L  R +    TC  C    E+++HAL  C + K +       ++  +     
Subjt:  -LSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSICD-CTITRIDDQIMRMD

Query:  QQLLEGNNTEEEFRWKYQHSLNSKK-SIVRVYTDATVIPNRSGAGYG-VAIVGEGNVLRCAMEMVEDVDLNPLAAEVQAILQALRLIQRLQYQEPKCVQT
           L  +  ++  R +++  + +       V+ DA + P+++G G G +     GN++   M  +      P+ AE +A++ AL+        +PK +Q+
Subjt:  QQLLEGNNTEEEFRWKYQHSLNSKK-SIVRVYTDATVIPNRSGAGYG-VAIVGEGNVLRCAMEMVEDVDLNPLAAEVQAILQALRLIQRLQYQEPKCVQT

Query:  P
        P
Subjt:  P

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.4e-0426.49Show/hide
Query:  DAALKEIHCKVSEEMND---KLLASFDREEVI-RAVKQMHLSKAPGPMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSI
        D    E + +  EE+     KL  S ++E ++  +  +  +   P P     K   +FR ISL N+  KI+ K++ NR+   ++ +I  +Q+ F+PG   
Subjt:  DAALKEIHCKVSEEMND---KLLASFDREEVI-RAVKQMHLSKAPGPMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSI

Query:  FDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLG
        + N   S   +  I       K  V   +D  KA+D+++  F+   + KLG
Subjt:  FDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLG

P0C2F6 Putative ribonuclease H protein At1g657503.3e-0724.48Show/hide
Query:  DRVVA-DFITPSKGRDILKLREQVRDEDVRLITTIPISSI-NVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLF
        D VVA D   P +G D  K+     +     +  + +  +    DR  W +S++G + V+S Y++  +      E  R  M  ++  LWK  +P ++K F
Subjt:  DRVVA-DFITPSKGRDILKLREQVRDEDVRLITTIPISSI-NVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLF

Query:  IWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC
        +W   ++ + T     +R +     C +C   +E++ H L  C
Subjt:  IWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC

P11369 LINE-1 retrotransposable element ORF2 protein3.1e-0530.39Show/hide
Query:  KQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVK
        K P  + +FR ISL N+  KI+ K++ NR+   ++ II  +Q+ F+PG   + N   S   +H I       K  +   LD  KA+D+++  F+  ++ +
Subjt:  KQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVK

Query:  LG
         G
Subjt:  LG

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-0741.11Show/hide
Query:  VSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAF-KLDMSKAYDRVEWSFL
        + ++R +SL +  YKIVAK I  R+  +L ++I  +Q   VPGR+IFDN  +  + LH   +R TG    +AF  LD  KA+DRV+  +L
Subjt:  VSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAF-KLDMSKAYDRVEWSFL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein2.8e-0929.63Show/hide
Query:  DILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSG--------GESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHEC
        D  K+ + V   D   I  I ++     D+ +W+Y+  G Y V+SGY L      +         G    K        +W   +  K+K F+W+A  + 
Subjt:  DILKLREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKLAMLSSRSG--------GESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHEC

Query:  LPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC
        L T   L  R +   P+CP C+ E E+INHAL  C
Subjt:  LPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.0e-0540Show/hide
Query:  WWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC
        W   +W   +  KIKL IWKA +  LP    L  R I   P C  C  + ETI H L  C
Subjt:  WWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGC

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.9e-1037.18Show/hide
Query:  IVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGF
        +V R+  ++ ++I   Q +F+PGR   DN +   E +H+++ R  G KGW+  KLD+ KAYDR+ W +L+  ++  GF
Subjt:  IVNRMNWILQDIILENQLAFVPGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGF

AT4G29090.1 Ribonuclease H-like superfamily protein3.8e-1124.51Show/hide
Query:  FNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNF--------------------------------------------FQTVSHKKFT
        FN ALL KQ+WR+ + P  L ++VF +R           L S+ +F                                             Q V  +++ 
Subjt:  FNKALLAKQVWRLFTNPNLLASRVFNAR----------SLDSQRNF--------------------------------------------FQTVSHKKFT

Query:  SQWDRVVADFITPSKGRDILK--LREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKL--AMLSSRSG-GESSRKAMKVWWQSLWKSGLP
        S    +    +    GR+  K  +     + + +LI  +      + D + W Y+ +G+Y VKSGY +   +++ RS   E S  ++   +Q +WKS   
Subjt:  SQWDRVVADFITPSKGRDILK--LREQVRDEDVRLITTIPISSINVEDRWVWHYSKNGNYIVKSGYKL--AMLSSRSG-GESSRKAMKVWWQSLWKSGLP

Query:  TKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAK
         KI+ F+WK     LP    L  R +     C  C S  ET+NH L  C  A+
Subjt:  TKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAAATTGCTCTCAGCGGCGACAACTGGATCAATCCCTCGGAAAACAGACGGTTCGGACGTGGTGTTTTGGAAGCTCAAGAACAAGGTTCCGGTACTGTTCCGAT
GGAGATCTCGACGGCGATGAAAATGGGTTACATTCTCATAATGCCTATTAATCCAAGTATTATGATGGCAATAATGGTGAAGGCCACGACCCAGGATGATAATAAGCCAC
CTTTAAACCATTGTCTGTTGGTCCAACAACTCAAATGGCCTGGAATTGGTTGTGGGCTTGTCGAATTCAAAATCTTGGAAGAGGCGAGCAAGACTAATTTATCTGACACG
AAACTGCTACGTCGGATCCACTCAAGAAGCGGGTTGGAGATGGGCTGCAGGAGGGTCAAAAAAAGGCCATGCATGGAGAATGAAGATGTGATATCCACAATGAAGAATCG
GCGGAGGTTGTTGATCAGCCCCGCCGCGAGCCATGAAAACTATATGTTGGAATGGTGTTTCATAGTAGATTGTAAAGGTAAAAGCGGAGGTTTATGTTTGATGTGGAAGG
ATGAAGTTGATGTTCGCATTCGCTCCTACACTCTCTTTCATATTGACGCCTCAAGAAAATGGGAGTCTAAATGTGGCGATTTACTGGCATTTACGGTCTCCCCAGGTGGA
GTGCCAGTTCGCGATTCTCAGCTAGAGAACTTTCGTTCAGTAATGGATGATTGCGGGCTTCAGGATTTGGACTATGTGGGTGATTCGTTCACATGGTCCAATAGACAGCT
TGATGTGGAGCAAATAAATGAAAGACTGGATAGGTTCATAGCCAATGAGGAATTCTGCCAACTATTTCCTCAACTCTCTATGACCCATCTAGATTGGACTCATTCAGACC
ACCGTCCAATTATGCTGCATACTCTACCACAACAAAGCGGCCAATGGGGAATGCGCACCATGGAGACTCCTCGGCTAGAAGAGTGTTTGGCAAAATGTAAATTTCGTCTG
CAAAAATGGGGAAAAGAACAATATTTCACGTCCATATACCCAGACGAGGCGACTTTCGATGCTGCACTAAAGGAGATACATTGTAAGGTGTCTGAAGAAATGAATGATAA
ATTGCTAGCTTCATTTGATCGAGAAGAGGTGATTAGAGCTGTTAAGCAAATGCACCTGTCCAAAGCACCTGGCCCGATGGTGAAGCAACCCAAACATGTCTCTGATTTTC
GTCTTATTAGTCTATGTAATGTTGCCTACAAGATTGTGGCCAAAGTTATTGTTAACCGAATGAACTGGATCCTCCAAGATATAATCTTAGAGAATCAATTGGCTTTTGTA
CCTGGGAGATCCATTTTCGATAACAAAATTATTAGTCATGAGTGTTTGCATACCATAAAATCTAGGCACACAGGGCGTAAAGGCTGGGTGGCTTTTAAATTGGATATGAG
CAAAGCGTACGATCGTGTTGAATGGTCCTTCCTGAAGGTGTTAATGGTCAAGCTTGGTTTCACCCTCGGAGCAACCACTCTCCTCATATTGATTTTCTGTCAAGCAACGA
GTGTTCAAGTTAGTGCTTTGCGTGATATTCTACATGCATATGAGATGGCATCTGGACAAGTCATTAATGTGACAAATTCGACGATGTATTTCTCTCCAAATGTTAGTAGT
AACGATCGCTCAATGATTCAGTCTATCATGGTGAACTTCAATAAAGCGTTGTTGGCAAAACAGGTATGGCGATTATTTACTAATCCCAATCTTCTAGCCTCTAGAGTTTT
CAATGCCAGATCCCTGGATTCCCAAAGAAACTTCTTTCAAACCGTTTCCCATAAGAAGTTCACAAGTCAATGGGACAGAGTTGTAGCTGATTTCATCACCCCTTCCAAAG
GGCGGGATATTTTAAAGCTAAGGGAACAAGTACGGGATGAGGATGTGAGATTAATTACTACTATCCCCATTAGCTCTATAAATGTCGAAGATAGGTGGGTTTGGCATTAC
TCTAAAAATGGCAACTACATTGTTAAGAGTGGTTATAAACTAGCCATGCTATCTTCAAGATCTGGTGGTGAGTCAAGTAGGAAAGCCATGAAAGTTTGGTGGCAATCGTT
ATGGAAATCCGGTTTGCCGACAAAGATTAAGTTATTCATTTGGAAAGCCTATCATGAGTGTTTACCAACTAATTACTGCTTGTGGAAAAGGAAAATATTAGGCCGGCCTA
CTTGCCCCATGTGCAATTCCGAGATAGAGACAATCAACCATGCCTTATGCGGCTGTATTCGAGCAAAGTCAATTTGTGATTGTACAATCACACGGATTGATGATCAAATA
ATGCGAATGGATCAACAATTACTAGAAGGAAACAACACAGAAGAAGAGTTCAGATGGAAGTATCAGCATTCACTTAACAGTAAGAAGAGTATCGTCCGGGTTTATACAGA
TGCGACGGTGATTCCGAATCGTAGTGGTGCAGGATATGGAGTTGCTATAGTGGGGGAGGGAAATGTCCTTCGTTGTGCAATGGAGATGGTCGAAGATGTAGATCTTAATC
CACTAGCAGCAGAGGTACAAGCCATTCTCCAAGCACTTCGTCTCATACAACGATTGCAATATCAAGAACCCAAGTGTGTTCAGACTCCATTACTGCCATTCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAAATTGCTCTCAGCGGCGACAACTGGATCAATCCCTCGGAAAACAGACGGTTCGGACGTGGTGTTTTGGAAGCTCAAGAACAAGGTTCCGGTACTGTTCCGAT
GGAGATCTCGACGGCGATGAAAATGGGTTACATTCTCATAATGCCTATTAATCCAAGTATTATGATGGCAATAATGGTGAAGGCCACGACCCAGGATGATAATAAGCCAC
CTTTAAACCATTGTCTGTTGGTCCAACAACTCAAATGGCCTGGAATTGGTTGTGGGCTTGTCGAATTCAAAATCTTGGAAGAGGCGAGCAAGACTAATTTATCTGACACG
AAACTGCTACGTCGGATCCACTCAAGAAGCGGGTTGGAGATGGGCTGCAGGAGGGTCAAAAAAAGGCCATGCATGGAGAATGAAGATGTGATATCCACAATGAAGAATCG
GCGGAGGTTGTTGATCAGCCCCGCCGCGAGCCATGAAAACTATATGTTGGAATGGTGTTTCATAGTAGATTGTAAAGGTAAAAGCGGAGGTTTATGTTTGATGTGGAAGG
ATGAAGTTGATGTTCGCATTCGCTCCTACACTCTCTTTCATATTGACGCCTCAAGAAAATGGGAGTCTAAATGTGGCGATTTACTGGCATTTACGGTCTCCCCAGGTGGA
GTGCCAGTTCGCGATTCTCAGCTAGAGAACTTTCGTTCAGTAATGGATGATTGCGGGCTTCAGGATTTGGACTATGTGGGTGATTCGTTCACATGGTCCAATAGACAGCT
TGATGTGGAGCAAATAAATGAAAGACTGGATAGGTTCATAGCCAATGAGGAATTCTGCCAACTATTTCCTCAACTCTCTATGACCCATCTAGATTGGACTCATTCAGACC
ACCGTCCAATTATGCTGCATACTCTACCACAACAAAGCGGCCAATGGGGAATGCGCACCATGGAGACTCCTCGGCTAGAAGAGTGTTTGGCAAAATGTAAATTTCGTCTG
CAAAAATGGGGAAAAGAACAATATTTCACGTCCATATACCCAGACGAGGCGACTTTCGATGCTGCACTAAAGGAGATACATTGTAAGGTGTCTGAAGAAATGAATGATAA
ATTGCTAGCTTCATTTGATCGAGAAGAGGTGATTAGAGCTGTTAAGCAAATGCACCTGTCCAAAGCACCTGGCCCGATGGTGAAGCAACCCAAACATGTCTCTGATTTTC
GTCTTATTAGTCTATGTAATGTTGCCTACAAGATTGTGGCCAAAGTTATTGTTAACCGAATGAACTGGATCCTCCAAGATATAATCTTAGAGAATCAATTGGCTTTTGTA
CCTGGGAGATCCATTTTCGATAACAAAATTATTAGTCATGAGTGTTTGCATACCATAAAATCTAGGCACACAGGGCGTAAAGGCTGGGTGGCTTTTAAATTGGATATGAG
CAAAGCGTACGATCGTGTTGAATGGTCCTTCCTGAAGGTGTTAATGGTCAAGCTTGGTTTCACCCTCGGAGCAACCACTCTCCTCATATTGATTTTCTGTCAAGCAACGA
GTGTTCAAGTTAGTGCTTTGCGTGATATTCTACATGCATATGAGATGGCATCTGGACAAGTCATTAATGTGACAAATTCGACGATGTATTTCTCTCCAAATGTTAGTAGT
AACGATCGCTCAATGATTCAGTCTATCATGGTGAACTTCAATAAAGCGTTGTTGGCAAAACAGGTATGGCGATTATTTACTAATCCCAATCTTCTAGCCTCTAGAGTTTT
CAATGCCAGATCCCTGGATTCCCAAAGAAACTTCTTTCAAACCGTTTCCCATAAGAAGTTCACAAGTCAATGGGACAGAGTTGTAGCTGATTTCATCACCCCTTCCAAAG
GGCGGGATATTTTAAAGCTAAGGGAACAAGTACGGGATGAGGATGTGAGATTAATTACTACTATCCCCATTAGCTCTATAAATGTCGAAGATAGGTGGGTTTGGCATTAC
TCTAAAAATGGCAACTACATTGTTAAGAGTGGTTATAAACTAGCCATGCTATCTTCAAGATCTGGTGGTGAGTCAAGTAGGAAAGCCATGAAAGTTTGGTGGCAATCGTT
ATGGAAATCCGGTTTGCCGACAAAGATTAAGTTATTCATTTGGAAAGCCTATCATGAGTGTTTACCAACTAATTACTGCTTGTGGAAAAGGAAAATATTAGGCCGGCCTA
CTTGCCCCATGTGCAATTCCGAGATAGAGACAATCAACCATGCCTTATGCGGCTGTATTCGAGCAAAGTCAATTTGTGATTGTACAATCACACGGATTGATGATCAAATA
ATGCGAATGGATCAACAATTACTAGAAGGAAACAACACAGAAGAAGAGTTCAGATGGAAGTATCAGCATTCACTTAACAGTAAGAAGAGTATCGTCCGGGTTTATACAGA
TGCGACGGTGATTCCGAATCGTAGTGGTGCAGGATATGGAGTTGCTATAGTGGGGGAGGGAAATGTCCTTCGTTGTGCAATGGAGATGGTCGAAGATGTAGATCTTAATC
CACTAGCAGCAGAGGTACAAGCCATTCTCCAAGCACTTCGTCTCATACAACGATTGCAATATCAAGAACCCAAGTGTGTTCAGACTCCATTACTGCCATTCAGATGA
Protein sequenceShow/hide protein sequence
MVEIALSGDNWINPSENRRFGRGVLEAQEQGSGTVPMEISTAMKMGYILIMPINPSIMMAIMVKATTQDDNKPPLNHCLLVQQLKWPGIGCGLVEFKILEEASKTNLSDT
KLLRRIHSRSGLEMGCRRVKKRPCMENEDVISTMKNRRRLLISPAASHENYMLEWCFIVDCKGKSGGLCLMWKDEVDVRIRSYTLFHIDASRKWESKCGDLLAFTVSPGG
VPVRDSQLENFRSVMDDCGLQDLDYVGDSFTWSNRQLDVEQINERLDRFIANEEFCQLFPQLSMTHLDWTHSDHRPIMLHTLPQQSGQWGMRTMETPRLEECLAKCKFRL
QKWGKEQYFTSIYPDEATFDAALKEIHCKVSEEMNDKLLASFDREEVIRAVKQMHLSKAPGPMVKQPKHVSDFRLISLCNVAYKIVAKVIVNRMNWILQDIILENQLAFV
PGRSIFDNKIISHECLHTIKSRHTGRKGWVAFKLDMSKAYDRVEWSFLKVLMVKLGFTLGATTLLILIFCQATSVQVSALRDILHAYEMASGQVINVTNSTMYFSPNVSS
NDRSMIQSIMVNFNKALLAKQVWRLFTNPNLLASRVFNARSLDSQRNFFQTVSHKKFTSQWDRVVADFITPSKGRDILKLREQVRDEDVRLITTIPISSINVEDRWVWHY
SKNGNYIVKSGYKLAMLSSRSGGESSRKAMKVWWQSLWKSGLPTKIKLFIWKAYHECLPTNYCLWKRKILGRPTCPMCNSEIETINHALCGCIRAKSICDCTITRIDDQI
MRMDQQLLEGNNTEEEFRWKYQHSLNSKKSIVRVYTDATVIPNRSGAGYGVAIVGEGNVLRCAMEMVEDVDLNPLAAEVQAILQALRLIQRLQYQEPKCVQTPLLPFR