; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036114 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036114
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:39248531..39253513
RNA-Seq ExpressionLag0036114
SyntenyLag0036114
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.5e-10632.09Show/hide
Query:  FTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIA
        FTGFYG+P    R  +  L+RR+ + + S W+IGGDMN ILW  E       +  +I AFR ++D  +L D+G  GG+FTWCN R  G+Q+  RLDRF+ 
Subjt:  FTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIA

Query:  NASFCDLF-----------------EKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAF--------------------VIFLKEMFSSSKPDQG---
        N +F  +F                 + +Q  S   R      V D+     ++K +I DA+                    ++ L+E+F   +  +    
Subjt:  NASFCDLF-----------------EKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAF--------------------VIFLKEMFSSSKPDQG---

Query:  ---------DIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-------------------------------
                 DI+ +++ +P +++ E+N+ L+A +T EEI  AI+   PTKA GP+GFPA+FYQ YW +                                
Subjt:  ---------DIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-------------------------------

Query:  ----RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIME
            R +SD+ PISLCNVSYKI++K +TNRLK  +  +I + QS F+P R+I+D +++GHE LH + + + G  G A+LK+D+SKA+DRVEW YL  IM 
Subjt:  ----RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIME

Query:  KLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD---------------------------------------------------------
        K+GF+E WI  +++C++T  FSI +N    G  +P RGIRQ D                                                         
Subjt:  KLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD---------------------------------------------------------

Query:  ------------------------------------TRQYLCNVLSMRATDSLGPYLGLPETF--HRGKTRDFKFLLDGVCC------------VQNFGG
                                             +QYL  +L+++     G YLGLP  F   RG++R   ++  G  C            ++ F  
Subjt:  ------------------------------------TRQYLCNVLSMRATDSLGPYLGLPETF--HRGKTRDFKFLLDGVCC------------VQNFGG

Query:  APMITRVECISSGGKIFAS---QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVAD
        A +   V        +  S   + +YF  +S+L  + +S SS+FWKGF+WG DLL  GLR  +GNG +I  F DPW+PR +TFK +     ++ D  VA 
Subjt:  APMITRVECISSGGKIFAS---QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVAD

Query:  FITPSIQWDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSM-MKNQETSLSRRDGNYR
        FIT    WDV  ++  F   D   I  +PISS +  D W+WH+D++G YSV+S YK  M +K   TS S    NYR
Subjt:  FITPSIQWDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSM-MKNQETSLSRRDGNYR

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]1.0e-9431.85Show/hide
Query:  DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASI-VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIM
        DSE   GGL +LW  + +V+I +Y+ HHI   +     K W   G YG+P  + + H+  L+RRL   +   W+  GD NEIL  NEK GG +R    I 
Subjt:  DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASI-VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIM

Query:  AFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIAN----ASFCDLFE-----------KVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDA
         FRE + D  L DLGS G  FTW NR+   + +  +LDRF+ N    A F D +             V  E   R+R+L+ E +  +     +  S ++A
Subjt:  AFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIAN----ASFCDLFE-----------KVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDA

Query:  FVIFLKE----------------------------------------------------------------------------------MFSSSKPDQGD
            +KE                                                                                  +F++S P    
Subjt:  FVIFLKE----------------------------------------------------------------------------------MFSSSKPDQGD

Query:  IDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYW------------DI-----------------------ARLVSDY
        I+  L  +  +V   MN  L   FT+EEI  A+    PTKA GP+G  AVF+QK+W            D+                        R V++Y
Subjt:  IDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYW------------DI-----------------------ARLVSDY

Query:  HPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWIS
         PISLCNV Y +V K + NRLK  L++II   QS F+P R ITD +++G+E LH +++ +  K    +LK+D+ KAYDRVEW +L  ++E+LGF  KWI+
Subjt:  HPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWIS

Query:  LVMKCVTTPTFSIRMNNESFGFIKPFRGIRQD-DTRQYLCNVLSMRATDSLGPYLGLPETFHRGKTRDFKFLLDGVCCVQNF---GGAPMITRVECISSG
        L+M C+TTPTFS+ +N  + G I P RG+RQ      YL   L     D  G + G  E     K R      D  C  Q      G  ++   E + + 
Subjt:  LVMKCVTTPTFSIRMNNESFGFIKPFRGIRQD-DTRQYLCNVLSMRATDSLGPYLGLPETFHRGKTRDFKFLLDGVCCVQNF---GGAPMITRVECISSG

Query:  GKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFK-VVTPPIPSMKDALVADFITPSIQWDVVKLNQ
         K+   + RY+  +  LN  + SS SF W+  +WG  +L+ G R  +GNG+ I      WIPR +TFK +V P +P+   A V++ I P+ QW+   +NQ
Subjt:  GKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFK-VVTPPIPSMKDALVADFITPSIQWDVVKLNQ

Query:  LFIGFDVGAIQRLPI-SSSAPDKWMWHFDRKGIYSVKSSYKFSM
         F   D   I+ + +  +   D+ +WH+DRKG+YSVKS Y+ ++
Subjt:  LFIGFDVGAIQRLPI-SSSAPDKWMWHFDRKGIYSVKSSYKFSM

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]2.1e-8729.22Show/hide
Query:  GGLCLLWKKEIDVRIQNYSIHHIHASI--VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREV
        GGLCLLW  +I V +++YS  HI   +  +   + W FTG YG P    R  + NLI+ L    N  W++GGD NEIL   EK GGP R  R++  FRE 
Subjt:  GGLCLLWKKEIDVRIQNYSIHHIHASI--VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREV

Query:  LDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEKVQ--------------------KESIRRRRNLIDEVEDINGNWLSEKCSIHDA
        L+   L+DL  SG  FTW   R+ GE++ +RLDRF+AN S+  +F   +                    K + R++R      ED    WL E     D 
Subjt:  LDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEKVQ--------------------KESIRRRRNLIDEVEDINGNWLSEKCSIHDA

Query:  FVIFLKEM-----FSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA------------------
             +       F+   P  G+ D  LS+V   +SDE N  L +    EE+  A+K   P+KA GP+GF   FYQ++WD+                   
Subjt:  FVIFLKEM-----FSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA------------------

Query:  -----------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAY
                           V    PISLCNV YKI +KVL NRLK  L++II + QS F+PGR I+D  +L  E  H L+ +R+G  G+ +LK+DMSKAY
Subjt:  -----------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAY

Query:  DRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD--------------------------------------------
        DRVEW +L  +++KLGF   WI   M CV T ++S  +N +  G I P RG+RQ D                                            
Subjt:  DRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD--------------------------------------------

Query:  --------TRQYLCN--------------------------------------VLSMRATDSLGPYLGLPETFHRGKTRDFKFLLDGV------------
                T   LC                                       VL ++  D    YLGLP      K   F++L+D V            
Subjt:  --------TRQYLCN--------------------------------------VLSMRATDSLGPYLGLPETFHRGKTRDFKFLLDGV------------

Query:  ------------------------------------------------------------CCVQNFGG--------------APMITRVECISSGGKIFA
                                                                    C  +  GG              A    R+ C         
Subjt:  ------------------------------------------------------------CCVQNFGG--------------APMITRVECISSGGKIFA

Query:  SQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSI-QWDVVKLNQLFIGF
         + RYFP +  ++  +   +SF W+  + G DLLK GLR  +GNG+ I  + DPW+P    FK  + P+   +D  V D I      W    L++LF   
Subjt:  SQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSI-QWDVVKLNQLFIGF

Query:  DVGAIQRLPIS-SSAPDKWMWHFDRKGIYSVKSSY
        +V  I ++P+S S   D+ + HFD+KG YSVK+ Y
Subjt:  DVGAIQRLPIS-SSAPDKWMWHFDRKGIYSVKSSY

XP_030946032.1 uncharacterized protein LOC115970553 [Quercus lobata]1.7e-8429.34Show/hide
Query:  EKAGGGLCLLWKKEIDVRIQNYSIHHIHASI-VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAF
        E  GGG+ + WKK ID  +  +S +HI   +    + +W FTGFYG  +      S + +RRL + N   W+  GD NEI   +EKLGG  R  R++  F
Subjt:  EKAGGGLCLLWKKEIDVRIQNYSIHHIHASI-VCNDKQWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAF

Query:  REVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASF--------------------------------------------------------
        R VLD+   RDL   GG FTWCN    G  +  RLDR +A   +                                                        
Subjt:  REVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASF--------------------------------------------------------

Query:  --CDLFEKV--------------QKESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSE
          CD+  +                K S R RRN I+ +E+  G    ++  I +  V   + +F SS PDQ  I+  L   P  VS+EMNQ L+A F   
Subjt:  --CDLFEKV--------------QKESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSE

Query:  EITTAIKSFQPTKAAGPNGFPAVFYQKYW-----DIARL------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNE
        E+  A+K   P KA GP+G P +F+Q +W     D+                                 VSDY P++LCN  YK+++KVL NRLK  L  
Subjt:  EITTAIKSFQPTKAAGPNGFPAVFYQKYW-----DIARL------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNE

Query:  IIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFR
         I + QS F   ++I+D +++  +TLH+++N++  K G+ +LK+DMSK YD V W YL +IMEKLGF +KW+SLV +C+++ ++SI +N E  G I+P R
Subjt:  IIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFR

Query:  GIRQDDTRQYLCNVLSMRATDSLGPYL------GLPETFHRGKTRDFKFLLDGVCCVQNFGGAPMITRVE-------------------------CISSG
        G+RQ                D L PYL      GL     R    D   ++  +  +        + R +                          +   
Subjt:  GIRQDDTRQYLCNVLSMRATDSLGPYL------GLPETFHRGKTRDFKFLLDGVCCVQNFGGAPMITRVE-------------------------CISSG

Query:  GKIF--ASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKL
          +F    + +YFP  S+     ++  S+ W+  +    ++  G+R  +GNG+SI  + D W+P   + K+++P    +  A VA  I    +  D   L
Subjt:  GKIF--ASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKL

Query:  NQLFIGFDVGAIQRLPIS-SSAPDKWMWHFDRKGIYSVKSSYK
         Q F+ F+V  I+ +P+  +   D  +W   + G YSVK+ Y+
Subjt:  NQLFIGFDVGAIQRLPIS-SSAPDKWMWHFDRKGIYSVKSSYK

XP_039834390.1 uncharacterized protein LOC120695147 [Panicum virgatum]7.0e-8327.93Show/hide
Query:  DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIM
        DSE   G L L WKKE+++R+ ++S +HI   I   D+  W  TG YG      +  +  L+R L  H+   W+  GD NEIL+  EK GGP R  R  +
Subjt:  DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIM

Query:  AFREVLDDFNLRDLGSSGGLFTWCNRRS-LGEQVSLRLDRFIANASFCDLFEKVQ----------------KESIRRR---RNLIDEVEDINGNWL-SEK
         FRE L++ +L DLG  G +FTW N        V  RLDR +AN+++   F  V+                +  ++ +   R  ++ ++     WL  E+
Subjt:  AFREVLDDFNLRDLGSSGGLFTWCNRRS-LGEQVSLRLDRFIANASFCDLFEKVQ----------------KESIRRR---RNLIDEVEDINGNWL-SEK

Query:  CSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-----------------
        CS        +KE + +    Q  I+ VL  + +K++ +MN  L A F+S+E+  A+K     KA G +G P VFY+K+W +                  
Subjt:  CSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-----------------

Query:  ------------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKA
                            + D  PISLCNV YK+++KVL NRLK+ L EII   QS F+PGR ITD ++L +E  HYL  +R GK G A++K+DMSKA
Subjt:  ------------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKA

Query:  YDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ---------------------------------------------
        YDRVEW +L ++M KLGF  +W+  VMKCV+T ++ I++N +    I P RG+RQ                                             
Subjt:  YDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ---------------------------------------------

Query:  --DDT----------RQYLCNVLSMRATDS------------LGP------------------------YLGLPETFHRGKTRDFKF-------------
          DD+           Q L  +L +  T S              P                        YLGLP +  + + + F++             
Subjt:  --DDT----------RQYLCNVLSMRATDS------------LGP------------------------YLGLPETFHRGKTRDFKF-------------

Query:  -----------------------------LLDGVC------------------------------CVQNFGG----------APMITR------VECISS
                                     L  G+C                                +N GG            M+ R      V   + 
Subjt:  -----------------------------LLDGVC------------------------------CVQNFGG----------APMITR------VECISS

Query:  GGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITP-SIQWDVVKLN
         G++   + +YFP  ++L        S+ W+  + G+DL+K G+   +GNG+S+  + DPWIP   T +  T    ++    V D + P S  WD   + 
Subjt:  GGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITP-SIQWDVVKLN

Query:  QLFIGFDVGAIQRLPISSSAPDKWMWHFDRKGIYSVKSSYKFSMMKNQETSLSRRDGNYRSCP
          F  FD  AI +L ++    D+  WHFD+KG++SVKS+YK ++ + +E  + R   +  S P
Subjt:  QLFIGFDVGAIQRLPISSSAPDKWMWHFDRKGIYSVKSSYKFSMMKNQETSLSRRDGNYRSCP

TrEMBL top hitse value%identityAlignment
A0A2N9FI47 CCHC-type domain-containing protein4.6e-8828.48Show/hide
Query:  GDDSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDK-QWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRK
        G D    GGGL LLW+  + V IQ+YSI HI A +V  D+ +W  TGFYG+P++ LR  S  L+R+LHD  +  W++ GD NE++   E+ G  +R   +
Subjt:  GDDSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDK-QWSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRK

Query:  IMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEKVQ-----------------------KESIRRRR-------------
        + AFR+ L D +L+DLG  G  F+W NRR  G  V  +LDR +AN ++  LF   Q                         SI R++             
Subjt:  IMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEKVQ-----------------------KESIRRRR-------------

Query:  -----------------------------------------------------------------------------NLIDEVEDI-------NGNWLSE
                                                                                     N++ E E+I       +  W +E
Subjt:  -----------------------------------------------------------------------------NLIDEVEDI-------NGNWLSE

Query:  KCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA----------------
          +I +  V +   +F SS P+   I  V+  V   V+  MN  L+   +S+EI  A+    P+KA GP+G  A+F+QKYW I                 
Subjt:  KCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA----------------

Query:  -------------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSK
                             +S +  ISLCNV YKI +KVL NR+K+ L  +I + QS F+PGR I+D +++  E LHYL+N   G     + K+DMSK
Subjt:  -------------------RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSK

Query:  AYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDDTRQYLCNVLSMRATDSLGPYLGLPETFHRGKTRDFKF---LL
        AYDRVEW +L  I+ K GFH +W+ L+M  V+T ++++ +N   +G+IKP RG+RQ D       +L      +L       +         FK    L 
Subjt:  AYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDDTRQYLCNVLSMRATDSLGPYLGLPETFHRGKTRDFKF---LL

Query:  DGVCCV-------QNFGGAPM--ITRVECI---SSGGKIFAS----------------------------QKRYFPTSSVLNGTISSSSSFFWKGFVWGM
        + +C +       Q  G   +  + + + +     GG  F                              + +YFP +S L   +S +  + W+      
Subjt:  DGVCCV-------QNFGGAPM--ITRVECI---SSGGKIFAS----------------------------QKRYFPTSSVLNGTISSSSSFFWKGFVWGM

Query:  DLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFIT-PSIQWDVVKLNQLFIGFDVGAIQRLPISSSAP-DKWMWHFDRKGIYSV
        ++++ GLR  +G+G +I  ++D W+P  STF+ ++P   S  +A V   I   S+ WDV KL Q+F+  DV  I ++P+S   P DK +W   + G ++V
Subjt:  DLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFIT-PSIQWDVVKLNQLFIGFDVGAIQRLPISSSAP-DKWMWHFDRKGIYSV

Query:  KSSYKFSMMKNQETSLSRRDGNYRS
        KS+Y  S++ +Q     R   N R+
Subjt:  KSSYKFSMMKNQETSLSRRDGNYRS

A0A2N9FTN8 Uncharacterized protein8.9e-9229.82Show/hide
Query:  ILFSPGNIKRPKTDGD------DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGD
        I   P NI  P  +        +S   GGGLCL WKKEI++R+ ++S  HI A I  N    W  TGFYG P+   R  S +L+RRL+      W   GD
Subjt:  ILFSPGNIKRPKTDGD------DSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGD

Query:  MNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLF----------------------------
         NE++   EK G   R +R++  F++VLDD    DLG  G  FTW N R+ G+    RLDR +A   +   F                            
Subjt:  MNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLF----------------------------

Query:  --------------------EKVQKESIRRRRNLIDEVED--INGNWLSEKCSIHDAFVIFLK-------------------EMFSSSKPDQGDIDRVLS
                             K  + +    +N I E E        LS +   H    I  +                   E   +  P+   +++V+ 
Subjt:  --------------------EKVQKESIRRRRNLIDEVED--INGNWLSEKCSIHDAFVIFLK-------------------EMFSSSKPDQGDIDRVLS

Query:  YVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYW-----DIARL------------------------------VSDYHPISLC
         +P+ V+ EMN +L   FT +E+TTA+K   P KA GP+G P +FYQ+YW     DI +                               V ++ PISLC
Subjt:  YVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYW-----DIARL------------------------------VSDYHPISLC

Query:  NVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCV
        NV YK+ +KVL NRLK  L +I+ E QS F+PGR ITD +++  ETLH++ ++RKGKFG  +LK+DMSKAYDRVEW YL ++MEK+GFH KW++L+M+C+
Subjt:  NVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCV

Query:  TTPTFSIRMNNESFGFIKPFRGIRQDDT------RQY----------------------------LCNVLSMRATDSLGPYLGLPETFHRGKTRDFKFL-
        TT ++SI +N E  GFIKP R    D +      ++Y                            +  +L + A      YLGLP     G      +L 
Subjt:  TTPTFSIRMNNESFGFIKPFRGIRQDDT------RQY----------------------------LCNVLSMRATDSLGPYLGLPETFHRGKTRDFKFL-

Query:  LDGVCCVQNFGG--------------APMITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI
         + +C  +  GG              A  + R+   +S       + +YFP  S+L+   ++ SSF WK  +   +L++ GL   +G G  +  + D W+
Subjt:  LDGVCCVQNFGG--------------APMITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI

Query:  PRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSMMKNQE
        P  +   +++P  P+     V   I    + W    +   F+  +   I  + +SS +  D   W   + G+Y+V+S Y   + ++ +
Subjt:  PRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSMMKNQE

A0A2N9HB41 Uncharacterized protein7.6e-9126.69Show/hide
Query:  GDDSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRK
        G D  + GGGL L W   + + +Q+YS +HI A ++ +D   W FTGFYG+P+  LR  S +L+RRLH  NN  W++ GD NEI+   EK G  DR  R+
Subjt:  GDDSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRK

Query:  IMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEK----------------------------------------------
        +  FRE L D +L DLG  G  FTW N R   + V +RLDR +A+  + +LF +                                              
Subjt:  IMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEK----------------------------------------------

Query:  --------------------------------------------------------------------------------VQKESI--------------
                                                                                        ++KE I              
Subjt:  --------------------------------------------------------------------------------VQKESI--------------

Query:  -------------RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAA
                     RR+ N +  + D NG W S+   +    V + + +F SS P    I+ V   V  +VS EMN  L+A F+SEE+ +A+    P+KA 
Subjt:  -------------RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAA

Query:  GPNGFPAVFYQKYWDIARL-----------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSI
        GP+G  A+F+Q+YW+I  L                                   +S + PISLCNV YKI++KVL NR+K  L  II +CQS F+PGR I
Subjt:  GPNGFPAVFYQKYWDIARL-----------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSI

Query:  TDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ-DDTRQYL---
        TD +++  E LHYL+NK+ G+ G  + K+DMSKAYDRVEW YL  I+ KLGFHE+W+SL+M CV++ T+S+ +N E  GFIKP RG+RQ D    YL   
Subjt:  TDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ-DDTRQYL---

Query:  ------------------------------------------CNVLSMR-----------------------------------------------ATDS
                                                  CN  ++                                                AT  
Subjt:  ------------------------------------------CNVLSMR-----------------------------------------------ATDS

Query:  LGPYLGLPETFHRGKTRDFKFLLD------------------------------------------GVCCVQN-------FGGAPMITRVECISS-----
           YLGLP    R K R F  L D                                          G+C   +       +G   M  +V  ++      
Subjt:  LGPYLGLPETFHRGKTRDFKFLLD------------------------------------------GVCCVQN-------FGGAPMITRVECISS-----

Query:  ----GGKIFAS----------------------------QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTF
            GG  F                              + +Y+P  S L  +I  + SF W+       +L  GL   +G+G++I  ++D W+      
Subjt:  ----GGKIFAS----------------------------QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTF

Query:  KVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISSSAP-DKWMWHFDRKGIYSVKSSYKFSMMKNQETSLSRRDGNYR
        K+++ P     +A V++ I      W+   ++Q+F+  D  +I++LP+SS  P DK +W  +R G+YSVKS+YK  M K+   S S   G+ +
Subjt:  KVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISSSAP-DKWMWHFDRKGIYSVKSSYKFSMMKNQETSLSRRDGNYR

A0A2N9I5P9 Uncharacterized protein4.0e-9230.32Show/hide
Query:  GGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREV
        G GL LLW   I V IQ++S HHI A ++  D   W  TGFYG+P++ LRSHS +L+R LH   +  W++ GD NEI   +EK G  DR   +++AFRE 
Subjt:  GGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQ-WSFTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREV

Query:  LDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEK-----------------------------------------------------
        L D +LRDLG  G  FTW NRR  G  V +RLDR +AN ++  LF +                                                     
Subjt:  LDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEK-----------------------------------------------------

Query:  ---------------------------VQKESI---------------------------RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSK
                                   V+KE +                           R++ N I  + D NG W S+  +I+   V +   +FSSSK
Subjt:  ---------------------------VQKESI---------------------------RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSK

Query:  PDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-----------------------------------R
        PD   ID V+  V   VS +MN  L+  F+SEEI  A+    P+KA GP+G  A+F+QKYW I                                     
Subjt:  PDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-----------------------------------R

Query:  LVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFH
         ++ + PISLCNV YKIV+KVL NR+K  L  +I + QS F+PGR ITD +++  E LHYL+N R G     + K+DMSKAYDRVEW YL  I+ KLGFH
Subjt:  LVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFH

Query:  EKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDDTRQYLCNVLSMRATDSLGP----------------------------------------YLG
         +W+ L+M CV T ++S+ +N E+ G+IKP RG+RQ D       ++      SL P                                        YLG
Subjt:  EKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDDTRQYLCNVLSMRATDSLGP----------------------------------------YLG

Query:  LPETFHRGKTRDFKFLLDGV------CCVQNFGGAPMITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKG-------FVW------------
        LP    R K   F  + + +         +    A   T ++ +      +A     FP    L   I S ++ FW G         W            
Subjt:  LPETFHRGKTRDFKFLLDGV------CCVQNFGGAPMITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKG-------FVW------------

Query:  -GM---------------DLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISSS
         GM                +L+ GLR  +G+G  I  ++D W+   +T+K+++P     ++A V   I   ++ W+   L Q+F+  DV  I+++P+S  
Subjt:  -GM---------------DLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSIQ-WDVVKLNQLFIGFDVGAIQRLPISSS

Query:  AP-DKWMW
        +P D+ +W
Subjt:  AP-DKWMW

A0A6J1DX30 uncharacterized protein LOC1110248742.2e-10632.09Show/hide
Query:  FTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIA
        FTGFYG+P    R  +  L+RR+ + + S W+IGGDMN ILW  E       +  +I AFR ++D  +L D+G  GG+FTWCN R  G+Q+  RLDRF+ 
Subjt:  FTGFYGNPDQTLRSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIA

Query:  NASFCDLF-----------------EKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAF--------------------VIFLKEMFSSSKPDQG---
        N +F  +F                 + +Q  S   R      V D+     ++K +I DA+                    ++ L+E+F   +  +    
Subjt:  NASFCDLF-----------------EKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAF--------------------VIFLKEMFSSSKPDQG---

Query:  ---------DIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-------------------------------
                 DI+ +++ +P +++ E+N+ L+A +T EEI  AI+   PTKA GP+GFPA+FYQ YW +                                
Subjt:  ---------DIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIA-------------------------------

Query:  ----RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIME
            R +SD+ PISLCNVSYKI++K +TNRLK  +  +I + QS F+P R+I+D +++GHE LH + + + G  G A+LK+D+SKA+DRVEW YL  IM 
Subjt:  ----RLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIME

Query:  KLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD---------------------------------------------------------
        K+GF+E WI  +++C++T  FSI +N    G  +P RGIRQ D                                                         
Subjt:  KLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQDD---------------------------------------------------------

Query:  ------------------------------------TRQYLCNVLSMRATDSLGPYLGLPETF--HRGKTRDFKFLLDGVCC------------VQNFGG
                                             +QYL  +L+++     G YLGLP  F   RG++R   ++  G  C            ++ F  
Subjt:  ------------------------------------TRQYLCNVLSMRATDSLGPYLGLPETF--HRGKTRDFKFLLDGVCC------------VQNFGG

Query:  APMITRVECISSGGKIFAS---QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVAD
        A +   V        +  S   + +YF  +S+L  + +S SS+FWKGF+WG DLL  GLR  +GNG +I  F DPW+PR +TFK +     ++ D  VA 
Subjt:  APMITRVECISSGGKIFAS---QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVAD

Query:  FITPSIQWDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSM-MKNQETSLSRRDGNYR
        FIT    WDV  ++  F   D   I  +PISS +  D W+WH+D++G YSV+S YK  M +K   TS S    NYR
Subjt:  FITPSIQWDVVKLNQLFIGFDVGAIQRLPISS-SAPDKWMWHFDRKGIYSVKSSYKFSM-MKNQETSLSRRDGNYR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.1e-1723.02Show/hide
Query:  RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVL-SYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQK
        +R +N ID +++  G+  ++   I      + K ++++   +  ++D  L +Y   +++ E  + L    T  EI   I S    K+ GP+GF A FYQ+
Subjt:  RRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVL-SYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQK

Query:  YWD-----IARLV-------------------------------SDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETL
        Y +     + +L                                 ++ PISL N+  KI+ K+L NR++  + ++I   Q  FIPG      +      +
Subjt:  YWD-----IARLV-------------------------------SDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETL

Query:  HYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ
         ++ N+ K K  +  + ID  KA+D+++ P++ + + KLG    ++ ++      PT +I +N +         G RQ
Subjt:  HYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ

P08548 LINE-1 reverse transcriptase homolog3.1e-1725Show/hide
Query:  RRSLGEQVSLRLDRFIANASFCDLFEKVQK--------ESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLS--YVPRKV
        R  L E  + R+ + I N S    FEK+ K           +R ++LI  + + N    ++   I      + K+++S    +  +ID+ L   ++PR +
Subjt:  RRSLGEQVSLRLDRFIANASFCDLFEKVQK--------ESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLS--YVPRKV

Query:  SDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWD--IARLVS----------------------------------DYHPISLCNVSYK
        S +  +ML    +S EI + I++    K+ GP+GF + FYQ + +  +  L++                                  +Y PISL N+  K
Subjt:  SDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWD--IARLVS----------------------------------DYHPISLCNVSYK

Query:  IVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTF
        I+ K+LTNR++  + +II   Q  FIPG      +      + ++ NK K K  +  L ID  KA+D ++ P++ + ++K+G    ++ L+    + PT 
Subjt:  IVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTF

Query:  SIRMNNESFGFIKPFRGIRQ
        +I +N           G RQ
Subjt:  SIRMNNESFGFIKPFRGIRQ

P11369 LINE-1 retrotransposable element ORF2 protein3.3e-1923.7Show/hide
Query:  FCDLFEKVQKESIR-----RRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLS-YVPRKVSDEMNQMLIASFTSEEITTAIKSF
        F +   K+ K   R     R + LI+++ +  G+  ++   I +    F K ++S+   +  ++D+ L  Y   K++ +    L +  + +EI   I S 
Subjt:  FCDLFEKVQKESIR-----RRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLS-YVPRKVSDEMNQMLIASFTSEEITTAIKSF

Query:  QPTKAAGPNGFPAVFYQKYWD-----IARL-------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQST
           K+ GP+GF A FYQ + +     + +L                               + ++ PISL N+  KI+ K+L NR++  +  II   Q  
Subjt:  QPTKAAGPNGFPAVFYQKYWD-----IARL-------------------------------VSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQST

Query:  FIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQD-DT
        FIPG      +      +HY+ NK K K  +  + +D  KA+D+++ P++ +++E+ G    +++++    + P  +I++N E    I    G RQ    
Subjt:  FIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQD-DT

Query:  RQYLCNVL
          YL N++
Subjt:  RQYLCNVL

P14381 Transposon TX1 uncharacterized 149 kDa protein7.7e-1624.21Show/hide
Query:  FEKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNG
        F  ++K+   R++      ED  G  L +  +I D    F + +FS         + +   +P  VS+   + L    T +E++ A++     K+ G +G
Subjt:  FEKVQKESIRRRRNLIDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNG

Query:  FPAVFYQKYWDI-----------------------------------ARLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTM
            F+Q +WD                                     RL+ ++ P+SL +  YKIV K ++ RLK  L E+I   QS  +PGR+I D +
Subjt:  FPAVFYQKYWDI-----------------------------------ARLVSDYHPISLCNVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTM

Query:  VLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ
         L  + LH+    R+     A L +D  KA+DRV+  YL   ++   F  +++  +     +    +++N      +   RG+RQ
Subjt:  VLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMNNESFGFIKPFRGIRQ

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.1e-1240.96Show/hide
Query:  LTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWI
        +  RLK  +  +I   Q++FIPGR  TD +V   E +H ++ K KG  G+  LK+D+ KAYDR+ W YL   +   GF E W+
Subjt:  LTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWI

AT4G29090.1 Ribonuclease H-like superfamily protein4.6e-0830.38Show/hide
Query:  MITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI---PRLSTFKVVTPP---IPSMKDAL-V
        M++R E + +  K+F S  RYF  S  LN  + S  SF WK      ++L+ G R  +GNG+ I  +R  W+   P  +  ++   P     S+   L V
Subjt:  MITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI---PRLSTFKVVTPP---IPSMKDAL-V

Query:  ADFITPS-IQWDVVKLNQLFIGFDVGAIQRL-PISSSAPDKWMWHFDRKGIYSVKSSY
        +D I  S  +W    +  LF   +   I  L P      D + W +   G Y+VKS Y
Subjt:  ADFITPS-IQWDVVKLNQLFIGFDVGAIQRL-PISSSAPDKWMWHFDRKGIYSVKSSY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.9e-0432.73Show/hide
Query:  QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI
        + RYFP SS++  ++ +  S+ W+  + G +LL  GL + +G+G     + D WI
Subjt:  QKRYFPTSSVLNGTISSSSSFFWKGFVWGMDLLKGGLRKNLGNGQSIYKFRDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTCAATCCAATTTTGGTTAGGGAAGATAACAATATGCCTCAATCAGATTTTCCCCAGAGCCAAAGACTTGCGATTGTGGTCTTGCCAGCGTTTAATCCGGTCCC
AACTGTCAATGAACTTCCTCGCCCTCATTACAATGGCATTCGGATTAGCGAACCAAGCGATGCACAGGTGCAGCAATTTCAACCTCCATTCTCGCCGGCATATTACTCTC
CCTCTGGTGCTCCGACGAGATTATCAAAAGGAAAAGAGAAGATCGTTGACTTGATGGCCAAACCCAAAGGGGCGCGTCAATGCTCTGTTCCACTGTGGCGGCCAATTTCT
CCGGCAATCTCAAACCAGCAATCTCCTCTGGCGAAGTCCCAGGTGGAGCAGCGGCCGGAGGGGGCGTCAAGCAATTCAATGGATGTTGGGCCAGAGTTTCCAAGCGATAT
GGGTTTAATCTATAAGCTACCATCTGGACCTCGTGAATCGAGCAAGCAGCATTCTGGGCCCACGAATTCAGATGGGATGGGTTGGACTGGTTTGAATGCTGAGAAATTGA
AGGAAACAGTGAATGTAGAGACTGCTTTTCGGCCCAACAAGAATGTGAATGTTGGAGCGCACGAAGGACACAATGGTGTGACTGATCCTATGATTTTTAATGCTAAATGT
AACAAAAATGATCATTCCAAAATACTTGGTAACATGTGGAAAAAAAGTGCCTGTGCAGGAATGGTACCTAGTGGCATAAATTTGAAGATTTTGGAGGAATTTCATAAGCG
AAAAGATGGGCCAATTTTGTTCTCTCCTGGAAACATTAAACGCCCAAAGACTGATGGTGATGATAGTGAGAAGGCGGGTGGTGGTCTATGTTTGCTATGGAAGAAAGAAA
TTGATGTTCGCATTCAAAATTATTCCATTCACCATATACATGCAAGTATTGTGTGCAATGACAAACAATGGAGTTTCACAGGATTTTATGGCAATCCAGATCAAACCCTT
CGATCTCATTCTTTGAATTTGATTCGGAGATTGCATGACCATAATAATTCGGCATGGGTTATTGGAGGAGACATGAACGAAATCCTGTGGCAAAATGAAAAACTAGGAGG
GCCAGATCGGGAAAATAGGAAAATTATGGCATTTAGGGAGGTCCTAGATGATTTCAATCTCCGAGATCTTGGCTCTTCTGGGGGTCTATTCACTTGGTGTAACAGGAGAA
GTTTGGGTGAGCAGGTTAGTTTGAGGTTGGACCGATTCATTGCAAATGCTAGTTTCTGTGATTTATTTGAGAAAGTTCAGAAGGAGTCTATAAGGAGAAGGCGAAATTTG
ATAGATGAGGTTGAAGATATTAATGGGAACTGGTTGTCAGAAAAGTGTAGCATTCATGATGCGTTCGTGATCTTTTTGAAGGAGATGTTCTCTTCTTCCAAACCAGACCA
AGGTGATATAGATAGGGTGCTTAGCTATGTTCCGAGGAAGGTCTCTGATGAGATGAATCAGATGTTAATAGCCTCATTTACTAGTGAGGAGATTACGACAGCAATCAAAA
GTTTCCAACCAACAAAGGCGGCAGGACCGAATGGTTTCCCTGCAGTATTCTATCAGAAATACTGGGATATAGCGAGGTTAGTATCTGATTATCACCCAATCAGTCTATGC
AATGTTTCGTATAAGATTGTTACAAAGGTTTTGACTAATAGACTCAAGATTTTTTTGAATGAGATTATTGATGAGTGTCAGTCGACTTTTATTCCTGGCAGATCAATAAC
TGATACTATGGTTTTGGGTCATGAAACGTTGCATTATCTTCAAAATAAGCGCAAGGGAAAATTTGGGTATGCATCTTTAAAAATTGACATGAGCAAAGCTTATGATAGAG
TAGAGTGGCCTTACTTGTGTCAAATTATGGAAAAACTAGGCTTCCATGAGAAGTGGATTTCATTAGTGATGAAGTGTGTTACAACCCCAACATTTTCCATCCGGATGAAT
AATGAGTCCTTTGGTTTTATAAAACCATTTCGTGGGATTAGGCAAGATGATACAAGGCAGTATCTATGTAATGTTCTGTCCATGAGGGCAACCGATTCATTGGGACCTTA
TCTTGGCTTACCAGAGACTTTTCACAGAGGCAAAACTCGAGACTTCAAATTCCTTTTAGACGGAGTATGTTGTGTGCAAAATTTTGGTGGGGCTCCAATGATAACAAGAG
TAGAATGCATTAGCAGCGGAGGGAAAATCTTTGCAAGCCAAAAGAGATATTTTCCTACGTCATCTGTTCTTAATGGGACAATCTCGTCCTCTTCTTCCTTTTTTTGGAAA
GGATTTGTTTGGGGGATGGATCTTCTTAAGGGTGGTTTGAGGAAAAATCTTGGAAACGGTCAGTCAATTTATAAATTCCGTGACCCTTGGATTCCTCGACTTTCTACCTT
TAAGGTGGTAACCCCTCCAATTCCATCGATGAAGGATGCTCTAGTTGCGGACTTTATTACACCGTCTATACAATGGGATGTGGTTAAATTAAACCAATTATTCATTGGTT
TTGATGTGGGAGCAATTCAACGCTTACCTATAAGCAGTTCAGCTCCGGATAAATGGATGTGGCATTTTGATAGAAAAGGGATATACTCTGTTAAAAGTAGTTATAAGTTT
TCTATGATGAAGAATCAAGAGACCTCGTTATCAAGACGAGATGGAAACTACAGATCATGCCCTTTTTCAATGTATGAGATCTCGGGAGAGTTCTTGAAGGCTAATCCGAA
AAGCGTGTCTACTACTCAAACGATGGAGGACGTTGGGAATATAATTTCTGATGGAGAAGATTTTATTATGTCAGATGCATTTGTCATGGGAGGGCAAAATACATGCGGTA
TTGGGATTGTGCTGCATGATAAACAAGGGAATTTAAAGAAGGTTCAGAATACATCTTCCCAGGCGGCTACCTCTCCTTTTGAAGCGGAAGCATTTGCGGTGCTTGAAGGG
ATGTGTTTGGCTACATTGTTAAATGTAAAGCGACTGACTGTCTTGTCTGATTCATTGACTGTGATAATTCAATCAACGAGAAGATACAGGTGGAGGCCTCCATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTCAATCCAATTTTGGTTAGGGAAGATAACAATATGCCTCAATCAGATTTTCCCCAGAGCCAAAGACTTGCGATTGTGGTCTTGCCAGCGTTTAATCCGGTCCC
AACTGTCAATGAACTTCCTCGCCCTCATTACAATGGCATTCGGATTAGCGAACCAAGCGATGCACAGGTGCAGCAATTTCAACCTCCATTCTCGCCGGCATATTACTCTC
CCTCTGGTGCTCCGACGAGATTATCAAAAGGAAAAGAGAAGATCGTTGACTTGATGGCCAAACCCAAAGGGGCGCGTCAATGCTCTGTTCCACTGTGGCGGCCAATTTCT
CCGGCAATCTCAAACCAGCAATCTCCTCTGGCGAAGTCCCAGGTGGAGCAGCGGCCGGAGGGGGCGTCAAGCAATTCAATGGATGTTGGGCCAGAGTTTCCAAGCGATAT
GGGTTTAATCTATAAGCTACCATCTGGACCTCGTGAATCGAGCAAGCAGCATTCTGGGCCCACGAATTCAGATGGGATGGGTTGGACTGGTTTGAATGCTGAGAAATTGA
AGGAAACAGTGAATGTAGAGACTGCTTTTCGGCCCAACAAGAATGTGAATGTTGGAGCGCACGAAGGACACAATGGTGTGACTGATCCTATGATTTTTAATGCTAAATGT
AACAAAAATGATCATTCCAAAATACTTGGTAACATGTGGAAAAAAAGTGCCTGTGCAGGAATGGTACCTAGTGGCATAAATTTGAAGATTTTGGAGGAATTTCATAAGCG
AAAAGATGGGCCAATTTTGTTCTCTCCTGGAAACATTAAACGCCCAAAGACTGATGGTGATGATAGTGAGAAGGCGGGTGGTGGTCTATGTTTGCTATGGAAGAAAGAAA
TTGATGTTCGCATTCAAAATTATTCCATTCACCATATACATGCAAGTATTGTGTGCAATGACAAACAATGGAGTTTCACAGGATTTTATGGCAATCCAGATCAAACCCTT
CGATCTCATTCTTTGAATTTGATTCGGAGATTGCATGACCATAATAATTCGGCATGGGTTATTGGAGGAGACATGAACGAAATCCTGTGGCAAAATGAAAAACTAGGAGG
GCCAGATCGGGAAAATAGGAAAATTATGGCATTTAGGGAGGTCCTAGATGATTTCAATCTCCGAGATCTTGGCTCTTCTGGGGGTCTATTCACTTGGTGTAACAGGAGAA
GTTTGGGTGAGCAGGTTAGTTTGAGGTTGGACCGATTCATTGCAAATGCTAGTTTCTGTGATTTATTTGAGAAAGTTCAGAAGGAGTCTATAAGGAGAAGGCGAAATTTG
ATAGATGAGGTTGAAGATATTAATGGGAACTGGTTGTCAGAAAAGTGTAGCATTCATGATGCGTTCGTGATCTTTTTGAAGGAGATGTTCTCTTCTTCCAAACCAGACCA
AGGTGATATAGATAGGGTGCTTAGCTATGTTCCGAGGAAGGTCTCTGATGAGATGAATCAGATGTTAATAGCCTCATTTACTAGTGAGGAGATTACGACAGCAATCAAAA
GTTTCCAACCAACAAAGGCGGCAGGACCGAATGGTTTCCCTGCAGTATTCTATCAGAAATACTGGGATATAGCGAGGTTAGTATCTGATTATCACCCAATCAGTCTATGC
AATGTTTCGTATAAGATTGTTACAAAGGTTTTGACTAATAGACTCAAGATTTTTTTGAATGAGATTATTGATGAGTGTCAGTCGACTTTTATTCCTGGCAGATCAATAAC
TGATACTATGGTTTTGGGTCATGAAACGTTGCATTATCTTCAAAATAAGCGCAAGGGAAAATTTGGGTATGCATCTTTAAAAATTGACATGAGCAAAGCTTATGATAGAG
TAGAGTGGCCTTACTTGTGTCAAATTATGGAAAAACTAGGCTTCCATGAGAAGTGGATTTCATTAGTGATGAAGTGTGTTACAACCCCAACATTTTCCATCCGGATGAAT
AATGAGTCCTTTGGTTTTATAAAACCATTTCGTGGGATTAGGCAAGATGATACAAGGCAGTATCTATGTAATGTTCTGTCCATGAGGGCAACCGATTCATTGGGACCTTA
TCTTGGCTTACCAGAGACTTTTCACAGAGGCAAAACTCGAGACTTCAAATTCCTTTTAGACGGAGTATGTTGTGTGCAAAATTTTGGTGGGGCTCCAATGATAACAAGAG
TAGAATGCATTAGCAGCGGAGGGAAAATCTTTGCAAGCCAAAAGAGATATTTTCCTACGTCATCTGTTCTTAATGGGACAATCTCGTCCTCTTCTTCCTTTTTTTGGAAA
GGATTTGTTTGGGGGATGGATCTTCTTAAGGGTGGTTTGAGGAAAAATCTTGGAAACGGTCAGTCAATTTATAAATTCCGTGACCCTTGGATTCCTCGACTTTCTACCTT
TAAGGTGGTAACCCCTCCAATTCCATCGATGAAGGATGCTCTAGTTGCGGACTTTATTACACCGTCTATACAATGGGATGTGGTTAAATTAAACCAATTATTCATTGGTT
TTGATGTGGGAGCAATTCAACGCTTACCTATAAGCAGTTCAGCTCCGGATAAATGGATGTGGCATTTTGATAGAAAAGGGATATACTCTGTTAAAAGTAGTTATAAGTTT
TCTATGATGAAGAATCAAGAGACCTCGTTATCAAGACGAGATGGAAACTACAGATCATGCCCTTTTTCAATGTATGAGATCTCGGGAGAGTTCTTGAAGGCTAATCCGAA
AAGCGTGTCTACTACTCAAACGATGGAGGACGTTGGGAATATAATTTCTGATGGAGAAGATTTTATTATGTCAGATGCATTTGTCATGGGAGGGCAAAATACATGCGGTA
TTGGGATTGTGCTGCATGATAAACAAGGGAATTTAAAGAAGGTTCAGAATACATCTTCCCAGGCGGCTACCTCTCCTTTTGAAGCGGAAGCATTTGCGGTGCTTGAAGGG
ATGTGTTTGGCTACATTGTTAAATGTAAAGCGACTGACTGTCTTGTCTGATTCATTGACTGTGATAATTCAATCAACGAGAAGATACAGGTGGAGGCCTCCATTGTGA
Protein sequenceShow/hide protein sequence
MILNPILVREDNNMPQSDFPQSQRLAIVVLPAFNPVPTVNELPRPHYNGIRISEPSDAQVQQFQPPFSPAYYSPSGAPTRLSKGKEKIVDLMAKPKGARQCSVPLWRPIS
PAISNQQSPLAKSQVEQRPEGASSNSMDVGPEFPSDMGLIYKLPSGPRESSKQHSGPTNSDGMGWTGLNAEKLKETVNVETAFRPNKNVNVGAHEGHNGVTDPMIFNAKC
NKNDHSKILGNMWKKSACAGMVPSGINLKILEEFHKRKDGPILFSPGNIKRPKTDGDDSEKAGGGLCLLWKKEIDVRIQNYSIHHIHASIVCNDKQWSFTGFYGNPDQTL
RSHSLNLIRRLHDHNNSAWVIGGDMNEILWQNEKLGGPDRENRKIMAFREVLDDFNLRDLGSSGGLFTWCNRRSLGEQVSLRLDRFIANASFCDLFEKVQKESIRRRRNL
IDEVEDINGNWLSEKCSIHDAFVIFLKEMFSSSKPDQGDIDRVLSYVPRKVSDEMNQMLIASFTSEEITTAIKSFQPTKAAGPNGFPAVFYQKYWDIARLVSDYHPISLC
NVSYKIVTKVLTNRLKIFLNEIIDECQSTFIPGRSITDTMVLGHETLHYLQNKRKGKFGYASLKIDMSKAYDRVEWPYLCQIMEKLGFHEKWISLVMKCVTTPTFSIRMN
NESFGFIKPFRGIRQDDTRQYLCNVLSMRATDSLGPYLGLPETFHRGKTRDFKFLLDGVCCVQNFGGAPMITRVECISSGGKIFASQKRYFPTSSVLNGTISSSSSFFWK
GFVWGMDLLKGGLRKNLGNGQSIYKFRDPWIPRLSTFKVVTPPIPSMKDALVADFITPSIQWDVVKLNQLFIGFDVGAIQRLPISSSAPDKWMWHFDRKGIYSVKSSYKF
SMMKNQETSLSRRDGNYRSCPFSMYEISGEFLKANPKSVSTTQTMEDVGNIISDGEDFIMSDAFVMGGQNTCGIGIVLHDKQGNLKKVQNTSSQAATSPFEAEAFAVLEG
MCLATLLNVKRLTVLSDSLTVIIQSTRRYRWRPPL