; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G012890 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G012890
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationCG_Chr04:27740679..27745160
RNA-Seq ExpressionClCG04G012890
SyntenyClCG04G012890
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]1.4e-13630.35Show/hide
Query:  KHSMKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKT
        K  MKI+SWNTRGLG + K+  +K  +    P++V+ QETKKE  +   +  + ++R+  W  + A G SGG+L +WD  K+   E + G +++S+K   
Subjt:  KHSMKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKT

Query:  ICGKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGR---RRKNIIKIIAG---------------------------Q
           +  W++ VYGP +   RK +W+EL  +A   +  WC+GGDFN+ R   ++    R     K+    I+                             
Subjt:  ICGKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGR---RRKNIIKIIAG---------------------------Q

Query:  MFLVTNEWDEAF----EGTRLVVNSQCCNIINRT--------------------------------LDAGNHQG----------------WN------LR
         FL +NEW++ F    +G      S    I+  T                                      +G                WN      L 
Subjt:  MFLVTNEWDEAF----EGTRLVVNSQCCNIINRT--------------------------------LDAGNHQG----------------WN------LR

Query:  RQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDL
        +++ +I+  L   +   +    S  L   RA  K  L ++  +EE    +K  + W+  GD N+KFFH+    ++ R  I EL N  G   N+   I++ 
Subjt:  RQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDL

Query:  ILGFYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENF
        IL +++ LYT  PS  S+    L+W  +  E   RL S F  +EI + +  M  +KAP                   + V +F EFH +G +N     +F
Subjt:  ILGFYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENF

Query:  ICLIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEK
        I L+ KK  +  + DFRPISL T +YK+IAK                     ++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+ 
Subjt:  ICLIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEK

Query:  VLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKY
        V++ K F  +W  W+ GC+ +  F++ +NG  +  ++ASRG+RQGDPLSPFLF ++  +LS ++ +   +   EGF VG+ +  V  LQFADDT+ F   
Subjt:  VLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKY

Query:  DIEMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP-------------------------
          E +  L+  + +F   SG KVN +KS + G+N++   +S  A  L+CKA   P++YL LPLGG PK + FW P                         
Subjt:  DIEMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP-------------------------

Query:  -----------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RH
                    L+ FK+  S    +ER  R+F W G G  K +HL +WD V + +  GGLG   + IRN+ALL KW WR+  E  +LW Q        H
Subjt:  -----------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RH

Query:  TNRKEVASL-----RSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNS--SWDLVFKRQLKDEDLGE
        +N  +V ++     R  W +I+  +Q+      F VG+G +  FW D     QPL  ++PRL ++    N  ++         SW+  F+R L D ++ +
Subjt:  TNRKEVASL-----RSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNS--SWDLVFKRQLKDEDLGE

Query:  FMSLSSSVANKRVLLQ-PDKRVWTFDGN
           L  S     +    PDKR W+   +
Subjt:  FMSLSSSVANKRVLLQ-PDKRVWTFDGN

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.3e-13531.67Show/hide
Query:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTI-C
        MKI+SWN RGLG ++K+  IK  +   NP++V+IQETKKE  +   +  + + R+  W  + A G SGG+L +WD   +   E + G +++SVK     C
Subjt:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTI-C

Query:  GKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKR-----FSPGRR---------------------------RKNIIKIIAGQ
        G + W++ VYGP     RK  W+EL  +       WC+GGDFN+ R   ++      +P  R                              + K +   
Subjt:  GKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKR-----FSPGRR---------------------------RKNIIKIIAGQ

Query:  MFLVTNEWDEAF-EGTRLVVNSQCCNIINRTLDAGNHQGW--------NLRRQEVEIVENLEKDEQWA-------ETDDPSSPLQDLRASLKAY------
         FL +NEW   F +G +  +  +  +     +D  N   W        N+  Q     EN    + W+       E       LQ ++A LK +      
Subjt:  MFLVTNEWDEAF-EGTRLVVNSQCCNIINRTLDAGNHQGW--------NLRRQEVEIVENLEKDEQWA-------ETDDPSSPLQDLRASLKAY------

Query:  -LLDIYKKEERDLI--------RKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQKV
         L +  K+E  +LI        +K  + W+  GD N+KF+H+    ++ R  I EL N +GL   +   I + IL +++ LYT           L+W  +
Subjt:  -LLDIYKKEERDLI--------RKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQKV

Query:  PDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKV
         +E   RL S F  +EI + +  +  +KAP                   + V +F EFH +G +N     +FI LI KK  +  + DFRPISL T +YK+
Subjt:  PDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKV

Query:  IAKP---------PPKVH------IEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFI
        IAK             +H      ++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+ +L+ K F+ +W  W+ GC+ +  F+I +
Subjt:  IAKP---------PPKVH------IEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFI

Query:  NGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS
        NG  +  ++ASRG+RQGDPLSPFLF L+  +LS ++ R   +   EGF VG+ +  V  LQFADD + F     E L+ L+  + +F    G KVN  KS
Subjt:  NGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS

Query:  ALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP------------------------------------SLTKFK----VVLSLER
        ++ G+N+D   +S  A  L+CKA   P++YL LPLGG PK   FW P                                     L+ FK    V   +ER
Subjt:  ALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP------------------------------------SLTKFK----VVLSLER

Query:  SMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNRKEVASL-----RSLWISISRQWQKI
          R+F W G G  K +HL  WD V R +  GGLGL N+  RNLALL KW WR+  E  +LW Q        H+N  +  ++     R  W +I++ +Q+ 
Subjt:  SMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNRKEVASL-----RSLWISISRQWQKI

Query:  EALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNSS--WDLVFKRQLKDEDLGEFMSLSSSVANKRVLLQ-PDKRVWTFDGN
          +  +  G+G +  FW D    +QPL  ++PRLF++    N S++     +    W+L F+R L D ++ +   L  S+ +  +    PD ++W    +
Subjt:  EALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNSS--WDLVFKRQLKDEDLGEFMSLSSSVANKRVLLQ-PDKRVWTFDGN

Query:  G
        G
Subjt:  G

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.5e-13930.67Show/hide
Query:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG
        MKI+SWNTRGLG + K+  ++  +   NP++V++QETK+E  +   +  +   + + W  + A G SGG++ +WD SK+   E + G ++++VK  +   
Subjt:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG

Query:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKN--------------------------------IIKIIAGQM
           W+T+VYGP +   RK  WLEL+ L       WC+GGDFN+ R I ++    R   N                                I K +    
Subjt:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKN--------------------------------IIKIIAGQM

Query:  FLVTNEWDEAFEGT------RLVVNSQCCNIINRTLDAG---------------------------NHQGW-------------------------NLRR
        FL ++EWD  F  +      R   +     +    L  G                             +GW                         +L+ 
Subjt:  FLVTNEWDEAFEGT------RLVVNSQCCNIINRTLDAG---------------------------NHQGW-------------------------NLRR

Query:  QEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLI
        ++  I+ +L + +   +  + +S L   R   +  L D+  KEE    +K  + W+  GD N+KFFHR    ++ R  I  LI+ +G   N+  DI + I
Subjt:  QEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLI

Query:  LGFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFIC
        + F+ NLY+K          ++W  +  E    L   F  +E++R +  +   KAP                   + + +F EFH NG +N      FI 
Subjt:  LGFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFIC

Query:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL
        L+ KK  +V + D+RPISL T +YK+IAK                     +EGR ILD +LIANE+V+E R   ++G + K+D EKA+D VDWGFL+ VL
Subjt:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL

Query:  QGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDI
        Q K F+ KW  WI GC+ +  F+I +NG  +  ++ASRG+RQGDPLSPFLF L+  +LS ++ R       EGF VG+++  V +LQFADDT+ F K  +
Subjt:  QGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDI

Query:  EMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP---------------------------
        E L+ L+  + +F   SG K+N EKS + G+N     +SS A+  +C+  + PL YL LPLGG PK   FW P                           
Subjt:  EMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP---------------------------

Query:  ---------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTN
                  L+ FK+  S    +E+  RNF W G G  K +HL  W+ V+R ++ GGLG   + +RN+ALL KW WRF  E   LW +        H N
Subjt:  ---------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTN

Query:  RKEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHN--SSWDLVFKRQLKDEDLGEFM
          +       S R  W +I++ +Q+        VG+G +  FW D    NQ L  +F  L+++ +  N +V+    ++   +W+L F+R L D ++    
Subjt:  RKEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHN--SSWDLVFKRQLKDEDLGEFM

Query:  SLSSSVANKRVLLQ-PDKRVWTFDGNG
         L SS+++ R      D R W+   +G
Subjt:  SLSSSVANKRVLLQ-PDKRVWTFDGNG

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]6.1e-13730.51Show/hide
Query:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG
        MKI+SWNTRGLG + K+  +K+ +    P++V+IQETKKE  +   +  + S R+  W  + A G SGG+L +WD  K+   E + G +++S+K      
Subjt:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG

Query:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRR---RKNIIKIIAG---------------------------QMFL
        +  W++ VYGP +   RK  W+EL  +A      WC+GGDFN+ R   ++    R     K+  + I                               FL
Subjt:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRR---RKNIIKIIAG---------------------------QMFL

Query:  VTNEWDEAFEGTRLVV---------------------------------NSQCCNIINRTLDAGNHQGW-------------------------NLRRQE
         +NEW++ F  +   V                                 +S       R        GW                          L +++
Subjt:  VTNEWDEAFEGTRLVV---------------------------------NSQCCNIINRTLDAGNHQGW-------------------------NLRRQE

Query:  VEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILG
         +I+  L   +   +    S  L   RA  K  L ++  +EE    +K  + W+  GD N+KFFH+    ++ R  I EL N  GL  N+   I++ IL 
Subjt:  VEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILG

Query:  FYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICL
        +++ LY   PS  S+    L+W  +  E   RL S F  +EI + +  M  +KAP                   + V +F EFH +G +N     +FI L
Subjt:  FYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICL

Query:  IKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQ
        + KK  +  + DFRPISL T +YK+IAK                     ++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+ VL+
Subjt:  IKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQ

Query:  GKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE
         K F+ +W  W+ GC+ +  +++ +NG  +  ++ASRG+RQGDPLSPFLF ++  +LS ++ +   +   EGF VG+ +  V  LQFADDT+ F     E
Subjt:  GKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE

Query:  MLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF-----------------------
         L  L+  + +F   SG KVN +KS + G+NI+   +S  A  L+CKA   P++YL LPLGG PK + FW P + +                        
Subjt:  MLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF-----------------------

Query:  -----------------KVVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNR
                          V   +ER  R F W G G  K +HL +WD V + +  GGLG   + +RN+ALL KW WR+  E  +LW Q        H+N 
Subjt:  -----------------KVVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNR

Query:  KEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAG--HWDHNSSWDLVFKRQLKDEDLGEFMS
         +V      S R  W +I+  +Q+      F VGDG +  FW D    +QPL  ++PRL  +    N  ++    +    SW+  F+R L D ++ +  S
Subjt:  KEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAG--HWDHNSSWDLVFKRQLKDEDLGEFMS

Query:  LSSSVANKRVLLQ-PDKRVWTFDGNG
        L  S+    +    PDKR W+   +G
Subjt:  LSSSVANKRVLLQ-PDKRVWTFDGNG

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.1e-13529.82Show/hide
Query:  ESLFSISSEDLEEPEEEEIHLSINLDDSIGED---LHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQE
        ES  S+S+ +L  P +E    +++L     E+   LH++ +       +  T+ H         MKI+SWNTRGLG + K+  +K  +    P++V+ QE
Subjt:  ESLFSISSEDLEEPEEEEIHLSINLDDSIGED---LHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQE

Query:  TKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWC
        TKKE  +   +  + ++R+  W  + A G SGG+L +WD  K+   E + G +++S+K      +  W++ VYGP +   RK +W+EL  +A   +  WC
Subjt:  TKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWC

Query:  LGGDFNITRAIHKRFSPGR---RRKNIIKIIAG---------------------------QMFLVTNEWDEAF----EGTRLVVNSQCCNIINRT-----
        +GGDFN+ R   ++    R     K+    I+                              FL +NEW++ F    +G      S    I+  T     
Subjt:  LGGDFNITRAIHKRFSPGR---RRKNIIKIIAG---------------------------QMFLVTNEWDEAF----EGTRLVVNSQCCNIINRT-----

Query:  ---------------------------LDAGNHQG----------------WN------LRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLD
                                         +G                WN      L +++ +I+  L   +   +    S  L   RA  K  L +
Subjt:  ---------------------------LDAGNHQG----------------WN------LRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLD

Query:  IYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSR
        +  +EE    +K  + W+  GD N+KFFH+    ++ R  I EL N  G   N+   I++ IL +++ LYT  PS  S+    L+W  +  E   RL S 
Subjt:  IYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPA-NLEWQKVPDEQNRRLSSR

Query:  FNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAK--------
        F  +EI + +  M  +KAP                   + V +F EFH +G +N     +FI L+ KK  +  + DFRPISL T +YK+IAK        
Subjt:  FNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAK--------

Query:  -------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQAS
                     ++GRQILD +LIANE+V+E R   ++G + K+D EKA+D V W FL+ V++ K F  +W  W+ GC+ +  F++ +NG  +  ++AS
Subjt:  -------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQAS

Query:  RGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLE
        RG+RQGDPLSPFLF ++  +LS ++ +   +   EGF VG+ +  V  LQFADDT+ F     E +  L+  + +F   SG KVN +KS + G+N++   
Subjt:  RGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLE

Query:  VSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP------------------------------------SLTKFKVVLS----LERSMRNFFWEGHG
        +S  A  L+CKA   P++YL LPLGG PK + FW P                                     L+ FK+  S    +ER  R+F W G G
Subjt:  VSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP------------------------------------SLTKFKVVLS----LERSMRNFFWEGHG

Query:  GSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNRKEVASL-----RSLWISISRQWQKIEALAVFKVGDG
          K +HL +WD V + +  GGLG   + IRN+ALL KW WR+  E  +LW Q        H+N  +V ++     R  W +I+  +Q+      F VG+G
Subjt:  GSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTNRKEVASL-----RSLWISISRQWQKIEALAVFKVGDG

Query:  RKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNS--SWDLVFKRQLKDEDLGEFMSLSSSVANKRVLLQ-PDKRVWTFDGNG
         +  FW D     QPL  ++PRL ++    N  ++         SW+  F+R L D ++ +   L  S     +    PDKR W+   +G
Subjt:  RKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNS--SWDLVFKRQLKDEDLGEFMSLSSSVANKRVLLQ-PDKRVWTFDGNG

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein4.1e-13930.67Show/hide
Query:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG
        MKI+SWNTRGLG + K+  ++  +   NP++V++QETK+E  +   +  +   + + W  + A G SGG++ +WD SK+   E + G ++++VK  +   
Subjt:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG

Query:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKN--------------------------------IIKIIAGQM
           W+T+VYGP +   RK  WLEL+ L       WC+GGDFN+ R I ++    R   N                                I K +    
Subjt:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKN--------------------------------IIKIIAGQM

Query:  FLVTNEWDEAFEGT------RLVVNSQCCNIINRTLDAG---------------------------NHQGW-------------------------NLRR
        FL ++EWD  F  +      R   +     +    L  G                             +GW                         +L+ 
Subjt:  FLVTNEWDEAFEGT------RLVVNSQCCNIINRTLDAG---------------------------NHQGW-------------------------NLRR

Query:  QEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLI
        ++  I+ +L + +   +  + +S L   R   +  L D+  KEE    +K  + W+  GD N+KFFHR    ++ R  I  LI+ +G   N+  DI + I
Subjt:  QEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLI

Query:  LGFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFIC
        + F+ NLY+K          ++W  +  E    L   F  +E++R +  +   KAP                   + + +F EFH NG +N      FI 
Subjt:  LGFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFIC

Query:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL
        L+ KK  +V + D+RPISL T +YK+IAK                     +EGR ILD +LIANE+V+E R   ++G + K+D EKA+D VDWGFL+ VL
Subjt:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVL

Query:  QGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDI
        Q K F+ KW  WI GC+ +  F+I +NG  +  ++ASRG+RQGDPLSPFLF L+  +LS ++ R       EGF VG+++  V +LQFADDT+ F K  +
Subjt:  QGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDI

Query:  EMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP---------------------------
        E L+ L+  + +F   SG K+N EKS + G+N     +SS A+  +C+  + PL YL LPLGG PK   FW P                           
Subjt:  EMLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQP---------------------------

Query:  ---------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTN
                  L+ FK+  S    +E+  RNF W G G  K +HL  W+ V+R ++ GGLG   + +RN+ALL KW WRF  E   LW +        H N
Subjt:  ---------SLTKFKVVLS----LERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ-------RHTN

Query:  RKEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHN--SSWDLVFKRQLKDEDLGEFM
          +       S R  W +I++ +Q+        VG+G +  FW D    NQ L  +F  L+++ +  N +V+    ++   +W+L F+R L D ++    
Subjt:  RKEVA-----SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHN--SSWDLVFKRQLKDEDLGEFM

Query:  SLSSSVANKRVLLQ-PDKRVWTFDGNG
         L SS+++ R      D R W+   +G
Subjt:  SLSSSVANKRVLLQ-PDKRVWTFDGNG

A0A803P8A0 Uncharacterized protein1.6e-15132.49Show/hide
Query:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG
        MKI++WN RG GD+ K+ AIK  I K NP++V++QE K+  ++   I  +  SR   W  + A G+SGG L +WD   I V+++L G +++SV       
Subjt:  MKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICG

Query:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMF
        +  W + VYGP  YK R   W EL  L++ C E+WC+GGDFN+TR + ++ +     ++ +K+  G                                 F
Subjt:  KICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMF

Query:  LVTNEWDEAFEGTR-------------LVVNSQCCNIINRTLDAGNH--------------------QGWNLRRQEVEIVENLEKDEQWAE---------
        L  N W+  F   R             +V++S+            NH                     GW   +   ++     K ++W+          
Subjt:  LVTNEWDEAFEGTR-------------LVVNSQCCNIINRTLDAGNH--------------------QGWNLRRQEVEIVENLEKDEQWAE---------

Query:  ----------------TDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLIL
                        T   +  L D R  LK     +  +EER +  K    W   GD N++FFH  L A+K RN I+ +  + G   +S ++I + ++
Subjt:  ----------------TDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLIL

Query:  GFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICL
         F+  LYT     G+    +EWQ + +   R+L   F  DE++  +     +KAP                  NE + +F  FH  G++   + + FICL
Subjt:  GFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICL

Query:  IKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQ
        I K+ ++  VKDFRPISL T +YK+IAK                     +EGRQILD +L+ANE VE+YR + KKG++LK+D EKA+D+VDWGFL+ VL+
Subjt:  IKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQ

Query:  GKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE
         K F  +W  WI GC+ +  FSIF+NGR R +   SRG+RQGDPLSPFLF L+  +L  ++ +    E F GF +GK+ I +  LQFADDTL F K D +
Subjt:  GKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE

Query:  MLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTK------------------------
         L+ L K +E F   SG KVN  KS L G+ + D  V+  A  + C+  K P+ YL +PLGG P+K +FW+P L K                        
Subjt:  MLEALRKTIELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTK------------------------

Query:  ------------FK----VVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA---
                    FK    V+  LE+ MR+FFWEG   +  +HL  WD V + + +GGL +  + +RN  LL KW WRF  ES+SLW +   +R   A   
Subjt:  ------------FK----VVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA---

Query:  ---------SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV------AGH-WDHNSSWDLVFKRQLKDEDL
                 S R  W+ I+  + +   +  FKVG+G    FW D  +    L+D+FP L  ++   N S+      AG   D  +SWD  F+R + D ++
Subjt:  ---------SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV------AGH-WDHNSSWDLVFKRQLKDEDL

Query:  GEFMSLSSSVANKRVL-LQPDKRVWTFDGNG
             L   + + RVL +  D R+W  D  G
Subjt:  GEFMSLSSSVANKRVL-LQPDKRVWTFDGNG

A0A803QEA6 Uncharacterized protein2.2e-14531.49Show/hide
Query:  QDFEEESLFSISSEDLEEPEEE------EIHLSINLDDSIGEDLHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLILKTN
        Q  EE  L   S + L+E E+E      EI +      +I E + ++        GME TQ++  S ++  + +I++WN RG GD+ K+ AIK  I K N
Subjt:  QDFEEESLFSISSEDLEEPEEE------EIHLSINLDDSIGEDLHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLILKTN

Query:  PELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLELESLA
        P+LV++QE K+  ++   I  +  SR   W  + A G+SGG L +WD   I V+++L G +++SV       +  W + VYGP  YK R   W EL  L+
Subjt:  PELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLELESLA

Query:  AYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMFLVTNEWDEAFEGTR-----LVVNSQCCN
        + C ++WC+ GDFN+TR + ++ +     ++ +K+  G                                 FL TN W+  F   R      +V+     
Subjt:  AYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMFLVTNEWDEAFEGTR-----LVVNSQCCN

Query:  II---------------NRTLDAGN-------------HQGWNLRR--QEVEIVENLEKDEQWAE-------------------------TDDPSSPLQD
        +I               N  LD  +             + GW   +  ++++I++   K ++W++                         T   +  L D
Subjt:  II---------------NRTLDAGN-------------HQGWNLRR--QEVEIVENLEKDEQWAE-------------------------TDDPSSPLQD

Query:  LRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQKVP
         R  LK     +  +EER    K    W   GD N++FFH  L A+K RN I+ +    G   ++ ++I + ++ F+  LYT     GS    +EWQ++ 
Subjt:  LRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQKVP

Query:  DEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVI
        +    +L   F  +E++  +     NKAP                  ++ + +F  FH  G++   + + FICLI K+ ++  VKDFRPISL T +YK+I
Subjt:  DEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVI

Query:  AK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFIN
        AK                     +EGRQILD +L+ANE VE+YR + +KG++LK+D EKA+D+VDWGFL+ VL+ K F  +W  WI GC+ +  FSIFIN
Subjt:  AK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFIN

Query:  GRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSA
        GR R +   SRG+RQGDPLSPFLF +I  +L  ++ +    E   GF +GK+ I +  LQFADDTL F K ++  L+ L K ++ F   SG KVN  KS 
Subjt:  GRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSA

Query:  LCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF---------------------------------------KVVL-SLERS
        L G+ +++  V+ +A  + C+  + P+ YL + LGG P+K SFW+P L K                                        KVVL  LE+ 
Subjt:  LCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF---------------------------------------KVVL-SLERS

Query:  MRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA------------SLRSLWISISRQWQKIE
        MR FFWEG   +  +HL  WD V + + +GGL +  + +RN  LL KW WRF  E +SLW +   +R   A            S R  W  IS  + +  
Subjt:  MRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA------------SLRSLWISISRQWQKIE

Query:  ALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSSVANKRVL-LQPDKRVWT
         L  FKVG+G +  FW D  +    L+D+FP L  I+   N S+           D  +SWDL F+R + D ++     L   + + RVL +  D R W 
Subjt:  ALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSSVANKRVL-LQPDKRVWT

Query:  FDGNG
         D  G
Subjt:  FDGNG

A0A803QI00 Uncharacterized protein2.3e-14231.31Show/hide
Query:  RGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNV
        +G GD+ K+ AIK  I K NP+LV++QE K+ +++   I  +  SR   W  + A G+SGG L +WD   I V+++L G +++SV  K       W + V
Subjt:  RGLGDQSKQLAIKHLILKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNV

Query:  YGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNI---------IKIIAGQM---------------------FLVTNEWDEA
        YGP  YK R   W EL  L+A C ++WC+GGDFN+TR   ++ +     +++         +++I  ++                     FL TN W+  
Subjt:  YGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNI---------IKIIAGQM---------------------FLVTNEWDEA

Query:  FEGTR-------------LVVNS--------------------QCCNIINRTLDAGNHQGWNLRRQEVEIVENLEKDEQWAET-----------------
        +   R             +V++S                           R     +  GW   +   ++ +  EK ++W+ +                 
Subjt:  FEGTR-------------LVVNS--------------------QCCNIINRTLDAGNHQGWNLRRQEVEIVENLEKDEQWAET-----------------

Query:  -----DDPSSPLQDL---RASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTK
             +  +S +Q L   R  LK     +  +EER +  K    W   GD N++FFH  L A+K RN I+ +    G   +   +I + ++GF+  LYT 
Subjt:  -----DDPSSPLQDL---RASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTK

Query:  IPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVM
            GS   ++EWQ++      +L S F  +E+KR +     +KAP                  ++ + +F  F + G++   + E FICLI K+ ++  
Subjt:  IPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAP------------------NEFVALFNEFHENGKLNSCVKENFICLIKKKEDAVM

Query:  VKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWI
        VKDFRPISL T +YK++AK                     +EGRQILD +LIANE VE++R + KKG++ K+DLEKA+D+VDW FL+ VL+ K F   W 
Subjt:  VKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWI

Query:  SWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTI
         WI GC+ +  FS+ INGR R + + SRG+RQGDPLSPFLF L+  +L  L+ +  + + F GF VGK+ I +  LQFADDTL F K D   L  L + +
Subjt:  SWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTI

Query:  ELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTK---------------------------------
        E F   SG KVN  KS L G+++++  V+  A  + C+    P+ YL +PLGG P+K +FW+P L K                                 
Subjt:  ELFEWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTK---------------------------------

Query:  ---FK----VVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNR------------KEVA
           FK    V+ ++E+ MR+FFWEG   +  +HL  WD V + + +GGL +  + +RN  LL KW WR+  E +SLW +   +R               A
Subjt:  ---FK----VVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNR------------KEVA

Query:  SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSS
        S R  W  IS  + +   L  FKVG+G    FW D  +    LK++FP +  I+   N S+           +  +SWD  F+R   + ++     L   
Subjt:  SLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSS

Query:  VANKRVL-LQPDKRVWTFDGNG
        + + RVL +  D R+W  D +G
Subjt:  VANKRVL-LQPDKRVWTFDGNG

A0A803QQM3 Uncharacterized protein2.8e-14030.71Show/hide
Query:  QDFEEESLFSISSEDLEEPEEEEIHLSI----------NLDDSIGEDLHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLI
        Q+  E  L  ++    E   EEE+ +++          N+ DS+ E   K+ Q  +         D  + + +K   K    N   LGD+ K+ AIK  I
Subjt:  QDFEEESLFSISSEDLEEPEEEEIHLSI----------NLDDSIGEDLHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLI

Query:  LKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLEL
         K NP+LV++QE K+  ++   I  +  SR   W  + A G+SGG L +WD   I V+++L G +++SV       +  W + VYGP  YK R   W EL
Subjt:  LKTNPELVLIQETKKEAIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLEL

Query:  ESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMFLVTNEWDEAFEGTR----------
          L++ C E+WC+GGDFN+TR + ++ +     ++ +K+  G                                 FL +N W+  +   R          
Subjt:  ESLAAYCTEAWCLGGDFNITRAIHKRFSPGRRRKNIIKIIAG-------------------------------QMFLVTNEWDEAFEGTR----------

Query:  ---LVVNSQCCNIINRTLDAGNH--------------------QGWNLRRQEVEIVENLEKDEQWAE-------------------------TDDPSSPL
           +V++S             NH                     GW   +   ++     K ++W++                         T   +  +
Subjt:  ---LVVNSQCCNIINRTLDAGNH--------------------QGWNLRRQEVEIVENLEKDEQWAE-------------------------TDDPSSPL

Query:  QDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQK
         D R  LK     ++ +EER +  K    W   GD N++ FH  L A+K +N I+ +  + G   ++ ++I + ++ F+  LYT    +G+    +EW K
Subjt:  QDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQK

Query:  VPDEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------------------EFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYK
        + +   R+L   F  +E++  +     NKAP                   + + +F  FH  G++   + + FICLI K+ ++  VKD+RPISL T +YK
Subjt:  VPDEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------------------EFVALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYK

Query:  VIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIF
        +IAK                     +EGRQILD +L+ANE VE+YR + KKG +LK+D EKA+D+VDWGFL+ V++ K F  +W  WI GC+    FSIF
Subjt:  VIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIF

Query:  INGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEK
        INGR R +   SRG+RQ DPLSPFLF LI  +L  ++ +    E   GF +GK+ I +  LQFADDTL F K D   L+ L K +E F   SG KVN  K
Subjt:  INGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEK

Query:  SALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF---------------------------------------KVVL-SLE
        S L G+ +D+  V+ +A ++ C+  + P+ YL +PLGG P+K SFW+P L K                                        KVVL  LE
Subjt:  SALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKF---------------------------------------KVVL-SLE

Query:  RSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA------------SLRSLWISISRQWQK
        + MR+FFWEG   +  +HL  WD V + + +GGL +  + +RN  LL KW WRF  ES+SLW +   +R   A            S R  W  IS  + +
Subjt:  RSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVA------------SLRSLWISISRQWQK

Query:  IEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSSVANKRVL-LQPDKRV
           L  FKVG+G    FW D  +    L+D+F  L  I+   N S+           D  +SWDL F+R + D ++     L   + + RVL +  D R+
Subjt:  IEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSV-------AGHWDHNSSWDLVFKRQLKDEDLGEFMSLSSSVANKRVL-LQPDKRV

Query:  WTFDGNG
        W  D  G
Subjt:  WTFDGNG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-2225.76Show/hide
Query:  VENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYK
        ++ LEK EQ   T   +S  Q++   ++A L +I  ++    I +    +    ++  +   R +  K+ +N I  + N++G  T    +I+  I  +YK
Subjt:  VENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYK

Query:  NLY-TKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------EF------------VALFNEFHENGKL-NSCVKENFIC
        +LY  K+ +       L+   +P    E+   L+      EI   +  +   K+P       EF            + LF    + G L NS  + + I 
Subjt:  NLY-TKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------EF------------VALFNEFHENGKL-NSCVKENFIC

Query:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------PPPKVH------IEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKV
        + K   D    ++FRPISL  +  K++ K             +H      I G Q    I  +  +++   R K K   I+ +D EKAFDK+   F+ K 
Subjt:  LIKKKEDAVMVKDFRPISLTTLIYKVIAK---------PPPKVH------IEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAFDKVDWGFLEKV

Query:  LQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYD
        L     +  ++  I      P  +I +NG+         G RQG PLSP LF   N +L  L   I ++++ +G  +GKE++ + +  FADD +++ +  
Subjt:  LQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYD

Query:  IEMLEALRKTIELFEWCSGQKVNWEKS
        I   + L K I  F   SG K+N +KS
Subjt:  IEMLEALRKTIELFEWCSGQKVNWEKS

P08548 LINE-1 reverse transcriptase homolog4.5e-1823.92Show/hide
Query:  LRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRK--RTLNWLHLGDENTKFFHRFLAAKKR-RNLITELINNQGLPTNSFR
        L++ E E V NL    +  E ++ S+P    R  +     ++ + E + +I++  ++ +W              L  KKR ++LI+ + N     T    
Subjt:  LRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRK--RTLNWLHLGDENTKFFHRFLAAKKR-RNLITELINNQGLPTNSFR

Query:  DIEDLILGFYKNLYT-KIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAP------NEF------------VALFNEFHENGKL
        +I+ ++  +YK LY+ K  +       LE   +P    ++   L+   +  EI   ++ +   K+P      +EF            + LF    + G L
Subjt:  DIEDLILGFYKNLYT-KIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAP------NEF------------VALFNEFHENGKL

Query:  -NSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAKPPPK---------VH------IEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAF
         N+  + N   + K  +D    +++RPISL  +  K++ K             +H      I G Q    I  +  +++   ++K K   IL +D EKAF
Subjt:  -NSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAKPPPK---------VH------IEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAF

Query:  DKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEK-FEGFIVGKEKIHVPILQ
        D +   F+ + L+       ++  I      P  +I +NG   +      G RQG PLSP LF ++ ++L+  I    R+EK  +G  +G E+I + +  
Subjt:  DKVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEK-FEGFIVGKEKIHVPILQ

Query:  FADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS
        FADD +++ +   +    L + I+ +   SG K+N  KS
Subjt:  FADDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-2426.54Show/hide
Query:  RRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRK--RTLNW-LHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRD
        +++E     +L    +  E  + +SP +  R  +     +I + E R  I++  +T +W     ++  K   R     + + LI ++ N +G  T    +
Subjt:  RRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRK--RTLNW-LHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRD

Query:  IEDLILGFYKNLY-TKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------EFVALFNE---------FHE---NGKLN
        I++ I  FYK LY TK+ +       L+  +VP    +Q   L+S  +  EI+  +  +   K+P       EF   F E         FH+    G L 
Subjt:  IEDLILGFYKNLY-TKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------EFVALFNE---------FHE---NGKLN

Query:  SCVKENFICLI-KKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAFD
        +   E  I LI K ++D   +++FRPISL  +  K++ K               P     I G Q    I  +  ++    ++K K   I+ LD EKAFD
Subjt:  SCVKENFICLI-KKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEY-RIKKKKGWILKLDLEKAFD

Query:  KVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFA
        K+   F+ KVL+       +++ I      P  +I +NG     I    G RQG PLSP+LF   N +L  L   I ++++ +G  +GKE++ + +L  A
Subjt:  KVDWGFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFA

Query:  DDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS
        DD +++          L   I  F    G K+N  KS
Subjt:  DDTLLFCKYDIEMLEALRKTIELFEWCSGQKVNWEKS

P14381 Transposon TX1 uncharacterized 149 kDa protein7.4e-2925.12Show/hide
Query:  RRQEVEIV--ENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDI
        R  E+E +  E L+ +++ + ++D +  LQ      K  L ++ +++ R    +  +  L   D  ++FF+     K  R  IT L    G P      I
Subjt:  RRQEVEIV--ENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRKRTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDI

Query:  EDLILGFYKNLYTKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------------------EFVALFNEFHENGKLNSC
         D    FY+NL++  P +      L W  +P   + +  RL +    DE+ + LR M HNK+P                   +F  +  E  + G+L   
Subjt:  EDLILGFYKNLYTKIPSAGSFPANLEWQKVP---DEQNRRLSSRFNRDEIKRDLRGMRHNKAPN------------------EFVALFNEFHENGKLNSC

Query:  VKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDW
         +   + L+ KK D  ++K++RP+SL +  YK++AK               P     + GR I D + +  +++   R        L LD EKAFD+VD 
Subjt:  VKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAK---------------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDW

Query:  GFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTL
         +L   LQ  +F  +++ ++     + +  + IN      +   RGVRQG PLS  L+ L  +    L+     +++  G ++ +  + V +  +ADD +
Subjt:  GFLEKVLQGKNFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTL

Query:  LFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSA
        L  + D+  LE  ++  E++   S  ++NW KS+
Subjt:  LFCKYDIEMLEALRKTIELFEWCSGQKVNWEKSA

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.0e-1430.22Show/hide
Query:  LIKKKEDAVMVKDFRPISLTT----LIYKVIAK---------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQG
        LI K  D     ++RPI++ +    L+++++AK         P  K +      L   L+ +  +   R ++K   ++ LD+ KAFD V    + + LQ 
Subjt:  LIKKKEDAVMVKDFRPISLTT----LIYKVIAK---------PPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQG

Query:  KNFNSKWISWILGCIKNPKFSIFIN-GRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE
           +    ++I G + +   +I +  G   R+I   RGV+QGDPLSPFLF   N +L  L+  +       G  +G+EKI  P+L FADD LL    D+ 
Subjt:  KNFNSKWISWILGCIKNPKFSIFIN-GRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIE

Query:  MLEALRKTIELFEWCSGQKVNWEKS
        +L     T+  F    G  +N +KS
Subjt:  MLEALRKTIELFEWCSGQKVNWEKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-1124.91Show/hide
Query:  GTRLVVNSQCCNIINRTLDAGNHQGW-NLRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLK------AYLLDIYKKEERDLIRKRTLNWLHLGDENT
        G  L    +CC ++NR       QG+ N++ +  E +++LE  +    T+   S  +    + K      A L   Y+++ R       + WL  GD NT
Subjt:  GTRLVVNSQCCNIINRTLDAGNHQGW-NLRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLK------AYLLDIYKKEERDLIRKRTLNWLHLGDENT

Query:  KFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTK-----IPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAPN-
        +FFH+ + A + +NLI  L  +  +   +   ++++I+ +Y +L         P +     ++   +  D    RLS+  +  EI   +  M  NKAP  
Subjt:  KFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTK-----IPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAPN-

Query:  -----EF------------VALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVI
             EF            +A   EF   G L        I LI K      +  FRP+S  T++YK+I
Subjt:  -----EF------------VALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.1e-0639.77Show/hide
Query:  MVKDFRPISLTTLIYKVIAKPPPKVHIEGRQILDPILIANEMVEEYRIKK-KKGW-ILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWI
        MV+  +P+ +T LI      P     I GR   D I+   E V   R KK  KGW +LKLDLEKA+D++ W +LE  L    F   W+
Subjt:  MVKDFRPISLTTLIYKVIAKPPPKVHIEGRQILDPILIANEMVEEYRIKK-KKGW-ILKLDLEKAFDKVDWGFLEKVLQGKNFNSKWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-0526.83Show/hide
Query:  MRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ----RHTNRKEV--ASLRS----LWISISRQWQKIEAL
        + +F+W     +K  H   WD ++  + +GG+G +++   NLALL K  WR +   +SL  +    R+ ++ +   A L S    +W SI    + +   
Subjt:  MRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLENMRIRNLALLSKWGWRFMHESDSLWCQ----RHTNRKEV--ASLRS----LWISISRQWQKIEAL

Query:  AVFKVGDGRKTAFWFDPSLENQP
        A   VG+G     W    L+++P
Subjt:  AVFKVGDGRKTAFWFDPSLENQP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-0944.78Show/hide
Query:  INGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDT
        ING P+  +  SRG+RQGDPLSP+LF+L  ++LS L  R   + +  G  V      +  L FADDT
Subjt:  INGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTAGTTAGCTGCAAAATTAATGGGGTGTTCTTCTGCTCTTGGTCCGAAAAGGGAAGTTTCATAATAGAAGATGTTGAAGTTCGAAAATTAATTCGCCTCTCAAA
AATGCAGATGTCTTGGATGGTAGAACAAGTAGAGGACCTTCTCAATGGTTCAAGCGTCTGTTTCTTCTTAAGGGAATGGAGAGACGACTCAGGATTAATAAGAATATCTA
AATTGAAGATCAAGACAGAGTGGAGTTTAAGATGTGTAGTTTGGCCATTTACCGGAGGCCGGAAAGAAGTCGGCCGAATTAATGAACCCCTCGAAAATGATCTGACCCTT
GACATTCTATCTGGGGAAATCCTCCAAACAACTAAGCCCATTTTCCTTGTTTCTATCCAAAGAAGAAGCCGAAAAGTCTCCCCCAGCAGCCCGAAAGCAAAGCAAATAGA
AAGGATTTCGAATTTCAAAAGGAATGAAAAGAAGAAAGAGAATGCAGCCGGCAGATTTAGTGATGTCCTCGAAAACAAACAGCCCCCTGAAACTCCATCGGATTCCCTCC
AAAAATCCGAGCCCTTCAGTTTCGAATCCATCCATAGAAATCGAAAAGTCTCAGACGACCCGTTGGTGGTTGTTGAAGAAGCCGAAAAGTCACCCCCCAGCAGCCCGAAA
TCCAATCCAGAAGAGCATTCATTTCAGTCACTATCAACATTACATCAACAGCCTCGAGCAAGCACGGTAGCCTTGGACAAGAACTCATTTCTTCCAGCCGTTAGAAAGAG
ATCTAATCTCTTCAAGCCCTTCCCAAAGCACTTCAACAGAGGAAAGACTTCATTCCTCAATTTATGGTCCGCATTGACAAATTCAGATTTACTCAAAGAAAGTTGTGCAA
ATGCTCAAGCTCCACACAAAAAAAATCTCCAAGTCAATACTTCCCCTCTCTTCAACACAAAAGTTTCAGCTTTCAGAGTTTCTAATTCCGACTCCTCCCATCTCAGCCCA
AGTAAGAAGCCACAGGACTTTGAAGAAGAATCTCTATTCAGCATAAGTAGTGAAGATTTAGAAGAACCTGAAGAAGAAGAGATCCATCTGAGCATAAACTTGGATGATTC
AATTGGAGAAGATCTACACAAACTTTTCCAGTCTGACCAAAGCAGAGAAGGCATGGAGTGTACTCAGGACCACCACTTGTCAAAGAGGCAGAAGCATTCAATGAAAATTG
TTTCTTGGAACACTAGAGGTCTTGGAGATCAATCCAAACAATTGGCAATCAAGCATTTAATTTTGAAGACAAATCCAGAGCTGGTTTTAATTCAAGAAACCAAGAAAGAA
GCAATTGAAATTGAAGCAATAAAAAAATTAAGGAGTTCAAGAGACATTGGGTGGACTTTTGTGGAAGCATATGGCAAATCAGGGGGGATGCTAACCATGTGGGATGAAAG
CAAAATCTTAGTCATTGAAACACTAAAAGGAGGCTACACACTCTCAGTTAAATGCAAGACTATATGCGGAAAGATTTGTTGGGTAACGAACGTCTATGGTCCAACCGATT
ATAAGGAAAGAAAGCGCATTTGGCTGGAGTTAGAATCTTTAGCAGCCTACTGCACAGAAGCTTGGTGCTTGGGAGGAGATTTCAATATCACTAGAGCAATTCATAAAAGA
TTTTCACCTGGAAGAAGAAGGAAGAACATCATCAAGATCATTGCTGGACAGATGTTCCTTGTGACAAATGAATGGGATGAAGCCTTCGAAGGCACTCGACTGGTTGTTAA
TTCTCAGTGCTGTAACATTATAAATAGAACTCTTGATGCTGGAAATCATCAAGGATGGAATCTAAGAAGACAGGAAGTGGAAATAGTAGAGAATCTGGAAAAAGATGAGC
AGTGGGCAGAAACAGATGATCCATCCTCCCCTTTACAAGATCTAAGGGCATCCCTAAAAGCTTATCTGCTGGATATTTACAAAAAAGAGGAAAGAGACTTGATTCGGAAG
CGTACGCTAAATTGGCTACATTTAGGAGACGAAAACACAAAATTTTTCCATCGTTTTCTTGCTGCAAAAAAGAGAAGGAATCTTATAACAGAATTAATCAATAATCAAGG
CCTTCCAACCAATTCATTCCGCGATATAGAAGACCTAATTTTAGGATTCTACAAGAATCTTTACACAAAAATCCCAAGTGCCGGAAGCTTCCCTGCAAATTTGGAGTGGC
AAAAGGTACCTGATGAGCAAAACAGAAGGCTATCTTCAAGATTCAATAGGGATGAAATCAAAAGGGATTTAAGAGGAATGCGGCATAATAAAGCGCCTAATGAGTTTGTA
GCCCTATTTAATGAATTTCATGAAAATGGAAAGCTAAACTCTTGTGTAAAGGAAAACTTCATTTGCTTGATCAAGAAAAAAGAAGATGCTGTCATGGTAAAAGATTTCCG
GCCAATCAGCCTCACCACATTAATATATAAAGTAATTGCCAAACCCCCACCCAAAGTGCATATCGAAGGACGGCAGATTTTAGATCCCATCCTCATTGCAAATGAAATGG
TGGAGGAATATAGAATCAAGAAGAAAAAAGGGTGGATACTAAAGCTAGATTTAGAAAAAGCCTTTGACAAAGTGGATTGGGGATTTCTAGAGAAGGTTCTACAAGGAAAG
AACTTCAATTCCAAATGGATTTCTTGGATTTTGGGTTGCATCAAAAACCCAAAATTCTCAATCTTCATTAATGGAAGACCAAGAAGAAGAATCCAAGCTTCAAGAGGTGT
TAGACAAGGAGATCCTCTCTCACCCTTCTTATTTCTCCTAATTAATAAGCTGTTGAGTAGCCTCATCTCAAGAATTCATCGCAAAGAAAAATTTGAAGGATTTATAGTGG
GGAAGGAGAAAATACATGTTCCAATTCTTCAATTCGCAGATGATACATTATTATTTTGCAAGTATGATATTGAAATGTTGGAAGCCTTGAGAAAAACTATTGAGCTATTT
GAATGGTGCTCCGGCCAAAAGGTGAATTGGGAGAAATCAGCCTTGTGTGGGTTAAATATTGATGATCTAGAGGTTAGCTCGACTGCTGCTAGGTTGAATTGCAAGGCTGA
AAAACTTCCCTTAATGTATCTCAATTTGCCCTTAGGAGGATACCCAAAGAAGACTTCTTTTTGGCAGCCATCCTTGACAAAATTCAAGGTAGTCTTGTCATTAGAAAGAT
CCATGAGAAACTTCTTTTGGGAAGGTCACGGAGGGAGCAAACTGAACCACCTTGCTCATTGGGATTCAGTCACTAGAAATCAAAAAGATGGAGGTCTTGGGCTGGAAAAT
ATGAGAATCAGAAACTTGGCGCTGCTGTCCAAATGGGGGTGGCGTTTTATGCATGAATCGGACTCACTTTGGTGCCAACGGCACACTAACAGAAAGGAGGTTGCAAGTCT
TAGAAGCCTATGGATAAGCATTTCAAGGCAATGGCAGAAGATAGAAGCCTTGGCAGTCTTTAAAGTAGGAGATGGAAGAAAAACTGCTTTTTGGTTTGATCCTTCGCTTG
AGAATCAACCCCTAAAGGACAGATTTCCAAGACTTTTCAAGATAGCTGCCAAGCCAAATGGGTCAGTAGCAGGTCATTGGGACCATAACTCCTCATGGGATTTAGTGTTC
AAAAGGCAGCTAAAAGATGAAGATTTGGGTGAATTTATGTCTTTATCAAGCAGTGTGGCAAATAAAAGAGTGCTGTTACAGCCGGATAAAAGGGTATGGACATTTGATGG
AAATGGTGGTTTACCTTTCCTTGGCTTCCCCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTAGTTAGCTGCAAAATTAATGGGGTGTTCTTCTGCTCTTGGTCCGAAAAGGGAAGTTTCATAATAGAAGATGTTGAAGTTCGAAAATTAATTCGCCTCTCAAA
AATGCAGATGTCTTGGATGGTAGAACAAGTAGAGGACCTTCTCAATGGTTCAAGCGTCTGTTTCTTCTTAAGGGAATGGAGAGACGACTCAGGATTAATAAGAATATCTA
AATTGAAGATCAAGACAGAGTGGAGTTTAAGATGTGTAGTTTGGCCATTTACCGGAGGCCGGAAAGAAGTCGGCCGAATTAATGAACCCCTCGAAAATGATCTGACCCTT
GACATTCTATCTGGGGAAATCCTCCAAACAACTAAGCCCATTTTCCTTGTTTCTATCCAAAGAAGAAGCCGAAAAGTCTCCCCCAGCAGCCCGAAAGCAAAGCAAATAGA
AAGGATTTCGAATTTCAAAAGGAATGAAAAGAAGAAAGAGAATGCAGCCGGCAGATTTAGTGATGTCCTCGAAAACAAACAGCCCCCTGAAACTCCATCGGATTCCCTCC
AAAAATCCGAGCCCTTCAGTTTCGAATCCATCCATAGAAATCGAAAAGTCTCAGACGACCCGTTGGTGGTTGTTGAAGAAGCCGAAAAGTCACCCCCCAGCAGCCCGAAA
TCCAATCCAGAAGAGCATTCATTTCAGTCACTATCAACATTACATCAACAGCCTCGAGCAAGCACGGTAGCCTTGGACAAGAACTCATTTCTTCCAGCCGTTAGAAAGAG
ATCTAATCTCTTCAAGCCCTTCCCAAAGCACTTCAACAGAGGAAAGACTTCATTCCTCAATTTATGGTCCGCATTGACAAATTCAGATTTACTCAAAGAAAGTTGTGCAA
ATGCTCAAGCTCCACACAAAAAAAATCTCCAAGTCAATACTTCCCCTCTCTTCAACACAAAAGTTTCAGCTTTCAGAGTTTCTAATTCCGACTCCTCCCATCTCAGCCCA
AGTAAGAAGCCACAGGACTTTGAAGAAGAATCTCTATTCAGCATAAGTAGTGAAGATTTAGAAGAACCTGAAGAAGAAGAGATCCATCTGAGCATAAACTTGGATGATTC
AATTGGAGAAGATCTACACAAACTTTTCCAGTCTGACCAAAGCAGAGAAGGCATGGAGTGTACTCAGGACCACCACTTGTCAAAGAGGCAGAAGCATTCAATGAAAATTG
TTTCTTGGAACACTAGAGGTCTTGGAGATCAATCCAAACAATTGGCAATCAAGCATTTAATTTTGAAGACAAATCCAGAGCTGGTTTTAATTCAAGAAACCAAGAAAGAA
GCAATTGAAATTGAAGCAATAAAAAAATTAAGGAGTTCAAGAGACATTGGGTGGACTTTTGTGGAAGCATATGGCAAATCAGGGGGGATGCTAACCATGTGGGATGAAAG
CAAAATCTTAGTCATTGAAACACTAAAAGGAGGCTACACACTCTCAGTTAAATGCAAGACTATATGCGGAAAGATTTGTTGGGTAACGAACGTCTATGGTCCAACCGATT
ATAAGGAAAGAAAGCGCATTTGGCTGGAGTTAGAATCTTTAGCAGCCTACTGCACAGAAGCTTGGTGCTTGGGAGGAGATTTCAATATCACTAGAGCAATTCATAAAAGA
TTTTCACCTGGAAGAAGAAGGAAGAACATCATCAAGATCATTGCTGGACAGATGTTCCTTGTGACAAATGAATGGGATGAAGCCTTCGAAGGCACTCGACTGGTTGTTAA
TTCTCAGTGCTGTAACATTATAAATAGAACTCTTGATGCTGGAAATCATCAAGGATGGAATCTAAGAAGACAGGAAGTGGAAATAGTAGAGAATCTGGAAAAAGATGAGC
AGTGGGCAGAAACAGATGATCCATCCTCCCCTTTACAAGATCTAAGGGCATCCCTAAAAGCTTATCTGCTGGATATTTACAAAAAAGAGGAAAGAGACTTGATTCGGAAG
CGTACGCTAAATTGGCTACATTTAGGAGACGAAAACACAAAATTTTTCCATCGTTTTCTTGCTGCAAAAAAGAGAAGGAATCTTATAACAGAATTAATCAATAATCAAGG
CCTTCCAACCAATTCATTCCGCGATATAGAAGACCTAATTTTAGGATTCTACAAGAATCTTTACACAAAAATCCCAAGTGCCGGAAGCTTCCCTGCAAATTTGGAGTGGC
AAAAGGTACCTGATGAGCAAAACAGAAGGCTATCTTCAAGATTCAATAGGGATGAAATCAAAAGGGATTTAAGAGGAATGCGGCATAATAAAGCGCCTAATGAGTTTGTA
GCCCTATTTAATGAATTTCATGAAAATGGAAAGCTAAACTCTTGTGTAAAGGAAAACTTCATTTGCTTGATCAAGAAAAAAGAAGATGCTGTCATGGTAAAAGATTTCCG
GCCAATCAGCCTCACCACATTAATATATAAAGTAATTGCCAAACCCCCACCCAAAGTGCATATCGAAGGACGGCAGATTTTAGATCCCATCCTCATTGCAAATGAAATGG
TGGAGGAATATAGAATCAAGAAGAAAAAAGGGTGGATACTAAAGCTAGATTTAGAAAAAGCCTTTGACAAAGTGGATTGGGGATTTCTAGAGAAGGTTCTACAAGGAAAG
AACTTCAATTCCAAATGGATTTCTTGGATTTTGGGTTGCATCAAAAACCCAAAATTCTCAATCTTCATTAATGGAAGACCAAGAAGAAGAATCCAAGCTTCAAGAGGTGT
TAGACAAGGAGATCCTCTCTCACCCTTCTTATTTCTCCTAATTAATAAGCTGTTGAGTAGCCTCATCTCAAGAATTCATCGCAAAGAAAAATTTGAAGGATTTATAGTGG
GGAAGGAGAAAATACATGTTCCAATTCTTCAATTCGCAGATGATACATTATTATTTTGCAAGTATGATATTGAAATGTTGGAAGCCTTGAGAAAAACTATTGAGCTATTT
GAATGGTGCTCCGGCCAAAAGGTGAATTGGGAGAAATCAGCCTTGTGTGGGTTAAATATTGATGATCTAGAGGTTAGCTCGACTGCTGCTAGGTTGAATTGCAAGGCTGA
AAAACTTCCCTTAATGTATCTCAATTTGCCCTTAGGAGGATACCCAAAGAAGACTTCTTTTTGGCAGCCATCCTTGACAAAATTCAAGGTAGTCTTGTCATTAGAAAGAT
CCATGAGAAACTTCTTTTGGGAAGGTCACGGAGGGAGCAAACTGAACCACCTTGCTCATTGGGATTCAGTCACTAGAAATCAAAAAGATGGAGGTCTTGGGCTGGAAAAT
ATGAGAATCAGAAACTTGGCGCTGCTGTCCAAATGGGGGTGGCGTTTTATGCATGAATCGGACTCACTTTGGTGCCAACGGCACACTAACAGAAAGGAGGTTGCAAGTCT
TAGAAGCCTATGGATAAGCATTTCAAGGCAATGGCAGAAGATAGAAGCCTTGGCAGTCTTTAAAGTAGGAGATGGAAGAAAAACTGCTTTTTGGTTTGATCCTTCGCTTG
AGAATCAACCCCTAAAGGACAGATTTCCAAGACTTTTCAAGATAGCTGCCAAGCCAAATGGGTCAGTAGCAGGTCATTGGGACCATAACTCCTCATGGGATTTAGTGTTC
AAAAGGCAGCTAAAAGATGAAGATTTGGGTGAATTTATGTCTTTATCAAGCAGTGTGGCAAATAAAAGAGTGCTGTTACAGCCGGATAAAAGGGTATGGACATTTGATGG
AAATGGTGGTTTACCTTTCCTTGGCTTCCCCAATTGA
Protein sequenceShow/hide protein sequence
MEVVSCKINGVFFCSWSEKGSFIIEDVEVRKLIRLSKMQMSWMVEQVEDLLNGSSVCFFLREWRDDSGLIRISKLKIKTEWSLRCVVWPFTGGRKEVGRINEPLENDLTL
DILSGEILQTTKPIFLVSIQRRSRKVSPSSPKAKQIERISNFKRNEKKKENAAGRFSDVLENKQPPETPSDSLQKSEPFSFESIHRNRKVSDDPLVVVEEAEKSPPSSPK
SNPEEHSFQSLSTLHQQPRASTVALDKNSFLPAVRKRSNLFKPFPKHFNRGKTSFLNLWSALTNSDLLKESCANAQAPHKKNLQVNTSPLFNTKVSAFRVSNSDSSHLSP
SKKPQDFEEESLFSISSEDLEEPEEEEIHLSINLDDSIGEDLHKLFQSDQSREGMECTQDHHLSKRQKHSMKIVSWNTRGLGDQSKQLAIKHLILKTNPELVLIQETKKE
AIEIEAIKKLRSSRDIGWTFVEAYGKSGGMLTMWDESKILVIETLKGGYTLSVKCKTICGKICWVTNVYGPTDYKERKRIWLELESLAAYCTEAWCLGGDFNITRAIHKR
FSPGRRRKNIIKIIAGQMFLVTNEWDEAFEGTRLVVNSQCCNIINRTLDAGNHQGWNLRRQEVEIVENLEKDEQWAETDDPSSPLQDLRASLKAYLLDIYKKEERDLIRK
RTLNWLHLGDENTKFFHRFLAAKKRRNLITELINNQGLPTNSFRDIEDLILGFYKNLYTKIPSAGSFPANLEWQKVPDEQNRRLSSRFNRDEIKRDLRGMRHNKAPNEFV
ALFNEFHENGKLNSCVKENFICLIKKKEDAVMVKDFRPISLTTLIYKVIAKPPPKVHIEGRQILDPILIANEMVEEYRIKKKKGWILKLDLEKAFDKVDWGFLEKVLQGK
NFNSKWISWILGCIKNPKFSIFINGRPRRRIQASRGVRQGDPLSPFLFLLINKLLSSLISRIHRKEKFEGFIVGKEKIHVPILQFADDTLLFCKYDIEMLEALRKTIELF
EWCSGQKVNWEKSALCGLNIDDLEVSSTAARLNCKAEKLPLMYLNLPLGGYPKKTSFWQPSLTKFKVVLSLERSMRNFFWEGHGGSKLNHLAHWDSVTRNQKDGGLGLEN
MRIRNLALLSKWGWRFMHESDSLWCQRHTNRKEVASLRSLWISISRQWQKIEALAVFKVGDGRKTAFWFDPSLENQPLKDRFPRLFKIAAKPNGSVAGHWDHNSSWDLVF
KRQLKDEDLGEFMSLSSSVANKRVLLQPDKRVWTFDGNGGLPFLGFPN