; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr6:14149188..14151472
RNA-Seq ExpressionCSPI06G15800
SyntenyCSPI06G15800
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]1.1e-17147.55Show/hide
Query:  QKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW
        QKSRIRWLKLG++N  FF+RS+      N L S  +     L+     IVQFRW++EC QALQ PI  EEVRRVLFSMDSGKAP   G  +   K +   
Subjt:  QKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW

Query:  LERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGL
        +           ++ I    G  R+     +    +I   IS +      +   + I         V GYHLN+             AYDS+NWDFLFGL
Subjt:  LERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGL

Query:  LIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYS
        LIAI TPL+FVSW++AC+TSPMFSIMINGSLEGFFHGRK +RQG+PLS F FVMVM+V SRML  PPQ F+FHQ CEKV LT LTFA+DLMIFC AD  S
Subjt:  LIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYS

Query:  WSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSV
         SF++ET+++FG+L GL+ANL K SIF+ G  +  AS LAANMGF +G+L VRYL LPLL GRLR SDC PLIQRITSRIRSW+ARVLSFAGR QLVRSV
Subjt:  WSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSV

Query:  LMSLQKEGRGVAKVAWDEVY-LPFDEGGLA----------IRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRK
          SLQ        V W  V+ LP                 +RDG SWNI STLKIL LL   S SLWVAWVEAYILK +  W +D+ V RSW  RAILRK
Subjt:  LMSLQKEGRGVAKVAWDEVY-LPFDEGGLA----------IRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRK

Query:  RDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW--------
        RD LK HV +E+              QG PI++Q GERV+YDA  RR+ARL + +G DGE                +Q +R   SV DRWVW        
Subjt:  RDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW--------

Query:  ---------------------------------------------------------------------------------EIWSRILLIMSSSHRIGYW
                                                                                         ++WSR+L +M+SSHRI YW
Subjt:  ---------------------------------------------------------------------------------EIWSRILLIMSSSHRIGYW

Query:  GVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVHGLI
        GV+LSWIC+QGIG  VR+KLWR+L CAT YFIW+E NHRLHGG     ++IFQ I +CI+ R  SW +  H LI
Subjt:  GVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVHGLI

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]1.5e-18947.41Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE
        +RQKS++RWL LG++NTAFF+RSVRSR S N L S +D DG+R+++HD                             +IVQF+W+EECCQALQ PI REE
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE

Query:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRL----------------------RLLLFLK
        VRRVLFSMDSGKAP   G  +   K  G W               E  Y  +  +  +++ I    G  RL                      RL L+L 
Subjt:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRL----------------------RLLLFLK

Query:  GMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLE
          +            SAFIPGRSII+NIL+CQE+V GYHLN+             AYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTS MFSIMINGSLE
Subjt:  GMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLE

Query:  GFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVN
        GFF+GRK LRQGDPLS FLFVMVMEVLSRML   PQ+F+FH  CEKV LTHLTFA+DLMIFCAAD  S SFI+E +++FG+ SGLFAN  K+SIF+VGVN
Subjt:  GFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVN

Query:  SAEASWLAANMGFTIGHLHVRYLGLPLLLG-------RLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------------
        +  AS LAA +G +           P  L         LRS DC PLIQRITSRIRSW+ARVLSFAGRLQLVRSVL SLQ                    
Subjt:  SAEASWLAANMGFTIGHLHVRYLGLPLLLG-------RLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------------

Query:  ---------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAH
                 +EGRG  KVAW +V LPF+EGGL IRDG SWNIA+TLKI   LL   GSLWVAW+EAYILKG+  W++D+RVGRSW  RAILRKR+ +K H
Subjt:  ---------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAH

Query:  VKMEIQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW-----------------------------
        V             GERV+YDA  RR+A+L D +  +GE                +Q +    SV D WVW                             
Subjt:  VKMEIQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW-----------------------------

Query:  ------------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLW
                                                                    ++WSR+  IM SSHRIG+WGV+LSWIC++GIGKGVR+KLW
Subjt:  ------------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLW

Query:  RLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH
        R+LWCATIYFIW ERNHRLHGG   +P+++F LI + I+ RA SW +  H
Subjt:  RLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH

KAA0059752.1 F17F8.5 [Cucumis melo var. makuwa]4.7e-17048.18Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE
        +RQKSRIRWLKLG++NT FF+RSV SR S N L+S +D DG R+++HD                             +IVQF+W+EECCQALQ PI REE
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE

Query:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------
        VRRVLFSMDSGKAP   G      K  G W               E  Y  V  +  +++ I    G  RL     +    ++   IS +          
Subjt:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------

Query:  ------ASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKR
               SAF+PGRSII+NIL+CQE+V GYHLN+             AYDSVNWDFLFGLLIAIGTPL+F                              
Subjt:  ------ASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKR

Query:  LRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLA
        +RQGDPLS FLFVMVMEVLSRML   PQ+F+FH  CEKV LT+LTFA+DLMIFCAAD  S  FI+E +++FG+LSGLFAN  K+SIF+ GVN+  AS LA
Subjt:  LRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLA

Query:  ANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGV
        A MGF  G+L +RYLGLPLL GRLRS+D  PLIQRITSRIRSW+ARVLSFAGRLQLVR VL SLQ                             +EGRG 
Subjt:  ANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGV

Query:  AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI----------
         KVAW +V LPF+E GL IRDG SWNIASTLKIL L+L  SGSLWVAWV+AYILKGR  W++D+RVGRSW  RAILRKR+ LK  V M++          
Subjt:  AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI----------

Query:  ----QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW----------VWE--IWS-------RILLIMSSSHRIG-------YWG
             G  I++Q GERV+YDA   R+ARL D +  DGE    R   S+   W          +W+  +W             ++   R+G       YW 
Subjt:  ----QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW----------VWE--IWS-------RILLIMSSSHRIG-------YWG

Query:  VQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH
                    + VR+KLWR+LWCATIYFIW ERNHRLHGG   +P+VIF LI S I+ R  SW +  H
Subjt:  VQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH

KAA0062318.1 uncharacterized protein E6C27_scaffold154G00690 [Cucumis melo var. makuwa]1.2e-17853.89Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQ------EIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMG----
        +RQKSRIRWLKLG++NTAFF+RSVRSR S N L+S +D DG R   + +      +IVQF+W+EECCQALQ PI REEVRRVLFSMDSGKAP   G    
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQ------EIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMG----

Query:  ------IRLASSKELGRWLERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL-ASAFIPGRSIIDNILICQEVVEGYHLNT-------
              I L         LE    +S   +L    S     RLR            W  S +    SAFIPGRSII+NIL+CQE+V GYHLN+       
Subjt:  ------IRLASSKELGRWLERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL-ASAFIPGRSIIDNILICQEVVEGYHLNT-------

Query:  ------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVML
              AYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTSPMFSIMINGSLEGFFHGRK +RQGDPLS FLFVMVMEVLSRML   PQ+F FH  CEKV L
Subjt:  ------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVML

Query:  THLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIR
        THLTFA+DLMIFCAA+  S  FI+E +++FG+LSGLFAN  K+SIF+ GVN+  AS LA  MGF  G+L VRYLGLPLL GRLRS+D  PLIQRITSRIR
Subjt:  THLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIR

Query:  SWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKS
        SW+ARVLSFAGRLQLV SVL S Q                             +EGRG  KVAW +V LPF+EGGL IRDG SWNIASTLKIL L+L  S
Subjt:  SWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKS

Query:  GSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEIQG--------------GPIIQQFGERVIYDAGIRRDARLVDLMGRDGE---
        GSLWVAWVEAYILKGR  W++D+RVG+SW  RAILRKR+ LK  V+M++                G I++Q GERV+YDA  RR ARL D +  DGE   
Subjt:  GSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEIQG--------------GPIIQQFGERVIYDAGIRRDARLVDLMGRDGE---

Query:  -------------IQGLRSSPSVEDRWVWEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVW
                     +Q +    SV D WVW              R G + +  +W       + VR +  R+LW      +W   N + H    W
Subjt:  -------------IQGLRSSPSVEDRWVWEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVW

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]3.8e-21256.8Show/hide
Query:  RIRWLKLGNKNTAFFYR-----------SVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLA
        R+ W +L  +    FYR            V++RQS N L S IDPDGNRLTNHDQ            QALQ PIGREEVRR LFSMDSGKAP   G  + 
Subjt:  RIRWLKLGNKNTAFFYR-----------SVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLA

Query:  SSKELGRWLERIYVMSSY--ASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------------ASAFIPGRSIIDNILICQEVVEGYHLN---
          K  G W    Y   +    +++ I    G  RL     +    +I   IS +                 SAFIPGRSIIDNIL+CQE+V GYHL+   
Subjt:  SSKELGRWLERIYVMSSY--ASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------------ASAFIPGRSIIDNILICQEVVEGYHLN---

Query:  ----------TAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFC
                   AYDSVNWDFLFGLLIAIG  +RFVSWVRAC TS MFSI+INGSLEGFFHGRK LRQGDPLS FLFVMVMEVLSRML  PPQNF+FHQFC
Subjt:  ----------TAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFC

Query:  EKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRI
        EKV LTHLTFA+DLMIFCAADNYS SFIKETIKRFG+LSGLFANL K+SIFLVGVNS++AS LAANMGF+IGHL VRYLGLPLL GRL+S DCDPLIQRI
Subjt:  EKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRI

Query:  TSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGL
        TSRIRSWSARVLSFAGRLQLVRSVL SLQ                             +EGRG AKVAWDEV LPFDEGGLAIRDGSSWNIASTLKIL L
Subjt:  TSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGL

Query:  LLVKSGSLWVAWVEAYILKGR--LSWEIDARVGRSWY----FRAILRKRDILKAHVKME-IQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE-----
        LLVKSGSLWVAWVEAYILKGR  L W     +G S       + ILR R  +   ++   IQGG IIQQFGERVIYDAG RRDARLVD M RDG+     
Subjt:  LLVKSGSLWVAWVEAYILKGR--LSWEIDARVGRSWY----FRAILRKRDILKAHVKME-IQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE-----

Query:  -----------IQGLRSSPSVEDRWV--------------------------------------------------------------------------
                   IQG+R S SVEDRWV                                                                          
Subjt:  -----------IQGLRSSPSVEDRWV--------------------------------------------------------------------------

Query:  ---------------WEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIF
                       WEIWSRILL MSSSHRIGYWGV+LSWICNQGIGKGVR+KLW LLWCATIYFIW+ERNH LHGGAV EPM+ F
Subjt:  ---------------WEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIF

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein5.4e-17247.55Show/hide
Query:  QKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW
        QKSRIRWLKLG++N  FF+RS+      N L S  +     L+     IVQFRW++EC QALQ PI  EEVRRVLFSMDSGKAP   G  +   K +   
Subjt:  QKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW

Query:  LERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGL
        +           ++ I    G  R+     +    +I   IS +      +   + I         V GYHLN+             AYDS+NWDFLFGL
Subjt:  LERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGL

Query:  LIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYS
        LIAI TPL+FVSW++AC+TSPMFSIMINGSLEGFFHGRK +RQG+PLS F FVMVM+V SRML  PPQ F+FHQ CEKV LT LTFA+DLMIFC AD  S
Subjt:  LIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYS

Query:  WSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSV
         SF++ET+++FG+L GL+ANL K SIF+ G  +  AS LAANMGF +G+L VRYL LPLL GRLR SDC PLIQRITSRIRSW+ARVLSFAGR QLVRSV
Subjt:  WSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSV

Query:  LMSLQKEGRGVAKVAWDEVY-LPFDEGGLA----------IRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRK
          SLQ        V W  V+ LP                 +RDG SWNI STLKIL LL   S SLWVAWVEAYILK +  W +D+ V RSW  RAILRK
Subjt:  LMSLQKEGRGVAKVAWDEVY-LPFDEGGLA----------IRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRK

Query:  RDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW--------
        RD LK HV +E+              QG PI++Q GERV+YDA  RR+ARL + +G DGE                +Q +R   SV DRWVW        
Subjt:  RDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW--------

Query:  ---------------------------------------------------------------------------------EIWSRILLIMSSSHRIGYW
                                                                                         ++WSR+L +M+SSHRI YW
Subjt:  ---------------------------------------------------------------------------------EIWSRILLIMSSSHRIGYW

Query:  GVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVHGLI
        GV+LSWIC+QGIG  VR+KLWR+L CAT YFIW+E NHRLHGG     ++IFQ I +CI+ R  SW +  H LI
Subjt:  GVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVHGLI

A0A5A7TZS0 Reverse transcriptase domain-containing protein7.5e-19047.41Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE
        +RQKS++RWL LG++NTAFF+RSVRSR S N L S +D DG+R+++HD                             +IVQF+W+EECCQALQ PI REE
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE

Query:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRL----------------------RLLLFLK
        VRRVLFSMDSGKAP   G  +   K  G W               E  Y  +  +  +++ I    G  RL                      RL L+L 
Subjt:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRL----------------------RLLLFLK

Query:  GMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLE
          +            SAFIPGRSII+NIL+CQE+V GYHLN+             AYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTS MFSIMINGSLE
Subjt:  GMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLE

Query:  GFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVN
        GFF+GRK LRQGDPLS FLFVMVMEVLSRML   PQ+F+FH  CEKV LTHLTFA+DLMIFCAAD  S SFI+E +++FG+ SGLFAN  K+SIF+VGVN
Subjt:  GFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVN

Query:  SAEASWLAANMGFTIGHLHVRYLGLPLLLG-------RLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------------
        +  AS LAA +G +           P  L         LRS DC PLIQRITSRIRSW+ARVLSFAGRLQLVRSVL SLQ                    
Subjt:  SAEASWLAANMGFTIGHLHVRYLGLPLLLG-------RLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------------

Query:  ---------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAH
                 +EGRG  KVAW +V LPF+EGGL IRDG SWNIA+TLKI   LL   GSLWVAW+EAYILKG+  W++D+RVGRSW  RAILRKR+ +K H
Subjt:  ---------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAH

Query:  VKMEIQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW-----------------------------
        V             GERV+YDA  RR+A+L D +  +GE                +Q +    SV D WVW                             
Subjt:  VKMEIQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGE----------------IQGLRSSPSVEDRWVW-----------------------------

Query:  ------------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLW
                                                                    ++WSR+  IM SSHRIG+WGV+LSWIC++GIGKGVR+KLW
Subjt:  ------------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLW

Query:  RLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH
        R+LWCATIYFIW ERNHRLHGG   +P+++F LI + I+ RA SW +  H
Subjt:  RLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH

A0A5A7UP65 Reverse transcriptase1.5e-16948.68Show/hide
Query:  KNTAFFYRSVRSRQSSN--VLQSFIDPDGNRLTNHD----------QEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW
        +N  F   SV SR SS+  V Q  ++   N L + +           +IVQF+W+EECCQALQ PI REEVRRVLFSMDSGKAP   G  +   K  G W
Subjt:  KNTAFFYRSVRSRQSSN--VLQSFIDPDGNRLTNHD----------QEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRW

Query:  --------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------------ASAFIPGRSIIDNILICQEVVEGY
                       E  Y  V  +  +++ I    G  RL     +    ++   IS +                 SAFI GRSII+NIL+CQE+V GY
Subjt:  --------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------------ASAFIPGRSIIDNILICQEVVEGY

Query:  HLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNF
        HLN+             AYDSVNWDFLFGL I+I TPL+FVSW+ ACVTSPMFSIMINGSLEGFFHGRK +RQGDPLS FLFVMVMEVLSRML   PQ+F
Subjt:  HLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNF

Query:  KFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCD
        +FH  CE                               KRFG+LSGLFAN  K+SIF+ GVN+  AS LAA MGF  G+L VRYLGLPLL GRLRS+DC 
Subjt:  KFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCD

Query:  PLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQKEGRGVAKVAWDEV-----YLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYIL
        PLIQRITSRIRS SARVLSFAGRLQLV SVL SLQ        V W  V     Y+  +EGGL IRDG++W  ASTLKIL L+L  SGSLWVAWVEAY+L
Subjt:  PLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQKEGRGVAKVAWDEV-----YLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYIL

Query:  KGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW--
        KGR  W++D+RVGRSW  RAILRK++ LK HV+M++              Q G I++Q GERV+YDA  RR+A L + +G DGE    R   S+   W  
Subjt:  KGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI--------------QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW--

Query:  -------------VW----------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWIC
                     +W                                                          ++WSR+L IM+SSHRIG+WGV+LSWIC
Subjt:  -------------VW----------------------------------------------------------EIWSRILLIMSSSHRIGYWGVQLSWIC

Query:  NQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSW
        +QGI KGVR+KLWR+LWCATIYFIW ERNHRLHGG   +P+VIF LI + I+ RA SW
Subjt:  NQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSW

A0A5A7UV01 F17F8.52.3e-17048.18Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE
        +RQKSRIRWLKLG++NT FF+RSV SR S N L+S +D DG R+++HD                             +IVQF+W+EECCQALQ PI REE
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHD----------------------------QEIVQFRWTEECCQALQAPIGREE

Query:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------
        VRRVLFSMDSGKAP   G      K  G W               E  Y  V  +  +++ I    G  RL     +    ++   IS +          
Subjt:  VRRVLFSMDSGKAPSQMGIRLASSKELGRW--------------LERIY--VMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL--------

Query:  ------ASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKR
               SAF+PGRSII+NIL+CQE+V GYHLN+             AYDSVNWDFLFGLLIAIGTPL+F                              
Subjt:  ------ASAFIPGRSIIDNILICQEVVEGYHLNT-------------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKR

Query:  LRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLA
        +RQGDPLS FLFVMVMEVLSRML   PQ+F+FH  CEKV LT+LTFA+DLMIFCAAD  S  FI+E +++FG+LSGLFAN  K+SIF+ GVN+  AS LA
Subjt:  LRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLA

Query:  ANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGV
        A MGF  G+L +RYLGLPLL GRLRS+D  PLIQRITSRIRSW+ARVLSFAGRLQLVR VL SLQ                             +EGRG 
Subjt:  ANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGV

Query:  AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI----------
         KVAW +V LPF+E GL IRDG SWNIASTLKIL L+L  SGSLWVAWV+AYILKGR  W++D+RVGRSW  RAILRKR+ LK  V M++          
Subjt:  AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEI----------

Query:  ----QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW----------VWE--IWS-------RILLIMSSSHRIG-------YWG
             G  I++Q GERV+YDA   R+ARL D +  DGE    R   S+   W          +W+  +W             ++   R+G       YW 
Subjt:  ----QGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRW----------VWE--IWS-------RILLIMSSSHRIG-------YWG

Query:  VQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH
                    + VR+KLWR+LWCATIYFIW ERNHRLHGG   +P+VIF LI S I+ R  SW +  H
Subjt:  VQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVH

A0A5A7V3Z0 Reverse transcriptase domain-containing protein6.0e-17953.89Show/hide
Query:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQ------EIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMG----
        +RQKSRIRWLKLG++NTAFF+RSVRSR S N L+S +D DG R   + +      +IVQF+W+EECCQALQ PI REEVRRVLFSMDSGKAP   G    
Subjt:  MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQ------EIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMG----

Query:  ------IRLASSKELGRWLERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL-ASAFIPGRSIIDNILICQEVVEGYHLNT-------
              I L         LE    +S   +L    S     RLR            W  S +    SAFIPGRSII+NIL+CQE+V GYHLN+       
Subjt:  ------IRLASSKELGRWLERIYVMSSYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYL-ASAFIPGRSIIDNILICQEVVEGYHLNT-------

Query:  ------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVML
              AYDSVNWDFLFGLLIAIGTPL+FVSW+RACVTSPMFSIMINGSLEGFFHGRK +RQGDPLS FLFVMVMEVLSRML   PQ+F FH  CEKV L
Subjt:  ------AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVML

Query:  THLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIR
        THLTFA+DLMIFCAA+  S  FI+E +++FG+LSGLFAN  K+SIF+ GVN+  AS LA  MGF  G+L VRYLGLPLL GRLRS+D  PLIQRITSRIR
Subjt:  THLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIR

Query:  SWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKS
        SW+ARVLSFAGRLQLV SVL S Q                             +EGRG  KVAW +V LPF+EGGL IRDG SWNIASTLKIL L+L  S
Subjt:  SWSARVLSFAGRLQLVRSVLMSLQ-----------------------------KEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKS

Query:  GSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEIQG--------------GPIIQQFGERVIYDAGIRRDARLVDLMGRDGE---
        GSLWVAWVEAYILKGR  W++D+RVG+SW  RAILRKR+ LK  V+M++                G I++Q GERV+YDA  RR ARL D +  DGE   
Subjt:  GSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEIQG--------------GPIIQQFGERVIYDAGIRRDARLVDLMGRDGE---

Query:  -------------IQGLRSSPSVEDRWVWEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVW
                     +Q +    SV D WVW              R G + +  +W       + VR +  R+LW      +W   N + H    W
Subjt:  -------------IQGLRSSPSVEDRWVWEIWSRILLIMSSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.4e-1526.73Show/hide
Query:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA
        A+D +   F+   L  +G    ++  +RA    P  +I++NG     F  +   RQG PLS  LF +V+EVL+R +    +        E+V L+   FA
Subjt:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA

Query:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA
        +D++++      S   + + I  F  +SG   N+ K+  FL   N    S +   + FTI    ++YLG+ L   +  L   +  PL++ I      W  
Subjt:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA

Query:  RVLSFAGRLQLVRSVLM
           S+ GR+ +V+  ++
Subjt:  RVLSFAGRLQLVRSVLM

P08548 LINE-1 reverse transcriptase homolog9.9e-1426.27Show/hide
Query:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA
        A+D++   F+   L  IG    F+  + A  + P  +I++NG     F  R   RQG PLS  LF +VMEVL+  +         H   E++ L+   FA
Subjt:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA

Query:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA
        +D++++      S + + E IK + ++SG   N  K+  F+   N+     +  ++ FT+    ++YLG+ L   +  L   + + L + I   +  W  
Subjt:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA

Query:  RVLSFAGRLQLVRSVLM
           S+ GR+ +V+  ++
Subjt:  RVLSFAGRLQLVRSVLM

P0C2F6 Putative ribonuclease H protein At1g657505.8e-0628.68Show/hide
Query:  LPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSL-----------------------------QKEGRGVAKVAWDEVYLPFDEGG
        +P+L  R+       +++R++SR+  W  + LSFAGRL L ++VL S+                               E +    V W +V  P  EGG
Subjt:  LPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSL-----------------------------QKEGRGVAKVAWDEVYLPFDEGG

Query:  LAIRDGSSWNIASTLKILGLLLVKSGSLW
        L +R   S N A   K+   LL +  SLW
Subjt:  LAIRDGSSWNIASTLKILGLLLVKSGSLW

P11369 LINE-1 retrotransposable element ORF2 protein7.1e-1224.42Show/hide
Query:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA
        A+D +   F+  +L   G    +++ ++A  + P+ +I +NG        +   RQG PLS +LF +V+EVL+R +    +        E+V ++ L  A
Subjt:  AYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFA

Query:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA
        +D++++ +    S   +   I  FG++ G   N +K+  FL   N      +     F+I   +++YLG+ L   +  L   +   L + I   +R W  
Subjt:  NDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAANMGFTIGHLHVRYLGLPLL--LGRLRSSDCDPLIQRITSRIRSWSA

Query:  RVLSFAGRLQLVRSVLM
           S+ GR+ +V+  ++
Subjt:  RVLSFAGRLQLVRSVLM

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.0e-0625.62Show/hide
Query:  FIPGRSIIDNILICQEVVEGYH------------LNTAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQF
        F+P     DN  I   V+   H            ++ A+DS++   ++  L A G P  FV +V+        S+  +G     F   + ++QGDPLS  
Subjt:  FIPGRSIIDNILICQEVVEGYH------------LNTAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSLEGFFHGRKRLRQGDPLSQF

Query:  LFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNS--AEASWLAANMGFTIG
        LF +VM+ L R L S     +        +     FA+DL++F A        + +    F  + GL  N DK   F VG+     +   +     F +G
Subjt:  LFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNS--AEASWLAANMGFTIG

Query:  HLHV---------RYLGLPL-LLGRLR---SSDCDPLIQRIT
           +         +YLG+     GR+R   + D  P +QR+T
Subjt:  HLHV---------RYLGLPL-LLGRLR---SSDCDPLIQRIT

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-1633.01Show/hide
Query:  LVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------KEGRGV--
        + GV   + + +  +  F  G L VRYLGLPLL  ++ +SD  PL+++I  RI  W+AR LSFAGRLQL+ SV+ SL               KE   +  
Subjt:  LVGVNSAEASWLAANMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQ--------------KEGRGV--

Query:  -------------AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVK
                     AKVAW +V  P DEGGL IR               L     GS W             S   +  +G SW ++ IL+ R +    VK
Subjt:  -------------AKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVKSGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVK

Query:  MEIQGG
         +I  G
Subjt:  MEIQGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCAGAAGTCGCGAATCAGATGGCTGAAGCTAGGAAATAAGAATACTGCCTTTTTTTATCGATCTGTTCGATCAAGGCAGAGCAGCAATGTTTTGCAATCATTTAT
TGATCCGGATGGTAATCGATTGACTAATCATGATCAGGAGATTGTTCAATTTAGATGGACTGAGGAGTGCTGCCAAGCCCTTCAGGCGCCAATTGGCAGGGAGGAAGTGA
GACGTGTTCTGTTTTCCATGGATAGTGGAAAGGCTCCAAGCCAGATGGGTATTCGATTGGCTTCTTCAAAGGAGCTTGGACGGTGGTTGGAGAGGATTTATGTGATGTCG
TCATACGCTTCTTTGAGTTCAATTACTTCCCTTAAGGGGGGAATACGACTGCGATTACTCTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGAATTTCAGGCCTATA
TCTTGCTTCAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTATTTGTCAGGAGGTTGTAGAGGGTTACCATTTGAACACAGCTTATGATTCTGTTAATTGGG
ATTTCCTATTTGGCCTACTGATTGCCATTGGTACGCCTTTAAGATTTGTGAGTTGGGTTCGAGCTTGTGTGACCTCTCCGATGTTCTCCATTATGATTAATGGTTCGTTG
GAAGGTTTTTTCCATGGAAGGAAAAGACTTAGACAAGGTGATCCTCTATCCCAGTTCTTATTTGTGATGGTCATGGAGGTGCTATCTCGCATGTTGATTAGCCCGCCTCA
GAATTTTAAATTCCACCAGTTTTGTGAGAAGGTCATGTTAACTCATCTTACTTTTGCGAATGATCTGATGATCTTTTGTGCTGCTGATAATTATTCTTGGAGTTTCATAA
AAGAGACTATTAAGAGGTTTGGTGATCTATCGGGCTTGTTTGCTAATCTTGATAAAAACTCTATTTTCCTTGTAGGGGTTAATAGTGCGGAAGCTTCTTGGCTTGCTGCT
AACATGGGTTTTACCATTGGTCACCTCCATGTTCGTTATCTTGGTCTTCCTCTCCTCTTAGGAAGATTGCGGAGCTCTGATTGTGATCCCCTTATTCAGCGAATTACCAG
TCGTATTCGGTCTTGGTCTGCTAGAGTGTTATCTTTTGCAGGTAGACTTCAGCTTGTTCGCTCAGTCCTTATGAGCCTTCAGAAGGAGGGAAGAGGTGTTGCTAAAGTTG
CCTGGGATGAGGTTTATCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGTGATGGATCGTCTTGGAATATAGCAAGCACGTTGAAGATCTTAGGGTTGCTACTAGTTAAA
TCTGGTAGCTTGTGGGTTGCTTGGGTGGAAGCTTACATCCTTAAAGGGAGATTGTCGTGGGAGATCGATGCTAGGGTGGGTCGATCTTGGTATTTTAGGGCTATCTTGCG
TAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGATTCAGGGTGGTCCGATTATCCAGCAGTTTGGGGAGAGGGTGATCTATGATGCAGGTATTCGGCGGGATGCGA
GGCTTGTGGATTTAATGGGTCGGGATGGTGAGATTCAGGGACTGAGGTCGAGTCCGAGTGTTGAGGATAGGTGGGTATGGGAGATTTGGTCTAGAATCTTGTTGATTATG
TCATCTTCTCATAGGATAGGGTATTGGGGTGTTCAGTTGTCTTGGATCTGCAATCAAGGTATTGGGAAGGGTGTGAGGAAAAAACTGTGGCGCCTTCTTTGGTGTGCTAC
AATTTATTTCATTTGGCAGGAGCGAAATCATCGTCTTCATGGAGGTGCAGTTTGGGAGCCTATGGTTATATTCCAACTCATTCGATCATGTATTAAAGTGCGTGCTACTT
CTTGGTTCGATGGAGTTCATGGTCTTATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTCAGAAGTCGCGAATCAGATGGCTGAAGCTAGGAAATAAGAATACTGCCTTTTTTTATCGATCTGTTCGATCAAGGCAGAGCAGCAATGTTTTGCAATCATTTAT
TGATCCGGATGGTAATCGATTGACTAATCATGATCAGGAGATTGTTCAATTTAGATGGACTGAGGAGTGCTGCCAAGCCCTTCAGGCGCCAATTGGCAGGGAGGAAGTGA
GACGTGTTCTGTTTTCCATGGATAGTGGAAAGGCTCCAAGCCAGATGGGTATTCGATTGGCTTCTTCAAAGGAGCTTGGACGGTGGTTGGAGAGGATTTATGTGATGTCG
TCATACGCTTCTTTGAGTTCAATTACTTCCCTTAAGGGGGGAATACGACTGCGATTACTCTTATTCCTAAAAGGAATGGTGCTGATCGGTTGGAGAATTTCAGGCCTATA
TCTTGCTTCAGCTTTCATCCCTGGGAGGAGTATTATTGACAATATTCTTATTTGTCAGGAGGTTGTAGAGGGTTACCATTTGAACACAGCTTATGATTCTGTTAATTGGG
ATTTCCTATTTGGCCTACTGATTGCCATTGGTACGCCTTTAAGATTTGTGAGTTGGGTTCGAGCTTGTGTGACCTCTCCGATGTTCTCCATTATGATTAATGGTTCGTTG
GAAGGTTTTTTCCATGGAAGGAAAAGACTTAGACAAGGTGATCCTCTATCCCAGTTCTTATTTGTGATGGTCATGGAGGTGCTATCTCGCATGTTGATTAGCCCGCCTCA
GAATTTTAAATTCCACCAGTTTTGTGAGAAGGTCATGTTAACTCATCTTACTTTTGCGAATGATCTGATGATCTTTTGTGCTGCTGATAATTATTCTTGGAGTTTCATAA
AAGAGACTATTAAGAGGTTTGGTGATCTATCGGGCTTGTTTGCTAATCTTGATAAAAACTCTATTTTCCTTGTAGGGGTTAATAGTGCGGAAGCTTCTTGGCTTGCTGCT
AACATGGGTTTTACCATTGGTCACCTCCATGTTCGTTATCTTGGTCTTCCTCTCCTCTTAGGAAGATTGCGGAGCTCTGATTGTGATCCCCTTATTCAGCGAATTACCAG
TCGTATTCGGTCTTGGTCTGCTAGAGTGTTATCTTTTGCAGGTAGACTTCAGCTTGTTCGCTCAGTCCTTATGAGCCTTCAGAAGGAGGGAAGAGGTGTTGCTAAAGTTG
CCTGGGATGAGGTTTATCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGTGATGGATCGTCTTGGAATATAGCAAGCACGTTGAAGATCTTAGGGTTGCTACTAGTTAAA
TCTGGTAGCTTGTGGGTTGCTTGGGTGGAAGCTTACATCCTTAAAGGGAGATTGTCGTGGGAGATCGATGCTAGGGTGGGTCGATCTTGGTATTTTAGGGCTATCTTGCG
TAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGATTCAGGGTGGTCCGATTATCCAGCAGTTTGGGGAGAGGGTGATCTATGATGCAGGTATTCGGCGGGATGCGA
GGCTTGTGGATTTAATGGGTCGGGATGGTGAGATTCAGGGACTGAGGTCGAGTCCGAGTGTTGAGGATAGGTGGGTATGGGAGATTTGGTCTAGAATCTTGTTGATTATG
TCATCTTCTCATAGGATAGGGTATTGGGGTGTTCAGTTGTCTTGGATCTGCAATCAAGGTATTGGGAAGGGTGTGAGGAAAAAACTGTGGCGCCTTCTTTGGTGTGCTAC
AATTTATTTCATTTGGCAGGAGCGAAATCATCGTCTTCATGGAGGTGCAGTTTGGGAGCCTATGGTTATATTCCAACTCATTCGATCATGTATTAAAGTGCGTGCTACTT
CTTGGTTCGATGGAGTTCATGGTCTTATTTAG
Protein sequenceShow/hide protein sequence
MRQKSRIRWLKLGNKNTAFFYRSVRSRQSSNVLQSFIDPDGNRLTNHDQEIVQFRWTEECCQALQAPIGREEVRRVLFSMDSGKAPSQMGIRLASSKELGRWLERIYVMS
SYASLSSITSLKGGIRLRLLLFLKGMVLIGWRISGLYLASAFIPGRSIIDNILICQEVVEGYHLNTAYDSVNWDFLFGLLIAIGTPLRFVSWVRACVTSPMFSIMINGSL
EGFFHGRKRLRQGDPLSQFLFVMVMEVLSRMLISPPQNFKFHQFCEKVMLTHLTFANDLMIFCAADNYSWSFIKETIKRFGDLSGLFANLDKNSIFLVGVNSAEASWLAA
NMGFTIGHLHVRYLGLPLLLGRLRSSDCDPLIQRITSRIRSWSARVLSFAGRLQLVRSVLMSLQKEGRGVAKVAWDEVYLPFDEGGLAIRDGSSWNIASTLKILGLLLVK
SGSLWVAWVEAYILKGRLSWEIDARVGRSWYFRAILRKRDILKAHVKMEIQGGPIIQQFGERVIYDAGIRRDARLVDLMGRDGEIQGLRSSPSVEDRWVWEIWSRILLIM
SSSHRIGYWGVQLSWICNQGIGKGVRKKLWRLLWCATIYFIWQERNHRLHGGAVWEPMVIFQLIRSCIKVRATSWFDGVHGLI