; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G07720 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G07720
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr4:5579976..5581676
RNA-Seq ExpressionCSPI04G07720
SyntenyCSPI04G07720
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW43689.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.4e-6035.18Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        ++SDFRPISL+TSLYK+I+KVL+ RL+ VL   I+ +Q AFV+GRQI+DA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  ++ K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG
        R W                  +G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF       L   K  
Subjt:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG

Query:  DLVEGDQYLSSRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNGIVIPL----------
         L  G       S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS           L +LP S    +  L          
Subjt:  DLVEGDQYLSSRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNGIVIPL----------

Query:  ------------IC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMG---------RILK-------------HKETFKFSAFVLRKG
                    +C     GGLG G    +N ALL KWLWR   +G       IL I G          I++              +E   F+ FV+  G
Subjt:  ------------IC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMG---------RILK-------------HKETFKFSAFVLRKG

Query:  TKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
         +IRFW+D W G +PL  ++P+LF + L+K+  ++         SWNL+ RRN+ ++EI ++  ++  L
Subjt:  TKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

RVW56424.1 putative ribonuclease H protein [Vitis vinifera]1.3e-5933.74Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA R++ VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  +++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPRG-----------------KILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------
        R W  G                  + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+VDDT FF              
Subjt:  RLWPRG-----------------KILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------

Query:  -----FLGGWKL--------GDLVEGDQYLSSRSRL--------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------
              + G K+        G  +E   +LS  + +        P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS       
Subjt:  -----FLGGWKL--------GDLVEGDQYLSSRSRL--------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------

Query:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------
            L ++PTS                       + +   ++C     GGLG G    +N ALL KWLWR   +G       IL I G            
Subjt:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------

Query:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASIL
                    L ++E  KF+ FV+  G +IRFW D W G +PL  ++P L S+  +K+A ++         SWN + RRN+ ++EI ++  ++
Subjt:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASIL

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.4e-6034.14Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA RL+ VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  +++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------
        R W R                 G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF              
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------

Query:  -----FLGGWKL--------GDLVEGDQYLS--------SRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------
              + G K+        G  +E   +LS          S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS       
Subjt:  -----FLGGWKL--------GDLVEGDQYLS--------SRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------

Query:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------
            L ++P S                       + +   ++C     GGLG G    +N ALL KWLWR   +G       IL I G            
Subjt:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------

Query:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
                    L  +E  KF+ FV+  G +IRFW D W G +PL  ++P L S+  +K+A ++         SWN + RRN+ ++EI ++ S+++ L
Subjt:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.6e-6235.74Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA RL+ VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  +++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG
        R W R                 G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF       L   K  
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG

Query:  DLVEGDQYLSSR-SRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNG--------------
         L    + L  + S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS           L ++P S                 
Subjt:  DLVEGDQYLSSR-SRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNG--------------

Query:  ------------IVIPLICYGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR----------------------ILKHKETFKFSAFVLRK
                    +V      GGLG G    +N ALL KWLWR   +G       IL I G                        L  +E  KF+ FV+  
Subjt:  ------------IVIPLICYGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR----------------------ILKHKETFKFSAFVLRK

Query:  GTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
        G +IRFW D W G +PL  ++P L S+  +K+A ++         SWN + RRN+ ++EI ++ S++  L
Subjt:  GTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

RVX17354.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]6.7e-6134.79Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA R+++VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  M++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------
        R W R                 G +   RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF              
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------

Query:  -----FLGGWKLGDLVEGDQY-----LSSRSRL-----------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------
              + G K+ +L + + Y      +  SRL           P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS       
Subjt:  -----FLGGWKLGDLVEGDQY-----LSSRSRL-----------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------

Query:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGKIL--QIMGRI--------LKHKET
            L ++P S                       + +   ++C     GGLG G    +N ALL KWLWR   +G  L  Q+ G I        L ++E 
Subjt:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGKIL--QIMGRI--------LKHKET

Query:  FKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAE-CWCIVTHSWNLSLRRNMLNNEIVNVASILEIL
         KF+ FV+  G +IRFW D W G +PL  ++P L  +  +K+A ++         SWN + RRN+ ++EI ++  +++ L
Subjt:  FKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAE-CWCIVTHSWNLSLRRNMLNNEIVNVASILEIL

TrEMBL top hitse value%identityAlignment
A0A438E7Q5 Transposon TX1 uncharacterized 149 kDa protein3.6e-6035.18Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        ++SDFRPISL+TSLYK+I+KVL+ RL+ VL   I+ +Q AFV+GRQI+DA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  ++ K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG
        R W                  +G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF       L   K  
Subjt:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG

Query:  DLVEGDQYLSSRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNGIVIPL----------
         L  G       S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS           L +LP S    +  L          
Subjt:  DLVEGDQYLSSRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNGIVIPL----------

Query:  ------------IC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMG---------RILK-------------HKETFKFSAFVLRKG
                    +C     GGLG G    +N ALL KWLWR   +G       IL I G          I++              +E   F+ FV+  G
Subjt:  ------------IC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMG---------RILK-------------HKETFKFSAFVLRKG

Query:  TKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
         +IRFW+D W G +PL  ++P+LF + L+K+  ++         SWNL+ RRN+ ++EI ++  ++  L
Subjt:  TKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

A0A438G038 Transposon TX1 uncharacterized 149 kDa protein2.1e-6034.14Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA RL+ VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  +++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------
        R W R                 G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF              
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------

Query:  -----FLGGWKL--------GDLVEGDQYLS--------SRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------
              + G K+        G  +E   +LS          S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS       
Subjt:  -----FLGGWKL--------GDLVEGDQYLS--------SRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------

Query:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------
            L ++P S                       + +   ++C     GGLG G    +N ALL KWLWR   +G       IL I G            
Subjt:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR-----------

Query:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
                    L  +E  KF+ FV+  G +IRFW D W G +PL  ++P L S+  +K+A ++         SWN + RRN+ ++EI ++ S+++ L
Subjt:  -----------ILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein1.7e-6235.74Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA RL+ VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  +++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG
        R W R                 G + A RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF       L   K  
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFF------LGGWKLG

Query:  DLVEGDQYLSSR-SRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNG--------------
         L    + L  + S  P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS           L ++P S                 
Subjt:  DLVEGDQYLSSR-SRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS--------EGLLIQLPTSSNG--------------

Query:  ------------IVIPLICYGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR----------------------ILKHKETFKFSAFVLRK
                    +V      GGLG G    +N ALL KWLWR   +G       IL I G                        L  +E  KF+ FV+  
Subjt:  ------------IVIPLICYGGLGIGLFRQKNTALLTKWLWR-SEQGK------ILQIMGR----------------------ILKHKETFKFSAFVLRK

Query:  GTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL
        G +IRFW D W G +PL  ++P L S+  +K+A ++         SWN + RRN+ ++EI ++ S++  L
Subjt:  GTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECWCIV-THSWNLSLRRNMLNNEIVNVASILEIL

A0A438K828 LINE-1 retrotransposable element ORF2 protein3.3e-6134.79Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        R+SDFRPISL+TSLYK+I+KVLA R+++VL   I+ +Q AFV+GRQILDA+LIA+E+VD+    G +GV+ K+D EKAYD V   FLD  M++K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------
        R W R                 G +   RG+RQGDPL+PFLFT+V D LS ++    E+  L+GF   +    ++HLQ+ DDT FF              
Subjt:  RLWPR-----------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFF--------------

Query:  -----FLGGWKLGDLVEGDQY-----LSSRSRL-----------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------
              + G K+ +L + + Y      +  SRL           P  YLG  +GG       W+ + ER   ++D W+   LS G R+TL+QS       
Subjt:  -----FLGGWKLGDLVEGDQY-----LSSRSRL-----------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQS-------

Query:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGKIL--QIMGRI--------LKHKET
            L ++P S                       + +   ++C     GGLG G    +N ALL KWLWR   +G  L  Q+ G I        L ++E 
Subjt:  -EGLLIQLPTS----------------------SNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-SEQGKIL--QIMGRI--------LKHKET

Query:  FKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAE-CWCIVTHSWNLSLRRNMLNNEIVNVASILEIL
         KF+ FV+  G +IRFW D W G +PL  ++P L  +  +K+A ++         SWN + RRN+ ++EI ++  +++ L
Subjt:  FKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAE-CWCIVTHSWNLSLRRNMLNNEIVNVASILEIL

A0A803QEA6 Uncharacterized protein9.5e-6133.27Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------
        +V DFRPISL+TS+YK+I+K LATRL+ VL   I+++Q AFVEGRQILD++L+A+E V+ + S+GRKG +LK+D EKAYD+VD  FLDM ++ K      
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLK------

Query:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFFLGGWKLGDLVE--
        R W                  RGK    RG+RQGDPL+PFLFTM+ D L  ++    E  SL GF   K    L+HLQ+ DDT FF      L  LV+  
Subjt:  RLW-----------------PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFFLGGWKLGDLVE--

Query:  -------GDQYLSSRSRL------------------------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQSEGLLIQLPT
               G +   ++S+L                        P  YLG S+GG   ++  W  + ++   ++D W+   LS+G RLTL+QS  +L  LP 
Subjt:  -------GDQYLSSRSRL------------------------PFKYLGFSIGGGRNRKEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQSEGLLIQLPT

Query:  --------------------------------SSNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-------------------------SEQGK
                                          + +    +C     GGL IG    +N  LL KWLWR                         ++QG 
Subjt:  --------------------------------SSNGIVIPLIC----YGGLGIGLFRQKNTALLTKWLWR-------------------------SEQGK

Query:  ILQIMGRILK----HKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECW--------CIVTHSWNLSLRRNMLNNEIVNVAS
         +   G        + E  K   F +  G +IRFW+D W G   L ++FPNL  +S  K+  + E          C+   SW+L+ RRN+++ EI ++  
Subjt:  ILQIMGRILK----HKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKFPNLFSLSLNKDAYVAECW--------CIVTHSWNLSLRRNMLNNEIVNVAS

Query:  ILEIL
        +L+ L
Subjt:  ILEIL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.0e-1223.68Show/hide
Query:  DFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWS-SKGRKGVLLKLDLEKAYDKVDGSFL----------DMAMK
        +FRPISL+    K+++K+LA R+++ +  +I+  Q+ F+ G Q    I  +  V+   + +K +  V++ +D EKA+DK+   F+           M +K
Subjt:  DFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWS-SKGRKGVLLKLDLEKAYDKVDGSFL----------DMAMK

Query:  LKRL---WPRGKIL----------AKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEK-------LSDDLTHLQYVDD------------T
        + R     P   I+           K G RQG PL+P LF +V   L  L     +++ +KG    K        +DD+  + Y+++            +
Subjt:  LKRL---WPRGKIL----------AKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEK-------LSDDLTHLQYVDD------------T

Query:  SFFFLGGWKLGDLVEGDQYLSSRSR---------LPF-------KYLGFSIGGGRNR--KEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQSEGLLIQL
        +F  + G+K+ ++ +   +L + +R         LPF       KYLG  +        KE +  L +  +   ++W+N+  S   R+ +V+   +L ++
Subjt:  SFFFLGGWKLGDLVEGDQYLSSRSR---------LPF-------KYLGFSIGGGRNR--KEMWNTLEERFRYKIDRWRNVSLSKGDRLTLVQSEGLLIQL

Query:  PTSSNGIVIPLICYGGLGIGLFRQKNTALLTKWLWRSEQGKI
            N I I       L +  F +     L K++W  ++ +I
Subjt:  PTSSNGIVIPLICYGGLGIGLFRQKNTALLTKWLWRSEQGKI

P08548 LINE-1 reverse transcriptase homolog3.3e-1027.03Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSS-KGRKGVLLKLDLEKAYDKVDGSFLDMAMKLKRLWPR
        R  ++RPISL+    K+++K+L  R+++ +  II+  Q+ F+ G Q    I  +  V+   +  K +  ++L +D EKA+D +   F  M   LK++   
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSS-KGRKGVLLKLDLEKAYDKVDGSFLDMAMKLKRLWPR

Query:  GKIL-------------------------AKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDD
        G  L                          + G RQG PL+P LF +V + L+  I    E++++KG H    S+++    + DD
Subjt:  GKIL-------------------------AKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDD

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-1026.84Show/hide
Query:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSS-KGRKGVLLKLDLEKAYDKVDGSFLDMAMK-------
        ++ +FRPISL+    K+++K+LA R+++ + +II+  Q+ F+ G Q    I  +  V+   +  K +  +++ LD EKA+DK+   F+   ++       
Subjt:  RVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSS-KGRKGVLLKLDLEKAYDKVDGSFLDMAMK-------

Query:  ----LKRLWPR------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEK-------LSDDLTHLQYVDD
            +K ++ +              I  K G RQG PL+P+LF +V   L  L     +++ +KG    K       L+DD+  + Y+ D
Subjt:  ----LKRLWPR------------GKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEK-------LSDDLTHLQYVDD

P14381 Transposon TX1 uncharacterized 149 kDa protein3.5e-1227.07Show/hide
Query:  VSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMK---------
        + ++RP+SL+++ YK+++K ++ RLK VL  +I+  Q   V GR I D + +  +++      G     L LD EKA+D+VD  +L   ++         
Subjt:  VSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMK---------

Query:  --LKRLWPRGKILAK------------RGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDD
          LK ++   + L K            RG+RQG PL+  L+++  +   CL+     ++ L G   ++    +    Y DD
Subjt:  --LKRLWPRGKILAK------------RGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDD

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.4e-0727.92Show/hide
Query:  SDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKG----VLLKLDLEKAYDKVDGSFLDMAMK------
        S++RPI++ ++L +++ ++LA RL+  +   ++ +Q  +      +D  L+ S ++D + S  R+      ++ LD+ KA+D V  S +  A++      
Subjt:  SDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKG----VLLKLDLEKAYDKVDGSFLDMAMK------

Query:  ---------------LKRLWPRG---KILAKRGIRQGDPLAPFLFTMVGDALSC
                         R+ P     KI  +RG++QGDPL+PFLF  V D L C
Subjt:  ---------------LKRLWPRG---KILAKRGIRQGDPLAPFLFTMVGDALSC

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.6e-0640.58Show/hide
Query:  LATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQW-SSKGRKG-VLLKLDLEKAYDKVDGSFLD
        +  RLK ++ ++I  +Q +F+ GR   D I+   E V      KG KG +LLKLDLEKAYD++   +L+
Subjt:  LATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQW-SSKGRKG-VLLKLDLEKAYDKVDGSFLD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-0840.62Show/hide
Query:  PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTS
        P+G +   RG+RQGDPL+P+LF +  + LS L     E+  L G      S  + HL + DDTS
Subjt:  PRGKILAKRGIRQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCGTGTCAGTGATTTCAGACCCATAAGCTTAGTTACCTCCTTGTATAAAGTTATCTCCAAGGTACTTGCAACAAGACTTAAAAAAGTCCTTCCTTCGATAATTAA
TGATTCTCAGATGGCTTTTGTGGAGGGAAGGCAAATTCTTGATGCTATTTTGATTGCTTCTGAGGTTGTTGACCAATGGTCTTCAAAAGGCAGAAAAGGCGTTCTTTTGA
AGCTTGATCTGGAGAAAGCTTATGACAAGGTGGATGGGTCTTTTCTTGATATGGCCATGAAACTTAAAAGGCTTTGGCCTAGAGGAAAGATTCTTGCTAAAAGGGGCATT
CGTCAAGGAGATCCTCTTGCCCCTTTTCTTTTTACAATGGTGGGAGATGCTCTAAGCTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTCCATTTTGA
GAAATTGTCAGATGATTTAACCCACCTTCAGTATGTAGATGACACTTCTTTTTTCTTCCTGGGAGGATGGAAACTTGGAGACTTGGTGGAAGGTGATCAATATCTTTCTT
CTAGGAGCCGTCTTCCCTTTAAGTACTTGGGCTTTTCTATTGGAGGGGGTCGTAATAGAAAAGAGATGTGGAACACCCTTGAAGAGCGATTCAGATATAAAATCGACAGG
TGGAGGAATGTGTCCCTCTCTAAAGGGGATAGGCTAACTCTGGTGCAATCAGAGGGTCTACTAATTCAATTGCCCACCTCGTCAAATGGGATTGTAATTCCCCTAATCTG
TTATGGTGGTCTTGGGATTGGCTTGTTTAGGCAAAAGAACACTGCTCTTCTCACTAAATGGCTTTGGAGATCCGAACAGGGGAAAATCTTACAGATTATGGGCCGTATTT
TGAAGCACAAGGAGACCTTTAAATTTTCAGCTTTCGTGCTGAGAAAAGGAACCAAAATCAGATTTTGGAAGGACAATTGGTGTGGCGTGGAGCCACTTGCAGAAAAATTC
CCTAACTTGTTCTCCTTGTCATTGAATAAGGATGCTTATGTGGCTGAATGTTGGTGTATTGTTACTCATTCGTGGAACTTGAGCCTTAGAAGAAATATGCTCAATAACGA
GATTGTCAATGTGGCCTCAATTTTAGAAATTCTTCATTCTTGGGCCCCTCAGATGGGGATGATTGCCTTAAATGAACTCCTAATATTGATGGCAGCTTTACTACGAAGTC
TACTTTTCTCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCGTGTCAGTGATTTCAGACCCATAAGCTTAGTTACCTCCTTGTATAAAGTTATCTCCAAGGTACTTGCAACAAGACTTAAAAAAGTCCTTCCTTCGATAATTAA
TGATTCTCAGATGGCTTTTGTGGAGGGAAGGCAAATTCTTGATGCTATTTTGATTGCTTCTGAGGTTGTTGACCAATGGTCTTCAAAAGGCAGAAAAGGCGTTCTTTTGA
AGCTTGATCTGGAGAAAGCTTATGACAAGGTGGATGGGTCTTTTCTTGATATGGCCATGAAACTTAAAAGGCTTTGGCCTAGAGGAAAGATTCTTGCTAAAAGGGGCATT
CGTCAAGGAGATCCTCTTGCCCCTTTTCTTTTTACAATGGTGGGAGATGCTCTAAGCTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTCCATTTTGA
GAAATTGTCAGATGATTTAACCCACCTTCAGTATGTAGATGACACTTCTTTTTTCTTCCTGGGAGGATGGAAACTTGGAGACTTGGTGGAAGGTGATCAATATCTTTCTT
CTAGGAGCCGTCTTCCCTTTAAGTACTTGGGCTTTTCTATTGGAGGGGGTCGTAATAGAAAAGAGATGTGGAACACCCTTGAAGAGCGATTCAGATATAAAATCGACAGG
TGGAGGAATGTGTCCCTCTCTAAAGGGGATAGGCTAACTCTGGTGCAATCAGAGGGTCTACTAATTCAATTGCCCACCTCGTCAAATGGGATTGTAATTCCCCTAATCTG
TTATGGTGGTCTTGGGATTGGCTTGTTTAGGCAAAAGAACACTGCTCTTCTCACTAAATGGCTTTGGAGATCCGAACAGGGGAAAATCTTACAGATTATGGGCCGTATTT
TGAAGCACAAGGAGACCTTTAAATTTTCAGCTTTCGTGCTGAGAAAAGGAACCAAAATCAGATTTTGGAAGGACAATTGGTGTGGCGTGGAGCCACTTGCAGAAAAATTC
CCTAACTTGTTCTCCTTGTCATTGAATAAGGATGCTTATGTGGCTGAATGTTGGTGTATTGTTACTCATTCGTGGAACTTGAGCCTTAGAAGAAATATGCTCAATAACGA
GATTGTCAATGTGGCCTCAATTTTAGAAATTCTTCATTCTTGGGCCCCTCAGATGGGGATGATTGCCTTAAATGAACTCCTAATATTGATGGCAGCTTTACTACGAAGTC
TACTTTTCTCAAATTAACCAAGATCCCTTCCATTGTTGTTCATTTGATTCGTCATATTTGGAAGACTAAAATTCCAAAAAAATGTGAAATTTATCTTATGGTCGCTCGCT
TACAGAAGCCTTAACACTCCTGAGAAGCT
Protein sequenceShow/hide protein sequence
MARVSDFRPISLVTSLYKVISKVLATRLKKVLPSIINDSQMAFVEGRQILDAILIASEVVDQWSSKGRKGVLLKLDLEKAYDKVDGSFLDMAMKLKRLWPRGKILAKRGI
RQGDPLAPFLFTMVGDALSCLIHYCNEKRSLKGFHFEKLSDDLTHLQYVDDTSFFFLGGWKLGDLVEGDQYLSSRSRLPFKYLGFSIGGGRNRKEMWNTLEERFRYKIDR
WRNVSLSKGDRLTLVQSEGLLIQLPTSSNGIVIPLICYGGLGIGLFRQKNTALLTKWLWRSEQGKILQIMGRILKHKETFKFSAFVLRKGTKIRFWKDNWCGVEPLAEKF
PNLFSLSLNKDAYVAECWCIVTHSWNLSLRRNMLNNEIVNVASILEILHSWAPQMGMIALNELLILMAALLRSLLFSN