; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G23910 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G23910
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr3:21013498..21015270
RNA-Seq ExpressionCSPI03G23910
SyntenyCSPI03G23910
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]7.9e-6834Show/hide
Query:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR
        RK  W ELS +A  +S  WC+GGDFN+ R + E+    R T  M  F+ FI                                     W+  F  S    
Subjt:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR

Query:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWNIAQ------------AKLI--------
          R  SDH+P+ LE   F WGP+PFRF N WL      +            GW G     +L+ VK  +K WN A             + L+        
Subjt:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWNIAQ------------AKLI--------

Query:  -GLNQEELNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP
         GL+ E L  RA  + EL  +   EE ++ QK+++ W+  GD N+ FFH+  N ++ R  I EL+N++G++  +   I+  +L +FE LYT   G+ +  
Subjt:  -GLNQEELNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP

Query:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS
           +WS +S      L + F+ EEIFKA+  +  +KAP PDGFT+      W + K+    + ++FHR+G +N     +FI L+ KK     + DFRPIS
Subjt:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS

Query:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYR
        L T  YK++AKVL+ R++ V+   I  TQ AF++GRQILD +LIANE V++ R
Subjt:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYR

TYJ99326.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-6734.28Show/hide
Query:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR
        R L W EL +L       W +GGDFN+ RW  E      ++  M  FN FI                                     W+  F       
Subjt:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR

Query:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE
          R  SDHFP+ LE+ S  WGPSPFRF N++L   D  + I     N++  G+AG+    +L+ + + +KAW               I +  LI   + E
Subjt:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE

Query:  LNS-------RAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP
          S       R AL+A+L  I   E + + QK K  W+  GDEN+ FFH+   A++++ LI+++ N  G    +  DI    ++ FE +YT         
Subjt:  LNS-------RAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP

Query:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS
         N +W  +S+  +  L   F+  EI+  LK+   NKAP PDGFT+ FL   WS  K     +  DFH N  +N  + E  I  + KKEN   V DFRPIS
Subjt:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS

Query:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK
        LTT  YK++AKVL++RLKQ +   IS +Q AF++GRQI + ILIANEA++ +R K+++
Subjt:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK

XP_038884535.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X1 [Benincasa hispida]2.1e-7348.5Show/hide
Query:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA
        G ST+   L    +   WD  FE+SRVSR+AR  SDHFPL  EAG+F WGPSPFRFCNSWL + +  +II  +      Q WAGF L+ +LR VK +VK 
Subjt:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA

Query:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV
        W     K   + +E L                     + R +L+A+LLS+YQ EER+ IQKSKLNWL  GDENT FFHRFL AK+R+NLI EL N+ G+ 
Subjt:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV

Query:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR
        + SFR+IE ++L+FF +LYT+  G R IP+N  WS VS+  N  L+ +FS  EI  A++ALG NKAP PDGFTV+F++  W++ KD FK +  +F+ NG+
Subjt:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR

Query:  L
        +
Subjt:  L

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]2.1e-7348.5Show/hide
Query:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA
        G ST+   L    +   WD  FE+SRVSR+AR  SDHFPL  EAG+F WGPSPFRFCNSWL + +  +II  +      Q WAGF L+ +LR VK +VK 
Subjt:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA

Query:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV
        W     K   + +E L                     + R +L+A+LLS+YQ EER+ IQKSKLNWL  GDENT FFHRFL AK+R+NLI EL N+ G+ 
Subjt:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV

Query:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR
        + SFR+IE ++L+FF +LYT+  G R IP+N  WS VS+  N  L+ +FS  EI  A++ALG NKAP PDGFTV+F++  W++ KD FK +  +F+ NG+
Subjt:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR

Query:  L
        +
Subjt:  L

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]2.1e-7348.5Show/hide
Query:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA
        G ST+   L    +   WD  FE+SRVSR+AR  SDHFPL  EAG+F WGPSPFRFCNSWL + +  +II  +      Q WAGF L+ +LR VK +VK 
Subjt:  GRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKA

Query:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV
        W     K   + +E L                     + R +L+A+LLS+YQ EER+ IQKSKLNWL  GDENT FFHRFL AK+R+NLI EL N+ G+ 
Subjt:  WNIAQAKLIGLNQEEL---------------------NSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVV

Query:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR
        + SFR+IE ++L+FF +LYT+  G R IP+N  WS VS+  N  L+ +FS  EI  A++ALG NKAP PDGFTV+F++  W++ KD FK +  +F+ NG+
Subjt:  STSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGR

Query:  L
        +
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A5D3BJP3 LINE-1 retrotransposable element ORF2 protein1.1e-6734.28Show/hide
Query:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR
        R L W EL +L       W +GGDFN+ RW  E      ++  M  FN FI                                     W+  F       
Subjt:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR

Query:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE
          R  SDHFP+ LE+ S  WGPSPFRF N++L   D  + I     N++  G+AG+    +L+ + + +KAW               I +  LI   + E
Subjt:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE

Query:  LNS-------RAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP
          S       R AL+A+L  I   E + + QK K  W+  GDEN+ FFH+   A++++ LI+++ N  G    +  DI    ++ FE +YT         
Subjt:  LNS-------RAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP

Query:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS
         N +W  +S+  +  L   F+  EI+  LK+   NKAP PDGFT+ FL   WS  K     +  DFH N  +N  + E  I  + KKEN   V DFRPIS
Subjt:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS

Query:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK
        LTT  YK++AKVL++RLKQ +   IS +Q AF++GRQI + ILIANEA++ +R K+++
Subjt:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK

A0A5D3E0F6 LINE-1 retrotransposable element ORF2 protein4.2e-6733.41Show/hide
Query:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR
        R L W EL +L       W +GGDFN+ RW  E      ++  M  FN FI                                     W+  F       
Subjt:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR

Query:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE
          R  SDHFP+ LE+ +  WGPSPFRF N++L   D  + I     N++  G+AG+    +L+ + L +KAW               I +  LI   + E
Subjt:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN--------------IAQAKLIGLNQEE

Query:  -------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP
                  R AL+A+L  I   E + + QK K  W+  GDEN+ FFH+   A++++ LI+++ N+ G    +  DI    ++ FE +YT         
Subjt:  -------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP

Query:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS
         N +W  +S+  ++ L   F+  EI+  LK+   NKAP PDG+T+ FL   WS  K     +  DFH    +N  + E  I L+ KKEN   V DFRPIS
Subjt:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS

Query:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK
        LTT  YK++AK L++RLKQ +   IS +Q AF++GR+I + ILIANEA++ +R K+++
Subjt:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK

A5BCI7 Reverse transcriptase domain-containing protein3.8e-6834Show/hide
Query:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR
        RK  W ELS +A  +S  WC+GGDFN+ R + E+    R T  M  F+ FI                                     W+  F  S    
Subjt:  RKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFI----------------------------------DMMWDVCFENSRVSR

Query:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWNIAQ------------AKLI--------
          R  SDH+P+ LE   F WGP+PFRF N WL      +            GW G     +L+ VK  +K WN A             + L+        
Subjt:  KARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWNIAQ------------AKLI--------

Query:  -GLNQEELNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP
         GL+ E L  RA  + EL  +   EE ++ QK+++ W+  GD N+ FFH+  N ++ R  I EL+N++G++  +   I+  +L +FE LYT   G+ +  
Subjt:  -GLNQEELNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIP

Query:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS
           +WS +S      L + F+ EEIFKA+  +  +KAP PDGFT+      W + K+    + ++FHR+G +N     +FI L+ KK     + DFRPIS
Subjt:  INSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPIS

Query:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYR
        L T  YK++AKVL+ R++ V+   I  TQ AF++GRQILD +LIANE V++ R
Subjt:  LTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYR

M5VS59 Reverse transcriptase domain-containing protein (Fragment)8.5e-6834.2Show/hide
Query:  ERKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFIDM----------------------------------MWDVCFENSRVS
        ER   W EL+ L       WC+GGDFN+ R++ E+   GR T+ M  FN FI                                     W+  F + R  
Subjt:  ERKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFIDM----------------------------------MWDVCFENSRVS

Query:  RKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN------------IAQAKLIGLNQEE-
           RI SDH P+ L+     WGPSPFRF N WL   D  + I          GW G+    +L+ +K  +K W+             A+A+L+ L+Q E 
Subjt:  RKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN------------IAQAKLIGLNQEE-

Query:  --------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFI
                 + R  L  ++  + Q EE  + Q+ K+ W   GD NT FFHR  N  ++RN I +L+ +D  V     +IER V+ FF+ LY+      + 
Subjt:  --------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFI

Query:  PINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPI
            NW  +S ++ + L   F +EE+ KA+   G +K+P PDGF++ F  + W + K     +M DF ++G +N    E FICL+ KK N   V D RPI
Subjt:  PINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPI

Query:  SLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK
        SL T  YKV++KVL+ RL++V+   IS +Q AF++ RQILD +L+ANE VE+ R +++K
Subjt:  SLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK

M5WPQ5 Reverse transcriptase domain-containing protein1.5e-6934.42Show/hide
Query:  ERKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFIDM----------------------------------MWDVCFENSRVS
        ER   W EL+ L       WC+GGDFN+ R++ E+   GR T+ M  FN FI                                     W+  F + R  
Subjt:  ERKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFIDM----------------------------------MWDVCFENSRVS

Query:  RKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN------------IAQAKLIGLNQEE-
           RI SDH P+ L++    WGPSPFRF N WL   D  + I          GW G+    +L+ +K  +K W+             A+A+L+ L+Q E 
Subjt:  RKARIFSDHFPLFLEAGSFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWN------------IAQAKLIGLNQEE-

Query:  --------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFI
                 + R  L  ++  + Q EE  + Q+ K+ W   GD NT FFHR  N  ++RN I +L+ +D  V     +IER V+ FF+ LY+R     + 
Subjt:  --------LNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFI

Query:  PINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPI
            NW  +S ++ + L   F +EE+ KA+   G +K+P PDGF++ F  + W + K     +M DF ++G +N    E FICL+ KK N   V D+RPI
Subjt:  PINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPI

Query:  SLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK
        SL T  YKV++KVL+ RL++V+   IS +Q AF++ RQILD +L+ANE VE+ R +++K
Subjt:  SLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.8e-1325.61Show/hide
Query:  QEELNSRAA-------LQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTR-----
        QE+ +S+A+       ++AEL  I   +    I +S+  +    ++      R +  K+ +N I  +KND G ++T   +I+  + E+++ LY       
Subjt:  QEELNSRAA-------LQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTR-----

Query:  IPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKK-ENVT
           D F+        ++  + E+L    +  EI   + +L   K+P PDGFT +F              L     + G L     E  I L+ K   + T
Subjt:  IPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKK-ENVT

Query:  LVKDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQ
          ++FRPISL  +  K++ K+L+ R++Q +  +I   Q  FI G Q
Subjt:  LVKDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQ

P08548 LINE-1 reverse transcriptase homolog4.9e-1226.09Show/hide
Query:  LQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTR-----IPGDRFIPINSNWSCV
        ++AEL  I        I KSK  +    ++           K+ ++LI+ ++N +  ++T   +I++++ E+++ LY+         D+++    +   +
Subjt:  LQAELLSIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTR-----IPGDRFIPINSNWSCV

Query:  SSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKK-ENVTLVKDFRPISLTTLAYK
        S  + E L    S  EI   ++ L   K+P PDGFT +F  T       I  +L  +  + G L     E  I L+ K  ++ T  +++RPISL  +  K
Subjt:  SSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKK-ENVTLVKDFRPISLTTLAYK

Query:  VVAKVLSERLKQVMDAIISPTQSAFIEGRQ
        ++ K+L+ R++Q +  II   Q  FI G Q
Subjt:  VVAKVLSERLKQVMDAIISPTQSAFIEGRQ

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-1428.35Show/hide
Query:  RFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLY-TRIPG----DRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFT
        R     + + LI +++N+ G ++T   +I+  +  F++ LY T++      D+F+        ++  Q + L +  S +EI   + +L   K+P PDGF+
Subjt:  RFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLY-TRIPG----DRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFT

Query:  VKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQK-KENVTLVKDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQ
         +F  T       I   L       G L     E  I L+ K +++ T +++FRPISL  +  K++ K+L+ R+++ + AII P Q  FI G Q
Subjt:  VKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQK-KENVTLVKDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQ

P14381 Transposon TX1 uncharacterized 149 kDa protein2.3e-2227.56Show/hide
Query:  QAKLIGLNQEELNSRAALQAELL-SIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIP-
        + +L G   + L      + E L ++ Q + R    +S++  L   D  + FF+     K  R  IT L  +DG        I      F+++L++  P 
Subjt:  QAKLIGLNQEELNSRAALQAELL-SIYQNEERNFIQKSKLNWLSSGDENTGFFHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIP-

Query:  -GDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLV
          D    +      VS  + E L T  +++E+ +AL+ + +NK+P  DG T++F    W      F  ++++  + G L    +   + L+ KK ++ L+
Subjt:  -GDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLV

Query:  KDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANE
        K++RP+SL +  YK+VAK +S RLK V+  +I P QS  + GR I D + +  +
Subjt:  KDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANE

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.5e-2128.35Show/hide
Query:  FILHEQLRSVKLAVKAWN-IAQAKLIGLNQEELNSRAALQAELL-----SIYQNE--------------ERNFIQKSKLNWLSSGDENTGFFHRFLNAKK
        F L E L++ K   K  N      +    +E L+S  ++Q++LL     S+++ E              E  + QKS++ WL  GD NT FFH+ + A +
Subjt:  FILHEQLRSVKLAVKAWN-IAQAKLIGLNQEELNSRAALQAELL-----SIYQNE--------------ERNFIQKSKLNWLSSGDENTGFFHRFLNAKK

Query:  RRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQ----NEALVTQFSV----EEIFKALKALGNNKAPSPDGFTVKFL
         +NLI  L+ DD V   +   ++ +++ ++  L   +  D  I    +   +  I     N+ L ++ S     +EI  A+ A+  NKAP PD FT +F 
Subjt:  RRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQ----NEALVTQFSV----EEIFKALKALGNNKAPSPDGFTVKFL

Query:  ITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPISLTTLAYKVV
           W + KD   + + +F R G L        I L+ K   V  +  FRP+S  T+ YK++
Subjt:  ITQWSIFKDIFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPISLTTLAYKVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTTAAAGTGGGACCAAGTATTCTGTTAGCTTTCTTAGTGTTTGCCTATTTAGGTATTGTGAATGTTGGAGGTGGAGAAAGGAAATTGGTTTGGCCCGAATTGTC
TTCCTTGGCTGATTGTTCGTCCGTGGCCTGGTGCATAGGTGGGGATTTTAATATCACTCGTTGGGCCCGGGAAAGATTTCCTTTTGGCAGAAGTACTAGAGGAATGTCAT
TATTCAATAAGTTTATTGACATGATGTGGGATGTTTGTTTTGAAAATTCTCGGGTTTCTCGAAAAGCACGCATCTTCTCGGATCACTTTCCTTTATTCTTGGAGGCTGGT
TCTTTTCTTTGGGGTCCCTCTCCTTTTCGGTTTTGTAACAGCTGGTTGGTGTCCAGTGATTCTAATCAGATTATTGTTGAAACAGTGAGCAACTCTAACTTCCAGGGATG
GGCTGGTTTCATTCTTCATGAGCAGTTGAGATCAGTTAAATTGGCAGTAAAAGCTTGGAATATTGCGCAAGCTAAGTTGATTGGATTAAATCAAGAAGAGTTGAATTCTA
GAGCTGCTTTGCAAGCGGAATTACTTAGTATTTATCAAAACGAAGAGCGTAATTTTATTCAGAAGAGTAAACTCAATTGGCTTTCTTCGGGTGATGAGAATACGGGCTTC
TTCCACCGGTTTCTAAATGCGAAAAAAAGAAGGAACCTCATAACTGAATTAAAAAATGATGATGGGGTTGTCTCGACTTCATTCCGCGACATTGAAAGGCTTGTGCTGGA
ATTTTTTGAGTCACTTTATACCAGAATTCCGGGGGACAGATTCATCCCTATTAACAGTAATTGGTCTTGTGTTTCCTCAATTCAGAATGAGGCTCTTGTTACCCAATTTT
CAGTGGAGGAGATCTTTAAGGCATTAAAGGCACTTGGCAACAATAAAGCTCCGAGCCCGGATGGCTTCACAGTGAAGTTCCTAATCACCCAATGGTCTATTTTCAAGGAT
ATCTTCAAATCACTAATGTCTGATTTCCACAGAAATGGAAGATTAAATGCTTGCATCCAAGAAAACTTCATTTGTTTAGTGCAGAAAAAAGAGAATGTGACTCTAGTCAA
GGATTTCCGTCCAATCAGCCTTACTACGTTAGCATACAAGGTTGTTGCAAAGGTTTTATCTGAACGTCTAAAACAAGTTATGGATGCAATTATAAGCCCCACTCAAAGCG
CCTTTATTGAAGGTAGGCAAATTCTTGACCCAATATTAATTGCTAACGAGGCCGTGGAAGATTATAGGGCAAAACGGAAAAAGGATGAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTTAAAGTGGGACCAAGTATTCTGTTAGCTTTCTTAGTGTTTGCCTATTTAGGTATTGTGAATGTTGGAGGTGGAGAAAGGAAATTGGTTTGGCCCGAATTGTC
TTCCTTGGCTGATTGTTCGTCCGTGGCCTGGTGCATAGGTGGGGATTTTAATATCACTCGTTGGGCCCGGGAAAGATTTCCTTTTGGCAGAAGTACTAGAGGAATGTCAT
TATTCAATAAGTTTATTGACATGATGTGGGATGTTTGTTTTGAAAATTCTCGGGTTTCTCGAAAAGCACGCATCTTCTCGGATCACTTTCCTTTATTCTTGGAGGCTGGT
TCTTTTCTTTGGGGTCCCTCTCCTTTTCGGTTTTGTAACAGCTGGTTGGTGTCCAGTGATTCTAATCAGATTATTGTTGAAACAGTGAGCAACTCTAACTTCCAGGGATG
GGCTGGTTTCATTCTTCATGAGCAGTTGAGATCAGTTAAATTGGCAGTAAAAGCTTGGAATATTGCGCAAGCTAAGTTGATTGGATTAAATCAAGAAGAGTTGAATTCTA
GAGCTGCTTTGCAAGCGGAATTACTTAGTATTTATCAAAACGAAGAGCGTAATTTTATTCAGAAGAGTAAACTCAATTGGCTTTCTTCGGGTGATGAGAATACGGGCTTC
TTCCACCGGTTTCTAAATGCGAAAAAAAGAAGGAACCTCATAACTGAATTAAAAAATGATGATGGGGTTGTCTCGACTTCATTCCGCGACATTGAAAGGCTTGTGCTGGA
ATTTTTTGAGTCACTTTATACCAGAATTCCGGGGGACAGATTCATCCCTATTAACAGTAATTGGTCTTGTGTTTCCTCAATTCAGAATGAGGCTCTTGTTACCCAATTTT
CAGTGGAGGAGATCTTTAAGGCATTAAAGGCACTTGGCAACAATAAAGCTCCGAGCCCGGATGGCTTCACAGTGAAGTTCCTAATCACCCAATGGTCTATTTTCAAGGAT
ATCTTCAAATCACTAATGTCTGATTTCCACAGAAATGGAAGATTAAATGCTTGCATCCAAGAAAACTTCATTTGTTTAGTGCAGAAAAAAGAGAATGTGACTCTAGTCAA
GGATTTCCGTCCAATCAGCCTTACTACGTTAGCATACAAGGTTGTTGCAAAGGTTTTATCTGAACGTCTAAAACAAGTTATGGATGCAATTATAAGCCCCACTCAAAGCG
CCTTTATTGAAGGTAGGCAAATTCTTGACCCAATATTAATTGCTAACGAGGCCGTGGAAGATTATAGGGCAAAACGGAAAAAGGATGAATTTTAA
Protein sequenceShow/hide protein sequence
MTVKVGPSILLAFLVFAYLGIVNVGGGERKLVWPELSSLADCSSVAWCIGGDFNITRWARERFPFGRSTRGMSLFNKFIDMMWDVCFENSRVSRKARIFSDHFPLFLEAG
SFLWGPSPFRFCNSWLVSSDSNQIIVETVSNSNFQGWAGFILHEQLRSVKLAVKAWNIAQAKLIGLNQEELNSRAALQAELLSIYQNEERNFIQKSKLNWLSSGDENTGF
FHRFLNAKKRRNLITELKNDDGVVSTSFRDIERLVLEFFESLYTRIPGDRFIPINSNWSCVSSIQNEALVTQFSVEEIFKALKALGNNKAPSPDGFTVKFLITQWSIFKD
IFKSLMSDFHRNGRLNACIQENFICLVQKKENVTLVKDFRPISLTTLAYKVVAKVLSERLKQVMDAIISPTQSAFIEGRQILDPILIANEAVEDYRAKRKKDEF