; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032599 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032599
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:35119279..35121871
RNA-Seq ExpressionLag0032599
SyntenyLag0032599
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78865.1 hypothetical protein VITISV_013346 [Vitis vinifera]1.2e-4330.15Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K  + DR+ V SVW+ RN  W ++ A GASGGI+I+W+       E+V G FS+SI  SL      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNWIMGEITILLDGLRKNLPSQPLLLL---ILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELII
          P W    + +     ++N              G + + R+      L  WN+ SFG+ KE +  +  +L+  D +E+   +  +   +R   K EL  
Subjt:  CLPNWIMGEITILLDGLRKNLPSQPLLLL---ILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELII

Query:  LSANEEIMWRQ-----------------------------------------------------------------------------------------
        L   EEI WRQ                                                                                         
Subjt:  LSANEEIMWRQ-----------------------------------------------------------------------------------------

Query:  -------------------------------------------RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEI
                                                   R+ DF+PISL T LY+II++VL  RL+ VL  TI   Q AFV+GRQI+DA LIANEI
Subjt:  -------------------------------------------RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEI

Query:  IDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        +DE  R  ++GVV K+D EK +D V WDFLD  L+ KGF   WR+W+ GC+SS +++I++N
Subjt:  IDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

RVW29586.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.7e-4534.46Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR++V SVW+ RN  W  + A GASGGI+ +W+       E+V G FS+S+  +L      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLP--------------NWIMGEITILLDGLRKNLPSQPLLLLILGG---RIIYRM-----VGRVMTLFIWNQQSF----------GQQKEIR-------
          P               W   + + +   L + L S     L   G    +I R      +      F+W    F            ++  R       
Subjt:  CLP--------------NWIMGEITILLDGLRKNLPSQPLLLLILGG---RIIYRM-----VGRVMTLFIWNQQSF----------GQQKEIR-------

Query:  ------HRLNRELSIIDNMEENEPIFEEE----SKRRTEIKA------ELIILS-----ANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPH
              H+  R L  +     N   F  E     K+    KA       + +         E++   +++ DFRPISL T LY+II++VL  RL+ VL  
Subjt:  ------HRLNRELSIIDNMEENEPIFEEE----SKRRTEIKA------ELIILS-----ANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPH

Query:  TITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        TI   Q AFV+GRQIMDA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KGF   WR+W+ GC+SS ++++++N
Subjt:  TITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

RVW30566.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-4334.29Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR+ V SVW+ RN  WA+  A GAS G +I+W+       E+V G FS+S+  +L      W++ VYGPNSS  RK FW ++ D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNW-IMGEITIL------LDGLRKNLPSQ---------PLLLLILGGRIIYRM------VGRVMTLFIWN---QQSFGQ--QKEIRHRLNRELSII--
          P W + G+  ++      L G R     +          LL L L      R       V + +  F+++   +  F Q  Q+ +    +   SI+  
Subjt:  CLPNW-IMGEITIL------LDGLRKNLPSQ---------PLLLLILGGRIIYRM------VGRVMTLFIWN---QQSFGQ--QKEIRHRLNRELSII--

Query:  -DNMEENEP---------------IFEEESKR-----------RTEIKAELIILSANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITE
         +  ++  P               + +E+  R                A  I+L   + +   +++ DFRPISL T LY+II++VL  RL+ VL  TI  
Subjt:  -DNMEENEP---------------IFEEESKR-----------RTEIKAELIILSANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITE

Query:  FQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
         Q AFV+GRQI+D  LIAN+I+DE  R KK+GVV K+D EKA++ V WDFLD  L+ KGF   WR+W+ GC+S+ +F +++N
Subjt:  FQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

RVW38710.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.2e-5136.77Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR+ V SVW++RN  W ++   GASGGI+I+W+       E+V G FS+S+  SL      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNWIMGEITILLDGLRKNLPSQPLLLLILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEEN--------EPIFEEESKRRT---
          P W   EI  +  G + N           G + + R+      L  WN+ SFG+ KE +  +  +L+  D +E+         + +F EE   +    
Subjt:  CLPNWIMGEITILLDGLRKNLPSQPLLLLILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEEN--------EPIFEEESKRRT---

Query:  ----------------------EIKAELIIL------------SANEEIM-------WRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYA
                               IK +L+ +            S N   +         +R+ DFRPISL T LY+II++VL  RL+ VL  TI   Q A
Subjt:  ----------------------EIKAELIIL------------SANEEIM-------WRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYA

Query:  FVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        FV+GRQI+DA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KGF   WR+W+ GC+SS +++I++N
Subjt:  FVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

RVW89552.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.3e-4933.33Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR++V SVWS RN  WA++ A GASGG +I+W+       E+V G FS+SI  ++    + W++ VYGPN+S  RK FW +LSD+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPN------------------WIMGEITILLDGLR--------KNLPSQPLLLLILGGRIIYRMVGR--------------VMTLFIWNQQSFGQQKEI
          P                    I+G + +  +  +        +N+  Q        GR      G                  L  WN+ SFG+  + 
Subjt:  CLPN------------------WIMGEITILLDGLR--------KNLPSQPLLLLILGGRIIYRMVGR--------------VMTLFIWNQQSFGQQKEI

Query:  RHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELIILSANEEIMWRQ----------------------------------------------------
        +  +   L+  D++E+   +  E   +R   K EL  L   EEI WRQ                                                    
Subjt:  RHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELIILSANEEIMWRQ----------------------------------------------------

Query:  -----RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKG
             R+ DFRPISL T LY+II++VL  RL+ VL  TI   Q AFV+GRQI+DA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KG
Subjt:  -----RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKG

Query:  FGECWRRWIGGCISSANFSIIIN
        F   WR+W+ GC+SS ++++++N
Subjt:  FGECWRRWIGGCISSANFSIIIN

TrEMBL top hitse value%identityAlignment
A0A438D2A9 Transposon TX1 uncharacterized 149 kDa protein2.3e-4534.46Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR++V SVW+ RN  W  + A GASGGI+ +W+       E+V G FS+S+  +L      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLP--------------NWIMGEITILLDGLRKNLPSQPLLLLILGG---RIIYRM-----VGRVMTLFIWNQQSF----------GQQKEIR-------
          P               W   + + +   L + L S     L   G    +I R      +      F+W    F            ++  R       
Subjt:  CLP--------------NWIMGEITILLDGLRKNLPSQPLLLLILGG---RIIYRM-----VGRVMTLFIWNQQSF----------GQQKEIR-------

Query:  ------HRLNRELSIIDNMEENEPIFEEE----SKRRTEIKA------ELIILS-----ANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPH
              H+  R L  +     N   F  E     K+    KA       + +         E++   +++ DFRPISL T LY+II++VL  RL+ VL  
Subjt:  ------HRLNRELSIIDNMEENEPIFEEE----SKRRTEIKA------ELIILS-----ANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPH

Query:  TITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        TI   Q AFV+GRQIMDA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KGF   WR+W+ GC+SS ++++++N
Subjt:  TITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

A0A438D563 LINE-1 retrotransposable element ORF2 protein5.6e-4434.29Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR+ V SVW+ RN  WA+  A GAS G +I+W+       E+V G FS+S+  +L      W++ VYGPNSS  RK FW ++ D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNW-IMGEITIL------LDGLRKNLPSQ---------PLLLLILGGRIIYRM------VGRVMTLFIWN---QQSFGQ--QKEIRHRLNRELSII--
          P W + G+  ++      L G R     +          LL L L      R       V + +  F+++   +  F Q  Q+ +    +   SI+  
Subjt:  CLPNW-IMGEITIL------LDGLRKNLPSQ---------PLLLLILGGRIIYRM------VGRVMTLFIWN---QQSFGQ--QKEIRHRLNRELSII--

Query:  -DNMEENEP---------------IFEEESKR-----------RTEIKAELIILSANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITE
         +  ++  P               + +E+  R                A  I+L   + +   +++ DFRPISL T LY+II++VL  RL+ VL  TI  
Subjt:  -DNMEENEP---------------IFEEESKR-----------RTEIKAELIILSANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITE

Query:  FQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
         Q AFV+GRQI+D  LIAN+I+DE  R KK+GVV K+D EKA++ V WDFLD  L+ KGF   WR+W+ GC+S+ +F +++N
Subjt:  FQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

A0A438DTG0 Transposon TX1 uncharacterized 149 kDa protein5.6e-5236.77Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR+ V SVW++RN  W ++   GASGGI+I+W+       E+V G FS+S+  SL      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNWIMGEITILLDGLRKNLPSQPLLLLILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEEN--------EPIFEEESKRRT---
          P W   EI  +  G + N           G + + R+      L  WN+ SFG+ KE +  +  +L+  D +E+         + +F EE   +    
Subjt:  CLPNWIMGEITILLDGLRKNLPSQPLLLLILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEEN--------EPIFEEESKRRT---

Query:  ----------------------EIKAELIIL------------SANEEIM-------WRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYA
                               IK +L+ +            S N   +         +R+ DFRPISL T LY+II++VL  RL+ VL  TI   Q A
Subjt:  ----------------------EIKAELIIL------------SANEEIM-------WRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYA

Query:  FVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        FV+GRQI+DA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KGF   WR+W+ GC+SS +++I++N
Subjt:  FVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

A0A438HYP2 LINE-1 retrotransposable element ORF2 protein2.6e-4933.33Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K    DR++V SVWS RN  WA++ A GASGG +I+W+       E+V G FS+SI  ++    + W++ VYGPN+S  RK FW +LSD+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPN------------------WIMGEITILLDGLR--------KNLPSQPLLLLILGGRIIYRMVGR--------------VMTLFIWNQQSFGQQKEI
          P                    I+G + +  +  +        +N+  Q        GR      G                  L  WN+ SFG+  + 
Subjt:  CLPN------------------WIMGEITILLDGLR--------KNLPSQPLLLLILGGRIIYRMVGR--------------VMTLFIWNQQSFGQQKEI

Query:  RHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELIILSANEEIMWRQ----------------------------------------------------
        +  +   L+  D++E+   +  E   +R   K EL  L   EEI WRQ                                                    
Subjt:  RHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELIILSANEEIMWRQ----------------------------------------------------

Query:  -----RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKG
             R+ DFRPISL T LY+II++VL  RL+ VL  TI   Q AFV+GRQI+DA LIANEI+DE  R  ++GVV K+D EKA+D V WDFLD  L+ KG
Subjt:  -----RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKG

Query:  FGECWRRWIGGCISSANFSIIIN
        F   WR+W+ GC+SS ++++++N
Subjt:  FGECWRRWIGGCISSANFSIIIN

A5BCE8 Reverse transcriptase domain-containing protein5.6e-4430.15Show/hide
Query:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL
        +V++QE K  + DR+ V SVW+ RN  W ++ A GASGGI+I+W+       E+V G FS+SI  SL      WI+ VYGPNS   RK FW +L D+  L
Subjt:  LVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRVYGPNSSRDRKSFWKKLSDLQTL

Query:  CLPNWIMGEITILLDGLRKNLPSQPLLLL---ILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELII
          P W    + +     ++N              G + + R+      L  WN+ SFG+ KE +  +  +L+  D +E+   +  +   +R   K EL  
Subjt:  CLPNWIMGEITILLDGLRKNLPSQPLLLL---ILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEENEPIFEEESKRRTEIKAELII

Query:  LSANEEIMWRQ-----------------------------------------------------------------------------------------
        L   EEI WRQ                                                                                         
Subjt:  LSANEEIMWRQ-----------------------------------------------------------------------------------------

Query:  -------------------------------------------RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEI
                                                   R+ DF+PISL T LY+II++VL  RL+ VL  TI   Q AFV+GRQI+DA LIANEI
Subjt:  -------------------------------------------RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEI

Query:  IDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN
        +DE  R  ++GVV K+D EK +D V WDFLD  L+ KGF   WR+W+ GC+SS +++I++N
Subjt:  IDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIIN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.4e-0630.17Show/hide
Query:  DFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRR
        +FRPISL     +I++++L  R+++ +   I   Q  F+ G Q       +  +I   NR K K  V+I +D EKAFD +   F+  TL   G    + +
Subjt:  DFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRR

Query:  WIGGCISSANFSIIIN
         I         +II+N
Subjt:  WIGGCISSANFSIIIN

P08548 LINE-1 reverse transcriptase homolog1.3e-0528.57Show/hide
Query:  RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGEC
        R  ++RPISL     +I++++L  R+++ +   I   Q  F+ G Q       +  +I   N+ K K  +++ +D EKAFD +   F+  TLK  G    
Subjt:  RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGEC

Query:  WRRWIGGCISSANFSIIIN
        + + I    S    +II+N
Subjt:  WRRWIGGCISSANFSIIIN

P11369 LINE-1 retrotransposable element ORF2 protein3.2e-0427.73Show/hide
Query:  RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGEC
        ++ +FRPISL     +I++++L  R+++ +   I   Q  F+ G Q       +  +I   N+ K K  ++I LD EKAFD +   F+   L+  G    
Subjt:  RLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKG-VVIKLDIEKAFDMVDWDFLDDTLKAKGFGEC

Query:  WRRWIGGCISSANFSIIIN
        +   I    S    +I +N
Subjt:  WRRWIGGCISSANFSIIIN

P14381 Transposon TX1 uncharacterized 149 kDa protein3.6e-0833.04Show/hide
Query:  DFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRW
        ++RP+SL +  Y+I+++ +  RLK VL   I   Q   V GR I D   +  +++    R       + LD EKAFD VD  +L  TL+A  FG  +  +
Subjt:  DFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRW

Query:  IGGCISSANFSIIIN
        +    +SA   + IN
Subjt:  IGGCISSANFSIIIN

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.6e-1135.58Show/hide
Query:  LLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRK--KKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIINV
        ++ERLK ++ + I   Q +F+ GR   D  +   E +    R+K  K  +++KLD+EKA+D + WD+L+DTL + GF E W   I      A   +   V
Subjt:  LLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRK--KKGVVIKLDIEKAFDMVDWDFLDDTLKAKGFGECWRRWIGGCISSANFSIIINV

Query:  GRGD
        GR D
Subjt:  GRGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAAGAGGATTCGGAAACAAAGCTTCCATATCAGGCAAGCGACCCTGCGGTTTTCTTGCCGTTTCTTTTCCCTTGGCTGGCTGAACATGCTCTAGTGATTCTTCA
AGAGATGAAGCTCCCATCCATTGATAGAAAAATAGTCAAATCGGTTTGGAGCTCAAGGAATATTGCTTGGGCTTCGGTTGATGCTATTGGTGCCTCGGGAGGGATTATCA
TCCTGTGGAATGAATCCACCTTTGATGTTGTAGAGATTGTCGAGGGTATCTTCTCTTTATCCATCCATCTCTCTCTTGCGGATGGTTTTTCCTTTTGGATCACAAGAGTA
TATGGCCCAAATTCTTCTCGTGATAGGAAATCTTTTTGGAAGAAGTTATCTGATTTGCAAACCCTCTGCCTTCCGAACTGGATTATGGGGGAGATTACAATATTACTAGA
TGGTCTACGAAAAAATCTACCTTCACAGCCCCTACTCTTGTTGATTCTTGGTGGAAGGATAATATATCGCATGGTTGGCCGGGTCATGACTTTATTCATCTGGAATCAAC
AATCTTTTGGGCAACAAAAGGAGATTAGGCACAGGCTGAACCGTGAGCTCTCTATCATAGACAACATGGAGGAAAATGAGCCAATTTTCGAGGAAGAATCCAAAAGAAGA
ACTGAAATCAAGGCGGAATTGATTATCTTATCAGCCAATGAAGAGATTATGTGGCGCCAAAGATTGGGAGATTTTAGGCCTATAAGCCTCACCACTTGCCTCTATGAGAT
CATTTCTAGAGTCCTCTTGGAAAGATTAAAGAAGGTCCTCCCACACACGATCACAGAATTCCAATATGCTTTTGTTGAAGGGCGTCAAATTATGGACGCTTCCTTAATAG
CTAACGAAATCATCGATGAATGGAATAGAAGAAAAAAGAAGGGTGTTGTTATTAAGCTCGACATTGAAAAAGCTTTCGATATGGTGGATTGGGACTTTCTCGACGACACT
CTTAAAGCAAAGGGGTTTGGTGAATGTTGGAGAAGATGGATTGGAGGATGCATATCTTCTGCGAATTTTTCAATCATTATCAATGTCGGAAGAGGTGACAAAACATTATT
CTGGGAGGATATTTGGCTCGGCACATCATCACTGCAATCCAAATACCCTGCTCTATACAATCTATCTTTAAAGAAAGAGGCCACTATTGCTGAGTTATGGAATCCAGAAA
ACGGGGCTTGGAATTTACATCTTAGGAGACATTTACGTGACTCTGAATCCCTGGAATGGGCAGTTATGTCTCACCATTTATCCACCTTTTCTATCCGGGATGTGGATGAC
TTATGGTCTTGGCAGCTTGGAGATAGAGACATTTCTCCACAGGATCCCTTACCAAAAGCTTGGCCTCCCTCCCTTTACCTAATTGTAGGACTTTTACAGCCTTCTATGGA
GAGGACCTATGCCTATAAAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAAGAGGATTCGGAAACAAAGCTTCCATATCAGGCAAGCGACCCTGCGGTTTTCTTGCCGTTTCTTTTCCCTTGGCTGGCTGAACATGCTCTAGTGATTCTTCA
AGAGATGAAGCTCCCATCCATTGATAGAAAAATAGTCAAATCGGTTTGGAGCTCAAGGAATATTGCTTGGGCTTCGGTTGATGCTATTGGTGCCTCGGGAGGGATTATCA
TCCTGTGGAATGAATCCACCTTTGATGTTGTAGAGATTGTCGAGGGTATCTTCTCTTTATCCATCCATCTCTCTCTTGCGGATGGTTTTTCCTTTTGGATCACAAGAGTA
TATGGCCCAAATTCTTCTCGTGATAGGAAATCTTTTTGGAAGAAGTTATCTGATTTGCAAACCCTCTGCCTTCCGAACTGGATTATGGGGGAGATTACAATATTACTAGA
TGGTCTACGAAAAAATCTACCTTCACAGCCCCTACTCTTGTTGATTCTTGGTGGAAGGATAATATATCGCATGGTTGGCCGGGTCATGACTTTATTCATCTGGAATCAAC
AATCTTTTGGGCAACAAAAGGAGATTAGGCACAGGCTGAACCGTGAGCTCTCTATCATAGACAACATGGAGGAAAATGAGCCAATTTTCGAGGAAGAATCCAAAAGAAGA
ACTGAAATCAAGGCGGAATTGATTATCTTATCAGCCAATGAAGAGATTATGTGGCGCCAAAGATTGGGAGATTTTAGGCCTATAAGCCTCACCACTTGCCTCTATGAGAT
CATTTCTAGAGTCCTCTTGGAAAGATTAAAGAAGGTCCTCCCACACACGATCACAGAATTCCAATATGCTTTTGTTGAAGGGCGTCAAATTATGGACGCTTCCTTAATAG
CTAACGAAATCATCGATGAATGGAATAGAAGAAAAAAGAAGGGTGTTGTTATTAAGCTCGACATTGAAAAAGCTTTCGATATGGTGGATTGGGACTTTCTCGACGACACT
CTTAAAGCAAAGGGGTTTGGTGAATGTTGGAGAAGATGGATTGGAGGATGCATATCTTCTGCGAATTTTTCAATCATTATCAATGTCGGAAGAGGTGACAAAACATTATT
CTGGGAGGATATTTGGCTCGGCACATCATCACTGCAATCCAAATACCCTGCTCTATACAATCTATCTTTAAAGAAAGAGGCCACTATTGCTGAGTTATGGAATCCAGAAA
ACGGGGCTTGGAATTTACATCTTAGGAGACATTTACGTGACTCTGAATCCCTGGAATGGGCAGTTATGTCTCACCATTTATCCACCTTTTCTATCCGGGATGTGGATGAC
TTATGGTCTTGGCAGCTTGGAGATAGAGACATTTCTCCACAGGATCCCTTACCAAAAGCTTGGCCTCCCTCCCTTTACCTAATTGTAGGACTTTTACAGCCTTCTATGGA
GAGGACCTATGCCTATAAAAGTTAA
Protein sequenceShow/hide protein sequence
MQEEDSETKLPYQASDPAVFLPFLFPWLAEHALVILQEMKLPSIDRKIVKSVWSSRNIAWASVDAIGASGGIIILWNESTFDVVEIVEGIFSLSIHLSLADGFSFWITRV
YGPNSSRDRKSFWKKLSDLQTLCLPNWIMGEITILLDGLRKNLPSQPLLLLILGGRIIYRMVGRVMTLFIWNQQSFGQQKEIRHRLNRELSIIDNMEENEPIFEEESKRR
TEIKAELIILSANEEIMWRQRLGDFRPISLTTCLYEIISRVLLERLKKVLPHTITEFQYAFVEGRQIMDASLIANEIIDEWNRRKKKGVVIKLDIEKAFDMVDWDFLDDT
LKAKGFGECWRRWIGGCISSANFSIIINVGRGDKTLFWEDIWLGTSSLQSKYPALYNLSLKKEATIAELWNPENGAWNLHLRRHLRDSESLEWAVMSHHLSTFSIRDVDD
LWSWQLGDRDISPQDPLPKAWPPSLYLIVGLLQPSMERTYAYKS