; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002705 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002705
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold6:501253..504286
RNA-Seq ExpressionSpg002705
SyntenySpg002705
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056838.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.6e-5225.64Show/hide
Query:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH
        +PSYGGW++ R +PL  W+  TF+ IG  CGG+L+ A +T+    +++  IK++ N  GF+PA I +  +        T+   +  ++ + N+     +H
Subjt:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH

Query:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG
        G         +AA E D H  +               P R     +I    + S    TQ   N+   S S   P                L+++D + G
Subjt:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG

Query:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------
          S  S  +     S + P G     S   ++     +    +ND                        Q+   DH   L+      +   LS+      
Subjt:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------

Query:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--
                                     D A+   + + V+E     K +     E    D  T   +      +  + I    N  KLS      V  
Subjt:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--

Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT
              I S +N+        G  GGI++LW+++ F V  I  G                   VYGP    +R   W EL  LQ+LCLPNW++ GDFN+ 
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT

Query:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN
        RW  E + +S   R M  FN FI  ++L+D PL N  FTWS+ R NPT + +DR+LLS      F   ++R L+R  SDHFPI L   + KWGP PF+ N
Subjt:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN

Query:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM
        N+ L    F      WW N+   G PG+          K +K+W  +        + +L  E+  +D +E  G +S     +R  +K+ L+ +  N+  +
Subjt:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM

Query:  WRQKCK
        W Q+ +
Subjt:  WRQKCK

RVW12714.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.6e-5237.83Show/hide
Query:  PNRQKLSNTAKKKV-KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGVYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTW
        P+   +  T K++  ++++ S+WS RN  W +L + GASGGI+I+W+      +++V  VYGPN+S  R+ FW+EL D+  L  P W +GGDFNV R + 
Subjt:  PNRQKLSNTAKKKV-KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGVYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTW

Query:  EKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWL
        EK   S  +  MK F+ FI + +L+D PL +  +TWS+ + NP    +DR+L S+     F  +    L R TSDH+PI L     KWGP PF+F N WL
Subjt:  EKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWL

Query:  SHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMWRQK
         H +F      WW      G  GH         K +LK+WN++ FG   +++  + + L+N D +E+ G LS +  ++R   K +L  L   EEI WRQK
Subjt:  SHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMWRQK

Query:  CKLK
         ++K
Subjt:  CKLK

RVW77758.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.8e-5238.51Show/hide
Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNES------------SFDVKKIVEG--------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNV
        ++ + S+WS RN  W  L + GA GGI+I+W+              S  VK +++G        VYGPN+   R+ FW EL DL  L  P+W +GGDFNV
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNES------------SFDVKKIVEG--------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNV

Query:  TRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKF
         R   EK   S  + +M+ F+ FI ES+L D PL N  FTWS+ + +P    +DR+L S+     F  +    L R TSDH+PI L     KWGP PF+F
Subjt:  TRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKF

Query:  NNAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEI
         N WL HH+F  +   WW+     G  GH         K +LK WN++ FG  K+++ S++ E++N+D +E+ G LS      R   K +L  L   EEI
Subjt:  NNAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEI

Query:  MWRQKCKLK
         W+QK K+K
Subjt:  MWRQKCKLK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.6e-5225.64Show/hide
Query:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH
        +PSYGGW++ R +PL  W+  TF+ IG  CGG+L+ A +T+    +++  IK++ N  GF+PA I +  +        T+   +  ++ + N+     +H
Subjt:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH

Query:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG
        G         +AA E D H  +               P R     +I    + S    TQ   N+   S S   P                L+++D D G
Subjt:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG

Query:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------
          S  S  +     S + P G     S   ++     +    +ND                        Q+   DH   L+      +   LS+      
Subjt:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------

Query:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--
                                     D A+   + + V+E     K +     E    D  T   L      +  + I    N  KLS      V  
Subjt:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--

Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT
              I S +N+        G  GGI++LW++++F V  I  G                   VYGP    +R   W EL  LQ+LCLPNW++ GDFN+ 
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT

Query:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN
        RW  E + +S   R M  FN FI  ++L+D P  N  FTWS+ R NPT + +DR+LLS      F   ++R L+R  SDHFPI L   + KWGP PF+ N
Subjt:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN

Query:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM
        N+ L    F      WW ++   G PG+          K +K+W  +        + +L  E+  +D +E  G +S     +R  +K+ L+ +  N+  +
Subjt:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM

Query:  WRQKCK
        W Q+ +
Subjt:  WRQKCK

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]6.3e-5739.09Show/hide
Query:  IIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGV--------------------YGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTR
        I+KS+WS+  I W++LD+ G + GI+ILWN+      +++EGV                    YGP++++   LFW EL+DL  LC  +WIL GDFNVTR
Subjt:  IIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGV--------------------YGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTR

Query:  WTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNN
        W+WEKS     +++M  FN FIE+S L+D+PL+NG+ TWS    N + +LID +LL++  + K     A+R+ R TSDHFPI L  G+  WG  PF+F N
Subjt:  WTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNN

Query:  AWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMW
         WLSH +F   ++ WW N    G PGH         K  +K W    F     Q+  L + +++LD +E    ++   +  R + K  L+ + A EE  W
Subjt:  AWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMW

Query:  RQKCKLK
        RQ+CK K
Subjt:  RQKCKLK

TrEMBL top hitse value%identityAlignment
A0A438GZW0 LINE-1 retrotransposable element ORF2 protein1.3e-5238.51Show/hide
Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNES------------SFDVKKIVEG--------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNV
        ++ + S+WS RN  W  L + GA GGI+I+W+              S  VK +++G        VYGPN+   R+ FW EL DL  L  P+W +GGDFNV
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNES------------SFDVKKIVEG--------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNV

Query:  TRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKF
         R   EK   S  + +M+ F+ FI ES+L D PL N  FTWS+ + +P    +DR+L S+     F  +    L R TSDH+PI L     KWGP PF+F
Subjt:  TRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKF

Query:  NNAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEI
         N WL HH+F  +   WW+     G  GH         K +LK WN++ FG  K+++ S++ E++N+D +E+ G LS      R   K +L  L   EEI
Subjt:  NNAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEI

Query:  MWRQKCKLK
         W+QK K+K
Subjt:  MWRQKCKLK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.7e-5225.64Show/hide
Query:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH
        +PSYGGW++ R +PL  W+  TF+ IG  CGG+L+ A +T+    +++  IK++ N  GF+PA I +  +        T+   +  ++ + N+     +H
Subjt:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH

Query:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG
        G         +AA E D H  +               P R     +I    + S    TQ   N+   S S   P                L+++D D G
Subjt:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG

Query:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------
          S  S  +     S + P G     S   ++     +    +ND                        Q+   DH   L+      +   LS+      
Subjt:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------

Query:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--
                                     D A+   + + V+E     K +     E    D  T   L      +  + I    N  KLS      V  
Subjt:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--

Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT
              I S +N+        G  GGI++LW++++F V  I  G                   VYGP    +R   W EL  LQ+LCLPNW++ GDFN+ 
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT

Query:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN
        RW  E + +S   R M  FN FI  ++L+D P  N  FTWS+ R NPT + +DR+LLS      F   ++R L+R  SDHFPI L   + KWGP PF+ N
Subjt:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN

Query:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM
        N+ L    F      WW ++   G PG+          K +K+W  +        + +L  E+  +D +E  G +S     +R  +K+ L+ +  N+  +
Subjt:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM

Query:  WRQKCK
        W Q+ +
Subjt:  WRQKCK

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein1.7e-5225.64Show/hide
Query:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH
        +PSYGGW++ R +PL  W+  TF+ IG  CGG+L+ A +T+    +++  IK++ N  GF+PA I +  +        T+   +  ++ + N+     +H
Subjt:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH

Query:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG
        G         +AA E D H  +               P R     +I    + S    TQ   N+   S S   P                L+++D + G
Subjt:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG

Query:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------
          S  S  +     S + P G     S   ++     +    +ND                        Q+   DH   L+      +   LS+      
Subjt:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------

Query:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--
                                     D A+   + + V+E     K +     E    D  T   +      +  + I    N  KLS      V  
Subjt:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--

Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT
              I S +N+        G  GGI++LW+++ F V  I  G                   VYGP    +R   W EL  LQ+LCLPNW++ GDFN+ 
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT

Query:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN
        RW  E + +S   R M  FN FI  ++L+D PL N  FTWS+ R NPT + +DR+LLS      F   ++R L+R  SDHFPI L   + KWGP PF+ N
Subjt:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN

Query:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM
        N+ L    F      WW N+   G PG+          K +K+W  +        + +L  E+  +D +E  G +S     +R  +K+ L+ +  N+  +
Subjt:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM

Query:  WRQKCK
        W Q+ +
Subjt:  WRQKCK

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.7e-5225.64Show/hide
Query:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH
        +PSYGGW++ R +PL  W+  TF+ IG  CGG+L+ A +T+    +++  IK++ N  GF+PA I +  +        T+   +  ++ + N+     +H
Subjt:  VPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSAS----PTIAKIDPFFMEDYNIGYIASIH

Query:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG
        G         +AA E D H  +               P R     +I    + S    TQ   N+   S S   P                L+++D D G
Subjt:  GKIPATMVDREAAKEDDGHTRVV--------------PARLE---TIVEKVQTSYKLETQRSPNDEIYSVSNLTP---------------ALLLSDSDGG

Query:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------
          S  S  +     S + P G     S   ++     +    +ND                        Q+   DH   L+      +   LS+      
Subjt:  VSSPCSTPLEQ---SPMIPRG-----SPPSVSPPSIGILFDEVND------------------------QQHQIDHPCPLRIENPDHRNTILSI------

Query:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--
                                     D A+   + + V+E     K +     E    D  T   L      +  + I    N  KLS      V  
Subjt:  -----------------------------DEADQSLIDICVEE-----KDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKV--

Query:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT
              I S +N+        G  GGI++LW++++F V  I  G                   VYGP    +R   W EL  LQ+LCLPNW++ GDFN+ 
Subjt:  KKIIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEG-------------------VYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVT

Query:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN
        RW  E + +S   R M  FN FI  ++L+D P  N  FTWS+ R NPT + +DR+LLS      F   ++R L+R  SDHFPI L   + KWGP PF+ N
Subjt:  RWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFN

Query:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM
        N+ L    F      WW ++   G PG+          K +K+W  +        + +L  E+  +D +E  G +S     +R  +K+ L+ +  N+  +
Subjt:  NAWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIM

Query:  WRQKCK
        W Q+ +
Subjt:  WRQKCK

A0A6J1E2G6 uncharacterized protein LOC1110254053.1e-5739.09Show/hide
Query:  IIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGV--------------------YGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTR
        I+KS+WS+  I W++LD+ G + GI+ILWN+      +++EGV                    YGP++++   LFW EL+DL  LC  +WIL GDFNVTR
Subjt:  IIKSIWSSRNIAWTSLDSEGASGGIVILWNESSFDVKKIVEGV--------------------YGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTR

Query:  WTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNN
        W+WEKS     +++M  FN FIE+S L+D+PL+NG+ TWS    N + +LID +LL++  + K     A+R+ R TSDHFPI L  G+  WG  PF+F N
Subjt:  WTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNN

Query:  AWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMW
         WLSH +F   ++ WW N    G PGH         K  +K W    F     Q+  L + +++LD +E    ++   +  R + K  L+ + A EE  W
Subjt:  AWLSHHSFHNTVDIWWKNNLSQG-PGH---------KKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTEIKAQLVLLSANEEIMW

Query:  RQKCKLK
        RQ+CK K
Subjt:  RQKCKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-0725Show/hide
Query:  ILGGDFN---VTRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFR-PNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTL
        IL GDF+    T   +     S P R +++F   + +SDL+DIP     +TWS+ +  NP +  +DR + +    + F +A A       SDH P  + L
Subjt:  ILGGDFN---VTRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFR-PNPTMTLIDRYLLSDSIVAKFSAASARRLDRITSDHFPISLTL

Query:  -GKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQG----------PGHKKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTE-
            K     F++ +   +H +F  ++ + W+  +  G             KK  K  N+  FGN    ++     L +L+ ++     +  D+L R E 
Subjt:  -GKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQG----------PGHKKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRRTE-

Query:  -IKAQLVLLSANEEIMWRQKCKLK
          + +    +A  E  +RQK ++K
Subjt:  -IKAQLVLLSANEEIMWRQKCKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGCGAGCCCAAAGTTCCATCCTACGGAGGCTGGATTAAGATAAGAAACTTGCCCCTTGACAAATGGTCCATTGAGACCTTCAGGAAAATAGGCGATGATTGTGG
AGGATACCTGGAGACTGCCAATAAAACTCTAGCAAGAATGGACATGATGGAGGTTTGTATTAAGATCAAGGAGAATAGTTTCGGCTTTATCCCAGCGGAAATACACTTAC
CTTCTTCGTCCGCCAGCCCAACTATCGCCAAGATCGATCCCTTTTTCATGGAAGACTACAACATAGGTTATATTGCCAGCATTCATGGCAAAATACCAGCTACCATGGTG
GATCGAGAAGCTGCTAAGGAAGACGATGGCCACACCCGTGTCGTCCCTGCGCGTTTGGAAACTATTGTAGAAAAAGTACAGACCTCGTACAAGTTGGAAACCCAACGAAG
CCCGAATGATGAAATTTATTCAGTCTCGAACTTGACCCCAGCCCTTCTCTTATCAGATTCGGATGGAGGAGTCTCTTCCCCGTGTTCCACACCCTTGGAGCAATCTCCCA
TGATCCCTAGAGGCAGCCCCCCATCTGTTTCCCCACCATCTATCGGCATCCTTTTTGACGAAGTGAATGATCAACAACATCAGATAGACCATCCCTGTCCATTAAGAATC
GAGAATCCTGATCATAGAAACACAATTTTATCCATTGACGAGGCCGACCAATCTCTAATTGATATCTGCGTGGAAGAAAAAGACAGTGATGAATTTTACACAGAGGCTGT
GCACAACGATCCAGCGACATATCTTCCGTTATTATTTCCTTGGCTTGCCGAACATGGCATGTGCATTATGCCAATGCCCAACAGACAAAAGCTCTCCAATACAGCCAAGA
AGAAAGTCAAGAAGATTATTAAGTCCATTTGGAGTTCCCGGAATATTGCTTGGACCTCTTTAGACTCTGAAGGAGCATCTGGTGGCATTGTGATACTATGGAATGAATCT
TCCTTTGATGTCAAAAAGATTGTCGAAGGTGTTTACGGACCCAACTCCTCCAAGGAGAGACGGTTATTCTGGTTAGAGTTAATGGATCTTCAAGCCCTCTGTCTCCCAAA
TTGGATTTTGGGTGGTGATTTTAACGTGACTCGGTGGACTTGGGAGAAATCTACCCAATCGGCGCCATCTCGAGCTATGAAGAAATTCAATCGTTTTATAGAAGAATCTG
ATCTTCTAGACATTCCCCTGAGCAATGGAAAATTTACATGGTCTAGTTTTAGGCCTAATCCCACCATGACCCTCATTGATCGGTATCTCCTATCCGATAGCATTGTCGCC
AAATTCTCAGCTGCTTCTGCCCGTAGATTGGATAGAATTACGTCGGACCATTTCCCTATCAGTCTCACTTTGGGGAAGGAAAAATGGGGACCAGCCCCTTTCAAATTCAA
TAATGCCTGGCTTTCGCATCACTCCTTCCATAATACAGTCGATATTTGGTGGAAGAACAACCTATCACAAGGGCCAGGTCACAAAAAGGAATTAAAACAGTGGAATCAGT
CTGTTTTTGGTAATACTAAACAGCAGAGATATAGTTTGAATTCAGAGCTATCAAATCTGGACATGATGGAGGAACTCGGTCGACTATCTGAACAAGATGCCCTTAGAAGA
ACAGAGATAAAAGCCCAACTTGTTTTGCTATCAGCAAACGAAGAGATCATGTGGAGACAGAAATGTAAACTTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGCGAGCCCAAAGTTCCATCCTACGGAGGCTGGATTAAGATAAGAAACTTGCCCCTTGACAAATGGTCCATTGAGACCTTCAGGAAAATAGGCGATGATTGTGG
AGGATACCTGGAGACTGCCAATAAAACTCTAGCAAGAATGGACATGATGGAGGTTTGTATTAAGATCAAGGAGAATAGTTTCGGCTTTATCCCAGCGGAAATACACTTAC
CTTCTTCGTCCGCCAGCCCAACTATCGCCAAGATCGATCCCTTTTTCATGGAAGACTACAACATAGGTTATATTGCCAGCATTCATGGCAAAATACCAGCTACCATGGTG
GATCGAGAAGCTGCTAAGGAAGACGATGGCCACACCCGTGTCGTCCCTGCGCGTTTGGAAACTATTGTAGAAAAAGTACAGACCTCGTACAAGTTGGAAACCCAACGAAG
CCCGAATGATGAAATTTATTCAGTCTCGAACTTGACCCCAGCCCTTCTCTTATCAGATTCGGATGGAGGAGTCTCTTCCCCGTGTTCCACACCCTTGGAGCAATCTCCCA
TGATCCCTAGAGGCAGCCCCCCATCTGTTTCCCCACCATCTATCGGCATCCTTTTTGACGAAGTGAATGATCAACAACATCAGATAGACCATCCCTGTCCATTAAGAATC
GAGAATCCTGATCATAGAAACACAATTTTATCCATTGACGAGGCCGACCAATCTCTAATTGATATCTGCGTGGAAGAAAAAGACAGTGATGAATTTTACACAGAGGCTGT
GCACAACGATCCAGCGACATATCTTCCGTTATTATTTCCTTGGCTTGCCGAACATGGCATGTGCATTATGCCAATGCCCAACAGACAAAAGCTCTCCAATACAGCCAAGA
AGAAAGTCAAGAAGATTATTAAGTCCATTTGGAGTTCCCGGAATATTGCTTGGACCTCTTTAGACTCTGAAGGAGCATCTGGTGGCATTGTGATACTATGGAATGAATCT
TCCTTTGATGTCAAAAAGATTGTCGAAGGTGTTTACGGACCCAACTCCTCCAAGGAGAGACGGTTATTCTGGTTAGAGTTAATGGATCTTCAAGCCCTCTGTCTCCCAAA
TTGGATTTTGGGTGGTGATTTTAACGTGACTCGGTGGACTTGGGAGAAATCTACCCAATCGGCGCCATCTCGAGCTATGAAGAAATTCAATCGTTTTATAGAAGAATCTG
ATCTTCTAGACATTCCCCTGAGCAATGGAAAATTTACATGGTCTAGTTTTAGGCCTAATCCCACCATGACCCTCATTGATCGGTATCTCCTATCCGATAGCATTGTCGCC
AAATTCTCAGCTGCTTCTGCCCGTAGATTGGATAGAATTACGTCGGACCATTTCCCTATCAGTCTCACTTTGGGGAAGGAAAAATGGGGACCAGCCCCTTTCAAATTCAA
TAATGCCTGGCTTTCGCATCACTCCTTCCATAATACAGTCGATATTTGGTGGAAGAACAACCTATCACAAGGGCCAGGTCACAAAAAGGAATTAAAACAGTGGAATCAGT
CTGTTTTTGGTAATACTAAACAGCAGAGATATAGTTTGAATTCAGAGCTATCAAATCTGGACATGATGGAGGAACTCGGTCGACTATCTGAACAAGATGCCCTTAGAAGA
ACAGAGATAAAAGCCCAACTTGTTTTGCTATCAGCAAACGAAGAGATCATGTGGAGACAGAAATGTAAACTTAAATGA
Protein sequenceShow/hide protein sequence
MNSEPKVPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDMMEVCIKIKENSFGFIPAEIHLPSSSASPTIAKIDPFFMEDYNIGYIASIHGKIPATMV
DREAAKEDDGHTRVVPARLETIVEKVQTSYKLETQRSPNDEIYSVSNLTPALLLSDSDGGVSSPCSTPLEQSPMIPRGSPPSVSPPSIGILFDEVNDQQHQIDHPCPLRI
ENPDHRNTILSIDEADQSLIDICVEEKDSDEFYTEAVHNDPATYLPLLFPWLAEHGMCIMPMPNRQKLSNTAKKKVKKIIKSIWSSRNIAWTSLDSEGASGGIVILWNES
SFDVKKIVEGVYGPNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKSTQSAPSRAMKKFNRFIEESDLLDIPLSNGKFTWSSFRPNPTMTLIDRYLLSDSIVA
KFSAASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGPGHKKELKQWNQSVFGNTKQQRYSLNSELSNLDMMEELGRLSEQDALRR
TEIKAQLVLLSANEEIMWRQKCKLK