; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036174 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036174
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:40995286..40997485
RNA-Seq ExpressionLag0036174
SyntenyLag0036174
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.1e-14138.16Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR +WLK GDKNTK+FHS+AS R++KN I  + +  G+WV+    IE     +F+ LF SS P+   I E  + + PKV+ +    L  PF+  DI  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        L +M PTKAPG D                 +++ CL+ILN  G L+ +N T I+LIPK+  P+++ +F PISLCNVVY++++KAIANR+K IL+ IISP 
Subjt:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+FIP RLITDNV++G++C+H I   +  + G +A+KLD+SKAYDRVEW+F+ + M  LGF   WI  +M CI +  +S+LING+P    +P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE--------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSN
         PLSPYLF+LC E                                DDSL+  KA+  +C  +K +   Y + SGQ  N +KS+   S      +   + +
Subjt:  DPLSPYLFLLCPE--------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSN

Query:  LLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---SPIG------------------DKRKAH
        +  +K+      YLG+P   GR K   F             W  K+ ++ G E  +    +   +Y  S    P G                  DK   H
Subjt:  LLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---SPIG------------------DKRKAH

Query:  WVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKDD
        W  W  +  +K  GGLGFR++  FNQA++AKQ WRL+    SL+ +V + RY+ ++ F  A +G+NPS  WRSILWG ++ KKG++WR+G+ K +++  D
Subjt:  WVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKDD

Query:  PWLPKQGCYKPVWIKKEFQGDRVASLIDGSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDPTPV
         W+P+   ++P+  K       VA LID    W+ + +   F+  + EAIL I L S ++ D+++W  DKKG +SVKS Y LA N +  ++  SS+ +  
Subjt:  PWLPKQGCYKPVWIKKEFQGDRVASLIDGSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDPTPV

Query:  ASFWKKNIGKWMQPLEKK
        +  WK     WM  L +K
Subjt:  ASFWKKNIGKWMQPLEKK

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.0e-13942.11Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        R++  WLK GD+NTK+FH++AS R+K+N+I  I +  G W +    I   A++YF  +++SS P+   IEE T  I  KVT +    L R F+K ++  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK + P KAPG D                 ++ + L +LN++  +  +N T ISLIPK  NPKRM DF PISLCNVVYK+ISK +ANR+K +L  IIS  
Subjt:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F   RLITDNVL+ F+ +H ++ K + K G+MAIKLDMSKA+DRVEW FI +VME++GF + W   VM+CI SVSYSILING       P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSP LFLLC E                                 DDS++  KA    C  ++ +L +YEE SGQ IN DKS+   S N  +   +E+ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSS---PIG------------------DKRKA
        N+LG   +     YLG+PS  GR K +VF             W  K+ +  G E  +    +   +Y  S    P G                   + K 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSS---PIG------------------DKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
         W+SWKR+C SK  GGLGFR +  FN AMLAKQ WR+L N  SL+ +V + RYF   + + A LG++PS +WRSI    E+ ++G +WRVGN K I I +
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIK-KEFQGDRVASLIDGSGS-WKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN-SSLADKSSSSD
        D WLP    YK +  +   F+   V+SLID     WK E +R IFLP   E IL IPL      DK+IW  +KKG FSVKSAYH+A++     ++   S+
Subjt:  DPWLPKQGCYKPVWIK-KEFQGDRVASLIDGSGS-WKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN-SSLADKSSSSD

Query:  PTPVASFWKK
          P    WKK
Subjt:  PTPVASFWKK

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.8e-14641.85Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR +WL  GD+NTK+FH++AS R+++N+I+ I + NG+W +    I  VA++YF+ +++SS P  +S  E    I   VT +    L + F++ +IE A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        L  M PTKAP                 G+DI  + L +LN++  +  IN T I+L+PKI NP +M DF PISLCNVVYK+ISK +ANR+K IL QIIS  
Subjt:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F+ GRLITDNVL+ F+ +H +  K+  K G+ AIKLDMSKAYDRVEW FI++VMEK+GF + WIK VM CI SVSYSIL+NG       P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DP+SPY+FLLC +                                 DDSL+  KAN   C ++ ++L+ YE+ SGQ IN+DKS+   S N P  K  E+ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVFWKMEN------SPGLERRVVLYGRE--------------RDSYQSSSPI----------------GDKRKA
         +LG         YLG+PS  G+ K ++F +++       S   E+ + + GRE                 +Q    +                G + K 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVFWKMEN------SPGLERRVVLYGRE--------------RDSYQSSSPI----------------GDKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
         WVSWK+LC +K  GG+GFR +  FN AMLAKQ WRL++N  SL+ ++++ RY+ H +  +A LG +PS TWRSI  G E+ ++G +WRVGN + I+I +
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIKKEFQG-DRVASLIDGSGS-WKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLA---YNSSLADKSSS
        D WLP    YK +   K F    RV++LID     WK++V+RD+FLP  +  IL IPL      D+IIW  ++KG FSVKSAY++A    ++    +SSS
Subjt:  DPWLPKQGCYKPVWIKKEFQG-DRVASLIDGSGS-WKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLA---YNSSLADKSSS

Query:  SDPTPVASFWKK
         D   +   W+K
Subjt:  SDPTPVASFWKK

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]1.4e-14040.58Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR  W + GD+NT +FH++AS R +KN ID I +  G W E    IE+VA+ YF+ LF SS+P   S  +    + PKVT D  V+L R ++  ++  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK M P KAPG D                 ++   L  LN+       N T I LIPKI  PK + D+ PISLCNV YK+ SKAIANR+K+ L  IIS T
Subjt:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F+ GRLITDNVL+ F+ +H I+RK+  K G MAIKLDMSKAYDRVEW F+ ++MEKLGF+ N    +M+CI +VSY+I ING P     P RG+RQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSPYLFLLC E                                 DDSLI  KA    C +++ VL  YE+ SGQ +N  K++   S N PK   EE+ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA
           G ++ +    YLG+PS  G+ K   F             W  K+ +  G E  +        +Y  S                       + ++ + 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
         W+SW ++C SK  GG+GF+ + LFN A+LAKQ WRL   Q SL+++V + +YF    F+ ASLGNNPS +WRSI+  + L K+G+KWRVGN  SI + +
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIKKEFQGD-RVASLIDG-SGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN-SSLADKSSSSD
        D WLP    +K +  +     D RVA L+D   G W+ EVI  +FLP  +++I  IP+ ++   DK+IW     G+F+V+SAY LA N  S+ +K + SD
Subjt:  DPWLPKQGCYKPVWIKKEFQGD-RVASLIDG-SGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN-SSLADKSSSSD

Query:  PTPVASFWKKNIGKWMQPLEKK
         + + SFW++    W  P+  K
Subjt:  PTPVASFWKKNIGKWMQPLEKK

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]6.3e-13840.37Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR +WLK GD NT +FHS A+ R K+N I K+   +G  V G   I +  + YFK +FAS+ P+  + ++  + I  KVT     DL R F+  ++E A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK M+P  AP                 G D+    L ILN+      +N T ISLIPKI +P++  DF PISLCNV+YK++SK IANR+K++L +++S +
Subjt:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F+  RLI+DN+L+ F+ +H +  K   K G+MAIKLDMSKAYDRVEW+F+ +VMEKLGF++ WI  V  CI SVS+S+L+NG P   F P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSPYLFLLC E                                 DDSL+  +AN     SI E+L++YEE SGQ IN +K+    S N      EE+ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRK--------HKVFWKMENSPGLERRVVLYGRE---RDSYQSSSPI---------------------------GDKR
         LLG+  + +   YLG+PS  GR K         +V+ KM+     ER +   GRE   +   Q+                               G+ R
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRK--------HKVFWKMENSPGLERRVVLYGRE---RDSYQSSSPI---------------------------GDKR

Query:  KAHWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIII
        K HWV WK+LC SK  GGLGF++I LFN AML KQ WRL++N+ SL +KVF+ +YF + + ++  +  N S  W+SIL  R + + G KWR+G+  S+ I
Subjt:  KAHWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIII

Query:  KDDPWLPKQGCYKPVWIKKEFQGD-RVASLIDGSG-SWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSS
        + D WLP     + V  +K F  + RV +LID     W E+ IR+ FLP  +EAIL +PL      D++IW     G ++ KSAY L   ++ A   SSS
Subjt:  KDDPWLPKQGCYKPVWIKKEFQGD-RVASLIDGSG-SWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSS

Query:  DPTPVASFWKK
        +      FW++
Subjt:  DPTPVASFWKK

TrEMBL top hitse value%identityAlignment
A0A2N9FFZ2 Reverse transcriptase domain-containing protein7.6e-13738.47Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR EWL  GD+NT++FH  A+ RK+KN + ++   +G W      +  + + Y+K LF ++ P    +E+   +I   VT +    L   F+ +++E A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAPGD-----------------DISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK M P KAPG                  D++   L  LN+   L  IN T I+LIPK+ NP+ + +F PISLCNV+YK+ISK +ANR+K +L  I+  +
Subjt:  LKDMRPTKAPGD-----------------DISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+FIPGRLITDN+L+ F+ +H +  +++ K G MA+KLDMSKAYDRVEW +++ VMEK+GF + W+  +M+CI++VSYSIL+NG P    +P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSPYLFLLC E                                 DDSL+  KA   + + I+ +L +YE+ SGQ +N  K+    SK+ P A   ++ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA
        N+LG+   +    YLG+PS  GR K+  F             W  K+ +  G E  +    +   +Y  S                        GDK K 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
        HW+ W+ LC SK  GG+G R++  FN+A+LAKQ WRLL+N +SL  KVF+ +YF H + +EA   +  S  W+SI+  R+L  KG  WRVG    I I  
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVW-IKKEFQGDRVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP
        D WLP    +  V           V  LID    SWK E+++ IFLP  +  IL IPL  ++ ID ++W++ K G ++V+S YHL  N    D+ SSSD 
Subjt:  DPWLPKQGCYKPVW-IKKEFQGDRVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP

Query:  TPVASFW
        T +   W
Subjt:  TPVASFW

A0A2N9FN80 Uncharacterized protein8.9e-13838.06Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR EWLK GD+NT++FH  A+ R+++N + ++ +  G W      +  + ++Y+  LF +  P    IE+   +I P VT     +L R F+  ++  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK M P KAP                 GD++++  L  LN    L   N T I+LIPKI NP+ + DF PISLCNV+YK+ISK +ANR+K IL  I+S +
Subjt:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F+PGRLITDN+L+ F+ +H +  +R+++   MA+KLDMSKAYD+VEW +++RVME++GF   W+  +M+CI+SVSYSIL+NG P    +P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSPYLFLLC E                                 DDSL+  +A   +   I+ +L +YE  SGQ IN  K+    SK+ P +    + 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA
        N+LG+ + +    YLG+PS  GR K+  F             W  K+ +  G E  +    +   +Y  S                        GDK K 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSS---------------------SPIGDKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
        HW+ W  LC SK  GG+GFRE+  FN+A+LAKQ WRLL+N +SL +KVF+ +YF   + +EA L +  S  W+SI+  R+L KKG  WRVG+  +I I  
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIKKEFQG-DRVASLID-GSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP
        D WLP    +     +   Q   +VA LID  S +WKEE+IR+IFLP ++ AIL +PL  +   D ++W   K G+++V+S YH          +  SD 
Subjt:  DPWLPKQGCYKPVWIKKEFQG-DRVASLID-GSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP

Query:  TPVASFWKKNIGKWMQPLEKKFV---CGESSKT
        T ++  W       + P  + F+   C ES  T
Subjt:  TPVASFWKKNIGKWMQPLEKKFV---CGESSKT

A0A2N9IP69 Reverse transcriptase domain-containing protein4.7e-13939.63Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RSR  WLK GD+NTK+FHS A+HR+++N +  I N  GD +     I    + Y++ LF ++    V  E     I+  V+ +    L  PF++ +I  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        +K M P KAP                 G D+ +  L  LN+   L  IN T ++LIPK+ NP+++ D+ PISLCNV+Y++ISK +ANR K++L  IIS T
Subjt:  LKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F+PGRLITDN+L+ F+ +H +N +RS K G MA+KLDMSKAYDRVEW+F+++VM K+GF  +W+  +M+CI++VSYS+LING PT    P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DP+SPYLFLLC E                                 DDSL+  +A +  C  I+EVL+ YE VSGQ +N  K+    S+N P+A  +++ 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSS---PIG------------------DKRKA
        ++LG+   +    YL +PS  G++K   F             W  K+ +  G E  +    +   SY  +    P+G                  + RK 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSS---PIG------------------DKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
        HW+ W++LC  KG GGLGFRE+  FN A+LAKQ WRL++ + SLLFKVF  ++F   N MEAS  N  S  WRSIL  ++L   G  WRVG+ K I IKD
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIKKEFQGD-RVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP
          WL ++G  + +        D +VA LI  S  +W E  IR +FLP +S+AIL IPL  +   DK+ W +   G +SV+S Y L     +    +SS+ 
Subjt:  DPWLPKQGCYKPVWIKKEFQGD-RVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDP

Query:  TPVASFWKK
            + WK+
Subjt:  TPVASFWKK

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.2e-13941.7Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA
        RS+  WLK GD+NTK+FH+ AS R+K+N+I  + +  G W E    I + A+ YF+ ++++S P+ V  +E T  I   +T +   +L+R F++ +I  A
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENA

Query:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT
        LK + PTK+PG D                 +S + L +LN    L+ IN T I LIPK  NPKRM DF PISLCNV+YK+ISK +ANR+K  L  II+  
Subjt:  LKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPT

Query:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG
        QS+F   RLITDNVL+ ++ +H +  K+  K  +MA KLDMSKA+DRVEW FI RVM K+GF + WI  +M+CI+SVSYS++ING       P RGLRQG
Subjt:  QSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQG

Query:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS
        DPLSPYLFLLC E                                 DDSL+  KAN   C  +KE+L +YE  SGQ +N DKS+   S N      E + 
Subjt:  DPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELS

Query:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSSPI---------------------GDKRKA
        N+LG         YLG+PS  GR K  VF             W  K+ +S G E  +    +   +Y  S  +                       + K 
Subjt:  NLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------W--KMENSPGLERRVVLYGRERDSYQSSSPI---------------------GDKRKA

Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
         W+SW+++C  K  GGLGFR +  FN A+LAKQ WR+L N  SL  ++ + +YF + + + ASLG+NPS TWRSI    E+ KKG +WRVGN + I I D
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYK---PVWIKKEFQGDRVASLID-GSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN---SSLADKS
        D WLP    YK   P  I +++    V+SLID  +  WK + IR +FLP ++EAIL IPL      D+IIW  +KKG FSVKSAY +A N   S+   + 
Subjt:  DPWLPKQGCYK---PVWIKKEFQGDRVASLID-GSGSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYN---SSLADKS

Query:  SSSDP
        SS DP
Subjt:  SSSDP

A0A7N2LIH6 Uncharacterized protein2.2e-13638.6Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPAT--VSIEENTRNISPKVTNDHKVDLNRPFSKIDIE
        RSR  WL++GDKN+K+FH+ AS R++KN I  + +  G W E     E + ++YFK +++S++P +  VS+E     ++P++ ND   +L + F  +++ 
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPAT--VSIEENTRNISPKVTNDHKVDLNRPFSKIDIE

Query:  NALKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIIS
         AL+ M PTKAPG D                 ++   L  LN+      IN T I LIPK  NP+++ +F PISLCNV+YK+ISK +ANR+K++L  +I 
Subjt:  NALKDMRPTKAPGDD-----------------ISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIIS

Query:  PTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLR
          QS+F+PGR+ITDNV++ F+ +H IN++R  K G MAIKLDMSKAYDRVEW+++  +M+K+GF D WI  +M C+ SVS+S+LING P   F P RGLR
Subjt:  PTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLR

Query:  QGDPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEE
        QGDP+SPYLFLLC E                                 DDS+I  +A    C  + +VL  YEE SGQ +N DK++   S+N      E 
Subjt:  QGDPLSPYLFLLCPE---------------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEE

Query:  LSNLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------WKMENSPGLERRVVLYG--------------------RERDSYQSS---SPIGDKR
           + G ++ +    YLG+P   GR K K F             WK +      R V++                       E +S   S      G ++
Subjt:  LSNLLGIKLSESLGYYLGMPSQTGRRKHKVF-------------WKMENSPGLERRVVLYG--------------------RERDSYQSS---SPIGDKR

Query:  KAHWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIII
        K  WVSWK LC  K +GG+GF+++  FN A+LAKQ WRL  N  SL  +V + +YFA+++FMEA LG  PS  WRSI+  + + K+G +W VG+ +SI I
Subjt:  KAHWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIII

Query:  KDDPWLPKQGCYKPVWIKK-EFQGDRVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSS-
         D  WLP     K +  +    QG+RVASLI    G WK  +++  F+P  +E IL IPL S    D ++W     G F+VKSAY  A+   L  +    
Subjt:  KDDPWLPKQGCYKPVWIKK-EFQGDRVASLIDGS-GSWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSS-

Query:  ----SDPTPVASFWK
            SD + +++ WK
Subjt:  ----SDPTPVASFWK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.1e-2624.47Show/hide
Query:  RKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNIS-PKVTNDHKVDLNRPFSKIDIENALKDMRPTKAPGDDISRVCLY--
        +++KN ID I N  GD      +I+     Y+K L+A+       ++      + P++  +    LNRP +  +I   +  +   K+PG D      Y  
Subjt:  RKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNIS-PKVTNDHKVDLNRPFSKIDIENALKDMRPTKAPGDDISRVCLY--

Query:  --------------ILNNDGDL-NPINSTLISLIPKI-PNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCI
                       +  +G L N      I LIPK   +  + E+F PISL N+  K+++K +ANR+++ + ++I   Q  FIPG     N+      I
Subjt:  --------------ILNNDGDL-NPINSTLISLIPKI-PNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCI

Query:  HVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQGDPLSPYLFLLCPE--------
          INR +     ++ I +D  KA+D+++  F+ + + KLG +  ++K +    +  + +I++NG     F    G RQG PLSP LF +  E        
Subjt:  HVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQGDPLSPYLFLLCPE--------

Query:  --------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSNL
                            DD ++ L+    +  ++ +++  + +VSG  IN+ KS   +  N  + +++ +  L
Subjt:  --------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSNL

P08548 LINE-1 reverse transcriptase homolog7.3e-2825.7Show/hide
Query:  DKNTKWFHSEASH---------RKK--KNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNIS-PKVTNDHKVDLNRPFSKIDIE
        +K+  WF  + +          RKK  K+ I  I N N +      +I+ +   Y+K L++        I++       P+++      LNRP S  +I 
Subjt:  DKNTKWFHSEASH---------RKK--KNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNIS-PKVTNDHKVDLNRPFSKIDIE

Query:  NALKDMRPTKAPGDD-------------ISRVCLYILNN---DGDL-NPINSTLISLIPKI-PNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQII
        + ++++   K+PG D             +  + L +  N   +G L N      I+LIPK   +P R E++ PISL N+  K+++K + NR+++ + +II
Subjt:  NALKDMRPTKAPGDD-------------ISRVCLYILNN---DGDL-NPINSTLISLIPKI-PNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQII

Query:  SPTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGL
           Q  FIPG     N+      I  IN+ ++    +M + +D  KA+D ++  F+ R ++K+G E  ++K +    +  + +I++NG     F    G 
Subjt:  SPTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGL

Query:  RQGDPLSPYLFLLCPE----------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAK
        RQG PLSP LF +  E                            DD ++ L+    +   + EV++ Y  VSG  IN  KS   +  N  +A+
Subjt:  RQGDPLSPYLFLLCPE----------------------------DDSLILLKANETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAK

P11369 LINE-1 retrotransposable element ORF2 protein5.6e-2828.49Show/hide
Query:  KKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEE-NTRNISPKVTNDHKVDLNRPFSKIDIENALKDMRPTKAPGDD----------
        + K  I+KI N  GD      +I++   +++K L+++       +++   R   PK+  D    LN P S  +IE  +  +   K+PG D          
Subjt:  KKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEE-NTRNISPKVTNDHKVDLNRPFSKIDIENALKDMRPTKAPGDD----------

Query:  ------ISRVCLYILNNDGDL-NPINSTLISLIPK-IPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCIH
              I     + +  +G L N      I+LIPK   +P ++E+F PISL N+  K+++K +ANR++  +  II P Q  FIPG     N+      IH
Subjt:  ------ISRVCLYILNNDGDL-NPINSTLISLIPK-IPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCIH

Query:  VINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQGDPLSPYLFLLCPEDDSLILLKA
         IN+ +     +M I LD  KA+D+++  F+ +V+E+ G +  ++  +    +    +I +NG          G RQG PLSPYLF +  E  +  + + 
Subjt:  VINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQGDPLSPYLFLLCPEDDSLILLKA

Query:  NETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSNLLGIKLSESLGY
         E   + I +     EEV    I+L     +V  + PK    EL NL+     E +GY
Subjt:  NETNCVSIKEVLRRYEEVSGQAINLDKSACMVSKNIPKAKAEELSNLLGIKLSESLGY

P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-2227.64Show/hide
Query:  MRSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIEN
        +RSR + L   D+ +++F++    +  +  I  +   +G  +E    I D A ++++ LF S  P +    E   +  P V+   K  L  P +  ++  
Subjt:  MRSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIEN

Query:  ALKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISP
        AL+ M   K+P                 G D  RV                 ++SL+PK  + + ++++ P+SL +  YK+++KAI+ R+K +L ++I P
Subjt:  ALKDMRPTKAP-----------------GDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISP

Query:  TQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQ
         QS  +PGR I DNV L    +H   R   S A    + LD  KA+DRV+  ++   ++   F   ++  +     S    + IN S T+    GRG+RQ
Subjt:  TQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQ

Query:  GDPLSPYLFLLCPEDDSLILLK
        G PLS  L+ L  E    +L K
Subjt:  GDPLSPYLFLLCPEDDSLILLK

P93295 Uncharacterized mitochondrial protein AtMg003105.4e-2345.45Show/hide
Query:  DKRKAHWVSWKRLCASK-GEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDK
        +KRK  WV+W++LC SK  +GGLGFR++  FNQA+LAKQ +R+++   +LL ++ R RYF H++ ME S+G  PS  WRSI+ GREL  +G+   +G+  
Subjt:  DKRKAHWVSWKRLCASK-GEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDK

Query:  SIIIKDDPWL
           +  D W+
Subjt:  SIIIKDDPWL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.7e-0828.26Show/hide
Query:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFAS-SRPATVSIEENTRNISPKVTNDHKVD-LNRPFSKIDIE
        +SR +WL+ GD NT++FH      + KN I  +   +   VE V  ++++ + Y+  L  S S   T    +  ++I P   ND     L+   S  +I 
Subjt:  RSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFAS-SRPATVSIEENTRNISPKVTNDHKVD-LNRPFSKIDIE

Query:  NALKDMRPTKAPGDDISRVCLY-----------------ILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMIS
         A+  M   KAPG D      +                        L   N+T I+LIPK+    ++  F P+S C VVYK+I+
Subjt:  NALKDMRPTKAPGDDISRVCLY-----------------ILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.2e-1436.36Show/hide
Query:  IANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMK
        +  R+K ++  +I P Q+SFIPGR+ TDN++   + +H + RK+  K G+M +KLD+ KAYDR+ W ++   +   GF + W+ ++ +
Subjt:  IANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMAIKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMK

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-2834.74Show/hide
Query:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD
        HW +W  L   K EGG+GF++I  FN A+L KQ WR+L+   SL+ KVF+ RYF  ++ + A LG+ PS  W+SI   +E+ ++G +  VGN + III  
Subjt:  HWVSWKRLCASKGEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKD

Query:  DPWLPKQGCYKPVWIKK----EFQGD----RVASLIDGSG-SWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAY
          WL  +     + +++    E+       +V+ LID SG  W+++VI  +F     + I ++  G +  +D   W     G ++VKS Y
Subjt:  DPWLPKQGCYKPVWIKK----EFQGD----RVASLIDGSG-SWKEEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.8e-2445.45Show/hide
Query:  DKRKAHWVSWKRLCASK-GEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDK
        +KRK  WV+W++LC SK  +GGLGFR++  FNQA+LAKQ +R+++   +LL ++ R RYF H++ ME S+G  PS  WRSI+ GREL  +G+   +G+  
Subjt:  DKRKAHWVSWKRLCASK-GEGGLGFREISLFNQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDK

Query:  SIIIKDDPWL
           +  D W+
Subjt:  SIIIKDDPWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.3e-0758.14Show/hide
Query:  LINGSPTSEFRPGRGLRQGDPLSPYLFLLCPEDDSLILLKANE
        +ING+P     P RGLRQGDPLSPYLF+LC E  S +  +A E
Subjt:  LINGSPTSEFRPGRGLRQGDPLSPYLFLLCPEDDSLILLKANE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCTAGAGAGGAGTGGCTCAAATGGGGTGACAAGAATACCAAATGGTTCCATTCAGAAGCTTCTCACAGAAAGAAGAAGAACTCCATTGATAAAATCAATAACAT
AAACGGGGACTGGGTGGAAGGTGTTGGTGATATCGAAGATGTAGCTATGAACTATTTCAAGTTTTTGTTTGCTTCCTCGAGGCCTGCCACGGTCTCCATCGAAGAGAACA
CTAGGAACATCTCCCCCAAAGTCACTAATGATCATAAAGTCGATCTGAATAGACCTTTCTCGAAAATTGACATAGAAAATGCTTTGAAAGATATGAGGCCCACGAAAGCT
CCAGGGGACGATATATCGAGAGTGTGCTTGTACATTCTTAACAATGACGGGGATCTGAATCCTATCAACTCCACTTTGATATCTCTTATTCCCAAAATCCCTAACCCCAA
AAGGATGGAAGACTTTTGGCCCATTAGCCTCTGCAATGTGGTGTATAAGATGATTTCTAAAGCCATAGCCAATAGAATGAAAAGGATCCTCGATCAAATTATTTCCCCTA
CTCAGTCATCTTTCATTCCTGGAAGACTCATCACCGATAACGTTCTCTTAGGCTTCAAATGTATCCACGTTATCAACAGAAAAAGATCGAGTAAGGCAGGTTACATGGCC
ATAAAGCTCGACATGAGCAAGGCCTATGATCGGGTCGAGTGGAGCTTTATTCGCAGAGTGATGGAGAAGCTTGGTTTTGAGGATAATTGGATCAAGAAAGTCATGAAGTG
TATCAATTCAGTTAGCTACTCAATCCTTATTAATGGCTCTCCAACTTCAGAGTTCAGGCCAGGGAGAGGTTTGCGCCAAGGAGATCCCCTATCTCCTTATTTGTTTCTTC
TCTGCCCTGAAGACGACAGTCTCATTCTCCTGAAGGCCAACGAGACCAACTGTGTATCTATCAAGGAAGTCCTTAGAAGATATGAAGAGGTTTCTGGCCAAGCTATCAAT
CTTGATAAGTCTGCTTGCATGGTGAGCAAGAATATCCCCAAAGCCAAAGCCGAGGAGCTTAGCAATCTCCTTGGTATCAAGCTTTCGGAGTCCTTGGGATACTATCTTGG
CATGCCCTCTCAGACGGGCAGGAGGAAGCACAAAGTGTTTTGGAAAATGGAAAACTCTCCGGGGCTGGAAAGAAGGGTTGTTCTCTATGGGAGGGAAAGAGACTCTTATC
AAAGCAGTAGCCCAATTGGGGACAAGAGGAAAGCTCATTGGGTGAGTTGGAAGAGGCTTTGCGCTAGTAAAGGAGAAGGGGGGCTTGGTTTTAGGGAAATTAGTCTCTTT
AATCAAGCAATGCTTGCTAAGCAGAGATGGAGACTCCTTAATAATCAGACTAGTCTCTTATTCAAGGTGTTCCGAGGGAGATATTTTGCCCATAACAACTTCATGGAAGC
CTCTTTAGGGAACAATCCCTCTCTTACTTGGAGAAGCATCCTTTGGGGCAGGGAGTTATTTAAGAAGGGCATGAAATGGAGAGTTGGCAATGACAAGAGTATTATCATCA
AAGATGACCCTTGGCTCCCTAAGCAAGGCTGTTATAAGCCAGTGTGGATCAAGAAGGAATTTCAAGGAGATAGGGTGGCCAGTTTGATTGATGGATCGGGGTCCTGGAAA
GAAGAAGTGATCAGAGACATTTTTCTCCCTTCTAACTCAGAAGCAATTCTTGACATCCCTCTCGGCAGCAAGGAGGATATTGACAAAATTATCTGGAGGTCGGACAAAAA
AGGGATGTTCTCGGTCAAAAGTGCCTATCATCTAGCTTACAATTCCTCCTTGGCCGATAAGAGTTCTTCCTCTGACCCCACTCCAGTAGCCTCATTCTGGAAAAAAAATA
TTGGAAAGTGGATGCAACCCCTAGAGAAAAAATTTGTGTGTGGAGAGAGCTCCAAAACTCCCTTCCTACTCAAGCAAATATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGATCTAGAGAGGAGTGGCTCAAATGGGGTGACAAGAATACCAAATGGTTCCATTCAGAAGCTTCTCACAGAAAGAAGAAGAACTCCATTGATAAAATCAATAACAT
AAACGGGGACTGGGTGGAAGGTGTTGGTGATATCGAAGATGTAGCTATGAACTATTTCAAGTTTTTGTTTGCTTCCTCGAGGCCTGCCACGGTCTCCATCGAAGAGAACA
CTAGGAACATCTCCCCCAAAGTCACTAATGATCATAAAGTCGATCTGAATAGACCTTTCTCGAAAATTGACATAGAAAATGCTTTGAAAGATATGAGGCCCACGAAAGCT
CCAGGGGACGATATATCGAGAGTGTGCTTGTACATTCTTAACAATGACGGGGATCTGAATCCTATCAACTCCACTTTGATATCTCTTATTCCCAAAATCCCTAACCCCAA
AAGGATGGAAGACTTTTGGCCCATTAGCCTCTGCAATGTGGTGTATAAGATGATTTCTAAAGCCATAGCCAATAGAATGAAAAGGATCCTCGATCAAATTATTTCCCCTA
CTCAGTCATCTTTCATTCCTGGAAGACTCATCACCGATAACGTTCTCTTAGGCTTCAAATGTATCCACGTTATCAACAGAAAAAGATCGAGTAAGGCAGGTTACATGGCC
ATAAAGCTCGACATGAGCAAGGCCTATGATCGGGTCGAGTGGAGCTTTATTCGCAGAGTGATGGAGAAGCTTGGTTTTGAGGATAATTGGATCAAGAAAGTCATGAAGTG
TATCAATTCAGTTAGCTACTCAATCCTTATTAATGGCTCTCCAACTTCAGAGTTCAGGCCAGGGAGAGGTTTGCGCCAAGGAGATCCCCTATCTCCTTATTTGTTTCTTC
TCTGCCCTGAAGACGACAGTCTCATTCTCCTGAAGGCCAACGAGACCAACTGTGTATCTATCAAGGAAGTCCTTAGAAGATATGAAGAGGTTTCTGGCCAAGCTATCAAT
CTTGATAAGTCTGCTTGCATGGTGAGCAAGAATATCCCCAAAGCCAAAGCCGAGGAGCTTAGCAATCTCCTTGGTATCAAGCTTTCGGAGTCCTTGGGATACTATCTTGG
CATGCCCTCTCAGACGGGCAGGAGGAAGCACAAAGTGTTTTGGAAAATGGAAAACTCTCCGGGGCTGGAAAGAAGGGTTGTTCTCTATGGGAGGGAAAGAGACTCTTATC
AAAGCAGTAGCCCAATTGGGGACAAGAGGAAAGCTCATTGGGTGAGTTGGAAGAGGCTTTGCGCTAGTAAAGGAGAAGGGGGGCTTGGTTTTAGGGAAATTAGTCTCTTT
AATCAAGCAATGCTTGCTAAGCAGAGATGGAGACTCCTTAATAATCAGACTAGTCTCTTATTCAAGGTGTTCCGAGGGAGATATTTTGCCCATAACAACTTCATGGAAGC
CTCTTTAGGGAACAATCCCTCTCTTACTTGGAGAAGCATCCTTTGGGGCAGGGAGTTATTTAAGAAGGGCATGAAATGGAGAGTTGGCAATGACAAGAGTATTATCATCA
AAGATGACCCTTGGCTCCCTAAGCAAGGCTGTTATAAGCCAGTGTGGATCAAGAAGGAATTTCAAGGAGATAGGGTGGCCAGTTTGATTGATGGATCGGGGTCCTGGAAA
GAAGAAGTGATCAGAGACATTTTTCTCCCTTCTAACTCAGAAGCAATTCTTGACATCCCTCTCGGCAGCAAGGAGGATATTGACAAAATTATCTGGAGGTCGGACAAAAA
AGGGATGTTCTCGGTCAAAAGTGCCTATCATCTAGCTTACAATTCCTCCTTGGCCGATAAGAGTTCTTCCTCTGACCCCACTCCAGTAGCCTCATTCTGGAAAAAAAATA
TTGGAAAGTGGATGCAACCCCTAGAGAAAAAATTTGTGTGTGGAGAGAGCTCCAAAACTCCCTTCCTACTCAAGCAAATATTCTAG
Protein sequenceShow/hide protein sequence
MRSREEWLKWGDKNTKWFHSEASHRKKKNSIDKINNINGDWVEGVGDIEDVAMNYFKFLFASSRPATVSIEENTRNISPKVTNDHKVDLNRPFSKIDIENALKDMRPTKA
PGDDISRVCLYILNNDGDLNPINSTLISLIPKIPNPKRMEDFWPISLCNVVYKMISKAIANRMKRILDQIISPTQSSFIPGRLITDNVLLGFKCIHVINRKRSSKAGYMA
IKLDMSKAYDRVEWSFIRRVMEKLGFEDNWIKKVMKCINSVSYSILINGSPTSEFRPGRGLRQGDPLSPYLFLLCPEDDSLILLKANETNCVSIKEVLRRYEEVSGQAIN
LDKSACMVSKNIPKAKAEELSNLLGIKLSESLGYYLGMPSQTGRRKHKVFWKMENSPGLERRVVLYGRERDSYQSSSPIGDKRKAHWVSWKRLCASKGEGGLGFREISLF
NQAMLAKQRWRLLNNQTSLLFKVFRGRYFAHNNFMEASLGNNPSLTWRSILWGRELFKKGMKWRVGNDKSIIIKDDPWLPKQGCYKPVWIKKEFQGDRVASLIDGSGSWK
EEVIRDIFLPSNSEAILDIPLGSKEDIDKIIWRSDKKGMFSVKSAYHLAYNSSLADKSSSSDPTPVASFWKKNIGKWMQPLEKKFVCGESSKTPFLLKQIF