; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012568 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012568
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:42271749..42279116
RNA-Seq ExpressionLag0012568
SyntenyLag0012568
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN62051.1 hypothetical protein VITISV_016641 [Vitis vinifera]6.7e-17434.01Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T  M+ F+ FI D ELID P+ +  ++WS +       R+DRFL S+ W   FP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+    FKE   +WW + +  GW G + M KL+ +KA +K+WNK    +   KK +IL  +   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E ++G +L N   I+ EIL ++ KLY   S   + +EG DW+PID  S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        +S+LE  F+ EEIYKA+  M   K+PGPDG T   +++ W+++K D+V +F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA RL+ VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD + WDFLD +LE+KGF  +WR+W++GCL++ ++++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFPRS-CSRLWVTLSASQSIIDV--------------AKWWEILRMIMR------------GSGLTL-------------------
        +G +   +    G    ++  SRL V L    S   +                W  ++  I R            G  +TL                   
Subjt:  RGKIMASRGLRQGDPFPRS-CSRLWVTLSASQSIIDV--------------AKWWEILRMIMR------------GSGLTL-------------------

Query:  -----------------NLAKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWV
                          + +    +L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W 
Subjt:  -----------------NLAKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWV

Query:  DIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHD
         IA  +  F KFT+F+V +G  I+FW D W  +QPL   +P L  +   K+A+++     +   +WN  FRR L D E+    +L+  L  + ++ +  D
Subjt:  DIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHD

Query:  EICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHCEFAKKV
        +  W++  SG +T  S F  ++Q      V+P                                         P         GE +DHLF+HC   K +
Subjt:  EICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHCEFAKKV

Query:  WFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
        W  +  L  + +  P  I D       G+   K+  ++   A  A +W++W+ERN+R FEDK+ +     D+++  AS W    K+++
Subjt:  WFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]1.3e-17232.74Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T SM+ F+ FI D ELID P+ +  F+WS +       R+DRFL S+ W   FP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ +KA +K WNK    +   +K +IL ++   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E +NG ++ N   I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        + +LE  F+ EEI+KA+  M   K+PGPDG T   +++ W ++K D+V++F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA R++ VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD V WDFLD +LE+KGFG +WR+W++GCL++ +F++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G + ASRGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  ------------------------------AGSN------------------------------------------------------------------
                                       G N                                                                  
Subjt:  ------------------------------AGSN------------------------------------------------------------------

Query:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
                     L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  Y  F KFT
Subjt:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT
        +F+V NG  I+FW D W   QPL   +P L  +   K+A ++         +WN  FRR L D E+     L++ L  + +++   D+  W L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    ++P                                  RP   +S          GE +DHLF+HC     +W  +     + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  I D L     G+   K+  ++   A  A +W++W+ERN+R FEDK+ + E+  D++    S W    K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]1.4e-16832.93Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R   E+L     T SM+  + FI ++ELIDPP+ +  F+WS +       R+DRFL S+ W   FP+   E L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ LKA +K+WNK      I +K  IL +I   D +E +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS  FHK A+  R++ FI ++E + G +L N + I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        +S+LE  F+ EEIYKA+  M    +PGPDG T   +++ W+++K D+V +F EF  +GIIN+ TN +++ ++PKK  A K+ +YRPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA RL+ +L  TI   Q AFVQGRQILDA+L+A E V++ +   E+G + K+D EKAYD V WDFLD ++E KGF    R+WI+ CL++ +F+IL+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G +   RGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  -----------------------------------------------AGSNLAKWE--------------------------------------------
                                                         S L  W+                                            
Subjt:  -----------------------------------------------AGSNLAKWE--------------------------------------------

Query:  --WSSL----------------PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
          WS +                 ++ GGLG+  ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  +  F KFT
Subjt:  --WSSL----------------PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHDEICWTLEGSGSYT
        +FIV +G  I+FW+D W  +Q L   FP L  +   K+ +++         +WN  FRR L D E+    +L++ L  I L+ +  D+  W+L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    V+P                                  RP   +S         RGE +DHLF+HC     +W  +  L  + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  + D +     G+   K+  ++   A  A LW++W+ERN+R FEDKS + E   D +   AS W+   K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

RVW19678.1 Structural maintenance of chromosomes protein 3, partial [Vitis vinifera]3.9e-16634.38Show/hide
Query:  TRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIMMSIGVVKWGPTPFRFENLWIDMPDFKEK
        T +MR+F+ FI +S L+DPP+ N  F+WS +       R+DRFL S  W+  F +   E L R TSDH PI +      WGPTPFRFEN+W+  P+FKEK
Subjt:  TRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIMMSIGVVKWGPTPFRFENLWIDMPDFKEK

Query:  IEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINKKCKLKWVQE
           WW +    GW G + M KLK +K+ +K+WN  +      +K +IL ++  ID++E +  ++   + ER   + EL DL + E+    +K ++KW++E
Subjt:  IEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINKKCKLKWVQE

Query:  GDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATSSSKLEDVFSVEEIYKAVQDMGSQKSPGP
        GD NS FFH+ A+  RS+ FI  +  + GE L N   I  EI+NF+  LY K     + IEG DWAPI   S+  L+  FS EE+  AV  +  +K+PGP
Subjt:  GDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATSSSKLEDVFSVEEIYKAVQDMGSQKSPGP

Query:  DGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILD
        DG T   ++  W+++K D++ +F EF   G+IN+ TN T++ ++PKK +  K+ DYRPISLVTS YK+IAKVL+ RL+KVL  TI   Q AFV+GRQILD
Subjt:  DGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILD

Query:  AILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDPFPR--------SC
        A+L+A E V++ R   E+G + K+D EKAYD V W FLD +L+ KGF  KWR W++GCL++++F+IL+NG  +G + ASRGLRQGDP             
Subjt:  AILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDPFPR--------SC

Query:  SRLWV---TLSASQSIIDVAKWWEILRMIM-RGSGLTLNLAKSAGS------------------------------------------------------
        SRL +       +++ +D+ +  +I+ M+  + SGL +NL KS  S                                                      
Subjt:  SRLWV---TLSASQSIIDVAKWWEILRMIM-RGSGLTLNLAKSAGS------------------------------------------------------

Query:  -------------------------------NLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYGA--SGWKTTPFRGKK
                                       +L +WE  S PR  GGLG    + RN + + KWLWRF ++++ LW ++I SIYG   +GW         
Subjt:  -------------------------------NLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYGA--SGWKTTPFRGKK

Query:  GNKIWVDIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDN-NQAWNLAFRRGLFDRELTSWMTLVEKLSLIRL
            W  IA  +  F  F + +V NG  I+FW+D W  NQ L A F +LY +   ++  V++   ++   +WN  FRR L D E+     L+  L  + L
Subjt:  GNKIWVDIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDN-NQAWNLAFRRGLFDRELTSWMTLVEKLSLIRL

Query:  N-NGHDEICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHC
        + +  D   W+L  SGS++  S F  +++ ++     P                                         P+        GE++DHLF+HC
Subjt:  N-NGHDEICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHC

Query:  EFAKKVWFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCW
             +W  + NL G+ +  P  I+D L+    G     + KI+   A    +W++W+ERN+R FEDK  + E   D ++  +S W
Subjt:  EFAKKVWFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCW

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.3e-17332.65Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T SM+ F+ FI D ELID P+ +  F+WS +       R+DRFL S+ W  TFP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ +KA +K WNK    +   +K +IL  +   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E +NG+++ N   I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        + +LE  F+ EEI KA+  M   K+PGPDG T   +++ W ++K D+V++F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA R+++VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD V WDFLD ++E+KGFG +WR+W++GCL++ +F++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G + ASRGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  ------------------------------AGSN------------------------------------------------------------------
                                       G N                                                                  
Subjt:  ------------------------------AGSN------------------------------------------------------------------

Query:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
                     L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  Y  F KFT
Subjt:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT
        +F+V NG  I+FW D W   QPL   +P L  +   K+A ++         +WN  FRR L D E+     L++    + +++   D+  W+L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    ++P                                  RP   +S          GE +DHLF+HC     +W  +     + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  I D L     G+   K+  ++   A  A +W++W+ERN+R FEDK+ + E+  D++    S W    K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein2.1e-17332.65Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T SM+ F+ FI D ELID P+ +  F+WS +       R+DRFL S+ W  TFP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ +KA +K WNK    +   +K +IL  +   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E +NG+++ N   I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        + +LE  F+ EEI KA+  M   K+PGPDG T   +++ W ++K D+V++F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA R+++VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD V WDFLD ++E+KGFG +WR+W++GCL++ +F++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G + ASRGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  ------------------------------AGSN------------------------------------------------------------------
                                       G N                                                                  
Subjt:  ------------------------------AGSN------------------------------------------------------------------

Query:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
                     L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  Y  F KFT
Subjt:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT
        +F+V NG  I+FW D W   QPL   +P L  +   K+A ++         +WN  FRR L D E+     L++    + +++   D+  W+L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    ++P                                  RP   +S          GE +DHLF+HC     +W  +     + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  I D L     G+   K+  ++   A  A +W++W+ERN+R FEDK+ + E+  D++    S W    K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

A5BCI7 Reverse transcriptase domain-containing protein6.1e-17332.74Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T SM+ F+ FI D ELID P+ +  F+WS +       R+DRFL S+ W   FP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ +KA +K WNK    +   +K +IL ++   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E +NG ++ N   I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        + +LE  F+ EEI+KA+  M   K+PGPDG T   +++ W ++K D+V++F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA R++ VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD V WDFLD +LE+KGFG +WR+W++GCL++ +F++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G + ASRGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  ------------------------------AGSN------------------------------------------------------------------
                                       G N                                                                  
Subjt:  ------------------------------AGSN------------------------------------------------------------------

Query:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
                     L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  Y  F KFT
Subjt:  -------------LAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT
        +F+V NG  I+FW D W   QPL   +P L  +   K+A ++         +WN  FRR L D E+     L++ L  + +++   D+  W L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    ++P                                  RP   +S          GE +DHLF+HC     +W  +     + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  I D L     G+   K+  ++   A  A +W++W+ERN+R FEDK+ + E+  D++    S W    K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

A5BPI6 Uncharacterized protein7.0e-16932.93Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R   E+L     T SM+  + FI ++ELIDPP+ +  F+WS +       R+DRFL S+ W   FP+   E L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+  P FKE   +WW + +  GW G + M KL+ LKA +K+WNK      I +K  IL +I   D +E +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS  FHK A+  R++ FI ++E + G +L N + I+ EIL ++ KLY   S   + +EG DW+PI   S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        +S+LE  F+ EEIYKA+  M    +PGPDG T   +++ W+++K D+V +F EF  +GIIN+ TN +++ ++PKK  A K+ +YRPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA RL+ +L  TI   Q AFVQGRQILDA+L+A E V++ +   E+G + K+D EKAYD V WDFLD ++E KGF    R+WI+ CL++ +F+IL+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----
        +G +   RGLRQGDP                                R+ +R+     A  +I        D+     +L +    SGL +NL KS    
Subjt:  RGKIMASRGLRQGDPFP------------------------------RSCSRLWVTLSASQSII-------DVAKWWEILRMIMRGSGLTLNLAKS----

Query:  -----------------------------------------------AGSNLAKWE--------------------------------------------
                                                         S L  W+                                            
Subjt:  -----------------------------------------------AGSNLAKWE--------------------------------------------

Query:  --WSSL----------------PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT
          WS +                 ++ GGLG+  ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W  IA  +  F KFT
Subjt:  --WSSL----------------PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFT

Query:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHDEICWTLEGSGSYT
        +FIV +G  I+FW+D W  +Q L   FP L  +   K+ +++         +WN  FRR L D E+    +L++ L  I L+ +  D+  W+L  SG +T
Subjt:  KFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHDEICWTLEGSGSYT

Query:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY
          S F  ++Q +    V+P                                  RP   +S         RGE +DHLF+HC     +W  +  L  + + 
Subjt:  ASSMFQNIAQKNSEAGVYPP---------------------------------RPAGFVS--------SRGENLDHLFIHCEFAKKVWFFVANLFGLSFY

Query:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
         P  + D +     G+   K+  ++   A  A LW++W+ERN+R FEDKS + E   D +   AS W+   K+++
Subjt:  LPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

A5C377 Uncharacterized protein2.5e-17434.01Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        GL    WC+ GDFN++R S E+L  +  T  M+ F+ FI D ELID P+ +  ++WS +       R+DRFL S+ W   FP+     L R TSDH+PI+
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK
        +     KWGPTPFRFEN+W+    FKE   +WW + +  GW G + M KL+ +KA +K+WNK    +   KK +IL  +   D LE +  +    + +R 
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERK

Query:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS
          K EL +L + E+    +K ++KWV+EGD NS FFHK A+  R++ FI  +E ++G +L N   I+ EIL ++ KLY   S   + +EG DW+PID  S
Subjt:  NLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPIDATS

Query:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV
        +S+LE  F+ EEIYKA+  M   K+PGPDG T   +++ W+++K D+V +F EF  +GIIN+ TN +++ ++PKK  + ++ D+RPISL+TS YK+IAKV
Subjt:  SSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLIAKV

Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP
        LA RL+ VL  TI   Q AFVQGRQILDA+L+A E V++ R   E+G + K+D EKAYD + WDFLD +LE+KGF  +WR+W++GCL++ ++++L+NG  
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRP

Query:  RGKIMASRGLRQGDPFPRS-CSRLWVTLSASQSIIDV--------------AKWWEILRMIMR------------GSGLTL-------------------
        +G +   +    G    ++  SRL V L    S   +                W  ++  I R            G  +TL                   
Subjt:  RGKIMASRGLRQGDPFPRS-CSRLWVTLSASQSIIDV--------------AKWWEILRMIMR------------GSGLTL-------------------

Query:  -----------------NLAKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWV
                          + +    +L  W+    P+  GGLG   ++ RN + + KWLWR+ ++ ++LW Q+I+SIYG  ++GW             W 
Subjt:  -----------------NLAKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWV

Query:  DIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHD
         IA  +  F KFT+F+V +G  I+FW D W  +QPL   +P L  +   K+A+++     +   +WN  FRR L D E+    +L+  L  + ++ +  D
Subjt:  DIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQ-AWNLAFRRGLFDRELTSWMTLVEKLSLIRLN-NGHD

Query:  EICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHCEFAKKV
        +  W++  SG +T  S F  ++Q      V+P                                         P         GE +DHLF+HC   K +
Subjt:  EICWTLEGSGSYTASSMFQNIAQKNSEAGVYP-----------------------------------------PRPAGFVSSRGENLDHLFIHCEFAKKV

Query:  WFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE
        W  +  L  + +  P  I D       G+   K+  ++   A  A +W++W+ERN+R FEDK+ +     D+++  AS W    K+++
Subjt:  WFFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYE

M5WPQ5 Reverse transcriptase domain-containing protein2.9e-16735.58Show/hide
Query:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM
        G C   WC+ GDFN+VR+S E+ N    T+SMR FN FI ++ L DP + N  F+WS L       R+DRFL+S  W + FP  R + L R TSDH PI 
Subjt:  GLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFI-DSELIDPPMSNGPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIM

Query:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIE----IIDKLEDQNAIDASQI
        +    VKWGP+PFRFEN+W++ PDFK KI+ WWG+ +  GW G++ M +LK LK+ +K W+KE            LRE E    ++D+ E    +D    
Subjt:  MSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIE----IIDKLEDQNAIDASQI

Query:  EERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPI
         ER NL  ++ DL   E+    ++ K+KW +EGD N+ FFH+ A+  R + +I  +E ++  ++     IE E++ F+  LY ++ N+ + +EG +W PI
Subjt:  EERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNLRFVIEGPDWAPI

Query:  DATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKL
            +  LE  F +EE+ KAV D G  KSPGPDG +  F+++ W ++K D++++ Q+FF +GI+N  TNET++C+IPKK  + KV DYRPISLVTS YK+
Subjt:  DATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKL

Query:  IAKVLAERLKKVLPNTICQA--AFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILI
        I+KVLA RL++VL NTI Q+  AFVQ RQILDA+LVA E VE++R +K KG + K+D EKAYD V W+F+D ++  KGFG KWR WI GCL + NFSI+I
Subjt:  IAKVLAERLKKVLPNTICQA--AFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILI

Query:  NGRPRGKIMASRGLRQGDPFPRSCSRLWVTLSASQSIIDV-------AKWW----EILRMIMRGSGLTLNLAKS---------------AGS--------
        NG+PRGK  ASRGLRQGDP       L + +S  Q   D         ++W    ++L++    SG+ +N AKS               AGS        
Subjt:  NGRPRGKIMASRGLRQGDPFPRSCSRLWVTLSASQSIIDV-------AKWW----EILRMIMRGSGLTLNLAKS---------------AGS--------

Query:  ------------------------------------------------------------------------------------------NLAKWEWSSL
                                                                                                  +L +WE  + 
Subjt:  ------------------------------------------------------------------------------------------NLAKWEWSSL

Query:  PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQP
         +  GGLG+ SL +RN +   KWLWRF  + NSLW +II S YG  ++GW T           W +I+  Y +FL+  +F V NG  I+FW+D W     
Subjt:  PRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYG--ASGWKTTPFRGKKGNKIWVDIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQP

Query:  LQALFPDLYLISERKDAVVADCWEDNNQA---WNLAFRRGLFDRELTSWMTLVEKLSLIRLNNGH-DEICWTLEGSGSYTASSMFQNIAQKNSEA-----
        L+ LFP L  +S RK+  +A C+ +N+     W+  FRR L + EL   + L++ L  +RL     D   W +E  GS++  S    +     +      
Subjt:  LQALFPDLYLISERKDAVVADCWEDNNQA---WNLAFRRGLFDRELTSWMTLVEKLSLIRLNNGH-DEICWTLEGSGSYTASSMFQNIAQKNSEA-----

Query:  ----GVYPPRPAGFV-------------------------------SSRGENLDHLFIHCEFAKKVWFFVANLFGLSF
               PP+   FV                                   EN+DHLFIHC ++ ++W+ +    G+ +
Subjt:  ----GVYPPRPAGFV-------------------------------SSRGENLDHLFIHCEFAKKVWFFVANLFGLSF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.5e-2224.15Show/hide
Query:  WAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWA
        W  F+ +   +G    +  + ++     I   T+ L+E+E     ++Q    AS+ +E   ++AEL ++   +      + +  + +  ++      +  
Subjt:  WAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWA

Query:  SAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLY-EKDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFW
           R K  I  I+   G+I T+  EI+  I  +Y  LY  K  NL  +    D   +   +  ++E +    +  EI   +  + ++KSPGPDG T EF+
Subjt:  SAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLY-EKDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFW

Query:  KNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKK-KKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATE
        + Y   L   ++++FQ     GI+     E  + +IPK  +  +K  ++RPISL+    K++ K+LA R+++ +   I   Q  F+ G Q    I  +  
Subjt:  KNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKK-KKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATE

Query:  AVEDIRLRKEKG-FLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP
         ++ I   K+K   ++ +D EKA+D +   F+   L   G    + + I+   +    +I++NG+         G RQG P
Subjt:  AVEDIRLRKEKG-FLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP

P08548 LINE-1 reverse transcriptase homolog5.0e-2326.86Show/hide
Query:  KALIKDWNKELRTKAISKKT------NILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINK--KCKLKWVQEGDENSSFFHKWASAMRS
        KA+++   K +  +A  KKT      N++  ++ ++K E  N    S+ +E   ++AEL +  I  +R I +  K K  + ++ ++            R 
Subjt:  KALIKDWNKELRTKAISKKT------NILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINK--KCKLKWVQEGDENSSFFHKWASAMRS

Query:  KAFISMIEKKNGEILTNQNEIENEILNFYSKLY-EKDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWN
        K+ IS I   N EI T+ +EI+  +  +Y KLY  K  NL+ + +  +   +   S  ++E +    S  EI   +Q++  +KSPGPDG T EF++ +  
Subjt:  KAFISMIEKKNGEILTNQNEIENEILNFYSKLY-EKDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWN

Query:  ILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKK-KKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDI
         L   ++ +FQ     GI+     E  + +IPK  K  ++  +YRPISL+    K++ K+L  R+++ +   I   Q  F+ G Q    I  +   ++ I
Subjt:  ILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKK-KKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDI

Query:  -RLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP
         +L+ +   +L +D EKA+D +   F+   L+  G    + + I+   +    +I++NG          G RQG P
Subjt:  -RLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP

P0C2F6 Putative ribonuclease H protein At1g657504.4e-1127.36Show/hide
Query:  AKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYGASGWKTTPFRGKKG--NKIWVDIAANYPTFLKF-TKFIVN
        A+    +L KW     P+  GGLGV +    N + ++K  WR  Q+KNSLW  ++   Y     + + +   KG  +  W  IA      +     +I  
Subjt:  AKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDKNSLWRQIIISIYGASGWKTTPFRGKKG--NKIWVDIAANYPTFLKF-TKFIVN

Query:  NGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVA-DCWEDNNQAWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYTASSMF
        +G  I+FW DRW   +PL  L  D        D VVA D W    + W+ A      D   T+   L  +  ++ L  G  D + W     G ++  S +
Subjt:  NGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVA-DCWEDNNQAWNLAFRRGLFDRELTSWMTLVEKLSLIRLNNG-HDEICWTLEGSGSYTASSMF

Query:  Q
        +
Subjt:  Q

P11369 LINE-1 retrotransposable element ORF2 protein4.6e-2429.08Show/hide
Query:  RSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYE-KDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNY
        R K  I+ I  + G+I T+  EI+N I +FY +LY  K  NL  + +  D   +   +  +++ +    S +EI   +  + ++KSPGPDG + EF++ +
Subjt:  RSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYE-KDSNLRFVIEGPDWAPIDATSSSKLEDV---FSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNY

Query:  WNILKSDIVEMFQEFFH----NGIINKRTNETYVCVIPK-KKKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVAT
            K D++ +  + FH     G +     E  + +IPK +K  +K+ ++RPISL+    K++ K+LA R+++ +   I   Q  F+ G Q    I  + 
Subjt:  WNILKSDIVEMFQEFFH----NGIINKRTNETYVCVIPK-KKKASKVRDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVAT

Query:  EAVEDIRLRKEKG-FLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP
          +  I   K+K   ++ LD EKA+D +   F+  +LE  G    +   IK   +    +I +NG     I    G RQG P
Subjt:  EAVEDIRLRKEKG-FLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP

P14381 Transposon TX1 uncharacterized 149 kDa protein8.9e-2825.09Show/hide
Query:  IAGDFNMVRWSEERLNVTNSTRSMRKF--NSFIDSELIDPPMSNGP----FSWSRL-GTRMAASRIDR-FLISHLWNDTFPELRVERLQRPTSDHFPIMM
        I GDFN    + +R NV     S             L+D      P    F++ R+    ++ SRIDR ++ SHL +    +    RL  P SDH  + +
Subjt:  IAGDFNMVRWSEERLNVTNSTRSMRKF--NSFIDSELIDPPMSNGP----FSWSRL-GTRMAASRIDR-FLISHLWNDTFPELRVERLQRPTSDHFPIMM

Query:  SIGVVKWGPTP--FRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFR----------MMEKLKGLKALIKDWNKELRTKAISKKTNILREIEII--DKLED
         + +    P    + F N  ++   F + +   W      GW  F+           + K+  LK L +++     TK++S + N   EIE +  + L+ 
Subjt:  SIGVVKWGPTP--FRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFR----------MMEKLKGLKALIKDWNKELRTKAISKKTNILREIEII--DKLED

Query:  QNAIDASQIE----ERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSN
        +  +  S+ +    E    K  L ++   + R    + +++ + + D  S FF+       ++  I+ +  ++G  L +   I +   +FY  L+  D  
Subjt:  QNAIDASQIE----ERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSN

Query:  LRFVIEGP-DWAP-IDATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKV
             E   D  P +      +LE   +++E+ +A++ M   KSPG DG+T EF++ +W+ L  D   +  E F  G +        + ++PKK     +
Subjt:  LRFVIEGP-DWAP-IDATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKV

Query:  RDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRR
        +++RP+SL+++ YK++AK ++ RLK VL   I   Q+  V GR I D + +  + +   R        L LD EKA+D V   +L G L+   FG ++  
Subjt:  RDYRPISLVTSPYKLIAKVLAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRR

Query:  WIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP
        ++K    +    + IN      +   RG+RQG P
Subjt:  WIKGCLNNTNFSILINGRPRGKIMASRGLRQGDP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.8e-3127Show/hide
Query:  IAGDFNMVRWSEERLNVTNSTRSMRKFNSF----IDSELIDPPMSNGPFSWS-RLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFP-IMMSI
        + GDF+ +  + +  +V  ++  MR    F     DS+L+D P     ++WS          ++DR + +  W  +FP           SDH P I++  
Subjt:  IAGDFNMVRWSEERLNVTNSTRSMRKFNSF----IDSELIDPPMSNGPFSWS-RLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFP-IMMSI

Query:  GVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDA-SQIEERKNL
         + K     FR+ +     P F   +   W +  P+G   F + E LK  K   K  N++       K    L  +E I      N  D+  ++E     
Subjt:  GVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWNKELRTKAISKKTNILREIEIIDKLEDQNAIDA-SQIEERKNL

Query:  KAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNL---RFVIEGPDWAPI--D
        K        A +    +K ++KW+Q+GD N+ FFHK   A ++K  I  +   +   + N  +++  I+ +Y+ L   DS++     V    D  P   +
Subjt:  KAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEILNFYSKLYEKDSNL---RFVIEGPDWAPI--D

Query:  ATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLI
         T +S+L  + S +EI  AV  M   K+PGPD  T EF+   W ++K   +   +EFF  G + KR N T + +IPK     ++  +RP+S  T  YK+I
Subjt:  ATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKVRDYRPISLVTSPYKLI

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.1e-0423.85Show/hide
Query:  VYPPRPAGFVSSRGENLDHLFIHCEFAKKVWFFVANLFGLSFYLPNKIDD---WLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLE
        ++PP    F ++  E   HLF  CEFA++VW +  +   +  + P   +D   WL          K    ++  +  A+++ +WKERN+R  +  S    
Subjt:  VYPPRPAGFVSSRGENLDHLFIHCEFAKKVWFFVANLFGLSFYLPNKIDD---WLMEGLAGWNLRKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLE

Query:  FFCDNVQNTASCWILNFKIYERDGSFPTSF
             +++   C +      +R G    SF
Subjt:  FFCDNVQNTASCWILNFKIYERDGSFPTSF

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-0424.26Show/hide
Query:  HDEICWTLEGSGSYTASSMFQNIAQKNSEAGVYPPRPAGFVSSRGENLDHLFIHCEFAKKVW-FFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIM
        H  ICW +  +  +T   +           G+  P      ++  ++  HLF  C+F+  VW FF A+    +   P ++ D L   L   +  K   ++
Subjt:  HDEICWTLEGSGSYTASSMFQNIAQKNSEAGVYPPRPAGFVSSRGENLDHLFIHCEFAKKVW-FFVANLFGLSFYLPNKIDDWLMEGLAGWNLRKKAKIM

Query:  VGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQ
        +  AF + ++ +W+ERN R     S S E    ++Q
Subjt:  VGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQ

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.5e-0946.91Show/hide
Query:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRK-EKGF-LLKLDLEKAYDMVGWDFLDGMLELKGFGNKW
        + ERLK ++ N I   QA+F+ GR   D I+   EAV  +R +K  KG+ LLKLDLEKAYD + WD+L+  L   GF   W
Subjt:  LAERLKKVLPNTI--CQAAFVQGRQILDAILVATEAVEDIRLRK-EKGF-LLKLDLEKAYDMVGWDFLDGMLELKGFGNKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAGCTCATGCAAGCTCTACTTTTCTTGGGAGGAGCATGGAGAGTCACTTGTAGATCTCTAATAGATAAGGAAATTGAGAAATGTGCTTTATTTGCAAAACTGGA
TCACGAATTGACAGATGAGATTGGAGAACTTGAGAAACAAAAGGATCATCTGGAAGCTGAATTGAAAAAGGGTCTGTGTCAAGGTAATTGGTGTATTGCCGGGGATTTTA
ACATGGTTAGATGGTCAGAGGAAAGGCTGAATGTCACCAATTCGACAAGAAGCATGAGAAAATTTAATAGCTTCATAGACTCTGAGCTTATTGATCCCCCTATGAGTAAT
GGGCCCTTTTCTTGGTCAAGATTGGGTACTAGAATGGCTGCCTCTAGAATTGATAGGTTCCTTATCTCTCATTTGTGGAATGATACTTTCCCGGAGCTTAGAGTTGAGAG
ACTGCAAAGACCCACGTCGGACCACTTCCCCATCATGATGTCCATTGGGGTGGTGAAATGGGGGCCTACTCCCTTCAGATTTGAAAATTTGTGGATTGATATGCCAGATT
TTAAAGAGAAGATAGAGAAGTGGTGGGGAGACTTAAAACCTATCGGCTGGGCTGGTTTCAGAATGATGGAAAAGCTAAAGGGGCTGAAAGCCTTAATTAAAGACTGGAAC
AAAGAGCTTAGGACTAAGGCTATAAGTAAGAAAACGAATATCTTGAGGGAGATTGAGATTATAGATAAGCTGGAGGATCAGAACGCTATAGATGCTTCCCAGATTGAGGA
AAGGAAAAATCTTAAGGCAGAGCTTTTAGATTTGACCATAGCTGAGCAGAGAAGTATTAATAAAAAATGCAAATTAAAATGGGTACAAGAGGGGGATGAAAACTCCAGCT
TCTTTCATAAATGGGCTTCGGCCATGAGATCCAAGGCTTTCATCTCTATGATTGAAAAAAAGAATGGTGAGATCCTTACCAATCAAAACGAGATTGAAAATGAGATTTTG
AACTTTTATAGCAAGCTTTATGAGAAAGACAGCAATCTTAGATTTGTGATTGAAGGTCCAGATTGGGCTCCTATTGATGCTACCAGCAGCAGCAAGCTGGAAGATGTATT
CAGTGTAGAGGAAATCTATAAAGCAGTTCAGGATATGGGTAGCCAAAAATCCCCGGGTCCGGATGGCATGACGGGGGAATTTTGGAAAAATTATTGGAACATTTTGAAGT
CCGATATAGTAGAGATGTTCCAAGAATTTTTTCACAATGGTATTATCAATAAAAGGACCAATGAAACGTATGTTTGCGTTATTCCCAAGAAGAAAAAAGCCTCCAAAGTC
AGAGATTATAGGCCTATAAGCCTAGTCACTTCCCCTTATAAATTGATAGCTAAAGTGCTCGCTGAGCGATTGAAGAAGGTCCTCCCAAATACTATCTGCCAAGCGGCTTT
TGTGCAAGGTAGGCAAATTCTCGATGCTATTTTAGTGGCTACCGAGGCCGTGGAAGACATAAGGTTGAGAAAAGAGAAAGGATTCCTCCTCAAGCTAGATTTAGAGAAGG
CCTACGATATGGTGGGATGGGATTTCCTTGATGGGATGTTAGAGTTGAAGGGATTTGGCAATAAATGGAGAAGGTGGATCAAAGGCTGCCTTAATAATACCAATTTCTCC
ATTCTCATTAATGGGCGTCCTAGGGGAAAGATCATGGCTTCTAGAGGTTTGAGGCAAGGGGATCCCTTTCCCCGTTCTTGTTCACGCTTGTGGGTGACGCTATCAGCAAG
TCAATCCATTATTGATGTGGCCAAATGGTGGGAAATTCTCAGGATGATAATGAGGGGCTCGGGTTTGACTCTGAATTTGGCCAAATCTGCGGGTTCTAATTTGGCTAAAT
GGGAGTGGTCCTCGTTGCCGCGTCTTCATGGTGGGTTGGGAGTTAGCTCTCTAGCTCAAAGAAACTTCTCCTTTATGACCAAATGGCTTTGGAGATTCACTCAAGACAAG
AATTCCTTGTGGAGACAAATCATCATTAGTATATATGGTGCTTCCGGGTGGAAAACTACTCCCTTTAGAGGCAAAAAAGGGAACAAAATCTGGGTTGATATAGCTGCAAA
TTATCCGACATTTCTCAAATTTACCAAGTTCATTGTTAATAATGGGTCGAGTATTAAGTTTTGGAAGGATAGGTGGTGTGACAACCAACCTCTCCAAGCCTTATTTCCAG
ATCTCTACTTGATTTCTGAGAGAAAAGATGCAGTTGTTGCTGATTGTTGGGAAGATAATAACCAAGCTTGGAATTTGGCTTTTAGAAGAGGCTTATTTGATAGGGAATTA
ACCAGCTGGATGACCCTTGTGGAGAAGCTTAGCTTGATTAGGCTGAATAATGGCCATGATGAAATTTGTTGGACTCTTGAAGGGTCGGGCAGCTATACAGCAAGTTCTAT
GTTCCAGAATATCGCCCAAAAAAACTCAGAAGCTGGTGTTTATCCCCCTCGGCCTGCAGGTTTTGTCTCCTCTCGGGGGGAAAACTTAGACCATTTATTCATCCATTGTG
AGTTTGCAAAGAAGGTCTGGTTTTTTGTGGCTAATCTCTTCGGTTTGTCTTTCTACCTTCCCAACAAGATTGATGATTGGCTCATGGAAGGGCTGGCAGGATGGAACCTG
AGGAAAAAAGCCAAGATTATGGTTGGTTGTGCTTTTAGGGCTGCTCTTTGGCTTTTGTGGAAGGAGAGGAACTCTAGAACCTTTGAAGACAAGTCGGCTAGCCTTGAGTT
CTTTTGTGATAATGTCCAAAATACTGCTTCGTGTTGGATTCTCAATTTCAAGATATATGAAAGAGATGGCAGTTTTCCCACCTCATTTAATCTTGATACTTGTTCTTGTC
CTGCTAGACGAGTACTTCAGGGGGTCGATGAAAAGATTATGTACGTTTCTCCATTTGCAAGACTTGCAGATTCAGTTGCTCAAGTTTGGAATTCAGAATCTAAAGCATGG
GATTTGGGCCTCCGTCGAAATTTAAATGAGGAGGAGATTATGGAATGGGCCAATCTCTCAGATCTCCTATCTGCAGTCATCATTAGAAATGTTAAGATTCTTGGTCATGG
CCTCTTGACTCTTCTAATGCTTTTACCGTCAAATCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAGCTCATGCAAGCTCTACTTTTCTTGGGAGGAGCATGGAGAGTCACTTGTAGATCTCTAATAGATAAGGAAATTGAGAAATGTGCTTTATTTGCAAAACTGGA
TCACGAATTGACAGATGAGATTGGAGAACTTGAGAAACAAAAGGATCATCTGGAAGCTGAATTGAAAAAGGGTCTGTGTCAAGGTAATTGGTGTATTGCCGGGGATTTTA
ACATGGTTAGATGGTCAGAGGAAAGGCTGAATGTCACCAATTCGACAAGAAGCATGAGAAAATTTAATAGCTTCATAGACTCTGAGCTTATTGATCCCCCTATGAGTAAT
GGGCCCTTTTCTTGGTCAAGATTGGGTACTAGAATGGCTGCCTCTAGAATTGATAGGTTCCTTATCTCTCATTTGTGGAATGATACTTTCCCGGAGCTTAGAGTTGAGAG
ACTGCAAAGACCCACGTCGGACCACTTCCCCATCATGATGTCCATTGGGGTGGTGAAATGGGGGCCTACTCCCTTCAGATTTGAAAATTTGTGGATTGATATGCCAGATT
TTAAAGAGAAGATAGAGAAGTGGTGGGGAGACTTAAAACCTATCGGCTGGGCTGGTTTCAGAATGATGGAAAAGCTAAAGGGGCTGAAAGCCTTAATTAAAGACTGGAAC
AAAGAGCTTAGGACTAAGGCTATAAGTAAGAAAACGAATATCTTGAGGGAGATTGAGATTATAGATAAGCTGGAGGATCAGAACGCTATAGATGCTTCCCAGATTGAGGA
AAGGAAAAATCTTAAGGCAGAGCTTTTAGATTTGACCATAGCTGAGCAGAGAAGTATTAATAAAAAATGCAAATTAAAATGGGTACAAGAGGGGGATGAAAACTCCAGCT
TCTTTCATAAATGGGCTTCGGCCATGAGATCCAAGGCTTTCATCTCTATGATTGAAAAAAAGAATGGTGAGATCCTTACCAATCAAAACGAGATTGAAAATGAGATTTTG
AACTTTTATAGCAAGCTTTATGAGAAAGACAGCAATCTTAGATTTGTGATTGAAGGTCCAGATTGGGCTCCTATTGATGCTACCAGCAGCAGCAAGCTGGAAGATGTATT
CAGTGTAGAGGAAATCTATAAAGCAGTTCAGGATATGGGTAGCCAAAAATCCCCGGGTCCGGATGGCATGACGGGGGAATTTTGGAAAAATTATTGGAACATTTTGAAGT
CCGATATAGTAGAGATGTTCCAAGAATTTTTTCACAATGGTATTATCAATAAAAGGACCAATGAAACGTATGTTTGCGTTATTCCCAAGAAGAAAAAAGCCTCCAAAGTC
AGAGATTATAGGCCTATAAGCCTAGTCACTTCCCCTTATAAATTGATAGCTAAAGTGCTCGCTGAGCGATTGAAGAAGGTCCTCCCAAATACTATCTGCCAAGCGGCTTT
TGTGCAAGGTAGGCAAATTCTCGATGCTATTTTAGTGGCTACCGAGGCCGTGGAAGACATAAGGTTGAGAAAAGAGAAAGGATTCCTCCTCAAGCTAGATTTAGAGAAGG
CCTACGATATGGTGGGATGGGATTTCCTTGATGGGATGTTAGAGTTGAAGGGATTTGGCAATAAATGGAGAAGGTGGATCAAAGGCTGCCTTAATAATACCAATTTCTCC
ATTCTCATTAATGGGCGTCCTAGGGGAAAGATCATGGCTTCTAGAGGTTTGAGGCAAGGGGATCCCTTTCCCCGTTCTTGTTCACGCTTGTGGGTGACGCTATCAGCAAG
TCAATCCATTATTGATGTGGCCAAATGGTGGGAAATTCTCAGGATGATAATGAGGGGCTCGGGTTTGACTCTGAATTTGGCCAAATCTGCGGGTTCTAATTTGGCTAAAT
GGGAGTGGTCCTCGTTGCCGCGTCTTCATGGTGGGTTGGGAGTTAGCTCTCTAGCTCAAAGAAACTTCTCCTTTATGACCAAATGGCTTTGGAGATTCACTCAAGACAAG
AATTCCTTGTGGAGACAAATCATCATTAGTATATATGGTGCTTCCGGGTGGAAAACTACTCCCTTTAGAGGCAAAAAAGGGAACAAAATCTGGGTTGATATAGCTGCAAA
TTATCCGACATTTCTCAAATTTACCAAGTTCATTGTTAATAATGGGTCGAGTATTAAGTTTTGGAAGGATAGGTGGTGTGACAACCAACCTCTCCAAGCCTTATTTCCAG
ATCTCTACTTGATTTCTGAGAGAAAAGATGCAGTTGTTGCTGATTGTTGGGAAGATAATAACCAAGCTTGGAATTTGGCTTTTAGAAGAGGCTTATTTGATAGGGAATTA
ACCAGCTGGATGACCCTTGTGGAGAAGCTTAGCTTGATTAGGCTGAATAATGGCCATGATGAAATTTGTTGGACTCTTGAAGGGTCGGGCAGCTATACAGCAAGTTCTAT
GTTCCAGAATATCGCCCAAAAAAACTCAGAAGCTGGTGTTTATCCCCCTCGGCCTGCAGGTTTTGTCTCCTCTCGGGGGGAAAACTTAGACCATTTATTCATCCATTGTG
AGTTTGCAAAGAAGGTCTGGTTTTTTGTGGCTAATCTCTTCGGTTTGTCTTTCTACCTTCCCAACAAGATTGATGATTGGCTCATGGAAGGGCTGGCAGGATGGAACCTG
AGGAAAAAAGCCAAGATTATGGTTGGTTGTGCTTTTAGGGCTGCTCTTTGGCTTTTGTGGAAGGAGAGGAACTCTAGAACCTTTGAAGACAAGTCGGCTAGCCTTGAGTT
CTTTTGTGATAATGTCCAAAATACTGCTTCGTGTTGGATTCTCAATTTCAAGATATATGAAAGAGATGGCAGTTTTCCCACCTCATTTAATCTTGATACTTGTTCTTGTC
CTGCTAGACGAGTACTTCAGGGGGTCGATGAAAAGATTATGTACGTTTCTCCATTTGCAAGACTTGCAGATTCAGTTGCTCAAGTTTGGAATTCAGAATCTAAAGCATGG
GATTTGGGCCTCCGTCGAAATTTAAATGAGGAGGAGATTATGGAATGGGCCAATCTCTCAGATCTCCTATCTGCAGTCATCATTAGAAATGTTAAGATTCTTGGTCATGG
CCTCTTGACTCTTCTAATGCTTTTACCGTCAAATCTCTGA
Protein sequenceShow/hide protein sequence
MTKLMQALLFLGGAWRVTCRSLIDKEIEKCALFAKLDHELTDEIGELEKQKDHLEAELKKGLCQGNWCIAGDFNMVRWSEERLNVTNSTRSMRKFNSFIDSELIDPPMSN
GPFSWSRLGTRMAASRIDRFLISHLWNDTFPELRVERLQRPTSDHFPIMMSIGVVKWGPTPFRFENLWIDMPDFKEKIEKWWGDLKPIGWAGFRMMEKLKGLKALIKDWN
KELRTKAISKKTNILREIEIIDKLEDQNAIDASQIEERKNLKAELLDLTIAEQRSINKKCKLKWVQEGDENSSFFHKWASAMRSKAFISMIEKKNGEILTNQNEIENEIL
NFYSKLYEKDSNLRFVIEGPDWAPIDATSSSKLEDVFSVEEIYKAVQDMGSQKSPGPDGMTGEFWKNYWNILKSDIVEMFQEFFHNGIINKRTNETYVCVIPKKKKASKV
RDYRPISLVTSPYKLIAKVLAERLKKVLPNTICQAAFVQGRQILDAILVATEAVEDIRLRKEKGFLLKLDLEKAYDMVGWDFLDGMLELKGFGNKWRRWIKGCLNNTNFS
ILINGRPRGKIMASRGLRQGDPFPRSCSRLWVTLSASQSIIDVAKWWEILRMIMRGSGLTLNLAKSAGSNLAKWEWSSLPRLHGGLGVSSLAQRNFSFMTKWLWRFTQDK
NSLWRQIIISIYGASGWKTTPFRGKKGNKIWVDIAANYPTFLKFTKFIVNNGSSIKFWKDRWCDNQPLQALFPDLYLISERKDAVVADCWEDNNQAWNLAFRRGLFDREL
TSWMTLVEKLSLIRLNNGHDEICWTLEGSGSYTASSMFQNIAQKNSEAGVYPPRPAGFVSSRGENLDHLFIHCEFAKKVWFFVANLFGLSFYLPNKIDDWLMEGLAGWNL
RKKAKIMVGCAFRAALWLLWKERNSRTFEDKSASLEFFCDNVQNTASCWILNFKIYERDGSFPTSFNLDTCSCPARRVLQGVDEKIMYVSPFARLADSVAQVWNSESKAW
DLGLRRNLNEEEIMEWANLSDLLSAVIIRNVKILGHGLLTLLMLLPSNL