; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr5:4797420..4802963
RNA-Seq ExpressionLag0017522
SyntenyLag0017522
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.9e-19237.32Show/hide
Query:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        V SR+DRFLY+ NW   F  H+++ L RVTSDHFPIVLE+    WGP PF+  N  L E  F  N+  WW++  QEG PG+SF+R+LKQL + I+  +R 
Subjt:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------
        N+      K     EI  IDRLE  G L++++  +RT LK+D+     +E ++W Q+ K+ W+ EGDENT+FFHK                         
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------

Query:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----
                         G     W++ NLNW PIS  +A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +F DF        
Subjt:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----

Query:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC
                            D+RPISLTT +Y+LIAK +AERLK                              ID+W+    +GFVIKLDIEKAF+K+ 
Subjt:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL
        W F+D +L  KGYP  W                    P+ + Q  +         PF FVLAMDY+SR+L SV +K  +KG  L  +++++HLLFADDIL
Subjt:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LFV+D++  + NL NII +F+L+SGL+IN NKS+I+ INV+ SR  QIA+ WG  T   PI YLG PLGG   +  FW N  +KI++KL SW+YS +SKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--
        G++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  + LV+W K+ +  E GGLG+ + + TN A   KWLW      S   ++++  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--

Query:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR
                     C   S +     I + L   +R      ++G  FS     FW  H                    K  ++ D W++T   W L PRR
Subjt:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR

Query:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI
         L + E    AE  +S        G D  +W  +S G ++VAS ++    L  P++          F NLWK  IPKK  FFIWT++Y  +NT ++L + 
Subjt:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI

Query:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT
          +    PS C +C    ++  H+FI C +A  IWRS   HL++ +N   P++   +C+   + K  +++ +++ +  A+ LW IW ERN RIF    +T
Subjt:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT

Query:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF
           IWEDI  LA  W S +  FSNYQASSIALN  AF
Subjt:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-18436.06Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR
        SRLDRFL++  W   F  H ++ L R TSDHFPIVLE+    WGP PFRF N  L + D+ KNI+FWW +T Q G+ GYSF+RRLKQL   IK W R+ +
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR

Query:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------
           ++ K+    EI +ID+LE  G  T+   +KRT+LK+DL Q+++ E ++W Q+CK+ W+ EGDEN++FFHK  T                        
Subjt:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------

Query:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------
                         +    + NL+WCPIS   +  L +PF+E E++  LK+   NK+PGPDG+ ++F +KSW  ++ ++ ++F DF           
Subjt:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------

Query:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF
                         DFRPISLTT +Y+LIAKTLA+RLK                              +DFW+    RGFVIKLDIEKAF+K+ W F
Subjt:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF

Query:  VDKILAFKGYPITWPKR----------------QNQGRKRHSS--RRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV
        +D +L  K Y   W K                 + +GR + S   R+ D   PF FVLAMDYLSR+L ++  K  + G   + +++++H+LFADDIL+FV
Subjt:  VDKILAFKGYPITWPKR----------------QNQGRKRHSS--RRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV

Query:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL
        +D D  + NL  I+ +FE +SGLNIN +KS+I  INV   R   IA +WG      P  YLG PLGG PSSS FW N + KI +KL +W+YS +SKGGR+
Subjt:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL

Query:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--
        TLI +TL  +P Y +S+FK P+ +   I+   R+FLW+G     +I L+ W+++ +P E GGLG+     TN A   KWLW          +RL+     
Subjt:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--

Query:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF
                    F  N  P   +   +S   ++       G   S              + P  F     K  +V + W+ +++ W L+  RPL D E  
Subjt:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF

Query:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQP-HSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSC
              +S P P  + G    LWN +S   F  AS ++      +P  P   H  ++  LWK   PKK KFFIWT+I+  +NT DRLQ+   +  L+P+ 
Subjt:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQP-HSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSC

Query:  CPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLA
        C +C+   ++++H+FIHC  +  +W         N   P  V  +     +    +Q+ ++  +  A ILW IW ERN RIF+   +   ++WED +   
Subjt:  CPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLA

Query:  SFWASSTKAFSNYQASSIALNWKAFL
          W+  +K FSNY   SIALN  AF+
Subjt:  SFWASSTKAFSNYQASSIALNWKAFL

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.6e-18235.71Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR
        SRLDRFL+SP W   F  H ++ L R TSDHFPIVLE+    WGP PFRF N  L + D+ +NI+FWW +T Q GF GYSF+RRLKQL   IK W +  +
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR

Query:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------
           +  K+    EI  ID+LE  G  T+    KR +LK+DL Q+++ E ++W Q+CK+ W+ EGDEN++FFHK  T                        
Subjt:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------

Query:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------
                         +    + NL+WCPIS      L +PF+E E++  LK+   NK+PGPDGFT++F +KSW  ++ ++ ++F DF           
Subjt:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------

Query:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF
                         DFRPISLTT +Y+LIAK LA+RLK                              +DFW+    RGFVIKLDIEKAF+K+ W F
Subjt:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF

Query:  VDKILAFKGYPITW------------------PKRQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV
        +D +L  K Y   W                   + + + +     R+ D   PF FVLAMDYLS +L ++ +KG + G +   +++++H+LFADDIL+FV
Subjt:  VDKILAFKGYPITW------------------PKRQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV

Query:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL
        +D +  + NL  I+ +FE +SGLNIN +KS+I  INV   R   I  +WG    Q P  YLG PLGG PSSS FW N + KI +KL SW+YS +SKGGR+
Subjt:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL

Query:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--
        TLI +TL  +P Y LS+FK P+ +   I+   R+FLW+G     +I L+ W++V +P E GGLG+     TN A   KWLW          +RL+     
Subjt:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--

Query:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF
                    +  N  P   +   +S   ++       G   S                P  F     K  +V D W+ +   W ++  RPL D E  
Subjt:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF

Query:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSCC
              +S P P    G    LW  +S   F  AS ++   + +S      H  ++  LWK   PKK KFFIWT+I+  +NT DRLQ+   +  L+P+ C
Subjt:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSCC

Query:  PLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLAS
         +C+   ++++H+FIHC  +  +W         N   P  V  +     +    +Q+ ++  +  A +LW IW ERN RIF+   + + ++WEDI+    
Subjt:  PLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLAS

Query:  FWASSTKAFSNYQASSIALNWKAFL
         W+  +K FSNY   SIALN  AF+
Subjt:  FWASSTKAFSNYQASSIALNWKAFL

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.8e-18335.95Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLE--NPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        SR+DRFLY+P+W   F  H TR L R TSDHFP+V E  NP+ RWGP PFR ++  L + DF +N+  WW+++ Q+G PGYSFI+RLK L + IK W++ 
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLE--NPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH--------------------------
          +   S K  I  E+  ID+ E    L+ +   +R +LK+DL ++S++E + W QR KK WL+EGDEN++FFH                          
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH--------------------------

Query:  ---------------KGPT-SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ------
                       +GPT S+ + + NL W PI   E  +L  PF E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F DF+       
Subjt:  ---------------KGPT-SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ------

Query:  -------------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKIC
                           KDFRPISLTT +Y++IAKTL+ RLKT                             +DFWK    +GF++KLDIEKAF+ + 
Subjt:  -------------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDIL
        W+F+D +L  K +PI W K                   Q + +     R+ D   PF FV+AMDYLSR+L  +E  G +KG S +S  ++SH+LFADDIL
Subjt:  WNFVDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LF++DND  L NL   + +FE +SGL IN  KS++  +NV + R  + A+ WG  +   P+ YLG PLGGNP S  FW+N  +KI +KL++W+Y+ ISKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCG
        GRLTLI++TLS +P Y LS+F+AP   C +I+K  R+FLW G + S    L++W KV      GGLG+ +  VTN A   KWLW       +  RRL+  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCG

Query:  SESFQQNIQPIDRELSQLRRDSLPPER--------------------------------HGHQFSSKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPL
         +   + I P D   +     S  P R                                 G   ++ P  F     K++TV DAW+   + W +  RR L
Subjt:  SESFQQNIQPIDRELSQLRRDSLPPER--------------------------------HGHQFSSKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPL

Query:  FDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNA
         DRE    A+     P P  + G+    W PDSK  FS+ASA+             P S +   +WK+ IP KIKFF+W +I R++NT + +Q    +  
Subjt:  FDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNA

Query:  LNPSCCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTS--RTYINIW
        L PS C LC   S+   H+F+HC     +W     A  + +      +     +    + +++ +    +  A+ W IW ERNRRIF   S  +T  N+W
Subjt:  LNPSCCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTS--RTYINIW

Query:  EDIITLASFWASSTKAFSNYQASSIALNWKAF
        E+   L   W S    F NY A++IALN   F
Subjt:  EDIITLASFWASSTKAFSNYQASSIALNWKAF

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.3e-19137.22Show/hide
Query:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        V SR+DRFLY+ NW   F  H+++ L RVTSDHFPIVLE+    WGP PF+  N  L E  F  N+  WW++  QEG PG+SF+R+LKQL + I+  +R 
Subjt:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------
        N+      K     EI  IDRLE  G L++++  +RT LK+D+     +E ++W Q+ K+ W+ EGDENT+FFHK                         
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------

Query:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----
                         G     W++ NLNW PIS  +A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +F DF        
Subjt:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----

Query:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC
                            D+RPISLTT +Y+LIAK +AERLK                              ID+W+    +GFVIKLDIEKAF+K+ 
Subjt:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL
        W F+D +L  KGYP  W                    P+ + Q  +         PF FVLAMDY+SR+L SV +K  +KG  L  +++++HLLFADDIL
Subjt:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LFV+D++  + NL NII +F+L+SGL+IN NKS+I+ INV+ SR  QIA+ WG  T   PI YLG PLGG   +  FW N  +KI++KL SW+YS +SKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--
        G++TLI+++L+ +P Y LSIFK P S C +I+K  R+FLW    ++  + LV+W K+ +  E GGLG+ + + TN A   KWLW      S   ++++  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--

Query:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR
                     C   S +     I + L   +R      ++G  FS     FW  H                    K  ++ D W++T   W L PRR
Subjt:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR

Query:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI
         L + E    AE  +S        G D  +W  +S G ++VAS ++    L  P++          F NLWK  IPKK  FFIWT++Y  +NT ++L + 
Subjt:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI

Query:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT
          +    PS C +C    ++  H+FI C +A  IWRS   HL++ +N   P++   +C+   + K  +++ +++ +  A+ LW IW ERN RIF    +T
Subjt:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT

Query:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF
           IWEDI  LA  W S +  FSNYQASSIALN  AF
Subjt:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.1e-19137.22Show/hide
Query:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        V SR+DRFLY+ NW   F  H+++ L RVTSDHFPIVLE+    WGP PF+  N  L E  F  N+  WW++  QEG PG+SF+R+LKQL + I+  +R 
Subjt:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------
        N+      K     EI  IDRLE  G L++++  +RT LK+D+     +E ++W Q+ K+ W+ EGDENT+FFHK                         
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------

Query:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----
                         G     W++ NLNW PIS  +A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +F DF        
Subjt:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----

Query:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC
                            D+RPISLTT +Y+LIAK +AERLK                              ID+W+    +GFVIKLDIEKAF+K+ 
Subjt:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL
        W F+D +L  KGYP  W                    P+ + Q  +         PF FVLAMDY+SR+L SV +K  +KG  L  +++++HLLFADDIL
Subjt:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LFV+D++  + NL NII +F+L+SGL+IN NKS+I+ INV+ SR  QIA+ WG  T   PI YLG PLGG   +  FW N  +KI++KL SW+YS +SKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--
        G++TLI+++L+ +P Y LSIFK P S C +I+K  R+FLW    ++  + LV+W K+ +  E GGLG+ + + TN A   KWLW      S   ++++  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--

Query:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR
                     C   S +     I + L   +R      ++G  FS     FW  H                    K  ++ D W++T   W L PRR
Subjt:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR

Query:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI
         L + E    AE  +S        G D  +W  +S G ++VAS ++    L  P++          F NLWK  IPKK  FFIWT++Y  +NT ++L + 
Subjt:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI

Query:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT
          +    PS C +C    ++  H+FI C +A  IWRS   HL++ +N   P++   +C+   + K  +++ +++ +  A+ LW IW ERN RIF    +T
Subjt:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT

Query:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF
           IWEDI  LA  W S +  FSNYQASSIALN  AF
Subjt:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.0e-18436.06Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR
        SRLDRFL++  W   F  H ++ L R TSDHFPIVLE+    WGP PFRF N  L + D+ KNI+FWW +T Q G+ GYSF+RRLKQL   IK W R+ +
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR

Query:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------
           ++ K+    EI +ID+LE  G  T+   +KRT+LK+DL Q+++ E ++W Q+CK+ W+ EGDEN++FFHK  T                        
Subjt:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPT------------------------

Query:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------
                         +    + NL+WCPIS   +  L +PF+E E++  LK+   NK+PGPDG+ ++F +KSW  ++ ++ ++F DF           
Subjt:  -----------------SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------

Query:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF
                         DFRPISLTT +Y+LIAKTLA+RLK                              +DFW+    RGFVIKLDIEKAF+K+ W F
Subjt:  ----------------KDFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKICWNF

Query:  VDKILAFKGYPITWPKR----------------QNQGRKRHSS--RRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV
        +D +L  K Y   W K                 + +GR + S   R+ D   PF FVLAMDYLSR+L ++  K  + G   + +++++H+LFADDIL+FV
Subjt:  VDKILAFKGYPITWPKR----------------QNQGRKRHSS--RRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLN-SVSVSHLLFADDILLFV

Query:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL
        +D D  + NL  I+ +FE +SGLNIN +KS+I  INV   R   IA +WG      P  YLG PLGG PSSS FW N + KI +KL +W+YS +SKGGR+
Subjt:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL

Query:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--
        TLI +TL  +P Y +S+FK P+ +   I+   R+FLW+G     +I L+ W+++ +P E GGLG+     TN A   KWLW          +RL+     
Subjt:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGS--

Query:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF
                    F  N  P   +   +S   ++       G   S              + P  F     K  +V + W+ +++ W L+  RPL D E  
Subjt:  ----------ESFQQNIQP---IDRELSQLRRDSLPPERHGHQFS--------------SKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAF

Query:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQP-HSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSC
              +S P P  + G    LWN +S   F  AS ++      +P  P   H  ++  LWK   PKK KFFIWT+I+  +NT DRLQ+   +  L+P+ 
Subjt:  AQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQP-HSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSC

Query:  CPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLA
        C +C+   ++++H+FIHC  +  +W         N   P  V  +     +    +Q+ ++  +  A ILW IW ERN RIF+   +   ++WED +   
Subjt:  CPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLA

Query:  SFWASSTKAFSNYQASSIALNWKAFL
          W+  +K FSNY   SIALN  AF+
Subjt:  SFWASSTKAFSNYQASSIALNWKAFL

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein4.2e-18335.95Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLE--NPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        SR+DRFLY+P+W   F  H TR L R TSDHFP+V E  NP+ RWGP PFR ++  L + DF +N+  WW+++ Q+G PGYSFI+RLK L + IK W++ 
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLE--NPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH--------------------------
          +   S K  I  E+  ID+ E    L+ +   +R +LK+DL ++S++E + W QR KK WL+EGDEN++FFH                          
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH--------------------------

Query:  ---------------KGPT-SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ------
                       +GPT S+ + + NL W PI   E  +L  PF E E+   + ++   K+PGPDGF + FFK  W  ++  +M++F DF+       
Subjt:  ---------------KGPT-SEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ------

Query:  -------------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKIC
                           KDFRPISLTT +Y++IAKTL+ RLKT                             +DFWK    +GF++KLDIEKAF+ + 
Subjt:  -------------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDIL
        W+F+D +L  K +PI W K                   Q + +     R+ D   PF FV+AMDYLSR+L  +E  G +KG S +S  ++SH+LFADDIL
Subjt:  WNFVDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LF++DND  L NL   + +FE +SGL IN  KS++  +NV + R  + A+ WG  +   P+ YLG PLGGNP S  FW+N  +KI +KL++W+Y+ ISKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCG
        GRLTLI++TLS +P Y LS+F+AP   C +I+K  R+FLW G + S    L++W KV      GGLG+ +  VTN A   KWLW       +  RRL+  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCG

Query:  SESFQQNIQPIDRELSQLRRDSLPPER--------------------------------HGHQFSSKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPL
         +   + I P D   +     S  P R                                 G   ++ P  F     K++TV DAW+   + W +  RR L
Subjt:  SESFQQNIQPIDRELSQLRRDSLPPER--------------------------------HGHQFSSKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPL

Query:  FDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNA
         DRE    A+     P P  + G+    W PDSK  FS+ASA+             P S +   +WK+ IP KIKFF+W +I R++NT + +Q    +  
Subjt:  FDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNA

Query:  LNPSCCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTS--RTYINIW
        L PS C LC   S+   H+F+HC     +W     A  + +      +     +    + +++ +    +  A+ W IW ERNRRIF   S  +T  N+W
Subjt:  LNPSCCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTS--RTYINIW

Query:  EDIITLASFWASSTKAFSNYQASSIALNWKAF
        E+   L   W S    F NY A++IALN   F
Subjt:  EDIITLASFWASSTKAFSNYQASSIALNWKAF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.2e-18236.68Show/hide
Query:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR
        SRLDRFL S  W   F  H +R L R  SDHFPI+LE+P+ +WGPCPFR +N  L +K+F KN   WW S+ Q GFPGY+FI+ L  L   IK W+ N  
Subjt:  SRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNR

Query:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK---------------------------
        +L  + K+ +  EI  ID+LE  G ++    QKR SLKSDL  +   + ++W QR ++ W   GDEN ++FH+                           
Subjt:  DLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK---------------------------

Query:  --------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------
                        + E  ++ NL+W PIS    + L +PF E E+   + +  + K+PGPDG+T+ F+KK W  ++  ++ VF DF +         
Subjt:  --------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQ---------

Query:  ----------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKICWNF
                         D+RPISLTT LY+++AK LA RLK+                             ID WK    +GFV+KLDIEKAF+KI W+F
Subjt:  ----------------KDFRPISLTTVLYRLIAKTLAERLKT-----------------------------IDFWKCSCTRGFVIKLDIEKAFEKICWNF

Query:  VDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDILLFV
        +D +LA K +P  W K                   + + +     R+ D   PF FVLAMDYLSR+L  +E KG +KG S N+  ++SHLLFADD+L+FV
Subjt:  VDKILAFKGYPITWPK------------------RQNQGRKRHSSRRSD--LPFYFVLAMDYLSRILQSVEQKGLVKGCSLNS-VSVSHLLFADDILLFV

Query:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL
        +DN+  L NL   + +FE +SGL  N +KS+I+ IN+   R  QIA+ +G  T   P+ YLG PLGGNP S  FW  TI+ IH+KL+ W+YS ISKGGRL
Subjt:  QDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRL

Query:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGSES
        TL++A+LS +P Y LS FKAP SV   I+K  R FLW G +   +  L++W+   +P E GGLG+ K + TN A   KWLW      +S  ++  C    
Subjt:  TLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLLCGSES

Query:  FQQNIQ---PIDRELSQLRRDSLPPERHGHQFSSK---------PLFFW------------QIHR-------KNMTVSDAWDHTTSSWKLYPRRPLFDRE
        + +N Q   P+    S         ++    + SK          L FW            QI R       ++ TV + WD  +  W + PRRPL +RE
Subjt:  FQQNIQ---PIDRELSQLRRDSLPPERHGHQFSSK---------PLFFW------------QIHR-------KNMTVSDAWDHTTSSWKLYPRRPLFDRE

Query:  AFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPS
                 S P    + G     WNP    +++VASA+   +  SS  +         +LW++ IP+K KFFIWT+++++LNT D +Q+   S +LNPS
Subjt:  AFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPS

Query:  CCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITL
         C  C + +++++H+FI C  A  +W  +   TG  M +   V  +CL+       + + I+  + A A LW IW  RN  IF +   +Y+N WEDI TL
Subjt:  CCPLCHAGSDELDHIFIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITL

Query:  ASFWASSTKAFSNYQASSIALNWKA
           W+S +K   NY  ++IALN KA
Subjt:  ASFWASSTKAFSNYQASSIALNWKA

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein3.8e-19237.32Show/hide
Query:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN
        V SR+DRFLY+ NW   F  H+++ L RVTSDHFPIVLE+    WGP PF+  N  L E  F  N+  WW++  QEG PG+SF+R+LKQL + I+  +R 
Subjt:  VKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRN

Query:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------
        N+      K     EI  IDRLE  G L++++  +RT LK+D+     +E ++W Q+ K+ W+ EGDENT+FFHK                         
Subjt:  NRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHK-------------------------

Query:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----
                         G     W++ NLNW PIS  +A +L   F+E E+ + L A  +NKSPGPDGFT+EF+K +W  ++  ++ +F DF        
Subjt:  -----------------GPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQK-----

Query:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC
                            D+RPISLTT +Y+LIAK +AERLK                              ID+W+    +GFVIKLDIEKAF+K+ 
Subjt:  --------------------DFRPISLTTVLYRLIAKTLAERLK-----------------------------TIDFWKCSCTRGFVIKLDIEKAFEKIC

Query:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL
        W F+D +L  KGYP  W                    P+ + Q  +         PF FVLAMDY+SR+L SV +K  +KG  L  +++++HLLFADDIL
Subjt:  WNFVDKILAFKGYPITW--------------------PKRQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSL-NSVSVSHLLFADDIL

Query:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG
        LFV+D++  + NL NII +F+L+SGL+IN NKS+I+ INV+ SR  QIA+ WG  T   PI YLG PLGG   +  FW N  +KI++KL SW+YS +SKG
Subjt:  LFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKG

Query:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--
        G++TLI+++L+ +P Y LSIFKAP S C +I+K  R+FLW    ++  + LV+W K+ +  E GGLG+ + + TN A   KWLW      S   ++++  
Subjt:  GRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW------SSFMRRLL--

Query:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR
                     C   S +     I + L   +R      ++G  FS     FW  H                    K  ++ D W++T   W L PRR
Subjt:  -------------CGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIH-------------------RKNMTVSDAWDHTTSSWKLYPRR

Query:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI
         L + E    AE  +S        G D  +W  +S G ++VAS ++    L  P++          F NLWK  IPKK  FFIWT++Y  +NT ++L + 
Subjt:  PLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDR---PQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRI

Query:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT
          +    PS C +C    ++  H+FI C +A  IWRS   HL++ +N   P++   +C+   + K  +++ +++ +  A+ LW IW ERN RIF    +T
Subjt:  FKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIWRSF--HLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRT

Query:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF
           IWEDI  LA  W S +  FSNYQASSIALN  AF
Subjt:  YINIWEDIITLASFWASSTKAFSNYQASSIALNWKAF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-1521.06Show/hide
Query:  EATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVF----------HDFFQ----------------KDFRPISLTTVLYRLIA
        E  SL +P +  E+   + ++   KSPGPDGFT EF+++    + P ++++F          + F++                ++FRPISL  +  +++ 
Subjt:  EATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVF----------HDFFQ----------------KDFRPISLTTVLYRLIA

Query:  KTLAERLKT----------------IDFW--------------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKGYPITWPK----------------
        K LA R++                 +  W              +       +I +D EKAF+KI   F+ K L   G    + K                
Subjt:  KTLAERLKT----------------IDFW--------------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKGYPITWPK----------------

Query:  -RQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGIN
         ++ +     +  R   P   +L    L  + +++ Q+  +KG  L    V   LFADD+++++++      NL  +I  F   SG  IN  KS     N
Subjt:  -RQNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGIN

Query:  VEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSI--FKAPQSVCFSIDKII
              +QI            I YLG  L  +        +   + +I    + W+    S  GR+ +++  +     Y  +    K P +    ++K  
Subjt:  VEDSRVAQIAANWGCPTTQFPIPYLGSPLGGNPSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSI--FKAPQSVCFSIDKII

Query:  RSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGL--FKTRVTNSAFQVKWLW
          F+W+ +    +  ++S        +AGG+ L  FK     +  +  W W
Subjt:  RSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGL--FKTRVTNSAFQVKWLW

P08548 LINE-1 reverse transcriptase homolog7.3e-1520.88Show/hide
Query:  ISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHD----------FFQ----------------KDFRPISLTTVLY
        +S  E   L +P S  E+   ++ +   KSPGPDGFT EF++     + P ++ +F +          F++                +++RPISL  +  
Subjt:  ISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHD----------FFQ----------------KDFRPISLTTVLY

Query:  RLIAKTLAERLK----------TIDF------W--------------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKGYPITWPKR-----------
        +++ K L  R++           + F      W              K       ++ +D EKAF+ I   F+ + L   G   T+ K            
Subjt:  RLIAKTLAERLK----------TIDF------W--------------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKGYPITWPKR-----------

Query:  ------QNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSI
              + +     S  R   P   +L    +  +  ++ ++  +KG  + S  +   LFADD+++++++       L  +IK +   SG  IN +KS  
Subjt:  ------QNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSI

Query:  TGINVEDSRVAQIAANWGCPTTQFP--IPYLGSPLGGNPSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCF
              ++  A+       P T  P  + YLG  L  +        +     +I   ++ W+    S  GR+ +++ ++    I N+     KAP S   
Subjt:  TGINVEDSRVAQIAANWGCPTTQFP--IPYLGSPLGGNPSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCF

Query:  SIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW
         ++KII  F+W+ +      P ++   ++   +AGG+ L   R+   +  +K  W
Subjt:  SIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAFQVKWLW

P0C2F6 Putative ribonuclease H protein At1g657505.2e-2124.51Show/hide
Query:  IDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAF--Q
        ++++  ++  WR   +S  GRLTL +A LS +P + +S    PQS+   +D++ R+FLW    +     LV W KV +P + GGLG+   +  N A   +
Subjt:  IDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLGLFKTRVTNSAF--Q

Query:  VKW--------LWSSFMR-----------RLLCGSESFQQNIQPID---RELSQLRRDSLPPERHGHQF------SSKPLFFWQIHRK-----NMTVSDA
        V W        LW+  ++           R L    S+    + I    R++       +P +    +F      S KPL       +      +   D 
Subjt:  VKW--------LWSSFMR-----------RLLCGSESFQQNIQPID---RELSQLRRDSLPPERHGHQF------SSKPLFFWQIHRK-----NMTVSDA

Query:  W------------DHTTSSWKLYPRRPLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSN-LWKALIP
        W             +TT++ +L  R  + D                 ++G  D + W     G+FSV SA   Y  L+  + P+P+   F N LWK  +P
Subjt:  W------------DHTTSSWKLYPRRPLFDREAFAQAECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSN-LWKALIP

Query:  KKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIW
        +++K F+W V  + + T +   R   S +   + C +C  G + + H+   C     IW
Subjt:  KKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSCCPLCHAGSDELDHIFIHCGVASDIW

P11369 LINE-1 retrotransposable element ORF2 protein8.9e-1321.28Show/hide
Query:  LTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGF
        L +++  ++  + +D  ++    IR + +R   T L+  DE   F  +      + V  LN       +   L  P S  E+   + ++   KSPGPDGF
Subjt:  LTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLKAMGHNKSPGPDGF

Query:  TVEFFKKSWIAIRPSVMEVFH----------DFF----------QKD------FRPISLTTVLYRLIAKTLAERLKT----------------IDFW---
        + EF++     + P + ++FH           F+          QKD      FRPISL  +  +++ K LA R++                 +  W   
Subjt:  TVEFFKKSWIAIRPSVMEVFH----------DFF----------QKD------FRPISLTTVLYRLIAKTLAERLKT----------------IDFW---

Query:  -----------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKG------------YPITWPKRQNQGRKRH-----SSRRSDLPFYFVLAMDYLSRIL
                   K       +I LD EKAF+KI   F+ K+L   G            Y       +  G K       S  R   P    L    L  + 
Subjt:  -----------KCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKG------------YPITWPKRQNQGRKRH-----SSRRSDLPFYFVLAMDYLSRIL

Query:  QSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGN
        +++ Q+  +KG  +    V   L ADD+++++ D       L N+I  F    G  IN NKS             +I            I YLG  L   
Subjt:  QSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAANWGCPTTQFPIPYLGSPLGGN

Query:  PSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLG
                + +   +I   L  W+    S  GR+ +++  +    I  +     K P      ++  I  F+W+ +    +  L+   + +  I    L 
Subjt:  PSS--SPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATL--SGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPIEAGGLG

Query:  LFKTRVTNSAFQVKWLW
        L+   +     +  W W
Subjt:  LFKTRVTNSAFQVKWLW

P14381 Transposon TX1 uncharacterized 149 kDa protein6.6e-0822.99Show/hide
Query:  VVKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGP--CPFRFDNYLLHEKDFTKNIKFWWES--TYQEGFP--------GYSFIRRL
        V +SR+DR   S +   + Q   T RL    SDH  + L        P    + F+N LL ++ F K+++  W     +Q+ F         G   ++ L
Subjt:  VVKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGP--CPFRFDNYLLHEKDFTKNIKFWWES--TYQEGFP--------GYSFIRRL

Query:  KQLVSSIKLWKRNNRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQ-KRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH-----KGPTSEG
         Q  +     +RN      +  + ++ E+  +D  + L G  DQ  Q +    K  L  +  ++ R    R +   L + D  + FF+     KG   + 
Subjt:  KQLVSSIKLWKRNNRDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQ-KRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFH-----KGPTSEG

Query:  WMVSNLNWCPISGPEA--------------------------------------TSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSV
          +   +  P+  PEA                                        L  P +  E+ Q L+ M HNKSPG DG T+EFF+  W  + P  
Subjt:  WMVSNLNWCPISGPEA--------------------------------------TSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSV

Query:  MEVFHDFFQ-------------------------KDFRPISLTTVLYRLIAKTLAERLKTI
          V  + F+                         K++RP+SL +  Y+++AK ++ RLK++
Subjt:  MEVFHDFFQ-------------------------KDFRPISLTTVLYRLIAKTLAERLKTI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.1e-1323.08Show/hide
Query:  RLDRFLYSPNWALKFQDHHTRRLGRVTSDHFP--IVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNN
        +LDR + + +W   F            SDH P  I+LEN   R   C FR+ ++L     F  ++   WE     G   +S    LK      KL  R  
Subjt:  RLDRFLYSPNWALKFQDHHTRRLGRVTSDHFP--IVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNN

Query:  RDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKG-------------PTSEGWMVSNLN
           ++ + +   D +  I + + L   +D + +     +   +  +      ++Q+ +  WL++GD NT FFHK                 +   V N+ 
Subjt:  RDLLKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKG-------------PTSEGWMVSNLN

Query:  ---------WCPISGPEATSL----------IQPF--------------SELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQKD-
                 +  + G ++  L          I PF              S+ E+   + AM  NK+PGPD FT EFF +SW  ++ S +    +FF+   
Subjt:  ---------WCPISGPEATSL----------IQPF--------------SELEVFQNLKAMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQKD-

Query:  ------------------------FRPISLTTVLYRLI
                                FRP+S  TV+Y++I
Subjt:  ------------------------FRPISLTTVLYRLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATGGGAGAGAGTCGTCAAGTCGCGGTTGGACAGATTCCTTTATTCCCCGAATTGGGCGCTGAAATTCCAGGATCATCACACAAGAAGGCTTGGTCGTGTTACATC
CGATCACTTTCCCATAGTCCTTGAAAATCCAGAATTCAGGTGGGGCCCATGTCCATTCCGTTTCGATAACTACCTTCTGCATGAGAAAGATTTCACCAAGAATATTAAAT
TCTGGTGGGAATCAACATATCAAGAAGGCTTCCCTGGTTACTCTTTTATTAGAAGATTAAAGCAGCTAGTTAGCAGCATCAAGCTTTGGAAAAGGAATAACAGAGATCTC
CTCAAATCTAGGAAACAGATCATATCGGATGAGATTGCTAAGATAGACCGCTTGGAGAATTTGGGTGGTCTAACTGATCAAATGTGCCAGAAAAGAACCTCTCTTAAATC
TGATCTTCACCAAGTGTCTATGCAAGAGATCAGGCTGTGGAAGCAAAGATGCAAGAAAACCTGGCTTAAAGAAGGAGATGAAAATACCACCTTTTTCCATAAAGGCCCCA
CTTCTGAAGGATGGATGGTCTCAAATTTAAATTGGTGCCCTATTTCTGGTCCTGAGGCGACATCTTTGATTCAGCCTTTTTCAGAGCTAGAAGTATTTCAGAATCTAAAA
GCGATGGGTCATAATAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAAAAAGTCCTGGATCGCTATCAGGCCTTCAGTTATGGAAGTGTTCCATGACTTCTT
TCAGAAGGACTTTAGACCCATTAGTCTCACCACAGTTCTTTATCGTCTTATCGCTAAGACTCTTGCAGAAAGGCTTAAAACTATTGATTTCTGGAAATGCTCTTGCACGA
GAGGATTCGTTATAAAGCTAGATATTGAAAAGGCTTTTGAGAAGATTTGTTGGAACTTCGTTGATAAGATTCTTGCTTTCAAGGGATACCCTATCACCTGGCCCAAGAGG
CAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCCCTTTTATTTTGTCCTAGCTATGGACTATCTTAGTCGAATTCTTCAGTCGGTTGAGCAAAAGGG
GCTTGTTAAGGGTTGTTCTCTCAACTCCGTCTCTGTCTCTCATCTTCTATTCGCAGATGACATTCTCCTTTTTGTTCAAGATAACGATGCTATGTTAGGCAACCTGTTCA
ACATCATCAAAGTATTTGAGCTATCTTCGGGTCTCAATATAAACTTCAACAAATCCTCTATAACGGGTATCAATGTGGAGGATTCCAGAGTTGCTCAAATTGCCGCCAAT
TGGGGATGCCCAACGACCCAATTCCCCATTCCTTATTTAGGCTCTCCCTTGGGGGGTAATCCATCATCGTCTCCGTTTTGGGCTAATACGATTGATAAGATTCATCGTAA
ATTGGATAGTTGGCGCTATTCCTATATTTCAAAGGGAGGAAGACTAACCTTGATTAGAGCAACTTTAAGTGGCATTCCCAACTACTTGTTATCTATCTTTAAAGCTCCGC
AATCGGTTTGCTTTAGTATAGATAAGATTATCAGATCTTTTCTTTGGCATGGGCAAGACCAAAGTAGTAGCATCCCTTTAGTTAGCTGGGATAAGGTGGCTGCGCCTATT
GAGGCTGGGGGTTTGGGCTTATTCAAGACTAGAGTTACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGTTCTTTCATGAGGAGACTTCTCTGTGGAAGCGAGTCATT
TCAGCAAAATATACAACCCATAGACAGGGAGCTCTCCCAACTCAGGCGCGATTCACTTCCTCCCGAGCGCCATGGACATCAATTCTCAAGCAAGCCTCTTTTTTTCTGGC
AAATACATAGGAAAAATATGACTGTCTCGGATGCCTGGGACCACACCACTTCATCGTGGAAGCTTTATCCTAGACGCCCTTTGTTCGACAGGGAGGCCTTTGCTCAGGCG
GAATGTGCTTCCTCCTTTCCAATTCCTGCATTATCTGGAGGGGCGGACCACATGCTTTGGAACCCGGACTCCAAGGGACGCTTCTCCGTTGCTTCAGCTAGGCAATTCTA
TTGGAATCTGTCATCGCCAGATCGGCCTCAGCCTCATTCCATTATTTTCTCAAATCTATGGAAGGCACTGATTCCAAAAAAGATCAAATTCTTTATTTGGACGGTTATTT
ACAGAAGATTAAATACAACAGATAGGCTTCAACGTATATTTAAATCCAATGCTCTAAATCCGAGCTGTTGCCCCCTTTGCCATGCAGGTTCAGACGAGTTGGATCATATT
TTTATCCATTGTGGGGTTGCTTCAGACATTTGGCGCTCTTTTCATCTAGCTACTGGTATTAATATGCCAATTCCTAGACAGGTGAATTGGATTTGTTTAGAGACCTTTGC
AGCCAAGGCTACTTCGCAGAGGGAGATTCTTATTCAGTCCATGGCGGCAGCAATTTTATGGGTTATTTGGGGTGAGCGTAATAGGAGGATTTTTCAGAACACATCCCGAA
CCTATATCAACATTTGGGAAGATATTATTACGCTTGCATCCTTTTGGGCATCCTCCACAAAAGCTTTCTCTAATTACCAGGCTTCCTCTATAGCTTTGAATTGGAAAGCT
TTTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGATGGGAGAGAGTCGTCAAGTCGCGGTTGGACAGATTCCTTTATTCCCCGAATTGGGCGCTGAAATTCCAGGATCATCACACAAGAAGGCTTGGTCGTGTTACATC
CGATCACTTTCCCATAGTCCTTGAAAATCCAGAATTCAGGTGGGGCCCATGTCCATTCCGTTTCGATAACTACCTTCTGCATGAGAAAGATTTCACCAAGAATATTAAAT
TCTGGTGGGAATCAACATATCAAGAAGGCTTCCCTGGTTACTCTTTTATTAGAAGATTAAAGCAGCTAGTTAGCAGCATCAAGCTTTGGAAAAGGAATAACAGAGATCTC
CTCAAATCTAGGAAACAGATCATATCGGATGAGATTGCTAAGATAGACCGCTTGGAGAATTTGGGTGGTCTAACTGATCAAATGTGCCAGAAAAGAACCTCTCTTAAATC
TGATCTTCACCAAGTGTCTATGCAAGAGATCAGGCTGTGGAAGCAAAGATGCAAGAAAACCTGGCTTAAAGAAGGAGATGAAAATACCACCTTTTTCCATAAAGGCCCCA
CTTCTGAAGGATGGATGGTCTCAAATTTAAATTGGTGCCCTATTTCTGGTCCTGAGGCGACATCTTTGATTCAGCCTTTTTCAGAGCTAGAAGTATTTCAGAATCTAAAA
GCGATGGGTCATAATAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAAAAAGTCCTGGATCGCTATCAGGCCTTCAGTTATGGAAGTGTTCCATGACTTCTT
TCAGAAGGACTTTAGACCCATTAGTCTCACCACAGTTCTTTATCGTCTTATCGCTAAGACTCTTGCAGAAAGGCTTAAAACTATTGATTTCTGGAAATGCTCTTGCACGA
GAGGATTCGTTATAAAGCTAGATATTGAAAAGGCTTTTGAGAAGATTTGTTGGAACTTCGTTGATAAGATTCTTGCTTTCAAGGGATACCCTATCACCTGGCCCAAGAGG
CAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCCCTTTTATTTTGTCCTAGCTATGGACTATCTTAGTCGAATTCTTCAGTCGGTTGAGCAAAAGGG
GCTTGTTAAGGGTTGTTCTCTCAACTCCGTCTCTGTCTCTCATCTTCTATTCGCAGATGACATTCTCCTTTTTGTTCAAGATAACGATGCTATGTTAGGCAACCTGTTCA
ACATCATCAAAGTATTTGAGCTATCTTCGGGTCTCAATATAAACTTCAACAAATCCTCTATAACGGGTATCAATGTGGAGGATTCCAGAGTTGCTCAAATTGCCGCCAAT
TGGGGATGCCCAACGACCCAATTCCCCATTCCTTATTTAGGCTCTCCCTTGGGGGGTAATCCATCATCGTCTCCGTTTTGGGCTAATACGATTGATAAGATTCATCGTAA
ATTGGATAGTTGGCGCTATTCCTATATTTCAAAGGGAGGAAGACTAACCTTGATTAGAGCAACTTTAAGTGGCATTCCCAACTACTTGTTATCTATCTTTAAAGCTCCGC
AATCGGTTTGCTTTAGTATAGATAAGATTATCAGATCTTTTCTTTGGCATGGGCAAGACCAAAGTAGTAGCATCCCTTTAGTTAGCTGGGATAAGGTGGCTGCGCCTATT
GAGGCTGGGGGTTTGGGCTTATTCAAGACTAGAGTTACAAATAGCGCATTTCAAGTCAAATGGCTTTGGAGTTCTTTCATGAGGAGACTTCTCTGTGGAAGCGAGTCATT
TCAGCAAAATATACAACCCATAGACAGGGAGCTCTCCCAACTCAGGCGCGATTCACTTCCTCCCGAGCGCCATGGACATCAATTCTCAAGCAAGCCTCTTTTTTTCTGGC
AAATACATAGGAAAAATATGACTGTCTCGGATGCCTGGGACCACACCACTTCATCGTGGAAGCTTTATCCTAGACGCCCTTTGTTCGACAGGGAGGCCTTTGCTCAGGCG
GAATGTGCTTCCTCCTTTCCAATTCCTGCATTATCTGGAGGGGCGGACCACATGCTTTGGAACCCGGACTCCAAGGGACGCTTCTCCGTTGCTTCAGCTAGGCAATTCTA
TTGGAATCTGTCATCGCCAGATCGGCCTCAGCCTCATTCCATTATTTTCTCAAATCTATGGAAGGCACTGATTCCAAAAAAGATCAAATTCTTTATTTGGACGGTTATTT
ACAGAAGATTAAATACAACAGATAGGCTTCAACGTATATTTAAATCCAATGCTCTAAATCCGAGCTGTTGCCCCCTTTGCCATGCAGGTTCAGACGAGTTGGATCATATT
TTTATCCATTGTGGGGTTGCTTCAGACATTTGGCGCTCTTTTCATCTAGCTACTGGTATTAATATGCCAATTCCTAGACAGGTGAATTGGATTTGTTTAGAGACCTTTGC
AGCCAAGGCTACTTCGCAGAGGGAGATTCTTATTCAGTCCATGGCGGCAGCAATTTTATGGGTTATTTGGGGTGAGCGTAATAGGAGGATTTTTCAGAACACATCCCGAA
CCTATATCAACATTTGGGAAGATATTATTACGCTTGCATCCTTTTGGGCATCCTCCACAAAAGCTTTCTCTAATTACCAGGCTTCCTCTATAGCTTTGAATTGGAAAGCT
TTTCTGTAA
Protein sequenceShow/hide protein sequence
MRWERVVKSRLDRFLYSPNWALKFQDHHTRRLGRVTSDHFPIVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWESTYQEGFPGYSFIRRLKQLVSSIKLWKRNNRDL
LKSRKQIISDEIAKIDRLENLGGLTDQMCQKRTSLKSDLHQVSMQEIRLWKQRCKKTWLKEGDENTTFFHKGPTSEGWMVSNLNWCPISGPEATSLIQPFSELEVFQNLK
AMGHNKSPGPDGFTVEFFKKSWIAIRPSVMEVFHDFFQKDFRPISLTTVLYRLIAKTLAERLKTIDFWKCSCTRGFVIKLDIEKAFEKICWNFVDKILAFKGYPITWPKR
QNQGRKRHSSRRSDLPFYFVLAMDYLSRILQSVEQKGLVKGCSLNSVSVSHLLFADDILLFVQDNDAMLGNLFNIIKVFELSSGLNINFNKSSITGINVEDSRVAQIAAN
WGCPTTQFPIPYLGSPLGGNPSSSPFWANTIDKIHRKLDSWRYSYISKGGRLTLIRATLSGIPNYLLSIFKAPQSVCFSIDKIIRSFLWHGQDQSSSIPLVSWDKVAAPI
EAGGLGLFKTRVTNSAFQVKWLWSSFMRRLLCGSESFQQNIQPIDRELSQLRRDSLPPERHGHQFSSKPLFFWQIHRKNMTVSDAWDHTTSSWKLYPRRPLFDREAFAQA
ECASSFPIPALSGGADHMLWNPDSKGRFSVASARQFYWNLSSPDRPQPHSIIFSNLWKALIPKKIKFFIWTVIYRRLNTTDRLQRIFKSNALNPSCCPLCHAGSDELDHI
FIHCGVASDIWRSFHLATGINMPIPRQVNWICLETFAAKATSQREILIQSMAAAILWVIWGERNRRIFQNTSRTYINIWEDIITLASFWASSTKAFSNYQASSIALNWKA
FL