; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035431 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035431
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr3:21369596..21374380
RNA-Seq ExpressionLag0035431
SyntenyLag0035431
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]9.1e-19843.3Show/hide
Query:  KKRNREESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHG
        K  + +E++    DRR V S+W++R+  WA L A  ++GGILI+W   K S  EVV                             + +K F  EL D+ G
Subjt:  KKRNREESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHG

Query:  LCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILL
        L    WC+ GDFN+IR   E+L  S  T +MK F+ FI+  +LID PL +  FTWS M  N    R+DRFL S +W   F       L R TSDH+PI+L
Subjt:  LCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILL

Query:  TVGANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKR
             KWGPTPFRFEN+WL++  FKE    WW+    NGW G +FM KL+ +K ++K WN  +  +   +K DI++ +   D LE++  +    + +R  
Subjt:  TVGANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKR

Query:  LKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNR
         K EL E  L E+    QK+++KW+KEGD NS FFH+    +RN+ FI  LEN++G ++ N   I+EEIL +F+KLY    G  + +EG++W PI  ++ 
Subjt:  LKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNR

Query:  INMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVL
          +E  F+EEEI KAI +M   K+PGP+      F++ W ++K DLV+VF EF ++ IIN+  N ++I L+PKK  +  + D+RPI+L+TSLYK+IAKVL
Subjt:  INMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVL

Query:  AERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLR
        A R++ VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD +L++KGFG RWRKW++GCL++ +F+V++NG  +
Subjt:  AERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLR

Query:  GKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGV
        G + ASRGLRQGD LSPFLFTIV D +SR +    E+ +L G+K+G++   VS LQ+ADDT+ F  +   D+    ++L +    +GL +N+ K++I G+
Subjt:  GKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGV

Query:  NVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDF
        N++   ++  A+   CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ RDF
Subjt:  NVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDF

Query:  VWSGGSYKPRERI
        +WSG     R+ +
Subjt:  VWSGGSYKPRERI

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-19931.89Show/hide
Query:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL
        + EV   K ++       + WI +   DLL++S+T  FF +    +  +W++K  NK  + +  EI +++N G K +++V  G +  GWKSF  L+    
Subjt:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL

Query:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW
              +  +  K      R E   +F+D   S  + + +  A+ ++ S  D  K   KA  + +  R     G+K   +    +++ ++IT+R FHDDW
Subjt:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW

Query:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL
         RI+  ++ Q        PF  DK +L   + + A LL  N    GW + G   +K E WD  +H+  + IPSYGGW++ R +PLHLWN  TF+ IG+  
Subjt:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL

Query:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I
        GGF++  +    + + ++ K+K++ N  GF+   + + D + E F V  V   E   L++R   +HGSF   AA  F +  + A   T + ++       
Subjt:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I

Query:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS
            DY +   H + K S  Y  +  +N S   E        S R K+KGK    IN+         K   +  +  V++    G++S+ S +   ++G 
Subjt:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS

Query:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH
               N++  K  +  +               E  E ++L      E  K+    S D+  IS  E + +    H          D N +S+ S    
Subjt:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH

Query:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA
        + T             +    + N+ +  TG     D    ++L++ L+ N L + P   TN    +S                      I S +++  A
Subjt:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA

Query:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM
            +   GGIL+LW ++  +V +                              +++     EL  L  LC   W + GDFN++RW  E    S   RNM
Subjt:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM

Query:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW
          FN FI+  +LIDPP  N  FTWS +  N   SR+DRFLLSK W + F       L R  SDHFPILL     KWGP PFR  N  L + +F++   +W
Subjt:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW

Query:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN
        W +    G+ G+ F++ L  L   IK W     +   + K+ ++ +ID+ID+LE Q E+ +   ++R  LK++LL    ++ +  +Q+++ +W   GDEN
Subjt:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN

Query:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM
        +S+FHR  T  + K  I  + + +G  L +   I    +S F  +Y K+   + +I+ + W PI    +  +   F E EI   I    + K+PGP+   
Subjt:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM

Query:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV
          F+K +W  LK DL+ VF++F +  I+N   N T+I LI KK+K S   DYRPI+L TSLYK++AK LA RLK  LP TI+  Q AF+ GRQI DAIL+
Subjt:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV

Query:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV
        A EA++  + RK  G +LKLD+EKA+D ++W F+D +L  K F H+WRKWIK C++N  +S+++NG  +G+I A RG+RQGD LSPF+F +  D +SR +
Subjt:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV

Query:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG
             K  + G     +   +S L +ADD L+F+ +N   +      LTL  K +GL  N SK++I  +N+ +      A  FG +   LP+NYLG  LG
Subjt:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG

Query:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK
        GN R   FWD  +E +++KL+ W+   ISKGGR+T  ++ L+SLP YQ S  KAP SV K +EK  RDF+W G   K
Subjt:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.4e-19843.69Show/hide
Query:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILWKESEI---EVV----------------------------ESEKKHFLQELYDLHGLCQGV
        +E++    DRRLV S+WS R+  WA L A  ++GGILI+W   ++   EVV                             + +K F  EL D+ GL    
Subjt:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILWKESEI---EVV----------------------------ESEKKHFLQELYDLHGLCQGV

Query:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN
        WC+ GDFN+IR   E+L  S  +  MK F+ FI   +LID PL +  +TWS M EN    R+DRFL S +W   F       L R TSDH+PI+L     
Subjt:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN

Query:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL
        KWGPTPF+FEN+WL++S FKE    WW     NGW G +FM KL+ +K ++K WN  +  +   KK+DI+A +   D LE++  +    + +R   K EL
Subjt:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL

Query:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM
         E  L E+    QK+++KW+KEGD NS+FFH+    +RN+ FI  LEN+SG +L N   I+EEIL +F+KLY    G  + +EG++W PID ++   +E 
Subjt:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM

Query:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK
         F+EEEI+KAI +M   K+PGP+D     F++ W+++K DLV VF EF ++ IIN+  N ++I LIPKK  +  + D+RPI+L+TSLY++IAKVLA RL+
Subjt:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK

Query:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA
         VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD +L++KGF  RWRKW++GCL++ +++V++NG  +G + A
Subjt:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA

Query:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV
        SRGLRQGD LSPFLFTIV D +SR +    E+ +L G+++G++   VS LQ+ADDT+ F      D+     +L +    +GL +N+ K++I G+N++  
Subjt:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV

Query:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG
         ++  A    CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ R+F+WSG 
Subjt:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG

Query:  SYKPRERI
            R+ +
Subjt:  SYKPRERI

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.1e-19843.19Show/hide
Query:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHGLCQGV
        +E++    DRR V S+W++R+  WATL A  ++GGILI+W   K S  EV+                             + +K    EL D+ GL    
Subjt:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHGLCQGV

Query:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN
        WC+ GDFN+IR   E+L  S  T +MK F+ FI+  +LID PL +  FTWS M  N    R+DRFL S +W   F       L R TSDH+PI+L     
Subjt:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN

Query:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL
        KWGPTPFRFEN+WL++  FKE    WW+    NGW G +FM KL+ +K ++K WN  +  +   +K DI++ +   D LE++  +    + +R   K EL
Subjt:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL

Query:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM
         E  L E+    QK+++KW+KEGD NS FFH+    +RN+ FI  LEN++G ++ N   I+EEIL +F+KLY    G  + +EG++W PI  ++ + +E 
Subjt:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM

Query:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK
         F+EEEI KAI +M   K+PGP+      F++ W ++K DLV+VF EF ++ IIN+  N ++I L+PKK  +  + D+RPI+L+TSLYK+IAKVLA R++
Subjt:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK

Query:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA
        +VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD ++++KGFG RWRKW++GCL++ +F+V++NG  +G + A
Subjt:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA

Query:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV
        SRGLRQGD LSPFLFTIV D +SR +    E+ +L G+K+G++   VS LQ+ADDT+ F  +   D+    ++L +    +GL +N+ K++I G+N++  
Subjt:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV

Query:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG
         ++  A+   CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ RDF+WSG 
Subjt:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG

Query:  SYKPRERI
            R+ +
Subjt:  SYKPRERI

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-19831.82Show/hide
Query:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL
        + EV   K ++       + WI +   DLL++S+T  FF +    +  +W++K  NK  + +  EI +++N G K +++V  G +  GWKSF  L+    
Subjt:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL

Query:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW
              +  +  K      R E   +F+D   S  + + +  A+ ++ S  D  K   KA  + +  R     G+K   +    +++ ++IT+R FHDDW
Subjt:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW

Query:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL
         RI+  ++ Q        PF  DK +L   + + A LL  N    GW + G   +K E WD  +H+  + IPSYGGW++ R +PLHLWN  TF+ IG+  
Subjt:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL

Query:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I
        GGF++  +    + + ++ K+K++ N  GF+   + + D + E F V  V   E   L++R   +HGSF   AA  F +  + A   T + ++       
Subjt:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I

Query:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS
            DY +   H + K S  Y  +  +N S   E        S R K+KGK    IN+         K   +  +  V++    G++S+ S +   ++G 
Subjt:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS

Query:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH
               N++  K  +  +               E  E ++L      E  K+    S D+  IS  E + +    H          D N +S+ S    
Subjt:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH

Query:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA
        + T             +    + N+ +  TG     D    ++L++ L+ N L + P   TN    +S                      I S +++  A
Subjt:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA

Query:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM
            +   GGIL+LW ++  +V +                              +++     EL  L  LC   W + GDFN++RW  E    S   RNM
Subjt:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM

Query:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW
          FN FI+  +LIDPP  N  FTWS +  N   SR+DRFLLSK W + F       L R  SDHFPILL     KWGP PFR  N  L + +F++   +W
Subjt:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW

Query:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN
        W +    G+ G+ F++ L  L   IK W     +   + K+ ++ +ID+ID+LE Q E+ +   ++R  LK++LL    ++ +  +Q+++ +W   GDEN
Subjt:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN

Query:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM
        +S+FHR  T  + K  I  + + +G  L +   I    +S F  +Y K+   + +I+ + W PI    +  +   F E EI   I    + K+PGP+   
Subjt:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM

Query:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV
          F+K +W  LK DL+ VF++F +  I+N   N T+I LI KK+K S   DYRPI+L TSLYK++AK LA RLK  LP TI+  Q AF+ GRQI DAIL+
Subjt:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV

Query:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV
        A E ++  + RK  G +LKLD+EKA+D ++W F+D +L  K F H+WRKWIK C++N  +S+++NG  +G+I A RG+RQGD LSPF+F +  D +SR +
Subjt:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV

Query:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG
             K  + G     +   +S L +ADD L+F+ +N   +      LTL  K +GL  N SK++I  +N+ +      A  FG +   LP+NYLG  LG
Subjt:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG

Query:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK
        GN R   FWD  +E +++KL+ W+   ISKGGR+T  ++ L+SLP YQ S  KAP SV K +EK  RDF+W G   K
Subjt:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK

TrEMBL top hitse value%identityAlignment
A0A438ISU8 Transposon TX1 uncharacterized 149 kDa protein6.8e-19943.69Show/hide
Query:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILWKESEI---EVV----------------------------ESEKKHFLQELYDLHGLCQGV
        +E++    DRRLV S+WS R+  WA L A  ++GGILI+W   ++   EVV                             + +K F  EL D+ GL    
Subjt:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILWKESEI---EVV----------------------------ESEKKHFLQELYDLHGLCQGV

Query:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN
        WC+ GDFN+IR   E+L  S  +  MK F+ FI   +LID PL +  +TWS M EN    R+DRFL S +W   F       L R TSDH+PI+L     
Subjt:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN

Query:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL
        KWGPTPF+FEN+WL++S FKE    WW     NGW G +FM KL+ +K ++K WN  +  +   KK+DI+A +   D LE++  +    + +R   K EL
Subjt:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL

Query:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM
         E  L E+    QK+++KW+KEGD NS+FFH+    +RN+ FI  LEN+SG +L N   I+EEIL +F+KLY    G  + +EG++W PID ++   +E 
Subjt:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM

Query:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK
         F+EEEI+KAI +M   K+PGP+D     F++ W+++K DLV VF EF ++ IIN+  N ++I LIPKK  +  + D+RPI+L+TSLY++IAKVLA RL+
Subjt:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK

Query:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA
         VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD +L++KGF  RWRKW++GCL++ +++V++NG  +G + A
Subjt:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA

Query:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV
        SRGLRQGD LSPFLFTIV D +SR +    E+ +L G+++G++   VS LQ+ADDT+ F      D+     +L +    +GL +N+ K++I G+N++  
Subjt:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV

Query:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG
         ++  A    CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ R+F+WSG 
Subjt:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG

Query:  SYKPRERI
            R+ +
Subjt:  SYKPRERI

A0A438JX47 LINE-1 retrotransposable element ORF2 protein2.0e-19843.19Show/hide
Query:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHGLCQGV
        +E++    DRR V S+W++R+  WATL A  ++GGILI+W   K S  EV+                             + +K    EL D+ GL    
Subjt:  EESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHGLCQGV

Query:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN
        WC+ GDFN+IR   E+L  S  T +MK F+ FI+  +LID PL +  FTWS M  N    R+DRFL S +W   F       L R TSDH+PI+L     
Subjt:  WCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGAN

Query:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL
        KWGPTPFRFEN+WL++  FKE    WW+    NGW G +FM KL+ +K ++K WN  +  +   +K DI++ +   D LE++  +    + +R   K EL
Subjt:  KWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAEL

Query:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM
         E  L E+    QK+++KW+KEGD NS FFH+    +RN+ FI  LEN++G ++ N   I+EEIL +F+KLY    G  + +EG++W PI  ++ + +E 
Subjt:  LEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEM

Query:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK
         F+EEEI KAI +M   K+PGP+      F++ W ++K DLV+VF EF ++ IIN+  N ++I L+PKK  +  + D+RPI+L+TSLYK+IAKVLA R++
Subjt:  NFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLK

Query:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA
        +VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD ++++KGFG RWRKW++GCL++ +F+V++NG  +G + A
Subjt:  KVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMA

Query:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV
        SRGLRQGD LSPFLFTIV D +SR +    E+ +L G+K+G++   VS LQ+ADDT+ F  +   D+    ++L +    +GL +N+ K++I G+N++  
Subjt:  SRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSV

Query:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG
         ++  A+   CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ RDF+WSG 
Subjt:  EIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGG

Query:  SYKPRERI
            R+ +
Subjt:  SYKPRERI

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.8e-19931.89Show/hide
Query:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL
        + EV   K ++       + WI +   DLL++S+T  FF +    +  +W++K  NK  + +  EI +++N G K +++V  G +  GWKSF  L+    
Subjt:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL

Query:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW
              +  +  K      R E   +F+D   S  + + +  A+ ++ S  D  K   KA  + +  R     G+K   +    +++ ++IT+R FHDDW
Subjt:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW

Query:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL
         RI+  ++ Q        PF  DK +L   + + A LL  N    GW + G   +K E WD  +H+  + IPSYGGW++ R +PLHLWN  TF+ IG+  
Subjt:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL

Query:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I
        GGF++  +    + + ++ K+K++ N  GF+   + + D + E F V  V   E   L++R   +HGSF   AA  F +  + A   T + ++       
Subjt:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I

Query:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS
            DY +   H + K S  Y  +  +N S   E        S R K+KGK    IN+         K   +  +  V++    G++S+ S +   ++G 
Subjt:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS

Query:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH
               N++  K  +  +               E  E ++L      E  K+    S D+  IS  E + +    H          D N +S+ S    
Subjt:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH

Query:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA
        + T             +    + N+ +  TG     D    ++L++ L+ N L + P   TN    +S                      I S +++  A
Subjt:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA

Query:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM
            +   GGIL+LW ++  +V +                              +++     EL  L  LC   W + GDFN++RW  E    S   RNM
Subjt:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM

Query:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW
          FN FI+  +LIDPP  N  FTWS +  N   SR+DRFLLSK W + F       L R  SDHFPILL     KWGP PFR  N  L + +F++   +W
Subjt:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW

Query:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN
        W +    G+ G+ F++ L  L   IK W     +   + K+ ++ +ID+ID+LE Q E+ +   ++R  LK++LL    ++ +  +Q+++ +W   GDEN
Subjt:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN

Query:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM
        +S+FHR  T  + K  I  + + +G  L +   I    +S F  +Y K+   + +I+ + W PI    +  +   F E EI   I    + K+PGP+   
Subjt:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM

Query:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV
          F+K +W  LK DL+ VF++F +  I+N   N T+I LI KK+K S   DYRPI+L TSLYK++AK LA RLK  LP TI+  Q AF+ GRQI DAIL+
Subjt:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV

Query:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV
        A EA++  + RK  G +LKLD+EKA+D ++W F+D +L  K F H+WRKWIK C++N  +S+++NG  +G+I A RG+RQGD LSPF+F +  D +SR +
Subjt:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV

Query:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG
             K  + G     +   +S L +ADD L+F+ +N   +      LTL  K +GL  N SK++I  +N+ +      A  FG +   LP+NYLG  LG
Subjt:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG

Query:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK
        GN R   FWD  +E +++KL+ W+   ISKGGR+T  ++ L+SLP YQ S  KAP SV K +EK  RDF+W G   K
Subjt:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.2e-19931.82Show/hide
Query:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL
        + EV   K ++       + WI +   DLL++S+T  FF +    +  +W++K  NK  + +  EI +++N G K +++V  G +  GWKSF  L+    
Subjt:  IQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFL--EITKVNNSGGKHNLVVSAGTEFNGWKSFSNLLKEFL

Query:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW
              +  +  K      R E   +F+D   S  + + +  A+ ++ S  D  K   KA  + +  R     G+K   +    +++ ++IT+R FHDDW
Subjt:  NGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPR--AETMTQS--DQQKTTEKAWRNITEIR-----GYKE-EVRNIDWDEVIVITKRDFHDDW

Query:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL
         RI+  ++ Q        PF  DK +L   + + A LL  N    GW + G   +K E WD  +H+  + IPSYGGW++ R +PLHLWN  TF+ IG+  
Subjt:  GRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNM---GWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCL

Query:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I
        GGF++  +    + + ++ K+K++ N  GF+   + + D + E F V  V   E   L++R   +HGSF   AA  F +  + A   T + ++       
Subjt:  GGFVEYDEANSLLFQCVEVKMKIKENCCGFILVELKVVDGE-EQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWR------I

Query:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS
            DY +   H + K S  Y  +  +N S   E        S R K+KGK    IN+         K   +  +  V++    G++S+ S +   ++G 
Subjt:  ENGVDYPVVITHQAVKES-RYCGRQLENASENFE--------SSRPKQKGK----INEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGS

Query:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH
               N++  K  +  +               E  E ++L      E  K+    S D+  IS  E + +    H          D N +S+ S    
Subjt:  E------NERVGKMSNDEEA------------DYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDVNREVITH----------DANHRSSPSDYCH

Query:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA
        + T             +    + N+ +  TG     D    ++L++ L+ N L + P   TN    +S                      I S +++  A
Subjt:  DNTTHSMASPKALAIIAPLIVQRNTLEEGTG-----DFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSRHIAWA

Query:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM
            +   GGIL+LW ++  +V +                              +++     EL  L  LC   W + GDFN++RW  E    S   RNM
Subjt:  TLDAINSAGGILILWKESEIEVVE------------------------------SEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNM

Query:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW
          FN FI+  +LIDPP  N  FTWS +  N   SR+DRFLLSK W + F       L R  SDHFPILL     KWGP PFR  N  L + +F++   +W
Subjt:  KKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESW

Query:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN
        W +    G+ G+ F++ L  L   IK W     +   + K+ ++ +ID+ID+LE Q E+ +   ++R  LK++LL    ++ +  +Q+++ +W   GDEN
Subjt:  WQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDEN

Query:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM
        +S+FHR  T  + K  I  + + +G  L +   I    +S F  +Y K+   + +I+ + W PI    +  +   F E EI   I    + K+PGP+   
Subjt:  SSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMM

Query:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV
          F+K +W  LK DL+ VF++F +  I+N   N T+I LI KK+K S   DYRPI+L TSLYK++AK LA RLK  LP TI+  Q AF+ GRQI DAIL+
Subjt:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV

Query:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV
        A E ++  + RK  G +LKLD+EKA+D ++W F+D +L  K F H+WRKWIK C++N  +S+++NG  +G+I A RG+RQGD LSPF+F +  D +SR +
Subjt:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV

Query:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG
             K  + G     +   +S L +ADD L+F+ +N   +      LTL  K +GL  N SK++I  +N+ +      A  FG +   LP+NYLG  LG
Subjt:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLG

Query:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK
        GN R   FWD  +E +++KL+ W+   ISKGGR+T  ++ L+SLP YQ S  KAP SV K +EK  RDF+W G   K
Subjt:  GNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYK

A5BCI7 Reverse transcriptase domain-containing protein4.4e-19843.3Show/hide
Query:  KKRNREESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHG
        K  + +E++    DRR V S+W++R+  WA L A  ++GGILI+W   K S  EVV                             + +K F  EL D+ G
Subjt:  KKRNREESRLNVIDRRLVKSIWSSRHIAWATLDAINSAGGILILW---KESEIEVV----------------------------ESEKKHFLQELYDLHG

Query:  LCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILL
        L    WC+ GDFN+IR   E+L  S  T +MK F+ FI+  +LID PL +  FTWS M  N    R+DRFL S +W   F       L R TSDH+PI+L
Subjt:  LCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILL

Query:  TVGANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKR
             KWGPTPFRFEN+WL++  FKE    WW+    NGW G +FM KL+ +K ++K WN  +  +   +K DI++ +   D LE++  +    + +R  
Subjt:  TVGANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSEIQSHQIEERKR

Query:  LKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNR
         K EL E  L E+    QK+++KW+KEGD NS FFH+    +RN+ FI  LEN++G ++ N   I+EEIL +F+KLY    G  + +EG++W PI  ++ 
Subjt:  LKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFVIEGVEWMPIDSQNR

Query:  INMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVL
          +E  F+EEEI KAI +M   K+PGP+      F++ W ++K DLV+VF EF ++ IIN+  N ++I L+PKK  +  + D+RPI+L+TSLYK+IAKVL
Subjt:  INMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVL

Query:  AERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLR
        A R++ VL  TI + Q AFV GRQILDA+L+A E V++KR   + GV+ K+D EKAYD V+WDFLD +L++KGFG RWRKW++GCL++ +F+V++NG  +
Subjt:  AERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLR

Query:  GKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGV
        G + ASRGLRQGD LSPFLFTIV D +SR +    E+ +L G+K+G++   VS LQ+ADDT+ F  +   D+    ++L +    +GL +N+ K++I G+
Subjt:  GKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGV

Query:  NVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDF
        N++   ++  A+   CKA   PI YLG  LGGN + S FWDP++ER++R+LD W+   +S GGR+T  QS L  +P Y  SL K P SV   +E++ RDF
Subjt:  NVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDF

Query:  VWSGGSYKPRERI
        +WSG     R+ +
Subjt:  VWSGGSYKPRERI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.9e-5024.63Show/hide
Query:  QELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID-----PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVER
        Q L DL         ++GDFN    + +R       ++ ++ N  +   DLID      P       +S    +   S+ID  + SK  + K    R E 
Subjt:  QELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID-----PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVER

Query:  LHRPTSDHFPILLTV--------GANKWGPTPFRFENIWLENSKFKEKIESWWQ-NMN-----PNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRD
        +    SDH  I L +         +  W        + W+ N + K +I+ +++ N N      N W  F+ +      + +  + NA  R Q  SK   
Subjt:  LHRPTSDHFPILLTV--------GANKWGPTPFRFENIWLENSKFKEKIESWWQ-NMN-----PNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRD

Query:  IVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFF
        + +Q+  +++ +EQ+  ++ + +E  +++AEL E    +      +S+  + +  ++      R +  KR K  I  ++ND GD+ T+ T+I+  I  ++
Subjt:  IVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFF

Query:  DKLYEKDLGPKFVIEGVEWMP--IDSQN--RINME------MNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRA
          LY   L      E +E M   +D+    R+N E         +  EI   I  + + KSPGP+    EF++ Y   L   L+++FQ   +  I+    
Subjt:  DKLYEKDLGPKFVIEGVEWMP--IDSQN--RINME------MNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRA

Query:  NETYICLIPKKKKASNVKD-YRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVED-KRCRKDNGVLLKLDLEKAYDMVN
         E  I LIPK  + +  K+ +RPI+L+    K++ K+LA R+++ +   I + Q  F+ G Q    I  +   ++   R +  N V++ +D EKA+D + 
Subjt:  NETYICLIPKKKKASNVKD-YRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVED-KRCRKDNGVLLKLDLEKAYDMVN

Query:  WDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDT
          F+   L   G    + K I+        ++++NG+         G RQG  LSP LF IV + ++R+++   ++K + G ++GK+ V +SL  +ADD 
Subjt:  WDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDT

Query:  LVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIF---WDPLLERLNRKLDSWRNFP
        +V++ N     Q    +++   K +G  +N+ K+     N +    +    +      S  I YLG  L  +  + +F   + PLL+ +    + W+N P
Subjt:  LVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYRRSIF---WDPLLERLNRKLDSWRNFP

Query:  ISKGGRVTFAQSVLNSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWS
         S  GR+   +  +    +Y+F+   +K P +    +EK    F+W+
Subjt:  ISKGGRVTFAQSVLNSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWS

P08548 LINE-1 reverse transcriptase homolog8.4e-4524.4Show/hide
Query:  FLQE-LYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID------PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDF
        F++E L D+  L      +VGDFN    V +R +    ++ +   N  I   DL D      P      F  S  G     S+ID  L  K  + KF   
Subjt:  FLQE-LYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID------PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDF

Query:  RVERLHRPTSDHFPILLTVGANK--------WGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIV
        ++E +    SDH  I + +  N+        W       ++ W+ +   KE  +   QN N +      +      L+ +  +  A  +     +  +++
Subjt:  RVERLHRPTSDHFPILLTVGANK--------WGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIV

Query:  AQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQ--KSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFF
          +  +++ EE S  +  + +E  +++AEL E  ++ +R + Q  KSK  + ++ ++           KR K+ IS + N + ++ T+ ++I++ +  ++
Subjt:  AQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQ--KSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFF

Query:  DKLY-EKDLGPKFVIEGVE--WMPIDSQNRINM-EMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYIC
         KLY  K    K + + +E   +P  SQ  + M     S  EI   I+ +   KSPGP+    EF++ +   L   L+ +FQ   +  I+     E  I 
Subjt:  DKLY-EKDLGPKFVIEGVE--WMPIDSQNRINM-EMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYIC

Query:  LIPKK-KKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVED-KRCRKDNGVLLKLDLEKAYDMVNWDFLDT
        LIPK  K  +  ++YRPI+L+    K++ K+L  R+++ +   I + Q  F+ G Q    I  +   ++   + +  + ++L +D EKA+D +   F+  
Subjt:  LIPKK-KKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVED-KRCRKDNGVLLKLDLEKAYDMVNWDFLDT

Query:  ILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPN
         L+  G    + K I+   +    ++++NG          G RQG  LSP LF IV + ++ +++   E+K + G  IG + + +SL  +ADD +V++ N
Subjt:  ILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPN

Query:  NAADIQKWWDILTLILKGAGLYLNMSKT-SIIGVNVDSVEIATWAKQFGCKADSLP-------INYLGFSLGGN----YRRSIFWDPLLERLNRKLDSWR
              K  +++      +G  +N  K+ + I  N +  E            DS+P       + YLG  L  +    Y+ +  ++ L + +   ++ W+
Subjt:  NAADIQKWWDILTLILKGAGLYLNMSKT-SIIGVNVDSVEIATWAKQFGCKADSLP-------INYLGFSLGGN----YRRSIFWDPLLERLNRKLDSWR

Query:  NFPISKGGRVTFAQSVLNSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWS
        N P S  GR+   +  +    +Y F+   +KAP S  K +EKII  F+W+
Subjt:  NFPISKGGRVTFAQSVLNSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWS

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-3923.24Show/hide
Query:  LVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID-----PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDH--FPILL
        +VGDFN      +R       R+  K    +   DL D      P   G   +S    +   S+ID  +  K  ++++ +  +E +    SDH    ++ 
Subjt:  LVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLID-----PPLGNGKFTWSRMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDH--FPILL

Query:  TVGANKWGPT-PFRFENIWLENSKFKEKIESWWQNM----------NPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSE
            N   PT  ++  N  L ++  KE I+   ++            PN W   +       L+ ++ + +A  + +  +    +   +  +++ E  S 
Subjt:  TVGANKWGPT-PFRFENIWLENSKFKEKIESWWQNM----------NPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLIDRLEEQSE

Query:  IQSHQIEERKRLKAELLEFALDEQRCLNQKSKIK-WLKEG-DENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLG-----
         +S + +E  +L+ E+ +  ++ +R + + ++ + W  E  ++      R     R+K  I+ + N+ GD+ T+  +I+  I SF+ +LY   L      
Subjt:  IQSHQIEERKRLKAELLEFALDEQRCLNQKSKIK-WLKEG-DENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLG-----

Query:  PKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQN-EIINKRANETY---ICLIPK-KKKA
         KF ++  +   ++     ++    S +EI   I  + + KSPGP+    EF++ +    K DL+ +  + F   E+     N  Y   I LIPK +K  
Subjt:  PKFVIEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQN-EIINKRANETY---ICLIPK-KKKA

Query:  SNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKD-NGVLLKLDLEKAYDMVNWDFLDTILQLKGFGH
        + ++++RPI+L+    K++ K+LA R+++ +   I   Q  F+ G Q    I  +   +      KD N +++ LD EKA+D +   F+  +L+  G   
Subjt:  SNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKD-NGVLLKLDLEKAYDMVNWDFLDTILQLKGFGH

Query:  RWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWW
         +   IK   +    ++ +NG     I    G RQG  LSP+LF IV + ++R+++   ++K + G +IGK+ V +SLL  ADD +V+I +     ++  
Subjt:  RWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWW

Query:  DILTLILKGAGLYLNMSKTS--IIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYR--RSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVL
        +++    +  G  +N +K+   +   N  + +       F    ++  I YLG +L    +      +  L + +   L  W++ P S  GR+   +  +
Subjt:  DILTLILKGAGLYLNMSKTS--IIGVNVDSVEIATWAKQFGCKADSLPINYLGFSLGGNYR--RSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVL

Query:  NSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWSGGSYKPR
            +Y+F+   +K P      +E  I  FVW+  + KPR
Subjt:  NSLPLYQFSL--LKAPKSVIKSMEKIIRDFVWSGGSYKPR

P14381 Transposon TX1 uncharacterized 149 kDa protein6.4e-4526.44Show/hide
Query:  FTWSRMGE-NVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTP--FRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKL
        FT+ R+ + +V+ SRIDR  +S   + +     + RL  P SDH  + L +      P    + F N  LE+  F + +   W+     GW  F+  ++ 
Subjt:  FTWSRMGE-NVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTP--FRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKL

Query:  KGLKNQIKSWNAIAR----------SQAVSKKRDIVAQ------IDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDENSS
          L      W  + +          +++VS +R+   +      +DL  RL   SE Q+ Q E  +R K  L      + R    +S+++ L + D  S 
Subjt:  KGLKNQIKSWNAIAR----------SQAVSKKRDIVAQ------IDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDENSS

Query:  FFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKD-LGPKFVIEGVEWMPIDSQNR-INMEMNFSEEEIHKAIREMGSLKSPGPNDMM
        FF+     K N+  I+ L  + G  L +   I +   SF+  L+  D + P    E  + +P+ S+ R   +E   + +E+ +A+R M   KSPG + + 
Subjt:  FFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKD-LGPKFVIEGVEWMPIDSQNR-INMEMNFSEEEIHKAIREMGSLKSPGPNDMM

Query:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV
         EFF+ +W+ L  D   V  E F+   +        + L+PKK     +K++RP++L+++ YK++AK ++ RLK VL   I   Q+  V GR I D + +
Subjt:  GEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILV

Query:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV
          + +   R    +   L LD EKA+D V+  +L   LQ   FG ++  ++K    ++   V IN  L   +   RG+RQG  LS  L+++  +      
Subjt:  AAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSV

Query:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQF-GCKADSLPINYLGFSL
          CL +K L G  + +  + V L  YADD ++ +  +  D+++  +   +    +   +N SK+S  G+   S+++      F     +S  I YLG  L
Subjt:  QFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQF-GCKADSLPINYLGFSL

Query:  GG-NYRRSIFWDPLLERLNRKLDSWRNFP--ISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSG
            Y  S  +  L E +  +L  W+ F   +S  GR      ++ S   Y+   L   +  I  +++ + DF+W G
Subjt:  GG-NYRRSIFWDPLLERLNRKLDSWRNFP--ISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSG

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.7e-1626.29Show/hide
Query:  LIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTIL
        LIPK     N  ++RPI + ++L +L+ ++LA+RL+  + L  +    A + G  +++++L+       +  RK   V + LD+ KA+D V+   +   L
Subjt:  LIPKKKKASNVKDYRPINLVTSLYKLIAKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTIL

Query:  QLKGFGHRWRKWIKGCLTNSNFSVMIN-GRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTL------
        Q  G       +I G L++S  ++ +  G    KI   RG++QGD LSPFLF  V D +  S    L+     G  IG++ + V  L +ADD L      
Subjt:  QLKGFGHRWRKWIKGCLTNSNFSVMIN-GRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTL------

Query:  VFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADS--LPI-------NYLG--FSLGGNYRRSIFWDPLLERLNRKL
        V +P   A +  ++ +        G+ LN  K+  I V           K F  + D+  LP+        YLG  F L G  +      P +  L+R L
Subjt:  VFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGCKADS--LPI-------NYLG--FSLGGNYRRSIFWDPLLERLNRKL

Query:  DSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFV
              P+    +++  +  +    LY           ++  +K+IR  V
Subjt:  DSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFV

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein9.9e-0925Show/hide
Query:  ESEKKHFLQELYDLHG---LCQGVWCLVGDFNLIRWVDER--LNASN-STRNMKKFNRFIASADLIDPPLGNGKFTWS-RMGENVAASRIDRFLLSKQWV
        E+E++    ++  L     LC   W +VGDFN I  V E   L  SN S + ++     +  +DL+D P     +TWS    +N    ++DR +++  W+
Subjt:  ESEKKHFLQELYDLHG---LCQGVWCLVGDFNLIRWVDER--LNASN-STRNMKKFNRFIASADLIDPPLGNGKFTWS-RMGENVAASRIDRFLLSKQWV

Query:  DKFSDFRVERLHRPTSDHFPILLTVGANKWGP----TPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWN----AIARSQAVS
          F       +  P SD       V  N   P      F++ +    +  F   I + WQ     G   F   E LK  K   +  N    +  ++Q +S
Subjt:  DKFSDFRVERLHRPTSDHFPILLTVGANKWGP----TPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWN----AIARSQAVS

Query:  KKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGD
           D + + + + R                        FA   +    QKS+IKWLKEGD
Subjt:  KKRDIVAQIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGD

AT1G43760.1 DNAse I-like superfamily protein2.9e-3227.25Show/hide
Query:  LVGDFNLIRWVDER---LNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWS-RMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFP-ILLTV
        LVGDF+ I    +    L  S   R +++F   +  +DL+D P     +TWS    +N    ++DR + +  W   F            SDH P I++  
Subjt:  LVGDFNLIRWVDER---LNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWS-RMGENVAASRIDRFLLSKQWVDKFSDFRVERLHRPTSDHFP-ILLTV

Query:  GANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLI-DRLEEQSEIQSHQIEERKRL
           K     FR+ +    +  F   +   W+   P G   F   E LK  K   K  N         K ++ +  ++ I  +L         ++E   R 
Subjt:  GANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVAQIDLI-DRLEEQSEIQSHQIEERKRL

Query:  KAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKD---LGPKFVIEGVEWMPIDSQ
        K      AL+      QKS+IKWL++GD N+ FFH+ + A + K  I  L  D    + N T+++E I++++  L   D   L P  V    +  P    
Subjt:  KAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKD---LGPKFVIEGVEWMPIDSQ

Query:  NRINMEMNF--SEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLI
        + +   ++   S++EI  A+  M   K+PGP+    EFF   W ++K   +   +EFF+   + KR N T I LIPK      +  +RP++  T +YK+I
Subjt:  NRINMEMNF--SEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLI

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.4e-0829.67Show/hide
Query:  FGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSG
        F   + +LP+ YLG  L      +  + PL+E++  ++  W    +S  GR+    SV++SL  +  S  + P + IK ++ I   F+WSG
Subjt:  FGCKADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSG

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.4e-0943.21Show/hide
Query:  LAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRK--DNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRW
        + ERLK ++   I   QA+F+ GR   D I+   EAV   R +K     +LLKLDLEKAYD + WD+L+  L   GF   W
Subjt:  LAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRK--DNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.1e-0638.24Show/hide
Query:  MINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDT
        +ING  +G +  SRGLRQGD LSP+LF +  + +S   +   E+  L G ++  +   ++ L +ADDT
Subjt:  MINGRLRGKIMASRGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTCCATCGATCGCACCTAAGCCCCCGTCCGGTAGTTCTTCTGCCATAGTCTGCGATGGCGATTCCAAGAAAATTTTAGAGGGAGAGATTAGGATTCAGGAGGT
GCACTTGAAGAAAATATATGCTTCGTCAGCCAAAGAAATAGTTATAGTGTGGATCATTGACTCCATTGATGATCTCCTCAACTCCTCAAGTACCCATAAATTCTTCCATA
AAGTGGACTGTAACAATGGATTCATTTGGATCCAAAAGATATCGAACAAACAGGGGAGCTTCCTTGAAATCACAAAGGTTAACAACTCTGGTGGCAAACACAACCTCGTA
GTGTCGGCTGGAACTGAGTTCAATGGTTGGAAAAGCTTTTCAAATCTTCTAAAAGAATTCCTCAACGGTAAGGACGACCTGCAGGAACAGAGCAAGGAAAAGGAAAAGGG
AGGGGTCGGAAGAAATGAGGATGGTAAATCGTTCGCAGACATTCTAAAGAGCTCCCCAAATTATAATCCTCGAGCTGAAACAATGACCCAGAGCGACCAGCAAAAAACTA
CGGAGAAAGCTTGGAGAAATATTACTGAGATCAGAGGATATAAAGAAGAAGTGAGAAACATCGATTGGGATGAGGTCATAGTTATTACCAAAAGGGACTTTCACGATGAT
TGGGGAAGAATCTTGGAAGTTATGCAACACCAATTGATGGAGACCCTCGTTATTAATCCCTTTCATCCGGACAAGGACCTTCTTAAATGCCCATCTCGTGAGTTAGCTAC
CCTTTTAACAAAAAATATGGGGTGGGTGAGCTTCGGCCCGATTATTCTGAAGGTTGAAAAGTGGGATAAACAAATTCACAATAGAATCACGTGTATTCCAAGCTATGGGG
GTTGGATTAAAATTCGAAATCTCCCTCTTCATTTATGGAACTTACAGACATTTAAGGCTATCGGAAACTGTCTCGGTGGCTTCGTTGAATATGATGAGGCCAACTCATTG
CTCTTTCAATGCGTGGAAGTCAAAATGAAGATTAAGGAGAACTGCTGTGGTTTCATCCTTGTGGAGTTAAAAGTGGTGGATGGTGAAGAACAATTTAACGTGCAGATCGT
GACCTACCAAGAAGGAAACTTGCTGATCGACAGGGTGGCCGGAATCCATGGAAGTTTCTCGCCGGCGGCTGCACACGTTTTCCATAGAGGCCCAAATGATGCCCTCTTTT
GCACCGCAGATATTTGGAGAATTGAGAATGGGGTTGATTACCCAGTGGTTATTACTCACCAAGCAGTTAAGGAATCCAGATATTGTGGAAGACAGCTGGAAAACGCTAGT
GAAAATTTTGAATCCTCCCGCCCAAAGCAAAAAGGAAAAATTAACGAAGGAGCTGGGCCCTCTAACATAGAAGGGAAAAGCCCTTTAAAGGAAAAAGATGAGACGGTGAA
TTGGAACGGTGAGATTGGCATGGAATCAGACTTATCCTTCTCGAGCCCAGCTAGTCGTGGAAGCGAGAATGAACGGGTAGGTAAGATGAGCAACGACGAGGAAGCCGATT
ATGAATTTCCGGAGGGATATCAGCTTTGCTTCTCTAATGAGGCGGAATCCGATAAGGAAGCTCAGCCGCAAAGCAACGATGTTAAGGCTATTTCGGGTCAAGAGGATGTG
AATAGAGAAGTGATCACTCACGATGCTAACCATCGTTCGTCCCCTAGCGATTATTGCCATGATAATACTACACACTCAATGGCTTCCCCAAAGGCTCTGGCCATCATTGC
TCCTTTGATTGTCCAGAGAAATACTCTCGAAGAGGGCACTGGAGACTTCATGATTAGCAAGGAGTTGATTCTCACCCTTAGAAGAAACAATTTATGTATTAGACCGATAA
CAGGCACAAATTCTGAGAAGGGCAATTCTACTAAAAAGAAACGTAATAGAGAGGAGTCCAGGCTTAACGTCATAGATAGGAGGCTTGTTAAATCTATTTGGAGCTCCAGG
CATATAGCTTGGGCCACCTTAGACGCCATTAATTCTGCTGGGGGCATTCTTATTTTATGGAAAGAGTCAGAGATAGAAGTTGTGGAGTCGGAGAAAAAACACTTCTTACA
AGAGCTGTATGACCTTCACGGGCTTTGCCAGGGTGTTTGGTGCTTGGTAGGTGATTTTAATCTGATTAGATGGGTTGATGAAAGGCTGAACGCTAGTAATTCTACTAGAA
ATATGAAGAAGTTCAACCGCTTCATTGCTTCCGCTGACTTGATTGATCCTCCGCTGGGTAATGGTAAATTTACTTGGTCTAGAATGGGAGAGAATGTGGCTGCTTCTAGG
ATTGATAGATTCCTTTTGTCCAAGCAGTGGGTTGATAAATTTTCTGATTTCAGAGTTGAAAGGCTTCACAGACCGACTTCTGACCACTTCCCTATCTTATTGACTGTTGG
GGCAAACAAATGGGGGCCGACGCCGTTTCGTTTTGAAAATATTTGGCTAGAGAACTCTAAGTTTAAGGAGAAGATTGAGAGTTGGTGGCAGAACATGAACCCAAATGGTT
GGGCGGGTTTTCGATTTATGGAAAAGTTGAAAGGATTGAAAAACCAAATTAAAAGTTGGAATGCTATAGCTCGTTCTCAGGCCGTGTCTAAAAAAAGGGATATTGTGGCT
CAAATTGACCTCATCGATCGCCTAGAAGAACAGAGTGAGATTCAAAGCCATCAGATTGAAGAAAGGAAAAGGCTTAAAGCCGAGTTGCTTGAGTTTGCTTTAGATGAGCA
GAGGTGTCTTAACCAAAAAAGCAAAATTAAATGGCTAAAAGAGGGGGATGAAAACTCTTCATTCTTTCATAGATGGGTCACGGCTAAAAGGAACAAGGCTTTCATCTCGA
TTCTGGAAAATGACAGTGGGGATCTTCTCACGAATGAGACCAAAATTGAGGAGGAAATCCTGTCTTTCTTCGATAAGCTTTATGAGAAGGATTTAGGGCCGAAGTTTGTG
ATAGAAGGGGTGGAATGGATGCCCATTGATTCCCAAAACAGAATCAATATGGAAATGAACTTCAGTGAGGAGGAAATCCACAAGGCTATTCGGGAGATGGGAAGTTTGAA
ATCCCCCGGTCCGAACGACATGATGGGTGAGTTTTTTAAAAATTATTGGAACATCTTGAAGCTGGATTTAGTAGAGGTGTTCCAAGAATTTTTTCAAAACGAAATTATTA
ATAAACGGGCCAATGAAACATATATTTGTTTGATCCCAAAGAAGAAAAAGGCTTCCAACGTCAAAGATTATAGACCGATCAATCTAGTTACCTCCCTTTACAAGCTGATA
GCGAAAGTTCTAGCTGAGAGATTGAAGAAAGTCCTTCCTCTCACCATAAGTAATTGGCAAGCGGCTTTTGTCCATGGTAGACAGATCCTTGATGCTATCTTAGTGGCGGC
TGAAGCAGTAGAAGATAAAAGGTGTAGAAAAGATAATGGCGTGCTCTTAAAGCTTGATCTCGAAAAAGCGTACGACATGGTCAACTGGGATTTCCTCGACACTATTCTTC
AGTTGAAGGGGTTTGGCCATAGATGGAGAAAATGGATCAAAGGCTGTCTCACTAATTCAAATTTTTCGGTCATGATTAACGGGCGTCTGAGGGGAAAAATTATGGCTTCT
AGAGGGTTGAGACAAGGGGACTCGCTATCTCCCTTTTTGTTTACCATTGTCGGGGACGCTATTAGTAGATCGGTTCAGTTTTGTCTTGAGAAGAAAATTCTGAATGGCTG
GAAAATTGGGAAGGATGGTGTGATGGTATCGTTGCTACAATATGCTGATGATACTTTGGTTTTTATCCCCAACAATGCTGCAGATATTCAGAAATGGTGGGACATTTTGA
CTCTCATTCTTAAGGGAGCTGGTCTTTATTTAAATATGTCGAAGACTTCTATCATTGGGGTTAATGTCGATTCTGTGGAGATAGCTACTTGGGCCAAGCAGTTCGGCTGT
AAAGCAGATTCGTTGCCTATCAACTACTTAGGCTTTTCCCTTGGTGGCAATTACCGTAGAAGCATCTTCTGGGACCCGTTACTGGAAAGGCTAAATAGAAAGCTCGATAG
CTGGAGGAACTTCCCTATTTCAAAAGGGGGAAGAGTCACCTTTGCCCAATCCGTTCTTAATAGTCTCCCCCTTTATCAATTCTCTTTGCTTAAAGCCCCTAAATCAGTTA
TCAAGTCCATGGAGAAAATCATCAGAGACTTTGTGTGGAGTGGGGGAAGTTATAAACCCAGAGAGCGAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTCCATCGATCGCACCTAAGCCCCCGTCCGGTAGTTCTTCTGCCATAGTCTGCGATGGCGATTCCAAGAAAATTTTAGAGGGAGAGATTAGGATTCAGGAGGT
GCACTTGAAGAAAATATATGCTTCGTCAGCCAAAGAAATAGTTATAGTGTGGATCATTGACTCCATTGATGATCTCCTCAACTCCTCAAGTACCCATAAATTCTTCCATA
AAGTGGACTGTAACAATGGATTCATTTGGATCCAAAAGATATCGAACAAACAGGGGAGCTTCCTTGAAATCACAAAGGTTAACAACTCTGGTGGCAAACACAACCTCGTA
GTGTCGGCTGGAACTGAGTTCAATGGTTGGAAAAGCTTTTCAAATCTTCTAAAAGAATTCCTCAACGGTAAGGACGACCTGCAGGAACAGAGCAAGGAAAAGGAAAAGGG
AGGGGTCGGAAGAAATGAGGATGGTAAATCGTTCGCAGACATTCTAAAGAGCTCCCCAAATTATAATCCTCGAGCTGAAACAATGACCCAGAGCGACCAGCAAAAAACTA
CGGAGAAAGCTTGGAGAAATATTACTGAGATCAGAGGATATAAAGAAGAAGTGAGAAACATCGATTGGGATGAGGTCATAGTTATTACCAAAAGGGACTTTCACGATGAT
TGGGGAAGAATCTTGGAAGTTATGCAACACCAATTGATGGAGACCCTCGTTATTAATCCCTTTCATCCGGACAAGGACCTTCTTAAATGCCCATCTCGTGAGTTAGCTAC
CCTTTTAACAAAAAATATGGGGTGGGTGAGCTTCGGCCCGATTATTCTGAAGGTTGAAAAGTGGGATAAACAAATTCACAATAGAATCACGTGTATTCCAAGCTATGGGG
GTTGGATTAAAATTCGAAATCTCCCTCTTCATTTATGGAACTTACAGACATTTAAGGCTATCGGAAACTGTCTCGGTGGCTTCGTTGAATATGATGAGGCCAACTCATTG
CTCTTTCAATGCGTGGAAGTCAAAATGAAGATTAAGGAGAACTGCTGTGGTTTCATCCTTGTGGAGTTAAAAGTGGTGGATGGTGAAGAACAATTTAACGTGCAGATCGT
GACCTACCAAGAAGGAAACTTGCTGATCGACAGGGTGGCCGGAATCCATGGAAGTTTCTCGCCGGCGGCTGCACACGTTTTCCATAGAGGCCCAAATGATGCCCTCTTTT
GCACCGCAGATATTTGGAGAATTGAGAATGGGGTTGATTACCCAGTGGTTATTACTCACCAAGCAGTTAAGGAATCCAGATATTGTGGAAGACAGCTGGAAAACGCTAGT
GAAAATTTTGAATCCTCCCGCCCAAAGCAAAAAGGAAAAATTAACGAAGGAGCTGGGCCCTCTAACATAGAAGGGAAAAGCCCTTTAAAGGAAAAAGATGAGACGGTGAA
TTGGAACGGTGAGATTGGCATGGAATCAGACTTATCCTTCTCGAGCCCAGCTAGTCGTGGAAGCGAGAATGAACGGGTAGGTAAGATGAGCAACGACGAGGAAGCCGATT
ATGAATTTCCGGAGGGATATCAGCTTTGCTTCTCTAATGAGGCGGAATCCGATAAGGAAGCTCAGCCGCAAAGCAACGATGTTAAGGCTATTTCGGGTCAAGAGGATGTG
AATAGAGAAGTGATCACTCACGATGCTAACCATCGTTCGTCCCCTAGCGATTATTGCCATGATAATACTACACACTCAATGGCTTCCCCAAAGGCTCTGGCCATCATTGC
TCCTTTGATTGTCCAGAGAAATACTCTCGAAGAGGGCACTGGAGACTTCATGATTAGCAAGGAGTTGATTCTCACCCTTAGAAGAAACAATTTATGTATTAGACCGATAA
CAGGCACAAATTCTGAGAAGGGCAATTCTACTAAAAAGAAACGTAATAGAGAGGAGTCCAGGCTTAACGTCATAGATAGGAGGCTTGTTAAATCTATTTGGAGCTCCAGG
CATATAGCTTGGGCCACCTTAGACGCCATTAATTCTGCTGGGGGCATTCTTATTTTATGGAAAGAGTCAGAGATAGAAGTTGTGGAGTCGGAGAAAAAACACTTCTTACA
AGAGCTGTATGACCTTCACGGGCTTTGCCAGGGTGTTTGGTGCTTGGTAGGTGATTTTAATCTGATTAGATGGGTTGATGAAAGGCTGAACGCTAGTAATTCTACTAGAA
ATATGAAGAAGTTCAACCGCTTCATTGCTTCCGCTGACTTGATTGATCCTCCGCTGGGTAATGGTAAATTTACTTGGTCTAGAATGGGAGAGAATGTGGCTGCTTCTAGG
ATTGATAGATTCCTTTTGTCCAAGCAGTGGGTTGATAAATTTTCTGATTTCAGAGTTGAAAGGCTTCACAGACCGACTTCTGACCACTTCCCTATCTTATTGACTGTTGG
GGCAAACAAATGGGGGCCGACGCCGTTTCGTTTTGAAAATATTTGGCTAGAGAACTCTAAGTTTAAGGAGAAGATTGAGAGTTGGTGGCAGAACATGAACCCAAATGGTT
GGGCGGGTTTTCGATTTATGGAAAAGTTGAAAGGATTGAAAAACCAAATTAAAAGTTGGAATGCTATAGCTCGTTCTCAGGCCGTGTCTAAAAAAAGGGATATTGTGGCT
CAAATTGACCTCATCGATCGCCTAGAAGAACAGAGTGAGATTCAAAGCCATCAGATTGAAGAAAGGAAAAGGCTTAAAGCCGAGTTGCTTGAGTTTGCTTTAGATGAGCA
GAGGTGTCTTAACCAAAAAAGCAAAATTAAATGGCTAAAAGAGGGGGATGAAAACTCTTCATTCTTTCATAGATGGGTCACGGCTAAAAGGAACAAGGCTTTCATCTCGA
TTCTGGAAAATGACAGTGGGGATCTTCTCACGAATGAGACCAAAATTGAGGAGGAAATCCTGTCTTTCTTCGATAAGCTTTATGAGAAGGATTTAGGGCCGAAGTTTGTG
ATAGAAGGGGTGGAATGGATGCCCATTGATTCCCAAAACAGAATCAATATGGAAATGAACTTCAGTGAGGAGGAAATCCACAAGGCTATTCGGGAGATGGGAAGTTTGAA
ATCCCCCGGTCCGAACGACATGATGGGTGAGTTTTTTAAAAATTATTGGAACATCTTGAAGCTGGATTTAGTAGAGGTGTTCCAAGAATTTTTTCAAAACGAAATTATTA
ATAAACGGGCCAATGAAACATATATTTGTTTGATCCCAAAGAAGAAAAAGGCTTCCAACGTCAAAGATTATAGACCGATCAATCTAGTTACCTCCCTTTACAAGCTGATA
GCGAAAGTTCTAGCTGAGAGATTGAAGAAAGTCCTTCCTCTCACCATAAGTAATTGGCAAGCGGCTTTTGTCCATGGTAGACAGATCCTTGATGCTATCTTAGTGGCGGC
TGAAGCAGTAGAAGATAAAAGGTGTAGAAAAGATAATGGCGTGCTCTTAAAGCTTGATCTCGAAAAAGCGTACGACATGGTCAACTGGGATTTCCTCGACACTATTCTTC
AGTTGAAGGGGTTTGGCCATAGATGGAGAAAATGGATCAAAGGCTGTCTCACTAATTCAAATTTTTCGGTCATGATTAACGGGCGTCTGAGGGGAAAAATTATGGCTTCT
AGAGGGTTGAGACAAGGGGACTCGCTATCTCCCTTTTTGTTTACCATTGTCGGGGACGCTATTAGTAGATCGGTTCAGTTTTGTCTTGAGAAGAAAATTCTGAATGGCTG
GAAAATTGGGAAGGATGGTGTGATGGTATCGTTGCTACAATATGCTGATGATACTTTGGTTTTTATCCCCAACAATGCTGCAGATATTCAGAAATGGTGGGACATTTTGA
CTCTCATTCTTAAGGGAGCTGGTCTTTATTTAAATATGTCGAAGACTTCTATCATTGGGGTTAATGTCGATTCTGTGGAGATAGCTACTTGGGCCAAGCAGTTCGGCTGT
AAAGCAGATTCGTTGCCTATCAACTACTTAGGCTTTTCCCTTGGTGGCAATTACCGTAGAAGCATCTTCTGGGACCCGTTACTGGAAAGGCTAAATAGAAAGCTCGATAG
CTGGAGGAACTTCCCTATTTCAAAAGGGGGAAGAGTCACCTTTGCCCAATCCGTTCTTAATAGTCTCCCCCTTTATCAATTCTCTTTGCTTAAAGCCCCTAAATCAGTTA
TCAAGTCCATGGAGAAAATCATCAGAGACTTTGTGTGGAGTGGGGGAAGTTATAAACCCAGAGAGCGAATTTAG
Protein sequenceShow/hide protein sequence
MESPSIAPKPPSGSSSAIVCDGDSKKILEGEIRIQEVHLKKIYASSAKEIVIVWIIDSIDDLLNSSSTHKFFHKVDCNNGFIWIQKISNKQGSFLEITKVNNSGGKHNLV
VSAGTEFNGWKSFSNLLKEFLNGKDDLQEQSKEKEKGGVGRNEDGKSFADILKSSPNYNPRAETMTQSDQQKTTEKAWRNITEIRGYKEEVRNIDWDEVIVITKRDFHDD
WGRILEVMQHQLMETLVINPFHPDKDLLKCPSRELATLLTKNMGWVSFGPIILKVEKWDKQIHNRITCIPSYGGWIKIRNLPLHLWNLQTFKAIGNCLGGFVEYDEANSL
LFQCVEVKMKIKENCCGFILVELKVVDGEEQFNVQIVTYQEGNLLIDRVAGIHGSFSPAAAHVFHRGPNDALFCTADIWRIENGVDYPVVITHQAVKESRYCGRQLENAS
ENFESSRPKQKGKINEGAGPSNIEGKSPLKEKDETVNWNGEIGMESDLSFSSPASRGSENERVGKMSNDEEADYEFPEGYQLCFSNEAESDKEAQPQSNDVKAISGQEDV
NREVITHDANHRSSPSDYCHDNTTHSMASPKALAIIAPLIVQRNTLEEGTGDFMISKELILTLRRNNLCIRPITGTNSEKGNSTKKKRNREESRLNVIDRRLVKSIWSSR
HIAWATLDAINSAGGILILWKESEIEVVESEKKHFLQELYDLHGLCQGVWCLVGDFNLIRWVDERLNASNSTRNMKKFNRFIASADLIDPPLGNGKFTWSRMGENVAASR
IDRFLLSKQWVDKFSDFRVERLHRPTSDHFPILLTVGANKWGPTPFRFENIWLENSKFKEKIESWWQNMNPNGWAGFRFMEKLKGLKNQIKSWNAIARSQAVSKKRDIVA
QIDLIDRLEEQSEIQSHQIEERKRLKAELLEFALDEQRCLNQKSKIKWLKEGDENSSFFHRWVTAKRNKAFISILENDSGDLLTNETKIEEEILSFFDKLYEKDLGPKFV
IEGVEWMPIDSQNRINMEMNFSEEEIHKAIREMGSLKSPGPNDMMGEFFKNYWNILKLDLVEVFQEFFQNEIINKRANETYICLIPKKKKASNVKDYRPINLVTSLYKLI
AKVLAERLKKVLPLTISNWQAAFVHGRQILDAILVAAEAVEDKRCRKDNGVLLKLDLEKAYDMVNWDFLDTILQLKGFGHRWRKWIKGCLTNSNFSVMINGRLRGKIMAS
RGLRQGDSLSPFLFTIVGDAISRSVQFCLEKKILNGWKIGKDGVMVSLLQYADDTLVFIPNNAADIQKWWDILTLILKGAGLYLNMSKTSIIGVNVDSVEIATWAKQFGC
KADSLPINYLGFSLGGNYRRSIFWDPLLERLNRKLDSWRNFPISKGGRVTFAQSVLNSLPLYQFSLLKAPKSVIKSMEKIIRDFVWSGGSYKPRERI