; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040826 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040826
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr13:8711855..8715762
RNA-Seq ExpressionLag0040826
SyntenyLag0040826
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-18434.28Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE
        K+CI+VP+G ++S W +FL+M+T K + +     +  PR  P   + R   P     D  +RSYA  V +  S + S  S         +   N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
        F       V+L RR FHDDW +I + +R+  +   +Y+ FHA+K +++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G ACGG I+V++ T    +LI+  +KI+ N+ GF+PA + I D +G  + V+ V   + KW + RN ++HGTF R AA  +D+++  
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD
        SE F F G  A +   LN ++ S  +          + +++P+K +    T+                             S DKGK+    P ++ S  
Subjt:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD

Query:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL
        + Y         SP +   F +P+  P++  HS +       + S+K     +   +  NQ  G+    I Q      +DL   KK L+  + L N   L
Subjt:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL

Query:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---
         P+K   +  +      + + N +V+     + +T+ + S+ S ++                                     +K     +SI+  +   
Subjt:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---

Query:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS
                  +G S GGILILWD  + S +   +G FSLS NF   +   WWL+G+YGP  RR R N W +LH+L  L +  W++GGD NV R   E+++
Subjt:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS

Query:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE
           +  S    N  +                                             I R L RP+      V E+     RWGP PFR ++  L++
Subjt:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE

Query:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK
         +F +N++ WW+ + Q G PG+ FI+RLK L + IK W+++    L S K+ I  E+  ID+ E    L+ +   +R +LK++L+ +S++E + W QR K
Subjt:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK

Query:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG
        K WLKEGDEN+ FFH++C++R++R  I E+    G    T++ + L  +NHFS IY+  T  +   + NL W PI  S+ S L  PFSE E+   +K+  
Subjt:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG

Query:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV
         NK+PGPDGF + FF+  W                            I  + K    S  KDFRPISLTT +Y+ IAKTL+ RLK TL  TIS NQLAF+
Subjt:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV

Query:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        K RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD + WNF+D +L    Y  +WRKWIR CI++V+YSI++NGKP+G+IKA RG+RQGDP+S F+
Subjt:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.9e-19035.23Show/hide
Query:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET
        G K  I+VP+G D  GW +FLA++TF+     T       R +P    +      SS SDSS++SYA VL + SE ++              + +  G  
Subjt:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET

Query:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG
         F+   N   K VI++RRCFHDDW RI   +R+ ++   SY PF ADKAIL+L + D A+LLC N+   GW T+G++ VK+E W+   H    +IPSYGG
Subjt:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG

Query:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME
        W++FRGIPLH WN  TF  +G ACGGF++V+K T +   LID  IK++ N+ GF+PA+I I D++GE + V TV P +A+W + RN +VHG+F   AA E
Subjt:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME

Query:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL
        +D+++  +E +T+ G  A            +++  +  + S +     N+++           +   K+KGK             ++  +RI +  + +L
Subjt:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL

Query:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE
        SP                       I+D                 +   +DP  S +  K S   T  + S +  L VDM         +      G D 
Subjt:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE

Query:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--
          N   + N K  D    K L   + E    N     +       +    +    D    + +VI  +     +S K    + SS    + ++   +D  
Subjt:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--

Query:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS
            +G  GGIL+LWD+ N    +   G +S+S+N    +G  WWL+ +YGP    +R   W EL  L  LC   WL+ GDFN+ RW  ET++    K++
Subjt:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS

Query:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF
        M  FN+ + +  L+  P L        + + P                                    +LE+P+ +WGPCPFR +N  L +K+F KN   
Subjt:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF

Query:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE
        WW+S+ Q GFPGY+FI+ L  L   IK W+    +L  + K+ +  EI  ID+LE  G ++    QKR SLKSDL  +   + ++W QR ++ W   GDE
Subjt:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE

Query:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF
        N ++FH++CT  +R+  I  +    G SL +   +    I+HF NIY   + E  ++ NL+W PIS    S L +PF E E+   + +  + K+PGPDG+
Subjt:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF

Query:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD
        T+ F++K W                               I++  KCS  S   D+RPISLTT LY+++AK LA RLKS L  TI+ENQ+AF+KGRQI D
Subjt:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD

Query:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        AIL+ANEAID WK  + +GFV+KLDIEKAFDKI W+F+D +LA K +   WRKWI+ACI++V YSILLNG P+G+IKAERGIRQGDP+SPFI
Subjt:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-18333.88Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLS-GIKGQFNKTGETE
        K+CI+VP+G D+SGW +FL+M+T K + +        PR  P   D R   P     D  +RSYA  +       +  S    D S       N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLS-GIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
         S      V++ RR FHDDW +I + +R+  +   +Y+ FHA+KA+++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G AC G I+V++ T    +LI+  IK++ N+ GF+PA + I D++G  + V+ V   + KW I RN ++HGTF R AA  +D+++ +
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADS----------------------------LNASCLNNSNNVVRPSKRSLPAAT-ISK----DKGKKSAQ---APRRIV
        SE F F G  A +   L+  +D                             LN   +N+SN     +K  L   + IS     DKGK+       P   +
Subjt:  SEPFTFRGEAACTVYELNAVADS----------------------------LNASCLNNSNNVVRPSKRSLPAAT-ISK----DKGKKSAQ---APRRIV

Query:  SVD-----LGYLSPISDPAFSSPEDPPSSNQHS----------------KDSSPST--------------------IAQD------------SLKTLLQV
        ++D     + + SP +     +P+  P+++  S                K  S ST                    +A D             L  L  +
Subjt:  SVD-----LGYLSPISDPAFSSPEDPPSSNQHS----------------KDSSPST--------------------IAQD------------SLKTLLQV

Query:  DMNMDINQRPGSDEINI---------------------------------------------GQEQNEKRNDL-MYKKQLNEWLLENNFCLVPTKCLSED
        D N  +     SD   +                                              +E+ EK  D   +KKQL  WL +N   L      ++ 
Subjt:  DMNMDINQRPGSDEINI---------------------------------------------GQEQNEKRNDL-MYKKQLNEWLLENNFCLVPTKCLSED

Query:  VNVTVNNHDVMGPDNIVILTETKISSISKKLIKSLWSSISINWSALDSIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRR
           T + + ++   N      + +   +K++IKSLW S SINW A ++ G SGGILILWD  N S +   +G+FSLS NF L +   WWL+G+YGP  RR
Subjt:  VNVTVNNHDVMGPDNIVILTETKISSISKKLIKSLWSSISINWSALDSIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRR

Query:  NRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKSMRRFN----SSLMIFRLLTR-----------PSLMAITPVL---------------
         R +FW ELH+L  L +  W+LGGD NV R   E++S   +  + R  N    ++L+I   LT            P+   I   L               
Subjt:  NRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKSMRRFN----SSLMIFRLLTR-----------PSLMAITPVL---------------

Query:  ----------------ENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRL
                         NP+  WGP PFR ++  L + +F +N+  WW+++ Q G+PG+SFI+RLK L + IK W+++    L   K+ I  E+  ID+ 
Subjt:  ----------------ENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRL

Query:  ENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SE
        E    LT +   +R +LK+DL ++S++E + W QR KK WL+EGDEN++FFH++C++R++R FI E+    G    T++ +    I  FS IY+  T S+
Subjt:  ENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SE

Query:  GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWIAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQ
           + NL+W PI+ SE S L  PF E E+   + +    K+PGPDGF + FF+  W                                      LK+TL 
Subjt:  GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWIAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGI
         TIS NQLAFVK RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD +  +F+D +L  K +   WRKWIR CI++V+YS+++NG+P+G+IKA RG+
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGI

Query:  RQGDPISPFI
        RQGDP+SPF+
Subjt:  RQGDPISPFI

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-18434.28Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE
        K+CI+VP+G ++S W +FL+M+T K + +     +  PR  P   + R   P     D  +RSYA  V +  S + S  S         +   N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
        F       V+L RR FHDDW +I + +R+  +   +Y+ FHA+K +++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G ACGG I+V++ T    +LI+  +KI+ N+ GF+PA + I D +G  + V+ V   + KW + RN ++HGTF R AA  +D+++  
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD
        SE F F G  A +   LN ++ S  +          + +++P+K +    T+                             S DKGK+    P ++ S  
Subjt:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD

Query:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL
        + Y         SP +   F +P+  P++  HS +       + S+K     +   +  NQ  G+    I Q      +DL   KK L+  + L N   L
Subjt:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL

Query:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---
         P+K   +  +      + + N +V+     + +T+ + S+ S ++                                     +K     +SI+  +   
Subjt:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---

Query:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS
                  +G S GGILILWD  + S +   +G FSLS NF   +   WWL+G+YGP  RR R N W +LH+L  L +  W++GGD NV R   E+++
Subjt:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS

Query:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE
           +  S    N  +                                             I R L RP+      V E+     RWGP PFR ++  L++
Subjt:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE

Query:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK
         +F +N++ WW+ + Q G PG+ FI+RLK L + IK W+++    L S K+ I  E+  ID+ E    L+ +   +R +LK++L+ +S++E + W QR K
Subjt:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK

Query:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG
        K WLKEGDEN+ FFH++C++R++R  I E+    G    T++ + L  +NHFS IY+  T  +   + NL W PI  S+ S L  PFSE E+   +K+  
Subjt:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG

Query:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV
         NK+PGPDGF + FF+  W                            I  + K    S  KDFRPISLTT +Y+ IAKTL+ RLK TL  TIS NQLAF+
Subjt:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV

Query:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        K RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD + WNF+D +L    Y  +WRKWIR CI++V+YSI++NGKP+G+IKA RG+RQGDP+S F+
Subjt:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-18935.15Show/hide
Query:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET
        G K  I+VP+G D  GW +FLA++TF+     T       R +P    +      SS SDSS++SYA VL + SE ++              + +  G  
Subjt:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET

Query:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG
         F+   N   K VI++RRCFHDDW RI   +R+ ++   SY PF ADKAIL+L + D A+LLC N+   GW T+G++ VK+E W+   H    +IPSYGG
Subjt:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG

Query:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME
        W++FRGIPLH WN  TF  +G ACGGF++V+K T +   LID  IK++ N+ GF+PA+I I D++GE + V TV P +A+W + RN +VHG+F   AA E
Subjt:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME

Query:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL
        +D+++  +E +T+ G  A            +++  +  + S +     N+++           +   K+KGK             ++  +RI +  + +L
Subjt:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL

Query:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE
        SP                       I+D                 +   +DP  S +  K S   T  + S +  L VDM         +      G D 
Subjt:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE

Query:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--
          N   + N K  D    K L   + E    N     +       +    +    D    + +VI  +     +S K    + SS    + ++   +D  
Subjt:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--

Query:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS
            +G  GGIL+LWD+ N    +   G +S+S+N    +G  WWL+ +YGP    +R   W EL  L  LC   WL+ GDFN+ RW  ET++    K++
Subjt:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS

Query:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF
        M  FN+ + +  L+  P L        + + P                                    +LE+P+ +WGPCPFR +N  L +K+F KN   
Subjt:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF

Query:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE
        WW+S+ Q GFPGY+FI+ L  L   IK W+    +L  + K+ +  EI  ID+LE  G ++    QKR SLKSDL  +   + ++W QR ++ W   GDE
Subjt:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE

Query:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF
        N ++FH++CT  +R+  I  +    G SL +   +    I+HF NIY   + E  ++ NL+W PIS    S L +PF E E+   + +  + K+PGPDG+
Subjt:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF

Query:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD
        T+ F++K W                               I++  KCS  S   D+RPISLTT LY+++AK LA RLKS L  TI+ENQ+AF+KGRQI D
Subjt:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD

Query:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        AIL+ANE ID WK  + +GFV+KLDIEKAFDKI W+F+D +LA K +   WRKWI+ACI++V YSILLNG P+G+IKAERGIRQGDP+SPFI
Subjt:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein2.7e-18434.28Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE
        K+CI+VP+G ++S W +FL+M+T K + +     +  PR  P   + R   P     D  +RSYA  V +  S + S  S         +   N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
        F       V+L RR FHDDW +I + +R+  +   +Y+ FHA+K +++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G ACGG I+V++ T    +LI+  +KI+ N+ GF+PA + I D +G  + V+ V   + KW + RN ++HGTF R AA  +D+++  
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD
        SE F F G  A +   LN ++ S  +          + +++P+K +    T+                             S DKGK+    P ++ S  
Subjt:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD

Query:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL
        + Y         SP +   F +P+  P++  HS +       + S+K     +   +  NQ  G+    I Q      +DL   KK L+  + L N   L
Subjt:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL

Query:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---
         P+K   +  +      + + N +V+     + +T+ + S+ S ++                                     +K     +SI+  +   
Subjt:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---

Query:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS
                  +G S GGILILWD  + S +   +G FSLS NF   +   WWL+G+YGP  RR R N W +LH+L  L +  W++GGD NV R   E+++
Subjt:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS

Query:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE
           +  S    N  +                                             I R L RP+      V E+     RWGP PFR ++  L++
Subjt:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE

Query:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK
         +F +N++ WW+ + Q G PG+ FI+RLK L + IK W+++    L S K+ I  E+  ID+ E    L+ +   +R +LK++L+ +S++E + W QR K
Subjt:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK

Query:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG
        K WLKEGDEN+ FFH++C++R++R  I E+    G    T++ + L  +NHFS IY+  T  +   + NL W PI  S+ S L  PFSE E+   +K+  
Subjt:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG

Query:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV
         NK+PGPDGF + FF+  W                            I  + K    S  KDFRPISLTT +Y+ IAKTL+ RLK TL  TIS NQLAF+
Subjt:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV

Query:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        K RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD + WNF+D +L    Y  +WRKWIR CI++V+YSI++NGKP+G+IKA RG+RQGDP+S F+
Subjt:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein3.3e-19035.23Show/hide
Query:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET
        G K  I+VP+G D  GW +FLA++TF+     T       R +P    +      SS SDSS++SYA VL + SE ++              + +  G  
Subjt:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET

Query:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG
         F+   N   K VI++RRCFHDDW RI   +R+ ++   SY PF ADKAIL+L + D A+LLC N+   GW T+G++ VK+E W+   H    +IPSYGG
Subjt:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG

Query:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME
        W++FRGIPLH WN  TF  +G ACGGF++V+K T +   LID  IK++ N+ GF+PA+I I D++GE + V TV P +A+W + RN +VHG+F   AA E
Subjt:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME

Query:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL
        +D+++  +E +T+ G  A            +++  +  + S +     N+++           +   K+KGK             ++  +RI +  + +L
Subjt:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL

Query:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE
        SP                       I+D                 +   +DP  S +  K S   T  + S +  L VDM         +      G D 
Subjt:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE

Query:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--
          N   + N K  D    K L   + E    N     +       +    +    D    + +VI  +     +S K    + SS    + ++   +D  
Subjt:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--

Query:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS
            +G  GGIL+LWD+ N    +   G +S+S+N    +G  WWL+ +YGP    +R   W EL  L  LC   WL+ GDFN+ RW  ET++    K++
Subjt:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS

Query:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF
        M  FN+ + +  L+  P L        + + P                                    +LE+P+ +WGPCPFR +N  L +K+F KN   
Subjt:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF

Query:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE
        WW+S+ Q GFPGY+FI+ L  L   IK W+    +L  + K+ +  EI  ID+LE  G ++    QKR SLKSDL  +   + ++W QR ++ W   GDE
Subjt:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE

Query:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF
        N ++FH++CT  +R+  I  +    G SL +   +    I+HF NIY   + E  ++ NL+W PIS    S L +PF E E+   + +  + K+PGPDG+
Subjt:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF

Query:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD
        T+ F++K W                               I++  KCS  S   D+RPISLTT LY+++AK LA RLKS L  TI+ENQ+AF+KGRQI D
Subjt:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD

Query:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        AIL+ANEAID WK  + +GFV+KLDIEKAFDKI W+F+D +LA K +   WRKWI+ACI++V YSILLNG P+G+IKAERGIRQGDP+SPFI
Subjt:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein2.7e-18434.28Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE
        K+CI+VP+G ++S W +FL+M+T K + +     +  PR  P   + R   P     D  +RSYA  V +  S + S  S         +   N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYAD-VLKNPSEVESGPSGGENDLSGIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
        F       V+L RR FHDDW +I + +R+  +   +Y+ FHA+K +++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G ACGG I+V++ T    +LI+  +KI+ N+ GF+PA + I D +G  + V+ V   + KW + RN ++HGTF R AA  +D+++  
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD
        SE F F G  A +   LN ++ S  +          + +++P+K +    T+                             S DKGK+    P ++ S  
Subjt:  SEPFTFRGEAACTVYELNAVADSLNASCLNN----SNNVVRPSKRSLPAATI-----------------------------SKDKGKKSAQAPRRIVSVD

Query:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL
        + Y         SP +   F +P+  P++  HS +       + S+K     +   +  NQ  G+    I Q      +DL   KK L+  + L N   L
Subjt:  LGYL--------SPISDPAFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQ-VDMNMDINQRPGSDEINIGQEQNEKRNDL-MYKKQLNEWL-LENNFCL

Query:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---
         P+K   +  +      + + N +V+     + +T+ + S+ S ++                                     +K     +SI+  +   
Subjt:  VPTKCLSEDVN------VTVNNHDVMGPDNIVILTETKISSISKKL-------------------------------------IKSLWSSISINWSA---

Query:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS
                  +G S GGILILWD  + S +   +G FSLS NF   +   WWL+G+YGP  RR R N W +LH+L  L +  W++GGD NV R   E+++
Subjt:  -------LDSIGKS-GGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSS

Query:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE
           +  S    N  +                                             I R L RP+      V E+     RWGP PFR ++  L++
Subjt:  PRPAKKSMRRFNSSLM--------------------------------------------IFRLLTRPSLMAITPVLEN--PEFRWGPCPFRFDNYLLHE

Query:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK
         +F +N++ WW+ + Q G PG+ FI+RLK L + IK W+++    L S K+ I  E+  ID+ E    L+ +   +R +LK++L+ +S++E + W QR K
Subjt:  KDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCK

Query:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG
        K WLKEGDEN+ FFH++C++R++R  I E+    G    T++ + L  +NHFS IY+  T  +   + NL W PI  S+ S L  PFSE E+   +K+  
Subjt:  KTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMG

Query:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV
         NK+PGPDGF + FF+  W                            I  + K    S  KDFRPISLTT +Y+ IAKTL+ RLK TL  TIS NQLAF+
Subjt:  HNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFV

Query:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        K RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD + WNF+D +L    Y  +WRKWIR CI++V+YSI++NGKP+G+IKA RG+RQGDP+S F+
Subjt:  KGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein7.9e-18433.88Show/hide
Query:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLS-GIKGQFNKTGETE
        K+CI+VP+G D+SGW +FL+M+T K + +        PR  P   D R   P     D  +RSYA  +       +  S    D S       N   ++ 
Subjt:  KNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLS-GIKGQFNKTGETE

Query:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI
         S      V++ RR FHDDW +I + +R+  +   +Y+ FHA+KA+++  S+  A LLC+N+GW T+G + V++EKW P  H  PKLIPSYGGW  FRGI
Subjt:  FSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKHVVPKLIPSYGGWVKFRGI

Query:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK
        PLH WNM TF Q+G AC G I+V++ T    +LI+  IK++ N+ GF+PA + I D++G  + V+ V   + KW I RN ++HGTF R AA  +D+++ +
Subjt:  PLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAMEYDEYDVK

Query:  SEPFTFRGEAACTVYELNAVADS----------------------------LNASCLNNSNNVVRPSKRSLPAAT-ISK----DKGKKSAQ---APRRIV
        SE F F G  A +   L+  +D                             LN   +N+SN     +K  L   + IS     DKGK+       P   +
Subjt:  SEPFTFRGEAACTVYELNAVADS----------------------------LNASCLNNSNNVVRPSKRSLPAAT-ISK----DKGKKSAQ---APRRIV

Query:  SVD-----LGYLSPISDPAFSSPEDPPSSNQHS----------------KDSSPST--------------------IAQD------------SLKTLLQV
        ++D     + + SP +     +P+  P+++  S                K  S ST                    +A D             L  L  +
Subjt:  SVD-----LGYLSPISDPAFSSPEDPPSSNQHS----------------KDSSPST--------------------IAQD------------SLKTLLQV

Query:  DMNMDINQRPGSDEINI---------------------------------------------GQEQNEKRNDL-MYKKQLNEWLLENNFCLVPTKCLSED
        D N  +     SD   +                                              +E+ EK  D   +KKQL  WL +N   L      ++ 
Subjt:  DMNMDINQRPGSDEINI---------------------------------------------GQEQNEKRNDL-MYKKQLNEWLLENNFCLVPTKCLSED

Query:  VNVTVNNHDVMGPDNIVILTETKISSISKKLIKSLWSSISINWSALDSIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRR
           T + + ++   N      + +   +K++IKSLW S SINW A ++ G SGGILILWD  N S +   +G+FSLS NF L +   WWL+G+YGP  RR
Subjt:  VNVTVNNHDVMGPDNIVILTETKISSISKKLIKSLWSSISINWSALDSIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRR

Query:  NRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKSMRRFN----SSLMIFRLLTR-----------PSLMAITPVL---------------
         R +FW ELH+L  L +  W+LGGD NV R   E++S   +  + R  N    ++L+I   LT            P+   I   L               
Subjt:  NRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKSMRRFN----SSLMIFRLLTR-----------PSLMAITPVL---------------

Query:  ----------------ENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRL
                         NP+  WGP PFR ++  L + +F +N+  WW+++ Q G+PG+SFI+RLK L + IK W+++    L   K+ I  E+  ID+ 
Subjt:  ----------------ENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRL

Query:  ENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SE
        E    LT +   +R +LK+DL ++S++E + W QR KK WL+EGDEN++FFH++C++R++R FI E+    G    T++ +    I  FS IY+  T S+
Subjt:  ENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPT-SE

Query:  GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWIAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQ
           + NL+W PI+ SE S L  PF E E+   + +    K+PGPDGF + FF+  W                                      LK+TL 
Subjt:  GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWIAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQ

Query:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGI
         TIS NQLAFVK RQITDAIL+ANEA+D+WK  + +GF++KLDIEKAFD +  +F+D +L  K +   WRKWIR CI++V+YS+++NG+P+G+IKA RG+
Subjt:  GTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGI

Query:  RQGDPISPFI
        RQGDP+SPF+
Subjt:  RQGDPISPFI

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein9.7e-19035.15Show/hide
Query:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET
        G K  I+VP+G D  GW +FLA++TF+     T       R +P    +      SS SDSS++SYA VL + SE ++              + +  G  
Subjt:  GNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNPSEVESGPSGGENDLSGIKGQFNKTGET

Query:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG
         F+   N   K VI++RRCFHDDW RI   +R+ ++   SY PF ADKAIL+L + D A+LLC N+   GW T+G++ VK+E W+   H    +IPSYGG
Subjt:  EFSFEWN---KVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNR---GWVTIGHFYVKYEKWEPGKHVVPKLIPSYGG

Query:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME
        W++FRGIPLH WN  TF  +G ACGGF++V+K T +   LID  IK++ N+ GF+PA+I I D++GE + V TV P +A+W + RN +VHG+F   AA E
Subjt:  WVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAME

Query:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL
        +D+++  +E +T+ G  A            +++  +  + S +     N+++           +   K+KGK             ++  +RI +  + +L
Subjt:  YDEYDVKSEPFTFRGEAAC-----------TVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGK-----------KSAQAPRRIVSVDLGYL

Query:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE
        SP                       I+D                 +   +DP  S +  K S   T  + S +  L VDM         +      G D 
Subjt:  SP-----------------------ISDP---------------AFSSPEDPPSSNQHSKDSSPSTIAQDSLKTLLQVDMN--------MDINQRPGSDE

Query:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--
          N   + N K  D    K L   + E    N     +       +    +    D    + +VI  +     +S K    + SS    + ++   +D  
Subjt:  I-NIGQEQNEKRNDLMYKKQLNEWLLE---NNFCLVPTKCLSEDVNVTVNNH---DVMGPDNIVILTETKISSISKKLIKSLWSS----ISINWSALD--

Query:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS
            +G  GGIL+LWD+ N    +   G +S+S+N    +G  WWL+ +YGP    +R   W EL  L  LC   WL+ GDFN+ RW  ET++    K++
Subjt:  ---SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKS

Query:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF
        M  FN+ + +  L+  P L        + + P                                    +LE+P+ +WGPCPFR +N  L +K+F KN   
Subjt:  MRRFNSSLMIFRLLTRPSL--------MAITP------------------------------------VLENPEFRWGPCPFRFDNYLLHEKDFTKNIKF

Query:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE
        WW+S+ Q GFPGY+FI+ L  L   IK W+    +L  + K+ +  EI  ID+LE  G ++    QKR SLKSDL  +   + ++W QR ++ W   GDE
Subjt:  WWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDE

Query:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF
        N ++FH++CT  +R+  I  +    G SL +   +    I+HF NIY   + E  ++ NL+W PIS    S L +PF E E+   + +  + K+PGPDG+
Subjt:  NTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGF

Query:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD
        T+ F++K W                               I++  KCS  S   D+RPISLTT LY+++AK LA RLKS L  TI+ENQ+AF+KGRQI D
Subjt:  TVEFFRKSW-----------------------------IAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITD

Query:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI
        AIL+ANE ID WK  + +GFV+KLDIEKAFDKI W+F+D +LA K +   WRKWI+ACI++V YSILLNG P+G+IKAERGIRQGDP+SPFI
Subjt:  AILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-1922.47Show/hide
Query:  DSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENT
        D+TYQ  +  +  + R K +  N   +KRK     +S+   ++ ++ ++++ E       +  Q+ T ++++L ++  Q          KT  K  +  +
Subjt:  DSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENT

Query:  TFFHKV----------CTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSE----GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKA
         FF ++             +R +  I  + +  G      ++++  +  ++ ++Y             +       ++  E  SL +P +  E+   + +
Subjt:  TFFHKV----------CTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSE----GWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKA

Query:  MGHNKSPGPDGFTVEFFR------------------KSWIAITQLWKCSM---------TSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQL
        +   KSPGPDGFT EF++                  K  I     ++ S+         T+ +++FRPISL  +  +++ K LA R++  ++  I  +Q+
Subjt:  MGHNKSPGPDGFTVEFFR------------------KSWIAITQLWKCSM---------TSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQL

Query:  AFVKGRQITDAILVANEAIDFWKCSRTRGFV-IKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPIS
         F+ G Q    I  +   I     ++ +  V I +D EKAFDKI   F+ K L   G    + K IRA     + +I+LNG+       + G RQG P+S
Subjt:  AFVKGRQITDAILVANEAIDFWKCSRTRGFV-IKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPIS

Query:  PFILS
        P + +
Subjt:  PFILS

P08548 LINE-1 reverse transcriptase homolog2.5e-1723.03Show/hide
Query:  TSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINH-----FSNIYKGPTSEGWMVSNLNWC
        T ++++L+++  + I     + K  + ++ ++       +   +R +  IS + +GN   + TD     +++N      +S+ Y+        +   +  
Subjt:  TSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINH-----FSNIYKGPTSEGWMVSNLNWC

Query:  PISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFR------------------KSWIAITQLWKCSMTSI---------RKDFRPISLTTVL
         +S  E   L +P S  E+   ++ +   KSPGPDGFT EF++                  K  I     ++ ++T I         ++++RPISL  + 
Subjt:  PISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFR------------------KSWIAITQLWKCSMTSI---------RKDFRPISLTTVL

Query:  YRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAID-FWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSY
         +++ K L  R++  ++  I  +Q+ F+ G Q    I  +   I    K       ++ +D EKAFD I   F+ + L   G   T+ K I A  +  + 
Subjt:  YRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAID-FWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSY

Query:  SILLNGKPRGKIKAERGIRQGDPISPFILS
        +I+LNG          G RQG P+SP + +
Subjt:  SILLNGKPRGKIKAERGIRQGDPISPFILS

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-1628.38Show/hide
Query:  ISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEF---FRKSWIAI---------------TQLWKCSMTSIRK---------DFRPISLTTVLY
        ++  +   L  P S  E+   + ++   KSPGPDGF+ EF   F++  I I                  ++ ++T I K         +FRPISL  +  
Subjt:  ISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEF---FRKSWIAI---------------TQLWKCSMTSIRK---------DFRPISLTTVLY

Query:  RLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFW-KCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYS
        +++ K LA R++  ++  I  +Q+ F+ G Q    I  +   I +  K       +I LD EKAFDKI   F+ K+L   G    +   I+A  +    +
Subjt:  RLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFW-KCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYS

Query:  ILLNGKPRGKIKAERGIRQGDPISPFILS
        I +NG+    I  + G RQG P+SP++ +
Subjt:  ILLNGKPRGKIKAERGIRQGDPISPFILS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-1822.45Show/hide
Query:  MAITPVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDS--TYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQK----QIISDEIVKIDRLENL
        M+I P L    +      + F+N LL ++ F K+++  W     +Q+ F   +    + ++  ++KL  ++    +  Q+    + ++ E++ ++  + L
Subjt:  MAITPVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDS--TYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQK----QIISDEIVKIDRLENL

Query:  GCLTDQMCQ-KRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYK-GPTSEGW
            DQ  Q +    K  L  +  ++ R    R +   L + D  + FF+ +   +  R  I+ + + +G  L     +     + + N++   P S   
Subjt:  GCLTDQMCQ-KRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYK-GPTSEGW

Query:  MVSNLNWCP-ISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFR
             +  P +S      L  P +  E+ Q L+ M HNKSPG DG T+EFF+  W                            ++ L K     + K++R
Subjt:  MVSNLNWCP-ISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSW--------------------------IAITQLWKCSMTSIRKDFR

Query:  PISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRA
        P+SL +  Y+++AK ++ RLKS L   I  +Q   V GR I D + +  + + F + +      + LD EKAFD++   ++   L    +   +  +++ 
Subjt:  PISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRA

Query:  CITSVSYSILLNGKPRGKIKAERGIRQGDPIS
           S    + +N      +   RG+RQG P+S
Subjt:  CITSVSYSILLNGKPRGKIKAERGIRQGDPIS

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.8e-0825.26Show/hide
Query:  SPGPDGFTVEFFRKSWIA------------ITQLWKCSMTS-IRKD--------FRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDA
        +PG DG TV+   ++ +             +   W    T+ I KD        +RPI++ + L RL+ + LA+RL++ ++   ++   A + G  +   
Subjt:  SPGPDGFTVEFFRKSWIA------------ITQLWKCSMTS-IRKD--------FRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDA

Query:  ILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLN-GKPRGKIKAERGIRQGDPISPFILS
        +L  +  I   +  R    V+ LD+ KAFD +  + + + L   G       +I   ++  + +I +  G    KI   RG++QGDP+SPF+ +
Subjt:  ILVANEAIDFWKCSRTRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLN-GKPRGKIKAERGIRQGDPISPFILS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.9e-1423.01Show/hide
Query:  FNSSLMIFRLLTRPSLMAITPVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLL--KSQKQIIS
        F S++ +F L           +LEN   R   C FR+ ++L     F  ++   W+     G   +S    LK      KL  R+    +  K+++ + S
Subjt:  FNSSLMIFRLLTRPSLMAITPVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLL--KSQKQIIS

Query:  DEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSN
         E ++   L N    +D + +     +   +  +      ++Q+ +  WL++GD NT FFHKV  A + +  I  +   + V +   +Q++  ++ ++++
Subjt:  DEIVKIDRLENLGCLTDQMCQKRTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSN

Query:  IYKGPT-----SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWI--------------------------AITQ
        +    +          + +++    + + AS L    S+ E+   + AM  NK+PGPD FT EFF +SW                           AIT 
Subjt:  IYKGPT-----SEGWMVSNLNWCPISGSEASSLIQPFSELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWI--------------------------AITQ

Query:  LWKCSMTSIRKDFRPISLTTVLYRLI
        + K +       FRP+S  TV+Y++I
Subjt:  LWKCSMTSIRKDFRPISLTTVLYRLI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.5e-0633.73Show/hide
Query:  LAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRG----FVIKLDIEKAFDKICWNFVDKILAFKGYLITW
        + ERLK  +   I   Q +F+ GR  TD I+   EA+   +  R +G     ++KLD+EKA+D+I W++++  L   G+   W
Subjt:  LAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSRTRG----FVIKLDIEKAFDKICWNFVDKILAFKGYLITW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCTTCTCACGGAGATGGGCTCGTACAGATCTTTTTCAATCAAGATGCATGCAGAGATGATCGACTAGCTAAGGGTAACATTTCAGACTCTCCTAGCTTCCCCAC
GGGAAACAAAAACTGCATTATGGTCCCAGACGGCTATGACCGAAGTGGATGGAATACTTTTTTGGCTATGCTCACTTTTAAGGAAAAAAAGGAGAGAACAGTCAATCAGA
TGGGTCCCCCGAGGGATCAACCCCGGGAACACGACAACAGGCAAAAAAGTCCAGATTCCTCTTCTAGCGATTCCTCTCAAAGATCGTACGCTGATGTGTTAAAAAATCCA
TCTGAAGTCGAATCGGGCCCCTCTGGTGGGGAAAACGACCTAAGTGGAATCAAAGGTCAATTTAACAAGACAGGGGAGACAGAGTTTAGCTTTGAATGGAATAAAGTGGT
TATTCTAAGCCGTAGATGTTTCCACGACGATTGGACAAGAATATTCAAGAAGATCCGAGAGCTTAACGACAATGTATTGTCTTATAGCCCATTTCACGCCGACAAAGCAA
TCTTATACCTCAGATCCTCAGACCAAGCGCAGTTGTTGTGTAAAAATAGAGGATGGGTCACAATAGGGCATTTTTATGTCAAGTATGAAAAATGGGAGCCGGGAAAACAT
GTCGTTCCAAAGCTTATTCCAAGTTATGGGGGATGGGTCAAGTTTCGTGGTATCCCTTTGCATGCGTGGAATATGAAGACCTTCACCCAACTCGGCGACGCCTGTGGAGG
TTTCATCGAAGTGTCCAAGAACACTTGGAGGAAGCTTGATCTGATTGATGTCGTTATAAAAATAAAAGAAAACTTCTGTGGTTTTATTCCGGCCGCGATATGCATCAGAG
ACGATAAAGGAGAAGCATATACGGTGAGAACAGTGGCACCGGAGAAGGCAAAATGGCATATCTGCAGAAACCCTAAGGTCCATGGGACGTTCATGCGCGGGGCAGCAATG
GAATACGATGAGTACGACGTCAAATCTGAGCCGTTCACATTCAGAGGGGAAGCAGCCTGCACAGTGTATGAATTAAATGCGGTTGCAGATTCCTTAAATGCCAGCTGCCT
CAATAATTCAAATAACGTGGTGAGGCCTTCTAAACGGTCTCTTCCAGCTGCCACGATTTCAAAGGATAAGGGGAAAAAATCTGCACAAGCTCCGAGGAGGATAGTAAGTG
TGGACTTGGGTTATTTGTCCCCAATATCCGACCCAGCTTTCTCAAGCCCGGAGGACCCTCCTTCGTCGAATCAACACTCAAAGGATAGTTCCCCTTCAACGATTGCTCAG
GACAGTTTGAAGACATTGCTTCAAGTGGACATGAACATGGATATCAATCAGAGGCCAGGCAGTGATGAGATCAATATAGGACAGGAGCAGAACGAGAAAAGAAATGACTT
GATGTATAAAAAACAACTAAATGAATGGCTCCTTGAGAACAACTTCTGCCTTGTTCCTACTAAATGTTTGTCGGAAGATGTTAATGTTACTGTTAACAACCATGATGTTA
TGGGGCCAGATAATATTGTTATTTTAACAGAAACAAAGATTTCCTCCATCTCTAAGAAGCTTATTAAATCACTATGGAGCTCAATCAGCATTAATTGGTCAGCGTTGGAT
TCTATTGGCAAATCTGGTGGCATTTTAATCCTTTGGGACGAATTGAATGTCTCTGCTATAGAAACTCTCAAAGGCATTTTCTCTCTCTCAATTAATTTTCGATTGGCAGA
TGGTTTTTGTTGGTGGCTTTCGGGTATTTATGGTCCAGCTAGTCGTCGAAACAGGAAGAATTTCTGGACTGAGCTTCATGATCTTGGAGGATTGTGCAATGATTGTTGGC
TATTAGGTGGAGACTTTAATGTCTTTCGGTGGAATCATGAAACTTCCTCTCCTCGTCCGGCTAAGAAGAGTATGAGGCGCTTCAATTCTTCATTAATGATCTTTCGCTTA
TTGACCCGCCCCTCCTTAATGGCAATTACACCTGTCCTTGAAAATCCAGAATTCAGGTGGGGCCCATGTCCCTTCCGTTTTGATAACTACCTTCTGCATGAGAAAGATTT
CACCAAGAATATTAAATTCTGGTGGGACTCAACATATCAAGAAGGCTTCCCTGGTTACTCATTTATTAGAAGATTAAAGCAGCTCGTTAGCAACATCAAGCTTTGGAAAA
GGAAAAACAGAGATCTCCTCAAATCTCAGAAACAGATCATATCGGATGAAATTGTGAAGATAGACCGCTTGGAGAATTTGGGTTGTCTAACCGATCAAATGTGCCAGAAA
AGAACCTCTCTTAAATCTGATCTTCATCAAGTGTCTATGCAAGAGATCAGGATGTGGAAACAAAGATGCAAGAAAACCTGGCTTAAAGAAGGAGATGAAAATACCACCTT
TTTCCATAAAGTCTGTACTGCTCGTCGGCGACGATGCTTCATTTCAGAGATGATTTCCGGAAATGGGGTAAGCCTGGTTACAGATTCTCAAATGGAGTTGGAGGTGATCA
ATCATTTTTCTAATATTTATAAGGGCCCCACTTCTGAAGGATGGATGGTCTCAAATTTAAATTGGTGCCCTATTTCTGGGTCTGAGGCGTCATCTTTGATTCAACCTTTT
TCTGAGCTAGAAGTATTTCAGAATCTAAAAGCGATGGGTCATAATAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAGAAAGTCCTGGATCGCTATCACTCA
GTTATGGAAGTGTTCCATGACTTCTATCAGAAAGGACTTTAGACCCATTAGTCTCACCACAGTTCTTTATCGCCTTATCGCTAAGACTCTGGCAGAAAGGCTTAAAAGTA
CTCTCCAAGGAACAATATCTGAGAATCAACTAGCATTTGTAAAGGGTCGTCAAATTACTGATGCTATTTTGGTGGCTAATGAAGCTATTGATTTCTGGAAATGTTCTCGC
ACGAGAGGATTCGTTATAAAGCTAGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCGTTGATAAGATTCTCGCTTTCAAGGGATACCTGATTACCTGGCGTAA
GTGGATCCGGGCTTGCATCACCTCAGTTTCATACTCCATATTGCTTAATGGGAAGCCAAGAGGCAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCC
CTTTTATTTTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATTCTTCTCACGGAGATGGGCTCGTACAGATCTTTTTCAATCAAGATGCATGCAGAGATGATCGACTAGCTAAGGGTAACATTTCAGACTCTCCTAGCTTCCCCAC
GGGAAACAAAAACTGCATTATGGTCCCAGACGGCTATGACCGAAGTGGATGGAATACTTTTTTGGCTATGCTCACTTTTAAGGAAAAAAAGGAGAGAACAGTCAATCAGA
TGGGTCCCCCGAGGGATCAACCCCGGGAACACGACAACAGGCAAAAAAGTCCAGATTCCTCTTCTAGCGATTCCTCTCAAAGATCGTACGCTGATGTGTTAAAAAATCCA
TCTGAAGTCGAATCGGGCCCCTCTGGTGGGGAAAACGACCTAAGTGGAATCAAAGGTCAATTTAACAAGACAGGGGAGACAGAGTTTAGCTTTGAATGGAATAAAGTGGT
TATTCTAAGCCGTAGATGTTTCCACGACGATTGGACAAGAATATTCAAGAAGATCCGAGAGCTTAACGACAATGTATTGTCTTATAGCCCATTTCACGCCGACAAAGCAA
TCTTATACCTCAGATCCTCAGACCAAGCGCAGTTGTTGTGTAAAAATAGAGGATGGGTCACAATAGGGCATTTTTATGTCAAGTATGAAAAATGGGAGCCGGGAAAACAT
GTCGTTCCAAAGCTTATTCCAAGTTATGGGGGATGGGTCAAGTTTCGTGGTATCCCTTTGCATGCGTGGAATATGAAGACCTTCACCCAACTCGGCGACGCCTGTGGAGG
TTTCATCGAAGTGTCCAAGAACACTTGGAGGAAGCTTGATCTGATTGATGTCGTTATAAAAATAAAAGAAAACTTCTGTGGTTTTATTCCGGCCGCGATATGCATCAGAG
ACGATAAAGGAGAAGCATATACGGTGAGAACAGTGGCACCGGAGAAGGCAAAATGGCATATCTGCAGAAACCCTAAGGTCCATGGGACGTTCATGCGCGGGGCAGCAATG
GAATACGATGAGTACGACGTCAAATCTGAGCCGTTCACATTCAGAGGGGAAGCAGCCTGCACAGTGTATGAATTAAATGCGGTTGCAGATTCCTTAAATGCCAGCTGCCT
CAATAATTCAAATAACGTGGTGAGGCCTTCTAAACGGTCTCTTCCAGCTGCCACGATTTCAAAGGATAAGGGGAAAAAATCTGCACAAGCTCCGAGGAGGATAGTAAGTG
TGGACTTGGGTTATTTGTCCCCAATATCCGACCCAGCTTTCTCAAGCCCGGAGGACCCTCCTTCGTCGAATCAACACTCAAAGGATAGTTCCCCTTCAACGATTGCTCAG
GACAGTTTGAAGACATTGCTTCAAGTGGACATGAACATGGATATCAATCAGAGGCCAGGCAGTGATGAGATCAATATAGGACAGGAGCAGAACGAGAAAAGAAATGACTT
GATGTATAAAAAACAACTAAATGAATGGCTCCTTGAGAACAACTTCTGCCTTGTTCCTACTAAATGTTTGTCGGAAGATGTTAATGTTACTGTTAACAACCATGATGTTA
TGGGGCCAGATAATATTGTTATTTTAACAGAAACAAAGATTTCCTCCATCTCTAAGAAGCTTATTAAATCACTATGGAGCTCAATCAGCATTAATTGGTCAGCGTTGGAT
TCTATTGGCAAATCTGGTGGCATTTTAATCCTTTGGGACGAATTGAATGTCTCTGCTATAGAAACTCTCAAAGGCATTTTCTCTCTCTCAATTAATTTTCGATTGGCAGA
TGGTTTTTGTTGGTGGCTTTCGGGTATTTATGGTCCAGCTAGTCGTCGAAACAGGAAGAATTTCTGGACTGAGCTTCATGATCTTGGAGGATTGTGCAATGATTGTTGGC
TATTAGGTGGAGACTTTAATGTCTTTCGGTGGAATCATGAAACTTCCTCTCCTCGTCCGGCTAAGAAGAGTATGAGGCGCTTCAATTCTTCATTAATGATCTTTCGCTTA
TTGACCCGCCCCTCCTTAATGGCAATTACACCTGTCCTTGAAAATCCAGAATTCAGGTGGGGCCCATGTCCCTTCCGTTTTGATAACTACCTTCTGCATGAGAAAGATTT
CACCAAGAATATTAAATTCTGGTGGGACTCAACATATCAAGAAGGCTTCCCTGGTTACTCATTTATTAGAAGATTAAAGCAGCTCGTTAGCAACATCAAGCTTTGGAAAA
GGAAAAACAGAGATCTCCTCAAATCTCAGAAACAGATCATATCGGATGAAATTGTGAAGATAGACCGCTTGGAGAATTTGGGTTGTCTAACCGATCAAATGTGCCAGAAA
AGAACCTCTCTTAAATCTGATCTTCATCAAGTGTCTATGCAAGAGATCAGGATGTGGAAACAAAGATGCAAGAAAACCTGGCTTAAAGAAGGAGATGAAAATACCACCTT
TTTCCATAAAGTCTGTACTGCTCGTCGGCGACGATGCTTCATTTCAGAGATGATTTCCGGAAATGGGGTAAGCCTGGTTACAGATTCTCAAATGGAGTTGGAGGTGATCA
ATCATTTTTCTAATATTTATAAGGGCCCCACTTCTGAAGGATGGATGGTCTCAAATTTAAATTGGTGCCCTATTTCTGGGTCTGAGGCGTCATCTTTGATTCAACCTTTT
TCTGAGCTAGAAGTATTTCAGAATCTAAAAGCGATGGGTCATAATAAGTCCCCTGGCCCGGATGGATTCACTGTTGAATTCTTTAGAAAGTCCTGGATCGCTATCACTCA
GTTATGGAAGTGTTCCATGACTTCTATCAGAAAGGACTTTAGACCCATTAGTCTCACCACAGTTCTTTATCGCCTTATCGCTAAGACTCTGGCAGAAAGGCTTAAAAGTA
CTCTCCAAGGAACAATATCTGAGAATCAACTAGCATTTGTAAAGGGTCGTCAAATTACTGATGCTATTTTGGTGGCTAATGAAGCTATTGATTTCTGGAAATGTTCTCGC
ACGAGAGGATTCGTTATAAAGCTAGATATTGAAAAGGCTTTTGACAAGATCTGTTGGAACTTCGTTGATAAGATTCTCGCTTTCAAGGGATACCTGATTACCTGGCGTAA
GTGGATCCGGGCTTGCATCACCTCAGTTTCATACTCCATATTGCTTAATGGGAAGCCAAGAGGCAAAATCAAGGCCGAAAGAGGCATTCGTCAAGGAGATCCGATCTCCC
CTTTTATTTTGTCCTAG
Protein sequenceShow/hide protein sequence
MHSSHGDGLVQIFFNQDACRDDRLAKGNISDSPSFPTGNKNCIMVPDGYDRSGWNTFLAMLTFKEKKERTVNQMGPPRDQPREHDNRQKSPDSSSSDSSQRSYADVLKNP
SEVESGPSGGENDLSGIKGQFNKTGETEFSFEWNKVVILSRRCFHDDWTRIFKKIRELNDNVLSYSPFHADKAILYLRSSDQAQLLCKNRGWVTIGHFYVKYEKWEPGKH
VVPKLIPSYGGWVKFRGIPLHAWNMKTFTQLGDACGGFIEVSKNTWRKLDLIDVVIKIKENFCGFIPAAICIRDDKGEAYTVRTVAPEKAKWHICRNPKVHGTFMRGAAM
EYDEYDVKSEPFTFRGEAACTVYELNAVADSLNASCLNNSNNVVRPSKRSLPAATISKDKGKKSAQAPRRIVSVDLGYLSPISDPAFSSPEDPPSSNQHSKDSSPSTIAQ
DSLKTLLQVDMNMDINQRPGSDEINIGQEQNEKRNDLMYKKQLNEWLLENNFCLVPTKCLSEDVNVTVNNHDVMGPDNIVILTETKISSISKKLIKSLWSSISINWSALD
SIGKSGGILILWDELNVSAIETLKGIFSLSINFRLADGFCWWLSGIYGPASRRNRKNFWTELHDLGGLCNDCWLLGGDFNVFRWNHETSSPRPAKKSMRRFNSSLMIFRL
LTRPSLMAITPVLENPEFRWGPCPFRFDNYLLHEKDFTKNIKFWWDSTYQEGFPGYSFIRRLKQLVSNIKLWKRKNRDLLKSQKQIISDEIVKIDRLENLGCLTDQMCQK
RTSLKSDLHQVSMQEIRMWKQRCKKTWLKEGDENTTFFHKVCTARRRRCFISEMISGNGVSLVTDSQMELEVINHFSNIYKGPTSEGWMVSNLNWCPISGSEASSLIQPF
SELEVFQNLKAMGHNKSPGPDGFTVEFFRKSWIAITQLWKCSMTSIRKDFRPISLTTVLYRLIAKTLAERLKSTLQGTISENQLAFVKGRQITDAILVANEAIDFWKCSR
TRGFVIKLDIEKAFDKICWNFVDKILAFKGYLITWRKWIRACITSVSYSILLNGKPRGKIKAERGIRQGDPISPFILS