; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008557 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008557
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr9:25334732..25341848
RNA-Seq ExpressionLag0008557
SyntenyLag0008557
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-13024.97Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY
        +TE  + +SF++ +  ++  W+ +CF DLL     + FF + R+++  +WV K  NK      AEI R+   G    I++P G D +GWKSF++L+  T+
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY

Query:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV
        +   P  + +         T+ +    D           + +   D  K++   TS  +   R  +  F          E ++II R+ FHD+W  I+  
Subjt:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV

Query:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET
        ++++     S  P Q D+A+L + + +  ++LC+ K   GW  VG + V+FE W++        +PSYGGW++ R +P+  W++ TFQ IG ACGG+++ 
Subjt:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET

Query:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK
        AK+T+    +++  IKV+ N     PAS+ +  +      VT      A   +     +HG        E                 + P  PT  H   
Subjt:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK

Query:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV
           Y    S+K + S   + KK   ++   D  D+    R+            Q +    K +  ++      + P   Q+ S N  +    +S     +
Subjt:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV

Query:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA
           F  +                              WS                          P   ++ +L +   +K        +  Q   ED  
Subjt:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA

Query:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH
         H L+  E+   + S    LS      S             PL           E+ I + +    D F+ +    N     S +            A++
Subjt:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH

Query:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG
            +   + Q ++A++     + +     S +  D+     L  W K   +K   K  N       +  SSS    +   I S + +  +    +G  G
Subjt:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG

Query:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID
         IL++W++    + ++  GN+S++LN+   +G + W+T +Y P    +R   W EL  L SLC PNW++ GDFNI RW  E +      + M +FN FI 
Subjt:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID

Query:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG
        + ELID P +N  +TWSN   N   + +DRFL++      FG    R L R  SDH+PI L   + KWGP  F+ +N  L    F+K   +WW +    G
Subjt:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG

Query:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM
        + G+ FIQ L +    +K+W        D  +K LL E++ ID  E    +    + +R+S+K+DLLS+      +W QR + +W   GDEN +YFHR  
Subjt:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM

Query:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW
          N+R+N I  +   +G SL     I   F+S +  ++T +  +  L D   W  I       L  PF E EI   +      K+PGP+GYT  F+KK W
Subjt:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW

Query:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------
          LK+D++ VF DF K+G +N ++N T+I L                                                                     
Subjt:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------

Query:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------
                     I K+  KI W  I                                                                          
Subjt:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------

Query:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW
                  +ISHL FADD ++F   +E +L+NL   +  FE+ASGL  N  KS    I I+      +A  FG +T   P  YLG+PL G P+S SFW
Subjt:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW

Query:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP
        D  +E I K+L  W  + +SKGGRLTL++A+LS+LP     + + P
Subjt:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP

RVW93866.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-12533.29Show/hide
Query:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI
        + LGS  KR +IK+FL+S NP VV++QETK  + DR+F+ S+W+ R   W ++ A G+ G ILI+W+  IL   EVV G+FS+++  SL     LW++ +
Subjt:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI

Query:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK
        Y PN+P+ R  FW ELFD+  L  P W +GGDFN+ R S EK      T  M+ F+ FI   EL+D PL N  +TWSN +   +   +DRFL ++     
Subjt:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK

Query:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN
        F       L R TSDH+PI +      WGPT F+F N WL H +F++    WW     +GW G+ F+++L+  K++LK+W  ++FG+    +K +LN+L 
Subjt:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN

Query:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI
          D  E+   L+ D  ++R+S K +L  L  R+E  WRQ+ K KW+ EGD N+ ++H+     R R  I EL +  G  L +  SI  E + ++ KL+T 
Subjt:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI

Query:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------
          G+ +  +  DW  I      RLE+PFTEEEI KA+  L  +K+ GP+G+T   F++ W+++KED++RVF +F +SG IN S N ++I LIPK      
Subjt:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------SGSKICWI---------------------HILS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVI
                        +GS   W+                      +LS                       +SHLQFADDTI FS+  E  L  L S++
Subjt:  ----------------SGSKICWI---------------------HILS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVI

Query:  KQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYIS
          F   SGL VN +KS   GI +    +S LA+   CK  GWP  YLGLPL G PK+  FWDPVVE+I  RL  W   +LS  GR+TLIQ+ L++LP   
Subjt:  KQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYIS

Query:  YPSSRLPQRLSTRL
            ++P  ++ ++
Subjt:  YPSSRLPQRLSTRL

RVX04307.1 Beta-arabinofuranosyltransferase RAY1 [Vitis vinifera]6.2e-13035.07Show/hide
Query:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI
        + LGS  KR ++K+FL+S NP VV++QETK    DR+F+ S+W++R   W ++ A G+ G ILI+W+   L   EVV G+FS+++  SL     LW++ +
Subjt:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI

Query:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK
        Y PN+P+ R  FW E+FD+  L  P W +GGDFN+ R S EK      T  M+ F+ FI   EL+D PL N  +TWSN +   +   +DRFL ++     
Subjt:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK

Query:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN
        F       L R TSDH+PI +      WG T F+F N WL H +F++    WW      GW GH F+++L+  K++LK+W+  +FG+    +K +LN+L 
Subjt:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN

Query:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI
          D  E    L+ D  ++R S K +L  L  R+E  WRQ+ K KW+ EGD N+ ++H+     R R  I EL +  G  L +  SI  E + ++ KL+T 
Subjt:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI

Query:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------
          G+ +  +  DW  I      RL++PFTEEEI KA+  L  +K+PGP+G+T   F++ W+++KED++RVF +F +SG IN S N ++I LIPK      
Subjt:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------

Query:  ---------------------------------------------------------------------SGSKICWI---------------------HI
                                                                             +GS   W+                      +
Subjt:  ---------------------------------------------------------------------SGSKICWI---------------------HI

Query:  LS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYL
        LS                       +SHLQFADDTI FS+  E  L  L S++  F   SGL VN +KS   GI +    +S LA+   CK  GWP  YL
Subjt:  LS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYL

Query:  GLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLPQRLSTRL
        GLPL G PK+  FWDPVVE+I  RL  W  T+LS GGR+T IQ+ L++LP       ++P  ++ ++
Subjt:  GLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLPQRLSTRL

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.2e-13026.45Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKKGHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLL------
        +TE+   ++F++ +      W+      L++ P   +FF + R  E  +W+ K  N KG  AEI R+      + I++P G DK GW SF+S++      
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKKGHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLL------

Query:  ----KDTYKPQ--------PPAIKAKMTY-KEIIELDPYMKEEKNDHPKQQDVLTSSIAQDLRELATFFPESSIIIFRKNFHDEWYEILRVMQQEVSDFG
            + T+ P+        PP    K +Y K + E  P+   + +D     D   SS        ++   E++++I R+ FHD+W++IL+ ++++  +  
Subjt:  ----KDTYKPQ--------PPAIKAKMTY-KEIIELDPYMKEEKNDHPKQQDVLTSSIAQDLRELATFFPESSIIIFRKNFHDEWYEILRVMQQEVSDFG

Query:  SISPIQPDRALLAVEDKEQGRILCNIKGWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMETAKKTLSRMDMME
        + +    ++AL+         +LC  KGW  VG + VRFE W+         +PSYGGW   R +P+  W+  TFQ+IG AC G ++ A++T S  +++E
Subjt:  SISPIQPDRALLAVEDKEQGRILCNIKGWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMETAKKTLSRMDMME

Query:  VSIKVKENVS---PASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKKVPYYRASASEKM
          IKV+ N S   PA+V +  +  +              F++ ++   GK     ++E         V L G    Q  +     + +   +    SE +
Subjt:  VSIKVKENVS---PASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKKVPYYRASASEKM

Query:  ASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMVPIMFNY-KPTLL
        +                 D L  +   R+SS  D PSA      + I KP    T P  L+     + N          H +     + I+       +L
Subjt:  ASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMVPIMFNY-KPTLL

Query:  IKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESAC
         KG +      +  I  +   +  L  S     FN     S S K N+      P+          N   +L S    E+ Q+V  +++     + +S+ 
Subjt:  IKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESAC

Query:  LKASSWCTLSKS-FVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARHGMCIMPIPSK
         + +S    +K  F+    Q +  +  +    L+L  D    + +   +  ++  ED  + ++         +E      + + P      M +    + 
Subjt:  LKASSWCTLSKS-FVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARHGMCIMPIPSK

Query:  QKQTATKKPRNWDR---------EVQNLQSTINYDKDLGSWKKRALIK-----NFLKSCNPTVVILQETKS--SSIDRKFIKSIWSSRYIGWSSIDAIGS
          +   +KP++  +         E +    +  + K L SW K+  +K     +   +   T V+L +  S     +++ IKS+W S  I W + +A GS
Subjt:  QKQTATKKPRNWDR---------EVQNLQSTINYDKDLGSWKKRALIK-----NFLKSCNPTVVILQETKS--SSIDRKFIKSIWSSRYIGWSSIDAIGS

Query:  LGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKF
         G ILI+W+     ++   +G FSL+ N  L + S  W+TG+Y P    ER+HFW EL +L  L    WI+GGD N+ R   E ++    +   +  N F
Subjt:  LGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKF

Query:  IDLTELIDVPLMNGRYTWSNNR---AKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEK--WGPTSFKFSNFWLSHNSFEKLLSSWWKNH
        I    LID PL N R+TWSN R     + IDRFL   S    F     R L R+TSDH+P+       K  WGP  F+ ++  LS   F++ +  WW+N 
Subjt:  IDLTELIDVPLMNGRYTWSNNR---AKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEK--WGPTSFKFSNFWLSHNSFEKLLSSWWKNH

Query:  SMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYF
           G+ G  FIQ+LK+  + +K W           ++ ++ E+++ID +E   PL +++ NRRL++KADL  L+ ++   W QR K  WL EGDEN+++F
Subjt:  SMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYF

Query:  HRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLF--TIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAE
        HR  ++ ++R+ I E+    G+    + SI T F+ F+S+++  + K    ++ ++ DW  I +     L APF E EI   +N     K+PGP+G+   
Subjt:  HRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLF--TIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAE

Query:  FFKKSW-------------------------------------------NILKEDIIRVFD----DF-------------FKSGTINASLNETYICLIP-
        FFK  W                                            ILK DI + FD    DF             ++        N TY  +I  
Subjt:  FFKKSW-------------------------------------------NILKEDIIRVFD----DF-------------FKSGTINASLNETYICLIP-

Query:  -------------------------------------KSGSKICWIHI---LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFM
                                             +S   I  + +    +ISH+ FADD +LF   ++  L NL   +  FE ASGL +N  KS  +
Subjt:  -------------------------------------KSGSKICWIHI---LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFM

Query:  GIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLP
         + ++ +     A  +G      P +YLG+PL G PKS  FW  V +KI+K+L +W    +SKGGRLTLI++TLS+LP
Subjt:  GIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLP

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.7e-13024.97Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY
        +TE  + +SF++ +  ++  W+ +CF DLL     + FF + R+++  +WV K  NK      AEI R+   G    I++P G D +GWKSF++L+  T+
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY

Query:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV
        +   P  + +         T+ +    D           + +   D  K++   TS  +   R  +  F          E ++II R+ FHD+W  I+  
Subjt:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV

Query:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET
        ++++     S  P Q D+A+L + + +  ++LC+ K   GW  VG + V+FE W++        +PSYGGW++ R +P+  W++ TFQ IG ACGG+++ 
Subjt:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET

Query:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK
        AK+T+    +++  IKV+ N     PAS+ +  +      VT      A   +     +HG        E                 + P  PT  H   
Subjt:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK

Query:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV
           Y    S+K + S   + KK   ++   D  D+    R+            Q +    K +  ++      + P   Q+ S N  +    +S     +
Subjt:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV

Query:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA
           F  +                              WS                          P   ++ +L +   +K        +  Q   ED  
Subjt:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA

Query:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH
         H L+  E+   + S    LS      S             PL           E+ I + +    D F+ +    N     S +            A++
Subjt:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH

Query:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG
            +   + Q ++A++     + +     S +  D+     L  W K   +K   K  N       +  SSS    +   I S + +  +    +G  G
Subjt:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG

Query:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID
         IL++W++    + ++  GN+S++LN+   +G + W+T +Y P    +R   W EL  L SLC PNW++ GDFNI RW  E +      + M +FN FI 
Subjt:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID

Query:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG
        + ELID P +N  +TWSN   N   + +DRFL++      FG    R L R  SDH+PI L   + KWGP  F+ +N  L    F+K   +WW +    G
Subjt:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG

Query:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM
        + G+ FIQ L +    +K+W        D  +K LL E++ ID  E    +    + +R+S+K+DLLS+      +W QR + +W   GDEN +YFHR  
Subjt:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM

Query:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW
          N+R+N I  +   +G SL     I   F+S +  ++T +  +  L D   W  I       L  PF E EI   +      K+PGP+GYT  F+KK W
Subjt:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW

Query:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------
          LK+D++ VF DF K+G +N ++N T+I L                                                                     
Subjt:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------

Query:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------
                     I K+  KI W  I                                                                          
Subjt:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------

Query:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW
                  +ISHL FADD ++F   +E +L+NL   +  FE+ASGL  N  KS    I I+      +A  FG +T   P  YLG+PL G P+S SFW
Subjt:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW

Query:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP
        D  +E I K+L  W  + +SKGGRLTL++A+LS+LP     + + P
Subjt:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP

TrEMBL top hitse value%identityAlignment
A0A438IAX3 Transposon TX1 uncharacterized 149 kDa protein7.7e-12633.29Show/hide
Query:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI
        + LGS  KR +IK+FL+S NP VV++QETK  + DR+F+ S+W+ R   W ++ A G+ G ILI+W+  IL   EVV G+FS+++  SL     LW++ +
Subjt:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI

Query:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK
        Y PN+P+ R  FW ELFD+  L  P W +GGDFN+ R S EK      T  M+ F+ FI   EL+D PL N  +TWSN +   +   +DRFL ++     
Subjt:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK

Query:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN
        F       L R TSDH+PI +      WGPT F+F N WL H +F++    WW     +GW G+ F+++L+  K++LK+W  ++FG+    +K +LN+L 
Subjt:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN

Query:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI
          D  E+   L+ D  ++R+S K +L  L  R+E  WRQ+ K KW+ EGD N+ ++H+     R R  I EL +  G  L +  SI  E + ++ KL+T 
Subjt:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI

Query:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------
          G+ +  +  DW  I      RLE+PFTEEEI KA+  L  +K+ GP+G+T   F++ W+++KED++RVF +F +SG IN S N ++I LIPK      
Subjt:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------SGSKICWI---------------------HILS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVI
                        +GS   W+                      +LS                       +SHLQFADDTI FS+  E  L  L S++
Subjt:  ----------------SGSKICWI---------------------HILS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVI

Query:  KQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYIS
          F   SGL VN +KS   GI +    +S LA+   CK  GWP  YLGLPL G PK+  FWDPVVE+I  RL  W   +LS  GR+TLIQ+ L++LP   
Subjt:  KQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYIS

Query:  YPSSRLPQRLSTRL
            ++P  ++ ++
Subjt:  YPSSRLPQRLSTRL

A0A438J5R7 Beta-arabinofuranosyltransferase RAY13.0e-13035.07Show/hide
Query:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI
        + LGS  KR ++K+FL+S NP VV++QETK    DR+F+ S+W++R   W ++ A G+ G ILI+W+   L   EVV G+FS+++  SL     LW++ +
Subjt:  KDLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGI

Query:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK
        Y PN+P+ R  FW E+FD+  L  P W +GGDFN+ R S EK      T  M+ F+ FI   EL+D PL N  +TWSN +   +   +DRFL ++     
Subjt:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTL---IDRFLVTDSCVQK

Query:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN
        F       L R TSDH+PI +      WG T F+F N WL H +F++    WW      GW GH F+++L+  K++LK+W+  +FG+    +K +LN+L 
Subjt:  FGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELN

Query:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI
          D  E    L+ D  ++R S K +L  L  R+E  WRQ+ K KW+ EGD N+ ++H+     R R  I EL +  G  L +  SI  E + ++ KL+T 
Subjt:  AIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTI

Query:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------
          G+ +  +  DW  I      RL++PFTEEEI KA+  L  +K+PGP+G+T   F++ W+++KED++RVF +F +SG IN S N ++I LIPK      
Subjt:  KVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK------

Query:  ---------------------------------------------------------------------SGSKICWI---------------------HI
                                                                             +GS   W+                      +
Subjt:  ---------------------------------------------------------------------SGSKICWI---------------------HI

Query:  LS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYL
        LS                       +SHLQFADDTI FS+  E  L  L S++  F   SGL VN +KS   GI +    +S LA+   CK  GWP  YL
Subjt:  LS-----------------------ISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYL

Query:  GLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLPQRLSTRL
        GLPL G PK+  FWDPVVE+I  RL  W  T+LS GGR+T IQ+ L++LP       ++P  ++ ++
Subjt:  GLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLPQRLSTRL

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.8e-13024.97Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY
        +TE  + +SF++ +  ++  W+ +CF DLL     + FF + R+++  +WV K  NK      AEI R+   G    I++P G D +GWKSF++L+  T+
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY

Query:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV
        +   P  + +         T+ +    D           + +   D  K++   TS  +   R  +  F          E ++II R+ FHD+W  I+  
Subjt:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV

Query:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET
        ++++     S  P Q D+A+L + + +  ++LC+ K   GW  VG + V+FE W++        +PSYGGW++ R +P+  W++ TFQ IG ACGG+++ 
Subjt:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET

Query:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK
        AK+T+    +++  IKV+ N     PAS+ +  +      VT      A   +     +HG        E                 + P  PT  H   
Subjt:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK

Query:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV
           Y    S+K + S   + KK   ++   D  D+    R+            Q +    K +  ++      + P   Q+ S N  +    +S     +
Subjt:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV

Query:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA
           F  +                              WS                          P   ++ +L +   +K        +  Q   ED  
Subjt:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA

Query:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH
         H L+  E+   + S    LS      S             PL           E+ I + +    D F+ +    N     S +            A++
Subjt:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH

Query:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG
            +   + Q ++A++     + +     S +  D+     L  W K   +K   K  N       +  SSS    +   I S + +  +    +G  G
Subjt:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG

Query:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID
         IL++W++    + ++  GN+S++LN+   +G + W+T +Y P    +R   W EL  L SLC PNW++ GDFNI RW  E +      + M +FN FI 
Subjt:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID

Query:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG
        + ELID P +N  +TWSN   N   + +DRFL++      FG    R L R  SDH+PI L   + KWGP  F+ +N  L    F+K   +WW +    G
Subjt:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG

Query:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM
        + G+ FIQ L +    +K+W        D  +K LL E++ ID  E    +    + +R+S+K+DLLS+      +W QR + +W   GDEN +YFHR  
Subjt:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM

Query:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW
          N+R+N I  +   +G SL     I   F+S +  ++T +  +  L D   W  I       L  PF E EI   +      K+PGP+GYT  F+KK W
Subjt:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW

Query:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------
          LK+D++ VF DF K+G +N ++N T+I L                                                                     
Subjt:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------

Query:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------
                     I K+  KI W  I                                                                          
Subjt:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------

Query:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW
                  +ISHL FADD ++F   +E +L+NL   +  FE+ASGL  N  KS    I I+      +A  FG +T   P  YLG+PL G P+S SFW
Subjt:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW

Query:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP
        D  +E I K+L  W  + +SKGGRLTL++A+LS+LP     + + P
Subjt:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein3.9e-13026.45Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKKGHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLL------
        +TE+   ++F++ +      W+      L++ P   +FF + R  E  +W+ K  N KG  AEI R+      + I++P G DK GW SF+S++      
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKKGHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLL------

Query:  ----KDTYKPQ--------PPAIKAKMTY-KEIIELDPYMKEEKNDHPKQQDVLTSSIAQDLRELATFFPESSIIIFRKNFHDEWYEILRVMQQEVSDFG
            + T+ P+        PP    K +Y K + E  P+   + +D     D   SS        ++   E++++I R+ FHD+W++IL+ ++++  +  
Subjt:  ----KDTYKPQ--------PPAIKAKMTY-KEIIELDPYMKEEKNDHPKQQDVLTSSIAQDLRELATFFPESSIIIFRKNFHDEWYEILRVMQQEVSDFG

Query:  SISPIQPDRALLAVEDKEQGRILCNIKGWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMETAKKTLSRMDMME
        + +    ++AL+         +LC  KGW  VG + VRFE W+         +PSYGGW   R +P+  W+  TFQ+IG AC G ++ A++T S  +++E
Subjt:  SISPIQPDRALLAVEDKEQGRILCNIKGWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMETAKKTLSRMDMME

Query:  VSIKVKENVS---PASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKKVPYYRASASEKM
          IKV+ N S   PA+V +  +  +              F++ ++   GK     ++E         V L G    Q  +     + +   +    SE +
Subjt:  VSIKVKENVS---PASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKKVPYYRASASEKM

Query:  ASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMVPIMFNY-KPTLL
        +                 D L  +   R+SS  D PSA      + I KP    T P  L+     + N          H +     + I+       +L
Subjt:  ASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMVPIMFNY-KPTLL

Query:  IKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESAC
         KG +      +  I  +   +  L  S     FN     S S K N+      P+          N   +L S    E+ Q+V  +++     + +S+ 
Subjt:  IKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESAC

Query:  LKASSWCTLSKS-FVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARHGMCIMPIPSK
         + +S    +K  F+    Q +  +  +    L+L  D    + +   +  ++  ED  + ++         +E      + + P      M +    + 
Subjt:  LKASSWCTLSKS-FVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARHGMCIMPIPSK

Query:  QKQTATKKPRNWDR---------EVQNLQSTINYDKDLGSWKKRALIK-----NFLKSCNPTVVILQETKS--SSIDRKFIKSIWSSRYIGWSSIDAIGS
          +   +KP++  +         E +    +  + K L SW K+  +K     +   +   T V+L +  S     +++ IKS+W S  I W + +A GS
Subjt:  QKQTATKKPRNWDR---------EVQNLQSTINYDKDLGSWKKRALIK-----NFLKSCNPTVVILQETKS--SSIDRKFIKSIWSSRYIGWSSIDAIGS

Query:  LGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKF
         G ILI+W+     ++   +G FSL+ N  L + S  W+TG+Y P    ER+HFW EL +L  L    WI+GGD N+ R   E ++    +   +  N F
Subjt:  LGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKF

Query:  IDLTELIDVPLMNGRYTWSNNR---AKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEK--WGPTSFKFSNFWLSHNSFEKLLSSWWKNH
        I    LID PL N R+TWSN R     + IDRFL   S    F     R L R+TSDH+P+       K  WGP  F+ ++  LS   F++ +  WW+N 
Subjt:  IDLTELIDVPLMNGRYTWSNNR---AKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEK--WGPTSFKFSNFWLSHNSFEKLLSSWWKNH

Query:  SMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYF
           G+ G  FIQ+LK+  + +K W           ++ ++ E+++ID +E   PL +++ NRRL++KADL  L+ ++   W QR K  WL EGDEN+++F
Subjt:  SMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYF

Query:  HRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLF--TIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAE
        HR  ++ ++R+ I E+    G+    + SI T F+ F+S+++  + K    ++ ++ DW  I +     L APF E EI   +N     K+PGP+G+   
Subjt:  HRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLF--TIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAE

Query:  FFKKSW-------------------------------------------NILKEDIIRVFD----DF-------------FKSGTINASLNETYICLIP-
        FFK  W                                            ILK DI + FD    DF             ++        N TY  +I  
Subjt:  FFKKSW-------------------------------------------NILKEDIIRVFD----DF-------------FKSGTINASLNETYICLIP-

Query:  -------------------------------------KSGSKICWIHI---LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFM
                                             +S   I  + +    +ISH+ FADD +LF   ++  L NL   +  FE ASGL +N  KS  +
Subjt:  -------------------------------------KSGSKICWIHI---LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFM

Query:  GIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLP
         + ++ +     A  +G      P +YLG+PL G PKS  FW  V +KI+K+L +W    +SKGGRLTLI++TLS+LP
Subjt:  GIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLP

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.8e-13024.97Show/hide
Query:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY
        +TE  + +SF++ +  ++  W+ +CF DLL     + FF + R+++  +WV K  NK      AEI R+   G    I++P G D +GWKSF++L+  T+
Subjt:  ITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVEKITNKK--GHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTY

Query:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV
        +   P  + +         T+ +    D           + +   D  K++   TS  +   R  +  F          E ++II R+ FHD+W  I+  
Subjt:  KPQPPAIKAK--------MTYKEIIELD---------PYMKEEKNDHPKQQDVLTSSIAQDLRELATFFP---------ESSIIIFRKNFHDEWYEILRV

Query:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET
        ++++     S  P Q D+A+L + + +  ++LC+ K   GW  VG + V+FE W++        +PSYGGW++ R +P+  W++ TFQ IG ACGG+++ 
Subjt:  MQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIK---GWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMET

Query:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK
        AK+T+    +++  IKV+ N     PAS+ +  +      VT      A   +     +HG        E                 + P  PT  H   
Subjt:  AKKTLSRMDMMEVSIKVKEN---VSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKK

Query:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV
           Y    S+K + S   + KK   ++   D  D+    R+            Q +    K +  ++      + P   Q+ S N  +    +S     +
Subjt:  VPYYRASASEKMASSVIGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMV

Query:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA
           F  +                              WS                          P   ++ +L +   +K        +  Q   ED  
Subjt:  PIMFNYKPTLLIKGTKFSTNPTRSSIDSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKA

Query:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH
         H L+  E+   + S    LS      S             PL           E+ I + +    D F+ +    N     S +            A++
Subjt:  WHSLARLESACLKASSWCTLSKSFVAFSSQDICFNWHSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARH

Query:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG
            +   + Q ++A++     + +     S +  D+     L  W K   +K   K  N       +  SSS    +   I S + +  +    +G  G
Subjt:  GMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDK----DLGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLG

Query:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID
         IL++W++    + ++  GN+S++LN+   +G + W+T +Y P    +R   W EL  L SLC PNW++ GDFNI RW  E +      + M +FN FI 
Subjt:  DILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFID

Query:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG
        + ELID P +N  +TWSN   N   + +DRFL++      FG    R L R  SDH+PI L   + KWGP  F+ +N  L    F+K   +WW +    G
Subjt:  LTELIDVPLMNGRYTWSN---NRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVG

Query:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM
        + G+ FIQ L +    +K+W        D  +K LL E++ ID  E    +    + +R+S+K+DLLS+      +W QR + +W   GDEN +YFHR  
Subjt:  WLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYM

Query:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW
          N+R+N I  +   +G SL     I   F+S +  ++T +  +  L D   W  I       L  PF E EI   +      K+PGP+GYT  F+KK W
Subjt:  AANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW

Query:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------
          LK+D++ VF DF K+G +N ++N T+I L                                                                     
Subjt:  NILKEDIIRVFDDFFKSGTINASLNETYICL---------------------------------------------------------------------

Query:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------
                     I K+  KI W  I                                                                          
Subjt:  -------------IPKSGSKICWIHI--------------------------------------------------------------------------

Query:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW
                  +ISHL FADD ++F   +E +L+NL   +  FE+ASGL  N  KS    I I+      +A  FG +T   P  YLG+PL G P+S SFW
Subjt:  ---------LSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFW

Query:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP
        D  +E I K+L  W  + +SKGGRLTL++A+LS+LP     + + P
Subjt:  DPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.1e-1922.95Show/hide
Query:  LGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI-DAIG---SLGDILIMWNELILDIVEVVK---GNFSLTLNLSLADGSDL
        L S  KR  + +++KS +P+V  +QET  +  D   +K        GW  I  A G     G  +++ ++      ++ +   G++ +          +L
Subjt:  LGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI-DAIG---SLGDILIMWNELILDIVEVVK---GNFSLTLNLSLADGSDL

Query:  WVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDV----PLMNGRYTW--SNNRAKTLIDRFL
         +  IY+PN    R    Q L DL    + + ++ GDFN      ++S  +   K  +  N  +  T+LID+       +  YT+  + +   + ID  +
Subjt:  WVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDV----PLMNGRYTW--SNNRAKTLIDRFL

Query:  VTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTME
         + + + K     +  +    SDH  IKL L  +    +    S  W  +N    LL+ +W ++ M               K+E+K +  T   K  T +
Subjt:  VTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTME

Query:  ----------KELLNELNAIDLEEESRPLDE-DKYNRRLSIKADLLSLAARDESLWRQRCK------SKWLTEGDENTAYFH-----------RYMAANR
                  +     LNA   ++E   +D      + L  +    S A+R + + + R +       K L + +E+ ++F            R +   R
Subjt:  ----------KELLNELNAIDLEEESRPLDE-DKYNRRLSIKADLLSLAARDESLWRQRCK------SKWLTEGDENTAYFH-----------RYMAANR

Query:  RRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKV-GQRYLPDIEDWGIIPAYHNDRLEA---PFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW
         +N I  + +  G+   D   I+T    +Y  L+  K+     +    D   +P  + + +E+   P T  EI   +N L T KSPGP+G+TAEF+++  
Subjt:  RRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKV-GQRYLPDIEDWGIIPAYHNDRLEA---PFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSW

Query:  NILKEDIIRVFDDFFKSGTINASLNETYICLIPKSG
          L   ++++F    K G +  S  E  I LIPK G
Subjt:  NILKEDIIRVFDDFFKSGTINASLNETYICLIPKSG

P08548 LINE-1 reverse transcriptase homolog2.0e-1423.43Show/hide
Query:  KRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI---DAIGSLGDILIMWNELI----LDIVEVVKGNFSLTLNLSLADGSDLWVTGI
        KR  + ++++   P +  +QE+  +      +K  +  +  GWSSI   +       I I++ + I      I +   G+F      +  D  ++ +  I
Subjt:  KRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI---DAIGSLGDILIMWNELI----LDIVEVVKGNFSLTLNLSLADGSDLWVTGI

Query:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLM----NGRYTWSNNRAKTL--IDRFLVTDSC
        Y+PN    +    + L D+S+L     I+ GDFN      ++S+ K  +K +   N  I   +L D+          YT+ ++   T   ID  L   S 
Subjt:  YSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLM----NGRYTWSNNRAKTL--IDRFLVTDSC

Query:  VQKFGNAHVRRLARTTSDHYPIKLTLGKEK--------WGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQD
        + KF    +  +    SDH+ IK+ L   +        W   +    + W+  +  +K ++ + + ++             KA     K  +   F K+ 
Subjt:  VQKFGNAHVRRLARTTSDHYPIKLTLGKEK--------WGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQD

Query:  TMEK--ELLNELNAIDLEEESRPLDEDKYNRR---LSIKADLLSLAARDESLWRQRCKSK-WLTEGDENTAYFHRYMAANRRRNTIMELLSS--SGNSLV
          E+   L+  L  ++ EE S P    K +RR     I+A+L  +   ++ + +Q  KSK W     E      + +A   R+  +  L+SS  +GN  +
Subjt:  TMEK--ELLNELNAIDLEEESRPLDEDKYNRR---LSIKADLLSLAARDESLWRQRCKSK-WLTEGDENTAYFHRYMAANRRRNTIMELLSS--SGNSLV

Query:  --DDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDR--------LEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVF
          D + I+     +Y KL++ K     L +I+ +  + A H  R        L  P +  EI   + +L   KSPGP+G+T+EF++     L   ++ +F
Subjt:  --DDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDR--------LEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVF

Query:  DDFFKSGTINASLNETYICLIPKSG
         +  K G +  +  E  I LIPK G
Subjt:  DDFFKSGTINASLNETYICLIPKSG

P11369 LINE-1 retrotransposable element ORF2 protein5.1e-1823.71Show/hide
Query:  LGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI---DAIGSLGDILIMWNELILDIVEVVK----GNFSLTLNLSLADGSDL
        L S  KR  + ++L   +PT   LQET     DR ++      R  GW +I   + +     + I+ ++ I    +V+K    G+F L     L +  +L
Subjt:  LGSWKKRALIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSI---DAIGSLGDILIMWNELILDIVEVVK----GNFSLTLNLSLADGSDL

Query:  WVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNI----SRWSWEKSNHKPPTKGMKSFNKFIDLTELIDV--PLMNGRYTWSNNRAKTL--IDR
         +  IY+PNA          L  L +   P+ I+ GDFN        SW++  ++   K +    K +DLT++     P   G YT+ +    T   ID 
Subjt:  WVTGIYSPNAPTERVHFWQELFDLSSLCEPNWIMGGDFNI----SRWSWEKSNHKPPTKGMKSFNKFIDLTELIDV--PLMNGRYTWSNNRAKTL--IDR

Query:  FLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGK--EKWGPT-SFKFSNFWLS----HNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAF-KSELKQWSA
         +   + + ++ N  +  +    SDH+ ++L          PT ++K +N  L+        +K +  + + +             +KAF + +L   SA
Subjt:  FLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGK--EKWGPT-SFKFSNFWLS----HNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAF-KSELKQWSA

Query:  TTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRR---LSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNS
        +   ++      L   L A++ +E + P    K +RR   + ++ ++  +  R       + +S +  + ++      R    +R +  I ++ +  G+ 
Subjt:  TTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRR---LSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNS

Query:  LVDDASIETEFVSFYSKLFTIKV-GQRYLPDIEDWGIIPAYHNDR---LEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFF
          D   I+    SFY +L++ K+     +    D   +P  + D+   L +P + +EI   +N L T KSPGP+G++AEF++      KED+I +    F
Subjt:  LVDDASIETEFVSFYSKLFTIKV-GQRYLPDIEDWGIIPAYHNDR---LEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFF

Query:  K----SGTINASLNETYICLIPK
              GT+  S  E  I LIPK
Subjt:  K----SGTINASLNETYICLIPK

P14381 Transposon TX1 uncharacterized 149 kDa protein2.5e-2023.08Show/hide
Query:  IKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSR----YIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLAD-GSDLWVTGIYSPNAP
        + +FL+    +V  LQET ++          W  R    ++ W+S   +    D    +   +L    V+ G     L+L + + G    +  +Y+P   
Subjt:  IKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSR----YIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLAD-GSDLWVTGIYSPNAP

Query:  TERVHFWQELFDLSSLCEPN--WIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNG----RYTWSNNR----AKTLIDRFLVTDSCVQ
         ER  F++ L       + +   I+GGDFN +  + +++  K          + I    L+DV          +T+   R    +++ IDR  ++   + 
Subjt:  TERVHFWQELFDLSSLCEPN--WIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNG----RYTWSNNR----AKTLIDRFLVTDSCVQ

Query:  KFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSE---LKQW--------------SA
        +  ++ +R    +  +   +++++         + F+N  L    F K +   W+     GW         +AF+ E   L QW                
Subjt:  KFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSFKFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSE---LKQW--------------SA

Query:  TTFGKQDTMEKELLNELNAIDLEEE-SRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLV
         +   Q   E E LN    +DLE+  S   D+      L  K  L ++  R       R + + L + D  + +F+        R  I  L +  G  L 
Subjt:  TTFGKQDTMEKELLNELNAIDLEEE-SRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLV

Query:  DDASIETEFVSFYSKLFTIKVGQRYLPDI--EDWGIIPAY---HNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFK
        D  +I     SFY  LF+        PD   E W  +P       +RLE P T +E+ +A+  +  NKSPG +G T EFF+  W+ L  D  RV  + FK
Subjt:  DDASIETEFVSFYSKLFTIKVGQRYLPDI--EDWGIIPAY---HNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFK

Query:  SGTINASLNETYICLIPKSG
         G +  S     + L+PK G
Subjt:  SGTINASLNETYICLIPKSG

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-3129.69Show/hide
Query:  PTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTLI----DRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTL-GKEKWGPTSFKFSNFWLSHNSF
        P +G++ F   +  ++L+D+P     YTWSN++    I    DR +        F +A         SDH P  + L    K     F++ +F  +H +F
Subjt:  PTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTLI----DRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTL-GKEKWGPTSFKFSNFWLSHNSF

Query:  EKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKW
           L+  W+    VG       + LKA K   K  +   FG      KE L+ L +I  +  + P D   +      +      AA  ES +RQ+ + KW
Subjt:  EKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKSKW

Query:  LTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPD----IEDWGIIPAYHND----RLEAPFTEEEIHKAV
        L +GD NT +FH+ + AN+ +N I  L       + +   ++   V++Y+ L          PD    I+D  I P   ND    RL A  +++EI  AV
Subjt:  LTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPD----IEDWGIIPAYHND----RLEAPFTEEEIHKAV

Query:  NDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK
          +  NK+PGP+ +TAEFF +SW ++K+  I    +FF++G +    N T I LIPK
Subjt:  NDLGTNKSPGPNGYTAEFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPK

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.4e-0837.21Show/hide
Query:  GIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP
        G+     + +   F   +G  P  YLGLPL  K  +TS + P+VEKI  R+  W + HLS  GRL LI + + +L      + RLP
Subjt:  GIAPQTVSCLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLP

AT3G49890.1 unknown protein3.5e-0628.68Show/hide
Query:  QKLLENILETTLEVVEHSIAIPCDDSKPSEGGIRLFKNAPVGVVFDHVDRSKG------LRITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFF
        + LL  ++E TL+ VE  + IP +D   ++ G+RLFK    G+VFDHVD  +G      LR  +     S     +++S   ++   +D+L+A +     
Subjt:  QKLLENILETTLEVVEHSIAIPCDDSKPSEGGIRLFKNAPVGVVFDHVDRSKG------LRITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFF

Query:  KQCRLD-EYVLWVEKITNKKGHCAEIARL
           RLD + V   +K   ++   AE+ ++
Subjt:  KQCRLD-EYVLWVEKITNKKGHCAEIARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCACAGAAGCTGTTGGAGAATATTTTGGAAACTACTCTAGAGGTGGTAGAACATTCCATAGCTATTCCTTGTGATGATTCCAAGCCCAGTGAAGGTGGAATTCG
TTTGTTTAAAAATGCTCCCGTTGGGGTTGTGTTTGATCACGTGGATAGAAGCAAAGGGCTGCGTATCACAGAATCCACTCGAGACCGATCTTTCACCCTCACTCTTAAAA
TAGAATCACAAGGCTGGCTTTCATCCTGTTTCACCGACCTCCTGTCAGCTCCTTTAAACCAGAAATTCTTCAAGCAATGTCGACTTGATGAATATGTCTTATGGGTTGAA
AAGATAACCAACAAAAAAGGCCATTGCGCCGAAATAGCCAGACTCGGAGTAAATGGTGGTCTAAACAAGATCATCATCCCTGTTGGTTCTGATAAATATGGGTGGAAAAG
TTTTATCTCTCTTCTTAAAGACACCTACAAACCACAACCCCCCGCCATAAAAGCCAAAATGACTTATAAAGAGATCATAGAGCTTGACCCGTATATGAAGGAGGAGAAGA
ATGACCACCCTAAGCAACAAGATGTACTTACCTCTAGCATTGCTCAAGACCTGCGTGAACTGGCGACCTTCTTTCCTGAATCTTCCATAATCATCTTTCGGAAGAACTTT
CATGATGAATGGTATGAGATTCTTAGAGTTATGCAACAAGAGGTGTCCGATTTTGGCTCCATTAGTCCTATCCAACCAGATAGAGCTCTTCTTGCTGTCGAAGACAAGGA
ACAAGGCCGAATCCTCTGTAATATCAAAGGGTGGTTTAAAGTTGGAACTTTCATGGTTAGATTTGAACCATGGAATACAAAAAATTTCGTTAAGGAACCAAAAGTTCCAT
CATACGGAGGATGGATCAAGATTAGAAACTTACCTATTGATAAATGGTCCTTTCAGACATTTCAAAAGATAGGCGATGCTTGTGGGGGATACATGGAAACAGCCAAAAAA
ACGTTATCCCGTATGGATATGATGGAGGTAAGCATAAAGGTTAAAGAGAATGTAAGCCCAGCTTCAGTATGCCTTCCATCGTCTTCTACCTCCCCCATCACAGTCACGGT
GGATCCCTTCTTCGTGGCCGACCAGTTCATCCGTTTTATATCAGGGATCCATGGCAAAGCTCCACCAACGCCGATCGTTGAGGAGCCATCACGCGCCGATTATGGCCCCG
TGAACTTGATAGGCGCGTCCCCTTCACAACCAAGATCACCCACTCTCGCGCATGACAAGAAAGTACCGTATTATCGTGCATCGGCCAGTGAAAAGATGGCTTCCTCAGTC
ATAGGTGAAGTAAAAAAGAAACCCTATGCTGACCAACCACGTGACAGACTTGATGAGGCCCATTTCCAGAGACAGTCGAGCCCAATTGACTTCCCTTCAGCCCACCAACC
CCAGACCAATACTCAAATTGACAAACCCACCAATATGGTAACCCAACCGCCCAACTTATCGGTTCGACCAACACCCAACCAAAACCCAAGCCCTAATAAACCGCTTCAGC
CACCGCACAGATCGAGAACCGATAAAATGGTTCCGATTATGTTCAATTACAAACCGACTCTCCTCATTAAGGGCACGAAATTCTCGACAAATCCTACCAGATCCTCCATC
GATTCTGAAGATCTCCTATCCTCCCAACTTTGTTGGTCAATGTTCTTTGCTATTTTCAATATACAGTGGGTTTTTTCAAATTCCGTAAAAGAGAATGTGCTTCAACTGCT
TATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTTGGTTGGAAAGAAATCAGAGGGTTTTTGAAGATAAAG
CGTGGCATTCTTTAGCTCGTCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCTAAATCTTTTGTAGCTTTCTCTTCACAGGATATTTGTTTTAATTGG
CATTCTTTTATTTTTCCCCTGACACTAGAGTTCGATCACATGAACTCGATCACTGAGACTGAGATTATCGCCAGAGATGAAATCAGTGAAGATGATTTTGACAAAGAGGA
TAGGCTGGATAACCCAATGTATTTGCAATCAGAAGACCCATCGGCATACTTATCATTATTATTTCCTTGGTTGGCTAGGCATGGAATGTGTATCATGCCCATACCAAGCA
AACAGAAACAAACGGCTACTAAAAAGCCAAGGAATTGGGATCGAGAAGTTCAAAATCTTCAATCCACTATCAATTACGACAAAGACCTTGGCTCATGGAAGAAACGAGCC
CTTATTAAGAACTTCCTCAAAAGTTGCAACCCGACCGTTGTTATTCTCCAAGAAACTAAATCCTCATCGATTGACAGGAAATTCATCAAGTCTATATGGAGTTCTCGATA
CATCGGTTGGTCCTCCATTGATGCCATTGGATCATTAGGTGACATCCTCATCATGTGGAATGAACTTATCCTCGATATTGTTGAAGTGGTCAAAGGTAACTTCTCTCTGA
CTCTAAACCTCTCTTTGGCTGATGGTTCCGATCTTTGGGTTACAGGTATTTATAGTCCTAATGCTCCCACAGAGAGAGTTCATTTCTGGCAGGAGCTTTTTGATTTATCC
TCCCTTTGTGAGCCGAATTGGATTATGGGTGGTGACTTTAATATTTCCAGATGGTCATGGGAGAAATCCAATCATAAACCTCCCACTAAAGGCATGAAGAGCTTTAACAA
GTTTATTGACTTGACTGAGTTGATAGATGTTCCCTTGATGAATGGTAGATATACTTGGTCCAATAACCGGGCCAAAACATTGATTGACCGATTCTTGGTAACAGATAGTT
GTGTTCAAAAGTTTGGCAATGCTCATGTTCGTCGTCTAGCTCGCACCACATCTGACCACTATCCCATTAAGCTTACTCTTGGCAAAGAAAAATGGGGCCCGACATCATTC
AAATTCTCTAATTTCTGGTTGTCCCACAATTCCTTCGAGAAGCTGTTATCATCATGGTGGAAAAACCACTCTATGGTAGGATGGCTGGGTCATGGTTTCATTCAAAAGTT
AAAGGCTTTCAAATCTGAATTAAAACAGTGGAGTGCTACAACCTTTGGCAAACAGGACACGATGGAAAAAGAATTACTTAATGAGTTAAATGCCATCGATTTAGAGGAAG
AATCAAGACCTTTGGATGAAGACAAGTACAACCGAAGGTTATCTATCAAAGCTGACCTTCTATCTCTAGCTGCCCGTGATGAATCTCTATGGAGACAAAGATGCAAATCC
AAGTGGCTTACTGAAGGAGATGAAAATACAGCATATTTCCACAGATACATGGCTGCTAACAGAAGAAGAAATACTATTATGGAGCTTTTATCTAGTTCTGGAAACAGTCT
GGTGGATGATGCTAGCATTGAAACTGAATTTGTTAGCTTTTACAGCAAGCTATTCACTATAAAGGTGGGACAAAGATACTTGCCTGATATTGAAGATTGGGGTATTATTC
CAGCATACCATAATGATAGATTGGAAGCTCCTTTTACTGAAGAGGAAATCCATAAAGCTGTCAACGATTTGGGAACCAACAAATCCCCAGGACCGAATGGTTACACTGCC
GAATTCTTTAAAAAATCATGGAACATTCTAAAGGAAGACATTATAAGAGTATTCGATGATTTTTTTAAGAGTGGCACTATTAACGCTAGCCTCAACGAGACATATATTTG
CCTCATTCCAAAAAGTGGGAGCAAAATCTGTTGGATCCATATTCTATCCATCTCGCATCTCCAATTTGCCGATGACACTATCCTTTTCTCATCGCACGACGAATCTCACC
TCGATAATCTTTTCAGTGTTATCAAGCAATTCGAGGAAGCTTCTGGGCTGAATGTAAATTGTCATAAATCTGAATTTATGGGTATTGGTATTGCTCCACAAACAGTTTCT
TGCCTTGCAGATCGATTTGGTTGTAAAACAGGAGGATGGCCGAACACCTACCTTGGCTTGCCGTTGAATGGTAAACCAAAATCCACATCTTTTTGGGATCCAGTGGTCGA
GAAGATTGAAAAGAGGCTCCTTTCATGGGGCTCTACTCATCTTTCTAAAGGAGGGAGGCTCACTCTAATACAGGCTACCTTGTCTAACCTTCCATATATTTCCTATCCCT
CTTCAAGGCTCCCACAAAGGTTATCAACAAGATTGAGAAGCTCTTTCGGAACTATCTATGGAGAGGCAACAGCGAATCTAAAGGCATCCATCTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCACAGAAGCTGTTGGAGAATATTTTGGAAACTACTCTAGAGGTGGTAGAACATTCCATAGCTATTCCTTGTGATGATTCCAAGCCCAGTGAAGGTGGAATTCG
TTTGTTTAAAAATGCTCCCGTTGGGGTTGTGTTTGATCACGTGGATAGAAGCAAAGGGCTGCGTATCACAGAATCCACTCGAGACCGATCTTTCACCCTCACTCTTAAAA
TAGAATCACAAGGCTGGCTTTCATCCTGTTTCACCGACCTCCTGTCAGCTCCTTTAAACCAGAAATTCTTCAAGCAATGTCGACTTGATGAATATGTCTTATGGGTTGAA
AAGATAACCAACAAAAAAGGCCATTGCGCCGAAATAGCCAGACTCGGAGTAAATGGTGGTCTAAACAAGATCATCATCCCTGTTGGTTCTGATAAATATGGGTGGAAAAG
TTTTATCTCTCTTCTTAAAGACACCTACAAACCACAACCCCCCGCCATAAAAGCCAAAATGACTTATAAAGAGATCATAGAGCTTGACCCGTATATGAAGGAGGAGAAGA
ATGACCACCCTAAGCAACAAGATGTACTTACCTCTAGCATTGCTCAAGACCTGCGTGAACTGGCGACCTTCTTTCCTGAATCTTCCATAATCATCTTTCGGAAGAACTTT
CATGATGAATGGTATGAGATTCTTAGAGTTATGCAACAAGAGGTGTCCGATTTTGGCTCCATTAGTCCTATCCAACCAGATAGAGCTCTTCTTGCTGTCGAAGACAAGGA
ACAAGGCCGAATCCTCTGTAATATCAAAGGGTGGTTTAAAGTTGGAACTTTCATGGTTAGATTTGAACCATGGAATACAAAAAATTTCGTTAAGGAACCAAAAGTTCCAT
CATACGGAGGATGGATCAAGATTAGAAACTTACCTATTGATAAATGGTCCTTTCAGACATTTCAAAAGATAGGCGATGCTTGTGGGGGATACATGGAAACAGCCAAAAAA
ACGTTATCCCGTATGGATATGATGGAGGTAAGCATAAAGGTTAAAGAGAATGTAAGCCCAGCTTCAGTATGCCTTCCATCGTCTTCTACCTCCCCCATCACAGTCACGGT
GGATCCCTTCTTCGTGGCCGACCAGTTCATCCGTTTTATATCAGGGATCCATGGCAAAGCTCCACCAACGCCGATCGTTGAGGAGCCATCACGCGCCGATTATGGCCCCG
TGAACTTGATAGGCGCGTCCCCTTCACAACCAAGATCACCCACTCTCGCGCATGACAAGAAAGTACCGTATTATCGTGCATCGGCCAGTGAAAAGATGGCTTCCTCAGTC
ATAGGTGAAGTAAAAAAGAAACCCTATGCTGACCAACCACGTGACAGACTTGATGAGGCCCATTTCCAGAGACAGTCGAGCCCAATTGACTTCCCTTCAGCCCACCAACC
CCAGACCAATACTCAAATTGACAAACCCACCAATATGGTAACCCAACCGCCCAACTTATCGGTTCGACCAACACCCAACCAAAACCCAAGCCCTAATAAACCGCTTCAGC
CACCGCACAGATCGAGAACCGATAAAATGGTTCCGATTATGTTCAATTACAAACCGACTCTCCTCATTAAGGGCACGAAATTCTCGACAAATCCTACCAGATCCTCCATC
GATTCTGAAGATCTCCTATCCTCCCAACTTTGTTGGTCAATGTTCTTTGCTATTTTCAATATACAGTGGGTTTTTTCAAATTCCGTAAAAGAGAATGTGCTTCAACTGCT
TATTGGTCCTTCTTTTTCTTCAAGACCGAGATTATTATGGATTAATGGTGTTAAAGCTTTGATATCAGAAATTTGGTTGGAAAGAAATCAGAGGGTTTTTGAAGATAAAG
CGTGGCATTCTTTAGCTCGTCTGGAATCAGCTTGCTTAAAGGCTTCTTCCTGGTGCACTCTTTCTAAATCTTTTGTAGCTTTCTCTTCACAGGATATTTGTTTTAATTGG
CATTCTTTTATTTTTCCCCTGACACTAGAGTTCGATCACATGAACTCGATCACTGAGACTGAGATTATCGCCAGAGATGAAATCAGTGAAGATGATTTTGACAAAGAGGA
TAGGCTGGATAACCCAATGTATTTGCAATCAGAAGACCCATCGGCATACTTATCATTATTATTTCCTTGGTTGGCTAGGCATGGAATGTGTATCATGCCCATACCAAGCA
AACAGAAACAAACGGCTACTAAAAAGCCAAGGAATTGGGATCGAGAAGTTCAAAATCTTCAATCCACTATCAATTACGACAAAGACCTTGGCTCATGGAAGAAACGAGCC
CTTATTAAGAACTTCCTCAAAAGTTGCAACCCGACCGTTGTTATTCTCCAAGAAACTAAATCCTCATCGATTGACAGGAAATTCATCAAGTCTATATGGAGTTCTCGATA
CATCGGTTGGTCCTCCATTGATGCCATTGGATCATTAGGTGACATCCTCATCATGTGGAATGAACTTATCCTCGATATTGTTGAAGTGGTCAAAGGTAACTTCTCTCTGA
CTCTAAACCTCTCTTTGGCTGATGGTTCCGATCTTTGGGTTACAGGTATTTATAGTCCTAATGCTCCCACAGAGAGAGTTCATTTCTGGCAGGAGCTTTTTGATTTATCC
TCCCTTTGTGAGCCGAATTGGATTATGGGTGGTGACTTTAATATTTCCAGATGGTCATGGGAGAAATCCAATCATAAACCTCCCACTAAAGGCATGAAGAGCTTTAACAA
GTTTATTGACTTGACTGAGTTGATAGATGTTCCCTTGATGAATGGTAGATATACTTGGTCCAATAACCGGGCCAAAACATTGATTGACCGATTCTTGGTAACAGATAGTT
GTGTTCAAAAGTTTGGCAATGCTCATGTTCGTCGTCTAGCTCGCACCACATCTGACCACTATCCCATTAAGCTTACTCTTGGCAAAGAAAAATGGGGCCCGACATCATTC
AAATTCTCTAATTTCTGGTTGTCCCACAATTCCTTCGAGAAGCTGTTATCATCATGGTGGAAAAACCACTCTATGGTAGGATGGCTGGGTCATGGTTTCATTCAAAAGTT
AAAGGCTTTCAAATCTGAATTAAAACAGTGGAGTGCTACAACCTTTGGCAAACAGGACACGATGGAAAAAGAATTACTTAATGAGTTAAATGCCATCGATTTAGAGGAAG
AATCAAGACCTTTGGATGAAGACAAGTACAACCGAAGGTTATCTATCAAAGCTGACCTTCTATCTCTAGCTGCCCGTGATGAATCTCTATGGAGACAAAGATGCAAATCC
AAGTGGCTTACTGAAGGAGATGAAAATACAGCATATTTCCACAGATACATGGCTGCTAACAGAAGAAGAAATACTATTATGGAGCTTTTATCTAGTTCTGGAAACAGTCT
GGTGGATGATGCTAGCATTGAAACTGAATTTGTTAGCTTTTACAGCAAGCTATTCACTATAAAGGTGGGACAAAGATACTTGCCTGATATTGAAGATTGGGGTATTATTC
CAGCATACCATAATGATAGATTGGAAGCTCCTTTTACTGAAGAGGAAATCCATAAAGCTGTCAACGATTTGGGAACCAACAAATCCCCAGGACCGAATGGTTACACTGCC
GAATTCTTTAAAAAATCATGGAACATTCTAAAGGAAGACATTATAAGAGTATTCGATGATTTTTTTAAGAGTGGCACTATTAACGCTAGCCTCAACGAGACATATATTTG
CCTCATTCCAAAAAGTGGGAGCAAAATCTGTTGGATCCATATTCTATCCATCTCGCATCTCCAATTTGCCGATGACACTATCCTTTTCTCATCGCACGACGAATCTCACC
TCGATAATCTTTTCAGTGTTATCAAGCAATTCGAGGAAGCTTCTGGGCTGAATGTAAATTGTCATAAATCTGAATTTATGGGTATTGGTATTGCTCCACAAACAGTTTCT
TGCCTTGCAGATCGATTTGGTTGTAAAACAGGAGGATGGCCGAACACCTACCTTGGCTTGCCGTTGAATGGTAAACCAAAATCCACATCTTTTTGGGATCCAGTGGTCGA
GAAGATTGAAAAGAGGCTCCTTTCATGGGGCTCTACTCATCTTTCTAAAGGAGGGAGGCTCACTCTAATACAGGCTACCTTGTCTAACCTTCCATATATTTCCTATCCCT
CTTCAAGGCTCCCACAAAGGTTATCAACAAGATTGAGAAGCTCTTTCGGAACTATCTATGGAGAGGCAACAGCGAATCTAAAGGCATCCATCTATTGA
Protein sequenceShow/hide protein sequence
MDAQKLLENILETTLEVVEHSIAIPCDDSKPSEGGIRLFKNAPVGVVFDHVDRSKGLRITESTRDRSFTLTLKIESQGWLSSCFTDLLSAPLNQKFFKQCRLDEYVLWVE
KITNKKGHCAEIARLGVNGGLNKIIIPVGSDKYGWKSFISLLKDTYKPQPPAIKAKMTYKEIIELDPYMKEEKNDHPKQQDVLTSSIAQDLRELATFFPESSIIIFRKNF
HDEWYEILRVMQQEVSDFGSISPIQPDRALLAVEDKEQGRILCNIKGWFKVGTFMVRFEPWNTKNFVKEPKVPSYGGWIKIRNLPIDKWSFQTFQKIGDACGGYMETAKK
TLSRMDMMEVSIKVKENVSPASVCLPSSSTSPITVTVDPFFVADQFIRFISGIHGKAPPTPIVEEPSRADYGPVNLIGASPSQPRSPTLAHDKKVPYYRASASEKMASSV
IGEVKKKPYADQPRDRLDEAHFQRQSSPIDFPSAHQPQTNTQIDKPTNMVTQPPNLSVRPTPNQNPSPNKPLQPPHRSRTDKMVPIMFNYKPTLLIKGTKFSTNPTRSSI
DSEDLLSSQLCWSMFFAIFNIQWVFSNSVKENVLQLLIGPSFSSRPRLLWINGVKALISEIWLERNQRVFEDKAWHSLARLESACLKASSWCTLSKSFVAFSSQDICFNW
HSFIFPLTLEFDHMNSITETEIIARDEISEDDFDKEDRLDNPMYLQSEDPSAYLSLLFPWLARHGMCIMPIPSKQKQTATKKPRNWDREVQNLQSTINYDKDLGSWKKRA
LIKNFLKSCNPTVVILQETKSSSIDRKFIKSIWSSRYIGWSSIDAIGSLGDILIMWNELILDIVEVVKGNFSLTLNLSLADGSDLWVTGIYSPNAPTERVHFWQELFDLS
SLCEPNWIMGGDFNISRWSWEKSNHKPPTKGMKSFNKFIDLTELIDVPLMNGRYTWSNNRAKTLIDRFLVTDSCVQKFGNAHVRRLARTTSDHYPIKLTLGKEKWGPTSF
KFSNFWLSHNSFEKLLSSWWKNHSMVGWLGHGFIQKLKAFKSELKQWSATTFGKQDTMEKELLNELNAIDLEEESRPLDEDKYNRRLSIKADLLSLAARDESLWRQRCKS
KWLTEGDENTAYFHRYMAANRRRNTIMELLSSSGNSLVDDASIETEFVSFYSKLFTIKVGQRYLPDIEDWGIIPAYHNDRLEAPFTEEEIHKAVNDLGTNKSPGPNGYTA
EFFKKSWNILKEDIIRVFDDFFKSGTINASLNETYICLIPKSGSKICWIHILSISHLQFADDTILFSSHDESHLDNLFSVIKQFEEASGLNVNCHKSEFMGIGIAPQTVS
CLADRFGCKTGGWPNTYLGLPLNGKPKSTSFWDPVVEKIEKRLLSWGSTHLSKGGRLTLIQATLSNLPYISYPSSRLPQRLSTRLRSSFGTIYGEATANLKASIY