; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G006100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G006100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCG_Chr07:10124141..10130626
RNA-Seq ExpressionClCG07G006100
SyntenyClCG07G006100
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]7.8e-23538.92Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKLELDLFDTYEWKSTDDQKNYLKTV
        +GYL GEK   + DDP + +WDAENSM                                                                D +++ KT+
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKLELDLFDTYEWKSTDDQKNYLKTV

Query:  EDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSA-ESSALVIENTAM-----------------------------
        ED  I+K L  LNVEFDEVR RI+ +  LP+I + FS+VRREES+RNVM+GKK    A E S LV                                   
Subjt:  EDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSA-ESSALVIENTAM-----------------------------

Query:  -------KASNQSNETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS--------
               K +N   +T DKP  A    AN  +++    EQ++ +L LLKSN + G  S SLA TGN   ALSC   S+PWIID GA+DHMT+        
Subjt:  -------KASNQSNETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS--------

Query:  ---------------------KGIIPLSTKLILRFVLHVPQLACNLL------------------------SDSGEMIEHARMINGLYYFDEVSTSNKKI
                             KG+I +S  + L+FVLHVP+L CNLL                          S + I  ARMINGLYYF++   SNK  
Subjt:  ---------------------KGIIPLSTKLILRFVLHVPQLACNLL------------------------SDSGEMIEHARMINGLYYFDEVSTSNKKI

Query:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDD
        QGLSS+SSL V++ IM+WH +LG P+F YLK+LFP LF+ +D   FQCE C+ AK  + T+  K Y +S PFYL H+DVWGPSKV T +GK+WFVTFIDD
Subjt:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDD

Query:  HTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYL
        HTRL W+YL+ +KS V+ +F  FYKMIE QFQ KI IL SDN  K+FN+ L TF + K I+HQ++C + PQQNGIA+RK +HLLEV+RA+MF M++P YL
Subjt:  HTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYL

Query:  WGDVGLTAAYLINRIPTK----------------------------------------------------------------------------------
        WGD  LTA+YLINR+PTK                                                                                  
Subjt:  WGDVGLTAAYLINRIPTK----------------------------------------------------------------------------------

Query:  -------------GESSLVEENFWD-TSPLPNII-----SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDT
                     GE  LVE NFW+   P P++I       E     P     E   +  E L++     N +   ++R+  + R ++Q +     Q   
Subjt:  -------------GESSLVEENFWD-TSPLPNII-----SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDT

Query:  LVNDPEN-------------------------------PGMSFSPSSLNMLPDVP----DFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYIN
        L N   N                               P +S     L +   VP    D D+PIA RKG            C   L  KY SY     +
Subjt:  LVNDPEN-------------------------------PGMSFSPSSLNMLPDVP----DFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYIN

Query:  KAAENQIRILRDCLEIDM---LDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQT
          ++N      +  ++ +   +   L+     L + KE             + N + E +DLP +KK VGCKWVFTIK  ADG +ERYKARLVAKGFTQT
Subjt:  KAAENQIRILRDCLEIDM---LDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQT

Query:  YGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQS
        YGIDYQ+TFAPVAKI SI +LLS+ VN +WPL+QLDVKNAFLNG+LEEEVFM  P  FE   G+ K             SPRAWFERF K +  YG++QS
Subjt:  YGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQS

Query:  QADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG
        QA+HTMFY+H+   KVV+LIVYVDDI+LTG+D   +            IKDLG+LKYFLGMEFARSK GI VNQRKY+ +LL +  +LGC+  E PIE  
Subjt:  QADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG

Query:  KVLETCGKTNIP--------------LSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTS
          L+     N+               LSHT PDIAF VSMV                                        E YT+ADWAGS  DRRSTS
Subjt:  KVLETCGKTNIP--------------LSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTS

Query:  GYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVIC
        GYCSFV G LVTW SKKQ+VVARSSAE EFR +AH                                                 VDK FIKEKID   +C
Subjt:  GYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVIC

Query:  IPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA
        + Y+PT EQ+A+V TK L K QF+ L  KLAM DIFKLA
Subjt:  IPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA

CAN76196.1 hypothetical protein VITISV_041073 [Vitis vinifera]2.3e-21836.65Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        +G+L GE      DDP           + TW    +V  I    +   TAK++W++V  MYSDL N SQ+F+L  KL                      E
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMI---GKKAIDSAESSALVIENTAMKASNQ
        LDL    EW   +D   + K  E+  +Y  LA+LN   DEVRGRILG+  LP+I +VFS+VRREE+RR VM+      +    ESSALV + + +    +
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMI---GKKAIDSAESSALVIENTAMKASNQ

Query:  SN-----------------ETHDKPRHASNANIVDS--------------------NPLKEQIDQILKLLKSNSLGNPSFSLAQTGNSR-QALSCLNSS-
                           + H KP++    N  D                     N  KEQ+  + KL +S    NPS SLAQ GN    ALS + S+ 
Subjt:  SN-----------------ETHDKPRHASNANIVDS--------------------NPLKEQIDQILKLLKSNSLGNPSFSLAQTGNSR-QALSCLNSS-

Query:  --PWIIDFGAADHMT-----------------------------SKGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGE
          PWIID GA DHMT                              KG + +S  L L  VLHVP L+CNLLS                         SG 
Subjt:  --PWIIDFGAADHMT-----------------------------SKGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGE

Query:  MIEHARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLI
         I +AR I GLY+F+  S S K IQ     S S++  + I+LWH RLGHP+F YLK+LFP LF+  + S FQCE C  AKHH+++F  + Y+ S PF LI
Subjt:  MIEHARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLI

Query:  HTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGI
        H+DVWGPS++ T +GK+WFVTFIDDHTR++W+YLL +KS V+EVF  FY M+ TQFQ KI++  SDN  ++ N+ L  F  +K IVHQ++C + PQQNGI
Subjt:  HTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGI

Query:  AKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK---------------------------------------------------------
        A+RK +HLLEV+RAL F+  VP YLWG+  LTA YLINR+PT+                                                         
Subjt:  AKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK---------------------------------------------------------

Query:  -------------------------------------GESSLVEENFW--DTSPLPN-------------------------------------------
                                             GES+  + + +  + +P PN                                           
Subjt:  -------------------------------------GESSLVEENFW--DTSPLPN-------------------------------------------

Query:  -IISPEIMSFSPSIPS----VENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRN-QTVELTQDQSDTLVNDPENPGMSFSPS-------SLNMLPDVP
         +++ E +  S S+PS      N+  G  T       +N +L  ++RR    ++ N   +   + +     N  E PG + + S       S +      
Subjt:  -IISPEIMSFSPSIPS----VENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRN-QTVELTQDQSDTLVNDPENPGMSFSPS-------SLNMLPDVP

Query:  DFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEK
        D +IPIA RKG   C  + ++ NY     L    PS+ + + +  +  +I + +++ L++       E KK +                R  E N + E 
Subjt:  DFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEK

Query:  IDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFE
        + LP+ K  VGCKWVFT+K N++G +ERYKARLVAKGFTQTYGIDY +TFAPVAK+ ++ +LLSIA N DWPL QLDVKNAFLNG LEEEV+MD P GF+
Subjt:  IDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFE

Query:  VDL------------GINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLG
                       G+ +SPRAWFERF + V + G+ Q+Q+DHTMF +H+ + K+ +LIVYVDDIILTG+  T M            IKDLGSL+YFLG
Subjt:  VDL------------GINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLG

Query:  MEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTN--------------IPLSHT-PDIAFVVSMV-------------------
        ME ARSK GI+V+QRKYILDLLK+TG+ GCR  +TPI+  + L      N              I LSHT PDIAF VS+V                   
Subjt:  MEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTN--------------IPLSHT-PDIAFVVSMV-------------------

Query:  ---------------------EVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------
                             E YTDADWAGS TDRRSTSGYC+++ G LVTW SKKQSV ARSSAE E+RA+AH                         
Subjt:  ---------------------EVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------

Query:  ------------------------VDKNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF
                                +D++FIKEK++A +IC+P++PTT+QIAD+LTKGL +  F  L  KL M DI+
Subjt:  ------------------------VDKNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF

GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum]9.4e-27344.1Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        IGY+ G+K         F  WDAENSMVMTWLVNSM ++IS+N +CY TAK+LWD+V+QMYSDL NQSQV+EL L+L                      +
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDS---AESSALVIENTAMKA---
        LDLFD YEWKS +D K+Y+KTV+   ++K LA LNVEFDEVRGRILG++ +P I +VF++VRREESRR VM+GKK + +    E SAL +     K+   
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDS---AESSALVIENTAMKA---

Query:  ---------------------SNQSNETHDKP----------RHASNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSCLN-SSPW
                                  + H +P          R  ++AN   S+P  KEQ+D + KLL+SN SL  P  ++AQTG +  ALS  N S+PW
Subjt:  ---------------------SNQSNETHDKP----------RHASNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSCLN-SSPW

Query:  IIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEH
        IID GA++HMT+                             KG I +S  + L+ VLHVP+ ACNLLS                        +SG+MI  
Subjt:  IIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEH

Query:  ARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDV
        AR INGLYY DE    NKK   L S S  LSV + +MLWHRRLGHP+F YLKYLFP+  K I+ S   CE C  AK H+ +FS K Y +S PFYL H+DV
Subjt:  ARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDV

Query:  WGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRK
        WGPSK+ T +GK+WFVTFIDDHTR+ W+YL+ KKS V E F  F++MIETQFQ KI IL SDN  ++FN+ L TFL  K I+HQ+TC + PQQNGIA+RK
Subjt:  WGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRK

Query:  YRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----------------------------------GE---SSLVEENFWDTSP-LPNIIS--
         RHLLEV+RA+M SM+VP YLWG+  LTA YLINR+PT+                                  GE   SS  E+NFW+  P L ++++  
Subjt:  YRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----------------------------------GE---SSLVEENFWDTSP-LPNIIS--

Query:  -PEIMSFSPSIPSVE-----NSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDP-ENPGMSFSPS----SLNMLPD-------
         P      P   + E      S  GGETL  +   RN +L+ + R+   +      +     QSD+    P +N   + SP     S N LPD       
Subjt:  -PEIMSFSPSIPSVE-----NSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDP-ENPGMSFSPS----SLNMLPD-------

Query:  --------VPDFDIPIAHRK--GGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIR-ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRR
                +PD D+PIA RK    C  + ++ NY     L   + +Y     N      ++  L D              KL +    + L  +  +S  
Subjt:  --------VPDFDIPIAHRK--GGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIR-ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRR

Query:  QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEE
                    LP+ KKAVGCKWVFT+KC ADG +ERYKARLVAKGFTQT+GIDYQ+TFAPVAKI SI ILLS+AVNF+W L+Q DVKNAFLNGEL EE
Subjt:  QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEE

Query:  VFMDLPLGFEVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGME
        V+M LP GFE + G            R  +  +GF+QSQADHT+F++H+   K+ +LIVYVDDII+TG+D   ++           IKDLG LKYFLGME
Subjt:  VFMDLPLGFEVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGME

Query:  FARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP-----------------------LSHTPDIAFVVS-----MVEVYTDADW
        F RSK G  +NQRKYILDLLK+TG++GCR VETP++    L+   +  +                        L  TP    +        VE YTDADW
Subjt:  FARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP-----------------------LSHTPDIAFVVS-----MVEVYTDADW

Query:  AGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNF
        AG+  DRRSTSGYC+FVGG  VTW SKKQSVVARSSAE EFR++AH                                                 VDK+F
Subjt:  AGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNF

Query:  IKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF
        IKEKID+G IC+ Y+PTT Q+ADVLTK LPK QF+ +  KLAMNDIF
Subjt:  IKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF

XP_024044151.1 uncharacterized protein LOC18046468 isoform X1 [Citrus clementina]6.8e-24739.39Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        IGYL G     + DDP F  WDA+NSM+M+WLVNSM ++I    +   TAK+LWD+VT+ YSDLGN +Q+++L  ++                      E
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALV--------IENTA-
        LD +   EW+   D   Y K +E   +++ LA L+ + DEVRGR+LGK  LP+  +VFS VRREESR+NVM+G     SAE+SAL+        +  T  
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALV--------IENTA-

Query:  MKASNQSN------------------ETHDKP----------RHASNANIVDSNP-------------LKEQIDQILKLL-KSNSLGNPS--FSLAQTGN
        +K S++ +                  + H KP          +H+    +V  N               KEQ++Q+ + L +S SL NPS   SLAQ GN
Subjt:  MKASNQSN------------------ETHDKP----------RHASNANIVDSNP-------------LKEQIDQILKLL-KSNSLGNPS--FSLAQTGN

Query:  SRQALSCL--NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS-----------------
        +  AL  +     PWIID GA DHMTS                             KG IP+ST L+L  VLHVP L+CNLLS                 
Subjt:  SRQALSCL--NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS-----------------

Query:  -------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKS
                SG+ I  AR ++GLYYF+E  +   + Q  ++  + S+++ IMLWH RLGHP+F YL++LFP LFK  + S+FQCE C+ +KHH+++F  + 
Subjt:  -------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKS

Query:  YKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQAT
        YK S+PF LIH+D+WGPS+V   +G +WF+TFIDDHTR+ W+YLL +KS    VF  F+ MI+TQFQAKI++  +DN  ++F   L  +  +  IVHQ++
Subjt:  YKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQAT

Query:  CCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK-----------------------------------------------
        C + PQQNG+A+RK RHLLEV+R+LMF+  VP   WG+  LTA+YLINR+PT+                                               
Subjt:  CCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK-----------------------------------------------

Query:  ------------------------------------------------GESSLVEENFWDTS-PLPNIISPEIMSFSPSIPSVENSPAGGETLQIDLIGR
                                                        GE    E++FW+ S P+P      IMS  P +PS   S    E   ++ +  
Subjt:  ------------------------------------------------GESSLVEENFWDTS-PLPNIISPEIMSFSPSIPSVENSPAGGETLQIDLIGR

Query:  NPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAEN
         P+LQ +TRRN ++R  + +    QD      N      +  +P S ++L    D D+PIA RKG            C      KY SY           
Subjt:  NPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAEN

Query:  QIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKT
          R     L +  +   ++       LS     D+        E N + E + LPE+KK VGCKW+FT+K  ADG +ERYKARLVAKGFTQTYGIDYQ+T
Subjt:  QIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKT

Query:  FAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFY
        FAPVAK+ SI +LLS+A +  W L QLDVKNAFLNGELEEEV+MDLP GFE + GI K             SPRAWF+RF K+V S+G+ QSQADHTMF+
Subjt:  FAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFY

Query:  QHTENDKVVVLIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIE---------K
        +H  +  VV++IVYVDDIILTG++   M  +          KDLG L+YFLGME ARS  GI V+QRKY LDLLK+TG+LGC+  +TPI+         +
Subjt:  QHTENDKVVVLIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIE---------K

Query:  GKVLE------TCGKTNIPLSHT-PDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVG
        GK ++        GK  I LSHT PDIAF VS+                                        VEVYTDADWAGS  DRRSTSGYC+FVG
Subjt:  GKVLE------TCGKTNIPLSHT-PDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVG

Query:  GKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPTT
        G LVTW SKKQSVVARSSAE EFRA+AH                                                 VD++FIKEKI++G+IC+ ++PTT
Subjt:  GKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPTT

Query:  EQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA
        +Q+AD+LTKGL K  F  LT KL M DIF+ A
Subjt:  EQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA

XP_024044152.1 uncharacterized protein LOC18046468 isoform X2 [Citrus clementina]6.8e-24739.39Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        IGYL G     + DDP F  WDA+NSM+M+WLVNSM ++I    +   TAK+LWD+VT+ YSDLGN +Q+++L  ++                      E
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALV--------IENTA-
        LD +   EW+   D   Y K +E   +++ LA L+ + DEVRGR+LGK  LP+  +VFS VRREESR+NVM+G     SAE+SAL+        +  T  
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALV--------IENTA-

Query:  MKASNQSN------------------ETHDKP----------RHASNANIVDSNP-------------LKEQIDQILKLL-KSNSLGNPS--FSLAQTGN
        +K S++ +                  + H KP          +H+    +V  N               KEQ++Q+ + L +S SL NPS   SLAQ GN
Subjt:  MKASNQSN------------------ETHDKP----------RHASNANIVDSNP-------------LKEQIDQILKLL-KSNSLGNPS--FSLAQTGN

Query:  SRQALSCL--NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS-----------------
        +  AL  +     PWIID GA DHMTS                             KG IP+ST L+L  VLHVP L+CNLLS                 
Subjt:  SRQALSCL--NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS-----------------

Query:  -------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKS
                SG+ I  AR ++GLYYF+E  +   + Q  ++  + S+++ IMLWH RLGHP+F YL++LFP LFK  + S+FQCE C+ +KHH+++F  + 
Subjt:  -------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKS

Query:  YKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQAT
        YK S+PF LIH+D+WGPS+V   +G +WF+TFIDDHTR+ W+YLL +KS    VF  F+ MI+TQFQAKI++  +DN  ++F   L  +  +  IVHQ++
Subjt:  YKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQAT

Query:  CCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK-----------------------------------------------
        C + PQQNG+A+RK RHLLEV+R+LMF+  VP   WG+  LTA+YLINR+PT+                                               
Subjt:  CCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK-----------------------------------------------

Query:  ------------------------------------------------GESSLVEENFWDTS-PLPNIISPEIMSFSPSIPSVENSPAGGETLQIDLIGR
                                                        GE    E++FW+ S P+P      IMS  P +PS   S    E   ++ +  
Subjt:  ------------------------------------------------GESSLVEENFWDTS-PLPNIISPEIMSFSPSIPSVENSPAGGETLQIDLIGR

Query:  NPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAEN
         P+LQ +TRRN ++R  + +    QD      N      +  +P S ++L    D D+PIA RKG            C      KY SY           
Subjt:  NPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAEN

Query:  QIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKT
          R     L +  +   ++       LS     D+        E N + E + LPE+KK VGCKW+FT+K  ADG +ERYKARLVAKGFTQTYGIDYQ+T
Subjt:  QIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKT

Query:  FAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFY
        FAPVAK+ SI +LLS+A +  W L QLDVKNAFLNGELEEEV+MDLP GFE + GI K             SPRAWF+RF K+V S+G+ QSQADHTMF+
Subjt:  FAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFY

Query:  QHTENDKVVVLIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIE---------K
        +H  +  VV++IVYVDDIILTG++   M  +          KDLG L+YFLGME ARS  GI V+QRKY LDLLK+TG+LGC+  +TPI+         +
Subjt:  QHTENDKVVVLIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIE---------K

Query:  GKVLE------TCGKTNIPLSHT-PDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVG
        GK ++        GK  I LSHT PDIAF VS+                                        VEVYTDADWAGS  DRRSTSGYC+FVG
Subjt:  GKVLE------TCGKTNIPLSHT-PDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVG

Query:  GKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPTT
        G LVTW SKKQSVVARSSAE EFRA+AH                                                 VD++FIKEKI++G+IC+ ++PTT
Subjt:  GKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPTT

Query:  EQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA
        +Q+AD+LTKGL K  F  LT KL M DIF+ A
Subjt:  EQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA

TrEMBL top hitse value%identityAlignment
A0A1S2XBU5 uncharacterized protein LOC1015132063.2e-21836.67Show/hide
Query:  DPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------ELDLFDTYEWKSTDD
        DP +  W +ENS ++ WL+++M   I    M   TAKE+ ++V + YSD+ N SQ+F+L  +L                      ELDL     WK  +D
Subjt:  DPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------ELDLFDTYEWKSTDD

Query:  QKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKK-AIDSAESSALVIENTAMKASNQSN--------------
           +LK  E+  ++  L  LN   DEVRGRILGK  L ++ + FS+VRREE+R+ VM+GK  ++   ESS L+ +N   K++++                
Subjt:  QKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKK-AIDSAESSALVIENTAMKASNQSN--------------

Query:  --ETHDKP----------RHASNANIVD--------SNPL-KEQIDQILKLLKSNSLGNPSFSLAQTGN--SRQALSCLNSSPWIIDFGAADHMT-----
          + H KP           HA  A   D        S+P  KEQ+DQ+ KLL+S +    S S+AQ GN  +   LS   S  WIID GA DHMT     
Subjt:  --ETHDKP----------RHASNANIVD--------SNPL-KEQIDQILKLLKSNSLGNPSFSLAQTGN--SRQALSCLNSSPWIIDFGAADHMT-----

Query:  ------------------------SKGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEHARMINGLYYFD---EV
                                 KG + LS KL L+ VLHVP L+CNLLS                         +G+MI +A+   GLYY D   + 
Subjt:  ------------------------SKGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEHARMINGLYYFD---EV

Query:  STSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRW
            K      S+   S  + IMLWH RLGHP+F YLK+LFP LF   D S+F CE C FAKHH+S+FS + YK S PF +IH+DVWGP+++ T + K+W
Subjt:  STSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRW

Query:  FVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFS
        F+TFIDDHTR+ W+YLL +KS V +V   F+KM+ TQ+Q  I++  SDN  ++FN  L+ F     +VHQ++C N PQQNGIA+RK RHLLEV+RAL+F+
Subjt:  FVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFS

Query:  MHVPTYLWGDVGLTAAYLINRIPTK-----------------------------GESSLVEENFWDTSPLPNIISPEIMSFSPS----------------
          VP YLWG+  LTA+YLINR+P+K                             G ++ V E+       P  I      +SP+                
Subjt:  MHVPTYLWGDVGLTAAYLINRIPTK-----------------------------GESSLVEENFWDTSPLPNIISPEIMSFSPS----------------

Query:  ---IPSVENSP----------------------------------------AGGETLQIDLIGRNPKL--------------------------------
           +  VEN P                                        A  E  Q  ++  NP +                                
Subjt:  ---IPSVENSP----------------------------------------AGGETLQIDLIGRNPKL--------------------------------

Query:  ------------QFHT-----------RRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRK--GGCVGNLLTQNYGCV
                    +F T           RRN  QR+    +   QD S+ + N   +     S S L    + PD D+PIA RK    C    L+ NY   
Subjt:  ------------QFHT-----------RRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRK--GGCVGNLLTQNYGCV

Query:  GILERKYPSYPWTYINKAAENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIE
          L   + +    + +K +  +I + +++ L+I       + K+ +L               R  E N +   + LP  K  VGCKWVFT+K N+D  +E
Subjt:  GILERKYPSYPWTYINKAAENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIE

Query:  RYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGIN------------KSPRAWFER
        RYKARLVAKGFTQTYGIDY +TFAPVAK+ +I +LLS+AVN DW L QLDVKNAFLNG+LEEEV+MD P GFE   G N            +SPRAWFE+
Subjt:  RYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGIN------------KSPRAWFER

Query:  FRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGL
        F ++V   G+ Q Q+DHT+F + +   K+ +LIVYVDDIILTG+D   M            IKDLG+LKYFLGME ARSK GI+V+QRKYILDLL++TG+
Subjt:  FRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGL

Query:  LGCRIVETPI-------EKGKVLETCGKTN------IPLSHT-PDIAFVVSM----------------------------------------VEVYTDAD
         GCR  +TP+       EKG V    G+        I L+HT PDIAF VS+                                        V ++TDAD
Subjt:  LGCRIVETPI-------EKGKVLETCGKTN------IPLSHT-PDIAFVVSM----------------------------------------VEVYTDAD

Query:  WAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALA-------------------------------------------------HVDKN
        WAGS TDR+ST+GYC++V G LVTW SKKQ VVARSSAE EFRA+A                                                  +D++
Subjt:  WAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALA-------------------------------------------------HVDKN

Query:  FIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF
        FIKEKIDAGVIC+P++ + +Q AD+LTK L +  F +L  KL M DI+
Subjt:  FIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF

A0A2Z6NTX3 Integrase catalytic domain-containing protein4.6e-27344.1Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        IGY+ G+K         F  WDAENSMVMTWLVNSM ++IS+N +CY TAK+LWD+V+QMYSDL NQSQV+EL L+L                      +
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDS---AESSALVIENTAMKA---
        LDLFD YEWKS +D K+Y+KTV+   ++K LA LNVEFDEVRGRILG++ +P I +VF++VRREESRR VM+GKK + +    E SAL +     K+   
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDS---AESSALVIENTAMKA---

Query:  ---------------------SNQSNETHDKP----------RHASNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSCLN-SSPW
                                  + H +P          R  ++AN   S+P  KEQ+D + KLL+SN SL  P  ++AQTG +  ALS  N S+PW
Subjt:  ---------------------SNQSNETHDKP----------RHASNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSCLN-SSPW

Query:  IIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEH
        IID GA++HMT+                             KG I +S  + L+ VLHVP+ ACNLLS                        +SG+MI  
Subjt:  IIDFGAADHMTS-----------------------------KGIIPLSTKLILRFVLHVPQLACNLLS------------------------DSGEMIEH

Query:  ARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDV
        AR INGLYY DE    NKK   L S S  LSV + +MLWHRRLGHP+F YLKYLFP+  K I+ S   CE C  AK H+ +FS K Y +S PFYL H+DV
Subjt:  ARMINGLYYFDEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDV

Query:  WGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRK
        WGPSK+ T +GK+WFVTFIDDHTR+ W+YL+ KKS V E F  F++MIETQFQ KI IL SDN  ++FN+ L TFL  K I+HQ+TC + PQQNGIA+RK
Subjt:  WGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRK

Query:  YRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----------------------------------GE---SSLVEENFWDTSP-LPNIIS--
         RHLLEV+RA+M SM+VP YLWG+  LTA YLINR+PT+                                  GE   SS  E+NFW+  P L ++++  
Subjt:  YRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----------------------------------GE---SSLVEENFWDTSP-LPNIIS--

Query:  -PEIMSFSPSIPSVE-----NSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDP-ENPGMSFSPS----SLNMLPD-------
         P      P   + E      S  GGETL  +   RN +L+ + R+   +      +     QSD+    P +N   + SP     S N LPD       
Subjt:  -PEIMSFSPSIPSVE-----NSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDP-ENPGMSFSPS----SLNMLPD-------

Query:  --------VPDFDIPIAHRK--GGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIR-ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRR
                +PD D+PIA RK    C  + ++ NY     L   + +Y     N      ++  L D              KL +    + L  +  +S  
Subjt:  --------VPDFDIPIAHRK--GGCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIR-ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRR

Query:  QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEE
                    LP+ KKAVGCKWVFT+KC ADG +ERYKARLVAKGFTQT+GIDYQ+TFAPVAKI SI ILLS+AVNF+W L+Q DVKNAFLNGEL EE
Subjt:  QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEE

Query:  VFMDLPLGFEVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGME
        V+M LP GFE + G            R  +  +GF+QSQADHT+F++H+   K+ +LIVYVDDII+TG+D   ++           IKDLG LKYFLGME
Subjt:  VFMDLPLGFEVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGME

Query:  FARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP-----------------------LSHTPDIAFVVS-----MVEVYTDADW
        F RSK G  +NQRKYILDLLK+TG++GCR VETP++    L+   +  +                        L  TP    +        VE YTDADW
Subjt:  FARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP-----------------------LSHTPDIAFVVS-----MVEVYTDADW

Query:  AGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNF
        AG+  DRRSTSGYC+FVGG  VTW SKKQSVVARSSAE EFR++AH                                                 VDK+F
Subjt:  AGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNF

Query:  IKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF
        IKEKID+G IC+ Y+PTT Q+ADVLTK LPK QF+ +  KLAMNDIF
Subjt:  IKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIF

A0A2Z7CP84 Beta-galactosidase6.1e-21736.78Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        +GYL G+       DP +  W +ENSMVM WL+NSM   I    +   TAK++W++V + YSDL N SQ+++L  +L                      E
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALVIEN----------T
        LDL    +W+  +D   Y K +E   ++  LA LN E DEVRGRILG+  LP++ +VF+++RR E RR VM+ +K     E+SAL+  N          +
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALVIEN----------T

Query:  AMKASNQSNETHDKPRHA----------------------------------SNANIVDSNPL--KEQIDQILKLLKSNSL-----GNPSFSLAQTGNSR
         MK      E   KP H                                     A +    P   KEQ+ Q+ KL +S           S SLAQ G++ 
Subjt:  AMKASNQSNETHDKPRHA----------------------------------SNANIVDSNPL--KEQIDQILKLLKSNSL-----GNPSFSLAQTGNSR

Query:  QALSCLNSSP---WIIDFGAADHMT-----------------------------SKGIIPLSTKLILRFVLHVPQLACNLL-------------------
           +  + SP   WI+D GA DHMT                              KG I +S  L L  VLHVP L+CNLL                   
Subjt:  QALSCLNSSP---WIIDFGAADHMT-----------------------------SKGIIPLSTKLILRFVLHVPQLACNLL-------------------

Query:  -----SDSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQ---ETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSP
             SDSG MI  AR   GLYYF+   +      G +  SSL +    + IML HRRLGHP+  YLK +FP L K  +     CE C FAKHH+++F  
Subjt:  -----SDSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQ---ETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSP

Query:  KSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQ
        KSYK+S+PF L+H+DVWGP  + T +GK+WFVTFIDDHTR++W+YLL  KS V   F  F+ M++TQFQ  I++L SDN  ++ N+ L  F     ++HQ
Subjt:  KSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQ

Query:  ATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----GESSLVEENFWDTSPLPNIISPEI-----------------
        ++C   PQQNG+A+RK RHLLEV+R+LMF+  VP YLWG+  LTA YLINR+P++       SL+    + TS L + + P +                 
Subjt:  ATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK----GESSLVEENFWDTSPLPNIISPEI-----------------

Query:  -------MSFSPS----------------------------------------IPSVENSPAGGETL------------------QIDLIGRNPKLQF--
               + +SPS                                        + SV   P   E L                  Q ++   +    F  
Subjt:  -------MSFSPS----------------------------------------IPSVENSPAGGETL------------------QIDLIGRNPKLQF--

Query:  -HTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPS-----SLNMLPDVPDFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAA
         + RRNT +R    T+    DQ     N+P  P  + SP       L+        DIPIA RKG   C  + L+ +     I +    SY   +I++  
Subjt:  -HTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPS-----SLNMLPDVPDFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAA

Query:  ENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDY
          +I + +++ L +         K+ +L   K  +             N + E + LP+++K VGC+WVF +K N+DG +ERYK  LVA+G+TQTYGIDY
Subjt:  ENQI-RILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDY

Query:  QKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDL------------GINKSPRAWFERFRKAVTSYGFSQSQADHTM
        Q+TFAPV K+ +I ILLSIA N +WPL+QLDVKNAFLNGELEEEVFMD P GFE               G+ +SPRAWFE+F K+V + G++Q Q+DHTM
Subjt:  QKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDL------------GINKSPRAWFERFRKAVTSYGFSQSQADHTM

Query:  FYQHTENDKVVVLIVYVDDIILTGNDE----------TGMSIIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVL---
        F + +   KV VLIVYVDDII++GNDE          +    +KDLG L+YFLGME ARSK GI V+QRKYILDLLK TG+ GCR  +TPI+    L   
Subjt:  FYQHTENDKVVVLIVYVDDIILTGNDE----------TGMSIIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVL---

Query:  -----------ETCGKTNIPLSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTSGYCSFV
                   +   +  I LSHT PDIAF VSMV                                        E +TDADWAGS TDRRSTSGYC F+
Subjt:  -----------ETCGKTNIPLSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTSGYCSFV

Query:  GGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPT
         G LVTW SKKQ+VVARSSAE E RA+A+                                                 +D++FIKEK++ G IC+PY+P+
Subjt:  GGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVICIPYLPT

Query:  TEQIADVLTKGLPKLQFNKLTDKLAMNDIF
         +QIAD  TKG  + +F   T KL M DIF
Subjt:  TEQIADVLTKGLPKLQFNKLTDKLAMNDIF

A5B9Y8 Integrase catalytic domain-containing protein3.8e-23538.92Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKLELDLFDTYEWKSTDDQKNYLKTV
        +GYL GEK   + DDP + +WDAENSM                                                                D +++ KT+
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKLELDLFDTYEWKSTDDQKNYLKTV

Query:  EDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSA-ESSALVIENTAM-----------------------------
        ED  I+K L  LNVEFDEVR RI+ +  LP+I + FS+VRREES+RNVM+GKK    A E S LV                                   
Subjt:  EDGSIYKLLASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSA-ESSALVIENTAM-----------------------------

Query:  -------KASNQSNETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS--------
               K +N   +T DKP  A    AN  +++    EQ++ +L LLKSN + G  S SLA TGN   ALSC   S+PWIID GA+DHMT+        
Subjt:  -------KASNQSNETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS--------

Query:  ---------------------KGIIPLSTKLILRFVLHVPQLACNLL------------------------SDSGEMIEHARMINGLYYFDEVSTSNKKI
                             KG+I +S  + L+FVLHVP+L CNLL                          S + I  ARMINGLYYF++   SNK  
Subjt:  ---------------------KGIIPLSTKLILRFVLHVPQLACNLL------------------------SDSGEMIEHARMINGLYYFDEVSTSNKKI

Query:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDD
        QGLSS+SSL V++ IM+WH +LG P+F YLK+LFP LF+ +D   FQCE C+ AK  + T+  K Y +S PFYL H+DVWGPSKV T +GK+WFVTFIDD
Subjt:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDD

Query:  HTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYL
        HTRL W+YL+ +KS V+ +F  FYKMIE QFQ KI IL SDN  K+FN+ L TF + K I+HQ++C + PQQNGIA+RK +HLLEV+RA+MF M++P YL
Subjt:  HTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYL

Query:  WGDVGLTAAYLINRIPTK----------------------------------------------------------------------------------
        WGD  LTA+YLINR+PTK                                                                                  
Subjt:  WGDVGLTAAYLINRIPTK----------------------------------------------------------------------------------

Query:  -------------GESSLVEENFWD-TSPLPNII-----SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDT
                     GE  LVE NFW+   P P++I       E     P     E   +  E L++     N +   ++R+  + R ++Q +     Q   
Subjt:  -------------GESSLVEENFWD-TSPLPNII-----SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDT

Query:  LVNDPEN-------------------------------PGMSFSPSSLNMLPDVP----DFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYIN
        L N   N                               P +S     L +   VP    D D+PIA RKG            C   L  KY SY     +
Subjt:  LVNDPEN-------------------------------PGMSFSPSSLNMLPDVP----DFDIPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYIN

Query:  KAAENQIRILRDCLEIDM---LDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQT
          ++N      +  ++ +   +   L+     L + KE             + N + E +DLP +KK VGCKWVFTIK  ADG +ERYKARLVAKGFTQT
Subjt:  KAAENQIRILRDCLEIDM---LDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQT

Query:  YGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQS
        YGIDYQ+TFAPVAKI SI +LLS+ VN +WPL+QLDVKNAFLNG+LEEEVFM  P  FE   G+ K             SPRAWFERF K +  YG++QS
Subjt:  YGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQS

Query:  QADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG
        QA+HTMFY+H+   KVV+LIVYVDDI+LTG+D   +            IKDLG+LKYFLGMEFARSK GI VNQRKY+ +LL +  +LGC+  E PIE  
Subjt:  QADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG

Query:  KVLETCGKTNIP--------------LSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTS
          L+     N+               LSHT PDIAF VSMV                                        E YT+ADWAGS  DRRSTS
Subjt:  KVLETCGKTNIP--------------LSHT-PDIAFVVSMV----------------------------------------EVYTDADWAGSTTDRRSTS

Query:  GYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVIC
        GYCSFV G LVTW SKKQ+VVARSSAE EFR +AH                                                 VDK FIKEKID   +C
Subjt:  GYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDKNFIKEKIDAGVIC

Query:  IPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA
        + Y+PT EQ+A+V TK L K QF+ L  KLAM DIFKLA
Subjt:  IPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA

A5BNN1 Integrase catalytic domain-containing protein4.6e-21738.98Show/hide
Query:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E
        +GYL GEK   + DDP + +WDAENSMVMTWLVNSM +DI+ N MCY T +ELW++V QMY DLGNQSQ+FEL LKL                      +
Subjt:  IGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKL----------------------E

Query:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNI-NDVFSKVRREESRRNVMIGKKAIDSAESSALVIENTAMKASNQSN
        LD F+TYEWKS +D  ++ KT+ED  I+K LA LNVEFDE +         P    D  +K R        + GK A                   N   
Subjt:  LDLFDTYEWKSTDDQKNYLKTVEDGSIYKLLASLNVEFDEVRGRILGKSTLPNI-NDVFSKVRREESRRNVMIGKKAIDSAESSALVIENTAMKASNQSN

Query:  ETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS----------------------
        +T DKP  A     N  +++P   EQ++  L LLKSN + G  S SLA TGN   ALSC   S+PWI+DFGA+DHMT+                      
Subjt:  ETHDKPRHA--SNANIVDSNPL-KEQIDQILKLLKSN-SLGNPSFSLAQTGNSRQALSC-LNSSPWIIDFGAADHMTS----------------------

Query:  -------KGIIPLSTKLILRFVLHVPQLACNLLSDSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLF
               KG+I +S  + L+ VLHVP+L       SG+ I  ARMI+GLYYF++   SNK  QGLSS+SSL V++ IM+WH RLGHP+F YLK+LFP LF
Subjt:  -------KGIIPLSTKLILRFVLHVPQLACNLLSDSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLF

Query:  KGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRIL
        + +D   FQCE C+ AK  + T+ PK Y +S PFYL H+DVWGPSKV T +GK+WFVTFI+DHTRL W+YL+ +KS V+ +F  FY+MIE QFQ KI IL
Subjt:  KGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRIL

Query:  HSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK--------------------
         SDN  ++FN+ L TF ++K I+HQ++C +  +QNGIA+ K +HLLEV+RA+MF M++P YLW D  LTA+YLINR+PTK                    
Subjt:  HSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK--------------------

Query:  ---------------------------------------------------------------------------GESSLVEENFWD-TSPLPNII----
                                                                                   GE  LVE NFW+   PL ++I    
Subjt:  ---------------------------------------------------------------------------GESSLVEENFWD-TSPLPNII----

Query:  -SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPEN-------------------------------
           E      +    E   +  E L++     N +   ++R+    R ++Q +     Q   L N   N                               
Subjt:  -SPEIMSFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPEN-------------------------------

Query:  PGMSFSPSSLNMLPDV----PDFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEF
        P +S     L + P V     D D+PIA RKG   C  + +++ Y     L   Y ++            I+      E+D    +L   + +  L K  
Subjt:  PGMSFSPSSLNMLPDV----PDFDIPIAHRKG--GCVGNLLTQNYGCVGILERKYPSYPWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEF

Query:  LNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKN
                      N + E IDLP +KK VGCKWVFTIK   DG +ERYKARLVAK                              VN +WPL+QLDVKN
Subjt:  LNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKN

Query:  AFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS--
        AFLNG+LE+EVFM    GFE   G+ K             SPRAWFERF K +  YG++QSQADHTMFY+H+   KVV+LIVYVDDI+LTG+D   +   
Subjt:  AFLNGELEEEVFMDLPLGFEVDLGINK-------------SPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS--

Query:  --------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS
                 IKDLG+LKYFLGMEFARSK GI VNQRKY+LDLL +TG+LGC+  ETPIE    L+     N+               LSHT PDIAF VS
Subjt:  --------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS

Query:  MVEVY---------------------TDADWAGSTTDRRSTSGYCSFVGGKLVTWH--SKKQSVVARSSAETEFRALAHVDKNFIKEKIDAGVICIPYLP
        MV  +                     T A      T  R        +G   +  +  +K    VA +    +      VDK+FIKEKID  ++C+ Y+P
Subjt:  MVEVY---------------------TDADWAGSTTDRRSTSGYCSFVGGKLVTWH--SKKQSVVARSSAETEFRALAHVDKNFIKEKIDAGVICIPYLP

Query:  TTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA
        T EQ+ADV  KGL K QF+ L  KLAM DIFK A
Subjt:  TTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.7e-7124.91Show/hide
Query:  HMTSKGIIPLST--KLILRFVLHVPQLACNLLS-----DSGEMIEHAR-----MINGLYYF-DEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPN-
        + T +GI+ L    ++ L  VL   + A NL+S     ++G  IE  +       NGL    +    +N  +    + S +   +    LWH R GH + 
Subjt:  HMTSKGIIPLST--KLILRFVLHVPQLACNLLS-----DSGEMIEHAR-----MINGLYYF-DEVSTSNKKIQGLSSVS-SLSVQETIMLWHRRLGHPN-

Query:  ----FVYLKYLFPD--LFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKS--SSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKE
             +  K +F D  L   ++ S   CE C+  K  +  F     K+    P +++H+DV GP   +T + K +FV F+D  T     YL+  KS+V  
Subjt:  ----FVYLKYLFPD--LFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKS--SSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKE

Query:  VFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK
        +F  F    E  F  K+  L+ DN  ++ +  +  F   K I +  T  + PQ NG+++R  R + E +R ++    +    WG+  LTA YLINRIP++
Subjt:  VFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK

Query:  G--ESSLVEENFWDTSPLPNIISPEIMSFSPSIPSVENSPA--GGETLQIDLIGRNP---KL------QFHTRRNTTQRDRN---------QTVELTQDQ
           +SS      W     P +    +   +  +  ++N       ++ +   +G  P   KL      +F   R+    + N         +TV L   +
Subjt:  G--ESSLVEENFWDTSPLPNIISPEIMSFSPSIPSVENSPA--GGETLQIDLIGRNP---KL------QFHTRRNTTQRDRN---------QTVELTQDQ

Query:  SDTLVNDP---------ENPGMSFSPSSLNMLPDVPDFD---IPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYI---------------------
             N P         E P  S    ++  L D  + +    P   RK         ++  C  I   K       Y                      
Subjt:  SDTLVNDP---------ENPGMSFSPSSLNMLPDVPDFD---IPIAHRKGGCVGNLLTQNYGCVGILERKYPSYPWTYI---------------------

Query:  -NKAAENQ------------------IRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFS---------RRQPETNSSKEKIDL------------
         N++ E++                  I I+    E      Q+   +   +L+K  LN   +F+         + + + +S +E I+             
Subjt:  -NKAAENQ------------------IRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFS---------RRQPETNSSKEKIDL------------

Query:  ----PEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF
            PE+K  V  +WVF++K N  G   RYKARLVA+GFTQ Y IDY++TFAPVA+I S   +LS+ + ++  ++Q+DVK AFLNG L+EE++M LP G 
Subjt:  ----PEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF

Query:  EVD-----------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTEN-DKVVVLIVYVDDIILTGNDETGMSIIK----------DLGSLKYFL
          +            G+ ++ R WFE F +A+    F  S  D  ++     N ++ + +++YVDD+++   D T M+  K          DL  +K+F+
Subjt:  EVD-----------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTEN-DKVVVLIVYVDDIILTGNDETGMSIIK----------DLGSLKYFL

Query:  GMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG---KVLETCGKTNIP------------LSHTPDIAFVVSMVEVYT--------------
        G+     +  I ++Q  Y+  +L K  +  C  V TP+      ++L +    N P            L   PD+   V+++  Y+              
Subjt:  GMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKG---KVLETCGKTNIP------------LSHTPDIAFVVSMVEVYT--------------

Query:  ----------------------------DADWAGSTTDRRSTSGYC-SFVGGKLVTWHSKKQSVVARSSAETEFRAL-----------------------
                                    D+DWAGS  DR+ST+GY        L+ W++K+Q+ VA SS E E+ AL                       
Subjt:  ----------------------------DADWAGSTTDRRSTSGYC-SFVGGKLVTWHSKKQSVVARSSAETEFRAL-----------------------

Query:  ------------------------AHVD--KNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAM
                                 H+D   +F +E++   VIC+ Y+PT  Q+AD+ TK LP  +F +L DKL +
Subjt:  ------------------------AHVD--KNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-7925.83Show/hide
Query:  LILRFVLHVPQLACNLLS--------------------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLF
        L+L+ V HVP L  NL+S                      G ++    +  G  Y     T+ +  QG   +++   + ++ LWH+R+GH +   L+ L 
Subjt:  LILRFVLHVPQLACNLLS--------------------DSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLF

Query:  PD--LFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQ
            +      +V  C+ C+F K H+ +F   S +  +   L+++DV GP ++ +  G ++FVTFIDD +R  W+Y+L  K  V +VF +F+ ++E +  
Subjt:  PD--LFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQ

Query:  AKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTKGESSLVEENFWDTS
         K++ L SDN  ++ +     +     I H+ T    PQ NG+A+R  R ++E  R+++    +P   WG+   TA YLINR P+   +  + E  W   
Subjt:  AKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTKGESSLVEENFWDTS

Query:  PLPNIISPEIMSFS----PSIPSVENSPAGGETLQIDLIGRNPKLQFHTR-------RNTTQRD---RNQTVELTQDQSDTLVNDPENPGMSFSPSSLNM
            +    +  F       +P  + +    +++    IG   + +F  R       +    RD   R   V    D S+ + N    P     PS+ N 
Subjt:  PLPNIISPEIMSFS----PSIPSVENSPAGGETLQIDLIGRNPKLQFHTR-------RNTTQRD---RNQTVELTQDQSDTLVNDPENPGMSFSPSSLNM

Query:  LPDVPDFDIPIAHRKGGCVGNLLTQNYGC-VGILERKYPSY-------------PWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDS
         P   +        +G   G ++ Q      G+ E ++P+              P     +    +  ++ D  E + L      K++L +  K  L  +
Subjt:  LPDVPDFDIPIAHRKGGCVGNLLTQNYGC-VGILERKYPSY-------------PWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDS

Query:  GVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLN
                + N + + ++LP+ K+ + CKWVF +K + D  + RYKARLV KGF Q  GID+ + F+PV K+ SI  +LS+A + D  + QLDVK AFL+
Subjt:  GVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLN

Query:  GELEEEVFMDLPLGFEVD-------------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMSII----
        G+LEEE++M+ P GFEV               G+ ++PR W+ +F   + S  + ++ +D  ++++    +  ++L++YVDD+++ G D+  ++ +    
Subjt:  GELEEEVFMDLPLGFEVD-------------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMSII----

Query:  ------KDLGSLKYFLGMEFARSKSG--ILVNQRKYILDLLKKTGLLGCRIVETPI----------------EKGKVLETCGKTNI------PLSHTPDI
              KDLG  +  LGM+  R ++   + ++Q KYI  +L++  +   + V TP+                EKG + +    + +       +   PDI
Subjt:  ------KDLGSLKYFLGMEFARSKSG--ILVNQRKYILDLLKKTGLLGCRIVETPI----------------EKGKVLETCGKTNI------PLSHTPDI

Query:  AFVVSMV---------------------------------------EVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH
        A  V +V                                       + YTDAD AG   +R+S++GY     G  ++W SK Q  VA S+ E E+ A   
Subjt:  AFVVSMV---------------------------------------EVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH

Query:  VDKNFI
          K  I
Subjt:  VDKNFI

P92519 Uncharacterized mitochondrial protein AtMg008104.1e-2129.3Show/hide
Query:  LIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPI--------------EKGKVLE
        L++YVDDI+LTG+  T ++++          KDLG + YFLG++     SG+ ++Q KY   +L   G+L C+ + TP+              +      
Subjt:  LIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPI--------------EKGKVLE

Query:  TCGKTNIPLSHTPDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQS
          G         PDI++ V++                                        V+ + D+DWAG T+ RRST+G+C+F+G  +++W +K+Q 
Subjt:  TCGKTNIPLSHTPDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQS

Query:  VVARSSAETEFRALA
         V+RSS ETE+RALA
Subjt:  VVARSSAETEFRALA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.2e-8625.58Show/hide
Query:  NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTK---LILRFVLHVPQLACNLLS------DSGEMIEHARMINGLYYF
        +S+ W++D GA  H+TS                              G   LSTK   L L  +L+VP +  NL+S       +G  +E        +  
Subjt:  NSSPWIIDFGAADHMTS-----------------------------KGIIPLSTK---LILRFVLHVPQLACNLLS------DSGEMIEHARMINGLYYF

Query:  DEVSTSNKKIQG----------------LSSVSSLSVQETIMLWHRRLGHP-----NFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSS
         +++T    +QG                +S  +S S + T   WH RLGHP     N V   Y    L          C DC+  K ++  FS  +  S+
Subjt:  DEVSTSNKKIQG----------------LSSVSSLSVQETIMLWHRRLGHP-----NFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSS

Query:  SPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNN
         P   I++DVW  S +L+ +  R++V F+D  TR TWLY L +KS VKE F+ F  ++E +FQ +I   +SDN  +F    L  +     I H  +  + 
Subjt:  SPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNN

Query:  PQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK------------GES------------------------------------
        P+ NG+++RK+RH++E    L+    +P   W      A YLINR+PT             G S                                    
Subjt:  PQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTK------------GES------------------------------------

Query:  ----------------------------------------SLVEENFWDTS----------------PLPNIISPEIMSFSPSIPSV--ENSPAGGETLQ
                                                S V+E   ++S                P P+   P   +  PS PS    NS      L 
Subjt:  ----------------------------------------SLVEENFWDTS----------------PLPNIISPEIMSFSPSIPSV--ENSPAGGETLQ

Query:  IDLIGRNP----------------------KLQFHTRRNTTQRD--RNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGN
               P                      + Q H+ +NT+Q +       +L Q  S    +   +P  + S SS +  P  P   I         V N
Subjt:  IDLIGRNP----------------------KLQFHTRRNTTQRD--RNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGN

Query:  -----LLTQNYGC---VGILERKYPSYPWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVG
             L T + G     GI+ +  P Y    ++ AAE++ R     L+                  + + N  G     Q   ++       P     VG
Subjt:  -----LLTQNYGC---VGILERKYPSYPWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVG

Query:  CKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF------------
        C+W+FT K N+DG + RYKARLVAKG+ Q  G+DY +TF+PV K  SI I+L +AV+  WP+ QLDV NAFL G L ++V+M  P GF            
Subjt:  CKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF------------

Query:  -EVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGI
         +   G+ ++PRAW+   R  + + GF  S +D ++F        +V ++VYVDDI++TGND T +            +KD   L YFLG+E  R  +G+
Subjt:  -EVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGI

Query:  LVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS-------------------------------
         ++QR+YILDLL +T ++  + V TP+     L     T +               L+ T PDI++ V+                               
Subjt:  LVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS-------------------------------

Query:  ---------MVEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-----------------------------------
                  +  Y+DADWAG   D  ST+GY  ++G   ++W SKKQ  V RSS E E+R++A+                                   
Subjt:  ---------MVEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-----------------------------------

Query:  --------------VDKNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDI
                      +D +FI+ ++ +G + + ++ T +Q+AD LTK L +  F     K+ +  +
Subjt:  --------------VDKNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-8325.55Show/hide
Query:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGI---DCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTF
        Q +S  +S   + T   WH RLGHP+   L  +  +    +      +  C DC   K H+  FS  +  SS P   I++DVW  S +L+ +  R++V F
Subjt:  QGLSSVSSLSVQETIMLWHRRLGHPNFVYLKYLFPDLFKGI---DCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTF

Query:  IDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVP
        +D  TR TWLY L +KS VK+ F+ F  ++E +FQ +I  L+SDN  +F    L  +L    I H  +  + P+ NG+++RK+RH++E+   L+    VP
Subjt:  IDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQAKIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVP

Query:  TYLWGDVGLTAAYLINRIPTK-------------------------------------------------------------------------------
           W      A YLINR+PT                                                                                
Subjt:  TYLWGDVGLTAAYLINRIPTK-------------------------------------------------------------------------------

Query:  ----------GESSLVEENF-----W-----------------------DTSPLPNIISPEIMS---FSPSIPSVE-NSPAGGETLQIDLIGRNPKLQFH
                  G S+  E+       W                       DTSP P      + +    S ++PS   +SP+  E       G  P  Q H
Subjt:  ----------GESSLVEENF-----W-----------------------DTSPLPNIISPEIMS---FSPSIPSVE-NSPAGGETLQIDLIGRNPKLQFH

Query:  TRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVG----NLLTQNYGCVGILERKYPSYPWTYINKAAENQIR
          +N+             + +  ++N+P NP  S SP+S N    +P   I   H           N  + +      L    P+ P   +N  A     
Subjt:  TRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVG----NLLTQNYGCVGILERKYPSYPWTYINKAAENQIR

Query:  ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDL------------------------PEDKKAVGCKWVFTIKCNADGIIER
               ++   M    K  +   ++++   + + +  +P T     K D                         P     VGC+W+FT K N+DG + R
Subjt:  ILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDL------------------------PEDKKAVGCKWVFTIKCNADGIIER

Query:  YKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF-------------EVDLGINKSPRAWFER
        YKARLVAKG+ Q  G+DY +TF+PV K  SI I+L +AV+  WP+ QLDV NAFL G L +EV+M  P GF             +   G+ ++PRAW+  
Subjt:  YKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGF-------------EVDLGINKSPRAWFER

Query:  FRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGL
         R  + + GF  S +D ++F        ++ ++VYVDDI++TGND   +            +K+   L YFLG+E  R   G+ ++QR+Y LDLL +T +
Subjt:  FRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDIILTGNDETGMS----------IIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGL

Query:  LGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS----------------------------------------MVEVYTDA
        L  + V TP+     L     T +P              L+ T PD+++ V+                                         +  Y+DA
Subjt:  LGCRIVETPIEKGKVLETCGKTNIP--------------LSHT-PDIAFVVS----------------------------------------MVEVYTDA

Query:  DWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDK
        DWAG T D  ST+GY  ++G   ++W SKKQ  V RSS E E+R++A+                                                 +D 
Subjt:  DWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAH-------------------------------------------------VDK

Query:  NFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAM
        +FI+ ++ +G + + ++ T +Q+AD LTK L ++ F   + K+ +
Subjt:  NFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAM

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-6637.59Show/hide
Query:  NLSKEFLNDSGVFSRR--QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWP
        N +KEFL   G         ET  + E   LP +KK +GCKWV+ IK N+DG IERYKARLVAKG+TQ  GID+ +TF+PV K+ S+ ++L+I+  +++ 
Subjt:  NLSKEFLNDSGVFSRR--QPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIAVNFDWP

Query:  LYQLDVKNAFLNGELEEEVFMDLPLGFEVD-----------------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDII
        L+QLD+ NAFLNG+L+EE++M LP G+                     G+ ++ R WF +F   +  +GF QS +DHT F + T    + VL VYVDDII
Subjt:  LYQLDVKNAFLNGELEEEVFMDLPLGFEVD-----------------LGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDDII

Query:  LTGNDETGMSIIK----------DLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLET---------------CGKTNIP
        +  N++  +  +K          DLG LKYFLG+E ARS +GI + QRKY LDLL +TGLLGC+    P++                       G+    
Subjt:  LTGNDETGMSIIK----------DLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLET---------------CGKTNIP

Query:  LSHTPDIAFVVS----------------------------------------MVEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAE
             DI+F V+                                         ++V++DA +      RRST+GYC F+G  L++W SKKQ VV++SSAE
Subjt:  LSHTPDIAFVVS----------------------------------------MVEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQSVVARSSAE

Query:  TEFRALA
         E+RAL+
Subjt:  TEFRALA

ATMG00810.1 DNA/RNA polymerases superfamily protein2.9e-2229.3Show/hide
Query:  LIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPI--------------EKGKVLE
        L++YVDDI+LTG+  T ++++          KDLG + YFLG++     SG+ ++Q KY   +L   G+L C+ + TP+              +      
Subjt:  LIVYVDDIILTGNDETGMSII----------KDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPI--------------EKGKVLE

Query:  TCGKTNIPLSHTPDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQS
          G         PDI++ V++                                        V+ + D+DWAG T+ RRST+G+C+F+G  +++W +K+Q 
Subjt:  TCGKTNIPLSHTPDIAFVVSM----------------------------------------VEVYTDADWAGSTTDRRSTSGYCSFVGGKLVTWHSKKQS

Query:  VVARSSAETEFRALA
         V+RSS ETE+RALA
Subjt:  VVARSSAETEFRALA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1039.56Show/hide
Query:  LNDSGVFSRRQPETNS-SKEK----IDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIA
        L D G     Q E ++ S+ K    +  P ++  +GCKWVF  K ++DG ++R KARLVAKGF Q  GI + +T++PV +  +I  +L++A
Subjt:  LNDSGVFSRRQPETNS-SKEK----IDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYGIDYQKTFAPVAKIKSIIILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAATGGGTAGGTTGTTGCGCTTGTTGAGATTGAAACTAAGGAAACTGACATTATCCATGAAGAAAGTCGCATTCCCTAGCCGTCGCCGTCGCCGATGCCTACGCTT
CTGCTGTGACCTGGGTTACCTCTTTCTCCACCGGTCGACCAAGTCTGCACCACTGTCCGAGGGATTTTCGGCCAGTTTTGGTGACTCTATGGTTCCTTGTTCTGCCGCTA
AGTGGAAGTTTTTGAGACTAAGGTATCTACCGCCAAAGTCTTCGACAATCAGATCCATTCCAACACTCCCACTGTCCAAATCACCACCATTCGACTTAACGGGGATAACT
TTCTTCATAGGGTATCTCATCGGAGAAAAAATCACTCTAAGTCCAGATGACCCTTTATTCATTGTGTGGGATGCTGAAAACTCCATGGTTATGACATGGCTAGTCAACTC
CATGGTGAAAGATATCAGTAGTAACTGCATGTGCTACATTACGGCCAAGGAATTATGGGACAGTGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCG
AGCTGAATCTTAAGTTGGAACTTGATCTGTTTGATACATATGAGTGGAAGTCCACAGACGACCAAAAAAATTATCTGAAAACTGTTGAAGATGGTAGCATTTACAAACTT
CTTGCTAGTCTCAATGTTGAGTTTGATGAGGTTAGAGGCAGGATACTTGGGAAAAGTACTCTTCCTAATATTAACGATGTTTTTTCTAAAGTTCGCAGGGAGGAAAGTCG
CAGGAATGTTATGATTGGAAAGAAAGCAATTGACTCAGCTGAAAGTTCCGCGTTAGTGATTGAAAATACTGCAATGAAAGCTTCCAATCAATCCAATGAAACTCATGACA
AGCCTCGTCATGCCTCCAATGCAAATATTGTTGATTCCAATCCACTCAAAGAGCAAATTGATCAAATCCTGAAGTTGCTAAAATCCAATTCATTGGGTAATCCTAGTTTT
TCCTTGGCACAAACAGGTAATTCCCGTCAAGCCCTCTCGTGTCTAAATTCCTCTCCGTGGATCATTGATTTCGGAGCTGCTGATCATATGACTAGCAAAGGAATTATTCC
CCTAAGTACAAAACTCATACTACGTTTTGTCCTTCATGTTCCTCAACTAGCTTGTAATTTATTATCTGACTCGGGGGAGATGATTGAACATGCTAGGATGATTAATGGTC
TCTATTACTTTGATGAAGTTTCAACTAGTAATAAAAAGATTCAGGGCTTGAGTAGTGTCAGTTCTCTTTCTGTTCAAGAAACTATTATGCTTTGGCATCGTAGATTAGGA
CATCCTAATTTCGTTTATTTAAAGTACTTGTTTCCTGATTTATTTAAAGGAATTGATTGTTCTGTTTTCCAATGCGAAGATTGCATTTTTGCCAAACATCATCAATCTAC
TTTTTCACCTAAATCTTATAAATCTTCATCACCTTTTTACTTAATTCATACTGATGTTTGGGGTCCGTCTAAGGTTTTGACCAAAAATGGCAAGCGTTGGTTTGTTACCT
TTATCGATGATCACACCCGTTTAACTTGGCTTTACTTACTTACGAAAAAGTCAAATGTAAAAGAGGTCTTTGTTCGTTTTTACAAAATGATTGAGACTCAATTTCAAGCT
AAAATTCGCATTCTTCACTCTGATAATGAAGCTAAATTTTTTAACGAACCATTAACTACCTTTTTACATGACAAGCACATCGTTCATCAAGCTACATGTTGCAATAATCC
TCAACAAAATGGTATTGCTAAACGAAAATATCGACATTTACTTGAAGTTTCTCGTGCCCTTATGTTTTCTATGCATGTTCCAACATATTTGTGGGGGGATGTTGGTCTAA
CTGCTGCTTATCTAATAAATAGAATACCTACTAAGGGGGAGTCATCTCTCGTTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATG
AGCTTTAGTCCTTCGATCCCAAGTGTGGAAAATTCTCCGGCAGGGGGAGAAACACTACAAATAGATTTGATAGGTCGAAATCCTAAACTTCAATTTCATACTAGAAGAAA
CACAACTCAAAGGGATAGAAATCAGACAGTCGAGCTAACACAAGACCAATCTGATACTCTAGTGAATGATCCTGAAAATCCAGGTATGTCTTTTAGTCCTTCCTCTCTTA
ATATGTTGCCCGATGTCCCTGATTTTGATATTCCAATTGCCCATAGAAAAGGTGGATGCGTTGGCAACCTCTTAACGCAGAACTATGGATGCGTTGGCATACTTGAACGC
AAGTACCCAAGTTATCCGTGGACTTATATCAACAAGGCAGCGGAGAATCAAATAAGAATCTTGAGAGATTGTTTGGAGATTGATATGCTTGATATGCAACTTGAAGGCAA
AAAGTTATTATTGAACCTTTCGAAAGAATTCCTAAATGACAGCGGAGTTTTCTCCAGACGACAACCAGAGACAAATAGCTCGAAAGAGAAGATTGATCTACCAGAAGACA
AGAAAGCAGTGGGATGTAAGTGGGTTTTCACGATAAAATGTAATGCTGATGGTATTATCGAAAGGTACAAGGCCAGATTAGTGGCTAAGGGATTCACCCAGACCTATGGA
ATTGATTATCAAAAGACATTTGCCCCTGTAGCTAAAATTAAGTCAATTATAATTTTGCTCTCTATTGCAGTTAATTTTGATTGGCCACTTTATCAACTTGATGTTAAAAA
TGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGACTTACCACTTGGTTTTGAAGTTGACCTTGGGATTAACAAGTCTCCTAGAGCTTGGTTTGAACGTTTTA
GAAAGGCAGTCACAAGCTATGGATTCAGCCAAAGTCAAGCCGACCATACTATGTTCTACCAGCATACAGAAAATGACAAGGTAGTTGTTCTGATAGTGTATGTCGATGAT
ATCATTCTTACAGGCAATGATGAGACAGGAATGTCTATTATCAAAGACCTGGGATCCTTAAAGTACTTTCTTGGCATGGAGTTTGCTAGGTCTAAAAGTGGTATCCTTGT
CAATCAAAGAAAGTATATCCTTGATCTACTCAAAAAGACAGGTTTACTTGGTTGCCGAATTGTAGAAACTCCCATTGAAAAAGGGAAAGTACTAGAGACTTGTGGGAAGA
CTAATATACCTCTCTCACACACGCCCGACATTGCTTTTGTAGTTAGTATGGTTGAAGTTTATACTGATGCTGATTGGGCAGGTAGCACGACAGATAGGAGATCAACTTCT
GGGTATTGCTCCTTTGTTGGAGGAAAGTTAGTTACTTGGCATAGCAAAAAACAGAGTGTAGTTGCAAGAAGTAGTGCTGAAACAGAATTTAGAGCATTGGCCCATGTTGA
TAAAAATTTCATAAAAGAAAAGATTGATGCAGGAGTGATATGCATTCCCTACCTCCCAACAACAGAACAAATTGCAGATGTATTAACTAAAGGTCTTCCTAAGTTACAAT
TCAACAAGTTAACAGACAAGCTGGCCATGAATGATATCTTCAAACTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAATGGGTAGGTTGTTGCGCTTGTTGAGATTGAAACTAAGGAAACTGACATTATCCATGAAGAAAGTCGCATTCCCTAGCCGTCGCCGTCGCCGATGCCTACGCTT
CTGCTGTGACCTGGGTTACCTCTTTCTCCACCGGTCGACCAAGTCTGCACCACTGTCCGAGGGATTTTCGGCCAGTTTTGGTGACTCTATGGTTCCTTGTTCTGCCGCTA
AGTGGAAGTTTTTGAGACTAAGGTATCTACCGCCAAAGTCTTCGACAATCAGATCCATTCCAACACTCCCACTGTCCAAATCACCACCATTCGACTTAACGGGGATAACT
TTCTTCATAGGGTATCTCATCGGAGAAAAAATCACTCTAAGTCCAGATGACCCTTTATTCATTGTGTGGGATGCTGAAAACTCCATGGTTATGACATGGCTAGTCAACTC
CATGGTGAAAGATATCAGTAGTAACTGCATGTGCTACATTACGGCCAAGGAATTATGGGACAGTGTGACTCAAATGTATTCTGATTTGGGTAACCAGTCACAAGTGTTCG
AGCTGAATCTTAAGTTGGAACTTGATCTGTTTGATACATATGAGTGGAAGTCCACAGACGACCAAAAAAATTATCTGAAAACTGTTGAAGATGGTAGCATTTACAAACTT
CTTGCTAGTCTCAATGTTGAGTTTGATGAGGTTAGAGGCAGGATACTTGGGAAAAGTACTCTTCCTAATATTAACGATGTTTTTTCTAAAGTTCGCAGGGAGGAAAGTCG
CAGGAATGTTATGATTGGAAAGAAAGCAATTGACTCAGCTGAAAGTTCCGCGTTAGTGATTGAAAATACTGCAATGAAAGCTTCCAATCAATCCAATGAAACTCATGACA
AGCCTCGTCATGCCTCCAATGCAAATATTGTTGATTCCAATCCACTCAAAGAGCAAATTGATCAAATCCTGAAGTTGCTAAAATCCAATTCATTGGGTAATCCTAGTTTT
TCCTTGGCACAAACAGGTAATTCCCGTCAAGCCCTCTCGTGTCTAAATTCCTCTCCGTGGATCATTGATTTCGGAGCTGCTGATCATATGACTAGCAAAGGAATTATTCC
CCTAAGTACAAAACTCATACTACGTTTTGTCCTTCATGTTCCTCAACTAGCTTGTAATTTATTATCTGACTCGGGGGAGATGATTGAACATGCTAGGATGATTAATGGTC
TCTATTACTTTGATGAAGTTTCAACTAGTAATAAAAAGATTCAGGGCTTGAGTAGTGTCAGTTCTCTTTCTGTTCAAGAAACTATTATGCTTTGGCATCGTAGATTAGGA
CATCCTAATTTCGTTTATTTAAAGTACTTGTTTCCTGATTTATTTAAAGGAATTGATTGTTCTGTTTTCCAATGCGAAGATTGCATTTTTGCCAAACATCATCAATCTAC
TTTTTCACCTAAATCTTATAAATCTTCATCACCTTTTTACTTAATTCATACTGATGTTTGGGGTCCGTCTAAGGTTTTGACCAAAAATGGCAAGCGTTGGTTTGTTACCT
TTATCGATGATCACACCCGTTTAACTTGGCTTTACTTACTTACGAAAAAGTCAAATGTAAAAGAGGTCTTTGTTCGTTTTTACAAAATGATTGAGACTCAATTTCAAGCT
AAAATTCGCATTCTTCACTCTGATAATGAAGCTAAATTTTTTAACGAACCATTAACTACCTTTTTACATGACAAGCACATCGTTCATCAAGCTACATGTTGCAATAATCC
TCAACAAAATGGTATTGCTAAACGAAAATATCGACATTTACTTGAAGTTTCTCGTGCCCTTATGTTTTCTATGCATGTTCCAACATATTTGTGGGGGGATGTTGGTCTAA
CTGCTGCTTATCTAATAAATAGAATACCTACTAAGGGGGAGTCATCTCTCGTTGAAGAGAATTTTTGGGACACTTCACCTCTCCCAAACATCATTAGTCCTGAAATTATG
AGCTTTAGTCCTTCGATCCCAAGTGTGGAAAATTCTCCGGCAGGGGGAGAAACACTACAAATAGATTTGATAGGTCGAAATCCTAAACTTCAATTTCATACTAGAAGAAA
CACAACTCAAAGGGATAGAAATCAGACAGTCGAGCTAACACAAGACCAATCTGATACTCTAGTGAATGATCCTGAAAATCCAGGTATGTCTTTTAGTCCTTCCTCTCTTA
ATATGTTGCCCGATGTCCCTGATTTTGATATTCCAATTGCCCATAGAAAAGGTGGATGCGTTGGCAACCTCTTAACGCAGAACTATGGATGCGTTGGCATACTTGAACGC
AAGTACCCAAGTTATCCGTGGACTTATATCAACAAGGCAGCGGAGAATCAAATAAGAATCTTGAGAGATTGTTTGGAGATTGATATGCTTGATATGCAACTTGAAGGCAA
AAAGTTATTATTGAACCTTTCGAAAGAATTCCTAAATGACAGCGGAGTTTTCTCCAGACGACAACCAGAGACAAATAGCTCGAAAGAGAAGATTGATCTACCAGAAGACA
AGAAAGCAGTGGGATGTAAGTGGGTTTTCACGATAAAATGTAATGCTGATGGTATTATCGAAAGGTACAAGGCCAGATTAGTGGCTAAGGGATTCACCCAGACCTATGGA
ATTGATTATCAAAAGACATTTGCCCCTGTAGCTAAAATTAAGTCAATTATAATTTTGCTCTCTATTGCAGTTAATTTTGATTGGCCACTTTATCAACTTGATGTTAAAAA
TGCATTTCTTAATGGGGAACTTGAAGAAGAAGTATTTATGGACTTACCACTTGGTTTTGAAGTTGACCTTGGGATTAACAAGTCTCCTAGAGCTTGGTTTGAACGTTTTA
GAAAGGCAGTCACAAGCTATGGATTCAGCCAAAGTCAAGCCGACCATACTATGTTCTACCAGCATACAGAAAATGACAAGGTAGTTGTTCTGATAGTGTATGTCGATGAT
ATCATTCTTACAGGCAATGATGAGACAGGAATGTCTATTATCAAAGACCTGGGATCCTTAAAGTACTTTCTTGGCATGGAGTTTGCTAGGTCTAAAAGTGGTATCCTTGT
CAATCAAAGAAAGTATATCCTTGATCTACTCAAAAAGACAGGTTTACTTGGTTGCCGAATTGTAGAAACTCCCATTGAAAAAGGGAAAGTACTAGAGACTTGTGGGAAGA
CTAATATACCTCTCTCACACACGCCCGACATTGCTTTTGTAGTTAGTATGGTTGAAGTTTATACTGATGCTGATTGGGCAGGTAGCACGACAGATAGGAGATCAACTTCT
GGGTATTGCTCCTTTGTTGGAGGAAAGTTAGTTACTTGGCATAGCAAAAAACAGAGTGTAGTTGCAAGAAGTAGTGCTGAAACAGAATTTAGAGCATTGGCCCATGTTGA
TAAAAATTTCATAAAAGAAAAGATTGATGCAGGAGTGATATGCATTCCCTACCTCCCAACAACAGAACAAATTGCAGATGTATTAACTAAAGGTCTTCCTAAGTTACAAT
TCAACAAGTTAACAGACAAGCTGGCCATGAATGATATCTTCAAACTAGCTTGA
Protein sequenceShow/hide protein sequence
MVMGRLLRLLRLKLRKLTLSMKKVAFPSRRRRRCLRFCCDLGYLFLHRSTKSAPLSEGFSASFGDSMVPCSAAKWKFLRLRYLPPKSSTIRSIPTLPLSKSPPFDLTGIT
FFIGYLIGEKITLSPDDPLFIVWDAENSMVMTWLVNSMVKDISSNCMCYITAKELWDSVTQMYSDLGNQSQVFELNLKLELDLFDTYEWKSTDDQKNYLKTVEDGSIYKL
LASLNVEFDEVRGRILGKSTLPNINDVFSKVRREESRRNVMIGKKAIDSAESSALVIENTAMKASNQSNETHDKPRHASNANIVDSNPLKEQIDQILKLLKSNSLGNPSF
SLAQTGNSRQALSCLNSSPWIIDFGAADHMTSKGIIPLSTKLILRFVLHVPQLACNLLSDSGEMIEHARMINGLYYFDEVSTSNKKIQGLSSVSSLSVQETIMLWHRRLG
HPNFVYLKYLFPDLFKGIDCSVFQCEDCIFAKHHQSTFSPKSYKSSSPFYLIHTDVWGPSKVLTKNGKRWFVTFIDDHTRLTWLYLLTKKSNVKEVFVRFYKMIETQFQA
KIRILHSDNEAKFFNEPLTTFLHDKHIVHQATCCNNPQQNGIAKRKYRHLLEVSRALMFSMHVPTYLWGDVGLTAAYLINRIPTKGESSLVEENFWDTSPLPNIISPEIM
SFSPSIPSVENSPAGGETLQIDLIGRNPKLQFHTRRNTTQRDRNQTVELTQDQSDTLVNDPENPGMSFSPSSLNMLPDVPDFDIPIAHRKGGCVGNLLTQNYGCVGILER
KYPSYPWTYINKAAENQIRILRDCLEIDMLDMQLEGKKLLLNLSKEFLNDSGVFSRRQPETNSSKEKIDLPEDKKAVGCKWVFTIKCNADGIIERYKARLVAKGFTQTYG
IDYQKTFAPVAKIKSIIILLSIAVNFDWPLYQLDVKNAFLNGELEEEVFMDLPLGFEVDLGINKSPRAWFERFRKAVTSYGFSQSQADHTMFYQHTENDKVVVLIVYVDD
IILTGNDETGMSIIKDLGSLKYFLGMEFARSKSGILVNQRKYILDLLKKTGLLGCRIVETPIEKGKVLETCGKTNIPLSHTPDIAFVVSMVEVYTDADWAGSTTDRRSTS
GYCSFVGGKLVTWHSKKQSVVARSSAETEFRALAHVDKNFIKEKIDAGVICIPYLPTTEQIADVLTKGLPKLQFNKLTDKLAMNDIFKLA