; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041639 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041639
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr13:22648987..22658292
RNA-Seq ExpressionLag0041639
SyntenyLag0041639
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]0.0e+0044.64Show/hide
Query:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE
        +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW   E      S++       PN +     ++  F          +L+      
Subjt:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE

Query:  AWEGLRSFCESVPTMVYLIRHNK--------KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP
           G  + C ++   +    + K        K   V+EVD V+ + A +  +  ++KN  V   Q  P               C  CGE H  + CP + 
Subjt:  AWEGLRSFCESVPTMVYLIRHNK--------KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP

Query:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG
         S+ FV N R   NNPYSN YNPGWR HPNFSW             QG   Q QQ M +  P   +  I       ANF +      G +AN + +RPQG
Subjt:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG

Query:  KLPSDTE-HPRREGKEQVKAVTLRS--ELESG--QGAGGSNKNAGASGSVPDVEPPY------VPPPPYVPPLP-----FPQRQKPKNQDGQFK------
         LPS+TE +PR++GK Q +AVTLR+  EL+    +      K   +     +VE P       +P    +  +P            K   G ++      
Subjt:  KLPSDTE-HPRREGKEQVKAVTLRS--ELESG--QGAGGSNKNAGASGSVPDVEPPY------VPPPPYVPPLP-----FPQRQKPKNQDGQFK------

Query:  --NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE------------------------
          +AI++N LPPK KDPGSFTIP +IG    GRALCDLG    L           GEA+PT++TLQLADRS+TYP+                        
Subjt:  --NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE------------------------

Query:  ----------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSVED
                  GRPFLATGRTLIDVQ                  KAMK+P+E ++C  + + ++ V            LE A+ D  D+ +E       ED
Subjt:  ----------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSVED

Query:  FEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADI
        +E   +   +  + F+   V       +R AP   +KPS+ E PTL+LKPLP+HL Y YLGES+TLP+I++S L     E ++++L+ ++ AIGWT+ADI
Subjt:  FEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADI

Query:  QGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV----------------------------
        +GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TV                            
Subjt:  QGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV----------------------------

Query:  --------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGG
                      MLDRLAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G 
Subjt:  --------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGG

Query:  SFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR--------------------
        SF  CL+NL  VLKRCEDT+L+LNWEKCHFMV+EG+VLGHK+S  G++VD+AK+E IE+L PP SVKG+RSFLGHAGFYR                    
Subjt:  SFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR--------------------

Query:  ----------------------STILCAPNWNLPFEVMCDASDAA-------------------------------------------------------
                              + I+  P+W+ PFE+MCDASD A                                                       
Subjt:  ----------------------STILCAPNWNLPFEVMCDASDAA-------------------------------------------------------

Query:  -------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASR
                                       EFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  DVP YA          
Subjt:  -------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASR

Query:  RCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRF
           C +     L AQ+K         L  ++RY   +P    +G   I   CV   E  +ILEQCH+SPYGGHF G  TA +IL  GFFW  LFKDAH F
Subjt:  RCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRF

Query:  YKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHF
           CD CQR GN+  R EM L  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTHF
Subjt:  YKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK
         N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+GMSPYRLV+GKACHLP+ELEH  +WA++K
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK

Query:  LNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVF
        LNFD+  AG  R+LQLNEL+EFR  +YENAKIYKEK K WH+KKI  +    GQ VLL+NS LKLFPGKLKS+WSGPF + EVFP+GA+ L+++     F
Subjt:  LNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVF

Query:  KVNGQRVKHYWGEEFQSKYPSL
        KVN QR+KHYWGE    ++ S+
Subjt:  KVNGQRVKHYWGEEFQSKYPSL

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]0.0e+0041.5Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA
        N   R +  YAVP  N   P I  P + A NFE+KP                        +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS A
Subjt:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA

Query:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------
        P S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P           T    + H+                       
Subjt:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------

Query:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---
                           KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N   
Subjt:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---

Query:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA
          Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF      P                               A+ +    I  G +AN + +
Subjt:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA

Query:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------
        R QG LPS TE      KE  KAVTLRS  + GQ +G +           N   S  V D+     P  P  PYVPP+PFPQR K    D QF+      
Subjt:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------

Query:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE
                                                     +AI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKLG+ E
Subjt:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE

Query:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF
         +PTT++LQLADRS+ YP                                   GRPFLAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Subjt:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF

Query:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES
        + +++ +  E    ++  +  E      G    ++ E     C+L+    K   R      F D+ + K PP  PS ++AP L+LKPLP HL Y +LGE+
Subjt:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES

Query:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC
         TLP+IV+  L  E  + +I++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ 
Subjt:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC

Query:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF
        VPKKGG+TV                                          +L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPF
Subjt:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF

Query:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP
        GLCNAPATFQRCM+AIFSD +E  +EVFMDDFSVFG SF +CLDNL +VL+RCE+T+LVLNWEKCHFMV+EG+VLGHKIS  GL+VDRAK+E+IE+L PP
Subjt:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP

Query:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------
         +VKG+RSFLGH GFYR                                          + I+ AP+W+LPFE+MCDASD A                  
Subjt:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------

Query:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ
                                                                            EFDLEI+DK+G ENV+ADHLSRL+   S  ++
Subjt:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ

Query:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE
          I++SFPDEQL AV V     +P YA               V+ L        ++  +    L   + Y   EP+    C       CV   E ++IL+
Subjt:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE

Query:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA
         CHS   GGHFS   T  ++   GF+W T+++D  ++   CD CQR GN+  ++EM LT ILEVELFD+WGIDFMGPFP S  N +IL+AVDYVSKWVEA
Subjt:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA

Query:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY
         A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAY
Subjt:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY

Query:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL
        RTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA+K LN DL  AG  R+LQL+ELEEFR  +YEN +IYKEKTK WHDK ++ +    GQ+VLL+NS L
Subjt:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL

Query:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY
        KLFPGKL+S+WSGPF V +V+P GA+ ++ E     FKVNGQR+K Y
Subjt:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]0.0e+0041.5Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA
        N   R +  YAVP  N   P I  P + A NFE+KP                        +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS A
Subjt:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA

Query:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------
        P S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P           T    + H+                       
Subjt:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------

Query:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---
                           KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N   
Subjt:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---

Query:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA
          Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF      P                               A+ +    I  G +AN + +
Subjt:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA

Query:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------
        R QG LPS TE      KE  KAVTLRS  + GQ +G +           N   S  V D+     P  P  PYVPP+PFPQR K    D QF+      
Subjt:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------

Query:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE
                                                     +AI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKLG+ E
Subjt:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE

Query:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF
         +PTT++LQLADRS+ YP                                   GRPFLAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Subjt:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF

Query:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES
        + +++ +  E    ++  +  E      G    ++ E     C+L+    K   R      F D+ + K PP  PS ++AP L+LKPLP HL Y +LGE+
Subjt:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES

Query:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC
         TLP+IV+  L  E  + +I++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ 
Subjt:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC

Query:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF
        VPKKGG+TV                                          +L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPF
Subjt:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF

Query:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP
        GLCNAPATFQRCM+AIFSD +E  +EVFMDDFSVFG SF +CLDNL +VL+RCE+T+LVLNWEKCHFMV+EG+VLGHKIS  GL+VDRAK+E+IE+L PP
Subjt:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP

Query:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------
         +VKG+RSFLGH GFYR                                          + I+ AP+W+LPFE+MCDASD A                  
Subjt:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------

Query:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ
                                                                            EFDLEI+DK+G ENV+ADHLSRL+   S  ++
Subjt:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ

Query:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE
          I++SFPDEQL AV V     +P YA               V+ L        ++  +    L   + Y   EP+    C       CV   E ++IL+
Subjt:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE

Query:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA
         CHS   GGHFS   T  ++   GF+W T+++D  ++   CD CQR GN+  ++EM LT ILEVELFD+WGIDFMGPFP S  N +IL+AVDYVSKWVEA
Subjt:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA

Query:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY
         A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAY
Subjt:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY

Query:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL
        RTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA+K LN DL  AG  R+LQL+ELEEFR  +YEN +IYKEKTK WHDK ++ +    GQ+VLL+NS L
Subjt:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL

Query:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY
        KLFPGKL+S+WSGPF V +V+P GA+ ++ E     FKVNGQR+K Y
Subjt:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]0.0e+0041.44Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA
        N   R +  YAVP  N   P I  P + A NFE+KP                        +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS A
Subjt:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA

Query:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------
        P S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P           T    + H+                       
Subjt:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------

Query:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---
                           KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V FV N   
Subjt:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---

Query:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA
          Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF      P                               A+ +    I  G +AN + +
Subjt:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA

Query:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------
        R QG LPS TE      KE  KAVTLRS  + GQ +G +           N   S  V D+     P  P  PYVPP+PFPQR K    D QF+      
Subjt:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------

Query:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE
                                                     +AI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKLG+ E
Subjt:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE

Query:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF
         +PTT++LQLADRS+ YP                                   GRPFLAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Subjt:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF

Query:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES
        + +++ +  E    ++  +  E      G    ++ E     C+L+    K   R      F D+ + K PP  PS ++AP L+LKPLP HL Y +LGE+
Subjt:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES

Query:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC
         TLP+IV+  L +E  + +I++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ 
Subjt:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC

Query:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF
        VPKKGG+TV                                          +L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPF
Subjt:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF

Query:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP
        GLCNAPATFQRCM+AIFSD +E  +EVFMDDFSVFG SF +CLDNL +VL+RCE+T+LVLNWEKCHFMV+EG++LGHKIS  GL+VDRAK+E+IE+L PP
Subjt:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP

Query:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------
         +VKG+RSFLGH GFYR                                          + I+ AP+W+LPFE+MCDASD A                  
Subjt:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------

Query:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ
                                                                            EFDLEI+DK+G ENV+ADHLSRL+   S  ++
Subjt:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ

Query:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE
          I++SFPDEQL AV V     +P YA               V+ L        ++  +    L   + Y   EP+    C       CV   E ++IL+
Subjt:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE

Query:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA
         CHS   GGHFS   T  ++   GF+W T+++D  ++   CD CQR GN+  ++EM LT ILEVELFD+WGIDFMGPFP S  N +IL+AVDYVSKWVEA
Subjt:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA

Query:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY
         A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAY
Subjt:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY

Query:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL
        RTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA+K LN DL  AG  R+LQL+ELEEFR  +YEN +IYKEKTK WHDK ++ +    GQ+VLL+NS L
Subjt:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL

Query:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY
        KLFPGKL+S+WSGPF V +V+P GA+ ++ E     FKVNGQR+K Y
Subjt:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]0.0e+0041.33Show/hide
Query:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA
        N   R +  YAVP  N   P I  P + A NFE+KP                        +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS A
Subjt:  NDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKP------------------------SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFA

Query:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------
        P S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P           T    + H+                       
Subjt:  PGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRHN-----------------------

Query:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---
                           KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H      S+   V FV N   
Subjt:  -------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGN---

Query:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA
          Q+NNPYSN YNPGWRNHPNFSW  QGN   + + ++ PGF      P                               A+ +    +  G +AN + +
Subjt:  --QRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAP-------------------------------ANFDEGTGIASGSVANELKA

Query:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------
        R QG LPS TE      KE  KAVTLRS  + GQ +  +           N   S  V D+     P  P  PYVPP+PFPQR K    D QF+      
Subjt:  RPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGS---------NKNAGASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFK------

Query:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE
                                                     +AI++N LPPK +DPGSF+IP +IG  +  RALCDLGAS++LMPLSV RKLG+ E
Subjt:  ---------------------------------------------NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGE

Query:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF
         +PTT++LQLADRS+ YP                                   GRPFLAT   +IDV+ G LT++V  EEV+FN+F+A KYP   +    
Subjt:  ARPTTVTLQLADRSITYP----------------------------------EGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSF

Query:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES
        + +++ +  E    ++  +  E      G    ++ E     C+L+    K   R      F D+ + K PP  PS ++AP L+LKPLP HL Y +LGE+
Subjt:  IRILESTVLETAIQDSADKHSE----KHGEVSVEDFEF----CSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGES

Query:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC
         TLP+IV+  L  E  + +I++L+  +KAIGWT++D++GISPS CMH+I +E+     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ 
Subjt:  ETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC

Query:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF
        VPKKGG+TV                                          +L+RLAG AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRMPF
Subjt:  VPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPF

Query:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP
        GLCNAPATFQRCM+AIFSD +E  +E+FMDDFSVFG SF +CLDNL +VL+RCE+T+LVLNWEKCHFMV+EG+VLGHKIS  GL+VDRAK+E+IE+L PP
Subjt:  GLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPP

Query:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------
         +VKG+RSFLGH GFYR                                          + I+ AP+W+LPFE+MCDASD A                  
Subjt:  NSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVMCDASDAA------------------

Query:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ
                                                                            EFDLEI+DK+G ENV+ADHLSRL+   S  ++
Subjt:  --------------------------------------------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ

Query:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE
          I++SFPDEQL AV V     +P YA               V+ L        ++  +    L   + Y   EP+    C       CV   E ++IL+
Subjt:  SAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIW---CVSGAEAKEILE

Query:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA
         CHS   GGHFS   T  ++   GF+W T+++D  ++   CD CQR GN+  ++EM LT ILEVELFD+WGIDFMGPFP S  N +IL+AVDYVSKWVEA
Subjt:  QCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEA

Query:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY
         A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAY
Subjt:  IACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY

Query:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL
        RTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA+K LN DL  AG  R+LQL+ELEEFR  +YEN +IYKEKTK WHDK ++ +    GQ+VLL+NS L
Subjt:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSIL

Query:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY
        KLFPGKL+S+WSGPF V +V+P GA+ ++ E     FKVNGQR+K Y
Subjt:  KLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0044.64Show/hide
Query:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE
        +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW   E      S++       PN +     ++  F          +L+      
Subjt:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE

Query:  AWEGLRSFCESVPTMVYLIRHNK--------KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP
           G  + C ++   +    + K        K   V+EVD V+ + A +  +  ++KN  V   Q  P               C  CGE H  + CP + 
Subjt:  AWEGLRSFCESVPTMVYLIRHNK--------KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP

Query:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG
         S+ FV N R   NNPYSN YNPGWR HPNFSW             QG   Q QQ M +  P   +  I       ANF +      G +AN + +RPQG
Subjt:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWGG-----------QGNNVQAQQKMNQ--PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG

Query:  KLPSDTE-HPRREGKEQVKAVTLRS--ELESG--QGAGGSNKNAGASGSVPDVEPPY------VPPPPYVPPLP-----FPQRQKPKNQDGQFK------
         LPS+TE +PR++GK Q +AVTLR+  EL+    +      K   +     +VE P       +P    +  +P            K   G ++      
Subjt:  KLPSDTE-HPRREGKEQVKAVTLRS--ELESG--QGAGGSNKNAGASGSVPDVEPPY------VPPPPYVPPLP-----FPQRQKPKNQDGQFK------

Query:  --NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE------------------------
          +AI++N LPPK KDPGSFTIP +IG    GRALCDLG    L           GEA+PT++TLQLADRS+TYP+                        
Subjt:  --NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE------------------------

Query:  ----------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSVED
                  GRPFLATGRTLIDVQ                  KAMK+P+E ++C  + + ++ V            LE A+ D  D+ +E       ED
Subjt:  ----------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSVED

Query:  FEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADI
        +E   +   +  + F+   V       +R AP   +KPS+ E PTL+LKPLP+HL Y YLGES+TLP+I++S L     E ++++L+ ++ AIGWT+ADI
Subjt:  FEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADI

Query:  QGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV----------------------------
        +GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TV                            
Subjt:  QGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV----------------------------

Query:  --------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGG
                      MLDRLAG+ +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G 
Subjt:  --------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGG

Query:  SFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR--------------------
        SF  CL+NL  VLKRCEDT+L+LNWEKCHFMV+EG+VLGHK+S  G++VD+AK+E IE+L PP SVKG+RSFLGHAGFYR                    
Subjt:  SFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR--------------------

Query:  ----------------------STILCAPNWNLPFEVMCDASDAA-------------------------------------------------------
                              + I+  P+W+ PFE+MCDASD A                                                       
Subjt:  ----------------------STILCAPNWNLPFEVMCDASDAA-------------------------------------------------------

Query:  -------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASR
                                       EFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  DVP YA          
Subjt:  -------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASR

Query:  RCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRF
           C +     L AQ+K         L  ++RY   +P    +G   I   CV   E  +ILEQCH+SPYGGHF G  TA +IL  GFFW  LFKDAH F
Subjt:  RCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRF

Query:  YKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHF
           CD CQR GN+  R EM L  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTHF
Subjt:  YKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHF

Query:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK
         N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+GMSPYRLV+GKACHLP+ELEH  +WA++K
Subjt:  VNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKK

Query:  LNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVF
        LNFD+  AG  R+LQLNEL+EFR  +YENAKIYKEK K WH+KKI  +    GQ VLL+NS LKLFPGKLKS+WSGPF + EVFP+GA+ L+++     F
Subjt:  LNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVF

Query:  KVNGQRVKHYWGEEFQSKYPSL
        KVN QR+KHYWGE    ++ S+
Subjt:  KVNGQRVKHYWGEEFQSKYPSL

A0A2G9G6G2 Reverse transcriptase0.0e+0044.5Show/hide
Query:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE
        +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW   E      S++       PN +     ++  F          +L+      
Subjt:  SFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTW--DELAENFLSKYFP-----PNRNAKLRSEIVGF---------RQLEDETFSE

Query:  AWEGLRSFCESVPTMVYLIRHNKKVK--------SVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP
           G  + C ++   +    + KK +         V+EVD V+ + A +  +  ++KN  V   Q  P               C  CGE H  + CP + 
Subjt:  AWEGLRSFCESVPTMVYLIRHNKKVK--------SVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNP

Query:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQ----------PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG
         S+ FV N R   NNPYSN YNPGWR HPNFSW    GQG+  + QQ+  Q          P   +  I       ANF +      G +AN + +RPQG
Subjt:  ASVFFVGNQR---NNPYSNFYNPGWRNHPNFSWG---GQGNNVQAQQKMNQ----------PGFAKAGI-----APANFDEGTGIASGSVANELKARPQG

Query:  KLPSDTEHPRREGKEQVKAVTLRS--ELES--GQGAGGSNKNAGASGSVPDVEPPY------VP-------PPPYVPPLPFPQRQKPKNQDGQFK-----
         LPS+TE   R+       VTLR+  EL+    +      K   +     +VE P       +P        P YV  +     +K +  D +       
Subjt:  KLPSDTEHPRREGKEQVKAVTLRS--ELES--GQGAGGSNKNAGASGSVPDVEPPY------VP-------PPPYVPPLPFPQRQKPKNQDGQFK-----

Query:  -NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-------------------------
         + I++N LPPK K+PGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLADRS+TYP+                         
Subjt:  -NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE-------------------------

Query:  ---------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILES------------TVLETAIQDSADKHSEKHGEVSVEDF
                 GRPFLATGRTLIDVQ                  KAMK+P+E ++C  + + ++              LE A+ D  D+ +EK  EV     
Subjt:  ---------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILES------------TVLETAIQDSADKHSEKHGEVSVEDF

Query:  EFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQ
            +   +  + F+   V       +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E ++++L+ ++  IGWT+ADI+
Subjt:  EFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQ

Query:  GISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV-----------------------------
        GISPSFCMHKI LE+    SIE QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQCVPKKGG+TV                             
Subjt:  GISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV-----------------------------

Query:  -------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGS
                     MLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM+AIF+DM+E+ +EVFMD+FSV+G S
Subjt:  -------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGS

Query:  FQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI-----LCAPNWNL-----
        F  CL+NL  VLKRCEDT+LVLNWEKCHFMV+EG+VLGHK+S  G++VD+AK+E IE+L PP SVKG+RSFLGHAGFYR  I     +  P  NL     
Subjt:  FQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI-----LCAPNWNL-----

Query:  PFEVM------------------------CDASDAA----------------------------------------------------------------
        PF                           CD +  A                                                                
Subjt:  PFEVM------------------------CDASDAA----------------------------------------------------------------

Query:  -------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRL
                           EFDLEI+D+KG+EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  +VP YA             C +      
Subjt:  -------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRL

Query:  GAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGN
             +++++   +L  ++RY   +P    +G   I   CV   E  +ILEQCH+SPYGGHF G  TA +IL  GFFW  LFKDAH F   CD CQR  N
Subjt:  GAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGN

Query:  LGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKY
        +  R EM L  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYVSKWVEA A   +D+K V  F++ +IF RFGTPRA++SD  T+F N     LL+KY
Subjt:  LGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKY

Query:  GIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIR
        G+KH+I TPYHPQ +G  E+SNREIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+GMSPY L++GKACHLP+ELEH  +WA+ KLNFD+  AG  R
Subjt:  GIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIR

Query:  MLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWG
        +LQLNEL+EFR  +YENAKIYKEKTK WHDKKI  +    GQ VLL+NS LKLFPGKLKS+W G F + EVFP+GA+ L++E     FK+N +R+KHYWG
Subjt:  MLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWG

A0A2G9HWF8 Reverse transcriptase0.0e+0041.68Show/hide
Query:  AAAKPLFEQNEQQNNQAENPILIAND--RTRAIRAYAVP---------MFNELNPG--IACPQIQ----AANF---------EMKPSFLGVSDSFVIQGV
        A  + L E  EQ+    EN I++  D      +R  A+P         +F EL  G  +  P IQ     A F             +FL + D+   +GV
Subjt:  AAAKPLFEQNEQQNNQAENPILIAND--RTRAIRAYAVP---------MFNELNPG--IACPQIQ----AANF---------EMKPSFLGVSDSFVIQGV

Query:  PRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVPT-------MVYLIRH--
         +DALRL LF +SL   A  W  S    SI TW +L E F+SK+F P + A LR+EI+ FRQ   ET  EAW   R    + P         V+   H  
Subjt:  PRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVPT-------MVYLIRH--

Query:  NKKVKSVLE-VDGVSTIRADLAMIANALKNVTVISHQQ------PPAMEPTAVVNQVT-----------DEACVYCGEDHNYEFCPSNPASVFFVGNQR-
         +  K  L+ ++G S +    A   N L N+ V  +++      PP       V+QVT                 CGE H  + CP +  S+ FV N R 
Subjt:  NKKVKSVLE-VDGVSTIRADLAMIANALKNVTVISHQQ------PPAMEPTAVVNQVT-----------DEACVYCGEDHNYEFCPSNPASVFFVGNQR-

Query:  --NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAPANFDEGTGIASGSVANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS--ELE
          NNPYSN YNPGWR HPNFSW    NN Q Q           G AP  F +G          +   +P         +PR++GK Q +AVTLR+  +L+
Subjt:  --NNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAPANFDEGTGIASGSVANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS--ELE

Query:  S--GQGAGGSNKNAGASGSVPDVEPPYVPPPPYVPPLPFPQR---QKPKNQDGQFK--------------------------------------------
            +      K   +     +VE P     P     PFPQR   QK K Q  +F                                             
Subjt:  S--GQGAGGSNKNAGASGSVPDVEPPYVPPPPYVPPLPFPQR---QKPKNQDGQFK--------------------------------------------

Query:  ----NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------
            +AI++N LPPK KDPG              RALCDLGASINLMP S+YR LG+ EA+PT++TLQLADRS+TYP+                      
Subjt:  ----NAILKNGLPPKAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------

Query:  ------------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSV
                    GRPFLATGRTLIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++              LE A+ D  D+ +E+  EV  
Subjt:  ------------GRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------LETAIQDSADKHSEKHGEVSV

Query:  EDFEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLA
               +   +  + F+   V       +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TLP+I++S L     E ++++L+ ++ AIGWT+A
Subjt:  EDFEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLA

Query:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV--------------------------
        DI+GISPSFCMHKI LE+    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPKKGG+TV                          
Subjt:  DIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV--------------------------

Query:  ----------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF
                        MLDRLAG+ +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+
Subjt:  ----------------MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF

Query:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR------------------
        G SF  CL+NL  VLKRCEDT+LVLNWEKCHFMV+EG+VLGHK+S  G++VD+AK+E IE+L P  SVKG+RSFLGHAGFYR                  
Subjt:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR------------------

Query:  ------------------------STILCAPNWNLPFEVMCDASDAA-----------------------------------------------------
                                + I+  P+W+ PFE+MCDASD A                                                     
Subjt:  ------------------------STILCAPNWNLPFEVMCDASDAA-----------------------------------------------------

Query:  ---------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARA
                                         EFDLEI+D+KG EN IADHLSRL+  +   E + I+D+FPDEQL A+   V  DVP YA        
Subjt:  ---------------------------------EFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARA

Query:  SRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAH
             C +     L AQ+K         L  ++RY   +P    +G   I   CV   E  +I EQCH+SPYGGHF    TA +IL  GFFW  LFKD H
Subjt:  SRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI---KGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAH

Query:  RFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGT
         F   CD CQR GN+  R EM L  ILEVELFDVWGIDFMGPF PS GN++IL+AVDY+SKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD GT
Subjt:  RFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGT

Query:  HFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWAL
        HF N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK  LEK V  +RKDWS RLDEALWAYRTA+KTP+GMSPY LV+GKACHLP           
Subjt:  HFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWAL

Query:  KKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDER
                                                     KI  +    GQ VLL+NS LKLFP KLKS+WS PF + EV P+GA+ L+++    
Subjt:  KKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDER

Query:  VFKVNGQRVKHYW
         FKVN QR+KHYW
Subjt:  VFKVNGQRVKHYW

A0A2G9IA86 DNA-directed DNA polymerase0.0e+0045.87Show/hide
Query:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNF
        K   V+EVD V+ + A +  +  ++KN  V   Q  P               C  CGE H    CP++  S+ FV N R   NNPYSN YNPGWR HPNF
Subjt:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQR---NNPYSNFYNPGWRNHPNF

Query:  SWGG-----------QGNNVQAQQKMNQ--PGFAKAGI----APANFDEGTGIASGSVANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSELESGQG
        SW             Q    Q QQ M +  P   +  I    + A   +      G +AN + +RPQG L S+TE +PR++GK Q +AVTLR+  E  + 
Subjt:  SWGG-----------QGNNVQAQQKMNQ--PGFAKAGI----APANFDEGTGIASGSVANELKARPQGKLPSDTE-HPRREGKEQVKAVTLRSELESGQG

Query:  AGGSNKNAG----ASGSVPDVEPP--------------------------------YVPPPPYVPPLPFPQRQKPKNQDGQFK------NAILKNGLPPK
             K+ G    +     +VE P                                    P YV  + +   +K +  D +        +AI++N LPPK
Subjt:  AGGSNKNAG----ASGSVPDVEPP--------------------------------YVPPPPYVPPLPFPQRQKPKNQDGQFK------NAILKNGLPPK

Query:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRP
         KDPGSFTIP +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLA+RS+TYP+                                  GRP
Subjt:  AKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRP

Query:  FLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV--------LETAIQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRC
        FLATGRTLIDVQKG+LTMRV ++++ FNVFKAMK+P+E ++C  + + ++          LE A+ D  D+ +E+  EV         +   +  + F+ 
Subjt:  FLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV--------LETAIQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRC

Query:  EDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEG
          V       +R AP   +KPS+ E+PTL+LKPLP HL Y YLGES+TLP+I++S L     E ++++ + ++ AIGWT+ADI+GIS SFCMHKI LE+ 
Subjt:  EDVFEFLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEG

Query:  SFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV------------------------------------------MLD
           S+E QRRLNP MKEVVKKE+IKW+DAGIIYPI+DS+WVSPVQCVPKKGG+TV                                          MLD
Subjt:  SFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV------------------------------------------MLD

Query:  RLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCE
        RLAG+ +YCFLDGY           DQEKTTFTCPYGTFAFRR+PFGLCNAPATFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF  CL+NL  VLKRCE
Subjt:  RLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCE

Query:  DTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR-------------------------------------
        DT+LVLNW+KCHFMV+EG+VL HK+S  G++V++AK+E IE+L PP SVKGIRSFLGHAGFYR                                     
Subjt:  DTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR-------------------------------------

Query:  -----STILCAPNWNLPFEVMCDASDAA------------------------------------------EFDLEIKDKKGSENVI-ADHLS--------
             + I+  P+W+ PFE+MCDASD A                                           FD       G++ ++  DH +        
Subjt:  -----STILCAPNWNLPFEVMCDASDAA------------------------------------------EFDLEIKDKKGSENVI-ADHLS--------

Query:  ----RLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI----KGAFC
            RL+  +   E + I+D+FPDEQL A+   V  DVP Y+             C +     L AQ+K         L  ++RY  W+ +    +G   
Subjt:  ----RLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPI----KGAFC

Query:  IHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGN
        I   CV   E  +ILEQCH+SPYGGHF G  TA +IL  GFFW  LFKDAH F   CD CQR GN+  R EM L  IL+VELFDVWGIDF+GPF PS GN
Subjt:  IHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGN

Query:  VFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHP
        ++IL+AVDYVSKWVEA+A   +D+K V  F++ +IF RFGTPRA++SD G HF N      L+KYG+KH+I TPYHPQ +GQ E+SNREIK ILEK V  
Subjt:  VFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHP

Query:  SRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKS
        +R DWS RLDEALWAYRT YKTP+GMSPY L++GKACHL +ELEH  +WA++KLNFD+   G  R+LQLNEL+EFR  +YENAKIYKEKTK WHDKKI  
Subjt:  SRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKS

Query:  KESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWG
        +    GQ VLL+NS LKLFP KLK +WSGPF + EVFP+GA+ L++E     FKVN QR+KHYWG
Subjt:  KESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWG

A0A6P8CBX2 Reverse transcriptase0.0e+0041.98Show/hide
Query:  FLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP----
        FL   ++  +  V  D +RL LFP+SLRD A++W NS    SI TW +L+  FL ++FPP R A+LR+EI  F +   E+  EAWE  +      P    
Subjt:  FLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP----

Query:  -----TMVYLIRHNKKVKSVLEV-----------DGVSTIRADLAMIAN------------ALKNVTVISHQQPPAMEPTAVVNQVTD---------EAC
               V+ +  +  ++S+++            D  S +  ++A  A+            ++ ++  I++        T  V+++T            C
Subjt:  -----TMVYLIRHNKKVKSVLEV-----------DGVSTIRADLAMIAN------------ALKNVTVISHQQPPAMEPTAVVNQVTD---------EAC

Query:  VYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIA--------------------
          C   H+   C S NP++      V FV N QR+N  PYSN YNPGWRNHPNFSW  + N ++       PGF K G A                    
Subjt:  VYCGEDHNYEFCPS-NPAS------VFFVGN-QRNN--PYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIA--------------------

Query:  ----------PANFDEGTGIASGSVANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS--ELE----SGQGAGGSNKNAGASGSVPDVEPPYVPPPPYV
                   A      G  S  ++ +L  RP G LPS+TE    E  + V A+ LRS  ELE      Q    S +       V +     +   PYV
Subjt:  ----------PANFDEGTGIASGSVANELKARPQGKLPSDTEHPRREGKEQVKAVTLRS--ELE----SGQGAGGSNKNAGASGSVPDVEPPYVPPPPYV

Query:  PPLPFPQRQKPKNQDGQF------------------------------KNAILKN------------------------GLPPKAKDPGSFTIPVSIGGK
        PP+PFP R K +  D QF                              K+ + K                          LP K +D GSFT+P +IG  
Subjt:  PPLPFPQRQKPKNQDGQF------------------------------KNAILKN------------------------GLPPKAKDPGSFTIPVSIGGK

Query:  ELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRTLIDVQKGEL
             L D GASINLMPLS++RKLG+GE + T +TLQLADRSI YP+                                  GRPFLATG+ LIDV++G+L
Subjt:  ELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPE----------------------------------GRPFLATGRTLIDVQKGEL

Query:  TMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVLETAIQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFE-----FLDLDQRKAPPIKP
        T+RV NE++ FNV+ A+K  D+ + C  I I++  + E+      +K      E  + D +  S D ++E+E    E V E     + +L      P+  
Subjt:  TMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVLETAIQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFE-----FLDLDQRKAPPIKP

Query:  SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKK
        SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E+ ++ +L+++++AIGWT+ADI+GISP  C H+I LE      ++ QRRLNP +KEVVKK
Subjt:  SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEEAVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKK

Query:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQIT
        EV+K LDAGIIYPI+DS WVSPVQ VPKKGG+TV                                          ML++LAG  YYCFLDGYSGYNQI 
Subjt:  EVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTV------------------------------------------MLDRLAGQAYYCFLDGYSGYNQIT

Query:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVL
        IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCM++IFSDM+E+ +E+FMDDFSVFG SF++CL NLG VLKRC++T+L+LNWEKCHFMV+EG+VL
Subjt:  IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVL

Query:  GHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVM
        GHK+SK G++VDRAKVE+IE+L PP S KG+RSFLGHAGFYR                                          + ++ APNW LPFE+M
Subjt:  GHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR------------------------------------------STILCAPNWNLPFEVM

Query:  CDASDAA--------------------------------------------------------------------------------------EFDLEIK
        CDASD A                                                                                      EFDLEI+
Subjt:  CDASDAA--------------------------------------------------------------------------------------EFDLEIK

Query:  DKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACW-E
        D KG+ENV+ADHLSRL+  S  L+ S I++ FPDEQL   E   I+ +P YA               VS +   G   +   + L  V     +Y  W E
Subjt:  DKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEVKVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACW-E

Query:  PIKGAFC---IHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDF
        P    +C   +   CV   E   I++ CHS   GGHF  + TA +IL CGF+W  +F D   +   C  CQR GN+  R E+    IL +ELFDVWGIDF
Subjt:  PIKGAFC---IHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDF

Query:  MGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREI
        MGPFP S  N +IL+AVDYVSKWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G+HF N    KLL+KYG+ H+IATPYHPQ  GQ E+SNREI
Subjt:  MGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREI

Query:  KAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKT
        K ILEK V+ SRKDWS +LD+ALWAYRTA+KTP+GMSPY++VYGK+CHLP+ELEHK +WA+K LNFDL  AG  R+LQLN++ E R+ +YENA+IYKE+ 
Subjt:  KAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKIYKEKT

Query:  KLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWGEE
        K WHD+ I  +E + GQKVLLYNS LKLFPGKLKS+WSGPF++  VFP GA+ L+ E D+R FKVNG  +KHY+  E
Subjt:  KLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWGEE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-2932.5Show/hide
Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKK------------------GGVTV-----------MLDRLAGQAYYCFLDGYSGYNQITIAPED
        A ++ V+ ++   L+ GII   ++S + SP+  VPKK                    +TV           +L +L    Y+  +D   G++QI + PE 
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKK------------------GGVTV-----------MLDRLAGQAYYCFLDGYSGYNQITIAPED

Query:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKIS
          KT F+  +G + + RMPFGL NAPATFQRCM  I   ++     V++DD  VF  S    L +LG V ++    +L L  +KC F+ +E   LGH ++
Subjt:  QEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKIS

Query:  KNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI
         +G++ +  K+E I++   P   K I++FLG  G+YR  I
Subjt:  KNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI

P10394 Retrovirus-related Pol polyprotein from transposon 4128.7e-4522.07Show/hide
Query:  MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLK
        +LD+L    Y+  LD  SG++QI +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS +  S   ++MDD  V G S ++ L NL +V  
Subjt:  MLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLK

Query:  RCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR----------------------------------
        +C + +L L+ EKC F + E   LGHK +  G+  D  K +VI+    P+     R F+    +YR                                  
Subjt:  RCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYR----------------------------------

Query:  --------STILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------------
                 T+L  P+++  F +  DAS  A                                                                     
Subjt:  --------STILCAPNWNLPFEVMCDASDAA---------------------------------------------------------------------

Query:  -----------------EFDLEIKDKKGSENVIADHLSRL------DPSSSLLE-----QSAISDSFPDEQL--------FAVEVKVIRDVPCYAKKSGI
                         E++  ++  KG +N +AD LSR+      D + ++L+     QS        EQL         A E  V   +     +  +
Subjt:  -----------------EFDLEIKDKKGSENVIADHLSRL------DPSSSLLE-----QSAISDSFPDEQL--------FAVEVKVIRDVPCYAKKSGI

Query:  ARASRRCLCSVSMLRRLGAQEKMTSERLDAVL-----------------VASQRYACWEPI--------------------KGAFCIHIWCVSGAEAKE-
               +C     +++ A+  +     + +L                 ++  + A W+ I                    K A    +  ++  + KE 
Subjt:  ARASRRCLCSVSMLRRLGAQEKMTSERLDAVL-----------------VASQRYACWEPI--------------------KGAFCIHIWCVSGAEAKE-

Query:  ILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQR-RGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPS-NGNVFILLAVDYVS
        IL   H  P  G  +G    +  +   ++W  + K    + ++C  CQ+ +     +  M +T   E   FD   +D +GP P S NGN + +  +  ++
Subjt:  ILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTLFKDAHRFYKQCDACQR-RGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPS-NGNVFILLAVDYVS

Query:  KWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDE
        K++ AI      AKTV + +      ++G  +  ++D GT + N+I+T L     IK+  +T +H Q  G  E S+R +   +   +   + DW   L  
Subjt:  KWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDE

Query:  ALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLN-----FDLSRAGAIRMLQLNELEEFRQFSYENAK----IYKEKTKLWHDKKIKSKE
         ++ + T         PY LV+G+  +LP     K F  L  +       D ++    R+          + +Y  A+     +KEK K  +D K+K  E
Subjt:  ALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLN-----FDLSRAGAIRMLQLNELEEFRQFSYENAK----IYKEKTKLWHDKKIKSKE

Query:  SVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY
           G KVLL N +      KL  K++GP+ +  +  N  ITL   K+++   V+  R+K +
Subjt:  SVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.8e-2730.62Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGG-----------
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGG-----------

Query:  -------------VTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF
                     +  +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +F
Subjt:  -------------VTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF

Query:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTILCAPNWNLPFEV-MC
          S +    +L  VL+R ++ +L++  +KC F  +E   LG+ I    +   + K   I     P +VK  + FLG   +YR  I        P ++ +C
Subjt:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTILCAPNWNLPFEV-MC

Query:  DASDAAE
        D S   E
Subjt:  DASDAAE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.0e-2631.8Show/hide
Query:  MKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKK-----------------------------GGVTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQ
        M+  V++++ + L  GII P ++S + SP+  VPKK                               +   L  L    Y+  LD  SG++QI +   D 
Subjt:  MKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKK-----------------------------GGVTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQ

Query:  EKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISK
         KT F+   G + F R+PFGL NAPA FQR +  I  + I     V++DD  VF   +     NL  VL      +L +N EK HF+  +   LG+ ++ 
Subjt:  EKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISK

Query:  NGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI
        +G++ D  KV  I  + PP SVK ++ FLG   +YR  I
Subjt:  NGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTI

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.7e-2730.48Show/hide
Query:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGG-----------
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIGWTL----ADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGG-----------

Query:  -------------VTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF
                     +  +L R+     +  LD +SGY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F D+    V V++DD  +F
Subjt:  -------------VTVMLDRLAGQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVF

Query:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTILCAPNWNLPFEV-MC
          S +    +L  VL+R ++ +L++  +KC F  +E   LG+ I    +   + K   I     P +VK  + FLG   +YR  I        P ++ +C
Subjt:  GGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKISKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTILCAPNWNLPFEV-MC

Query:  DASDAAEFDLEIKDK
        D S   E   +  DK
Subjt:  DASDAAEFDLEIKDK

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein4.3e-1562.5Show/hide
Query:  ILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFM
        +L  GF+W T FKDAH F   CDACQR+GN   R+EM   +ILEVE+FDVWGI FM
Subjt:  ILHCGFFWHTLFKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACTCCCACAGTTCCTGAAGGTCCAGCAGAGCAAACCCCAGCAGAATCCAT
TGCTGCAGCAAAACCCCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCC
CGATGTTTAATGAGTTGAATCCAGGAATTGCATGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGA
GTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGGTCAATTAGGACGTGGGATGAGTT
AGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGG
AAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTAGGCACAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCT
GATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATG
TGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAATTTCTATAATCCAG
GTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAGCAAAAGATGAACCAGCCAGGATTTGCTAAAGCAGGTATTGCCCCAGCAAAC
TTCGATGAGGGCACTGGAATTGCAAGTGGGTCAGTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGACGGGAAGGTAAGGA
GCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCAT
ATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAATGCTATTCTTAAGAATGGGCTACCCCCC
AAGGCTAAGGATCCAGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGT
CTATCGGAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTCGTCCATTTTTGGCTACTGGTAGGA
CATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCC
TTCATTAGGATTCTGGAGAGCACAGTTCTTGAGACAGCAATACAAGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTGTAGAAGATTTTGAATTTTGTTC
TTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGTGAGGATGTTTTTGAGTTTTTAGATTTAGATCAAAGGAAGGCTCCTCCAATTAAGCCATCCCTGATTGAGGCAC
CTACTTTAGATTTGAAACCCTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAG
GCCGTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGG
ATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCG
ATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTCT
GGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCGGCCTCTGCAATGCTCC
AGCAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATGATTGAGTCCACTGTTGAGGTATTTATGGATGATTTTTCAGTGTTTGGAGGGTCTTTTCAGAATTGTT
TAGATAATTTAGGAAAGGTATTAAAAAGATGTGAGGATACCCATCTAGTTCTCAATTGGGAGAAATGCCACTTCATGGTAAAGGAGGGCGTAGTGTTAGGACATAAAATT
TCTAAAAATGGTCTACAAGTTGATAGAGCAAAAGTTGAGGTTATTGAAAGGCTAGAGCCACCGAATTCAGTGAAGGGAATTCGGAGTTTTTTAGGACATGCTGGATTTTA
TAGGAGCACCATTCTTTGCGCACCTAATTGGAATTTGCCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGAATTTGACTTGGAGATAAAGGACAAGAAAGGATCAG
AAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAGATGAACAGCTCTTTGCTGTGGAGGTA
AAGGTAATTAGGGATGTCCCTTGCTATGCGAAAAAATCTGGAATCGCGAGAGCATCTCGACGCTGTCTATGTAGCGTCTCGATGCTGCGACGTCTTGGAGCCCAAGAAAA
GATGACGTCAGAGCGTCTCGACGCTGTCCTTGTAGCGTCTCAACGCTATGCCTGTTGGGAGCCTATAAAAGGAGCCTTTTGCATCCATATTTGGTGTGTTTCAGGTGCTG
AAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAGATGACAGCTATGAGGATTTTGCATTGCGGATTTTTCTGGCATACGTTA
TTCAAGGATGCCCATCGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTGGGGCCTAGAGATGAAATGGCTCTTACTTATATTTTAGAAGTTGAATTATT
CGATGTATGGGGTATTGACTTTATGGGGCCATTTCCTCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAGGCCATTGCATGCC
ATCAGAGTGATGCCAAGACAGTTACAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAAT
ATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTAT
CTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTATA
GGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTTGAGCATAAAACATTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATAAGAATG
CTGCAGCTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATTTATAAGGAAAAGACTAAGTTGTGGCATGACAAGAAAATAAAGTCCAAAGAATC
TGTCAAGGGTCAAAAAGTTTTGCTTTATAATTCTATATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAGGACCGTTTATTGTGATTGAGGTTTTCCCCAATG
GAGCAATTACTTTGCAGGATGAAAAAGATGAGAGAGTGTTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGA
CATTGGTACGTTTCTCCTCACTTCGTCTTCTTGATTCTATCTTGCCATCTACATTCTTTCTTTCTCCTTTCATTCTCTGCAAAACCCTTAAAAACTAAGAAGAAAAGAAC
ACCGGAGGAGAAAGAAGCTAAAAGAAGAAGGCGACAACAGCGGGAGGAAGAACAGGGAAAAGTGATAGAGGATGAGGCTGCTGCAGTGGTTGAAGAAAACCTGGAAGAAC
CACAAGAACATACCCCAGGAGAGGCTGATCAGAGACTTGCGGATACAGATACGGCTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGACCAAGAA
GCTCGTGTTGAGGTAATCATGCCCGAACCACCGAAACTTCGTCGCATAAAGAGAAAGGTCGGCCGTATTCAGGTGATTCGGACTGATACCCCGTCACCACCATCATCATA
TTCTGAGAGAGAAAAAGCAGAGCGAGAGGAAAGAGAGAAGAAAGAAGTTGAAGACAAAGAAAGAGAAGAAACAGAGAAGAAGACTGAGGAAGAGCAGTTGCTCAAGCGCA
GGGCAGAGAAGGGCAAAAATATTGCTGAAGCATCGGAAGAACACGAGGCGATAGAAGAACAGCAGGTTCCATTTGATCGCTTCGTCAATAATTTTGCCAGAGCAAAATAC
ACTGAGCTTCTGGAAAGGGATTTCTTGTTCAAACGCGGTTTCAGTGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGCAGACCACGGCTGGGAGCTGTTTTGTGCGAA
GCCTGAATCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAACATTATCAAGGAGGATGGTTTCCAGGTAATTGTCCGAGGAGTCGAGGTGGATTGGAGTCCCAGTG
CCATCAACGCACTGTACCATCTTCAAAACTTCCCCCATGCGGCGTATAATGAGATGGTTGTGGCGTCATCTGATGAGCAACTAAGTGAGGCTGTCAGGGAGGTAGGAATT
GAAGGGGCACAGTGGCAGTTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGATGCGAATACATGGATGGGATTTATTAGACAGAAGAGGCTTCC
AACAACACATGACTCGACAATCTCCAGTGAACGGGTTCTCCTAGCTTTTGTCATCTTACGGTCTCTCAGTATTAACGTAGGGAGGATCATTGCGAATGAGATTGCTGGAT
GTTGGAAAAAGAATGTGGGAAAACTATTCTTTCCGAACACAATCACGATGCTGTGCAGAAGAGCAGGGGTTCTGGTGGATGAGGGCGATGTGATCCTGTTTGATAAAGGG
ATCATAGACACGCCCAATTTGGCACGGCTCCAGCGTGTGCAGGAGGTACGTCAAGGTGGGCTTATCTACGGCATCAATACGATTCTAGAACAACTGGCACTTTCGGCCAG
TAGGCAGGAGTTTGCTGAGAGGCAGACTTTAACCTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAACGTGCCGCAACTCCCACAGTTCCTGAAGGTCCAGCAGAGCAAACCCCAGCAGAATCCAT
TGCTGCAGCAAAACCCCTGTTTGAGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCC
CGATGTTTAATGAGTTGAATCCAGGAATTGCATGTCCCCAAATCCAAGCGGCGAATTTTGAAATGAAACCGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGA
GTGCCTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCATGGTTAAACTCTTTTGCTCCAGGGTCAATTAGGACGTGGGATGAGTT
AGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGG
AAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTAGGCACAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGGCT
GATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTCACGGACGAAGCATG
TGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAACAACCCTTATTCTAATTTCTATAATCCAG
GTTGGCGCAACCACCCCAACTTCTCATGGGGAGGTCAAGGAAATAACGTACAAGCGCAGCAAAAGATGAACCAGCCAGGATTTGCTAAAGCAGGTATTGCCCCAGCAAAC
TTCGATGAGGGCACTGGAATTGCAAGTGGGTCAGTAGCTAATGAGCTGAAGGCAAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTAGACGGGAAGGTAAGGA
GCAGGTAAAGGCAGTAACTCTTAGGAGTGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAACCACCAT
ATGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGGTCAATTTAAAAATGCTATTCTTAAGAATGGGCTACCCCCC
AAGGCTAAGGATCCAGGGTCATTTACCATACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGT
CTATCGGAAGTTAGGTATTGGTGAAGCTAGGCCTACCACAGTTACACTCCAACTAGCTGATAGGTCTATCACATATCCAGAGGGTCGTCCATTTTTGGCTACTGGTAGGA
CATTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTCTGTAATGAGGAAGTAAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCC
TTCATTAGGATTCTGGAGAGCACAGTTCTTGAGACAGCAATACAAGATTCGGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTGTAGAAGATTTTGAATTTTGTTC
TTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGTGAGGATGTTTTTGAGTTTTTAGATTTAGATCAAAGGAAGGCTCCTCCAATTAAGCCATCCCTGATTGAGGCAC
CTACTTTAGATTTGAAACCCTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGATTTAATGTCGGAGCATGAGGAG
GCCGTAATAAAATTGCTGCAGCAATACCGCAAGGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACAAAATCACTCTAGAGGAGGG
ATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTGATTAAATGGTTGGATGCTGGGATCATTTATCCAATTGCCG
ATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTCT
GGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCTTATGGGACATTCGCTTTTAGGAGAATGCCTTTCGGCCTCTGCAATGCTCC
AGCAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATGATTGAGTCCACTGTTGAGGTATTTATGGATGATTTTTCAGTGTTTGGAGGGTCTTTTCAGAATTGTT
TAGATAATTTAGGAAAGGTATTAAAAAGATGTGAGGATACCCATCTAGTTCTCAATTGGGAGAAATGCCACTTCATGGTAAAGGAGGGCGTAGTGTTAGGACATAAAATT
TCTAAAAATGGTCTACAAGTTGATAGAGCAAAAGTTGAGGTTATTGAAAGGCTAGAGCCACCGAATTCAGTGAAGGGAATTCGGAGTTTTTTAGGACATGCTGGATTTTA
TAGGAGCACCATTCTTTGCGCACCTAATTGGAATTTGCCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGAATTTGACTTGGAGATAAAGGACAAGAAAGGATCAG
AAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATCTGCCATTTCCGATTCTTTTCCAGATGAACAGCTCTTTGCTGTGGAGGTA
AAGGTAATTAGGGATGTCCCTTGCTATGCGAAAAAATCTGGAATCGCGAGAGCATCTCGACGCTGTCTATGTAGCGTCTCGATGCTGCGACGTCTTGGAGCCCAAGAAAA
GATGACGTCAGAGCGTCTCGACGCTGTCCTTGTAGCGTCTCAACGCTATGCCTGTTGGGAGCCTATAAAAGGAGCCTTTTGCATCCATATTTGGTGTGTTTCAGGTGCTG
AAGCAAAGGAAATCCTGGAGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGTCAGATGACAGCTATGAGGATTTTGCATTGCGGATTTTTCTGGCATACGTTA
TTCAAGGATGCCCATCGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTGGGGCCTAGAGATGAAATGGCTCTTACTTATATTTTAGAAGTTGAATTATT
CGATGTATGGGGTATTGACTTTATGGGGCCATTTCCTCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTCCAAGTGGGTGGAGGCCATTGCATGCC
ATCAGAGTGATGCCAAGACAGTTACAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAAT
ATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTAT
CTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTATA
GGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTTGAGCATAAAACATTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATAAGAATG
CTGCAGCTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATTTATAAGGAAAAGACTAAGTTGTGGCATGACAAGAAAATAAAGTCCAAAGAATC
TGTCAAGGGTCAAAAAGTTTTGCTTTATAATTCTATATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAGGACCGTTTATTGTGATTGAGGTTTTCCCCAATG
GAGCAATTACTTTGCAGGATGAAAAAGATGAGAGAGTGTTCAAGGTGAATGGACAGCGTGTGAAGCATTATTGGGGTGAGGAGTTTCAGTCGAAATATCCTTCCCTAAGA
CATTGGTACGTTTCTCCTCACTTCGTCTTCTTGATTCTATCTTGCCATCTACATTCTTTCTTTCTCCTTTCATTCTCTGCAAAACCCTTAAAAACTAAGAAGAAAAGAAC
ACCGGAGGAGAAAGAAGCTAAAAGAAGAAGGCGACAACAGCGGGAGGAAGAACAGGGAAAAGTGATAGAGGATGAGGCTGCTGCAGTGGTTGAAGAAAACCTGGAAGAAC
CACAAGAACATACCCCAGGAGAGGCTGATCAGAGACTTGCGGATACAGATACGGCTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGACCAAGAA
GCTCGTGTTGAGGTAATCATGCCCGAACCACCGAAACTTCGTCGCATAAAGAGAAAGGTCGGCCGTATTCAGGTGATTCGGACTGATACCCCGTCACCACCATCATCATA
TTCTGAGAGAGAAAAAGCAGAGCGAGAGGAAAGAGAGAAGAAAGAAGTTGAAGACAAAGAAAGAGAAGAAACAGAGAAGAAGACTGAGGAAGAGCAGTTGCTCAAGCGCA
GGGCAGAGAAGGGCAAAAATATTGCTGAAGCATCGGAAGAACACGAGGCGATAGAAGAACAGCAGGTTCCATTTGATCGCTTCGTCAATAATTTTGCCAGAGCAAAATAC
ACTGAGCTTCTGGAAAGGGATTTCTTGTTCAAACGCGGTTTCAGTGGTGATCTTCCACAGTTTCTGAGGACCGGTATTGCAGACCACGGCTGGGAGCTGTTTTGTGCGAA
GCCTGAATCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAACATTATCAAGGAGGATGGTTTCCAGGTAATTGTCCGAGGAGTCGAGGTGGATTGGAGTCCCAGTG
CCATCAACGCACTGTACCATCTTCAAAACTTCCCCCATGCGGCGTATAATGAGATGGTTGTGGCGTCATCTGATGAGCAACTAAGTGAGGCTGTCAGGGAGGTAGGAATT
GAAGGGGCACAGTGGCAGTTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGATGCGAATACATGGATGGGATTTATTAGACAGAAGAGGCTTCC
AACAACACATGACTCGACAATCTCCAGTGAACGGGTTCTCCTAGCTTTTGTCATCTTACGGTCTCTCAGTATTAACGTAGGGAGGATCATTGCGAATGAGATTGCTGGAT
GTTGGAAAAAGAATGTGGGAAAACTATTCTTTCCGAACACAATCACGATGCTGTGCAGAAGAGCAGGGGTTCTGGTGGATGAGGGCGATGTGATCCTGTTTGATAAAGGG
ATCATAGACACGCCCAATTTGGCACGGCTCCAGCGTGTGCAGGAGGTACGTCAAGGTGGGCTTATCTACGGCATCAATACGATTCTAGAACAACTGGCACTTTCGGCCAG
TAGGCAGGAGTTTGCTGAGAGGCAGACTTTAACCTTCTAG
Protein sequenceShow/hide protein sequence
MSDPPGKEGAAQKPDGERAATPTVPEGPAEQTPAESIAAAKPLFEQNEQQNNQAENPILIANDRTRAIRAYAVPMFNELNPGIACPQIQAANFEMKPSFLGVSDSFVIQG
VPRDALRLTLFPYSLRDGAKSWLNSFAPGSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVPTMVYLIRHNKKVKSVLEVDGVSTIRA
DLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVGNQRNNPYSNFYNPGWRNHPNFSWGGQGNNVQAQQKMNQPGFAKAGIAPAN
FDEGTGIASGSVANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSELESGQGAGGSNKNAGASGSVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKNAILKNGLPP
KAKDPGSFTIPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSITYPEGRPFLATGRTLIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCS
FIRILESTVLETAIQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFEFLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMSEHEE
AVIKLLQQYRKAIGWTLADIQGISPSFCMHKITLEEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVMLDRLAGQAYYCFLDGYS
GYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQNCLDNLGKVLKRCEDTHLVLNWEKCHFMVKEGVVLGHKI
SKNGLQVDRAKVEVIERLEPPNSVKGIRSFLGHAGFYRSTILCAPNWNLPFEVMCDASDAAEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQSAISDSFPDEQLFAVEV
KVIRDVPCYAKKSGIARASRRCLCSVSMLRRLGAQEKMTSERLDAVLVASQRYACWEPIKGAFCIHIWCVSGAEAKEILEQCHSSPYGGHFSGQMTAMRILHCGFFWHTL
FKDAHRFYKQCDACQRRGNLGPRDEMALTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVSKWVEAIACHQSDAKTVTRFLQSHIFARFGTPRALVSDEGTHFVNN
ILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRM
LQLNELEEFRQFSYENAKIYKEKTKLWHDKKIKSKESVKGQKVLLYNSILKLFPGKLKSKWSGPFIVIEVFPNGAITLQDEKDERVFKVNGQRVKHYWGEEFQSKYPSLR
HWYVSPHFVFLILSCHLHSFFLLSFSAKPLKTKKKRTPEEKEAKRRRRQQREEEQGKVIEDEAAAVVEENLEEPQEHTPGEADQRLADTDTAREENVEENQEQQAEMDQE
ARVEVIMPEPPKLRRIKRKVGRIQVIRTDTPSPPSSYSEREKAEREEREKKEVEDKEREETEKKTEEEQLLKRRAEKGKNIAEASEEHEAIEEQQVPFDRFVNNFARAKY
TELLERDFLFKRGFSGDLPQFLRTGIADHGWELFCAKPESVNAQVVREFYANIIKEDGFQVIVRGVEVDWSPSAINALYHLQNFPHAAYNEMVVASSDEQLSEAVREVGI
EGAQWQLSKTQKRTFQSAYLKRDANTWMGFIRQKRLPTTHDSTISSERVLLAFVILRSLSINVGRIIANEIAGCWKKNVGKLFFPNTITMLCRRAGVLVDEGDVILFDKG
IIDTPNLARLQRVQEVRQGGLIYGINTILEQLALSASRQEFAERQTLTF