; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038687 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038687
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr2:23345657..23356902
RNA-Seq ExpressionLag0038687
SyntenyLag0038687
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]4.3e-25534.91Show/hide
Query:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFH---------------------------------------------------
        N + R ++ Y  P+ N+   GI R  I A NFE+KP +  M+Q   QF                                                    
Subjt:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFH---------------------------------------------------

Query:  --GSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP---------TMVYLIRYNKKVKSVLEVDGVSTIRADLAMIAN
          GSI +W ++AE FL+K+FPP + A+LRSEI  FRQ + E+  EAWE  +      P           ++    N + +++++     T+ +  A  A 
Subjt:  --GSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP---------TMVYLIRYNKKVKSVLEVDGVSTIRADLAMIAN

Query:  ALKNVTVISHQQPPA-------------MEPTAVV--------NQVTDEACVYCGEDHNYEFC-----PSNPASVFLVG-------NQRNNPYSNFYNPG
        +L      ++ Q P              +EP A +        +QV+        +   Y        P N AS   V        N R NP  N+Y+PG
Subjt:  ALKNVTVISHQQPPA-------------MEPTAVV--------NQVTDEACVYCGEDHNYEFC-----PSNPASVFLVG-------NQRNNPYSNFYNPG

Query:  WRNHPNFSWG------------------------DAAIQ-----------------------SNQ-ASMRALELQVGQLANELKARPQGKLPSDTEHPRR
         RNH NFS+G                        DA +                        SN  A+M+ LE+Q+GQLA  + A+ +G  PS+TE   +
Subjt:  WRNHPNFSWG------------------------DAAIQ-----------------------SNQ-ASMRALELQVGQLANELKARPQGKLPSDTEHPRR

Query:  EGCWG----------------------SNKNAGASGSVPDVE-----------PPYVP----PPPYVPPLPFPQR-------------------------
        E C                        +  N  +   V + E           PP +     PP    PLP+PQR                         
Subjt:  EGCWG----------------------SNKNAGASGSVPDVE-----------PPYVP----PPPYVPPLPFPQR-------------------------

Query:  -------------------QKPKNQDEWAT----------------PKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTL
                            K +  +E+ T                 K KDPGSFT+  +IG     + LCDLGASINLMPLSVYRKLG+GE + TT++L
Subjt:  -------------------QKPKNQDEWAT----------------PKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTL

Query:  QLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV
        QLADRS                                             TGRAL+DVQKGELT+RV  EEV FN+++AMK+P++   C  + ++E  V
Subjt:  QLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV

Query:  IETTIQDSADKHSEK------HG-----------------------------------------------------------------------------
        +E   +D    H E+      H                                                                              
Subjt:  IETTIQDSADKHSEK------HG-----------------------------------------------------------------------------

Query:  ----------------------------EQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWV
                                     ++R A+GWT++DI+GISPS CMHKI ++E    SIE QRRLNPAMKEVV+ E++K L+AGIIY I+DS+WV
Subjt:  ----------------------------EQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWV

Query:  SPVQCVLKKGGVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAF
        SPVQ V KKGG+TV                                          MLDRLA  +YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF
Subjt:  SPVQCVLKKGGVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAF

Query:  RRMSFHLCNAPATFQR------------------------------------------------------------------------------------
        RRM F LCNAPATFQR                                                                                    
Subjt:  RRMSFHLCNAPATFQR------------------------------------------------------------------------------------

Query:  -------------------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFI
                                                          C +AF  +K  LISAP++  P+W+ PFEVMCDASD A+GA+LGQ++ K  
Subjt:  -------------------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFI

Query:  HPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSS
          IYYAS+ LNEAQ+NYTTTE E+LAVVFA +KFR YL+ +KV VFTDHA +RYL +KKDAKPRLI W+LLLQEFDLE++DKKGSEN +ADHLSRL+   
Subjt:  HPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSS

Query:  SLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNG
         +     I ++FPDEQLFA E+K    +PWYADI NFL   V P D+ + Q+KKF HD K++ WD+P ++K+C D I +               R  P  
Subjt:  SLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNG

Query:  PADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRD
          ++++Q                                     L+  + S +GGHF   RTA ++L  GFFWP++F+D++   K CD CQR GN+  R 
Subjt:  PADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRD

Query:  EMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHR
        E+ L  ILEVELFDVWGIDFMGPFPPS G V+ILLAVDYV KWVEAIA   +DAK V +FL  +IF RFGTPRA++SDEGTHF N +   LL+KYG+KH+
Subjt:  EMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHR

Query:  IA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLN
        IA  YHPQ NGQAEISNREIK ILEK V+ +RKDW+ +LD+ALWAYRTA+KTP+                                     GE R+LQLN
Subjt:  IA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLN

Query:  ELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV
        E++EFR  +YENAK+YKE+TK WHDK+I  +EF  GQ+V
Subjt:  ELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]2.8e-25434.18Show/hide
Query:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------
        N   R +  YAVP  N     I RP + A NFE+KP + QM+Q   QF G                                                  
Subjt:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------

Query:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------
           S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P       ++    YN                           
Subjt:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------

Query:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--
                            KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V  V N  
Subjt:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK
           Q+NNPYSN YNPGWRNHPNFSW D                                                A +    +S R +E+Q+GQLAN + 
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK

Query:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------
        +R QG LPS TE   +E C      S K  G                     S  V D+     P  P  PYVPP+PFPQR K    D+           
Subjt:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------

Query:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR
                                                            PK +DPGSF+I  +IG  +  RALCDLGAS++LMPLSV RKLG+ E +
Subjt:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR

Query:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------
        PTT++LQLADRS                                             T  A+IDV+ G LT++V  EEV+FN+F+A KYP          
Subjt:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------

Query:  -------------------------------DEMEDCSFIRILEST-----------------------------------------------------V
                                       D +E       LE+T                                                     +
Subjt:  -------------------------------DEMEDCSFIRILEST-----------------------------------------------------V

Query:  IETTIQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG
        +  ++ D       +     +KAIGWT++D++GISPS CMH+I +++     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ V KKGG
Subjt:  IETTIQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG

Query:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP
        +TV                                          +L+RLA  AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRM F LCNAP
Subjt:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP

Query:  ATFQRY----------------------------------------------------------------------------------------------
        ATFQR                                                                                               
Subjt:  ATFQRY----------------------------------------------------------------------------------------------

Query:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN
                                               C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++ + +H IYYAS+VLN
Subjt:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN

Query:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS
         AQ+NY TTE ELLAVVFAF+KFR YLVGSKV V+TDH+ I+YL+ KKDAKPRLI WVLLLQEFDLEI+DK+G ENV+ADHLSRL+   S  ++  I++S
Subjt:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS

Query:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI
        FPDEQL AV V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W++P +YK C+DG IR+ +        +D+   L              
Subjt:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI

Query:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV
                                                   GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EM LT ILEV
Subjt:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV

Query:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN
        ELFD+WGIDFMGPFP S  N +IL+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ N
Subjt:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN

Query:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY
        GQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPL                                     GE R+LQL+ELEEFR  +Y
Subjt:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY

Query:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV
        EN ++YKEKTK WHDK ++ + F  GQ+V
Subjt:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]6.3e-25434.3Show/hide
Query:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------
        N   R +  YAVP  N     I RP + A NFE+KP + QM+Q   QF G                                                  
Subjt:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------

Query:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------
           S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P       ++    YN                           
Subjt:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------

Query:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--
                            KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V  V N  
Subjt:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK
           Q+NNPYSN YNPGWRNHPNFSW D                                                A +    +S R +E+Q+GQLAN + 
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK

Query:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------
        +R QG LPS TE   +E C      S K  G                     S  V D+     P  P  PYVPP+PFPQR K    D+           
Subjt:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------

Query:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR
                                                            PK +DPGSF+I  +IG  +  RALCDLGAS++LMPLSV RKLG+ E +
Subjt:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR

Query:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------
        PTT++LQLADRS                                             T  A+IDV+ G LT++V  EEV+FN+F+A KYP          
Subjt:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP----------

Query:  -------------------------------DEMEDCSFIRILEST-----------------------------------------------------V
                                       D +E       LE+T                                                     +
Subjt:  -------------------------------DEMEDCSFIRILEST-----------------------------------------------------V

Query:  IETTIQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG
        +  ++ D       +     +KAIGWT++D++GISPS CMH+I +++     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ V KKGG
Subjt:  IETTIQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG

Query:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP
        +TV                                          +L+RLA  AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRM F LCNAP
Subjt:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP

Query:  ATFQRY----------------------------------------------------------------------------------------------
        ATFQR                                                                                               
Subjt:  ATFQRY----------------------------------------------------------------------------------------------

Query:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN
                                               C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++ + +H IYYAS+VLN
Subjt:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN

Query:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS
         AQ+NY TTE ELLAVVFAF+KFR YLVGSKV V+TDH+ I+YL+ KKDAKPRLI WVLLLQEFDLEI+DK+G ENV+ADHLSRL+   S  ++  I++S
Subjt:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS

Query:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI
        FPDEQL AV V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W++P +YK C+DG IR+ +                   P D+      
Subjt:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI

Query:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV
          +   LD    +                              GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EM LT ILEV
Subjt:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV

Query:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN
        ELFD+WGIDFMGPFP S  N +IL+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ N
Subjt:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN

Query:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY
        GQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPL                                     GE R+LQL+ELEEFR  +Y
Subjt:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY

Query:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV
        EN ++YKEKTK WHDK ++ + F  GQ+V
Subjt:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]8.2e-25434.36Show/hide
Query:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------
        N   R +  YAVP  N     I RP + A NFE+KP + QM+Q   QF G                                                  
Subjt:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------

Query:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------
           S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P       ++    YN                           
Subjt:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------

Query:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--
                            KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H    C      V  V N  
Subjt:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN--

Query:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK
           Q+NNPYSN YNPGWRNHPNFSW D                                                A +    +S R +E+Q+GQLAN + 
Subjt:  ---QRNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELK

Query:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------
        +R QG LPS TE   +E C      S K  G                     S  V D+     P  P  PYVPP+PFPQR K    D+           
Subjt:  ARPQGKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW----------

Query:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR
                                                            PK +DPGSF+I  +IG  +  RALCDLGAS++LMPLSV RKLG+ E +
Subjt:  --------------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEAR

Query:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIR
        PTT++LQLADRS                                             T  A+IDV+ G LT++V  EEV+FN+F+A KYP   +    + 
Subjt:  PTTVTLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIR

Query:  ILESTV------------IETTIQD---SADKHSE----------------KHGEQY-------------------------------------------
        +++ +             +ET +     S D + E                K G  +                                           
Subjt:  ILESTV------------IETTIQD---SADKHSE----------------KHGEQY-------------------------------------------

Query:  --------------------RKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG
                            +KAIGWT++D++GISPS CMH+I +++     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ V KKGG
Subjt:  --------------------RKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGG

Query:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP
        +TV                                          +L+RLA  AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRM F LCNAP
Subjt:  VTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAP

Query:  ATFQRY----------------------------------------------------------------------------------------------
        ATFQR                                                                                               
Subjt:  ATFQRY----------------------------------------------------------------------------------------------

Query:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN
                                               C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++ + +H IYYAS+VLN
Subjt:  ---------------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLN

Query:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS
         AQ+NY TTE ELLAVVFAF+KFR YLVGSKV V+TDH+ I+YL+ KKDAKPRLI WVLLLQEFDLEI+DK+G ENV+ADHLSRL+   S  ++  I++S
Subjt:  EAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDS

Query:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI
        FPDEQL AV V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W++P +YK C+DG IR+ +                   P D+      
Subjt:  FPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI

Query:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV
          +   LD    +                              GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EM LT ILEV
Subjt:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV

Query:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN
        ELFD+WGIDFMGPFP S  N +IL+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ N
Subjt:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN

Query:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY
        GQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPL                                     GE R+LQL+ELEEFR  +Y
Subjt:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSY

Query:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV
        EN ++YKEKTK WHDK ++ + F  GQ+V
Subjt:  ENAKMYKEKTKLWHDKKIKSKEFIKGQKV

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]1.1e-25334.2Show/hide
Query:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------
        N   R +  YAVP  N     I RP + A NFE+KP + QM+Q   QF G                                                  
Subjt:  NDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------------------------------------------------

Query:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------
           S  TW+ L++ FLSKYFPP + AKLR++I  F Q + E+  EAWE  +      P       ++    YN                           
Subjt:  ---SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP------TMVYLIRYN---------------------------

Query:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDH-NYEFCPSNPASVFLVGNQ
                            KKV  + +VDG++ + A +  +      +  ++    P +            +C  CG  H + ++      S +    Q
Subjt:  --------------------KKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDH-NYEFCPSNPASVFLVGNQ

Query:  RNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELKARPQ
        +NNPYSN YNPGWRNHPNFSW D                                                A +    +S R +E+Q+GQLAN + +R Q
Subjt:  RNNPYSNFYNPGWRNHPNFSWGD------------------------------------------------AAIQSNQASMRALELQVGQLANELKARPQ

Query:  GKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW--------------
        G LPS TE   +E C      S K  G                     S  V D+     P  P  PYVPP+PFPQR K    D+               
Subjt:  GKLPSDTEHPRREGCWG----SNKNAG--------------------ASGSVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDEW--------------

Query:  ----------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTV
                                                        PK +DPGSF+I  +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT+
Subjt:  ----------------------------------------------ATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTV

Query:  TLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP--------------
        +LQLADRS                                             T  A+IDV+ G LT++V  EEV+FN+F+A KYP              
Subjt:  TLQLADRS-----------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYP--------------

Query:  ---------------------------DEMEDCSFIRILEST-----------------------------------------------------VIETT
                                   D +E       LE+T                                                     ++  +
Subjt:  ---------------------------DEMEDCSFIRILEST-----------------------------------------------------VIETT

Query:  IQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV-
        + D       +     +KAIGWT++D++GISPS CMH+I +++     +E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ V KKGG+TV 
Subjt:  IQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV-

Query:  -----------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQ
                                                 +L+RLA  AYYCFLDGYSGYNQI+I+PEDQEKTTFTCPYGTFAFRRM F LCNAPATFQ
Subjt:  -----------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQ

Query:  RY--------------------------------------------------------------------------------------------------
        R                                                                                                   
Subjt:  RY--------------------------------------------------------------------------------------------------

Query:  -----------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQV
                                           C  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++ + +H IYYAS+VLN AQ+
Subjt:  -----------------------------------CRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQV

Query:  NYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDE
        NY TTE ELLAVVFAF+KFR YLVGSKV V+TDH+ I+YL+ KKDAKPRLI WVLLLQEFDLEI+DK+G ENV+ADHLSRL+   S  ++  I++SFPDE
Subjt:  NYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDE

Query:  QLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVS
        QL AV V     +PWYAD+ N+LV G+ P D+ + QKKKF  D K ++W++P +YK C+DG IR+ +                   P D+        + 
Subjt:  QLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDG-IRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVS

Query:  TDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFD
          LD    +                              GGHFS  +T  ++   GF+WPT+++D   +   CD CQR GN+  ++EM LT ILEVELFD
Subjt:  TDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFD

Query:  VWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAE
        +WGIDFMGPFP S  N +IL+AVDYV KWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG+HF N     LL KYG+ H++A  YHPQ NGQ E
Subjt:  VWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAE

Query:  ISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSYENAK
        ++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPL                                     GE R+LQL+ELEEFR  +YEN +
Subjt:  ISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSYENAK

Query:  MYKEKTKLWHDKKIKSKEFIKGQKV
        +YKEKTK WHDK ++ + F  GQ+V
Subjt:  MYKEKTKLWHDKKIKSKEFIKGQKV

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.6e-25039.12Show/hide
Query:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGNQR---NNPYSNFYNPGWRNHPNF
        K   V+EVD V+ + A +  +  ++KN  V   Q  P               C  CGE H  + CP +  S+  V N R   NNPYSN YNPGWR HPNF
Subjt:  KVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGNQR---NNPYSNFYNPGWRNHPNF

Query:  SW---------------GDAAIQ-------------------SNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREG------------------
        SW               G   +Q                   S  A+ + ++ Q+GQLAN + +RPQG LPS+TE +PR++G                  
Subjt:  SW---------------GDAAIQ-------------------SNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREG------------------

Query:  ----CWGSNKNAGASGSVPDVEPPY------VPPPPYVPPLPFPQR------QKPKNQDEWAT----------------PKAKDPGSFTIHVSIGGKELG
                 K   +     +VE P       +P    +  +P   +       K ++  ++ T                PK KDPGSFTI  +IG    G
Subjt:  ----CWGSNKNAGASGSVPDVEPPY------VPPPPYVPPLPFPQR------QKPKNQDEWAT----------------PKAKDPGSFTIHVSIGGKELG

Query:  RALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSSIFGTG---RALIDVQK-------GELTMRVCNE-------------EVKFNVFKAMKYPD
        RALCDLG    L           GEA+PT++TLQLADRS  +  G     L+ V K         L M V  E                 +V KAMK+P+
Subjt:  RALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSSIFGTG---RALIDVQK-------GELTMRVCNE-------------EVKFNVFKAMKYPD

Query:  EMEDCSFIRILESTV------------IETTIQDSADKHSEKHGE-------------------------------------------------------
        E ++C  + + ++ V            +E  + D  D+ +E+  E                                                       
Subjt:  EMEDCSFIRILESTV------------IETTIQDSADKHSEKHGE-------------------------------------------------------

Query:  ------------------------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC
                                 ++ AIGWT+ADI+GISPSFCMHKI L++    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+DS+WVSPVQC
Subjt:  ------------------------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQC

Query:  VLKKGGVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSF
        V KKGG+TV                                          MLDRLA + +YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAFRRM F
Subjt:  VLKKGGVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSF

Query:  HLCNAPATFQR-----------------------------------------------------------------------------------------
         LCNAPATFQR                                                                                         
Subjt:  HLCNAPATFQR-----------------------------------------------------------------------------------------

Query:  --------------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYY
                                                     CR AF  LK  LISAPI+  P+W+ PFE+MCDASD AVGA+LGQ++ K    IYY
Subjt:  --------------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYY

Query:  ASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ
        ASK LN+AQ+NYTTTE ELLAVVFAF+KFR YLVG+KV V+TDHA IRYL+ KKDAKPRLI WVLLLQEFDLEI+D+KG+EN IADHLSRL+  +   E 
Subjt:  ASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQ

Query:  YAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQK
          I+D+FPDEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD PF++KQ  D I                            
Subjt:  YAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQK

Query:  LQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLT
        L+R +  +  +                            L   + SP+GGHF G RTA +IL  GFFWP LFKDAH F   CD CQR GN+  R EM L 
Subjt:  LQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLT

Query:  YILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PY
         ILEVELFDVWGIDFMGPF PS GN++IL+AVDYV KWVEA A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+KYG+KH+I+ PY
Subjt:  YILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PY

Query:  HPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEF
        HPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYRTAYKTP+                                     GE R+LQLNEL+EF
Subjt:  HPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEF

Query:  RQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV
        R  +YENAK+YKEK K WH+KKI  + F  GQ V
Subjt:  RQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV

A0A2G9HWF8 Reverse transcriptase1.4e-25138.55Show/hide
Query:  SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRYNKKVKSVLEVDGVSTIRADLAMIANA
        SI TW +L E F+SK+F P + A LR+EI+ FRQ   ET  EAW   R    + P           T  + +    K K +  ++G S +    A   N 
Subjt:  SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP-----------TMVYLIRYNKKVKSVLEVDGVSTIRADLAMIANA

Query:  LKNVTVISHQQ------PPAMEPTAVVNQVT-----------DEACVYCGEDHNYEFCPSNPASVFLVGNQR---NNPYSNFYNPGWRNHPNFSWGDAAI
        L N+ V  +++      PP       V+QVT                 CGE H  + CP +  S+  V N R   NNPYSN YNPGWR HPNFSW +   
Subjt:  LKNVTVISHQQ------PPAMEPTAVVNQVT-----------DEACVYCGEDHNYEFCPSNPASVFLVGNQR---NNPYSNFYNPGWRNHPNFSWGDAAI

Query:  QSNQASMRALELQVGQLANELKARPQGKLPSDTEHPR---------REGCWGSNKNAGASGSVPDVEPPYVPPPPYVPPLPFPQR---QKPKNQ------
        Q +    +    Q GQ   +   R  GK        R         +E      K   +     +VE P     P     PFPQR   QK K Q      
Subjt:  QSNQASMRALELQVGQLANELKARPQGKLPSDTEHPR---------REGCWGSNKNAGASGSVPDVEPPYVPPPPYVPPLPFPQR---QKPKNQ------

Query:  -----------------------------------DEWAT----------------PKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGI
                                            ++ T                PK KDPG              RALCDLGASINLMP S+YR LG+
Subjt:  -----------------------------------DEWAT----------------PKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGI

Query:  GEARPTTVTLQLADRSSIF-----------------------------------------GTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC
         EA+PT++TLQLADRS  +                                          TGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C
Subjt:  GEARPTTVTLQLADRSSIF-----------------------------------------GTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC

Query:  SFIRILESTV------------IETTIQDSADKHSEKHGE------------------------------------------------------------
          + + ++              +E  + D  D+ +E+  E                                                            
Subjt:  SFIRILESTV------------IETTIQDSADKHSEKHGE------------------------------------------------------------

Query:  -------------------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKG
                            ++ AIGWT+ADI+GISPSFCMHKI L++    S+E QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCV KKG
Subjt:  -------------------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKG

Query:  GVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNA
        G+TV                                          MLDRLA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRR+ F LCNA
Subjt:  GVTV------------------------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNA

Query:  PATFQR----------------------------------------------------------------------------------------------
        PATFQR                                                                                              
Subjt:  PATFQR----------------------------------------------------------------------------------------------

Query:  ---------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVL
                                                C  AF+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+LGQ++ K    IYYASK L
Subjt:  ---------------------------------------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVL

Query:  NEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISD
        N+AQ+NYTTTE ELLAVVFAF+KFR YLVG+KV V+TDHA IRYL+ KKDAKPRLI WVLLLQEFDLEI+D+KG EN IADHLSRL+  +   E   I+D
Subjt:  NEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISD

Query:  SFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI
        +FPDEQL A+   V  DVPWYADI N+L  G+ P D+  +QKKKF  D + ++WD PF++KQ  D I                            L+R +
Subjt:  SFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYI

Query:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV
          +  +                                + SP+GGHF   RTA +IL  GFFWP LFKD H F   CD CQR GN+  R EM L  ILEV
Subjt:  FSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEV

Query:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN
        ELFDVWGIDFMGPF PS GN++IL+AVDY+ KWVEA+A   +D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+KYG+KH+I+ PYHPQ +
Subjt:  ELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQAN

Query:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
        GQ E+SNREIK  LEK V  +RKDWS RLDEALWAYRTA+KTP+G
Subjt:  GQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

A0A2G9I3H5 Reverse transcriptase3.5e-25037.19Show/hide
Query:  DPEIERTFKSSEPPAEPLLQQNPLFEQNEQQNNQAENPILIANDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------
        DP+IERTF++        ++Q    E+N       +N ++  N     +R  A+P   E    +  P++     +++  M +M+Q   QF G        
Subjt:  DPEIERTFKSSEPPAEPLLQQNPLFEQNEQQNNQAENPILIANDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHG--------

Query:  ----SIRTWD-------ELAENFLSKYFPPNRNAKLRSEIVGFRQLE--------------DETFSEAW-EGLRSFCESVPTMVYLIRYNKKVK------
             ++  D       EL E F+SK+F P + A LR+EI+ FRQ+               D    +++  G  + C ++   +    Y KK K      
Subjt:  ----SIRTWD-------ELAENFLSKYFPPNRNAKLRSEIVGFRQLE--------------DETFSEAW-EGLRSFCESVPTMVYLIRYNKKVK------

Query:  --SVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN---QRNNPYSNFYNPGWRNHPNFS
           V+E+D V+ + A +  +  ++KN                   Q T   C  C E H  + CP +  S+  V N    +NNPYSN YN GWR HPNFS
Subjt:  --SVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYCGEDHNYEFCPSNPASVFLVGN---QRNNPYSNFYNPGWRNHPNFS

Query:  WGD-------------------AAIQ---------------SNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREG---CW--------------
        W +                    AIQ               S  A+ + +E Q+GQLAN + +RPQG LPS+TE +PR++G   C               
Subjt:  WGD-------------------AAIQ---------------SNQASMRALELQVGQLANELKARPQGKLPSDTE-HPRREG---CW--------------

Query:  -----GSNKNAGASGSVPDVEPP-----YVPPPPYVPPLPFPQRQKPKNQDEWATP---------------KAKDPGSFTIHVSIGGKELGRALCDLGAS
                K   +  +  +VE P         P YV  +     +K +  D                    K KD GSF I  +IG    GRALCDL AS
Subjt:  -----GSNKNAGASGSVPDVEPP-----YVPPPPYVPPLPFPQRQKPKNQDEWATP---------------KAKDPGSFTIHVSIGGKELGRALCDLGAS

Query:  INLMPLSVYRKLG---------------------IGEARPTTVTLQLADRSSI------FGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC
        INLMP S+YR L                      I  A    + +++  +  I        TGR LIDVQKGELTMRV ++++ FN+FKAMK+P+E ++ 
Subjt:  INLMPLSVYRKLG---------------------IGEARPTTVTLQLADRSSI------FGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDC

Query:  SFIRILES------------TVIETTIQDSADKHSEKHGE------------------------------------------------------------
          + +L++              +E  + D  D+ +E+  E                                                            
Subjt:  SFIRILES------------TVIETTIQDSADKHSEKHGE------------------------------------------------------------

Query:  QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV---------------
         ++ AIGWT+ADI+GISPSFCMHKI L+     S+E QRRLNP MKEVVKKE+IKWLD GIIYPI+DS+WVS VQCV KKGG+TV               
Subjt:  QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV---------------

Query:  ---------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR-------------
                                   MLDRLA + +YCFLDGYSGYNQI IAPEDQEKT FTCPYGTFAFRRM F LCNAPATFQR             
Subjt:  ---------------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR-------------

Query:  ---------------------------------------------------------------------------YCRKAFETLKAALISAPILCAPNWN
                                                                                    C  AF  LK  LISAPI+  P+W 
Subjt:  ---------------------------------------------------------------------------YCRKAFETLKAALISAPILCAPNWN

Query:  LPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQE
         PFE+MCDASD A+GA+LGQ++ K    IYYASK LN+AQ+NYTTT+  LLAVVFAF+KFR YLVG+KV V+TDH+ IRYL+ KKDAKPRLI WVLLLQE
Subjt:  LPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQE

Query:  FDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCS
        FDLEI+D+KG+EN IADHLSR +  +   E   I+D+FPDEQL A+   V  D                      +QKKKF  D   ++WD PF++KQCS
Subjt:  FDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCS

Query:  DGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWP
        D I +      +   I                                                      L   + SP+GGHF G RTA +IL   F+WP
Subjt:  DGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWP

Query:  TLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRA
         LFKD+H F   CD CQR GN+  R  M L  ILEVELFDVWGIDFMGPF PS GN++IL+AVDYV KWVE++A   +D+K V  F++ +IF RFGT RA
Subjt:  TLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRA

Query:  LVSDEGTHFVNNILTKLLAKYGIKHRI-APYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL--------------------
        ++SD GTHF N     LL+KYG+KH+I  PYHPQ +GQ      EIK ILEK V   RKDWS RLDEALWAYRTA+KTP+                    
Subjt:  LVSDEGTHFVNNILTKLLAKYGIKHRI-APYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL--------------------

Query:  -----------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV
                         GE R+LQLNEL+EFR  +YEN K+YKEKTK WHDKKI  + F  GQ V
Subjt:  -----------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKV

A0A5H2XVG6 Transposable element protein1.4e-24337.53Show/hide
Query:  GSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWE---GLRSFC--ESVPTMVYLIRYNKKVKSVLEV-------DGVSTIR---ADLA
        GS+ TW  + + FL KYF   +   LR +I  F Q +DE+F EAWE   GL + C    +P  + + R + K K+  E        +G  T R     ++
Subjt:  GSIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWE---GLRSFC--ESVPTMVYLIRYNKKVKSVLEV-------DGVSTIR---ADLA

Query:  MIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYC-GEDHNYEFCPSNPASVFLVGNQR-----------NNPYSNFYNPGWRNHPNFSWGDAAIQSN
         +   L  +  ++ + P              E C  C   DH    CP   A    V  Q            N+PYS  YNPGWRNHPNFSW +   Q+N
Subjt:  MIANALKNVTVISHQQPPAMEPTAVVNQVTDEACVYC-GEDHNYEFCPSNPASVFLVGNQR-----------NNPYSNFYNPGWRNHPNFSWGDAAIQSN

Query:  -------------------------------QASMRALELQVGQLANELKARPQGKLPSDT-----------EHPRREGCWGSNK---------------
                                       QA++  LE+Q+GQ+A E+  R  GK PS T            H  R G    NK               
Subjt:  -------------------------------QASMRALELQVGQLANELKARPQGKLPSDT-----------EHPRREGCWGSNK---------------

Query:  ---------------NAGASGSVPDVEPP---------------------YVPPPPYVPPLPFPQRQKPKNQDE--------------------------
                       +  A  +  D + P                     Y+PPPPY+PP+PFP R K  NQD+                          
Subjt:  ---------------NAGASGSVPDVEPP---------------------YVPPPPYVPPLPFPQRQKPKNQDE--------------------------

Query:  ----------------------------------WATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS---
                                             PK +DPGSF I ++IG K +GRA+ DLGASIN+MP SVY+ LG+   + T++ L+LAD S   
Subjt:  ----------------------------------WATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS---

Query:  ------------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPD-EMEDCSFIRILESTVIETT
                                                      T   +I V+ G L+M V  E V+F VF A+  P   ++ C  I +++  V    
Subjt:  ------------------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPD-EMEDCSFIRILESTVIETT

Query:  IQDSADKHSE-------------------------------------------------------------------------------------KHGE-
        +Q  ++   E                                                                                      H E 
Subjt:  IQDSADKHSE-------------------------------------------------------------------------------------KHGE-

Query:  -------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTVMLDRLASQ
               +++ A+GWT+ADI+GISPS CMH+I ++E S  S + QRRLNP MKEVV+ EV+K LD GIIYPI+DS WVSPVQ V KK G+TV+ +     
Subjt:  -------QYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTVMLDRLASQ

Query:  AYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR-----------------------YCRKAFETLKAALISAPILCAPNWN
             +   +GYNQI IAPEDQEKTTFTCP+GTFA+RRM F LCNAPATFQR                        C +AF TLK  L SAPI+ AP+W+
Subjt:  AYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR-----------------------YCRKAFETLKAALISAPILCAPNWN

Query:  LPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQE
        LPFE+MCDASD A+GA+LGQK+ K  H I+YAS+ LN+AQ+NY+TTE ELLAVVFA EKFRPYLVGSKV V++DHA +RYL+TKKDAKPRLI W+LLLQE
Subjt:  LPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQE

Query:  FDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCS
        FDLEI+DKKG ENV+ADHLSR+           ++++FPDEQL+  +VK     PWYAD  N+L  GV   D+ ++ KKKF    K + WD+PF++K C 
Subjt:  FDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCS

Query:  DGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWP
        D + +   VP +                    Q  I   S +L                                    GGHF  ++TA++IL  GFFWP
Subjt:  DGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWP

Query:  TLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRA
        TLFKDA  F  +CD CQR GN+  R+E+ L  IL VELFDVWGIDFMGPFP S G  +IL+AVDYV KWVEAIA   +D K V +FL+ +IF RFGTPRA
Subjt:  TLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRA

Query:  LVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL--------------------
        ++SD G+HF N +   L+ KY I HR++ PYHPQ +GQ EISNREIK ILEKVV+ +RKDW+ +L++ALWAYRTAYKTP+                    
Subjt:  LVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPL--------------------

Query:  -----------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQ
                         G +R  QLNELEE R  SYENAK+YKE+TK +HD+ I+ KEF KG+
Subjt:  -----------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQ

A0A6P8CBX2 Reverse transcriptase5.7e-24535.23Show/hide
Query:  SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP---------TMVYLIRYNKKVKSVLEV-----------DGVSTIR
        SI TW +L+  FL ++FPP R A+LR+EI  F +   E+  EAWE  +      P           V+ +  +  ++S+++            D  S + 
Subjt:  SIRTWDELAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVP---------TMVYLIRYNKKVKSVLEV-----------DGVSTIR

Query:  ADLAMIAN------------ALKNVTVISHQQPPAMEPTAVVNQVTD---------EACVYCGEDHNYEFCPS-NPAS------VFLVGN-QRNN--PYS
         ++A  A+            ++ ++  I++        T  V+++T            C  C   H+   C S NP++      V  V N QR+N  PYS
Subjt:  ADLAMIAN------------ALKNVTVISHQQPPAMEPTAVVNQVTD---------EACVYCGEDHNYEFCPS-NPAS------VFLVGN-QRNN--PYS

Query:  NFYNPGWRNHPNFSW-----------------------------------------GDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-EHPR-
        N YNPGWRNHPNFSW                                          D  +Q+ QA++R LE Q+ Q++ +L  RP G LPS+T E+P+ 
Subjt:  NFYNPGWRNHPNFSW-----------------------------------------GDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDT-EHPR-

Query:  ------REG--CWGSNKNAGASGSVPD-------VEPP---YVPPPPYVPPLPFPQRQKPKNQD------------------------------------
              R G      N+ A      P+       VE P    +   PYVPP+PFP R K +  D                                    
Subjt:  ------REG--CWGSNKNAGASGSVPD-------VEPP---YVPPPPYVPPLPFPQRQKPKNQD------------------------------------

Query:  -------EWATP--------------------KAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS---------
               + + P                    K +D GSFT+  +IG       L D GASINLMPLS++RKLG+GE + T +TLQLADRS         
Subjt:  -------EWATP--------------------KAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRS---------

Query:  --------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IET
                                            TG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++  +            +E+
Subjt:  --------------------------------SIFGTGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV------------IET

Query:  TIQD-----SADKHSEKHGE-----------------------------------------------------------------------QYRKAIGWT
         ++D       D+H E+  E                                                                       ++++AIGWT
Subjt:  TIQD-----SADKHSEKHGE-----------------------------------------------------------------------QYRKAIGWT

Query:  LADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV------------------------
        +ADI+GISP  C H+I L+      ++ QRRLNP +KEVVKKEV+K LDAGIIYPI+DS WVSPVQ V KKGG+TV                        
Subjt:  LADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVLKKGGVTV------------------------

Query:  ------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR----------------------
                          ML++LA   YYCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAFRRM F LCNAPATFQR                      
Subjt:  ------------------MLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQR----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYL
                    C +AF  LK  L SAP++ APNW LPFE+MCDASD AVGA+LGQ++GK  H IYYAS+ LNEAQ NY TTE ELLAV+FA +KFRPYL
Subjt:  -----------YCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYL

Query:  VGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFL
        +GSK+ V+TDHA ++YL  K DAKPRLI W+LLLQEFDLEI+D KG+ENV+ADHLSRL+  S  L+   I++ FPDEQL   E   ++ +PWYADI N++
Subjt:  VGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFL

Query:  VKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYR
        V  +TP  +  +QKKKF HD K+++WD+P+++K C                            ADQ ++R +     + +Q  ++               
Subjt:  VKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYR

Query:  FTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVD
                  +    GGHF  +RTA +IL CGF+WP +F D   +   C  CQR GN+  R E+    IL +ELFDVWGIDFMGPFP S  N +IL+AVD
Subjt:  FTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFMGPFPPSNGNVFILLAVD

Query:  YVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFR
        YV KWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G+HF N    KLL+KYG+ H+IA PYHPQ  GQ E+SNREIK ILEK V+ SRKDWS +
Subjt:  YVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFR

Query:  LDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQK
        LD+ALWAYRTA+KTP+                                     GE R+LQLN++ E R+ +YENA++YKE+ K WHD+ I  +EF+ GQK
Subjt:  LDEALWAYRTAYKTPL-------------------------------------GEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQK

Query:  V
        V
Subjt:  V

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-2723.02Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH
        +A E +K  L+S P+L   +++    +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+LA++ + + +R YL  +     + TDH
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH

Query:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM
          +  R     +    RL  W L LQ+F+ EI  + GS N IAD LSR+   +  + + +  +S          +  V  +    D  N +V        
Subjt:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM

Query:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP
              ++ +D K                      +  + KR++  ++L      + K            DQ  L N ++++ T     +    G  ++P
Subjt:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP

Query:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA
                   G      I+   F W  + K    + + C  CQ   +   +    L  I   E  ++   +DF+   P S+G   + + VD   K    
Subjt:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA

Query:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA
        + C +S  A+  AR     + A FG P+ +++D    F +        KY    + + PY PQ +GQ E +N+ ++ +L  V       W   +     +
Subjt:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA

Query:  YRTA
        Y  A
Subjt:  YRTA

P0CT41 Transposon Tf2-12 polyprotein3.2e-2723.02Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH
        +A E +K  L+S P+L   +++    +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+LA++ + + +R YL  +     + TDH
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH

Query:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM
          +  R     +    RL  W L LQ+F+ EI  + GS N IAD LSR+   +  + + +  +S          +  V  +    D  N +V        
Subjt:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM

Query:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP
              ++ +D K                      +  + KR++  ++L      + K            DQ  L N ++++ T     +    G  ++P
Subjt:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP

Query:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA
                   G      I+   F W  + K    + + C  CQ   +   +    L  I   E  ++   +DF+   P S+G   + + VD   K    
Subjt:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA

Query:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA
        + C +S  A+  AR     + A FG P+ +++D    F +        KY    + + PY PQ +GQ E +N+ ++ +L  V       W   +     +
Subjt:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA

Query:  YRTA
        Y  A
Subjt:  YRTA

P10394 Retrovirus-related Pol polyprotein from transposon 4121.9e-3525.12Show/hide
Query:  FRRMSFHLC--NAPATFQRYCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFA
        + R    LC  N P  +   C+KAF  LK+ LI+  +L  P+++  F +  DAS  A GA+L Q       P+ YAS+   + + N +TTE EL A+ +A
Subjt:  FRRMSFHLC--NAPATFQRYCRKAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFA

Query:  FEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRL------DPSSSLLE-----QYAISDSFPDEQL-F
           FRPY+ G   TV TDH  + YL +  +   +L    L L+E++  ++  KG +N +AD LSR+      D + ++L+     Q         EQL  
Subjt:  FEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRL------DPSSSLLE-----QYAISDSFPDEQL-F

Query:  AVEVKVVRDVP-WYADIANFLVKGVTPIDMD-----WRQKKKF--KHDAKFFYWDKPFMYKQCSD--GIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQR
          + K +   P  Y  I N  V+ V  + ++     ++  KK   ++D    Y +      Q      ++ GI   +Q K   W  ++  +   D K + 
Subjt:  AVEVKVVRDVP-WYADIANFLVKGVTPIDMD-----WRQKKKF--KHDAKFFYWDKPFMYKQCSD--GIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQR

Query:  YIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQR-RGNLGPRDEMSLTYI
            +  +L +  L+NP       +  N        L  L+  P  G  +G    +  +   ++W  + K    + ++C  CQ+ +     +  M++T  
Subjt:  YIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQR-RGNLGPRDEMSLTYI

Query:  LEVELFDVWGIDFMGPFPPS-NGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKH-RIAPYH
         E   FD   +D +GP P S NGN + +  +  + K++ AI      AKTVA+ +      ++G  +  ++D GT + N+I+T L     IK+     +H
Subjt:  LEVELFDVWGIDFMGPFPPS-NGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKH-RIAPYH

Query:  PQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTA---------YKTPLGEIRML-----QLNELEEF-------RQFSYENAKMYKEKTKL
         Q  G  E S+R +   +   +   + DW   L   ++ + T          Y+   G    L     +L+ +E         ++  Y     Y    KL
Subjt:  PQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTA---------YKTPLGEIRML-----QLNELEEF-------RQFSYENAKMYKEKTKL

Query:  WHDKKIKSKE--FIKGQKVQKIVAAKIML
            K K+KE   +K + ++  V  K++L
Subjt:  WHDKKIKSKE--FIKGQKVQKIVAAKIML

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.0e-2824.66Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQ--KQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHA
        KA E LKAAL ++P+L   N    + +  DAS   +GA+L +   + K +  + Y SK L  AQ NY   E ELL ++ A   FR  L G   T+ TDH 
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQ--KQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHA

Query:  TIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSR----LDPSSSL-LEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPI
        ++  L  K +   R+  W+  L  +D  ++   G +NV+AD +SR    + P +S  ++  +    +  + L +  +  ++++  +          VTP 
Subjt:  TIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSR----LDPSSSL-LEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPI

Query:  DMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSL
        DM                                     + +K+++ +     N   + ++  Y         Q RLV P    +    +  R     +L
Subjt:  DMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSL

Query:  YPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPR--DEMSLTYILEVELFDVWGIDFMGPFPPSNGNV-FILLAVDYVFK
               FGGHF    T  +I    ++WP L      + + C  CQ   +  PR    +    I E    D+  +DF+   PP++ N+  IL+ VD   K
Subjt:  YPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPR--DEMSLTYILEVELFDVWGIDFMGPFPPSNGNV-FILLAVDYVFK

Query:  WVEAIACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRI-APYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDE
            IA  ++ DA  +   L  +IF+  G PR + SD       +   +L  + GIK  + +  HPQ +GQ+E + + +  +L   V  + ++W   L +
Subjt:  WVEAIACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRI-APYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDE

Query:  ALWAYRTAYKTPLGE
          + Y +     LG+
Subjt:  ALWAYRTAYKTPLGE

Q9UR07 Transposon Tf2-11 polyprotein3.2e-2723.02Show/hide
Query:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH
        +A E +K  L+S P+L   +++    +  DASD AVGA+L QK      +P+ Y S  +++AQ+NY+ ++ E+LA++ + + +R YL  +     + TDH
Subjt:  KAFETLKAALISAPILCAPNWNLPFEVMCDASDAAVGAMLGQK-QGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGS--KVTVFTDH

Query:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM
          +  R     +    RL  W L LQ+F+ EI  + GS N IAD LSR+   +  + + +  +S          +  V  +    D  N +V        
Subjt:  ATI--RYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKGSENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDM

Query:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP
              ++ +D K                      +  + KR++  ++L      + K            DQ  L N ++++ T     +    G  ++P
Subjt:  DWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTLRLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYP

Query:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA
                   G      I+   F W  + K    + + C  CQ   +   +    L  I   E  ++   +DF+   P S+G   + + VD   K    
Subjt:  LYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVEL-FDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEA

Query:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA
        + C +S  A+  AR     + A FG P+ +++D    F +        KY    + + PY PQ +GQ E +N+ ++ +L  V       W   +     +
Subjt:  IACHQS-DAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIA-PYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWA

Query:  YRTA
        Y  A
Subjt:  YRTA

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.5e-1664.29Show/hide
Query:  ILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFM
        +L  GF+WPT FKDAH F   CDACQR+GN   R+EM   +ILEVE+FDVWGI FM
Subjt:  ILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSLTYILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATCGAAAGGACATTCAAGTCCAGCGAACCCCCAGCAGAACCGTTGCTGCAGCAAAACCCGCTGTTTGA
GCAAAATGAGCAACAAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCTAG
GGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGATCAATTAGGACGTGGGATGAG
TTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTG
GGAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTAGGTACAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGG
CTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAGGTCACAGATGAAGCA
TGTGTGTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTGGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCC
AGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAA
AGGCGAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGGTGCTGGGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAA
CCACCATACGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTC
ATTTACCATACATGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTGGGTATTG
GTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAGGTCGTCCATTTTTGGTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTC
TGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCATTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAACAAT
ACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACA
AAATCACTCTAGATGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGG
ATCATCTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCTTAAGAAAGGAGGTGTCACTGTGATGTTGGATCGATTGGCTAGTCAGGCCTACTACTG
TTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCCTATGGGACATTCGCTTTTAGGAGAATGTCTT
TCCACCTCTGCAATGCTCCAGCAACATTTCAACGGTATTGTAGGAAGGCATTTGAGACTTTAAAGGCTGCTTTAATCTCAGCACCCATTCTTTGCGCACCTAATTGGAAT
TTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGTAGGTGCTATGCTGGGGCAAAAGCAGGGAAAATTTATCCATCCTATATACTATGCAAGCAAGGTTTTAAA
TGAGGCACAAGTCAACTATACAACTACTGAAAATGAGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCCGACCATATTTGGTTGGATCCAAAGTCACGGTTTTCACGG
ATCATGCAACAATAAGGTACTTAATGACTAAGAAGGATGCAAAGCCTAGACTAATTCATTGGGTTTTACTATTGCAGGAATTCGATTTGGAAATAAAGGATAAGAAGGGA
TCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATATGCCATTTCCGATTCTTTTCCAGATGAACAGCTTTTTGCTGTTGA
GGTAAAGGTAGTTAGGGATGTCCCTTGGTATGCTGATATTGCAAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATG
ATGCGAAATTTTTCTATTGGGATAAGCCATTTATGTATAAGCAATGCTCTGACGGTATTAGGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTA
CGTCTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATACATATTTTCTGTGTCCACGGATCTAGACCAATATCGACTAGTTAATCCTTCACGAGTGTCTGTAAC
ACCATCTGGGTCAAATTACCGTTTTACCCCTGGGACATCCTTATACCCATTATACATTTCGCCGTTTGGAGGTCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGC
ATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTTCAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTGGGGCCTAGAGATGAAATGTCTCTT
ACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGACTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTT
CAAGTGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATCACACATTTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTG
ATGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATT
AGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGAC
TCCTCTAGGAGAAATAAGAATGCTGCAGCTTAATGAGTTAGAGGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATAAAGAAAAGACTAAGCTGTGGCATGACA
AGAAAATTAAATCCAAGGAGTTTATCAAGGGTCAAAAAGTTCAGAAGATTGTTGCAGCAAAGATAATGCTAGAGCAAATTTCTCGTACGAATAGGAAGGATTTTAGTGAA
GGGAATACTGAGAAGGATCATGAGACAGAGGCCGAAAGACAAGTCGTTGGTAAGCCTGACGAGGAGAAAACACCGGAGCTGGAGATGCAGGTTGAAGTCATTATGCCTGA
ATCGCCCAAGCGCCGCCGCATCAAGAGGAAGGCGGGTCGCCTGAGGGTGATTCAGAACACTCCGTCGCCTCCGACGTCGGACTCTGAGGAAGAAAAAAGGGAAGCTGAGA
ATAAGGAAAAAGAAGAAGAGGCAAGAAAGACAGAAGAAGAGCGTTTGCGCGAACAGAGAGAAAGCAAGGGCAAAGGAATTGTCGAAGCATCGGGAGAGATTGAGGAGCCG
AGGGCACCATTCATTCGCTTCGTCAATGAACTTGCTAGAGCAAAATATCAGGAAGTACTGAAGCGGGATTTCTTGTTTGAGCGAGGATTTGCCAGTGATCTGCCCAGGTT
TTTGGAATCTGGAATAGCAAGCCTGGGGTGGAGGCAGTTTTGTGCAAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTCTACGCCAATCTTGATGTGAAGGATG
ATTTTGAAGTCATAGTACGAGGAGTGCCTGTCCAATGGAGCCCAGAGGCCATCAATAATTTGTTTGATCTTCAGGACTTTCCACACGCAGTTTTCAATGAAATGATGGTT
GCTCCATCGAGCGACCAATTAAATGCGACGGTCCGAGAGAGGGAAGCTAATGCTTGGATGGGCTTCATTAGACTCCGCTTGCTGCTGACAACGCACGACTCCACTATATC
TCGGGACAGAGTGTTGCTTACTTTTGCCATCCTTCGATCAATGAGCATAGATGTTGGAAAAATAATTGCTACTGAGATTGCTGACTGTTGGCGCAAGAAGGTGGGGAAGC
TGTTCTTTCCAAACACGATTACAATGTTATGCAACAGGGTAGGAGTGCCCACGGTTCCAGAAGATATGATCATGCTTGATAAGGGAATCATTGACACTCCAATAATTCAA
CTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGTTTATGACTTAGAGCTTGGTTTTGCAGAGTGCTCAGGTATAACTCTTCTTCCTCTAGA
AGCTTTTCCAACTCCCCTTCCTTATCCAACATTTCTCTCCTCCATTGGGGACACTGCTGAGAAGACAAACCCAAGCTCATGGATGAGCCCAAATGGATTTCAGGTCCAAG
CCCATGAAGGGGTCCAAATGCAACCAAGCTCAAGTCCAAGCCCACGAAGGTCTTTGGGAAACTCTATAAATAGGAGGCATAACCTCTCATTTTTGGGGGGCAGAAATAAA
GGAAGAAGAGAAAGCTCTCAGAAAACTCTGAAGATTGAAGCTCTGTCTGCAGAATCGAAGATCGAAGCTCAGATCCTCATCAAATTCAGCTGCAAAACTAAAGAACAAGG
CTCCGATCATCTTCAAATCCTGCTGCAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATCGAAAGGACATTCAAGTCCAGCGAACCCCCAGCAGAACCGTTGCTGCAGCAAAACCCGCTGTTTGA
GCAAAATGAGCAACAAAATAATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTGTCCCGATGTTTAATGAGTTGAATCTAG
GGATTGCACGTCCCCAAATCCAAGCGGCAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGCAATTCCATGGATCAATTAGGACGTGGGATGAG
TTAGCTGAAAATTTTTTGAGTAAATATTTCCCACCTAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTG
GGAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTAGGTACAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGGG
CTGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAACAGCCACCAGCTATGGAGCCTACTGCAGTGGTGAACCAGGTCACAGATGAAGCA
TGTGTGTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTGGTAGGTAATCAGAGGAACAACCCTTATTCTAACTTCTATAATCC
AGGTTGGCGCAACCACCCCAACTTCTCATGGGGAGATGCCGCAATTCAAAGTAATCAAGCTTCGATGAGGGCCCTGGAATTGCAAGTGGGTCAGCTAGCTAATGAGCTAA
AGGCGAGGCCTCAAGGGAAACTTCCATCAGATACTGAACACCCTCGAAGGGAAGGGTGCTGGGGCAGCAATAAAAATGCTGGAGCATCTGGTTCTGTGCCAGATGTAGAA
CCACCATACGTGCCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAGAATCAGGATGAATGGGCTACCCCCAAGGCTAAGGATCCAGGGTC
ATTTACCATACATGTGTCTATAGGTGGAAAAGAATTAGGTAGAGCACTCTGTGATTTAGGCGCAAGCATTAACCTTATGCCTCTTTCGGTCTATCGAAAGTTGGGTATTG
GTGAAGCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAGGTCGTCCATTTTTGGTACTGGTAGGGCGTTAATAGATGTTCAAAAAGGGGAATTAACAATGAGAGTC
TGTAATGAGGAAGTGAAATTTAATGTGTTTAAAGCCATGAAATATCCAGACGAAATGGAGGATTGCTCATTCATTAGGATTCTGGAGAGCACAGTTATTGAGACAACAAT
ACAGGATTCGGCTGATAAGCATTCGGAAAAGCATGGAGAGCAATACCGCAAAGCTATAGGTTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGTATGCACA
AAATCACTCTAGATGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGGTTGGATGCTGGG
ATCATCTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCTTAAGAAAGGAGGTGTCACTGTGATGTTGGATCGATTGGCTAGTCAGGCCTACTACTG
TTTCTTGGATGGTTATTCTGGGTATAACCAGATTACTATTGCTCCTGAGGATCAGGAGAAAACCACTTTCACCTGTCCCTATGGGACATTCGCTTTTAGGAGAATGTCTT
TCCACCTCTGCAATGCTCCAGCAACATTTCAACGGTATTGTAGGAAGGCATTTGAGACTTTAAAGGCTGCTTTAATCTCAGCACCCATTCTTTGCGCACCTAATTGGAAT
TTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGTAGGTGCTATGCTGGGGCAAAAGCAGGGAAAATTTATCCATCCTATATACTATGCAAGCAAGGTTTTAAA
TGAGGCACAAGTCAACTATACAACTACTGAAAATGAGTTGTTAGCTGTGGTGTTTGCTTTTGAGAAATTCCGACCATATTTGGTTGGATCCAAAGTCACGGTTTTCACGG
ATCATGCAACAATAAGGTACTTAATGACTAAGAAGGATGCAAAGCCTAGACTAATTCATTGGGTTTTACTATTGCAGGAATTCGATTTGGAAATAAAGGATAAGAAGGGA
TCAGAAAATGTCATTGCAGATCATTTATCTCGTCTTGATCCGTCATCATCTTTGCTGGAGCAATATGCCATTTCCGATTCTTTTCCAGATGAACAGCTTTTTGCTGTTGA
GGTAAAGGTAGTTAGGGATGTCCCTTGGTATGCTGATATTGCAAACTTTTTGGTAAAGGGAGTCACTCCTATTGACATGGATTGGAGGCAGAAGAAAAAGTTTAAGCATG
ATGCGAAATTTTTCTATTGGGATAAGCCATTTATGTATAAGCAATGCTCTGACGGTATTAGGAAGGGAATAAAAGTCCCCACGCAGCGGAAGCGCATCGATTGGACCTTA
CGTCTGGGACCTAATGGACCTGCAGATCAGAAGCTCCAACGATACATATTTTCTGTGTCCACGGATCTAGACCAATATCGACTAGTTAATCCTTCACGAGTGTCTGTAAC
ACCATCTGGGTCAAATTACCGTTTTACCCCTGGGACATCCTTATACCCATTATACATTTCGCCGTTTGGAGGTCATTTCAGCGGTCAGAGGACAGCTATGAGGATTTTGC
ATTGCGGATTTTTCTGGCCTACGTTATTCAAGGATGCCCATTGGTTCTTCAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTGGGGCCTAGAGATGAAATGTCTCTT
ACTTACATTTTAGAAGTTGAATTATTCGATGTATGGGGTATTGACTTTATGGGGCCATTTCCCCCTTCTAATGGCAATGTTTTTATCTTATTGGCAGTTGATTATGTGTT
CAAGTGGGTGGAGGCCATTGCATGCCATCAGAGTGATGCCAAGACAGTAGCAAGGTTTCTTCAATCACACATTTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTG
ATGAGGGTACACATTTTGTTAATAATATCTTAACTAAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCCCCTTATCACCCACAAGCAAATGGTCAAGCTGAAATT
AGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGAC
TCCTCTAGGAGAAATAAGAATGCTGCAGCTTAATGAGTTAGAGGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATAAAGAAAAGACTAAGCTGTGGCATGACA
AGAAAATTAAATCCAAGGAGTTTATCAAGGGTCAAAAAGTTCAGAAGATTGTTGCAGCAAAGATAATGCTAGAGCAAATTTCTCGTACGAATAGGAAGGATTTTAGTGAA
GGGAATACTGAGAAGGATCATGAGACAGAGGCCGAAAGACAAGTCGTTGGTAAGCCTGACGAGGAGAAAACACCGGAGCTGGAGATGCAGGTTGAAGTCATTATGCCTGA
ATCGCCCAAGCGCCGCCGCATCAAGAGGAAGGCGGGTCGCCTGAGGGTGATTCAGAACACTCCGTCGCCTCCGACGTCGGACTCTGAGGAAGAAAAAAGGGAAGCTGAGA
ATAAGGAAAAAGAAGAAGAGGCAAGAAAGACAGAAGAAGAGCGTTTGCGCGAACAGAGAGAAAGCAAGGGCAAAGGAATTGTCGAAGCATCGGGAGAGATTGAGGAGCCG
AGGGCACCATTCATTCGCTTCGTCAATGAACTTGCTAGAGCAAAATATCAGGAAGTACTGAAGCGGGATTTCTTGTTTGAGCGAGGATTTGCCAGTGATCTGCCCAGGTT
TTTGGAATCTGGAATAGCAAGCCTGGGGTGGAGGCAGTTTTGTGCAAAGCCTGATCCTGTCAATGCCAACATCGTTCGGGAATTCTACGCCAATCTTGATGTGAAGGATG
ATTTTGAAGTCATAGTACGAGGAGTGCCTGTCCAATGGAGCCCAGAGGCCATCAATAATTTGTTTGATCTTCAGGACTTTCCACACGCAGTTTTCAATGAAATGATGGTT
GCTCCATCGAGCGACCAATTAAATGCGACGGTCCGAGAGAGGGAAGCTAATGCTTGGATGGGCTTCATTAGACTCCGCTTGCTGCTGACAACGCACGACTCCACTATATC
TCGGGACAGAGTGTTGCTTACTTTTGCCATCCTTCGATCAATGAGCATAGATGTTGGAAAAATAATTGCTACTGAGATTGCTGACTGTTGGCGCAAGAAGGTGGGGAAGC
TGTTCTTTCCAAACACGATTACAATGTTATGCAACAGGGTAGGAGTGCCCACGGTTCCAGAAGATATGATCATGCTTGATAAGGGAATCATTGACACTCCAATAATTCAA
CTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGTTTATGACTTAGAGCTTGGTTTTGCAGAGTGCTCAGGTATAACTCTTCTTCCTCTAGA
AGCTTTTCCAACTCCCCTTCCTTATCCAACATTTCTCTCCTCCATTGGGGACACTGCTGAGAAGACAAACCCAAGCTCATGGATGAGCCCAAATGGATTTCAGGTCCAAG
CCCATGAAGGGGTCCAAATGCAACCAAGCTCAAGTCCAAGCCCACGAAGGTCTTTGGGAAACTCTATAAATAGGAGGCATAACCTCTCATTTTTGGGGGGCAGAAATAAA
GGAAGAAGAGAAAGCTCTCAGAAAACTCTGAAGATTGAAGCTCTGTCTGCAGAATCGAAGATCGAAGCTCAGATCCTCATCAAATTCAGCTGCAAAACTAAAGAACAAGG
CTCCGATCATCTTCAAATCCTGCTGCAGAACTGA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFKSSEPPAEPLLQQNPLFEQNEQQNNQAENPILIANDRTRAIRAYAVPMFNELNLGIARPQIQAANFEMKPVMFQMLQTVGQFHGSIRTWDE
LAENFLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWEGLRSFCESVPTMVYLIRYNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPTAVVNQVTDEA
CVYCGEDHNYEFCPSNPASVFLVGNQRNNPYSNFYNPGWRNHPNFSWGDAAIQSNQASMRALELQVGQLANELKARPQGKLPSDTEHPRREGCWGSNKNAGASGSVPDVE
PPYVPPPPYVPPLPFPQRQKPKNQDEWATPKAKDPGSFTIHVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADRSSIFGTGRALIDVQKGELTMRV
CNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIETTIQDSADKHSEKHGEQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRRLNPAMKEVVKKEVIKWLDAG
IIYPIADSNWVSPVQCVLKKGGVTVMLDRLASQAYYCFLDGYSGYNQITIAPEDQEKTTFTCPYGTFAFRRMSFHLCNAPATFQRYCRKAFETLKAALISAPILCAPNWN
LPFEVMCDASDAAVGAMLGQKQGKFIHPIYYASKVLNEAQVNYTTTENELLAVVFAFEKFRPYLVGSKVTVFTDHATIRYLMTKKDAKPRLIHWVLLLQEFDLEIKDKKG
SENVIADHLSRLDPSSSLLEQYAISDSFPDEQLFAVEVKVVRDVPWYADIANFLVKGVTPIDMDWRQKKKFKHDAKFFYWDKPFMYKQCSDGIRKGIKVPTQRKRIDWTL
RLGPNGPADQKLQRYIFSVSTDLDQYRLVNPSRVSVTPSGSNYRFTPGTSLYPLYISPFGGHFSGQRTAMRILHCGFFWPTLFKDAHWFFKQCDACQRRGNLGPRDEMSL
TYILEVELFDVWGIDFMGPFPPSNGNVFILLAVDYVFKWVEAIACHQSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIAPYHPQANGQAEI
SNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGEIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKIKSKEFIKGQKVQKIVAAKIMLEQISRTNRKDFSE
GNTEKDHETEAERQVVGKPDEEKTPELEMQVEVIMPESPKRRRIKRKAGRLRVIQNTPSPPTSDSEEEKREAENKEKEEEARKTEEERLREQRESKGKGIVEASGEIEEP
RAPFIRFVNELARAKYQEVLKRDFLFERGFASDLPRFLESGIASLGWRQFCAKPDPVNANIVREFYANLDVKDDFEVIVRGVPVQWSPEAINNLFDLQDFPHAVFNEMMV
APSSDQLNATVREREANAWMGFIRLRLLLTTHDSTISRDRVLLTFAILRSMSIDVGKIIATEIADCWRKKVGKLFFPNTITMLCNRVGVPTVPEDMIMLDKGIIDTPIIQ
LEARVYCTIKWVIPCLRVYDLELGFAECSGITLLPLEAFPTPLPYPTFLSSIGDTAEKTNPSSWMSPNGFQVQAHEGVQMQPSSSPSPRRSLGNSINRRHNLSFLGGRNK
GRRESSQKTLKIEALSAESKIEAQILIKFSCKTKEQGSDHLQILLQN