; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008396 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008396
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr9:20186683..20201601
RNA-Seq ExpressionLag0008396
SyntenyLag0008396
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0043.03Show/hide
Query:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N + R ++ Y   + N+   GI R  I A NFE+KP +  M++   QF G   +DP++HL  FL + D+  + GV  D +RL LFP+SLRD A+ WL S 
Subjt:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY
          GSI +W ++AEKFL+K+FPP +  +LRSEI  FRQ + E+  EAWER+K+L+R CP HGLP  +Q++ FYNGLNG T+ +VDA++GG L++KT + A 
Subjt:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPP-------AMEPATVVNQVTDEACVYC-GEDHNYEFCP-----
         +LE ++ N+ QW   R   KKV  + E++  + + A +A +++    V+ ++ Q+ P       A      +N+ + E   Y    ++NY   P     
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPP-------AMEPATVVNQVTDEACVYC-GEDHNYEFCP-----

Query:  --------------------------SNP---------ASVFFV----VTFRSGKSLEERIE--------------------------------PSKTQE
                                  S P         A V FV     TF+   S  + IE                                PS T E
Subjt:  --------------------------SNP---------ASVFFV----VTFRSGKSLEERIE--------------------------------PSKTQE

Query:  VNG---------------DRNNNVVEEELESGQGAGGSKKDAGASGFVPDV-----EPPYVP----PPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLH
        VN                +R+ +   E   +    G SK        V D       PP +     PP    PLP+PQR + +  D QF KFL+I K++H
Subjt:  VNG---------------DRNNNVVEEELESGQGAGGSKKDAGASGFVPDV-----EPPYVP----PPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLH

Query:  INIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPT
        INIP  +A+EQMPNYAKFLKDI++KK+RL EFETV L+EECSAI++  LP K KDPGSFT P +IG     + LCDLGASINLMPLSVYRKLG+GE + T
Subjt:  INIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPT

Query:  TVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRIL
        T++LQLAD SI YP G IEDVLVKVDKFIFP DF++LD E D++VP+ILGRPF ATGRAL+DVQKGELT+RV  EEV FN+++AMK+P++   C  + ++
Subjt:  TVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRIL

Query:  ESTVIEIALQDSADKHSEKHGEVSVEDFEF-----CSLDRKNEKELFR----CEDVFESLDLDQRKAPPIKPSLI------------EAPTLDLKPLPDH
        E  V+E   +D    H E+    S   ++F     C  D     E F       +  + +    ++   ++P +             +  T +LK LP+H
Subjt:  ESTVIEIALQDSADKHSEKHGEVSVEDFEF-----CSLDRKNEKELFR----CEDVFESLDLDQRKAPPIKPSLI------------EAPTLDLKPLPDH

Query:  LKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIAD
        L+Y +LG+S T P+IVA+ L PE EE L+++L+++R A+ WT++DI+GISPS CMHKI +EE     IE QRRLNPAMKEVV+ E++K L+AGIIY I+D
Subjt:  LKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIAD

Query:  SNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYG
        S+WVSPVQ VPKKGG+TVV N NNE IPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG +YYCFLDGY GYNQI IAPEDQEKTTFTCPYG
Subjt:  SNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYG

Query:  TFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD--------------------------------------------
        TFAFRRMPFGLCNAP TFQRCM+AIFSDM+E  +E+FMDDFSVFG SF  CL NL                                             
Subjt:  TFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD--------------------------------------------

Query:  ------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ
                                                              C +AF  +K  LISAP++  P+W+ PFEVMCDASD A+GA+LGQ++
Subjt:  ------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ

Query:  -----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSLQIICLVL----------------IHHHL---
                                     AVVFA +KFR YL+ +KV VFT+HAA+R   +++D +  L    L+L                +  HL   
Subjt:  -----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSLQIICLVL----------------IHHHL---

Query:  ----------------------CWSNLPF------------------------------------SILFQMSSFLLLGK---GDEATEILEQCHSLPYGG
                              C   LP+                                     +LF+     ++ +    +E   IL  CHS  YGG
Subjt:  ----------------------CWSNLPF------------------------------------SILFQMSSFLLLGK---GDEATEILEQCHSLPYGG

Query:  HFSGQRTTMRILHSRFFLPTLFKDSHWF------CKRF--------------------------------------------DYVSKWVEAIACHHSDAK
        HF   RT  ++L S FF P++F+DS+        C+R                                             DYVSKWVEAIA   +DAK
Subjt:  HFSGQRTTMRILHSRFFLPTLFKDSHWF------CKRF--------------------------------------------DYVSKWVEAIACHHSDAK

Query:  TVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG
         V +FL  +IF RFGTPRA++SDEGTHF N +   LL+KYG+KH+IA  YHPQ NGQAEISNREIK ILEK V+ +RKDW+ +LD+ALWAYRTA+KTP+G
Subjt:  TVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLG

Query:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKS
        MSPYRLV+GKACHLP+ELEHK +WA+KK N DL  AG  R+LQLNE++EFR  +YENAK+YKE+TK WHDK+I  R  +  +   ++NSRLKLFPGKL+S
Subjt:  MSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKS

Query:  KWSRPFIVIEVFPNGAITLQDEKDGRVFK
        +W+ P+ + +V   GAI L+D K G +F+
Subjt:  KWSRPFIVIEVFPNGAITLQDEKDGRVFK

XP_038972405.1 uncharacterized protein LOC120104748 [Phoenix dactylifera]0.0e+0042.48Show/hide
Query:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y V   N   P I RP + A NFE+KP + QM++   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY
        A  S  TW+ L++ FLSKYFPP +  KLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ ++AY
Subjt:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A    L  +   L NV  +S            H     M+   V N      Q    +  Y   
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE

Query:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--
          N                       ++  PS P S     +       +     ER+E    Q  + +RN  +   +L +     GQG   SK +    
Subjt:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--

Query:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
                                            S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P 
Subjt:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP
        Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+ P +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT++LQLAD S+ YP
Subjt:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP

Query:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD
         G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPF AT  A+IDV+ G LT++V  EEV+FN+F+A KYP   +    + +++ +  E    ++  
Subjt:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD

Query:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH
        +  E      G    ++ E     C+L+    K +K     ED+        +  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E 
Subjt:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH

Query:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN
         + LI++L+  +KAI WT++D++GISPS CMH+I +E+    I+E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ VPKKGG+TVV N+NN
Subjt:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN

Query:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA
        ELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG AYYCFLDGY GYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM+A
Subjt:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA

Query:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------
        IFSD +E  +EVFMDDFSVFG SF SCLDNL                                                                     
Subjt:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------

Query:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------
                                     DC  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++                        
Subjt:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------

Query:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV
             AVVFAF+KFR YLVGSKV V+T+H+AI+    ++D +  L                                                 Q++ + 
Subjt:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV

Query:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS
        +I       ++L    +P  + + Q   FL           LL K             DE  +IL+ CHSL  GGHFS  +T  ++  S F+ PT+++D+
Subjt:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS

Query:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG
          +      C+R                                             DYVSKWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG
Subjt:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG

Query:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA
        +HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA
Subjt:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA

Query:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG
        +K LN DL  AG  R+LQL+ELEEFR  +YEN ++YKEKTK WHDK + +R+  + +   ++NSRLKLFPGKL+S+WS PF V +V+P GA+ ++ E  G
Subjt:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG

Query:  RVFKDLNLLEPYLFTN
            +   L+PYL +N
Subjt:  RVFKDLNLLEPYLFTN

XP_038973683.1 uncharacterized protein LOC120105384 [Phoenix dactylifera]0.0e+0042.48Show/hide
Query:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y V   N   P I RP + A NFE+KP + QM++   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY
        A  S  TW+ L++ FLSKYFPP +  KLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ ++AY
Subjt:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A    L  +   L NV  +S            H     M+   V N      Q    +  Y   
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE

Query:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--
          N                       ++  PS P S     +       +     ER+E    Q  + +RN  +   +L +     GQG   SK +    
Subjt:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--

Query:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
                                            S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P 
Subjt:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP
        Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+ P +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT++LQLAD S+ YP
Subjt:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP

Query:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD
         G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPF AT  A+IDV+ G LT++V  EEV+FN+F+A KYP   +    + +++ +  E    ++  
Subjt:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD

Query:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH
        +  E      G    ++ E     C+L+    K +K     ED+        +  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E 
Subjt:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH

Query:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN
         + LI++L+  +KAI WT++D++GISPS CMH+I +E+    I+E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ VPKKGG+TVV N+NN
Subjt:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN

Query:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA
        ELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG AYYCFLDGY GYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM+A
Subjt:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA

Query:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------
        IFSD +E  +EVFMDDFSVFG SF SCLDNL                                                                     
Subjt:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------

Query:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------
                                     DC  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++                        
Subjt:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------

Query:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV
             AVVFAF+KFR YLVGSKV V+T+H+AI+    ++D +  L                                                 Q++ + 
Subjt:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV

Query:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS
        +I       ++L    +P  + + Q   FL           LL K             DE  +IL+ CHSL  GGHFS  +T  ++  S F+ PT+++D+
Subjt:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS

Query:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG
          +      C+R                                             DYVSKWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG
Subjt:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG

Query:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA
        +HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA
Subjt:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA

Query:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG
        +K LN DL  AG  R+LQL+ELEEFR  +YEN ++YKEKTK WHDK + +R+  + +   ++NSRLKLFPGKL+S+WS PF V +V+P GA+ ++ E  G
Subjt:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG

Query:  RVFKDLNLLEPYLFTN
            +   L+PYL +N
Subjt:  RVFKDLNLLEPYLFTN

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]0.0e+0042.48Show/hide
Query:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y V   N   P I RP + A NFE+KP + QM++   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY
        A  S  TW+ L++ FLSKYFPP +  KLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ ++AY
Subjt:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A    L  + + L NV  +S            H     M+   V N      Q    +  Y   
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRA---DLAMIANALKNVTVIS------------HQQPPAMEPATVVN------QVTDEACVYCGE

Query:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--
          N                       ++  PS P S     +       +     ER+E    Q  + +RN  +   +L +     GQG   SK +    
Subjt:  DHN-----------------------YEFCPSNPAS-----VFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--

Query:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
                                            S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P 
Subjt:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP
        Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+ P +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT++LQLAD S+ YP
Subjt:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP

Query:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD
         G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPF AT  A+IDV+ G LT++V  EEV+FN+F+A KYP   +    + +++ +  E    ++  
Subjt:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD

Query:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH
        +  E      G    ++ E     C+L+    K +K     ED+        +  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E 
Subjt:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH

Query:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN
         + LI++L+  +KAI WT++D++GISPS CMH+I +E+    I+E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ VPKKGG+TVV N+NN
Subjt:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN

Query:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA
        ELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG AYYCFLDGY GYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM+A
Subjt:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA

Query:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------
        IFSD +E  +EVFMDDFSVFG SF SCLDNL                                                                     
Subjt:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------

Query:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------
                                     DC  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++                        
Subjt:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------

Query:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV
             AVVFAF+KFR YLVGSKV V+T+H+AI+    ++D +  L                                                 Q++ + 
Subjt:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV

Query:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS
        +I       ++L    +P  + + Q   FL           LL K             DE  +IL+ CHSL  GGHFS  +T  ++  S F+ PT+++D+
Subjt:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS

Query:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG
          +      C+R                                             DYVSKWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG
Subjt:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG

Query:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA
        +HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA
Subjt:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA

Query:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG
        +K LN DL  AG  R+LQL+ELEEFR  +YEN ++YKEKTK WHDK + +R+  + +   ++NSRLKLFPGKL+S+WS PF V +V+P GA+ ++ E  G
Subjt:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG

Query:  RVFKDLNLLEPYLFTN
            +   L+PYL +N
Subjt:  RVFKDLNLLEPYLFTN

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]0.0e+0042.13Show/hide
Query:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF
        N   R +  Y V   N   P I RP + A NFE+KP + QM++   QF G  SEDPH HL +FL + D+  + GV  DA+RL LFP+SL+D AK+WLNS 
Subjt:  NDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSF

Query:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY
        A  S  TW+ L++ FLSKYFPP +  KLR++I  F Q + E+  EAWERFK+L RKCPHHGLP  + ++TFYNGL    +  +DA+AGG L++K+ ++AY
Subjt:  ALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAY

Query:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAM--EPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVT
        E+LE ++ N+ QWS+ R   KKV  + +VDG++ + A +  +      +  ++    P +  +     +  +D   V    ++N +   +NP S  +   
Subjt:  EILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAM--EPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVT

Query:  FR---------------------------------SGKSLE--------------ERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--
        +R                                 S +S E              ER+E    Q  + +RN  +   +L +     GQG   SK +    
Subjt:  FR---------------------------------SGKSLE--------------ERIEPSKTQEVNGDRNNNVVEEELES-----GQGAGGSKKDAG--

Query:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN
                                            S  V D+     P  P  PYVPP+PFPQR K    D QF+KFL++ +QLHINIP  +A+ Q+P 
Subjt:  -----------------------------------ASGFVPDV---EPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPN

Query:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP
        Y KFLK+I++KK++L +FET++LTEECSAI++N LPPK +DPGSF+ P +IG  +  RALCDLGAS++LMPLSV RKLG+ E +PTT++LQLAD S+ YP
Subjt:  YAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYP

Query:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD
         G +E+VL+KV KFI PVDFI+L+ E D ++PIILGRPF AT  A+IDV+ G LT++V  EEV+FN+F+A KYP   +    + +++ +  E    ++  
Subjt:  EGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSAD

Query:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH
        +  E      G    ++ E     C+L+    K +K     ED+        +  PP  PS ++AP L+LKPLP HL Y +LGE+ TLP+IV+  L  E 
Subjt:  KHSE----KHGEVSVEDFEF----CSLDR---KNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEH

Query:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN
         + LI++L+  +KAI WT++D++GISPS CMH+I +E+    I+E QRRLNP MKEVV+ EV+KWLDAGIIYPI+DS W+SPVQ VPKKGG+TVV N+NN
Subjt:  EEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN

Query:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA
        ELIPTRTVTGWRVC+DYR+LN  TRKDHFPLPF+DQ+L+RLAG AYYCFLDGY GYNQI+I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM+A
Subjt:  ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLA

Query:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------
        IFSD +E  +E+FMDDFSVFG SF SCLDNL                                                                     
Subjt:  IFSDMIESTVEVFMDDFSVFGGSFQSCLDNL---------------------------------------------------------------------

Query:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------
                                     DC  AF  LK  L+SAPI+ AP+W+LPFE+MCDASD A+GA+LGQ++                        
Subjt:  -----------------------------DCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------------------

Query:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV
             AVVFAF+KFR YLVGSKV V+T+H+AI+    ++D +  L                                                 Q++ + 
Subjt:  -----AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------------QIICLV

Query:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS
        +I       ++L    +P  + + Q   FL           LL K             DE  +IL+ CHSL  GGHFS  +T  ++  S F+ PT+++D+
Subjt:  LIH------HHLCWSNLPFSILF-QMSSFL-----------LLGK------------GDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDS

Query:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG
          +      C+R                                             DYVSKWVEA A   +D++ V RF++ +IF+RFG PRA++SDEG
Subjt:  HWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEG

Query:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA
        +HF N     LL KYG+ H++A  YHPQ NGQ E++NRE+K ILEK V  SRKDW+ +LD+ALWAYRTA+KTPLGMSPYRLV+GK+CHLP+ELEH+ +WA
Subjt:  THFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWA

Query:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG
        +K LN DL  AG  R+LQL+ELEEFR  +YEN ++YKEKTK WHDK + +R+  + +   ++NSRLKLFPGKL+S+WS PF V +V+P GA+ ++ E  G
Subjt:  LKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKS-FVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDG

Query:  RVFKDLNLLEPYLFTN
            +   L+PYL +N
Subjt:  RVFKDLNLLEPYLFTN

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0042.59Show/hide
Query:  MLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLED
        M++   QF GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW                                
Subjt:  MLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLED

Query:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLA
        ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  + + +L  +  N  +    R T  K   V+EVD V+ + A + 
Subjt:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLA

Query:  MIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELESGQGAGG
         +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV   R  ++        P   Q  N   NNN  +      Q  G 
Subjt:  MIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELESGQGAGG

Query:  SKKDAGASGFVPDVEP------------------------------PYVPPPPYVPPLP------------------------FPQRQKPKN--QDGQFK
         +         P +E                               P    P    P P                         P + K K    + + K
Subjt:  SKKDAGASGFVPDVEP------------------------------PYVPPPPYVPPLP------------------------FPQRQKPKN--QDGQFK

Query:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYR
        +    L++LHINIP  EA+EQMP+Y KF+KDIL+KK+ LG++ETV+LTEECSAI++N LPPK KDPGSFT P +IG    GRALCDLG    L       
Subjt:  KFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYR

Query:  KLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE
            GEA+PT++TLQLAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGR LIDVQ                  KAMK+P+E
Subjt:  KLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDE

Query:  MEDCSFIRILESTV------------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL
         ++C  + + ++ V            +E AL D  D+ +E       ED+E   +   +  + F+   V ESL   +R AP   +KPS+ E PTL+LKPL
Subjt:  MEDCSFIRILESTV------------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPL

Query:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP
        P+HL Y YLGES+TLP+I++S L     E L+++L+ ++ AI WT+ADI+GISPSFCMHKI LE+     +E QRRLNP MKEVVKKE+IKWLDAGIIYP
Subjt:  PDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYP

Query:  IADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTC
        I+DS+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRKDHFPL FIDQMLDRLAG+ +YCFLDGY GYNQI IAPEDQEK TFTC
Subjt:  IADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTC

Query:  PYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-----------------------------------------
        PYGTFAFRRMPFGLCNAP TFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF  CL+NL                                          
Subjt:  PYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-----------------------------------------

Query:  ---------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLG
                                                                 CR AF  LK  LISAPI+  P+W+ PFE+MCDASD AVGA+LG
Subjt:  ---------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLG

Query:  QKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL----------------------------
        Q++                             AVVFAF+KFR YLVG+KV V+T+HAAIR    ++D +  L                            
Subjt:  QKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL----------------------------

Query:  ----------------------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHS
                              Q++ +V         I ++L    +PF +  Q                 FL     D          E  +ILEQCH+
Subjt:  ----------------------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHS

Query:  LPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACH
         PYGGHF G RT  +IL S FF P LFKD+H F      C+R                                             DYVSKWVEA A  
Subjt:  LPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACH

Query:  HSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAY
        ++D+K V  F++ +IF RFGTPRA++SD GTHF N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK ILEK V  +RKDWS RLDEALWAYRTAY
Subjt:  HSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAY

Query:  KTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRLKLFP
        KTP+GMSPYRLV+GKACHLP+ELEH  +WA++KLNFD+  AG  R+LQLNEL+EFR  +YENAK+YKEK K WH+KKI  R     +   ++NSRLKLFP
Subjt:  KTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRLKLFP

Query:  GKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK
        GKLKS+WS PF + EVFP+GA+ L+++     FK
Subjt:  GKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK

A0A2G9G6G2 Reverse transcriptase0.0e+0042.19Show/hide
Query:  MLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLED
        M++   QF GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW                                
Subjt:  MLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLED

Query:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLA
        ET  EAW +F+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  + + +L  +  N  +    R T  K   V+EVD V+ + A + 
Subjt:  ETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLA

Query:  MIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELESGQGAGG
         +  ++KN  V   Q  P               C  CGE H  + CP +  S+ FV   R  ++        P   Q  N   NNN        GQG+  
Subjt:  MIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELESGQGAGG

Query:  SKKDAGASGFVPDVEPPYVPPPPYV--------------------------------PPLPFPQRQKPKN-QDGQFKKFLEI------------------
          +  G       V+PP     P +                                P    P   +P + QD   +   E+                  
Subjt:  SKKDAGASGFVPDVEPPYVPPPPYV--------------------------------PPLPFPQRQKPKN-QDGQFKKFLEI------------------

Query:  ---------LKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPL
                 L++LHINIP  EA+EQMP+Y KF+KDIL+KK+RLG++E V+LTEECS I++N LPPK K+PGSFT P +IG    GRALCDLGASINLMP 
Subjt:  ---------LKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPL

Query:  SVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMK
        S+YR LG+GEA+PT++TLQLAD S+TYP+G I+D+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGR LIDVQ                  KAMK
Subjt:  SVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMK

Query:  YPDEMEDCSFIRILES------------TVIEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLD
        +P+E ++C  + + ++              +E AL D  D+ +EK  EV         +   +  + F+   V ESL   +R AP   +KPS+ E PTL+
Subjt:  YPDEMEDCSFIRILES------------TVIEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLD

Query:  LKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAG
        LKPLP HL Y YLGES+TLP+I++S L     E L+++L+ ++  I WT+ADI+GISPSFCMHKI LE+     IE QRRLNP MKEVVKKE+IKWLDAG
Subjt:  LKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAG

Query:  IIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKT
        IIYPI+DS+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGY GYNQI IAPEDQEKT
Subjt:  IIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKT

Query:  TFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-------------------------------------
        TFTCPYGTFAFRRMPFGLCNAP TFQRCM+AIF+DM+E+ +EVFMD+FSV+G SF  CL+NL                                      
Subjt:  TFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-------------------------------------

Query:  -------------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVG
                                                                     C  AF  LK  LISAPI+  P+            D AVG
Subjt:  -------------------------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVG

Query:  AMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSLQIICLVL-----------------
        A+LGQ++                             AVVFAF+KFR YLV +KV V+T+HAAIR    ++D    L +  L+L                 
Subjt:  AMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSLQIICLVL-----------------

Query:  -----------------------------------------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILE
                                                 I ++L    +PF +  Q                 FLL    D          E  +ILE
Subjt:  -----------------------------------------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILE

Query:  QCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEA
        QCH+ PYGGHF G RT  +IL S FF P LFKD+H F      C+R                                             DYVSKWVEA
Subjt:  QCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEA

Query:  IACHHSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY
         A  ++D+K V  F++ +IF RFGTPRA++SD  T+F N     LL+KYG+KH+I TPYHPQ +G  E+SNREIK ILEK V  +RKDWS RLDEALWAY
Subjt:  IACHHSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAY

Query:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRL
        RTAYKTP+GMSPY L++GKACHLP+ELEH  +WA+ KLNFD+  AG  R+LQLNEL+EFR  +YENAK+YKEKTK WHDKKI  R     +   ++NSRL
Subjt:  RTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRL

Query:  KLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK
        KLFPGKLKS+W   F + EVFP+GA+ L++E     FK
Subjt:  KLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK

A0A2G9HWF8 Reverse transcriptase0.0e+0041.09Show/hide
Query:  DPEIERTFRNRRREQRRNQMENAPQLPQVHEGPAAANPQQNPLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYDVSMFNELNPGIARPQIQAANFE
        DPEIERTFR RRR+                    A + +Q   +++N +    +   N     + +   R R        +F EL  G+          +
Subjt:  DPEIERTFRNRRREQRRNQMENAPQLPQVHEGPAAANPQQNPLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYDVSMFNELNPGIARPQIQAANFE

Query:  MKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIV
        ++  M QM++   QF GLS E+P+ H+ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW +L E+F+SK+F P +   LR+EI+
Subjt:  MKPVMFQMLETVGQFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIV

Query:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVS
         FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  + + +L  + +N  +    R T  K   V+EVD V+
Subjt:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVS

Query:  TIRADLAMIANALKN------------------VTVISHQQPPAMEPATVV---------------NQVTDEACVY-CGEDHNYEFCPSNPASV-FFVVT
         + A +  +  ++KN                  +  +S+ + P   P +                 NQ    A  +  G  HN +  P          VT
Subjt:  TIRADLAMIANALKN------------------VTVISHQQPPAMEPATVV---------------NQVTDEACVY-CGEDHNYEFCPSNPASV-FFVVT

Query:  FRSGKSLEERI-EPSKTQEVNGDRNNNVVEEELESGQGAGGSKKDAGASGFVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPL
         R+G+ L+E + EP+K++E        V  EE E                   +VE P     P     PFPQR + +    QF KFLE+ K+LHINIP 
Subjt:  FRSGKSLEERI-EPSKTQEVNGDRNNNVVEEELESGQGAGGSKKDAGASGFVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPL

Query:  VEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ
         EA+EQMP+Y KF+KDIL+KK+RLG++ETV+LTEECSAI++N LPPK KDPG              RALCDLGASINLMP S+YR LG+ EA+PT++TLQ
Subjt:  VEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQ

Query:  LADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-
        LAD S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGR LIDVQKGELTMRV ++++ FNVFKAMK+P+E ++C  + + ++   
Subjt:  LADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV-

Query:  -----------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETL
                   +E AL D  D+ +E+  EV         +   +  + F+   V ESL   +R AP   +KPS+ E PTL+LKPLP HL Y YLGES+TL
Subjt:  -----------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETL

Query:  PIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPK
        P+I++S L     E L+++L+ ++ AI WT+ADI+GISPSFCMHKI LE+     +E QRRLNP MKEVVKKE+IKWLDAGIIYPI+D +W+SPVQCVPK
Subjt:  PIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPK

Query:  KGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLC
        KGG+TVV N +NE IPT+TVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAFRR+PF LC
Subjt:  KGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLC

Query:  NAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD--------------------------------------------------------
        NAP TFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF  CL+NL                                                         
Subjt:  NAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD--------------------------------------------------------

Query:  ------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------
                                                  C  AF+ LK  LISAPI+  P+W+ PFE+MCDASD A+GA+LGQ++            
Subjt:  ------------------------------------------CRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ------------

Query:  -----------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------
                         AVVFAF+KFR YLVG+KV V+T+HAAIR    ++D +  L                                           
Subjt:  -----------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL-------------------------------------------

Query:  -------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHSLPYGGHFSGQRTTMR
               Q++ +V         I ++L    +PF +  Q                 FL     D          E  +I EQCH+ PYGGHF   RT  +
Subjt:  -------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHSLPYGGHFSGQRTTMR

Query:  ILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHI
        IL S FF P LFKD H F      C+R                                             DY+SKWVEA+A  ++D+K V  F++ +I
Subjt:  ILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTVARFLQSHI

Query:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGK
        F RFGTPRA++SD GTHF N     LL+KYG+KH+I+TPYHPQ +GQ E+SNREIK  LEK V  +RKDWS RLDEALWAYRTA+KTP+GMSPY LV+GK
Subjt:  FARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGK

Query:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKSFVYNSRLKLFPGKLKSKWSRPFIVIEV
        ACHLP  +E +                            F    Y                             ++NSRLKLFP KLKS+WSRPF + EV
Subjt:  ACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKSFVYNSRLKLFPGKLKSKWSRPFIVIEV

Query:  FPNGAITLQDEKDGRVFK
         P+GA+ L+++     FK
Subjt:  FPNGAITLQDEKDGRVFK

A0A2G9IA86 DNA-directed DNA polymerase0.0e+0044.07Show/hide
Query:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST
        FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D   G + L+ T  + + +L  +  N  +    R T  K   V+EVD V+ 
Subjt:  FRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVST

Query:  IRADLAMIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELES
        + A +  +  ++KN  V   Q  P               C  CGE H    CP++  S+ FV   R  ++        P   Q  N   NNN  +     
Subjt:  IRADLAMIANALKNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKS--LEERIEPSKTQEVNGDRNNNVVEEELES

Query:  GQGAGGSKKDAGASGFVPDVEPPYV-------------------------------------PPP-----------------------------------
         Q +G  +         P +E   +                                     P P                                   
Subjt:  GQGAGGSKKDAGASGFVPDVEPPYV-------------------------------------PPP-----------------------------------

Query:  ------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRP
                  PL   Q+QK K    QF KFLE+ K+LHIN P  EA+EQMP+Y KF+K IL+KK+RLG++ETV+LTEECSAI++N LPPK KDPGSFT P
Subjt:  ------PYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRP

Query:  VSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALID
         +IG    GRALCDLGASINLMP S+YR LG+GEA+PT++TLQLA+ S+TYP+G IED+LVKVDKFIFP DF++LD E D +VPIILGRPF ATGR LID
Subjt:  VSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALID

Query:  VQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV--------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLD
        VQKG+LTMRV ++++ FNVFKAMK+P+E ++C  + + ++          +E AL D  D+ +E+  EV         +   +  + F+   V ESL   
Subjt:  VQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTV--------IEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLD

Query:  QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRR
        +R AP   +KPS+ E+PTL+LKPLP HL Y YLGES+TLP+I++S L     E L+++ + ++ AI WT+ADI+GIS SFCMHKI LE+     +E QRR
Subjt:  QRKAPP--IKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRR

Query:  LNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCF
        LNP MKEVVKKE+IKW+DAGIIYPI+DS+WVSPVQCVPKKGG+TVV N +NELIPTRTVTGWRVCMDYR+LNKATRKDHFPLPFIDQMLDRLAG+ +YCF
Subjt:  LNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCF

Query:  LDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-----------------
        LDGY           DQEKTTFTCPYGTFAFRR+PFGLCNAP TFQRCM+AIF+DM+E+ +EVFMDDFSV+G SF  CL+NL                  
Subjt:  LDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNLD-----------------

Query:  ---------------------------------------------------------------------------------CRKAFETLKATLISAPILC
                                                                                         C  AF  LK  LISAPI+ 
Subjt:  ---------------------------------------------------------------------------------CRKAFETLKATLISAPILC

Query:  APNWNLPFEVMCDASDAAVGAMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL----
         P+W+ PFE+MCDASD A+GA+LGQ++                             AVVFAF+KFR YLVG+KV V+T+HAAIR    ++D +  L    
Subjt:  APNWNLPFEVMCDASDAAVGAMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---TRRDQRMSL----

Query:  ---------------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHSLPYGGHF
                       Q++ +V         I ++L    +PF +  Q                 FL     D          E  +ILEQCH+ PYGGHF
Subjt:  ---------------QIICLVL--------IHHHLCWSNLPFSILFQMSS--------------FLLLGKGD----------EATEILEQCHSLPYGGHF

Query:  SGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTV
         G RT  +IL S FF P LFKD+H F      C+R                                             DYVSKWVEA+A  ++D+K V
Subjt:  SGQRTTMRILHSRFFLPTLFKDSHWF------CKR--------------------------------------------FDYVSKWVEAIACHHSDAKTV

Query:  ARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMS
          F++ +IF RFGTPRA++SD G HF N      L+KYG+KH+I TPYHPQ +GQ E+SNREIK ILEK V  +R DWS RLDEALWAYRT YKTP+GMS
Subjt:  ARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMS

Query:  PYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRLKLFPGKLKSKW
        PY L++GKACHL +ELEH  +WA++KLNFD+   G  R+LQLNEL+EFR  +YENAK+YKEKTK WHDKKI  R     +   ++NSRLKLFP KLK +W
Subjt:  PYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSK-SFVYNSRLKLFPGKLKSKW

Query:  SRPFIVIEVFPNGAITLQDEKDGRVFK
        S PF + EVFP+GA+ L++E     FK
Subjt:  SRPFIVIEVFPNGAITLQDEKDGRVFK

A0A6P8CBX2 Reverse transcriptase0.0e+0041.34Show/hide
Query:  QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEA
        QF G  +E P  H+  FL   ++  +  V  D +RL LFP+SLRD A++W NS    SI TW +L+ KFL ++FPP R  +LR+EI  F +   E+  EA
Subjt:  QFHGLSSEDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEA

Query:  WERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANAL
        WERFKE +RKCPHHGLP  + +E FY  L+   + +VDA+AGGAL+ K +D+A  ++E ++ ++  W + R +  +V SV ++D ++ +   ++ +   +
Subjt:  WERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANAL

Query:  KNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVVTFRSG-------------------------------------
          +T            +   NQV    C  C   H+   C S NP++      V FV  F+                                       
Subjt:  KNVTVISHQQPPAMEPATVVNQVTDEACVYCGEDHNYEFCPS-NPAS------VFFVVTFRSG-------------------------------------

Query:  -------------------------------------KSLEERIE--------------PSKTQE---------VNGDRNNNVVEEELESGQGAGGSKKD
                                             ++LE +I               PS T+E         +   +   +V  + ++ + +   +KD
Subjt:  -------------------------------------KSLEERIE--------------PSKTQE---------VNGDRNNNVVEEELESGQGAGGSKKD

Query:  AGASGFVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKN---
         G    V +     +   PYVPP+PFP R K +  D QF KFL++ K+L INIP  EA++QMP+YA+F+KD+LTKK++    E V LT ECS IL+    
Subjt:  AGASGFVPDVEPPYVPPPPYVPPLPFPQRQKPKNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKN---

Query:  GLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPI
         LP K +D GSFT P +IG       L D GASINLMPLS++RKLG+GE + T +TLQLAD SI YP+G +E+VLVKVDKFIFPVDFI+L+ E DR+VP+
Subjt:  GLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSVYRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPI

Query:  ILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDV
        ILGRPF ATG+ALIDV++G+LT+RV NE++ FNV+ A+K  D+ + C  I I++    E+  +   +K      E  + D +  S D ++E+E       
Subjt:  ILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDV

Query:  FESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFR
         ++   ++      KP  SL ++P L+LKPLP HLKY YLG  +TLPII++S L  + E+ L+ +L+++++AI WT+ADI+GISP  C H+I LE     
Subjt:  FESLDLDQRKAPPIKP--SLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFR

Query:  IIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLA
        I++ QRRLNP +KEVVKKEV+K LDAGIIYPI+DS WVSPVQ VPKKGG+TVV N+ N+LIPTRTVTGWRVC+DYR+LN ATRKDHFPLPFIDQML++LA
Subjt:  IIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLA

Query:  GQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL-----------
        G  YYCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCM++IFSDM+E+ +E+FMDDFSVFG SF+SCL NL           
Subjt:  GQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL-----------

Query:  ---------------------------------------------------------------------------------------DCRKAFETLKATL
                                                                                               +C +AF  LK  L
Subjt:  ---------------------------------------------------------------------------------------DCRKAFETLKATL

Query:  ISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---------
         SAP++ APNW LPFE+MCDASD AVGA+LGQ++                             AV+FA +KFR YL+GSK+ V+T+HAA++         
Subjt:  ISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQ-----------------------------AVVFAFEKFRQYLVGSKVTVFTNHAAIR---------

Query:  -------------------------TRRDQRMSLQIICL--------------VLIHHHLCW---------SNL-PFSI---------------------
                                    D    L+  CL              V     L W         SN+ P+ +                     
Subjt:  -------------------------TRRDQRMSLQIICL--------------VLIHHHLCW---------SNL-PFSI---------------------

Query:  LFQMSSFLLLGKGDEATE---ILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR------------------------------
        LF+  +  ++ +    TE   I++ CHS   GGHF  +RT  +IL   F+ P +F D   +      C+R                              
Subjt:  LFQMSSFLLLGKGDEATE---ILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWF------CKR------------------------------

Query:  --------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKA
                       DYVSKWVEA+A   +DA+ V RFL+ +IF+RFG PRA++SD G+HF N    KLL+KYG+ H+IATPYHPQ  GQ E+SNREIK 
Subjt:  --------------FDYVSKWVEAIACHHSDAKTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKA

Query:  ILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKL
        ILEK V+ SRKDWS +LD+ALWAYRTA+KTP+GMSPY++VYGK+CHLP+ELEHK +WA+K LNFDL  AG  R+LQLN++ E R+ +YENA++YKE+ K 
Subjt:  ILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPYRLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKL

Query:  WHDKKINLRS-LSMSKSFVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK
        WHD+ I  R  L   K  +YNSRLKLFPGKLKS+WS PF++  VFP GA+ L+ E D R FK
Subjt:  WHDKKINLRS-LSMSKSFVYNSRLKLFPGKLKSKWSRPFIVIEVFPNGAITLQDEKDGRVFK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.6e-2135.2Show/hide
Query:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDG
        A ++ V+ ++   L+ GII   ++S + SP+  VPKK      S K            +R+ +DYR+LN+ T  D  P+P +D++L +L    Y+  +D 
Subjt:  AMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDG

Query:  YYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL
          G++QI + PE   KT F+  +G + + RMPFGL NAP TFQRCM  I   ++     V++DD  VF  S    L +L
Subjt:  YYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL

P10394 Retrovirus-related Pol polyprotein from transposon 4122.6e-2124.21Show/hide
Query:  VKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSADKHSEKHGE
        ++  K++ P DF ++D         I+G  F       ID+ + E    +    +KF ++  + Y   + + + +      V  + +    D     + E
Subjt:  VKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMEDCSFIRILESTVIEIALQDSADKHSEKHGE

Query:  VSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQY-RKAIDWT
        +    +   ++   +   +     +  + D DQ         L+   TL  +PL ++   V    SE     V S L     E     L+    + ID  
Subjt:  VSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASDLMPEHEEALIKLLQQY-RKAIDWT

Query:  LADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYR
          + + I+ +    +    +    +  +  R   +  E ++ +V K +   I+ P + S + SP+  VPKK              P      WR+ +DYR
Subjt:  LADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYR

Query:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFS
        ++NK    D FPLP ID +LD+L    Y+  LD   G++QI +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS +  S   ++MDD  
Subjt:  RLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFS

Query:  VFGGSFQSCLDNL
        V G S +  L NL
Subjt:  VFGGSFQSCLDNL

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-2034.48Show/hide
Query:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYN
        V+ +V + L+ G+I   ++S + SP   VPKK   +  +NK            +RV +DYR+LN+ T  D +P+P +D++L +L    Y+  +D   G++
Subjt:  VKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYN

Query:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL
        QI +  E   KT F+   G + + RMPFGL NAP TFQRCM  I   ++     V++DD  +F  S    L+++
Subjt:  QITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDMIESTVEVFMDDFSVFGGSFQSCLDNL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.9e-2636.52Show/hide
Query:  LLQQYRKAIDWTL----ADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNEL
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIDWTL----ADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNEL

Query:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIF
                +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD + GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F
Subjt:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIF

Query:  SDMIESTVEVFMDDFSVFGGSFQSCLDNLD
         D+    V V++DD  +F  S +    +LD
Subjt:  SDMIESTVEVFMDDFSVFGGSFQSCLDNLD

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.9e-2636.52Show/hide
Query:  LLQQYRKAIDWTL----ADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNEL
        L Q+YR+ I   L    ADI  I      H I ++ G+     Q   +    ++ + K V K LD   I P + S   SPV  VPKK G           
Subjt:  LLQQYRKAIDWTL----ADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNNEL

Query:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIF
                +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD + GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F
Subjt:  IPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIF

Query:  SDMIESTVEVFMDDFSVFGGSFQSCLDNLD
         D+    V V++DD  +F  S +    +LD
Subjt:  SDMIESTVEVFMDDFSVFGGSFQSCLDNLD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTTCCGCTGCGTGGGGACTTTTATTCCCTTCACTCTAGGCCTTTGGCCCTGGTCGGCCTCGACAGTGGGTCGAGGCCGACCTCATCATCCTACGGTTTTT
AAGAATTCGGAGGTGTCTCGGGACGAACCAGGTGAAACCGAGGCGGTCGGGGACATCAGGGACCGAAGGGAGGTGGCCAAACTCAGCCCGCGCCCGCGAGCCGAA
TGGTCGGCCTCGGCCTTATGTCGAGGTCGGCCTTGGGCCGAGGCCGACCATATGGGTCGGGCCAAATTGGCCCGACCCTTTGGTCTAGTCTTCCTCTGGGTCGGC
TTTCTGGTCCCACCTCTGCCCAATTGTCCTCATGCTTTTAAGAAGCAGGAATTGATCATTCGTACCCTTGGAAATGACTTACAAGAGATTTGCTTGCTTATAAAA
CCGAAGGATAGCCACACAACACCACTGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTATGCCACAGCAAAACTGGGAGCAGAAAC
TGCCACATCACAGCTCGTGTGATTTTGGTGTATGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAAC
GCCCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAAT
AATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGATGTCTCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCC
CAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGGAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTT
AAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACATTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCA
TGGTTAAACTCTTTTGCTCTAGGATCAATTAGGACGTGGGATGAGTTAGCTGAAAAGTTTTTGAGTAAATATTTCCCACCTAATAGAAATGTGAAATTAAGGAGT
GAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCATTGT
ATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATAAAGCCTAT
GAAATTTTAGAAAGAATATCTATTAATAGTTGCCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATT
AGGGCCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAACAGTAGTGAACCAAGTCACG
GACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGTAACTTTTAGGAGTGGTAAGTCATTA
GAAGAAAGAATTGAGCCTAGTAAAACCCAGGAGGTAAATGGTGATAGAAATAATAATGTTGTCGAGGAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAA
AAAGATGCTGGAGCATCTGGTTTTGTGCCAGATGTAGAACCACCATATGTGCCGCCTCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAG
AATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTT
CTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAATTTGAAACGGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCT
AAGGATCCAGGGTCATTTACTAGACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTATGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTC
TATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATATGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTA
AAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTTTAGGACGTCCATTTTTTGCTACTGGTAGG
GCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGGAT
TGCTCCTTCATCAGGATTCTGGAGAGCACAGTTATTGAGATAGCACTACAGGATTCAGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTGTAGAAGATTTT
GAATTTTGTTCTTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGCGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAGCCA
TCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTAATGCCGGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGATTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGT
ATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGATTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGG
TTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAAATAAT
GAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACCCGAAAGGATCATTTCCCTCTACCATTTATCGAC
CAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTATGGGTATAACCAGATTACTATTGCTCCTGAAGATCAGGAAAAAACCACT
TTCACCTGCCCTTATGGAACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATG
ATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAGAGTTGTTTAGATAATTTAGATTGTAGGAAGGCTTTTGAAACATTG
AAGGCTACTTTAATCTCAGCACCCATTCTCTGCGCACCTAATTGGAATTTGCCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCAGTAGGTGCAATGCTGGGG
CAGAAGCAGGCTGTTGTGTTTGCTTTTGAGAAATTCCGGCAATATTTGGTTGGATCCAAAGTCACGGTGTTCACGAATCATGCAGCAATAAGGACTAGAAGGGAT
CAGAGAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTTCGATTCTTTTCCAGATGAGCAGCTTTTTGCTG
TTAGGTAAAGGTGATGAAGCAACCGAAATCCTGGAGCAATGTCACTCTTTGCCGTATGGAGGCCATTTCAGCGGTCAAAGGACAACTATGAGGATTTTGCATTCC
AGATTTTTCTTGCCTACGTTATTCAAGGATTCCCATTGGTTCTGCAAGCGATTTGATTACGTGTCCAAATGGGTGGAGGCCATCGCATGCCATCATAGTGATGCC
AAGACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACT
AAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCGAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTA
GAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTAT
AGGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTTGAACATAAAACATTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATA
AGAATGCTGCAACTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATAAGGAAAAGACTAAGCTGTGGCATGACAAAAAAATTAAT
CTAAGGAGTTTGTCAATGTCAAAAAGTTTTGTTTATAATTCTAGATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAAGACCGTTTATTGTGATTGAG
GTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGATTTGAATCTGCTGGAGCCTTATTTATTCACAAACGCTGGTAAGCTT
CTCCCACTTCGTCTTCTTAATCCCATTTTTTGCCATCTGCATTCTTTTTTCTCTGTCAAAACCGTTCATACCTCCATGGCTAAGACTAGAGCTAGGAAAGAGAGG
GAGAGTGAAGAGGAGGAGGTACCGGTCACACCGGAAGTGCAAAAAGGGAAAACTAAGAAGAAAAGAACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCAG
CAGAGGGCTGCGGAACAAGAAGTCATCCAAGAAACAGAGGTTGTTCAGAATCTTGAGACAGAACCGATAGTTTCTGCTACGGTTCAGGAAGGGAATACTGAGAAG
GAACATGAGACAGAGGTCGAAAGACAAGTCGCAGGTATGCCTGAAAAAGAGAAAACACCAGAGCCGGTGCAAGAGGCTCATGTTGAAGTCATTATGCCTGAACTG
CCCAAGCGTCGCCGCATCAAGAGGATGGCGGGTCGCGTGAGGGTGATTCGAAACACTCCATCACCTCCGACGTCGGACTCTGAGGAAGAAAAAAGGGAAGCTGAG
AATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGCGAGCAGAGAGAATGCAAGGGCAAAGGAATTGCGGAAGCATCGGGAGAAATTGAG
GAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGAGACTTCTTGTTCGAACGAGGATTTGGCAGTGAT
TTGCCCAGGTTCTTGGAGTATGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACACTGTTCGAGAGTTTTATGCCAAT
CTTGACGTTAAGAATGATTTTGAGGTTTTCGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTCGATCTCCAGGATTTTCCGCATGCC
GTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGGAAG
CATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACCTTGGATGGGTTTCATCAGGCTACGCTTGCTGTCAACAACACACGACTTCACAGTATCTCAGGACA
GGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAACGTACGCAAGAGGCTCGA
CAGGGAGGGCTGGTGTATGGCGTTAATCAGATCCTAGAGCATCTGACAGTGTTGACCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTATTGGACTTAT
GCTAAAAGGAGAGATGATGCGCTCAGGGGGCCTTGCAAACCAATTTCTCAAGACCATATCAGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATAC
CACCCCCGCCTGTTGAACGAGAAGAGGATGTTAATGAAGAGCAGGGTCAGGAAGATTGATGGATGGAATGATCTTCAGCGAGCATTATTTTTTGTTCTTATCATT
TTGTTTGCTCGCGCATATCTGACCCCGAATCATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGATTATGCTGCTGAGCGACTGAAGGGAGCAAATTCT
ATGCCGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCAGTTGAGTTATTTTCGGGATAAAGAAGCAAGGAGA
GCCCTACACGTGTCCAAGTTGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCTTCCGCTGCGTGGGGACTTTTATTCCCTTCACTCTAGGCCTTTGGCCCTGGTCGGCCTCGACAGTGGGTCGAGGCCGACCTCATCATCCTACGGTTTTT
AAGAATTCGGAGGTGTCTCGGGACGAACCAGGTGAAACCGAGGCGGTCGGGGACATCAGGGACCGAAGGGAGGTGGCCAAACTCAGCCCGCGCCCGCGAGCCGAA
TGGTCGGCCTCGGCCTTATGTCGAGGTCGGCCTTGGGCCGAGGCCGACCATATGGGTCGGGCCAAATTGGCCCGACCCTTTGGTCTAGTCTTCCTCTGGGTCGGC
TTTCTGGTCCCACCTCTGCCCAATTGTCCTCATGCTTTTAAGAAGCAGGAATTGATCATTCGTACCCTTGGAAATGACTTACAAGAGATTTGCTTGCTTATAAAA
CCGAAGGATAGCCACACAACACCACTGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAAATTCTATGCCACAGCAAAACTGGGAGCAGAAAC
TGCCACATCACAGCTCGTGTGATTTTGGTGTATGAGCTTGATCCAGAAATTGAAAGGACATTTAGGAACAGAAGGAGGGAGCAGCGCAGAAACCAGATGGAGAAC
GCCCCGCAACTTCCACAGGTTCATGAAGGTCCAGCAGCAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTGTTTGAGCAAAATGAGCAGCGAAAT
AATCAGGCTGAGAATCCTATCTTGATAGCGAACGATAGGACCAGAGCCATTCGAGCGTATGATGTCTCGATGTTTAATGAGTTGAATCCAGGGATTGCACGTCCC
CAAATCCAAGCGGCGAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGGAAACCGTGGGGCAATTCCATGGTTTGTCATCTGAAGACCCTCATTTACATCTT
AAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGAGATGCTCTTAGATTAACATTGTTCCCGTATTCTCTTAGAGATGGAGCAAAGTCA
TGGTTAAACTCTTTTGCTCTAGGATCAATTAGGACGTGGGATGAGTTAGCTGAAAAGTTTTTGAGTAAATATTTCCCACCTAATAGAAATGTGAAATTAAGGAGT
GAAATAGTAGGGTTTAGGCAACTTGAAGATGAGACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTACGAAAGTGTCCCCACCATGGGTTACCTCATTGT
ATTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTCGATGCTTCGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATAAAGCCTAT
GAAATTTTAGAAAGAATATCTATTAATAGTTGCCAGTGGTCGGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTATTAGAGGTTGATGGTGTGTCCACCATT
AGGGCCGATCTTGCTATGATTGCTAACGCTCTTAAGAATGTGACAGTGATTAGTCATCAGCAGCCACCAGCTATGGAGCCTGCAACAGTAGTGAACCAAGTCACG
GACGAAGCATGTGTCTATTGCGGTGAAGACCACAACTACGAGTTTTGCCCCAGCAATCCAGCTTCTGTGTTTTTTGTAGTAACTTTTAGGAGTGGTAAGTCATTA
GAAGAAAGAATTGAGCCTAGTAAAACCCAGGAGGTAAATGGTGATAGAAATAATAATGTTGTCGAGGAAGAGTTGGAGTCTGGTCAGGGTGCTGGAGGCAGCAAA
AAAGATGCTGGAGCATCTGGTTTTGTGCCAGATGTAGAACCACCATATGTGCCGCCTCCACCTTATGTACCACCTCTACCTTTTCCACAAAGGCAAAAGCCTAAG
AATCAGGATGGTCAATTTAAAAAGTTTTTAGAGATTCTTAAGCAGTTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGCTAAATTT
CTTAAGGATATTTTGACTAAAAAGAAAAGGTTAGGTGAATTTGAAACGGTATCTCTTACTGAGGAGTGTAGTGCTATTCTTAAGAATGGGCTACCCCCCAAGGCT
AAGGATCCAGGGTCATTTACTAGACCTGTGTCTATAGGTGGAAAGGAATTAGGTAGAGCACTATGTGATTTAGGTGCAAGCATTAACCTTATGCCTCTTTCGGTC
TATCGAAAGTTAGGTATTGGTGAAGCTAGACCTACCACAGTTACACTCCAACTAGCTGATATGTCTATCACATATCCAGAGGGTAAGATTGAGGATGTCTTAGTA
AAAGTGGATAAATTCATATTTCCTGTTGATTTCATTATTTTAGATTATGAGGCTGATAGAGATGTCCCAATTATTTTAGGACGTCCATTTTTTGCTACTGGTAGG
GCATTAATAGATGTTCAAAAAGGAGAATTAACAATGAGAGTCTGTAATGAGGAAGTGAAATTTAATGTCTTTAAAGCCATGAAATATCCAGACGAAATGGAGGAT
TGCTCCTTCATCAGGATTCTGGAGAGCACAGTTATTGAGATAGCACTACAGGATTCAGCTGATAAGCATTCAGAAAAGCATGGAGAGGTTAGTGTAGAAGATTTT
GAATTTTGTTCTTTAGATAGAAAAAATGAAAAAGAATTGTTTAGGTGCGAGGATGTTTTTGAGTCTTTAGATTTAGATCAAAGAAAGGCTCCCCCAATTAAGCCA
TCCCTAATTGAGGCACCCACTTTAGATTTGAAACCTTTACCGGATCATCTAAAGTATGTGTATCTTGGGGAAAGTGAGACGTTGCCCATTATTGTTGCATCAGAT
TTAATGCCGGAGCATGAAGAGGCCTTAATAAAATTGCTGCAGCAATACCGCAAAGCTATAGATTGGACATTGGCTGATATTCAGGGAATTAGCCCATCTTTTTGT
ATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGATTATTGAGCAACAAAGAAGGCTTAACCCTGCAATGAAAGAGGTTGTTAAAAAGGAGGTAATTAAATGG
TTGGATGCTGGGATCATTTATCCAATTGCCGATAGCAATTGGGTAAGCCCTGTCCAATGTGTTCCTAAGAAAGGAGGTGTCACTGTGGTGAGCAATAAAAATAAT
GAGTTGATCCCCACAAGGACAGTAACTGGCTGGAGGGTTTGTATGGATTACAGAAGGCTTAATAAAGCCACCCGAAAGGATCATTTCCCTCTACCATTTATCGAC
CAGATGTTGGATCGATTGGCTGGTCAGGCCTATTACTGTTTCTTGGATGGTTATTATGGGTATAACCAGATTACTATTGCTCCTGAAGATCAGGAAAAAACCACT
TTCACCTGCCCTTATGGAACATTCGCTTTTAGGAGAATGCCTTTTGGCCTTTGCAATGCTCCAACAACATTTCAGCGGTGTATGTTAGCAATTTTTTCTGATATG
ATTGAGTCCACTGTTGAAGTCTTTATGGACGATTTTTCAGTTTTTGGAGGGTCTTTTCAGAGTTGTTTAGATAATTTAGATTGTAGGAAGGCTTTTGAAACATTG
AAGGCTACTTTAATCTCAGCACCCATTCTCTGCGCACCTAATTGGAATTTGCCATTCGAGGTAATGTGTGATGCGAGTGATGCTGCAGTAGGTGCAATGCTGGGG
CAGAAGCAGGCTGTTGTGTTTGCTTTTGAGAAATTCCGGCAATATTTGGTTGGATCCAAAGTCACGGTGTTCACGAATCATGCAGCAATAAGGACTAGAAGGGAT
CAGAGAATGTCATTGCAGATCATTTGTCTCGTCTTGATCCATCATCATCTTTGCTGGAGCAATCTGCCATTTTCGATTCTTTTCCAGATGAGCAGCTTTTTGCTG
TTAGGTAAAGGTGATGAAGCAACCGAAATCCTGGAGCAATGTCACTCTTTGCCGTATGGAGGCCATTTCAGCGGTCAAAGGACAACTATGAGGATTTTGCATTCC
AGATTTTTCTTGCCTACGTTATTCAAGGATTCCCATTGGTTCTGCAAGCGATTTGATTACGTGTCCAAATGGGTGGAGGCCATCGCATGCCATCATAGTGATGCC
AAGACAGTAGCAAGGTTTCTTCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGGGCTCTAGTGAGTGATGAGGGTACACATTTTGTTAATAATATCTTAACT
AAGCTTTTAGCTAAGTATGGGATTAAGCATAGGATAGCTACCCCTTATCACCCACAAGCGAATGGTCAAGCTGAAATTAGTAATAGGGAAATTAAAGCTATTTTA
GAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTATAGGACAGCTTATAAGACTCCTCTAGGTATGTCTCCCTAT
AGGTTAGTATATGGGAAAGCTTGCCATTTACCATTAGAGCTTGAACATAAAACATTTTGGGCTTTGAAAAAGTTAAATTTTGATCTGAGTCGTGCAGGAGCAATA
AGAATGCTGCAACTTAATGAGTTAGAAGAATTTCGTCAATTTTCTTACGAAAATGCGAAAATGTATAAGGAAAAGACTAAGCTGTGGCATGACAAAAAAATTAAT
CTAAGGAGTTTGTCAATGTCAAAAAGTTTTGTTTATAATTCTAGATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAAGACCGTTTATTGTGATTGAG
GTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGATTTGAATCTGCTGGAGCCTTATTTATTCACAAACGCTGGTAAGCTT
CTCCCACTTCGTCTTCTTAATCCCATTTTTTGCCATCTGCATTCTTTTTTCTCTGTCAAAACCGTTCATACCTCCATGGCTAAGACTAGAGCTAGGAAAGAGAGG
GAGAGTGAAGAGGAGGAGGTACCGGTCACACCGGAAGTGCAAAAAGGGAAAACTAAGAAGAAAAGAACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCAG
CAGAGGGCTGCGGAACAAGAAGTCATCCAAGAAACAGAGGTTGTTCAGAATCTTGAGACAGAACCGATAGTTTCTGCTACGGTTCAGGAAGGGAATACTGAGAAG
GAACATGAGACAGAGGTCGAAAGACAAGTCGCAGGTATGCCTGAAAAAGAGAAAACACCAGAGCCGGTGCAAGAGGCTCATGTTGAAGTCATTATGCCTGAACTG
CCCAAGCGTCGCCGCATCAAGAGGATGGCGGGTCGCGTGAGGGTGATTCGAAACACTCCATCACCTCCGACGTCGGACTCTGAGGAAGAAAAAAGGGAAGCTGAG
AATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGCGAGCAGAGAGAATGCAAGGGCAAAGGAATTGCGGAAGCATCGGGAGAAATTGAG
GAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGAGACTTCTTGTTCGAACGAGGATTTGGCAGTGAT
TTGCCCAGGTTCTTGGAGTATGGAATAGTGAACCTCGGATGGAGGCAATTTTGTGCGAAACCAGAACCTGTCAATTCCAACACTGTTCGAGAGTTTTATGCCAAT
CTTGACGTTAAGAATGATTTTGAGGTTTTCGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTCGATCTCCAGGATTTTCCGCATGCC
GTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGAGGGTGTCGCAGACGCGGAAG
CATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACCTTGGATGGGTTTCATCAGGCTACGCTTGCTGTCAACAACACACGACTTCACAGTATCTCAGGACA
GGCAGGGCAGGAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAACGTACGCAAGAGGCTCGA
CAGGGAGGGCTGGTGTATGGCGTTAATCAGATCCTAGAGCATCTGACAGTGTTGACCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTATTGGACTTAT
GCTAAAAGGAGAGATGATGCGCTCAGGGGGCCTTGCAAACCAATTTCTCAAGACCATATCAGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATAC
CACCCCCGCCTGTTGAACGAGAAGAGGATGTTAATGAAGAGCAGGGTCAGGAAGATTGATGGATGGAATGATCTTCAGCGAGCATTATTTTTTGTTCTTATCATT
TTGTTTGCTCGCGCATATCTGACCCCGAATCATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGATTATGCTGCTGAGCGACTGAAGGGAGCAAATTCT
ATGCCGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATGAACCGACTTCAGTTGAGTTATTTTCGGGATAAAGAAGCAAGGAGA
GCCCTACACGTGTCCAAGTTGACCTAA
Protein sequenceShow/hide protein sequence
MCFRCVGTFIPFTLGLWPWSASTVGRGRPHHPTVFKNSEVSRDEPGETEAVGDIRDRREVAKLSPRPRAEWSASALCRGRPWAEADHMGRAKLARPFGLVFLWVG
FLVPPLPNCPHAFKKQELIIRTLGNDLQEICLLIKPKDSHTTPLITAQKIMLLSDWREQILCHSKTGSRNCHITARVILVYELDPEIERTFRNRRREQRRNQMEN
APQLPQVHEGPAAANPQQNPLLQQNPLFEQNEQRNNQAENPILIANDRTRAIRAYDVSMFNELNPGIARPQIQAANFEMKPVMFQMLETVGQFHGLSSEDPHLHL
KSFLGVSDSFVIQGVPRDALRLTLFPYSLRDGAKSWLNSFALGSIRTWDELAEKFLSKYFPPNRNVKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHC
IQMETFYNGLNGVTQGMVDASAGGALLAKTFDKAYEILERISINSCQWSDVRGTNKKVKSVLEVDGVSTIRADLAMIANALKNVTVISHQQPPAMEPATVVNQVT
DEACVYCGEDHNYEFCPSNPASVFFVVTFRSGKSLEERIEPSKTQEVNGDRNNNVVEEELESGQGAGGSKKDAGASGFVPDVEPPYVPPPPYVPPLPFPQRQKPK
NQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLKDILTKKKRLGEFETVSLTEECSAILKNGLPPKAKDPGSFTRPVSIGGKELGRALCDLGASINLMPLSV
YRKLGIGEARPTTVTLQLADMSITYPEGKIEDVLVKVDKFIFPVDFIILDYEADRDVPIILGRPFFATGRALIDVQKGELTMRVCNEEVKFNVFKAMKYPDEMED
CSFIRILESTVIEIALQDSADKHSEKHGEVSVEDFEFCSLDRKNEKELFRCEDVFESLDLDQRKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGESETLPIIVASD
LMPEHEEALIKLLQQYRKAIDWTLADIQGISPSFCMHKITLEEGSFRIIEQQRRLNPAMKEVVKKEVIKWLDAGIIYPIADSNWVSPVQCVPKKGGVTVVSNKNN
ELIPTRTVTGWRVCMDYRRLNKATRKDHFPLPFIDQMLDRLAGQAYYCFLDGYYGYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPTTFQRCMLAIFSDM
IESTVEVFMDDFSVFGGSFQSCLDNLDCRKAFETLKATLISAPILCAPNWNLPFEVMCDASDAAVGAMLGQKQAVVFAFEKFRQYLVGSKVTVFTNHAAIRTRRD
QRMSLQIICLVLIHHHLCWSNLPFSILFQMSSFLLLGKGDEATEILEQCHSLPYGGHFSGQRTTMRILHSRFFLPTLFKDSHWFCKRFDYVSKWVEAIACHHSDA
KTVARFLQSHIFARFGTPRALVSDEGTHFVNNILTKLLAKYGIKHRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTAYKTPLGMSPY
RLVYGKACHLPLELEHKTFWALKKLNFDLSRAGAIRMLQLNELEEFRQFSYENAKMYKEKTKLWHDKKINLRSLSMSKSFVYNSRLKLFPGKLKSKWSRPFIVIE
VFPNGAITLQDEKDGRVFKDLNLLEPYLFTNAGKLLPLRLLNPIFCHLHSFFSVKTVHTSMAKTRARKERESEEEEVPVTPEVQKGKTKKKRTPEEKEAKRRRRQ
QRAAEQEVIQETEVVQNLETEPIVSATVQEGNTEKEHETEVERQVAGMPEKEKTPEPVQEAHVEVIMPELPKRRRIKRMAGRVRVIRNTPSPPTSDSEEEKREAE
NKEKEEEARKAEEERLREQRECKGKGIAEASGEIEEPRAPFIRFVNDLARAKYQEVLKRDFLFERGFGSDLPRFLEYGIVNLGWRQFCAKPEPVNSNTVREFYAN
LDVKNDFEVFVRGVPVQWSPEAINELFDLQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANLGWVSSGYACCQQHTTSQYLRT
GRAGVPTVPEDMIMTDKGIIDTPNLARLQRTQEARQGGLVYGVNQILEHLTVLTSRLEFAERQAQTYWTYAKRRDDALRGPCKPISQDHIRPFQCFPMICLIFGY
HPRLLNEKRMLMKSRVRKIDGWNDLQRALFFVLIILFARAYLTPNHFAAAELGFAECSDYAAERLKGANSMPQQNWEQKLPHHSSLANFMNRLQLSYFRDKEARR
ALHVSKLT