; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G11210 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G11210
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr11:16611685..16617601
RNA-Seq ExpressionClc11G11210
SyntenyClc11G11210
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]0.0e+0059.03Show/hide
Query:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI
        NNPYS TYNPGWR HPNF W  N    QG G  P+           QG Q    Q       SLE  + Q +                      S A   
Subjt:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI

Query:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA
        + ++ Q+GQ+A+  N+RPQG+LPSNTE  PR +GK QCQAVTLR+ +    E QE                   ++ EP  S +     E+   KG+E  
Subjt:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA

Query:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM
                   L++LHINIP  EALEQM SYVKF+KDILSKKR LG +ETVALT ECSA+ ++ +P K+KD GSFT+ C+IG    G+AL DLG    L 
Subjt:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM

Query:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQ--------------------
                  GEA+PT++TLQLADRS+ +P+G IED+LVK                            FL+    LIDVQ                    
Subjt:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQ--------------------

Query:  ---------------EGELTIRVDDQQD-----------------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTT
                       E  L   +D++ +                  G  + E+T      K S+EEPPTLELK LP HL YAYLGE  TLPVIISS+L+ 
Subjt:  ---------------EGELTIRVDDQQD-----------------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTT

Query:  ECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNS
           + LL +L+NH  AIGWT+ADI+GISPS+CMHKI LE+ +  SVE QRRLNP MKEVVKKEIIKWLDAG+IYPISDSSWVSPVQCVPKKGG+TV+ N 
Subjt:  ECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNS

Query:  KNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCM
         NELIPTRTVTGWR+CMDYRKLN AT+KDHFPL FIDQMLDRLAG EF+CFLDGYS YNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNAP TFQRCM
Subjt:  KNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCM

Query:  MAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHA
        MAIF+D +E  +EVF+DDFSV+G S+DECL NL  VLKRCEDTNL+LNWEKCHFMV +GIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHA
Subjt:  MAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHA

Query:  GFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYT
        GFYRRFIK FS+++KPL  LLE +  FNFD  C +AF  L+  LISAPI+  PDWS PFELMCDAS+ AVGAV GQRK+KI   IYYASKTLN +Q NYT
Subjt:  GFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYT

Query:  TTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM
        TTEKE+LA+VFA DKFR+YL+G+KV +Y+DH AI+YL+ KKDAKPRLIRWVLLLQEFDLEI+DRKGTENQ+ DHLSRLE+    +  + I + FPDE ++
Subjt:  TTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM

Query:  N-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPA
            S  PWYADIVNYL C  +P + +AQQKKK   +++ Y WD+P+L++ GPD+ILRRCVPE EM+ IL  CH + YGGHF G RTA K+LQSG+FWP 
Subjt:  N-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPA

Query:  LFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAI
        LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWGIDFMGPF+PS  N YILVAVDYVSKWVEAAA   ND+ VV  F+KK IF++FGTPRAI
Subjt:  LFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAI

Query:  ISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEH
        ISD GTHF NR    LL+K+ V H+++  YHPQT+ Q E++N EIK ILEK VS++RKDW++RLDEALWAYRTA+KTPIGMSPY LVFGKACHLP+ELEH
Subjt:  ISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEH

Query:  KAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLT
         A WA++KLN D +A+GE R LQLNEL E+R  AYENAK YKE+ K+WH+K I ++    GQ VLLFNSRL+LFPGKLKSRWSGPF I EVFPHGA+ L 
Subjt:  KAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLT

Query:  NENGTTSFKVNGQRVKPY
        N+N    FKVN QR+K Y
Subjt:  NENGTTSFKVNGQRVKPY

PIN00904.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]0.0e+0055.67Show/hide
Query:  VGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVA----------------SMDAMNTVN--ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQ
        VG+IE D  + L+A I  +   ++   +N    T                  S++++  V+      NNPYS TYNPGWR HPNF W  N    QG G  
Subjt:  VGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVA----------------SMDAMNTVN--ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQ

Query:  PQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNG
        P+        +G Q  Q P                       +E + SLE  L   M    S A   + +E Q+GQ+A+  N+RPQG+LPSNTE      
Subjt:  PQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNG

Query:  KEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKF
            Q VTLR+ +    E QE                    + EP      T SKE+E     ++     P      L++LHINIP  EALEQM SYVKF
Subjt:  KEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKF

Query:  LKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKI
        +KDILSKKRRLG +E V LT ECS + ++ +P K+K+ GSFT+ C+IG    G+AL DLGASINLMP SI++ LG+GEA+PT++TLQLADRS+ +P+G I
Subjt:  LKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKI

Query:  EDVLVKT---------------------------FLSYWMRLIDVQ-----------------------------------EGELTIRVDDQQD------
        +D+LVK                            FL+    LIDVQ                                   E  L   +D++ +      
Subjt:  EDVLVKT---------------------------FLSYWMRLIDVQ-----------------------------------EGELTIRVDDQQD------

Query:  -----------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH
                    G  + E+T      K S+EEPPTLELK LP+HL YAYLGE  TLPVIISS+L+    + LL +LKNH   IGWT+ADI+GISPS+CMH
Subjt:  -----------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH

Query:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA
        KI LE+ +  S+E QRRLNP MKEVVKKEIIKWLDAG+IYPISDSSWVSPVQCVPKKGG+TV+ N  NELIPTRTVTGWR+CMDYRKLN AT+KDHFPL 
Subjt:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA

Query:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE
        FIDQMLDRLAG EF+CFLDGYS YNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIF+D +E  +EVF+D+FSV+G S+DECL NL 
Subjt:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE

Query:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL
         VLKRCEDTNLVLNWEKCHFMV +GIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHAGFYRRFIK FS+++KPL  LLE +  FNF+  C 
Subjt:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL

Query:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI
        +AF  L+  LISAPI+  PD        CD    AVGAV GQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VFA DKFR+YL+ +KV +Y+DH AI
Subjt:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI

Query:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM-NAESQEPWYADIVNYLVCNQLPEEFNAQQKKKL
        +YL+ KKDA P LI WVLLLQEFDLEI+DRKGTENQ+ DHLSRLE+    +  + I + FPDE ++    S  PWYADIVNYL C  +P + + QQKKK+
Subjt:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM-NAESQEPWYADIVNYLVCNQLPEEFNAQQKKKL

Query:  RHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLE
          +++ Y W++P+L + GPD+ILRRCVPE EM+ IL  CH + YGGHF G RTA K+LQSG+FWP LFKDA ++   CDRCQRT NIS R+EMPLN++LE
Subjt:  RHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLE

Query:  VELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQT
        VELFDVWGIDFMGPF+PS  N YILVAVDYVSKWVEAAA   ND+ VV  F+KK IF++FGTPRAIISD  T+F NR    LL+K+ V H++   YHPQT
Subjt:  VELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQT

Query:  NDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSA
        +   E++N EIK ILEK VS++RKDW++RLDEALWAYRTA+KTPIGMSPY L+FGKACHLP+ELEH A WA+ KLN D +A+GE R LQLNEL E+R  A
Subjt:  NDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSA

Query:  YENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE
        YENAK YKE+TK+WHDK I ++    GQ VLLFNSRL+LFPGKLKSRW G F I EVFPHGA+ L NEN    FK+N +R+K Y  G  +   TSI L +
Subjt:  YENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0052.23Show/hide
Query:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMT---SLLQTKALNNQGGTVAS--------------MDAMNTVN
        M KT   A  +L+ ++ N  +W  +    R M KK    G+ E +  + LSA +A+++   S L T+ +      VA+              +  +N  N
Subjt:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMT---SLLQTKALNNQGGTVAS--------------MDAMNTVN

Query:  ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSH
             NP    Y+PG RNH NF +G  ++ +Q     P  ++ P                 +    SLE  M   + + + T K  ++ L N      + 
Subjt:  ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSH

Query:  AITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRSEKVL-----------PTEPQEQRMQGNLQPQITPLQT-----PATSIHEPQS
          T++NLE+Q+GQ+A+  N + +G  PSNTE    N KEQC+A+TLRS + +           PT P   + +  ++ +     T        SI  P +
Subjt:  AITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRSEKVL-----------PTEPQEQRMQGNLQPQITPLQT-----PATSIHEPQS

Query:  SSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSI
            +T         ++K  +   +KFLD+ K++HINIP  +ALEQM +Y KFLKDI+SKKRRL +FETV L+ ECSA+ +  +P K+KD GSFTL C+I
Subjt:  SSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSI

Query:  GGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQE
        G     + L DLGASINLMPLS+++KLG+GE + TT++LQLADRSIK+P G IEDVLVK                            FL+    L+DVQ+
Subjt:  GGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQE

Query:  GELTIRVDDQQ----------------------------------------------------DLGNRT-------------------------------
        GELT+RV+ ++                                                    D  N                                 
Subjt:  GELTIRVDDQQ----------------------------------------------------DLGNRT-------------------------------

Query:  --------TEKTKS-----SLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKD
                  KT S      +++  T ELK LP HL+YA+LG+  T PVI++++LT E E+ LL +L+ H  A+GWT++DI+GISPS CMHKI +EE   
Subjt:  --------TEKTKS-----SLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKD

Query:  GSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRL
         S+E QRRLNPAMKEVV+ EI+K L+AG+IY ISDSSWVSPVQ VPKKGGMTV+ N  NE IPTRTVTGWR+CMDYRKLN AT+KDHFPL FIDQMLDRL
Subjt:  GSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRL

Query:  AGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDT
        AG  ++CFLDGYS YNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIFSD +E  +E+F+DDFSVFG S+D CL NL  VL+RCED 
Subjt:  AGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDT

Query:  NLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQA
        NLVLNWEKCHFMV +GIVLGH++S  G+EVD+AKI  I KLP P NVK +RSFLGHAGFYRRFIK FS+++KPL  LLE N  F+FD  CL AF ++++ 
Subjt:  NLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQA

Query:  LISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDA
        LISAP++  PDWS PFE+MCDAS+ A+GAV GQR++K+   IYYAS+TLN +Q NYTTTEKEMLA+VFA DKFR+YLI +KV +++DH A++YL +KKDA
Subjt:  LISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDA

Query:  KPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWD
        KPRLIRW+LLLQEFDLE++D+KG+EN V DHLSRLE +EV+     I+E FPDE +   E + PWYADIVN+L C  LP +    Q+KK  H+ K+Y WD
Subjt:  KPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWD

Query:  EPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGID
        EP L++  PD I+RRCVPE EM +IL  CH + YGGHF   RTA KVLQSG+FWP++F+D+      CDRCQR GNIS R E+PL ++LEVELFDVWGID
Subjt:  EPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGID

Query:  FMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSE
        FMGPF PS    YIL+AVDYVSKWVEA A   NDA VV KFL K IF++FGTPRAIISDEGTHF N++  NLL+K+ V H++A+AYHPQTN QAEI+N E
Subjt:  FMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSE

Query:  IKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKER
        IK+ILEK V+T+RKDW ++LD+ALWAYRTAFKTPIGMSPY LVFGKACHLP+ELEHKA WA+KK NLD +A+GE R LQLNE+ E+R+ AYENAK YKER
Subjt:  IKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKER

Query:  TKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLREP
        TKKWHDK I ++    GQ+VLLFNSRL+LFPGKL+SRW+GP+ I +V   GAI L ++ G   F+VNGQR+K Y+  + E N   I L +P
Subjt:  TKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLREP

XP_031379021.1 uncharacterized protein LOC116194359 [Punica granatum]0.0e+0052.81Show/hide
Query:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRY-----MD---KKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVASM----------DAMNTV
        M K Y +A  +++ ++ +   W ++   SR      MD      +Q+  + T  +   SAH  N   +   +  +    T+  M          + +N V
Subjt:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRY-----MD---KKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVASM----------DAMNTV

Query:  NATASNN--PYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYN--QGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMG
        N    +N  PYS TYNPGWRNHPNF W               RN N       G Q     +N     S S +E LM   +       +  + MLQNQ  
Subjt:  NATASNN--PYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYN--QGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMG

Query:  EIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSS---KTTTS
               TIRNLE Q+ Q++ + + RP G+LPSNTE  P+G       A+ LRS K L    ++ + Q     +    Q     + EP+  S   K    
Subjt:  EIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSS---KTTTS

Query:  KEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKS---NIPAKMKDLGSFTLACSIGGMD
             G+ +++   A   KFLDV K+L INIP  EAL+QM SY +F+KD+L+KKR+    E V LT ECS + +    N+P K +D GSFT+ C+IG   
Subjt:  KEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKS---NIPAKMKDLGSFTLACSIGGMD

Query:  VGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELT
            L D GASINLMPLSIF+KLG+GE + T +TLQLADRSIK+P+G +E+VLVK                            FL+    LIDV++G+LT
Subjt:  VGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELT

Query:  IRV---------------------------------------------------------DDQ-----------------QDLGNRTTEKTKSSLEEPPT
        +RV                                                         DD+                 ++LG   T K  SSL + P 
Subjt:  IRV---------------------------------------------------------DDQ-----------------QDLGNRTTEKTKSSLEEPPT

Query:  LELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLD
        LELK LP+HLKYAYLG + TLP+IISS+LT + E+ LL +L+ H +AIGWT+ADI+GISP  C H+I LE      V+ QRRLNP +KEVVKKE++K LD
Subjt:  LELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLD

Query:  AGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQE
        AG+IYPISDS WVSPVQ VPKKGGMTV+ N  N+LIPTRTVTGWR+C+DYRKLN AT+KDHFPL FIDQML++LAG++++CFLDGYS YNQI IAPEDQE
Subjt:  AGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQE

Query:  KTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKV
        KTTFTCPYGTFAFRRMPFGLCNAP TFQRCMM+IFSD LE  +E+F+DDFSVFGKS++ CLTNL  VLKRC++TNL+LNWEKCHFMV +GIVLGHK+SK 
Subjt:  KTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKV

Query:  GLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHA
        G+EVD+AK++ I KLP PT+ K +RSFLGHAGFYRRFIK FS++++PL  LLE +  F F+  CL AF  L++ L SAP++VAP+W LPFELMCDAS++A
Subjt:  GLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHA

Query:  VGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTEN
        VGAV GQR+ K+ H IYYAS+TLN +Q+NY TTEKE+LA++FA DKFR YLIGSK+ +Y+DH A+KYL AK DAKPRLIRW+LLLQEFDLEI+D KGTEN
Subjt:  VGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTEN

Query:  QVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQE-PWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSI
         V DHLSRLE+  +    S I E+FPDE +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WDEPYL++   D ++RRCVPE E  SI
Subjt:  QVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQE-PWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSI

Query:  LRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWV
        ++ CH    GGHF  +RTATK+L  G++WP +F D R Y ++C  CQRTGNIS R+E+P NS+L +ELFDVWGIDFMGPF  S SN+YILVAVDYVSKWV
Subjt:  LRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWV

Query:  EAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALW
        EA A   NDA VV +FLKK IFS+FG PRAIISD G+HF NR    LL+K+ V+H++A  YHPQT  Q E++N EIK ILEK V+ SRKDW+ +LD+ALW
Subjt:  EAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALW

Query:  AYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNS
        AYRTAFKTPIGMSPY +V+GK+CHLP+ELEHKA WA+K LN D +A+GE R LQLN++ E R  AYENA+ YKER K+WHD+NI K+    GQKVLL+NS
Subjt:  AYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNS

Query:  RLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGE-FEINKTSIDL
        RL+LFPGKLKSRWSGPF+I  VFP+GA+ L +E+  T FKVNG  +K Y  GE  + + T++DL
Subjt:  RLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGE-FEINKTSIDL

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]0.0e+0052.05Show/hide
Query:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQ-----------GGTVASMDAM------NTVN
        M K+  +A E+L+ ++ N  +W ++    R M KK    GM + D  + L+A + ++  +       N            GG   S D M      N   
Subjt:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQ-----------GGTVASMDAM------NTVN

Query:  ATASNNPYSKTYNPGWRNHPNFGW--GGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIK
            NNPYS TYNPGWRNHPNF W   GNQ         P     P   +  Q  ++   + AN+SS   E L                        ++ 
Subjt:  ATASNNPYSKTYNPGWRNHPNFGW--GGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIK

Query:  SHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRS-------------------EKVLPTEPQEQRMQGNLQPQITPLQTPATSI
          A + RN+E+Q+GQ+A+  N+R QG LPS TE    N KE C+AVTLRS                   E+V     +E          + P++     I
Subjt:  SHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRS-------------------EKVLPTEPQEQRMQGNLQPQITPLQTPATSI

Query:  HEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFT
          PQ   +    ++ E              KFL V +QLHINIP  +AL Q+ +Y KFLK+I+SKKR+L  FET+ALT ECSA+ ++ +P K++D GSF+
Subjt:  HEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFT

Query:  LACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRL
        + C+IG +D  +AL DLGAS++LMPLS+ +KLG+ E +PTT++LQLADRS+K+P G +E+VL+K                            FL+    +
Subjt:  LACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRL

Query:  IDVQEGELTIRVDDQQ---------------------DLGNRTT------EKTKSSLE------------------------------------------
        IDV+ G LT++V +++                     D+ + +T      E TK  LE                                          
Subjt:  IDVQEGELTIRVDDQQ---------------------DLGNRTT------EKTKSSLE------------------------------------------

Query:  ----------EPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPA
                  + P LELK LP+HL YA+LGE +TLPVI+S +L+ E    L+ IL+   KAIGWT++D+RGISPS CMH+I +E+     VE QRRLNP 
Subjt:  ----------EPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPA

Query:  MKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGY
        MKEVV+ E++KWLDAG+IYPISDS W+SPVQ VPKKGGMTV+ N  NELIPTRTVTGWR+C+DYRKLN+ T+KDHFPL F+DQ+L+RLAG  ++CFLDGY
Subjt:  MKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGY

Query:  SCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFM
        S YNQI I+PEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIFSD++E+ +EVF+DDFSVFG S+D CL NL RVL+RCE+TNLVLNWEKCHFM
Subjt:  SCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFM

Query:  VTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDW
        V +GI+LGHKIS  GLEVD+AKI+ I KLP PTNVK +RSFLGH GFYRRFIK FS+++KPL  LL  +  F+FD  CLNAF  L+Q L+SAPI+ APDW
Subjt:  VTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDW

Query:  SLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQ
        SLPFELMCDAS+ A+GAV GQRK++ +H IYYAS+ LN++Q NY TTEKE+LA+VFA DKFR+YL+GSKV +Y+DH AIKYL+ KKDAKPRLIRWVLLLQ
Subjt:  SLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQ

Query:  EFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHI
        EFDLEI+D++G EN V DHLSRLE +   +    I E FPDE ++ A S  PWYAD+VNYLV   +P + +  QKKK   + K Y W+EP LY+   D +
Subjt:  EFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHI

Query:  LRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQ
        +RRCVP+ EM  IL  CH    GGHF   +T  KV QSG++WP +++D R Y   CDRCQR GNIS +NEMPL ++LEVELFD+WGIDFMGPF  S +NQ
Subjt:  LRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQ

Query:  YILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTS
        YILVAVDYVSKWVEA A   ND+ VV +F+KK IFS+FG PRAIISDEG+HF NR    LL K+ V+H+VA+AYHPQTN Q E+ N E+K ILEK VS+S
Subjt:  YILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTS

Query:  RKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKK
        RKDW  +LD+ALWAYRTAFKTP+GMSPY LVFGK+CHLP+ELEH+A WA+K LN+D +A+GE R LQL+EL E+R  AYEN + YKE+TK WHDK++  +
Subjt:  RKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKK

Query:  ILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPY
           +GQ+VLLFNSRL+LFPGKL+SRWSGPF + +V+P+GA+ + +E  T +FKVNGQR+KPY
Subjt:  ILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPY

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0059.03Show/hide
Query:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI
        NNPYS TYNPGWR HPNF W  N    QG G  P+           QG Q    Q       SLE  + Q +                      S A   
Subjt:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI

Query:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA
        + ++ Q+GQ+A+  N+RPQG+LPSNTE  PR +GK QCQAVTLR+ +    E QE                   ++ EP  S +     E+   KG+E  
Subjt:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA

Query:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM
                   L++LHINIP  EALEQM SYVKF+KDILSKKR LG +ETVALT ECSA+ ++ +P K+KD GSFT+ C+IG    G+AL DLG    L 
Subjt:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM

Query:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQ--------------------
                  GEA+PT++TLQLADRS+ +P+G IED+LVK                            FL+    LIDVQ                    
Subjt:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQ--------------------

Query:  ---------------EGELTIRVDDQQD-----------------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTT
                       E  L   +D++ +                  G  + E+T      K S+EEPPTLELK LP HL YAYLGE  TLPVIISS+L+ 
Subjt:  ---------------EGELTIRVDDQQD-----------------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTT

Query:  ECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNS
           + LL +L+NH  AIGWT+ADI+GISPS+CMHKI LE+ +  SVE QRRLNP MKEVVKKEIIKWLDAG+IYPISDSSWVSPVQCVPKKGG+TV+ N 
Subjt:  ECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNS

Query:  KNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCM
         NELIPTRTVTGWR+CMDYRKLN AT+KDHFPL FIDQMLDRLAG EF+CFLDGYS YNQI IAPEDQEK TFTCPYGTFAFRRMPFGLCNAP TFQRCM
Subjt:  KNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCM

Query:  MAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHA
        MAIF+D +E  +EVF+DDFSV+G S+DECL NL  VLKRCEDTNL+LNWEKCHFMV +GIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHA
Subjt:  MAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHA

Query:  GFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYT
        GFYRRFIK FS+++KPL  LLE +  FNFD  C +AF  L+  LISAPI+  PDWS PFELMCDAS+ AVGAV GQRK+KI   IYYASKTLN +Q NYT
Subjt:  GFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYT

Query:  TTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM
        TTEKE+LA+VFA DKFR+YL+G+KV +Y+DH AI+YL+ KKDAKPRLIRWVLLLQEFDLEI+DRKGTENQ+ DHLSRLE+    +  + I + FPDE ++
Subjt:  TTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM

Query:  N-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPA
            S  PWYADIVNYL C  +P + +AQQKKK   +++ Y WD+P+L++ GPD+ILRRCVPE EM+ IL  CH + YGGHF G RTA K+LQSG+FWP 
Subjt:  N-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPA

Query:  LFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAI
        LFKDA ++   CDRCQRTGNIS R+EMPLN++LEVELFDVWGIDFMGPF+PS  N YILVAVDYVSKWVEAAA   ND+ VV  F+KK IF++FGTPRAI
Subjt:  LFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAI

Query:  ISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEH
        ISD GTHF NR    LL+K+ V H+++  YHPQT+ Q E++N EIK ILEK VS++RKDW++RLDEALWAYRTA+KTPIGMSPY LVFGKACHLP+ELEH
Subjt:  ISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEH

Query:  KAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLT
         A WA++KLN D +A+GE R LQLNEL E+R  AYENAK YKE+ K+WH+K I ++    GQ VLLFNSRL+LFPGKLKSRWSGPF I EVFPHGA+ L 
Subjt:  KAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLT

Query:  NENGTTSFKVNGQRVKPY
        N+N    FKVN QR+K Y
Subjt:  NENGTTSFKVNGQRVKPY

A0A2G9G6G2 Reverse transcriptase0.0e+0055.67Show/hide
Query:  VGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVA----------------SMDAMNTVN--ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQ
        VG+IE D  + L+A I  +   ++   +N    T                  S++++  V+      NNPYS TYNPGWR HPNF W  N    QG G  
Subjt:  VGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVA----------------SMDAMNTVN--ATASNNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQ

Query:  PQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNG
        P+        +G Q  Q P                       +E + SLE  L   M    S A   + +E Q+GQ+A+  N+RPQG+LPSNTE      
Subjt:  PQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNG

Query:  KEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKF
            Q VTLR+ +    E QE                    + EP      T SKE+E     ++     P      L++LHINIP  EALEQM SYVKF
Subjt:  KEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKF

Query:  LKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKI
        +KDILSKKRRLG +E V LT ECS + ++ +P K+K+ GSFT+ C+IG    G+AL DLGASINLMP SI++ LG+GEA+PT++TLQLADRS+ +P+G I
Subjt:  LKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKI

Query:  EDVLVKT---------------------------FLSYWMRLIDVQ-----------------------------------EGELTIRVDDQQD------
        +D+LVK                            FL+    LIDVQ                                   E  L   +D++ +      
Subjt:  EDVLVKT---------------------------FLSYWMRLIDVQ-----------------------------------EGELTIRVDDQQD------

Query:  -----------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH
                    G  + E+T      K S+EEPPTLELK LP+HL YAYLGE  TLPVIISS+L+    + LL +LKNH   IGWT+ADI+GISPS+CMH
Subjt:  -----------LGNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH

Query:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA
        KI LE+ +  S+E QRRLNP MKEVVKKEIIKWLDAG+IYPISDSSWVSPVQCVPKKGG+TV+ N  NELIPTRTVTGWR+CMDYRKLN AT+KDHFPL 
Subjt:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA

Query:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE
        FIDQMLDRLAG EF+CFLDGYS YNQI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAP TFQRCMMAIF+D +E  +EVF+D+FSV+G S+DECL NL 
Subjt:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE

Query:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL
         VLKRCEDTNLVLNWEKCHFMV +GIVLGHK+S  G+EVD+AK++ I KLP PT+VK +RSFLGHAGFYRRFIK FS+++KPL  LLE +  FNF+  C 
Subjt:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL

Query:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI
        +AF  L+  LISAPI+  PD        CD    AVGAV GQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VFA DKFR+YL+ +KV +Y+DH AI
Subjt:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI

Query:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM-NAESQEPWYADIVNYLVCNQLPEEFNAQQKKKL
        +YL+ KKDA P LI WVLLLQEFDLEI+DRKGTENQ+ DHLSRLE+    +  + I + FPDE ++    S  PWYADIVNYL C  +P + + QQKKK+
Subjt:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVM-NAESQEPWYADIVNYLVCNQLPEEFNAQQKKKL

Query:  RHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLE
          +++ Y W++P+L + GPD+ILRRCVPE EM+ IL  CH + YGGHF G RTA K+LQSG+FWP LFKDA ++   CDRCQRT NIS R+EMPLN++LE
Subjt:  RHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLE

Query:  VELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQT
        VELFDVWGIDFMGPF+PS  N YILVAVDYVSKWVEAAA   ND+ VV  F+KK IF++FGTPRAIISD  T+F NR    LL+K+ V H++   YHPQT
Subjt:  VELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQT

Query:  NDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSA
        +   E++N EIK ILEK VS++RKDW++RLDEALWAYRTA+KTPIGMSPY L+FGKACHLP+ELEH A WA+ KLN D +A+GE R LQLNEL E+R  A
Subjt:  NDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSA

Query:  YENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE
        YENAK YKE+TK+WHDK I ++    GQ VLLFNSRL+LFPGKLKSRW G F I EVFPHGA+ L NEN    FK+N +R+K Y  G  +   TSI L +
Subjt:  YENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE

A0A2G9IA86 DNA-directed DNA polymerase0.0e+0055.39Show/hide
Query:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI
        NNPYS TYNPGWR HPNF W  N +Q QG+  + Q++      Q +Q                             E + SLE  L   M    S A  +
Subjt:  NNPYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITI

Query:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA
        + +E Q+GQ+A+  N+RPQG+L SNTE  PR +GK Q QAVTLR+ + L    +E                P  S  +   S K     E    + +++ 
Subjt:  RNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKA

Query:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM
         +    KFL+V K+LHIN P  EALEQM SYVKF+K ILSKKRRLG +ETVALT ECSA+ ++ +P K+KD GSFT+ C+IG    G+AL DLGASINLM
Subjt:  TQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLM

Query:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELTIRVDDQQ--------
        P SI++ LG+GEA+PT++TLQLA+RS+ +P+G IED+LVK                            FL+    LIDVQ+G+LT+RV DQQ        
Subjt:  PLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELTIRVDDQQ--------

Query:  ----------------------------------DL------------------------GNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGE
                                          DL                        G  + E+T      K S+EE PTLELK LP+HL Y YLGE
Subjt:  ----------------------------------DL------------------------GNRTTEKT------KSSLEEPPTLELKTLPTHLKYAYLGE

Query:  ESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQ
          TLPVIISS+L+    + LL + +NH  AIGWT+ADI+GIS S+CMHKI LE+ +  SVE QRRLNP MKEVVKKEIIKW+DAG+IYPISDSSWVSPVQ
Subjt:  ESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQ

Query:  CVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMP
        CVPKKGG+TV+ N  NELIPTRTVTGWR+CMDYRKLN AT+KDHFPL FIDQMLDRLAG EF+CFLDGY           DQEKTTFTCPYGTFAFRR+P
Subjt:  CVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMP

Query:  FGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPA
        FGLCNAP TFQRCMMAIF+D +E  +EVF+DDFSV+G S+DECL NL  VLKRCEDTNLVLNW+KCHFMV +GIVL HK+S  G+EV++AK++ I KLP 
Subjt:  FGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPA

Query:  PTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIY
        PT+VK +RSFLGHAGFYRRFIK FS+++KPL  LLE +  F FD  CL+AF  L+  LISAPI+  PDWS PFELMCDAS+ A+GAV GQRK+KI   IY
Subjt:  PTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIY

Query:  YASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQES
        YASKTLN  Q NYTTTEKE+LA+VFA DKFR+YL+G+KV +Y+DH AI+YL+ KKDAKP                               RLE+    + 
Subjt:  YASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQES

Query:  WSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQR
         + I + FPDE ++    S  PWY+DIVNYL C  +P + +AQQKKK   +++ Y WD+ +L++ GPD+ILRRCVPE EM+ IL  CH + YGGHF G R
Subjt:  WSDIEERFPDEHVMN-AESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQR

Query:  TATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFL
        TA K+LQSG+FWP LFKDA ++   CDRCQRTGNIS R+EMPLN++L+VELFDVWGIDF+GPF+PS  N YILVAVDYVSKWVEA A   ND+ VV  F+
Subjt:  TATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFL

Query:  KKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYAL
        KK IF++FGTPRAIISD G HF NR     L+K+ V H++   YHPQT+ Q E++N EIK ILEK VS++R DW++RLDEALWAYRT +KTPIGMSPY L
Subjt:  KKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYAL

Query:  VFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPF
        +FGKACHL +ELEH A WA++KLN D  A GE R LQLNEL E+R  AYENAK YKE+TK+WHDK I ++    GQ VLLFNSRL+LFP KLK RWSGPF
Subjt:  VFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPF

Query:  IIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE
         I EVFPHGA+ L NEN    FKVN QR+K Y  G  +   TSI L +
Subjt:  IIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSIDLRE

A0A5N6MBJ1 Reverse transcriptase0.0e+0049.84Show/hide
Query:  DKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMTSLLQTKALNN-----------------QGGTVAS----MDAMN
        DKT  +   +L++++ N  +W       +    K  + G+ + D  ++L A +  MT  +    +N                  Q G + +    +D M 
Subjt:  DKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMTSLLQTKALNN-----------------QGGTVAS----MDAMN

Query:  TVNATASNNPYSKTYNPGWRNHPNFGW---GGNQ----------SQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSS-----SSLEALMCQMIT--
        + N    NNPYS TYNPGW+NHPNF W   G NQ           Q Q   +  Q+N +   NQ    +Q  ++Q ++S S     S+LE +M Q ++  
Subjt:  TVNATASNNPYSKTYNPGWRNHPNFGW---GGNQ----------SQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSS-----SSLEALMCQMIT--

Query:  -----QNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRSEKVL----------PTEPQEQRMQ
             ++E      +A  Q    E++S    IR +E Q+GQ+A     R +G LPSNTE    N KE C+AVTLRS K            P   +E  +Q
Subjt:  -----QNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNGKEQCQAVTLRSEKVL----------PTEPQEQRMQ

Query:  GNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSA
          ++       T    + EP    K T       G+ + +  + +  KFLD+ KQLHIN+P VEAL QM  Y KFLKD+L+ K++L +   V L  ECSA
Subjt:  GNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSA

Query:  LFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVK-------------------
        + ++ +P KMKD GSFT+ C IGG+ V  AL DLGASINLMP S+F KL +GE +PT +++QLADRS+K+P G +E++LVK                   
Subjt:  LFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVK-------------------

Query:  --------TFLSYWMRLIDVQEGELTIRVDDQQDL------------------------------------------------------GNRTTE-----
                 FL+    L+DV EG+LT+RVD+++ +                                                       NR  +     
Subjt:  --------TFLSYWMRLIDVQEGELTIRVDDQQDL------------------------------------------------------GNRTTE-----

Query:  ------------------------KTKSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH
                                K K S+EEPP+LELK LPTHL+YAYL E+S LPVII+S LT + +  LLD+LK H KA+ W + DI+GI+PS+C H
Subjt:  ------------------------KTKSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMH

Query:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA
        KI +E+     V+ QRRLNP M+EVVKKE+IK LDAG+IYPISDS+WVSPVQ VPKKGGMTV+ N KNELIPTRT+TGWR+C+DYRKLN AT+KDHFPL 
Subjt:  KIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLA

Query:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE
        FIDQML+RL+GN F+CFLDG+S Y QI IAPEDQEKTTFTCPYGTFA+RRMPFGLCNAP TFQRCM+AIF D +E+S+EVF+DDFSVFG S+D CL+NL+
Subjt:  FIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLE

Query:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL
        ++L RCE++NLVLNWEKCHFMV +GIVLGHKIS  GLEVD+AK+D I+KLP PT+V+ +RSFLGHAGFYRRFIK FS++A+P+++LLE + +F F  +CL
Subjt:  RVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCL

Query:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI
         AF  L++ L++API+VAPDW+LPFELMCDAS++AVG V GQRK+K  HPIYYASKTLN +QENYTTTEKE+LA+VFA DKFR+YLI SK  +Y+DH A+
Subjt:  NAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAI

Query:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQE-SWSDIEERFPDEHVMNAES--QEPWYADIVNYLVCNQLPEEFNAQQKK
        +YL  K+DAKPRLIRW+LLLQEFD+EI+D+KG EN   DHLSRLEN  ++E   S+I + FP E ++  ++    PW+AD  NYL    L +    QQ++
Subjt:  KYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQE-SWSDIEERFPDEHVMNAES--QEPWYADIVNYLVCNQLPEEFNAQQKK

Query:  KLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSM
        K   + K+Y W++PYL+R+  D ++RRCV   E  +IL  CHE   GGH     TA KV  SG++WP +FKDA A   ACD CQR GNIS R+EMP NS+
Subjt:  KLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSM

Query:  LEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHP
           E+FDVWGIDFMGPF  S  ++YILVAVDYVSKWVEA A   NDA VV KFL+K +F++FG P+ +ISD GTHF N  +   L+++ V+HR +  YHP
Subjt:  LEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHP

Query:  QTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRH
        QT+ Q E+TN E+K ILE+ V  +RK+W ++LD+ALWA+RTA+KTPIG +PY LV+GKACHLP+ELEHKA WA+K +NLD  ++GE R +Q++EL + R+
Subjt:  QTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRH

Query:  SAYENAKQYKERTKKWHDKNI-SKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPY
         AYEN++ YKERTKK HD ++   K   VG +VLL+NSRLRLFPGKLKSRW+GP+++KEVF +G + + + +G   FKVNG R+K Y
Subjt:  SAYENAKQYKERTKKWHDKNI-SKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPY

A0A6P8CBX2 Reverse transcriptase0.0e+0052.81Show/hide
Query:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRY-----MD---KKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVASM----------DAMNTV
        M K Y +A  +++ ++ +   W ++   SR      MD      +Q+  + T  +   SAH  N   +   +  +    T+  M          + +N V
Subjt:  MDKTYTQAKEILDRISRNTDEWVDDGYGSRY-----MD---KKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVASM----------DAMNTV

Query:  NATASNN--PYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYN--QGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMG
        N    +N  PYS TYNPGWRNHPNF W               RN N       G Q     +N     S S +E LM   +       +  + MLQNQ  
Subjt:  NATASNN--PYSKTYNPGWRNHPNFGWGGNQSQMQGTGQQPQRNNNPVYN--QGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMG

Query:  EIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSS---KTTTS
               TIRNLE Q+ Q++ + + RP G+LPSNTE  P+G       A+ LRS K L    ++ + Q     +    Q     + EP+  S   K    
Subjt:  EIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEA-PRGNGKEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSS---KTTTS

Query:  KEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKS---NIPAKMKDLGSFTLACSIGGMD
             G+ +++   A   KFLDV K+L INIP  EAL+QM SY +F+KD+L+KKR+    E V LT ECS + +    N+P K +D GSFT+ C+IG   
Subjt:  KEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRRLGKFETVALTRECSALFKS---NIPAKMKDLGSFTLACSIGGMD

Query:  VGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELT
            L D GASINLMPLSIF+KLG+GE + T +TLQLADRSIK+P+G +E+VLVK                            FL+    LIDV++G+LT
Subjt:  VGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKT---------------------------FLSYWMRLIDVQEGELT

Query:  IRV---------------------------------------------------------DDQ-----------------QDLGNRTTEKTKSSLEEPPT
        +RV                                                         DD+                 ++LG   T K  SSL + P 
Subjt:  IRV---------------------------------------------------------DDQ-----------------QDLGNRTTEKTKSSLEEPPT

Query:  LELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLD
        LELK LP+HLKYAYLG + TLP+IISS+LT + E+ LL +L+ H +AIGWT+ADI+GISP  C H+I LE      V+ QRRLNP +KEVVKKE++K LD
Subjt:  LELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLD

Query:  AGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQE
        AG+IYPISDS WVSPVQ VPKKGGMTV+ N  N+LIPTRTVTGWR+C+DYRKLN AT+KDHFPL FIDQML++LAG++++CFLDGYS YNQI IAPEDQE
Subjt:  AGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQE

Query:  KTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKV
        KTTFTCPYGTFAFRRMPFGLCNAP TFQRCMM+IFSD LE  +E+F+DDFSVFGKS++ CLTNL  VLKRC++TNL+LNWEKCHFMV +GIVLGHK+SK 
Subjt:  KTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKV

Query:  GLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHA
        G+EVD+AK++ I KLP PT+ K +RSFLGHAGFYRRFIK FS++++PL  LLE +  F F+  CL AF  L++ L SAP++VAP+W LPFELMCDAS++A
Subjt:  GLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHA

Query:  VGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTEN
        VGAV GQR+ K+ H IYYAS+TLN +Q+NY TTEKE+LA++FA DKFR YLIGSK+ +Y+DH A+KYL AK DAKPRLIRW+LLLQEFDLEI+D KGTEN
Subjt:  VGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTEN

Query:  QVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQE-PWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSI
         V DHLSRLE+  +    S I E+FPDE +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WDEPYL++   D ++RRCVPE E  SI
Subjt:  QVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQE-PWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSI

Query:  LRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWV
        ++ CH    GGHF  +RTATK+L  G++WP +F D R Y ++C  CQRTGNIS R+E+P NS+L +ELFDVWGIDFMGPF  S SN+YILVAVDYVSKWV
Subjt:  LRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWV

Query:  EAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALW
        EA A   NDA VV +FLKK IFS+FG PRAIISD G+HF NR    LL+K+ V+H++A  YHPQT  Q E++N EIK ILEK V+ SRKDW+ +LD+ALW
Subjt:  EAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALW

Query:  AYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNS
        AYRTAFKTPIGMSPY +V+GK+CHLP+ELEHKA WA+K LN D +A+GE R LQLN++ E R  AYENA+ YKER K+WHD+NI K+    GQKVLL+NS
Subjt:  AYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNS

Query:  RLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGE-FEINKTSIDL
        RL+LFPGKLKSRWSGPF+I  VFP+GA+ L +E+  T FKVNG  +K Y  GE  + + T++DL
Subjt:  RLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGE-FEINKTSIDL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein9.0e-7727.81Show/hide
Query:  RQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNE
        R   L P   + +  EI + L +G+I   S +    PV  VPKK G                    R+ +DY+ LN   K + +PL  I+Q+L ++ G+ 
Subjt:  RQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNE

Query:  FFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVL
         F  LD  S Y+ I +   D+ K  F CP G F +  MP+G+  AP  FQ  +  I  +  E  V  ++DD  +  KS  E + +++ VL++ ++ NL++
Subjt:  FFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVL

Query:  NWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISA
        N  KC F  ++   +G+ IS+ G    Q  ID + +   P N K LR FLG   + R+FI   SQ+  PL+ LL+ +  + +      A ++++Q L+S 
Subjt:  NWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISA

Query:  PILVAPDWSLPFELMCDASNHAVGAVQGQR-KEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS--KVTIYSDHFAI--KYLMAKKD
        P+L   D+S    L  DAS+ AVGAV  Q+  +   +P+ Y S  ++ +Q NY+ ++KEMLAI+ ++  +R YL  +     I +DH  +  +     + 
Subjt:  PILVAPDWSLPFELMCDASNHAVGAVQGQR-KEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS--KVTIYSDHFAI--KYLMAKKD

Query:  AKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRL--ENKEVQESWSDIEERFPDEHVMNAESQEPWYAD------IVNYLVCNQLPEEFNAQQKKKLR
           RL RW L LQ+F+ EI  R G+ N + D LSR+  E + + +   D    F ++  +  + +     +      ++N L       E N Q K  L 
Subjt:  AKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRL--ENKEVQESWSDIEERFPDEHVMNAESQEPWYAD------IVNYLVCNQLPEEFNAQQKKKLR

Query:  HESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEV
          SK              D IL     +    +I++  HE     H  G    T ++   + W  + K  + Y   C  CQ   + +++   PL  +   
Subjt:  HESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEV

Query:  EL-FDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQ
        E  ++   +DF+     S     + V VD  SK      C ++  A   ++   +++ + FG P+ II+D    F ++   +   K+N   + ++ Y PQ
Subjt:  EL-FDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQ

Query:  TNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHL-PLEL
        T+ Q E TN  ++ +L  V ST    W + +     +Y  A  +   M+P+ +V   +  L PLEL
Subjt:  TNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHL-PLEL

P0CT41 Transposon Tf2-12 polyprotein9.0e-7727.81Show/hide
Query:  RQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNE
        R   L P   + +  EI + L +G+I   S +    PV  VPKK G                    R+ +DY+ LN   K + +PL  I+Q+L ++ G+ 
Subjt:  RQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNE

Query:  FFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVL
         F  LD  S Y+ I +   D+ K  F CP G F +  MP+G+  AP  FQ  +  I  +  E  V  ++DD  +  KS  E + +++ VL++ ++ NL++
Subjt:  FFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVL

Query:  NWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISA
        N  KC F  ++   +G+ IS+ G    Q  ID + +   P N K LR FLG   + R+FI   SQ+  PL+ LL+ +  + +      A ++++Q L+S 
Subjt:  NWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISA

Query:  PILVAPDWSLPFELMCDASNHAVGAVQGQR-KEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS--KVTIYSDHFAI--KYLMAKKD
        P+L   D+S    L  DAS+ AVGAV  Q+  +   +P+ Y S  ++ +Q NY+ ++KEMLAI+ ++  +R YL  +     I +DH  +  +     + 
Subjt:  PILVAPDWSLPFELMCDASNHAVGAVQGQR-KEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGS--KVTIYSDHFAI--KYLMAKKD

Query:  AKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRL--ENKEVQESWSDIEERFPDEHVMNAESQEPWYAD------IVNYLVCNQLPEEFNAQQKKKLR
           RL RW L LQ+F+ EI  R G+ N + D LSR+  E + + +   D    F ++  +  + +     +      ++N L       E N Q K  L 
Subjt:  AKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRL--ENKEVQESWSDIEERFPDEHVMNAESQEPWYAD------IVNYLVCNQLPEEFNAQQKKKLR

Query:  HESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEV
          SK              D IL     +    +I++  HE     H  G    T ++   + W  + K  + Y   C  CQ   + +++   PL  +   
Subjt:  HESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEV

Query:  EL-FDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQ
        E  ++   +DF+     S     + V VD  SK      C ++  A   ++   +++ + FG P+ II+D    F ++   +   K+N   + ++ Y PQ
Subjt:  EL-FDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQ

Query:  TNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHL-PLEL
        T+ Q E TN  ++ +L  V ST    W + +     +Y  A  +   M+P+ +V   +  L PLEL
Subjt:  TNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHL-PLEL

P10394 Retrovirus-related Pol polyprotein from transposon 4121.9e-9027.25Show/hide
Query:  MHKIRLEEGKDGSVERQRRLNP-AMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHF
        ++K +L    D  V  +   +P +  E ++ ++ K +   ++ P S S + SP+  VPKK              P      WR+ +DYR++N     D F
Subjt:  MHKIRLEEGKDGSVERQRRLNP-AMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHF

Query:  PLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLT
        PL  ID +LD+L   ++F  LD  S ++QI +    ++ T+F+   G++ F R+PFGL  AP +FQR M   FS        +++DD  V G S    L 
Subjt:  PLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLT

Query:  NLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDG
        NL  V  +C + NL L+ EKC F + +   LGHK +  G+  D  K D I   P P +  + R F+    +YRRFIK F+  ++ ++ L + N  F +  
Subjt:  NLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDG

Query:  KCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDH
        +C  AF  L+  LI+  +L  PD+S  F +  DAS  A GAV  Q       P+ YAS+     + N +TTE+E+ AI +A+  FR Y+ G   T+ +DH
Subjt:  KCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDH

Query:  FAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEV-----------------QESWSDIEERFPDEHVMNAESQEPWYADIVN
          + YL +  +   +L R  L L+E++  ++  KG +N V D LSR+  KE+                 Q+S +  E+    +      S+   Y  I N
Subjt:  FAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEV-----------------QESWSDIEERFPDEHVMNAESQEPWYADIVN

Query:  YLVCNQLPEEFN--------------------------------------------------AQQKKKLRHES--KFYCWDEPYLYRLGPDHI--LRRCV
          V   +  + N                                                  A  KK   H S  KF       L  L    +  + +  
Subjt:  YLVCNQLPEEFN--------------------------------------------------AQQKKKLRHES--KFYCWDEPYLYRLGPDHI--LRRCV

Query:  PEYEMHSILRSCH-EARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPS-CSNQYIL
         E E  +IL + H +   GGH    +T  KV +  Y+W  + K  + Y   C +CQ+     +       +      FD   +D +GP   S   N+Y +
Subjt:  PEYEMHSILRSCH-EARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPS-CSNQYIL

Query:  VAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKD
          +  ++K++ A   A   A  V+K + +    ++G  +  I+D GT + N IIT+L     + +  + A+H QT    E ++  +   +   +ST + D
Subjt:  VAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFINRIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKD

Query:  WTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQ----YKERTKKWHDKNISK
        W   L   ++ + T         PY LVFG+  +LP    +K        N+D  A     +L++         AY  A++    +KE+ K+ +D  +  
Subjt:  WTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKQ----YKERTKKWHDKNISK

Query:  KILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAI-MLTNENGTTSFKVNGQRVKPYH
          L VG KVLL N        KL  +++GP+ I+ +  +  I +LTN+N      V+  R+K +H
Subjt:  KILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAI-MLTNENGTTSFKVNGQRVKPYH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein7.1e-8229.67Show/hide
Query:  ADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRK
        ADI  I      H I ++ G      +   +    ++ + K + K LD   I P S S   SPV  VPKK G                   +R+C+DYR 
Subjt:  ADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRK

Query:  LNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSV
        LN AT  D FPL  ID +L R+   + F  LD +S Y+QI + P+D+ KT F  P G + +  MPFGL NAP TF R M   F D   + V V++DD  +
Subjt:  LNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSV

Query:  FGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELL
        F +S +E   +L+ VL+R ++ NL++  +KC F   +   LG+ I    +   Q K  AI   P P  VK  + FLG   +YRRFI   S++A+P+   L
Subjt:  FGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELL

Query:  EVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAV--QGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAY
         +  +  +  K   A + L+ AL ++P+LV  +    + L  DAS   +GAV  +   K K++  + Y SK+L S+Q+NY   E E+L I+ A+  FR  
Subjt:  EVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAV--QGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAY

Query:  LIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCN
        L G   T+ +DH ++  L  K +   R+ RW+  L  +D  ++   G +N V D +SR       E+   I+      +  +          +      N
Subjt:  LIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCN

Query:  QLPEE---FNAQQKKKLRHES--KFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHE-ARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDR
          PE+   F + QKK    E+  K Y  ++  +Y     +  R  VP  + ++++R  H+   +GGHF    T  K+    Y+WP L      Y   C +
Subjt:  QLPEE---FNAQQKKKLRHES--KFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHE-ARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDR

Query:  CQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFLPSCSN-QYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFIN
        CQ   +   R    L  +   E    D+  +DF+    P+ +N   ILV VD  SK     A  +  DA  +   L + IFS  G PR I SD       
Subjt:  CQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFLPSCSN-QYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFIN

Query:  RIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLN
             L  +  +   ++ A HPQT+ Q+E T   +  +L   VST+ ++W   L +  + Y +     +G SP+ +  G   + P       + A     
Subjt:  RIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLN

Query:  LDQEASGEARKLQLNELLEWRHSAYENAKQYKERTK
        ++     +A  +Q  E LE  H+  E      +R K
Subjt:  LDQEASGEARKLQLNELLEWRHSAYENAKQYKERTK

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.2e-8228.54Show/hide
Query:  ADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRK
        ADI  I      H I ++ G      +   +    ++ + K + K LD   I P S S   SPV  VPKK G                   +R+C+DYR 
Subjt:  ADIRGISPSYCMHKIRLEEGKDGSVERQRRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRK

Query:  LNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSV
        LN AT  D FPL  ID +L R+   + F  LD +S Y+QI + P+D+ KT F  P G + +  MPFGL NAP TF R M   F D   + V V++DD  +
Subjt:  LNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYNQIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSV

Query:  FGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELL
        F +S +E   +L+ VL+R ++ NL++  +KC F   +   LG+ I    +   Q K  AI   P P  VK  + FLG   +YRRFI   S++A+P+   L
Subjt:  FGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELL

Query:  EVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAV--QGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAY
         +  +  +  K   A   L+ AL ++P+LV  +    + L  DAS   +GAV  +   K K++  + Y SK+L S+Q+NY   E E+L I+ A+  FR  
Subjt:  EVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAV--QGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAY

Query:  LIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCN
        L G   T+ +DH ++  L  K +   R+ RW+  L  +D  ++   G +N V D +SR       E+   I+      +  +          +      N
Subjt:  LIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSDIEERFPDEHVMNAESQEPWYADIVNYLVCN

Query:  QLPEE---FNAQQKKKLRHES--KFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHE-ARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDR
          PE+   F + QKK    E+  K Y  ++  +Y     +  R  VP  + ++++R  H+   +GGHF    T  K+    Y+WP L      Y   C +
Subjt:  QLPEE---FNAQQKKKLRHES--KFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHE-ARYGGHFEGQRTATKVLQSGYFWPALFKDARAYAVACDR

Query:  CQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFLPSCSN-QYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFIN
        CQ   +   R    L  +   E    D+  +DF+    P+ +N   ILV VD  SK     A  +  DA  +   L + IFS  G PR I SD       
Subjt:  CQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFLPSCSN-QYILVAVDYVSKWVEAAACARN-DANVVSKFLKKQIFSQFGTPRAIISDEGTHFIN

Query:  RIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLN
             L  +  +   ++ A HPQT+ Q+E T   +  +L    ST+ ++W   L +  + Y +     +G SP+ +  G   + P       + A     
Subjt:  RIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLN

Query:  LDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRL--RLFPGKLKSRWSGPF-IIKEVFPHGAIMLTNENGTTS
        ++     +A  +Q  E LE  H+  E      +R K          +L +G  VL+       +    K++  + GPF ++K++  +   +  N +    
Subjt:  LDQEASGEARKLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRL--RLFPGKLKSRWSGPF-IIKEVFPHGAIMLTNENGTTS

Query:  FKVNGQRVKPY
          +N Q +K +
Subjt:  FKVNGQRVKPY

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.2e-1560.71Show/hide
Query:  VLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFM
        VLQ+G++WP  FKDA  +  +CD CQR GN + RNEMP + +LEVE+FDVWGI FM
Subjt:  VLQSGYFWPALFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-1837.88Show/hide
Query:  LTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHK--ISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREF
        + +L  VL+  E      N +KC F   +   LGH+  IS  G+  D AK++A+   P P N   LR FLG  G+YRRF+K + ++ +PL+ELL+ N   
Subjt:  LTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHK--ISKVGLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREF

Query:  NFDGKCLNAFKSLRQALISAPILVAPDWSLPF
         +      AFK+L+ A+ + P+L  PD  LPF
Subjt:  NFDGKCLNAFKSLRQALISAPILVAPDWSLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGACTTACACGCAAGCCAAAGAGATCCTCGACAGAATTTCGAGGAATACTGATGAATGGGTGGATGATGGATATGGATCCAGGTATATGGATAAAAAACGATC
GCAAGTTGGAATGATTGAAACGGATGCGACTAGTAACCTTTCAGCTCATATCGCTAACATGACATCCTTGCTGCAAACCAAAGCGTTGAACAACCAAGGAGGAACGGTTG
CATCAATGGATGCAATGAACACAGTAAATGCAACAGCGTCTAACAACCCGTATTCTAAGACTTATAATCCGGGTTGGAGGAACCATCCTAACTTTGGATGGGGAGGAAAC
CAATCACAGATGCAGGGGACAGGACAACAGCCGCAGAGAAACAATAATCCTGTGTATAACCAAGGTGTTCAAGGGCATCAAGTACCACGAAATCAGTCAGCAAATTCATC
ATCATCATCCCTTGAAGCACTTATGTGTCAAATGATTACTCAAAATGAAGAAACGCGTAAGTCGCTAGAAGCGATGCTCCAAAATCAGATGGGAGAAATAAAGAGCCATG
CCATTACAATTAGAAATCTAGAACTTCAAGTGGGACAAATGGCAAGTGAACGCAATACGAGACCCCAAGGAGCATTGCCTAGTAATACCGAAGCCCCACGAGGTAACGGT
AAGGAACAATGTCAAGCTGTGACATTGAGGAGTGAGAAGGTTCTTCCCACTGAACCGCAAGAGCAGAGGATGCAAGGAAATCTGCAGCCTCAAATCACTCCACTACAAAC
TCCCGCAACATCTATCCATGAACCGCAATCTTCATCCAAAACCACAACCTCTAAAGAACAAGAAAATGGAAAGGGGAGAGAAAAAGCAACGCAGGCAAACCCTACAAAAT
TTCTGGACGTTCTCAAACAACTTCACATCAACATTCCGTTGGTGGAAGCCCTGGAACAAATGCTGAGCTATGTCAAATTTCTGAAAGACATACTATCTAAGAAAAGGAGG
CTCGGAAAGTTTGAGACAGTCGCATTAACGCGGGAATGCAGCGCATTGTTTAAAAGCAATATACCCGCTAAAATGAAAGATCTAGGGAGTTTCACATTGGCCTGTTCTAT
AGGAGGAATGGACGTAGGTCAGGCACTATACGACTTAGGGGCCAGTATAAATTTAATGCCACTGTCTATATTCAAGAAGTTGGGTATAGGGGAAGCACGGCCAACTACAG
TCACACTTCAGCTCGCTGACCGTTCCATTAAGCATCCAGAGGGGAAAATCGAGGATGTTCTTGTGAAGACCTTTCTTAGCTACTGGATGCGCCTGATTGACGTCCAGGAA
GGGGAGTTAACTATTCGAGTGGATGACCAGCAGGACTTAGGCAACAGAACAACGGAGAAAACAAAGTCGTCCTTAGAAGAACCACCAACCCTTGAGCTTAAGACACTGCC
AACGCATTTAAAATATGCATATTTGGGGGAAGAGAGCACGCTACCGGTCATCATATCCTCTGCGTTGACGACAGAATGTGAGAAGGCTTTGCTGGATATCCTGAAAAACC
ATATCAAAGCCATAGGATGGACCTTAGCGGATATCAGAGGAATAAGCCCTTCGTACTGCATGCACAAAATTCGGTTGGAAGAGGGTAAGGATGGTTCAGTTGAGAGGCAG
CGCAGGCTCAACCCCGCAATGAAGGAGGTTGTGAAGAAAGAAATCATCAAATGGCTCGATGCAGGGGTCATTTATCCAATCTCCGACAGCAGCTGGGTGAGCCCAGTCCA
GTGTGTACCAAAGAAAGGGGGGATGACAGTGATCATGAATAGCAAGAATGAACTAATACCAACAAGAACGGTCACTGGCTGGCGCATCTGTATGGACTACCGCAAGCTGA
ATGCGGCAACAAAGAAGGACCACTTTCCACTGGCATTCATCGACCAGATGCTAGATAGGCTGGCAGGTAATGAATTTTTCTGCTTCCTGGATGGATATTCCTGCTATAAC
CAGATCATGATAGCTCCAGAAGATCAAGAGAAGACGACGTTCACATGTCCATATGGCACCTTCGCGTTCAGACGCATGCCATTCGGGCTATGTAATGCACCAGGGACGTT
CCAAAGGTGCATGATGGCAATATTCTCGGACTATTTGGAACAGTCAGTAGAGGTGTTTATAGACGACTTTTCAGTATTTGGGAAATCATATGATGAATGCTTGACAAATC
TAGAACGAGTGCTAAAACGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCATTTTATGGTTACTAAGGGTATTGTGTTGGGGCATAAAATCTCAAAGGTT
GGATTGGAAGTGGATCAGGCAAAAATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTAAGACTTTGCGAAGTTTCCTAGGCCATGCGGGCTTCTATAGAAGATT
CATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAATTGCTAGAAGTCAACAGGGAATTCAATTTCGACGGTAAATGCTTAAACGCATTTAAGTCTCTAAGGCAAG
CTCTAATTTCAGCACCTATTTTAGTTGCACCGGATTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTACAGGGACAGAGAAAAGAG
AAAATCATGCACCCGATCTATTATGCTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAAGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATT
TAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTTTGCAATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTATCCGTTGGGTCCTAC
TGTTGCAAGAATTCGATTTGGAGATTAAAGATAGAAAGGGAACCGAGAATCAGGTTACGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAGTGAT
ATAGAGGAACGATTCCCAGACGAGCATGTAATGAACGCAGAGAGTCAGGAACCGTGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATTGCCTGAAGAATTTAA
CGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCTGACCACATACTGCGTCGATGCGTTCCAGAAT
ATGAAATGCATAGCATTCTGAGAAGCTGTCATGAAGCACGTTACGGAGGACACTTTGAAGGGCAGAGAACAGCTACAAAGGTGCTGCAAAGTGGGTATTTCTGGCCCGCA
TTATTCAAAGACGCAAGGGCATATGCGGTAGCTTGCGATCGTTGTCAGAGAACAGGCAACATTTCCAACCGGAATGAGATGCCTCTGAATTCAATGTTAGAAGTTGAGTT
GTTCGACGTATGGGGAATCGACTTCATGGGACCATTCCTTCCTTCTTGCAGCAATCAATATATCTTAGTAGCGGTCGACTACGTATCTAAGTGGGTAGAAGCAGCAGCCT
GTGCAAGGAATGACGCAAACGTAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCAATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACGCATTTTATAAAT
CGCATAATCACTAATTTACTGACTAAGTTTAATGTTTCGCACAGGGTAGCAATTGCCTATCACCCACAGACAAACGACCAAGCTGAAATAACAAACAGTGAGATCAAGTC
CATACTAGAAAAGGTCGTGAGCACATCAAGGAAAGATTGGACAGAGAGATTAGATGAAGCTCTATGGGCCTACAGAACGGCATTCAAAACACCGATAGGCATGTCACCCT
ATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACTTGAGTTGGAACACAAGGCCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGA
AAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCACTCAGCTTATGAAAACGCAAAGCAGTACAAAGAAAGGACCAAGAAATGGCACGACAAGAACATTAGTAAGAAAAT
TCTATACGTTGGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTCCCAGGTAAGCTGAAGTCTCGATGGTCGGGTCCATTCATAATCAAGGAAGTGTTTCCGC
ATGGTGCGATCATGCTGACAAATGAAAATGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAGCCCTACCACATTGGAGAGTTCGAAATTAACAAGACTTCCATT
GACCTACGCGAACCTGCGAGGCCTTTTGCTTCTCCGGTCAGACCAAGAAATCCAAACCCCAACCCACCGTTTATCCCTAACTTCTCCGCGTCTAGGATCCGGGCGACTGC
CCAGAGACCACCCATCTACCCATTTTTTCGGCCATTCGCCAGAACCCCTACCCGTCCACCAGTAGTCGACGACTCCGACGACTGTGTGGAAATTCTAAACATCCGTCCTT
TGAGCCAAATCAACCCAATCCCTACCCCAAAACCCAAAAGCCGAGTCAAGTCCTGCATTTCCCGACCCAGGATGAAGAACATGAAGGAATTCTCTGCCGCAAACTACAAT
GTCGGAAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACAAGACTTACACGCAAGCCAAAGAGATCCTCGACAGAATTTCGAGGAATACTGATGAATGGGTGGATGATGGATATGGATCCAGGTATATGGATAAAAAACGATC
GCAAGTTGGAATGATTGAAACGGATGCGACTAGTAACCTTTCAGCTCATATCGCTAACATGACATCCTTGCTGCAAACCAAAGCGTTGAACAACCAAGGAGGAACGGTTG
CATCAATGGATGCAATGAACACAGTAAATGCAACAGCGTCTAACAACCCGTATTCTAAGACTTATAATCCGGGTTGGAGGAACCATCCTAACTTTGGATGGGGAGGAAAC
CAATCACAGATGCAGGGGACAGGACAACAGCCGCAGAGAAACAATAATCCTGTGTATAACCAAGGTGTTCAAGGGCATCAAGTACCACGAAATCAGTCAGCAAATTCATC
ATCATCATCCCTTGAAGCACTTATGTGTCAAATGATTACTCAAAATGAAGAAACGCGTAAGTCGCTAGAAGCGATGCTCCAAAATCAGATGGGAGAAATAAAGAGCCATG
CCATTACAATTAGAAATCTAGAACTTCAAGTGGGACAAATGGCAAGTGAACGCAATACGAGACCCCAAGGAGCATTGCCTAGTAATACCGAAGCCCCACGAGGTAACGGT
AAGGAACAATGTCAAGCTGTGACATTGAGGAGTGAGAAGGTTCTTCCCACTGAACCGCAAGAGCAGAGGATGCAAGGAAATCTGCAGCCTCAAATCACTCCACTACAAAC
TCCCGCAACATCTATCCATGAACCGCAATCTTCATCCAAAACCACAACCTCTAAAGAACAAGAAAATGGAAAGGGGAGAGAAAAAGCAACGCAGGCAAACCCTACAAAAT
TTCTGGACGTTCTCAAACAACTTCACATCAACATTCCGTTGGTGGAAGCCCTGGAACAAATGCTGAGCTATGTCAAATTTCTGAAAGACATACTATCTAAGAAAAGGAGG
CTCGGAAAGTTTGAGACAGTCGCATTAACGCGGGAATGCAGCGCATTGTTTAAAAGCAATATACCCGCTAAAATGAAAGATCTAGGGAGTTTCACATTGGCCTGTTCTAT
AGGAGGAATGGACGTAGGTCAGGCACTATACGACTTAGGGGCCAGTATAAATTTAATGCCACTGTCTATATTCAAGAAGTTGGGTATAGGGGAAGCACGGCCAACTACAG
TCACACTTCAGCTCGCTGACCGTTCCATTAAGCATCCAGAGGGGAAAATCGAGGATGTTCTTGTGAAGACCTTTCTTAGCTACTGGATGCGCCTGATTGACGTCCAGGAA
GGGGAGTTAACTATTCGAGTGGATGACCAGCAGGACTTAGGCAACAGAACAACGGAGAAAACAAAGTCGTCCTTAGAAGAACCACCAACCCTTGAGCTTAAGACACTGCC
AACGCATTTAAAATATGCATATTTGGGGGAAGAGAGCACGCTACCGGTCATCATATCCTCTGCGTTGACGACAGAATGTGAGAAGGCTTTGCTGGATATCCTGAAAAACC
ATATCAAAGCCATAGGATGGACCTTAGCGGATATCAGAGGAATAAGCCCTTCGTACTGCATGCACAAAATTCGGTTGGAAGAGGGTAAGGATGGTTCAGTTGAGAGGCAG
CGCAGGCTCAACCCCGCAATGAAGGAGGTTGTGAAGAAAGAAATCATCAAATGGCTCGATGCAGGGGTCATTTATCCAATCTCCGACAGCAGCTGGGTGAGCCCAGTCCA
GTGTGTACCAAAGAAAGGGGGGATGACAGTGATCATGAATAGCAAGAATGAACTAATACCAACAAGAACGGTCACTGGCTGGCGCATCTGTATGGACTACCGCAAGCTGA
ATGCGGCAACAAAGAAGGACCACTTTCCACTGGCATTCATCGACCAGATGCTAGATAGGCTGGCAGGTAATGAATTTTTCTGCTTCCTGGATGGATATTCCTGCTATAAC
CAGATCATGATAGCTCCAGAAGATCAAGAGAAGACGACGTTCACATGTCCATATGGCACCTTCGCGTTCAGACGCATGCCATTCGGGCTATGTAATGCACCAGGGACGTT
CCAAAGGTGCATGATGGCAATATTCTCGGACTATTTGGAACAGTCAGTAGAGGTGTTTATAGACGACTTTTCAGTATTTGGGAAATCATATGATGAATGCTTGACAAATC
TAGAACGAGTGCTAAAACGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCATTTTATGGTTACTAAGGGTATTGTGTTGGGGCATAAAATCTCAAAGGTT
GGATTGGAAGTGGATCAGGCAAAAATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTAAGACTTTGCGAAGTTTCCTAGGCCATGCGGGCTTCTATAGAAGATT
CATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAATTGCTAGAAGTCAACAGGGAATTCAATTTCGACGGTAAATGCTTAAACGCATTTAAGTCTCTAAGGCAAG
CTCTAATTTCAGCACCTATTTTAGTTGCACCGGATTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTACAGGGACAGAGAAAAGAG
AAAATCATGCACCCGATCTATTATGCTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAAGAAATGTTAGCCATAGTCTTTGCGGTTGACAAATT
TAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTTTGCAATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTATCCGTTGGGTCCTAC
TGTTGCAAGAATTCGATTTGGAGATTAAAGATAGAAAGGGAACCGAGAATCAGGTTACGGACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAGTGAT
ATAGAGGAACGATTCCCAGACGAGCATGTAATGAACGCAGAGAGTCAGGAACCGTGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATTGCCTGAAGAATTTAA
CGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCTGACCACATACTGCGTCGATGCGTTCCAGAAT
ATGAAATGCATAGCATTCTGAGAAGCTGTCATGAAGCACGTTACGGAGGACACTTTGAAGGGCAGAGAACAGCTACAAAGGTGCTGCAAAGTGGGTATTTCTGGCCCGCA
TTATTCAAAGACGCAAGGGCATATGCGGTAGCTTGCGATCGTTGTCAGAGAACAGGCAACATTTCCAACCGGAATGAGATGCCTCTGAATTCAATGTTAGAAGTTGAGTT
GTTCGACGTATGGGGAATCGACTTCATGGGACCATTCCTTCCTTCTTGCAGCAATCAATATATCTTAGTAGCGGTCGACTACGTATCTAAGTGGGTAGAAGCAGCAGCCT
GTGCAAGGAATGACGCAAACGTAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCAATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACGCATTTTATAAAT
CGCATAATCACTAATTTACTGACTAAGTTTAATGTTTCGCACAGGGTAGCAATTGCCTATCACCCACAGACAAACGACCAAGCTGAAATAACAAACAGTGAGATCAAGTC
CATACTAGAAAAGGTCGTGAGCACATCAAGGAAAGATTGGACAGAGAGATTAGATGAAGCTCTATGGGCCTACAGAACGGCATTCAAAACACCGATAGGCATGTCACCCT
ATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACTTGAGTTGGAACACAAGGCCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGA
AAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCACTCAGCTTATGAAAACGCAAAGCAGTACAAAGAAAGGACCAAGAAATGGCACGACAAGAACATTAGTAAGAAAAT
TCTATACGTTGGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTCCCAGGTAAGCTGAAGTCTCGATGGTCGGGTCCATTCATAATCAAGGAAGTGTTTCCGC
ATGGTGCGATCATGCTGACAAATGAAAATGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAGCCCTACCACATTGGAGAGTTCGAAATTAACAAGACTTCCATT
GACCTACGCGAACCTGCGAGGCCTTTTGCTTCTCCGGTCAGACCAAGAAATCCAAACCCCAACCCACCGTTTATCCCTAACTTCTCCGCGTCTAGGATCCGGGCGACTGC
CCAGAGACCACCCATCTACCCATTTTTTCGGCCATTCGCCAGAACCCCTACCCGTCCACCAGTAGTCGACGACTCCGACGACTGTGTGGAAATTCTAAACATCCGTCCTT
TGAGCCAAATCAACCCAATCCCTACCCCAAAACCCAAAAGCCGAGTCAAGTCCTGCATTTCCCGACCCAGGATGAAGAACATGAAGGAATTCTCTGCCGCAAACTACAAT
GTCGGAAGCTAG
Protein sequenceShow/hide protein sequence
MDKTYTQAKEILDRISRNTDEWVDDGYGSRYMDKKRSQVGMIETDATSNLSAHIANMTSLLQTKALNNQGGTVASMDAMNTVNATASNNPYSKTYNPGWRNHPNFGWGGN
QSQMQGTGQQPQRNNNPVYNQGVQGHQVPRNQSANSSSSSLEALMCQMITQNEETRKSLEAMLQNQMGEIKSHAITIRNLELQVGQMASERNTRPQGALPSNTEAPRGNG
KEQCQAVTLRSEKVLPTEPQEQRMQGNLQPQITPLQTPATSIHEPQSSSKTTTSKEQENGKGREKATQANPTKFLDVLKQLHINIPLVEALEQMLSYVKFLKDILSKKRR
LGKFETVALTRECSALFKSNIPAKMKDLGSFTLACSIGGMDVGQALYDLGASINLMPLSIFKKLGIGEARPTTVTLQLADRSIKHPEGKIEDVLVKTFLSYWMRLIDVQE
GELTIRVDDQQDLGNRTTEKTKSSLEEPPTLELKTLPTHLKYAYLGEESTLPVIISSALTTECEKALLDILKNHIKAIGWTLADIRGISPSYCMHKIRLEEGKDGSVERQ
RRLNPAMKEVVKKEIIKWLDAGVIYPISDSSWVSPVQCVPKKGGMTVIMNSKNELIPTRTVTGWRICMDYRKLNAATKKDHFPLAFIDQMLDRLAGNEFFCFLDGYSCYN
QIMIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPGTFQRCMMAIFSDYLEQSVEVFIDDFSVFGKSYDECLTNLERVLKRCEDTNLVLNWEKCHFMVTKGIVLGHKISKV
GLEVDQAKIDAIAKLPAPTNVKTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFDGKCLNAFKSLRQALISAPILVAPDWSLPFELMCDASNHAVGAVQGQRKE
KIMHPIYYASKTLNSSQENYTTTEKEMLAIVFAVDKFRAYLIGSKVTIYSDHFAIKYLMAKKDAKPRLIRWVLLLQEFDLEIKDRKGTENQVTDHLSRLENKEVQESWSD
IEERFPDEHVMNAESQEPWYADIVNYLVCNQLPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYEMHSILRSCHEARYGGHFEGQRTATKVLQSGYFWPA
LFKDARAYAVACDRCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFLPSCSNQYILVAVDYVSKWVEAAACARNDANVVSKFLKKQIFSQFGTPRAIISDEGTHFIN
RIITNLLTKFNVSHRVAIAYHPQTNDQAEITNSEIKSILEKVVSTSRKDWTERLDEALWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEAR
KLQLNELLEWRHSAYENAKQYKERTKKWHDKNISKKILYVGQKVLLFNSRLRLFPGKLKSRWSGPFIIKEVFPHGAIMLTNENGTTSFKVNGQRVKPYHIGEFEINKTSI
DLREPARPFASPVRPRNPNPNPPFIPNFSASRIRATAQRPPIYPFFRPFARTPTRPPVVDDSDDCVEILNIRPLSQINPIPTPKPKSRVKSCISRPRMKNMKEFSAANYN
VGS