; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G010280 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G010280
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr11:19194981..19205327
RNA-Seq ExpressionClCG11G010280
SyntenyClCG11G010280
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035816.1 pol protein [Cucumis melo var. makuwa]3.5e-26042.43Show/hide
Query:  EEDFQTLKFKTRTKRQ-NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL---------------
        E+ F+ L  + R ++Q    AP P   P P  VPV   ++L+            L  +AKHLRDFRKYN  TFDGSL+DP +A+L               
Subjt:  EEDFQTLKFKTRTKRQ-NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL---------------

Query:  -------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQ
                                 M+G  S+ + W+QFKE+FY K+F A+L   K +E L+LEQD+M+VE+YD +F+ LS FAP M+A +A R  +FV+
Subjt:  -------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQ

Query:  GLRDGLQGLVQAFKSATQAEDL----------------------------------------------------------------C--EGDIADKCPRS
        GLR  +QGLV+AF+  T A+ L                                                                C  EG  AD+CP  
Subjt:  GLRDGLQGLVQAFKSATQAEDL----------------------------------------------------------------C--EGDIADKCPRS

Query:  NVTDGGGQHMGSYH------GSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-----------------------------------EIA
           +   Q  G+ H       +  +++ +G      +P+             SHSF+S  FV                                   EIA
Subjt:  NVTDGGGQHMGSYH------GSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-----------------------------------EIA

Query:  NHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVR
         H +EVTL+VL+M DFDVIL MDWLAANH NIDCSRKEVTFNPP   +FK++   +  LP+VISA++A KLL+QG+W ILASVV T+E +++LSSEPVVR
Subjt:  NHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVR

Query:  DYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNKL-------
        DYP++FPE+L  LPPHRE+ F IEL+ G   IS+AP +M  AELKE            F        G   + +KKKD S+ L   Y+ LNK+       
Subjt:  DYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNKL-------

Query:  -----------------------------RIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL------------
                                     RIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL            
Subjt:  -----------------------------RIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL------------

Query:  ---VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDG
           VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDG
Subjt:  ---VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDG

Query:  SGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA----------------------------------------------------
        SG FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                                                    
Subjt:  SGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA----------------------------------------------------

Query:  -ILYHPGKANVVEDALSRKTVHLVALITSRPAYVKTLRVQRLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLME
         ILYHPGKANVV DALSRK  H  ALIT +    +      +   IIDAQ  DPYLVEK    +VGQ  EFS           R  VP D+ VK ELL E
Subjt:  -ILYHPGKANVVEDALSRKTVHLVALITSRPAYVKTLRVQRLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLME

Query:  AHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAA
        AHSSPFS+HPGSTKMYQ+LK  YWW  MK                            ME+ E V     C  +    +++RQ    +   L       + 
Subjt:  AHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAA

Query:  VALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVP
           K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH VP
Subjt:  VALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVP

Query:  ILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK-------
        + IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFEALYGK       
Subjt:  ILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK-------

Query:  -------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLS
               RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  S
Subjt:  -------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLS

Query:  LSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEM
        LS VH+VFHVSML+KYV +PSHV+DYE L+++E+LSY E+P ++LAREVK               NH+ +EA WE+ED+M
Subjt:  LSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEM

KAA0051357.1 pol protein [Cucumis melo var. makuwa]7.2e-25841.43Show/hide
Query:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------
        AP P+  P P LVPV    + +            L  +AKHLRDFRKYN  TFDGSL+DP +A++                                   
Subjt:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------

Query:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSATQAE
             M+G     + W+QFKE+FY K+F A+L   K +E L+LEQ +M+VE+YD +F+ LS FAP M+AT+A R  +FV+GLR  +QGLV+AF+ AT A+
Subjt:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSATQAE

Query:  -------------------------------------------------------------------DLC-----------------------EGDIADK
                                                                            LC                       EG  AD+
Subjt:  -------------------------------------------------------------------DLC-----------------------EGDIADK

Query:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------
        CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                  
Subjt:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------

Query:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE
         EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRK+VTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSE
Subjt:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE

Query:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----
        PVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELKE            F        G   + +KKKD S+ L   Y+ELNK    
Subjt:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----

Query:  --------------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------
                                        LRIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL        
Subjt:  --------------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------

Query:  ------------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-
                                      VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F 
Subjt:  ------------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-

Query:  ----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-------------------------
                        V+APVL VPDGSG FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                         
Subjt:  ----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-------------------------

Query:  ----------------------------ILYHPGKANVVEDALSRKTVHLVALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGD
                                    ILYHPGKANVV DALSRK  H  ALIT +                      +  L VQ  L+Q+IIDAQ  D
Subjt:  ----------------------------ILYHPGKANVVEDALSRKTVHLVALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGD

Query:  PYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVG
        PYLVEK    + GQ  EFS          GR  VP D+ VK ELL EAHSSPFS+HPGSTKMYQ+LK  YWW  MKRE          +A F        
Subjt:  PYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVG

Query:  EGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQG
                        C       E  ++P G +        +  +    K + +SMD     I+   R L           + F ++ V  +       
Subjt:  EGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQG

Query:  VRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------
                       KSAHF PGKSTY+ +KWAQ+YM E VRLH VP+ IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQ D QTE LNQ       
Subjt:  VRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------

Query:  ------------------FAYNNNYQTTIGMTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GD
                          FAYNN+YQ TIGM PFEALYGK              RL   ELVQ  NE IQK R+RM TA S+QKS+A+V          D
Subjt:  ------------------FAYNNNYQTTIGMTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GD

Query:  KVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---
        KVFLKVAPMKGVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P ++LAREVK   
Subjt:  KVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---

Query:  ------------NHQFKEAMWEQEDEMR
                    NH+ +EA WE+ED+MR
Subjt:  ------------NHQFKEAMWEQEDEMR

KAA0051368.1 pol protein [Cucumis melo var. makuwa]1.3e-26242.42Show/hide
Query:  VEVEVGEEDFQTLKFKTRTKRQNHS-----APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----
        V++   E+ F+ L  + R +++  S     AP P   P P   P P+ +  +F           L  +AKHLRDFRKYN  TFDGSL+DP +A++     
Subjt:  VEVEVGEEDFQTLKFKTRTKRQNHS-----APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----

Query:  -----------------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVAT
                                           M+      + W+QFKE+FY K+F A+L   K +E L+LEQ +++VE+YD +F+ LS FAP M+AT
Subjt:  -----------------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVAT

Query:  KAERMKRFVQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------DLCEGDIAD
        +A R  +FV+GLR  +QGLV+AF+ AT A+                                                              L  G  +D
Subjt:  KAERMKRFVQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------DLCEGDIAD

Query:  KCP-----RSNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV---------------------------------
        +CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                 
Subjt:  KCP-----RSNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV---------------------------------

Query:  --EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSS
          EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSS
Subjt:  --EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSS

Query:  EPVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEFHTTE----CVTLGCTNVVLKKKDSSLPLYKEL----------------
        EPVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELKE         C+     N V  K    LP   +L                
Subjt:  EPVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEFHTTE----CVTLGCTNVVLKKKDSSLPLYKEL----------------

Query:  ---NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL---------------VSFLGHVVLKGAVSIEPAKI
           ++LRIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF+ FLDTF+IVFIDDIL               VSFLGHVV K  VS++PAKI
Subjt:  ---NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL---------------VSFLGHVVLKGAVSIEPAKI

Query:  EAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHG
        EAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDGSG FV+YSDASKKGLGCVLMQ G
Subjt:  EAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHG

Query:  KVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------ILYHPGKANVVEDALSRKTVHL
        KVVAYAS QLK+HEQNYPTHDLELA                                                     ILYHPGKANVV DALSRK  H 
Subjt:  KVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------ILYHPGKANVVEDALSRKTVHL

Query:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS
         ALIT +                      +  L VQ  L+Q+IIDAQ  DPYLVEK    + GQ +EFS           R  VP D+ VK ELL EAHS
Subjt:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS

Query:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL
        SPFS+HPGSTKMYQ+LK  YWW  MKRE          +A F                          +  L  + ++ P+      L    +  +    
Subjt:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL

Query:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI
        K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH VP+ I
Subjt:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI

Query:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------
        V DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFEALYG+          
Subjt:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------

Query:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI
            RL   ELVQ  NE IQK R+RM TA S+QKS+A+V         GDKVFLKVAPMKGVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS 
Subjt:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI

Query:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P ++LAREVK               NH+ +EA WE+ED+MR
Subjt:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

KAA0062141.1 pol protein [Cucumis melo var. makuwa]1.5e-25842.56Show/hide
Query:  EEDFQTLKFKTRTKRQ---NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-------------
        E+ F+ L  + R ++Q      AP P   P P   P P  +      +A       L  +AKHLRDFRKYN  TFDGSL+DP +A+L             
Subjt:  EEDFQTLKFKTRTKRQ---NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-------------

Query:  ---------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRF
                                   M+G     + W+QFKE+FY K+F A+L   K +E L+LEQD+M+VE+YD +F+ LS FAP M+AT+A R  +F
Subjt:  ---------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRF

Query:  VQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------------DLC--------
        V+GLR  +QGLV+AF+ AT A+                                                                    LC        
Subjt:  VQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------------DLC--------

Query:  ---------------EGDIADKCP---RSNVTDGGGQ--HMGS-YHGSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-------EIANH
                       EG  AD+CP     N  + G    H G  +  +  +++ +G      +P+             SHSF+S  FV       EIA H
Subjt:  ---------------EGDIADKCP---RSNVTDGGGQ--HMGS-YHGSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-------EIANH

Query:  ALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVRDY
         ++VTL+VL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSEPV+RDY
Subjt:  ALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVRDY

Query:  PNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----------
        P+VFPE+L  LPPHRE+ F IELE G   IS+AP RM   ELKE            F        G   + +KKKD S+ L   Y+ELNK          
Subjt:  PNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----------

Query:  --------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------
                                  LRIKD ++PKT F SRY H EFIVMSFGLTN P +FM LMN+VF+ FLDTF+IVFIDDIL              
Subjt:  --------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------

Query:  ------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-------
                                VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F       
Subjt:  ------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-------

Query:  ----------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELAIL---------YHPGKANVVEDALSRKTVHL
                  V+APVL VPDGSG F++YSDASKKGLGCVLMQ GKVV YAS QLK+HEQNYPTHDLELA +         Y  GKANVV D LSRK  H 
Subjt:  ----------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELAIL---------YHPGKANVVEDALSRKTVHL

Query:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS
         ALIT +                      +  L VQ  L+Q+IIDAQ  DPYLVEK    +VGQ  EFS           R  VP D+ VK ELL EAHS
Subjt:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS

Query:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL
        SPFS+HPGS KMYQ LK  YWW  MKRE          +A F                          +  L  + ++ P+    A L Q    S     
Subjt:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL

Query:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI
        K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH V + I
Subjt:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI

Query:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------
        V DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFE LYGK          
Subjt:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------

Query:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI
            RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS 
Subjt:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI

Query:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        VH+VFHVSML+KYV +PSHVVDYE L+++E+ SY E+P ++LAREVK               NH+ +EA WE+ED+MR
Subjt:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

KAA0066456.1 pol protein [Cucumis melo var. makuwa]5.0e-25941.79Show/hide
Query:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------
        AP P   PVP   P P+ +  +F           L  +AKHLRDFRKYN  TFDGSL+DP +A++                                   
Subjt:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------

Query:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSAT---
             M+G     + W+QFKE+FY K+F A+L   K +E L+LEQ +M+VE+YD +F+ LS FAP M+AT+A R  +FV+GLR  +QGLV+AF+ AT   
Subjt:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSAT---

Query:  ----------------------------------------------------------------QAEDLC-----------------------EGDIADK
                                                                        + + LC                       EG  AD+
Subjt:  ----------------------------------------------------------------QAEDLC-----------------------EGDIADK

Query:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------
        CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                  
Subjt:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------

Query:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE
         EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSE
Subjt:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE

Query:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEF--HTTECVTLGCTNVVLKKKDSSLPLYKEL-------------------
        PVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELK+      E +  G  N V  K    LP   +L                   
Subjt:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEF--HTTECVTLGCTNVVLKKKDSSLPLYKEL-------------------

Query:  NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------------------------------
        ++LRIKD ++PKTTF SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL                                      
Subjt:  NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------------------------------

Query:  VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGG
        VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L  YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDGSG 
Subjt:  VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGG

Query:  FVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------IL
        FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                                                     IL
Subjt:  FVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------IL

Query:  YHPGKANVVEDALSRKTVHLVALITSRPAYVK--------------TLRVQR------LQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------G
        YHPGKANVV DALSRK  H VALIT +    +              T+++ R      L+Q+IIDAQ  DPYLVEK   V+ GQ +EFS           
Subjt:  YHPGKANVVEDALSRKTVHLVALITSRPAYVK--------------TLRVQR------LQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------G

Query:  RQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQP
        R  VP D+ VK ELL EAHSSPFS+HPGSTK+YQ+LK  YWW  MKRE          +A F                          +  L  + ++ P
Subjt:  RQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQP

Query:  QGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMN
        +      L    +  +    K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +
Subjt:  QGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMN

Query:  KWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIG
        KWAQ+YM E VRLH VP+ IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIG
Subjt:  KWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIG

Query:  MTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFE
        M PFEALYG+              RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GV+RF R+ KLS RF+GPFE
Subjt:  MTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFE

Query:  ILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        ILER+GPVAYRLAL  SLS VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P K+LAREVK               NH+ +EA WE ED+MR
Subjt:  ILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

TrEMBL top hitse value%identityAlignment
A0A5A7T0S7 Reverse transcriptase1.7e-26042.43Show/hide
Query:  EEDFQTLKFKTRTKRQ-NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL---------------
        E+ F+ L  + R ++Q    AP P   P P  VPV   ++L+            L  +AKHLRDFRKYN  TFDGSL+DP +A+L               
Subjt:  EEDFQTLKFKTRTKRQ-NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL---------------

Query:  -------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQ
                                 M+G  S+ + W+QFKE+FY K+F A+L   K +E L+LEQD+M+VE+YD +F+ LS FAP M+A +A R  +FV+
Subjt:  -------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQ

Query:  GLRDGLQGLVQAFKSATQAEDL----------------------------------------------------------------C--EGDIADKCPRS
        GLR  +QGLV+AF+  T A+ L                                                                C  EG  AD+CP  
Subjt:  GLRDGLQGLVQAFKSATQAEDL----------------------------------------------------------------C--EGDIADKCPRS

Query:  NVTDGGGQHMGSYH------GSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-----------------------------------EIA
           +   Q  G+ H       +  +++ +G      +P+             SHSF+S  FV                                   EIA
Subjt:  NVTDGGGQHMGSYH------GSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-----------------------------------EIA

Query:  NHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVR
         H +EVTL+VL+M DFDVIL MDWLAANH NIDCSRKEVTFNPP   +FK++   +  LP+VISA++A KLL+QG+W ILASVV T+E +++LSSEPVVR
Subjt:  NHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVR

Query:  DYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNKL-------
        DYP++FPE+L  LPPHRE+ F IEL+ G   IS+AP +M  AELKE            F        G   + +KKKD S+ L   Y+ LNK+       
Subjt:  DYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNKL-------

Query:  -----------------------------RIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL------------
                                     RIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL            
Subjt:  -----------------------------RIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL------------

Query:  ---VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDG
           VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDG
Subjt:  ---VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDG

Query:  SGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA----------------------------------------------------
        SG FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                                                    
Subjt:  SGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA----------------------------------------------------

Query:  -ILYHPGKANVVEDALSRKTVHLVALITSRPAYVKTLRVQRLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLME
         ILYHPGKANVV DALSRK  H  ALIT +    +      +   IIDAQ  DPYLVEK    +VGQ  EFS           R  VP D+ VK ELL E
Subjt:  -ILYHPGKANVVEDALSRKTVHLVALITSRPAYVKTLRVQRLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLME

Query:  AHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAA
        AHSSPFS+HPGSTKMYQ+LK  YWW  MK                            ME+ E V     C  +    +++RQ    +   L       + 
Subjt:  AHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAA

Query:  VALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVP
           K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH VP
Subjt:  VALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVP

Query:  ILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK-------
        + IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFEALYGK       
Subjt:  ILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK-------

Query:  -------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLS
               RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  S
Subjt:  -------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLS

Query:  LSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEM
        LS VH+VFHVSML+KYV +PSHV+DYE L+++E+LSY E+P ++LAREVK               NH+ +EA WE+ED+M
Subjt:  LSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEM

A0A5A7U7V9 Reverse transcriptase6.2e-26342.42Show/hide
Query:  VEVEVGEEDFQTLKFKTRTKRQNHS-----APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----
        V++   E+ F+ L  + R +++  S     AP P   P P   P P+ +  +F           L  +AKHLRDFRKYN  TFDGSL+DP +A++     
Subjt:  VEVEVGEEDFQTLKFKTRTKRQNHS-----APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----

Query:  -----------------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVAT
                                           M+      + W+QFKE+FY K+F A+L   K +E L+LEQ +++VE+YD +F+ LS FAP M+AT
Subjt:  -----------------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVAT

Query:  KAERMKRFVQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------DLCEGDIAD
        +A R  +FV+GLR  +QGLV+AF+ AT A+                                                              L  G  +D
Subjt:  KAERMKRFVQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------DLCEGDIAD

Query:  KCP-----RSNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV---------------------------------
        +CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                 
Subjt:  KCP-----RSNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV---------------------------------

Query:  --EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSS
          EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSS
Subjt:  --EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSS

Query:  EPVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEFHTTE----CVTLGCTNVVLKKKDSSLPLYKEL----------------
        EPVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELKE         C+     N V  K    LP   +L                
Subjt:  EPVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEFHTTE----CVTLGCTNVVLKKKDSSLPLYKEL----------------

Query:  ---NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL---------------VSFLGHVVLKGAVSIEPAKI
           ++LRIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF+ FLDTF+IVFIDDIL               VSFLGHVV K  VS++PAKI
Subjt:  ---NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL---------------VSFLGHVVLKGAVSIEPAKI

Query:  EAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHG
        EAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDGSG FV+YSDASKKGLGCVLMQ G
Subjt:  EAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHG

Query:  KVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------ILYHPGKANVVEDALSRKTVHL
        KVVAYAS QLK+HEQNYPTHDLELA                                                     ILYHPGKANVV DALSRK  H 
Subjt:  KVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------ILYHPGKANVVEDALSRKTVHL

Query:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS
         ALIT +                      +  L VQ  L+Q+IIDAQ  DPYLVEK    + GQ +EFS           R  VP D+ VK ELL EAHS
Subjt:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS

Query:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL
        SPFS+HPGSTKMYQ+LK  YWW  MKRE          +A F                          +  L  + ++ P+      L    +  +    
Subjt:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL

Query:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI
        K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH VP+ I
Subjt:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI

Query:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------
        V DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFEALYG+          
Subjt:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------

Query:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI
            RL   ELVQ  NE IQK R+RM TA S+QKS+A+V         GDKVFLKVAPMKGVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS 
Subjt:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI

Query:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P ++LAREVK               NH+ +EA WE+ED+MR
Subjt:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

A0A5A7UAA8 Reverse transcriptase3.5e-25841.43Show/hide
Query:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------
        AP P+  P P LVPV    + +            L  +AKHLRDFRKYN  TFDGSL+DP +A++                                   
Subjt:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------

Query:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSATQAE
             M+G     + W+QFKE+FY K+F A+L   K +E L+LEQ +M+VE+YD +F+ LS FAP M+AT+A R  +FV+GLR  +QGLV+AF+ AT A+
Subjt:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSATQAE

Query:  -------------------------------------------------------------------DLC-----------------------EGDIADK
                                                                            LC                       EG  AD+
Subjt:  -------------------------------------------------------------------DLC-----------------------EGDIADK

Query:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------
        CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                  
Subjt:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------

Query:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE
         EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRK+VTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSE
Subjt:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE

Query:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----
        PVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELKE            F        G   + +KKKD S+ L   Y+ELNK    
Subjt:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----

Query:  --------------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------
                                        LRIKD ++PKT F SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL        
Subjt:  --------------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------

Query:  ------------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-
                                      VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F 
Subjt:  ------------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-

Query:  ----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-------------------------
                        V+APVL VPDGSG FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                         
Subjt:  ----------------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-------------------------

Query:  ----------------------------ILYHPGKANVVEDALSRKTVHLVALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGD
                                    ILYHPGKANVV DALSRK  H  ALIT +                      +  L VQ  L+Q+IIDAQ  D
Subjt:  ----------------------------ILYHPGKANVVEDALSRKTVHLVALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGD

Query:  PYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVG
        PYLVEK    + GQ  EFS          GR  VP D+ VK ELL EAHSSPFS+HPGSTKMYQ+LK  YWW  MKRE          +A F        
Subjt:  PYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVG

Query:  EGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQG
                        C       E  ++P G +        +  +    K + +SMD     I+   R L           + F ++ V  +       
Subjt:  EGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQG

Query:  VRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------
                       KSAHF PGKSTY+ +KWAQ+YM E VRLH VP+ IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQ D QTE LNQ       
Subjt:  VRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------

Query:  ------------------FAYNNNYQTTIGMTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GD
                          FAYNN+YQ TIGM PFEALYGK              RL   ELVQ  NE IQK R+RM TA S+QKS+A+V          D
Subjt:  ------------------FAYNNNYQTTIGMTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFANV---------GD

Query:  KVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---
        KVFLKVAPMKGVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P ++LAREVK   
Subjt:  KVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---

Query:  ------------NHQFKEAMWEQEDEMR
                    NH+ +EA WE+ED+MR
Subjt:  ------------NHQFKEAMWEQEDEMR

A0A5A7V8X5 Pol protein7.1e-25942.56Show/hide
Query:  EEDFQTLKFKTRTKRQ---NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-------------
        E+ F+ L  + R ++Q      AP P   P P   P P  +      +A       L  +AKHLRDFRKYN  TFDGSL+DP +A+L             
Subjt:  EEDFQTLKFKTRTKRQ---NHSAPIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-------------

Query:  ---------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRF
                                   M+G     + W+QFKE+FY K+F A+L   K +E L+LEQD+M+VE+YD +F+ LS FAP M+AT+A R  +F
Subjt:  ---------------------------MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRF

Query:  VQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------------DLC--------
        V+GLR  +QGLV+AF+ AT A+                                                                    LC        
Subjt:  VQGLRDGLQGLVQAFKSATQAE-------------------------------------------------------------------DLC--------

Query:  ---------------EGDIADKCP---RSNVTDGGGQ--HMGS-YHGSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-------EIANH
                       EG  AD+CP     N  + G    H G  +  +  +++ +G      +P+             SHSF+S  FV       EIA H
Subjt:  ---------------EGDIADKCP---RSNVTDGGGQ--HMGS-YHGSSNKSKHSG---DSCIPI-------------SHSFLSFVFV-------EIANH

Query:  ALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVRDY
         ++VTL+VL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSEPV+RDY
Subjt:  ALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVRDY

Query:  PNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----------
        P+VFPE+L  LPPHRE+ F IELE G   IS+AP RM   ELKE            F        G   + +KKKD S+ L   Y+ELNK          
Subjt:  PNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKE------------FHTTECVTLGCTNVVLKKKDSSLPL---YKELNK----------

Query:  --------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------
                                  LRIKD ++PKT F SRY H EFIVMSFGLTN P +FM LMN+VF+ FLDTF+IVFIDDIL              
Subjt:  --------------------------LRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------

Query:  ------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-------
                                VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L+ YY RFV++FS IATPLTQLTRKG  F       
Subjt:  ------------------------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-------

Query:  ----------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELAIL---------YHPGKANVVEDALSRKTVHL
                  V+APVL VPDGSG F++YSDASKKGLGCVLMQ GKVV YAS QLK+HEQNYPTHDLELA +         Y  GKANVV D LSRK  H 
Subjt:  ----------VSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELAIL---------YHPGKANVVEDALSRKTVHL

Query:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS
         ALIT +                      +  L VQ  L+Q+IIDAQ  DPYLVEK    +VGQ  EFS           R  VP D+ VK ELL EAHS
Subjt:  VALITSR-------------------PAYVKTLRVQ-RLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------GRQRVPIDNDVKNELLMEAHS

Query:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL
        SPFS+HPGS KMYQ LK  YWW  MKRE          +A F                          +  L  + ++ P+    A L Q    S     
Subjt:  SPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAVAL

Query:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI
        K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +KWAQ+YM E VRLH V + I
Subjt:  KQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILI

Query:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------
        V DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIGM PFE LYGK          
Subjt:  VFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIGMTPFEALYGK----------

Query:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI
            RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GVLRF R+ KLS RF+GPFEILER+GPVAYRLAL  SLS 
Subjt:  ----RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFEILERVGPVAYRLALSLSLSI

Query:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        VH+VFHVSML+KYV +PSHVVDYE L+++E+ SY E+P ++LAREVK               NH+ +EA WE+ED+MR
Subjt:  VHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

A0A5A7VJE2 Reverse transcriptase2.4e-25941.79Show/hide
Query:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------
        AP P   PVP   P P+ +  +F           L  +AKHLRDFRKYN  TFDGSL+DP +A++                                   
Subjt:  APIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKL-----------------------------------

Query:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSAT---
             M+G     + W+QFKE+FY K+F A+L   K +E L+LEQ +M+VE+YD +F+ LS FAP M+AT+A R  +FV+GLR  +QGLV+AF+ AT   
Subjt:  -----MIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVEKYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSAT---

Query:  ----------------------------------------------------------------QAEDLC-----------------------EGDIADK
                                                                        + + LC                       EG  AD+
Subjt:  ----------------------------------------------------------------QAEDLC-----------------------EGDIADK

Query:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------
        CP      +     G  H G    ++        + +    P+             SHSF+S  FV                                  
Subjt:  CPR-----SNVTDGGGQHMGSYHGSSNKSKHSGDSCI----PI-------------SHSFLSFVFV----------------------------------

Query:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE
         EIA H +EVTLIVL+M DFDVIL MDWLAANH +IDCSRKEVTFNPP  A+FK++   +  LP+VISA++A+KLL+QGTW ILASVV T+E +++LSSE
Subjt:  -EIANHALEVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSE

Query:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEF--HTTECVTLGCTNVVLKKKDSSLPLYKEL-------------------
        PVVRDYP+VFPE+L  LPPHRE+ F IELE G   IS+AP RM  AELK+      E +  G  N V  K    LP   +L                   
Subjt:  PVVRDYPNVFPEDLLRLPPHREINFTIELELGMASISKAPCRMPLAELKEF--HTTECVTLGCTNVVLKKKDSSLPLYKEL-------------------

Query:  NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------------------------------
        ++LRIKD ++PKTTF SRY HYEFIVMSFGLTN P +FM LMN+VF++FLDTF+IVFIDDIL                                      
Subjt:  NKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL--------------------------------------

Query:  VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGG
        VSFLGHVV K  VS++PAKIEAV+ W+   TV    SF  L  YY RFV++FS IATPLTQLTRKG  F                 V+APVL VPDGSG 
Subjt:  VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGA-SFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTF-----------------VSAPVLIVPDGSGG

Query:  FVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------IL
        FV+YSDASKKGLGCVLMQ GKVVAYAS QLK+HEQNYPTHDLELA                                                     IL
Subjt:  FVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLELA-----------------------------------------------------IL

Query:  YHPGKANVVEDALSRKTVHLVALITSRPAYVK--------------TLRVQR------LQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------G
        YHPGKANVV DALSRK  H VALIT +    +              T+++ R      L+Q+IIDAQ  DPYLVEK   V+ GQ +EFS           
Subjt:  YHPGKANVVEDALSRKTVHLVALITSRPAYVK--------------TLRVQR------LQQKIIDAQQGDPYLVEKSHQVKVGQKSEFS----------G

Query:  RQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQP
        R  VP D+ VK ELL EAHSSPFS+HPGSTK+YQ+LK  YWW  MKRE          +A F                          +  L  + ++ P
Subjt:  RQRVPIDNDVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQP

Query:  QGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMN
        +      L    +  +    K + +SMD     I+   R L           + F ++ V  +                      KSAHF PGKSTY+ +
Subjt:  QGSIDAALRQGAKDSAAVALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMN

Query:  KWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIG
        KWAQ+YM E VRLH VP+ IV DRD RFTS FWK LQ+A+GT+LDFSTAF PQTD QTE LNQ                         FAYNN+YQ TIG
Subjt:  KWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQ-------------------------FAYNNNYQTTIG

Query:  MTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFE
        M PFEALYG+              RL   ELVQ  NE IQK R+RM TA S+QKS+A+         VGDKVFLKVAPM+GV+RF R+ KLS RF+GPFE
Subjt:  MTPFEALYGK--------------RL---ELVQPNNETIQKSRARMQTAHSKQKSFAN---------VGDKVFLKVAPMKGVLRFGRKRKLSSRFMGPFE

Query:  ILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR
        ILER+GPVAYRLAL  SLS VH+VFHVSML+KYV +PSHVVDYE L+++E+LSY E+P K+LAREVK               NH+ +EA WE ED+MR
Subjt:  ILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVK---------------NHQFKEAMWEQEDEMR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.5e-1928.74Show/hide
Query:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDILV----------------------------
        +++ L K  +++ +   ++ KT F +++ HYE++ M FGL N P  F   MN + +  L+   +V++DDI+V                            
Subjt:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDILV----------------------------

Query:  ----------SFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRG-ASFSRLSRYYLRFVKDFSHIATPLTQLTRKG----TT---FVSA-----------P
                  +FLGHV+    +   P KIEA+  +      +   +F  L+ YY +F+ +F+ IA P+T+  +K     TT   + SA           P
Subjt:  ----------SFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRG-ASFSRLSRYYLRFVKDFSHIATPLTQLTRKG----TT---FVSA-----------P

Query:  VLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLEL
        +L VPD +  F + +DAS   LG VL Q G  ++Y S  L  HE NY T + EL
Subjt:  VLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLEL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.3e-1225.18Show/hide
Query:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL-----------------------------
        ++L L    +++ + + +  KT+F      YEF  + FGL N  +IF   ++ V ++ +     V++DD++                             
Subjt:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL-----------------------------

Query:  ---------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTV-RGASFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTFVSA-------------------
                 V +LG +V K     +P K++A+  +     V +  SF  L+ YY  F+KDF+ IA P+T + +     VS                    
Subjt:  ---------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTV-RGASFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTFVSA-------------------

Query:  ----------PVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLE-LAILYHPGK
                   +L  PD    F + +DAS  G+G VL Q G+ +   S  LK  EQNY T++ E LAI++  GK
Subjt:  ----------PVLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLE-LAILYHPGK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.1e-1825.98Show/hide
Query:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDILV----------------------------
        +++ L K  +++ + + +I KT F ++  HYE++ M FGL N P  F   MN + +  L+   +V++DDI++                            
Subjt:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDILV----------------------------

Query:  ----------SFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRG-ASFSRLSRYYLRFVKDFSHIATPLTQLTRKGT------------------TFVSAP
                  +FLGH+V    +   P K++A+ S+      +   +F  L+ YY +F+ +++ IA P+T   +K T                    +  P
Subjt:  ----------SFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRG-ASFSRLSRYYLRFVKDFSHIATPLTQLTRKGT------------------TFVSAP

Query:  VLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLEL
        +L +PD    FV+ +DAS   LG VL Q+G  +++ S  L +HE NY   + EL
Subjt:  VLIVPDGSGGFVVYSDASKKGLGCVLMQHGKVVAYASCQLKNHEQNYPTHDLEL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.1e-1425.27Show/hide
Query:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL-----------------------------
        ++L L    +++ +K+S+IPKT F +    YEF+ + FGL N P IF  +++ + ++ +     V+IDDI+                             
Subjt:  SSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDFLDTFIIVFIDDIL-----------------------------

Query:  ---------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGAS-FSRLSRYYLRFVKDFSHIATPLTQLTR---------------------------
                 V FLG++V    +  +P K+ A+S      +V+    F  ++ YY +F++D++ +A PLT LTR                           
Subjt:  ---------VSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGAS-FSRLSRYYLRFVKDFSHIATPLTQLTR---------------------------

Query:  -KGTTFVSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQ----HGKVVAYASCQLKNHEQNYPTHDLE-LAILY
           +   S+ +L  P  +  F + +DAS   +G VL Q      + +AY S  L   E+NY T + E LAI++
Subjt:  -KGTTFVSAPVLIVPDGSGGFVVYSDASKKGLGCVLMQ----HGKVVAYASCQLKNHEQNYPTHDLE-LAILY

Q9UR07 Transposon Tf2-11 polyprotein8.1e-1027.49Show/hide
Query:  KSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQF---------------------
        K A   P   + +  + A+++    +     P  I+ D D  FTS  WK         + FS  +RPQTD QTE  NQ                      
Subjt:  KSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRFTSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQF---------------------

Query:  ----AYNNNYQTTIGMTPFEALYG-----KRLELVQPNNETIQKSRARMQ----------TAHSKQKSFANV----------GDKVFLKVAPMKGVLRFG
            +YNN   +   MTPFE ++        LEL   +++T + S+  +Q          T + K K + ++          GD V +K     G L   
Subjt:  ----AYNNNYQTTIGMTPFEALYG-----KRLELVQPNNETIQKSRARMQ----------TAHSKQKSFANV----------GDKVFLKVAPMKGVLRFG

Query:  RKRKLSSRFMGPFEILERVGPVAYRLALSLSLS-IVHNVFHVSMLKKYVAN
        +  KL+  F GPF +L++ GP  Y L L  S+  +  + FHVS L+KY  N
Subjt:  RKRKLSSRFMGPFEILERVGPVAYRLALSLSLS-IVHNVFHVSMLKKYVAN

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.1e-0538.03Show/hide
Query:  VSFLG--HVVLKGAVSIEPAKIEAVSSW---SHLVTVRGASFSRLSRYYLRFVKDFSHIATPLTQLTRKGT
        +++LG  H++    VS +PAK+EA+  W    +   +RG  F  L+ YY RFVK++  I  PLT+L +K +
Subjt:  VSFLG--HVVLKGAVSIEPAKIEAVSSW---SHLVTVRGASFSRLSRYYLRFVKDFSHIATPLTQLTRKGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACCGGAAGTCGAGGGGTTTTCTACTATGCTACCAGAAGCCTATGGGTTTCCTACTATATTACTGGAGACCTACAAGTTTCCTACTATATTTCGGGTGGGACAGAC
TGACTTACTAGGTAAGTCTAGACTATCCCTATGGCCATTAAGGGGTCCTTCGTCATCACCAGGATTTATTAGCAAGTGTTCATATATGTATATTAGGAAATGGCTCGAGG
AGGCAGAGGAGGCAGAACGGGTAGAGGGGGTAGTGGAAGTAGAGGTCGGAGAGGAGGATTTCCAAACCCTCAAGTTCAAAACCCGGACAAAAAGGCAGAACCATAGTGCA
CCAATTCCCCTTCTAGTGCCAGTTCCTCCTCTGGTGCCCGTTCCACTAGTCATCTTGCTCGAGTTTTTTGCAATGGCAACAACGAAAGGAGAGGCTGATCTATTTGTGGA
TGCAAAGCACTTGCGGGATTTTAGGAAGTACAACCGCTACACTTTTGATGGGTCACTAAAGGACCCTATAAAGGCTAAACTGATGATTGGGGTCGGTTCTAAACTAGTGA
AGTGGGAGCAATTCAAGGAGAACTTCTACGACAAATACTTCTTTGCGAATCTAATACGTCTCAAGGAGAAGGAGGTTCTGAGTTTAGAGCAGGACAACATGTCTGTAGAG
AAGTATGACCAGAAGTTCGAGCAGCTGTCCCATTTTGCCCCTACCATGGTTGCCACCAAGGCAGAGAGAATGAAGAGATTTGTCCAGGGGCTAAGGGATGGGTTGCAAGG
ACTGGTGCAAGCCTTTAAATCGGCCACCCAAGCAGAAGATCTCTGTGAAGGGGATATAGCGGACAAATGCCCTAGGAGTAACGTAACCGATGGTGGAGGTCAGCACATGG
GTTCTTACCATGGGAGTTCCAATAAGTCAAAGCATAGTGGTGACAGCTGCATCCCTATATCGCATTCATTTTTATCTTTTGTGTTTGTAGAAATAGCAAATCATGCGCTA
GAGGTAACCTTGATAGTCTTAAACATGCATGACTTCGATGTAATATTAAGCATGGATTGGTTGGCTGCTAACCATACTAATATAGATTGTTCTCGTAAGGAGGTAACTTT
CAACCCTCCTAAGGAAGCCAATTTCAAGTATAGAGTGGCTAGAACAATAGTCTTGCCCAAGGTAATCTCAGCCATGAAGGCTAACAAGCTACTTAACCAAGGTACCTGGA
GTATTTTGGCTAGTGTGGTCGGCACTAAAGAACCTGAACTCGCCTTATCTTCGGAACCAGTGGTGAGAGACTACCCAAATGTTTTTCCAGAGGACCTTCTAAGATTGCCA
CCGCATAGAGAGATAAACTTCACTATCGAGTTGGAGCTAGGTATGGCTTCCATCTCCAAGGCTCCTTGTAGGATGCCTCTAGCGGAGCTAAAAGAGTTTCATACGACCGA
GTGTGTCACCTTGGGGTGCACCAATGTTGTTCTGAAGAAGAAGGATAGTTCGTTACCTCTCTACAAAGAGTTGAACAAGTTGAGGATAAAAGACAGCAATATACCTAAGA
CAACTTTCTATTCCAGATATGAGCATTACGAGTTCATTGTTATGTCTTTTGGCCTGACAAATGTCCCTACAATATTCATGATTTTGATGAACCAGGTGTTCAAGGATTTT
CTTGACACTTTTATCATAGTATTCATAGATGATATACTGGTGTCCTTTTTGGGGCACGTGGTGTTGAAGGGAGCAGTTTCCATAGAACCTGCAAAGATAGAGGCAGTTTC
GAGTTGGTCTCATCTTGTGACAGTTAGAGGTGCGTCGTTTTCTAGGCTTAGCAGGTACTACCTTCGGTTTGTTAAGGACTTTTCTCACATAGCCACTCCTTTGACTCAGT
TGACCAGGAAGGGTACTACTTTTGTTTCTGCACCAGTACTTATAGTTCCAGACGGGTCTGGAGGTTTTGTTGTCTACAGTGATGCCTCTAAGAAAGGGTTGGGGTGTGTG
TTAATGCAACATGGTAAGGTGGTTGCTTATGCCTCTTGCCAGTTAAAGAACCACGAACAAAATTACCCGACTCATGATTTAGAGTTAGCAATTTTGTACCACCCTGGCAA
GGCAAATGTGGTAGAAGATGCCTTGAGTAGGAAGACAGTCCATTTGGTAGCTCTTATTACTAGCAGGCCTGCCTATGTAAAGACTTTGAGAGTGCAGAGATTGCAGCAAA
AGATTATTGATGCACAACAAGGTGATCCGTACCTGGTAGAGAAGTCTCACCAGGTGAAGGTGGGACAGAAAAGTGAGTTCTCAGGACGCCAACGTGTACCGATAGATAAT
GATGTTAAGAATGAATTGTTGATGGAAGCTCATAGTTCTCCATTCTCCGTGCATCCCGGAAGCACCAAGATGTACCAAGAACTAAAATGTTATTACTGGTGGCACAAAAT
GAAAAGAGAAATGAAAGCTTCGGCAAAAGCTGGCAGGGCTATTGCAGCCTTTGAAGGTGCCAGAATGGAAGTGGGAGAAGGTGTCAGAATGGAAATGGGAGAAGGTGTCT
ATGGATTTTATTGCTGCGCTGCAATGGCGCTTTTTCCCGAGAGCATCAGACAACCTCAGGGCAGCATCGACGCGGCATTGCGACAAGGCGCTAAGGATAGCGCTGCCGTG
GCGCTGAAGCAGAAACGCATCAGCATGGACGCAGAGGGGAGAAAAATATCAAGAAAAGCAAGAAAGTTGCAGACTGAAAGAAAGATCCAAAAAGGAAGAGGAAAAGAGTT
CGAGATAGTCTTGGTGCCTTTTGAAGTTCGACGTGAGAGCCAAGGAGTTCGTGACGATGCTGCCTCAATCAAGAACCTCTTTTCAACTCACAAGTCAGCACATTTTTTTC
CAGGCAAGTCTACCTATTCAATGAACAAGTGGGCACAGATTTACATGGGAGAAGATGTGAGGTTGCATGAAGTGCCAATTTTGATAGTGTTTGACAGAGACCCTCGCTTC
ACGTCCAGTTTTTGGAAGAGTCTCCAGTCAGCACTTGGCACACAGTTAGATTTCAGCACAGCCTTCCGCCCACAAACTGATGACCAAACTGAGCATTTGAATCAGTTTGC
TTATAACAACAACTACCAGACTACTATTGGCATGACACCATTTGAGGCCCTGTATGGGAAGAGGCTTGAGTTGGTACAACCTAATAATGAGACTATTCAGAAAAGTAGGG
CAAGGATGCAGACAGCTCATAGCAAACAGAAAAGTTTCGCCAATGTAGGTGACAAGGTATTCCTGAAAGTGGCGCCTATGAAGGGTGTTTTGAGGTTTGGAAGGAAGAGG
AAGTTGAGCTCTAGATTCATGGGCCCATTTGAGATATTGGAACGAGTTGGCCCAGTAGCTTATCGTTTGGCTTTGTCGCTGTCACTATCTATAGTTCATAATGTCTTTCA
TGTTTCTATGTTGAAGAAGTATGTCGCAAATCCGTCTCATGTAGTAGACTATGAACTCTTGCAACTAAATGAGCATTTGAGTTATGAGGAGAAGCCTGCTAAAATTCTTG
CAAGGGAAGTGAAGAATCATCAATTCAAGGAAGCAATGTGGGAACAGGAGGATGAAATGAGATTAAATCGTACGTCTTCGCCATTAGATCTGCCACCTCGTCTTCCTTCA
GGTGTCGTCGGTAGTCGTGTGAGGAAGCTTCGAACGACTGAAGCAGTGGCTGTCGTGTACTGCAGATCGTTGCTTCGACTTGGAAAATTGGAATCGACGAGTTTCCTACT
ATGTTATGGAAGTCAACGAGTTTTCTACTATGCTACTGGAAGCCTACGGGTTTCCTACTATGTTACGGGTGCCGACAGACCACGTCTAAATATAGGATGCAGTTCAAGTC
AACCGCCTTGTCTTATCCTCAAAACTCACGAGGAAACCCCAGATAGCCCAAAAAGCCAAAATCCAAAGACTAAAAGGAAAAGGAGAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTACCGGAAGTCGAGGGGTTTTCTACTATGCTACCAGAAGCCTATGGGTTTCCTACTATATTACTGGAGACCTACAAGTTTCCTACTATATTTCGGGTGGGACAGAC
TGACTTACTAGGTAAGTCTAGACTATCCCTATGGCCATTAAGGGGTCCTTCGTCATCACCAGGATTTATTAGCAAGTGTTCATATATGTATATTAGGAAATGGCTCGAGG
AGGCAGAGGAGGCAGAACGGGTAGAGGGGGTAGTGGAAGTAGAGGTCGGAGAGGAGGATTTCCAAACCCTCAAGTTCAAAACCCGGACAAAAAGGCAGAACCATAGTGCA
CCAATTCCCCTTCTAGTGCCAGTTCCTCCTCTGGTGCCCGTTCCACTAGTCATCTTGCTCGAGTTTTTTGCAATGGCAACAACGAAAGGAGAGGCTGATCTATTTGTGGA
TGCAAAGCACTTGCGGGATTTTAGGAAGTACAACCGCTACACTTTTGATGGGTCACTAAAGGACCCTATAAAGGCTAAACTGATGATTGGGGTCGGTTCTAAACTAGTGA
AGTGGGAGCAATTCAAGGAGAACTTCTACGACAAATACTTCTTTGCGAATCTAATACGTCTCAAGGAGAAGGAGGTTCTGAGTTTAGAGCAGGACAACATGTCTGTAGAG
AAGTATGACCAGAAGTTCGAGCAGCTGTCCCATTTTGCCCCTACCATGGTTGCCACCAAGGCAGAGAGAATGAAGAGATTTGTCCAGGGGCTAAGGGATGGGTTGCAAGG
ACTGGTGCAAGCCTTTAAATCGGCCACCCAAGCAGAAGATCTCTGTGAAGGGGATATAGCGGACAAATGCCCTAGGAGTAACGTAACCGATGGTGGAGGTCAGCACATGG
GTTCTTACCATGGGAGTTCCAATAAGTCAAAGCATAGTGGTGACAGCTGCATCCCTATATCGCATTCATTTTTATCTTTTGTGTTTGTAGAAATAGCAAATCATGCGCTA
GAGGTAACCTTGATAGTCTTAAACATGCATGACTTCGATGTAATATTAAGCATGGATTGGTTGGCTGCTAACCATACTAATATAGATTGTTCTCGTAAGGAGGTAACTTT
CAACCCTCCTAAGGAAGCCAATTTCAAGTATAGAGTGGCTAGAACAATAGTCTTGCCCAAGGTAATCTCAGCCATGAAGGCTAACAAGCTACTTAACCAAGGTACCTGGA
GTATTTTGGCTAGTGTGGTCGGCACTAAAGAACCTGAACTCGCCTTATCTTCGGAACCAGTGGTGAGAGACTACCCAAATGTTTTTCCAGAGGACCTTCTAAGATTGCCA
CCGCATAGAGAGATAAACTTCACTATCGAGTTGGAGCTAGGTATGGCTTCCATCTCCAAGGCTCCTTGTAGGATGCCTCTAGCGGAGCTAAAAGAGTTTCATACGACCGA
GTGTGTCACCTTGGGGTGCACCAATGTTGTTCTGAAGAAGAAGGATAGTTCGTTACCTCTCTACAAAGAGTTGAACAAGTTGAGGATAAAAGACAGCAATATACCTAAGA
CAACTTTCTATTCCAGATATGAGCATTACGAGTTCATTGTTATGTCTTTTGGCCTGACAAATGTCCCTACAATATTCATGATTTTGATGAACCAGGTGTTCAAGGATTTT
CTTGACACTTTTATCATAGTATTCATAGATGATATACTGGTGTCCTTTTTGGGGCACGTGGTGTTGAAGGGAGCAGTTTCCATAGAACCTGCAAAGATAGAGGCAGTTTC
GAGTTGGTCTCATCTTGTGACAGTTAGAGGTGCGTCGTTTTCTAGGCTTAGCAGGTACTACCTTCGGTTTGTTAAGGACTTTTCTCACATAGCCACTCCTTTGACTCAGT
TGACCAGGAAGGGTACTACTTTTGTTTCTGCACCAGTACTTATAGTTCCAGACGGGTCTGGAGGTTTTGTTGTCTACAGTGATGCCTCTAAGAAAGGGTTGGGGTGTGTG
TTAATGCAACATGGTAAGGTGGTTGCTTATGCCTCTTGCCAGTTAAAGAACCACGAACAAAATTACCCGACTCATGATTTAGAGTTAGCAATTTTGTACCACCCTGGCAA
GGCAAATGTGGTAGAAGATGCCTTGAGTAGGAAGACAGTCCATTTGGTAGCTCTTATTACTAGCAGGCCTGCCTATGTAAAGACTTTGAGAGTGCAGAGATTGCAGCAAA
AGATTATTGATGCACAACAAGGTGATCCGTACCTGGTAGAGAAGTCTCACCAGGTGAAGGTGGGACAGAAAAGTGAGTTCTCAGGACGCCAACGTGTACCGATAGATAAT
GATGTTAAGAATGAATTGTTGATGGAAGCTCATAGTTCTCCATTCTCCGTGCATCCCGGAAGCACCAAGATGTACCAAGAACTAAAATGTTATTACTGGTGGCACAAAAT
GAAAAGAGAAATGAAAGCTTCGGCAAAAGCTGGCAGGGCTATTGCAGCCTTTGAAGGTGCCAGAATGGAAGTGGGAGAAGGTGTCAGAATGGAAATGGGAGAAGGTGTCT
ATGGATTTTATTGCTGCGCTGCAATGGCGCTTTTTCCCGAGAGCATCAGACAACCTCAGGGCAGCATCGACGCGGCATTGCGACAAGGCGCTAAGGATAGCGCTGCCGTG
GCGCTGAAGCAGAAACGCATCAGCATGGACGCAGAGGGGAGAAAAATATCAAGAAAAGCAAGAAAGTTGCAGACTGAAAGAAAGATCCAAAAAGGAAGAGGAAAAGAGTT
CGAGATAGTCTTGGTGCCTTTTGAAGTTCGACGTGAGAGCCAAGGAGTTCGTGACGATGCTGCCTCAATCAAGAACCTCTTTTCAACTCACAAGTCAGCACATTTTTTTC
CAGGCAAGTCTACCTATTCAATGAACAAGTGGGCACAGATTTACATGGGAGAAGATGTGAGGTTGCATGAAGTGCCAATTTTGATAGTGTTTGACAGAGACCCTCGCTTC
ACGTCCAGTTTTTGGAAGAGTCTCCAGTCAGCACTTGGCACACAGTTAGATTTCAGCACAGCCTTCCGCCCACAAACTGATGACCAAACTGAGCATTTGAATCAGTTTGC
TTATAACAACAACTACCAGACTACTATTGGCATGACACCATTTGAGGCCCTGTATGGGAAGAGGCTTGAGTTGGTACAACCTAATAATGAGACTATTCAGAAAAGTAGGG
CAAGGATGCAGACAGCTCATAGCAAACAGAAAAGTTTCGCCAATGTAGGTGACAAGGTATTCCTGAAAGTGGCGCCTATGAAGGGTGTTTTGAGGTTTGGAAGGAAGAGG
AAGTTGAGCTCTAGATTCATGGGCCCATTTGAGATATTGGAACGAGTTGGCCCAGTAGCTTATCGTTTGGCTTTGTCGCTGTCACTATCTATAGTTCATAATGTCTTTCA
TGTTTCTATGTTGAAGAAGTATGTCGCAAATCCGTCTCATGTAGTAGACTATGAACTCTTGCAACTAAATGAGCATTTGAGTTATGAGGAGAAGCCTGCTAAAATTCTTG
CAAGGGAAGTGAAGAATCATCAATTCAAGGAAGCAATGTGGGAACAGGAGGATGAAATGAGATTAAATCGTACGTCTTCGCCATTAGATCTGCCACCTCGTCTTCCTTCA
GGTGTCGTCGGTAGTCGTGTGAGGAAGCTTCGAACGACTGAAGCAGTGGCTGTCGTGTACTGCAGATCGTTGCTTCGACTTGGAAAATTGGAATCGACGAGTTTCCTACT
ATGTTATGGAAGTCAACGAGTTTTCTACTATGCTACTGGAAGCCTACGGGTTTCCTACTATGTTACGGGTGCCGACAGACCACGTCTAAATATAGGATGCAGTTCAAGTC
AACCGCCTTGTCTTATCCTCAAAACTCACGAGGAAACCCCAGATAGCCCAAAAAGCCAAAATCCAAAGACTAAAAGGAAAAGGAGAGCCTAA
Protein sequenceShow/hide protein sequence
MVPEVEGFSTMLPEAYGFPTILLETYKFPTIFRVGQTDLLGKSRLSLWPLRGPSSSPGFISKCSYMYIRKWLEEAEEAERVEGVVEVEVGEEDFQTLKFKTRTKRQNHSA
PIPLLVPVPPLVPVPLVILLEFFAMATTKGEADLFVDAKHLRDFRKYNRYTFDGSLKDPIKAKLMIGVGSKLVKWEQFKENFYDKYFFANLIRLKEKEVLSLEQDNMSVE
KYDQKFEQLSHFAPTMVATKAERMKRFVQGLRDGLQGLVQAFKSATQAEDLCEGDIADKCPRSNVTDGGGQHMGSYHGSSNKSKHSGDSCIPISHSFLSFVFVEIANHAL
EVTLIVLNMHDFDVILSMDWLAANHTNIDCSRKEVTFNPPKEANFKYRVARTIVLPKVISAMKANKLLNQGTWSILASVVGTKEPELALSSEPVVRDYPNVFPEDLLRLP
PHREINFTIELELGMASISKAPCRMPLAELKEFHTTECVTLGCTNVVLKKKDSSLPLYKELNKLRIKDSNIPKTTFYSRYEHYEFIVMSFGLTNVPTIFMILMNQVFKDF
LDTFIIVFIDDILVSFLGHVVLKGAVSIEPAKIEAVSSWSHLVTVRGASFSRLSRYYLRFVKDFSHIATPLTQLTRKGTTFVSAPVLIVPDGSGGFVVYSDASKKGLGCV
LMQHGKVVAYASCQLKNHEQNYPTHDLELAILYHPGKANVVEDALSRKTVHLVALITSRPAYVKTLRVQRLQQKIIDAQQGDPYLVEKSHQVKVGQKSEFSGRQRVPIDN
DVKNELLMEAHSSPFSVHPGSTKMYQELKCYYWWHKMKREMKASAKAGRAIAAFEGARMEVGEGVRMEMGEGVYGFYCCAAMALFPESIRQPQGSIDAALRQGAKDSAAV
ALKQKRISMDAEGRKISRKARKLQTERKIQKGRGKEFEIVLVPFEVRRESQGVRDDAASIKNLFSTHKSAHFFPGKSTYSMNKWAQIYMGEDVRLHEVPILIVFDRDPRF
TSSFWKSLQSALGTQLDFSTAFRPQTDDQTEHLNQFAYNNNYQTTIGMTPFEALYGKRLELVQPNNETIQKSRARMQTAHSKQKSFANVGDKVFLKVAPMKGVLRFGRKR
KLSSRFMGPFEILERVGPVAYRLALSLSLSIVHNVFHVSMLKKYVANPSHVVDYELLQLNEHLSYEEKPAKILAREVKNHQFKEAMWEQEDEMRLNRTSSPLDLPPRLPS
GVVGSRVRKLRTTEAVAVVYCRSLLRLGKLESTSFLLCYGSQRVFYYATGSLRVSYYVTGADRPRLNIGCSSSQPPCLILKTHEETPDSPKSQNPKTKRKRRA