; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016367 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016367
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr08:14122344..14126878
RNA-Seq ExpressionPay0016367
SyntenyPay0016367
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]0.0e+0041.7Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
        +             ++EGTSKVKI RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD--TEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE    D  T+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD--TEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------
                             K  N T ++K+L      ++EV+++ K     RG                                             
Subjt:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------

Query:  -------SGVGNLEV-----------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA
                  G+++                                                      HMTG + +  N++ C T +VTF DG+KGKI+ 
Subjt:  -------SGVGNLEV-----------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA

Query:  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW---------
         G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W         
Subjt:  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW---------

Query:  --------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICK
                                        ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K++T E+ K
Subjt:  --------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICK

Query:  NLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT
         L L+LQREK+  I RIRSD+G+EF+N     FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R  T
Subjt:  NLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT

Query:  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEARTT
          TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A   
Subjt:  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEARTT

Query:  STVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------
         + E    +D+ +D+   N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE        
Subjt:  STVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------

Query:  ------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYL
              RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNGYL
Subjt:  ------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYL

Query:  NEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEF
        NEEVYV QPKGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQSEF
Subjt:  NEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEF

Query:  EMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQAD
        EMS+VGEL+ FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARYQA+
Subjt:  EMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQAD

Query:  PRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNML
        P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK ML
Subjt:  PRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNML

Query:  HEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
         EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  HEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

AAO73523.1 gag-pol polyprotein [Glycine max]0.0e+0041.71Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
                     S++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------
                             K  N T ++K+L      ++EV+++ K     RG                                             
Subjt:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------

Query:  -------SGVGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI
                  G+++                                                        HMTG + +  N++ C T +VTF DG+KGKI
Subjt:  -------SGVGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI

Query:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------
        +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W       
Subjt:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------

Query:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI
                                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K+ T E+
Subjt:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI

Query:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT
         K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Subjt:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT

Query:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEAR
         T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A 
Subjt:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEAR

Query:  TTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------
           + E    +D+ +D+   N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE      
Subjt:  TTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------

Query:  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNG
                RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNG
Subjt:  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNG

Query:  YLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQS
        YLNEEVYV QPKGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQS
Subjt:  YLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQS

Query:  EFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQ
        EFEMS+VGEL+ FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD K YRS+IGSLLYLTASRPDI+YAVG+CARYQ
Subjt:  EFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQ

Query:  ADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKN
        A+P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK 
Subjt:  ADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKN

Query:  MLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        ML EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  MLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

AAO73527.1 gag-pol polyprotein [Glycine max]0.0e+0041.69Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
        +             ++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ------------------------------------------KNTNA-TG--------------------------------------------------
                                                  ++ NA TG                                                  
Subjt:  ------------------------------------------KNTNA-TG--------------------------------------------------

Query:  ----------------------------TMKIL------IEEVMVILKKRKVTRG------FSG------------------------------------
                                    ++K+L      ++EV+++ K     RG      F+G                                    
Subjt:  ----------------------------TMKIL------IEEVMVILKKRKVTRG------FSG------------------------------------

Query:  ---------VGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI
                  G+++                                                        HMTG + +  N++ C T +VTF DG+KGKI
Subjt:  ---------VGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI

Query:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------
        +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W       
Subjt:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------

Query:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI
                                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K++T E+
Subjt:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI

Query:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT
         K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Subjt:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT

Query:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD
         T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D+              T+  + AD
Subjt:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD

Query:  -NPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNN
           S +  +N +  + E    + +  +   ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE              RN 
Subjt:  -NPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNN

Query:  IWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQP
        +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNGYLNEEVYV QP
Subjt:  IWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQP

Query:  KGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELS
        KGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQSEFEMS+VGEL+
Subjt:  KGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELS

Query:  CFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLEAV
         FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARYQA+P+ISHL  V
Subjt:  CFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLEAV

Query:  KRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDT
        KRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK ML EY  +QD 
Subjt:  KRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDT

Query:  MMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  MMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

AAO73529.1 gag-pol polyprotein [Glycine max]0.0e+0041.69Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
                     +++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++++D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++  L KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF-------------------------------------------
                              K  N T ++K+L      +++V+ + KK    RG                                            
Subjt:  ----------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF-------------------------------------------

Query:  ---------------------------------SG----------VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK
                                         SG          + +L V                    HMTG + +  N++ C T +VTF DG+KGK
Subjt:  ---------------------------------SG----------VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK

Query:  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW------
        I   G +    LP LN V  V GL  NLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    S+ S TC  ++ D+ +  W      
Subjt:  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW------

Query:  -----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVE
                                           ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K+DT E
Subjt:  -----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVE

Query:  ICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR
        + K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R
Subjt:  ICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR

Query:  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEA
          T  TLYE+WK RKP VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A
Subjt:  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEA

Query:  RTTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE-----
         T  + E    +D+ +D+   N+ +P K              ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE     
Subjt:  RTTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE-----

Query:  ---------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLN
                 RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLN
Subjt:  ---------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQ
        GYLNEE YV QPKGFVD  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQ
Subjt:  GYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQ

Query:  SEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARY
        SEFEMS+VGEL+ FL                KYAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARY
Subjt:  SEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARY

Query:  QADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMK
        QA+P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK
Subjt:  QADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMK

Query:  NMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
         ML EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IRELV++KVI L+H+ +  Q+ DIFTK LDA  FE L   LG+C
Subjt:  NMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

GAU46010.1 hypothetical protein TSUD_401320 [Trifolium subterraneum]0.0e+0044.62Show/hide
Query:  REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-----
        REG   +RPP+LD  NY+ WK RMI F+K++D + W+A++ G++ P +   NG    V KPE  WT AE+  ++GN++ALNA+FNGV+ N+F+LI     
Subjt:  REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-----

Query:  --------NSSYEGTSKVKIVRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDEL
                 ++ EGT+KVKI RLQ    +F  L+M    SV                  +K+SD KIVRK+LRSL +KFDMKV A+EEA  I+T+++DEL
Subjt:  --------NSSYEGTSKVKIVRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDEL

Query:  FSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA--NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE----------
          SL T+E +  +R  KK K I F S  E     SD E+  ++ E+I LL +QF  VL++                + N   + K  I+E          
Subjt:  FSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA--NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE----------

Query:  -----------------------VMVILKKRKVTRGFSGVGNLEV-------------------------------------------------------
                               +  IL+K      F G+    V                                                       
Subjt:  -----------------------VMVILKKRKVTRGFSGVGNLEV-------------------------------------------------------

Query:  --------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQ
                                              HMTG   +  ++K   T  VTF DGAKG+I+  G +  NNLP+L++V  V GL ANLI ISQ
Subjt:  --------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQ

Query:  LCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW--NSNTSYTCHLTRSDQTLSNW----------------------------------------
        LCDQG KV+F    C+V + + ++ M+G    DNCY W    +TS +  L   +  +  W                                        
Subjt:  LCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW--NSNTSYTCHLTRSDQTLSNW----------------------------------------

Query:  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFN
         ++T++ H+ L+   T RVL+LLHMDLM PMQ  SLGGKRY  VVVDD+SRYTW+ F+K K+DT ++ K+LC++LQREK + I RIRSD+GKEF N  F+
Subjt:  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFN

Query:  SFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADR
         FC  EGI HEFS+PITPQQNGVVERKNRTLQE AR M+H K L   FWAEA+NTAC+IHNRVT+R+ TT TLYELWK RKP VKYFHVFGS CYIL DR
Subjt:  SFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADR

Query:  EYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA
        E R+K D +S++GIFLGYS NSR+YRV+N+++  +ME+INV+ID+  E  T+++++  TTS  +  + +   +D   +  ++   S   ++KK   I   
Subjt:  EYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA

Query:  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC
         V+KNHP   IIGDPS G+ T+ K  +     V + C+   I+P  V  AL DE              RN +W LV +PE VN+IGTKWV+KNK++E G 
Subjt:  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC

Query:  VTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAW
        VT+NKARLVAQGY Q+EGVDFDETFAPVA LE+IRLLL ++CI KF+L+QMDVKSAFLNGYLNEEVYV QPKGFVD   P HVYKL KALYGLKQAPRAW
Subjt:  VTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAW

Query:  YERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKF
        YERLT +L  +GY +G  DKTLF+  E  +L++AQIYVDDIIFGG     V +F+  MQSEFEMS+VGEL+ FL                KYAK+++KKF
Subjt:  YERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKF

Query:  GLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTAS------------RPDISYAVGICARYQADPRISHLEAVKRIL-------------
        G    ++K T  ATH+KL ++ +G +VD  LY+S+IGSLLYLTAS            RPDI++AVG+CARYQA+P+ISHL  VKRIL             
Subjt:  GLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTAS------------RPDISYAVGICARYQADPRISHLEAVKRIL-------------

Query:  -----------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNP
                                                 +KK+NC+SLST EAEYI  GS C+QL+WMK ML EY  +QD M LYCDN+SA++ISKNP
Subjt:  -----------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNP

Query:  VQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        +QHSRTKHIDIRHHFIR LVE KVI L+H+ ++ QL DIFTK LDA  FE L + LGVC
Subjt:  VQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

TrEMBL top hitse value%identityAlignment
A0A2Z6P936 Integrase catalytic domain-containing protein0.0e+0044.62Show/hide
Query:  REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-----
        REG   +RPP+LD  NY+ WK RMI F+K++D + W+A++ G++ P +   NG    V KPE  WT AE+  ++GN++ALNA+FNGV+ N+F+LI     
Subjt:  REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-----

Query:  --------NSSYEGTSKVKIVRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDEL
                 ++ EGT+KVKI RLQ    +F  L+M    SV                  +K+SD KIVRK+LRSL +KFDMKV A+EEA  I+T+++DEL
Subjt:  --------NSSYEGTSKVKIVRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDEL

Query:  FSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA--NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE----------
          SL T+E +  +R  KK K I F S  E     SD E+  ++ E+I LL +QF  VL++                + N   + K  I+E          
Subjt:  FSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA--NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE----------

Query:  -----------------------VMVILKKRKVTRGFSGVGNLEV-------------------------------------------------------
                               +  IL+K      F G+    V                                                       
Subjt:  -----------------------VMVILKKRKVTRGFSGVGNLEV-------------------------------------------------------

Query:  --------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQ
                                              HMTG   +  ++K   T  VTF DGAKG+I+  G +  NNLP+L++V  V GL ANLI ISQ
Subjt:  --------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQ

Query:  LCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW--NSNTSYTCHLTRSDQTLSNW----------------------------------------
        LCDQG KV+F    C+V + + ++ M+G    DNCY W    +TS +  L   +  +  W                                        
Subjt:  LCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW--NSNTSYTCHLTRSDQTLSNW----------------------------------------

Query:  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFN
         ++T++ H+ L+   T RVL+LLHMDLM PMQ  SLGGKRY  VVVDD+SRYTW+ F+K K+DT ++ K+LC++LQREK + I RIRSD+GKEF N  F+
Subjt:  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFN

Query:  SFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADR
         FC  EGI HEFS+PITPQQNGVVERKNRTLQE AR M+H K L   FWAEA+NTAC+IHNRVT+R+ TT TLYELWK RKP VKYFHVFGS CYIL DR
Subjt:  SFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADR

Query:  EYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA
        E R+K D +S++GIFLGYS NSR+YRV+N+++  +ME+INV+ID+  E  T+++++  TTS  +  + +   +D   +  ++   S   ++KK   I   
Subjt:  EYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA

Query:  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC
         V+KNHP   IIGDPS G+ T+ K  +     V + C+   I+P  V  AL DE              RN +W LV +PE VN+IGTKWV+KNK++E G 
Subjt:  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC

Query:  VTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAW
        VT+NKARLVAQGY Q+EGVDFDETFAPVA LE+IRLLL ++CI KF+L+QMDVKSAFLNGYLNEEVYV QPKGFVD   P HVYKL KALYGLKQAPRAW
Subjt:  VTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAW

Query:  YERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKF
        YERLT +L  +GY +G  DKTLF+  E  +L++AQIYVDDIIFGG     V +F+  MQSEFEMS+VGEL+ FL                KYAK+++KKF
Subjt:  YERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKF

Query:  GLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTAS------------RPDISYAVGICARYQADPRISHLEAVKRIL-------------
        G    ++K T  ATH+KL ++ +G +VD  LY+S+IGSLLYLTAS            RPDI++AVG+CARYQA+P+ISHL  VKRIL             
Subjt:  GLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTAS------------RPDISYAVGICARYQADPRISHLEAVKRIL-------------

Query:  -----------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNP
                                                 +KK+NC+SLST EAEYI  GS C+QL+WMK ML EY  +QD M LYCDN+SA++ISKNP
Subjt:  -----------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNP

Query:  VQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        +QHSRTKHIDIRHHFIR LVE KVI L+H+ ++ QL DIFTK LDA  FE L + LGVC
Subjt:  VQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

Q84VH6 Gag-pol polyprotein0.0e+0041.69Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
                     +++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++++D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++  L KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF-------------------------------------------
                              K  N T ++K+L      +++V+ + KK    RG                                            
Subjt:  ----------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF-------------------------------------------

Query:  ---------------------------------SG----------VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK
                                         SG          + +L V                    HMTG + +  N++ C T +VTF DG+KGK
Subjt:  ---------------------------------SG----------VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK

Query:  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW------
        I   G +    LP LN V  V GL  NLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    S+ S TC  ++ D+ +  W      
Subjt:  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW------

Query:  -----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVE
                                           ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K+DT E
Subjt:  -----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVE

Query:  ICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR
        + K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R
Subjt:  ICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR

Query:  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEA
          T  TLYE+WK RKP VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A
Subjt:  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEA

Query:  RTTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE-----
         T  + E    +D+ +D+   N+ +P K              ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE     
Subjt:  RTTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE-----

Query:  ---------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLN
                 RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLN
Subjt:  ---------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQ
        GYLNEE YV QPKGFVD  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQ
Subjt:  GYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQ

Query:  SEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARY
        SEFEMS+VGEL+ FL                KYAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARY
Subjt:  SEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARY

Query:  QADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMK
        QA+P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK
Subjt:  QADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMK

Query:  NMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
         ML EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IRELV++KVI L+H+ +  Q+ DIFTK LDA  FE L   LG+C
Subjt:  NMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

Q84VH8 Gag-pol polyprotein0.0e+0041.69Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
        +             ++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ------------------------------------------KNTNA-TG--------------------------------------------------
                                                  ++ NA TG                                                  
Subjt:  ------------------------------------------KNTNA-TG--------------------------------------------------

Query:  ----------------------------TMKIL------IEEVMVILKKRKVTRG------FSG------------------------------------
                                    ++K+L      ++EV+++ K     RG      F+G                                    
Subjt:  ----------------------------TMKIL------IEEVMVILKKRKVTRG------FSG------------------------------------

Query:  ---------VGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI
                  G+++                                                        HMTG + +  N++ C T +VTF DG+KGKI
Subjt:  ---------VGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI

Query:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------
        +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W       
Subjt:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------

Query:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI
                                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K++T E+
Subjt:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI

Query:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT
         K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Subjt:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT

Query:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD
         T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D+              T+  + AD
Subjt:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD

Query:  -NPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNN
           S +  +N +  + E    + +  +   ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE              RN 
Subjt:  -NPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------------RNN

Query:  IWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQP
        +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNGYLNEEVYV QP
Subjt:  IWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQP

Query:  KGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELS
        KGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQSEFEMS+VGEL+
Subjt:  KGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELS

Query:  CFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLEAV
         FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARYQA+P+ISHL  V
Subjt:  CFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLEAV

Query:  KRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDT
        KRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK ML EY  +QD 
Subjt:  KRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDT

Query:  MMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  MMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

Q84VI2 Gag-pol polyprotein0.0e+0041.71Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
                     S++EGTSKVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  -------------SSYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE      DT+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV--SDTEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------
                             K  N T ++K+L      ++EV+++ K     RG                                             
Subjt:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------

Query:  -------SGVGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI
                  G+++                                                        HMTG + +  N++ C T +VTF DG+KGKI
Subjt:  -------SGVGNLEV-------------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI

Query:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------
        +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W       
Subjt:  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW-------

Query:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI
                                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K+ T E+
Subjt:  ----------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEI

Query:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT
         K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Subjt:  CKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT

Query:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEAR
         T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A 
Subjt:  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEAR

Query:  TTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------
           + E    +D+ +D+   N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE      
Subjt:  TTSTVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------

Query:  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNG
                RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNG
Subjt:  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNG

Query:  YLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQS
        YLNEEVYV QPKGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQS
Subjt:  YLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQS

Query:  EFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQ
        EFEMS+VGEL+ FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD K YRS+IGSLLYLTASRPDI+YAVG+CARYQ
Subjt:  EFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQ

Query:  ADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKN
        A+P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK 
Subjt:  ADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKN

Query:  MLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
        ML EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  MLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

Q84VI4 Gag-pol polyprotein0.0e+0041.7Show/hide
Query:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN
        M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G      KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN
Subjt:  MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLVP--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN

Query:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR
        +             ++EGTSKVKI RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP++FDMKVTAIEEA  I  +R
Subjt:  S-------------SYEGTSKVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLR

Query:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD--TEANMDESIALLTKQFINVLRKF-----------------------------------
        +DEL  SL TFE+  +DR  KK K + F S  EGE    D  T+  +  ++ LL KQF  VL +                                    
Subjt:  LDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD--TEANMDESIALLTKQFINVLRKF-----------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------
                             K  N T ++K+L      ++EV+++ K     RG                                             
Subjt:  ---------------------KNTNATGTMKIL------IEEVMVILKKRKVTRGF--------------------------------------------

Query:  -------SGVGNLEV-----------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA
                  G+++                                                      HMTG + +  N++ C T +VTF DG+KGKI+ 
Subjt:  -------SGVGNLEV-----------------------------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA

Query:  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW---------
         G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R  DNCY W    TSY  TC  ++ D+ +  W         
Subjt:  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW---------

Query:  --------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICK
                                        ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+SR+TWV F++ K++T E+ K
Subjt:  --------------------------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICK

Query:  NLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT
         L L+LQREK+  I RIRSD+G+EF+N     FC  EGI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R  T
Subjt:  NLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT

Query:  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEARTT
          TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN+++ +VME+INV++D          EED  TS  + A   
Subjt:  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMID----------EED-ETSNMSEARTT

Query:  STVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------
         + E    +D+ +D+   N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL DE        
Subjt:  STVE-VSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE--------

Query:  ------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYL
              RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQGYTQ+EGVDFDETFAPVARLE+IRLLL ++CI KFKLYQMDVKSAFLNGYL
Subjt:  ------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYL

Query:  NEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEF
        NEEVYV QPKGF D  HP HVY+L KALYGLKQAPRAWYERLT +L  +GY +G IDKTLF+ +++  L++AQIYVDDI+FGG   + + +F+  MQSEF
Subjt:  NEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEF

Query:  EMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQAD
        EMS+VGEL+ FL                +YAK+++KKFG+  A +K T   TH+KL ++  G  VD  LYRS+IGSLLYLTASRPDI+YAVG+CARYQA+
Subjt:  EMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQAD

Query:  PRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNML
        P+ISHL  VKRIL                                                      +KK+NCVSLST EAEYIA GS C+QL+WMK ML
Subjt:  PRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNML

Query:  HEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC
         EY  +QD M LYCDNMSA++ISKNPVQHSRTKHIDIRHH+IR+LV++KVI L H+ +  Q+ DIFTK LDAN FE L   LG+C
Subjt:  HEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVC

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.6e-9227.64Show/hide
Query:  NWQKTRSTHKSLKE-CYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEG
        N ++ R   K LK+  +  R L ++H D+  P+   +L  K Y ++ VD ++ Y   + +K K+D   + ++   K +     K+  +  DNG+E+ +  
Subjt:  NWQKTRSTHKSLKE-CYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEG

Query:  FNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR--TRTTVTLYELWKERKPNVKYFHVFGSICYI
           FC+ +GI +  + P TPQ NGV ER  RT+ E AR M+    L   FW EAV TA ++ NR+  R    ++ T YE+W  +KP +K+  VFG+  Y+
Subjt:  FNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR--TRTTVTLYELWKERKPNVKYFHVFGSICYI

Query:  LADREYRQKWDARSEQGIFLGYSQNS-RAYRVFNNKSVSV-----------------METINVMIDEEDETSNM-------------SEARTTSTVEVSK
           +  + K+D +S + IF+GY  N  + +   N K +                    ET+ +   +E E  N              +E++    ++  K
Subjt:  LADREYRQKWDARSEQGIFLGYSQNS-RAYRVFNNKSVSV-----------------METINVMIDEEDETSNM-------------SEARTTSTVEVSK

Query:  ADNPSDDLGKNLEKPSKEII-------------------------------------------------TKKSELITFAHVKK---NHPTSS----IIGD
            S++  KN    S++II                                                  +  E  T  H+K+   ++PT +    II  
Subjt:  ADNPSDDLGKNLEKPSKEII-------------------------------------------------TKKSELITFAHVKK---NHPTSS----IIGD

Query:  PSAGMQTK-----RKEKIVYLKMVDDLCYTFTIKPSTVDS-ALKDER----------------NNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKA
         S  ++TK      +E     K+V +    F   P++ D    +D++                NN WT+  +PE  NI+ ++WVF  K NE G   + KA
Subjt:  PSAGMQTK-----RKEKIVYLKMVDDLCYTFTIKPSTVDS-ALKDER----------------NNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKA

Query:  RLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTI
        RLVA+G+TQ   +D++ETFAPVAR+ + R +L +      K++QMDVK+AFLNG L EE+Y+  P+G   S +  +V KL+KA+YGLKQA R W+E    
Subjt:  RLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTI

Query:  YLRGKGYSRGEIDKTLFIHRES--NQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQ
         L+   +    +D+ ++I  +   N+ +   +YVDD++        ++NF   +  +F M+ + E+  F+                 Y K ++ KF +  
Subjt:  YLRGKGYSRGEIDKTLFIHRES--NQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQ

Query:  AQNKWTLTATHVKL-IRNTDGAEVDHKLYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL---------------------------
             T   + +   + N+D  E  +   RS+IG L+Y +  +RPD++ AV I +RY +       + +KR+L                           
Subjt:  AQNKWTLTATHVKL-IRNTDGAEVDHKLYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL---------------------------

Query:  ------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHID
                                       K++N V+ S+ EAEY+A+     + +W+K +L       ++ + +Y DN   + I+ NP  H R KHID
Subjt:  ------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHID

Query:  IRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGV
        I++HF RE V+N VI L++I +  QL DIFTKPL A  F  L   LG+
Subjt:  IRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.1e-11130.02Show/hide
Query:  VLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITP
        +L L++ D+  PM+ ES+GG +Y +  +DD SR  WV+ LK K    ++ +     ++RE  +K+ R+RSDNG E+ +  F  +C   GI HE + P TP
Subjt:  VLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITP

Query:  QQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGY
        Q NGV ER NRT+ E  R M+    LP  FW EAV TAC++ NR             +W  ++ +  +  VFG   +    +E R K D +S   IF+GY
Subjt:  QQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGY

Query:  SQNSRAYRVFNNKSVSVMETINVMIDEED--ETSNMSEARTTSTV-----------EVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSS
              YR+++     V+ + +V+  E +    ++MSE      +             + A++ +D++ +  E+P +  + ++ E +     +  HPT  
Subjt:  SQNSRAYRVFNNKSVSVMETINVMIDEED--ETSNMSEARTTSTV-----------EVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSS

Query:  ------IIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKD-----------------ERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCV
              +       ++++R     Y+ + DD       +P ++   L                   ++N  + LV  P+G   +  KWVFK K +    +
Subjt:  ------IIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKD-----------------ERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCV

Query:  TKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWY
         + KARLV +G+ Q +G+DFDE F+PV ++ +IR +L ++     ++ Q+DVK+AFL+G L EE+Y+ QP+GF  +     V KL+K+LYGLKQAPR WY
Subjt:  TKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWY

Query:  ERLTIYLRGKGYSRGEIDKTLFIHRES-NQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL-----------------EKYAKDMIK
         +   +++ + Y +   D  ++  R S N  ++  +YVDD++  G  +  +      +   F+M  +G     L                 EKY + +++
Subjt:  ERLTIYLRGKGYSRGEIDKTLFIHRES-NQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL-----------------EKYAKDMIK

Query:  KFGLGQAQNKWTLTATHVKLIRNTDGAEVDHK------LYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL----------------
        +F +  A+   T  A H+KL +      V+ K       Y S +GSL+Y +  +RPDI++AVG+ +R+  +P   H EAVK IL                
Subjt:  KFGLGQAQNKWTLTATHVKLIRNTDGAEVDHK------LYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL----------------

Query:  -------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHS
                                             +K + CV+LST EAEYIA      ++IW+K  L E G  Q   ++YCD+ SA+D+SKN + H+
Subjt:  -------------------------------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHS

Query:  RTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFE
        RTKHID+R+H+IRE+V+++ +++  I +N    D+ TK +  N FE
Subjt:  RTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFE

P25600 Putative transposon Ty5-1 protein YCL074W1.5e-2331.72Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDF
        MDV +AFLN  ++E +YV QP GFV+  +P +V++L   +YGLKQAP  W E +   L+  G+ R E +  L+    S+  +   +YVDD++        
Subjt:  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDF

Query:  VHNFINIMQSEFEMSMVGELSCFL------------EKYAKDMIKKFGLGQAQNKWTLTATHV----KLIRNTDGAEVDHKLYRSIIGSLLY-LTASRPD
               +   + M  +G++  FL                +D I K       N + LT T +     L   T     D   Y+SI+G LL+     RPD
Subjt:  VHNFINIMQSEFEMSMVGELSCFL------------EKYAKDMIKKFGLGQAQNKWTLTATHV----KLIRNTDGAEVDHKLYRSIIGSLLY-LTASRPD

Query:  ISYAVGICARYQADPRISHLEAVKRIL
        ISY V + +R+  +PR  HLE+ +R+L
Subjt:  ISYAVGICARYQADPRISHLEAVKRIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.9e-8826.76Show/hide
Query:  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCD-QGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHL---TR
        V  +DG+   I   G+   +   R   L+++ YV  +  NLI + +LC+  G  V F      V +    + +   +  D  Y W   +S    L     
Subjt:  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCD-QGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHL---TR

Query:  SDQTLSNWQK---------------------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESLGGKRYVLVVVDDYSRYTWVF
        S  T S+W                          +HK  S  +C  N+  K+      ++  RP++            S    RY ++ VD ++RYTW++
Subjt:  SDQTLSNWQK---------------------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESLGGKRYVLVVVDDYSRYTWVF

Query:  FLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTA
         LK K+   E        L+   + +I    SDNG EF       +    GI H  S P TP+ NG+ ERK+R + E    ++   ++P  +W  A   A
Subjt:  FLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTA

Query:  CHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSV----------SVMET
         ++ NR+        + ++      PN     VFG  CY       + K D +S Q +FLGYS    AY        R++ ++ V          + + T
Subjt:  CHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSV----------SVMET

Query:  INVMIDEEDETSNMSEARTT--------------------------------STVEVSKAD---------------------------------------
        ++ + ++  E+S +    TT                                S V  S  D                                       
Subjt:  INVMIDEEDETSNMSEARTT--------------------------------STVEVSKAD---------------------------------------

Query:  ----NPSDD----LGKNLEKPSKEIITKKSELITFAHVKKNHPT--SSIIGDP----------------SAGMQTKRKEKIVYLKMVDDLCYTFTI--KP
            NP+++    L ++L  P++   +  S   T A      PT  S +I  P                +  M T+ K  I+       L  +     +P
Subjt:  ----NPSDD----LGKNLEKPSKEIITKKSELITFAHVKKNHPT--SSIIGDP----------------SAGMQTKRKEKIVYLKMVDDLCYTFTI--KP

Query:  STVDSALKDER--------------NNIWTLVSKPEG-VNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCI
         T   ALKDER              N+ W LV  P   V I+G +W+F  K N  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++L ++  
Subjt:  STVDSALKDER--------------NNIWTLVSKPEG-VNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCI

Query:  QKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIF
        + + + Q+DV +AFL G L ++VY++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+ +    ++   +YVDDI+ 
Subjt:  QKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIF

Query:  GGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLT
         G     +HN ++ +   F +    EL  FL                +Y  D++ +  +  A+   T  A   KL   +     D   YR I+GSL YL 
Subjt:  GGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLT

Query:  ASRPDISYAVGICARYQADPRISHLEAVKRILN------------KKRNCVSL------------------------------------------STVEA
         +RPDISYAV   +++   P   HL+A+KRIL             KK N +SL                                          S+ EA
Subjt:  ASRPDISYAVGICARYQADPRISHLEAVKRILN------------KKRNCVSL------------------------------------------STVEA

Query:  EYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHA
        EY +V +  +++ W+ ++L E G       ++YCDN+ A  +  NPV HSR KHI I +HFIR  V++  +R+ H+ ++ QL D  TKPL   +F+   +
Subjt:  EYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHA

Query:  GLGVCR
         +GV R
Subjt:  GLGVCR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-8726.3Show/hide
Query:  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCDQG-YKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHLTR---
        V  +DG+   I   G+       R   LN V YV  +  NLI + +LC+     V F      V +    + +   +  D  Y W   +S    +     
Subjt:  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCDQG-YKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHLTR---

Query:  SDQTLSNWQK---------------------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESLGGKRYVLVVVDDYSRYTWVF
        S  T S+W                          +HK  S  +C+ N+  K+      +   +P++            S+   RY ++ VD ++RYTW++
Subjt:  SDQTLSNWQK---------------------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESLGGKRYVLVVVDDYSRYTWVF

Query:  FLKGKT---DTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAV
         LK K+   DT  I K+L   ++   + +I  + SDNG EF       +    GI H  S P TP+ NG+ ERK+R + EM   ++   ++P  +W  A 
Subjt:  FLKGKT---DTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAV

Query:  NTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSVSVME------TI
        + A ++ NR+        + ++    + PN +   VFG  CY       R K + +S+Q  F+GYS    AY        R++ ++ V   E      T 
Subjt:  NTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSVSVME------TI

Query:  NVMIDEEDETSNMSEARTTS-----------------------------------TVEVSKADNPSDDLGK-----------------------------
        N  +    E  + S     S                                   T +VS ++ PS  +                               
Subjt:  NVMIDEEDETSNMSEARTTS-----------------------------------TVEVSKADNPSDDLGK-----------------------------

Query:  -------NLEKPSKEIITKKSEL----ITFAHV--------KKNHPTSSIIGDP---------------------SAGMQTKRKEKIVYLKMVDDLCYTF
               N   PS     + S L    I+  H+        + N P+SS    P                     +  M T+ K+ I   K      Y  
Subjt:  -------NLEKPSKEIITKKSEL----ITFAHV--------KKNHPTSSIIGDP---------------------SAGMQTKRKEKIVYLKMVDDLCYTF

Query:  TI----KPSTVDSALKDER--------------NNIWTLV-SKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIR
        ++    +P T   A+KD+R              N+ W LV   P  V I+G +W+F  K N  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR
Subjt:  TI----KPSTVDSALKDER--------------NNIWTLV-SKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIR

Query:  LLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQ
        ++L ++  + + + Q+DV +AFL G L +EVY++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+     D +LF+ +    ++   
Subjt:  LLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQ

Query:  IYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSI
        +YVDDI+  G     + + ++ +   F +    +L  FL                +Y  D++ +  +  A+   T  AT  KL  ++     D   YR I
Subjt:  IYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSI

Query:  IGSLLYLTASRPDISYAVGICARYQADPRISHLEAVKRILN------------KKRNCVSL---------------------------------------
        +GSL YL  +RPD+SYAV   ++Y   P   H  A+KR+L             KK N +SL                                       
Subjt:  IGSLLYLTASRPDISYAVGICARYQADPRISHLEAVKRILN------------KKRNCVSL---------------------------------------

Query:  ---STVEAEYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDA
           S+ EAEY +V +  ++L W+ ++L E G       ++YCDN+ A  +  NPV HSR KHI + +HFIR  V++  +R+ H+ ++ QL D  TKPL  
Subjt:  ---STVEAEYIAVGSGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDA

Query:  NSFEYLHAGLGVCR
         +F+     +GV +
Subjt:  NSFEYLHAGLGVCR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-5929.21Show/hide
Query:  SNMSEARTTSTVEVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSI-IGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALK-
        S+   + ++S++++     PS ++  ++ +PS     +++    +      H  +S+ I D S  +  ++   + +  +V   C     +PST + A + 
Subjt:  SNMSEARTTSTVEVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSI-IGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALK-

Query:  -------------DERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDV
                      E  + W + + P     IG KWV+K K N  G + + KARLVA+GYTQ EG+DF ETF+PV +L +++L+L IS I  F L+Q+D+
Subjt:  -------------DERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDV

Query:  KSAFLNGYLNEEVYVAQPKGFV----DSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQD
         +AFLNG L+EE+Y+  P G+     DS  P  V  L K++YGLKQA R W+ + ++ L G G+ +   D T F+   +   L   +YVDDII       
Subjt:  KSAFLNGYLNEEVYVAQPKGFV----DSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQD

Query:  FVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDI
         V    + ++S F++  +G L  FL                KYA D++ + GL   +         V    ++ G  VD K YR +IG L+YL  +R DI
Subjt:  FVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDI

Query:  SYAVGICARYQADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVG
        S+AV   +++   PR++H +AV +IL                                                      +KK+  VS S+ EAEY A+ 
Subjt:  SYAVGICARYQADPRISHLEAVKRIL------------------------------------------------------NKKRNCVSLSTVEAEYIAVG

Query:  SGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRE
            +++W+     E         +L+CDN +A+ I+ N V H RTKHI+   H +RE
Subjt:  SGCTQLIWMKNMLHEYGFD-QDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.3e-0530.7Show/hide
Query:  NRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV
        NRT+ E  R M+    LP  F A+A NTA HI N+            E+W +  P   Y   FG + YI  D     K   R+++G      +   +Y +
Subjt:  NRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV

Query:  FNNKSVSVMETINV
          N+ VS++ TI +
Subjt:  FNNKSVSVMETINV

ATMG00810.1 DNA/RNA polymerases superfamily protein4.7e-0931.21Show/hide
Query:  IYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEV-DHKLYRS
        +YVDDI+  G     ++  I  + S F M  +G +  FL                KYA+ ++   G+     K   T   +KL  +   A+  D   +RS
Subjt:  IYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEV-DHKLYRS

Query:  IIGSLLYLTASRPDISYAVGICARYQADPRISHLEAVKRIL
        I+G+L YLT +RPDISYAV I  +   +P ++  + +KR+L
Subjt:  IIGSLLYLTASRPDISYAVGICARYQADPRISHLEAVKRIL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.7e-1540.8Show/hide
Query:  MQTKRKEKIVYLKMVDDLCYTFTIK--PSTVDSALKD--------------ERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQV
        M T+ K  I  L     L  T TIK  P +V  ALKD               RN  W LV  P   NI+G KWVFK K +  G + + KARLVA+G+ Q 
Subjt:  MQTKRKEKIVYLKMVDDLCYTFTIK--PSTVDSALKD--------------ERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQV

Query:  EGVDFDETFAPVARLEAIRLLLDIS
        EG+ F ET++PV R   IR +L+++
Subjt:  EGVDFDETFAPVARLEAIRLLLDIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATAATCAGAGAAGGATCATCTGCATCACGTCCTCCTATTTTAGATGGTAAAAACTACTCATATTGGAAACCTCGAATGATCTTCTTCATTAAGACGTTAGACAG
AAAAGCTTGGAGAGCTCTTGTTGCGGGTTATGATCCTCCTATGATTACCGTAAATGGTGTTTTGGTTCCAAAACCTGAAGTTGGTTGGACCAATGCTGAAGAGCAAGCTT
CCGTTGGGAATGCTAGAGCACTTAACGCAATATTCAATGGTGTTAACCTGAACATTTTCAAGTTAATAAATTCTTCGTATGAAGGTACTTCCAAAGTAAAGATCGTAAGA
TTACAAGAGTTCTTGAAATTGCAAATGAATCTTTGCTGCTCGGTGAAAAAAATATCTGACACTAAAATAGTACGAAAAGTACTTCGATCCTTGCCCAGAAAATTTGATAT
GAAAGTAACTGCCATTGAGGAAGCTCATTGTATTACAACATTGAGACTTGATGAATTGTTTAGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAATAAGA
AAGGCAAGGGAATTGATTTTAAGTCCACACATGAAGGTGAGGCGGCAGTAAGTGACACTGAAGCAAACATGGATGAGTCAATAGCTTTGTTAACAAAACAATTCATAAAC
GTCCTCCGAAAATTTAAAAATACGAATGCCACAGGAACAATGAAAATTCTAATAGAAGAAGTAATGGTTATATTAAAAAAAAGGAAGGTGACGAGAGGATTTTCAGGTGT
AGGGAATCTTGAGGTACATATGACTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTAGTGATGGTGCAAAAGGAAAAATTATGG
CTAAAGGTAACATAGATAAAAATAATCTGCCACGTTTAAATGATGTTAGGTATGTGGATGGACTAAAAGCAAATTTGATCTGTATAAGTCAACTATGTGATCAAGGCTAC
AAAGTCAGTTTTGATGATATTGGTTGTGTTGTTATGAATAAAGAAAATCAGATTTGTATGAGAGGTAAACGACAAACTGATAACTGTTATCACTGGAACTCAAATACGTC
ATACACCTGTCATTTGACAAGATCAGATCAAACACTGTCAAATTGGCAAAAGACAAGGTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGT
TACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACGTGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAA
GGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGAGAAAAAGATAACGAGGATCCGAAGTGATAATGGTAAGGAGTTTGATAA
TGAGGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAG
AGATGGCACGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACG
ACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATG
GGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTATAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAATGA
TAGATGAGGAAGATGAGACTTCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAA
CCATCAAAAGAAATTATCACTAAAAAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGAC
CAAAAGGAAAGAAAAGATTGTTTACTTAAAGATGGTTGATGATTTATGTTATACTTTTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAACGAAACAATA
TATGGACGTTAGTTTCCAAGCCAGAAGGTGTAAACATTATTGGCACTAAATGGGTATTTAAAAATAAGACTAATGAAGCTGGATGTGTGACAAAAAATAAAGCCAGATTA
GTAGCTCAAGGCTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTGTTACTTGATATATCATGCATACAGAA
ATTTAAATTATATCAGATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCAAAGC
ATGTGTATAAGCTCAGCAAAGCTCTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTATTTACTTAAGAGGTAAAGGATATTCCAGAGGAGAAATT
GACAAGACCTTGTTTATACACAGGGAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATTTTGTCCATAATTTCAT
TAACATCATGCAGTCAGAATTCGAAATGAGTATGGTTGGAGAACTTTCGTGCTTTTTGGAAAAGTATGCCAAGGATATGATTAAAAAGTTTGGTTTGGGACAGGCTCAAA
ATAAGTGGACTCTAACTGCGACACATGTTAAACTTATAAGAAATACTGATGGTGCTGAAGTTGATCACAAACTCTACAGGAGTATAATAGGCAGCTTATTGTATTTAACA
GCAAGTCGACCTGACATATCTTATGCTGTGGGAATATGTGCCCGTTATCAGGCTGATCCTCGCATCTCTCATCTAGAAGCTGTTAAACGAATTCTTAATAAGAAGCGAAA
CTGTGTCTCTTTATCTACAGTTGAAGCTGAATATATAGCAGTAGGCAGTGGTTGTACTCAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCTTTGATCAAGACA
CTATGATGTTGTATTGTGACAATATGAGCGCAGTTGATATATCGAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGAGAACTT
GTTGAAAATAAAGTAATTAGGCTTGATCATATTCATTCCAACTTACAATTAGTCGATATTTTCACTAAGCCTTTGGATGCAAATTCATTTGAATACTTACATGCTGGTTT
AGGAGTGTGTCGCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATAATCAGAGAAGGATCATCTGCATCACGTCCTCCTATTTTAGATGGTAAAAACTACTCATATTGGAAACCTCGAATGATCTTCTTCATTAAGACGTTAGACAG
AAAAGCTTGGAGAGCTCTTGTTGCGGGTTATGATCCTCCTATGATTACCGTAAATGGTGTTTTGGTTCCAAAACCTGAAGTTGGTTGGACCAATGCTGAAGAGCAAGCTT
CCGTTGGGAATGCTAGAGCACTTAACGCAATATTCAATGGTGTTAACCTGAACATTTTCAAGTTAATAAATTCTTCGTATGAAGGTACTTCCAAAGTAAAGATCGTAAGA
TTACAAGAGTTCTTGAAATTGCAAATGAATCTTTGCTGCTCGGTGAAAAAAATATCTGACACTAAAATAGTACGAAAAGTACTTCGATCCTTGCCCAGAAAATTTGATAT
GAAAGTAACTGCCATTGAGGAAGCTCATTGTATTACAACATTGAGACTTGATGAATTGTTTAGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAATAAGA
AAGGCAAGGGAATTGATTTTAAGTCCACACATGAAGGTGAGGCGGCAGTAAGTGACACTGAAGCAAACATGGATGAGTCAATAGCTTTGTTAACAAAACAATTCATAAAC
GTCCTCCGAAAATTTAAAAATACGAATGCCACAGGAACAATGAAAATTCTAATAGAAGAAGTAATGGTTATATTAAAAAAAAGGAAGGTGACGAGAGGATTTTCAGGTGT
AGGGAATCTTGAGGTACATATGACTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTAGTGATGGTGCAAAAGGAAAAATTATGG
CTAAAGGTAACATAGATAAAAATAATCTGCCACGTTTAAATGATGTTAGGTATGTGGATGGACTAAAAGCAAATTTGATCTGTATAAGTCAACTATGTGATCAAGGCTAC
AAAGTCAGTTTTGATGATATTGGTTGTGTTGTTATGAATAAAGAAAATCAGATTTGTATGAGAGGTAAACGACAAACTGATAACTGTTATCACTGGAACTCAAATACGTC
ATACACCTGTCATTTGACAAGATCAGATCAAACACTGTCAAATTGGCAAAAGACAAGGTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGT
TACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACGTGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAA
GGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGAGAAAAAGATAACGAGGATCCGAAGTGATAATGGTAAGGAGTTTGATAA
TGAGGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAG
AGATGGCACGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACG
ACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATG
GGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTATAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAATGA
TAGATGAGGAAGATGAGACTTCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAA
CCATCAAAAGAAATTATCACTAAAAAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGAC
CAAAAGGAAAGAAAAGATTGTTTACTTAAAGATGGTTGATGATTTATGTTATACTTTTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAACGAAACAATA
TATGGACGTTAGTTTCCAAGCCAGAAGGTGTAAACATTATTGGCACTAAATGGGTATTTAAAAATAAGACTAATGAAGCTGGATGTGTGACAAAAAATAAAGCCAGATTA
GTAGCTCAAGGCTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTGTTACTTGATATATCATGCATACAGAA
ATTTAAATTATATCAGATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCAAAGC
ATGTGTATAAGCTCAGCAAAGCTCTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTATTTACTTAAGAGGTAAAGGATATTCCAGAGGAGAAATT
GACAAGACCTTGTTTATACACAGGGAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATTTTGTCCATAATTTCAT
TAACATCATGCAGTCAGAATTCGAAATGAGTATGGTTGGAGAACTTTCGTGCTTTTTGGAAAAGTATGCCAAGGATATGATTAAAAAGTTTGGTTTGGGACAGGCTCAAA
ATAAGTGGACTCTAACTGCGACACATGTTAAACTTATAAGAAATACTGATGGTGCTGAAGTTGATCACAAACTCTACAGGAGTATAATAGGCAGCTTATTGTATTTAACA
GCAAGTCGACCTGACATATCTTATGCTGTGGGAATATGTGCCCGTTATCAGGCTGATCCTCGCATCTCTCATCTAGAAGCTGTTAAACGAATTCTTAATAAGAAGCGAAA
CTGTGTCTCTTTATCTACAGTTGAAGCTGAATATATAGCAGTAGGCAGTGGTTGTACTCAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCTTTGATCAAGACA
CTATGATGTTGTATTGTGACAATATGAGCGCAGTTGATATATCGAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGAGAACTT
GTTGAAAATAAAGTAATTAGGCTTGATCATATTCATTCCAACTTACAATTAGTCGATATTTTCACTAAGCCTTTGGATGCAAATTCATTTGAATACTTACATGCTGGTTT
AGGAGTGTGTCGCACTTAA
Protein sequenceShow/hide protein sequence
MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMITVNGVLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINSSYEGTSKVKIVR
LQEFLKLQMNLCCSVKKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEANMDESIALLTKQFIN
VLRKFKNTNATGTMKILIEEVMVILKKRKVTRGFSGVGNLEVHMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGY
KVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHLTRSDQTLSNWQKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLK
GKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT
TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEK
PSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARL
VAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEI
DKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFLEKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLT
ASRPDISYAVGICARYQADPRISHLEAVKRILNKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIREL
VENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVCRT