; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004819 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004819
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr04:18658048..18662974
RNA-Seq ExpressionPay0004819
SyntenyPay0004819
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAO73521.1 gag-pol polyprotein [Glycine max]0.0e+0044.62Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+IL+I               L ++FE L+M E+E I +F++ +L+IAN   ALGE++ D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI
        +DEL G  +                     +D      Y                            R+K H +    +  K  K+           KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISIC----TMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER---
        +CH CEG+GHI  EC T+LK+ +KG+    SD E   ESD +    AL  I       +D ++  T D+  ++        RK     E ILQQ+ +   
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISIC----TMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER---

Query:  -IQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------M
         I DL  E ++    I  LK E+     + E + K  +M   G+  LD++L  G+ A ++RGLGF  +           P +    + MS          
Subjt:  -IQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------M

Query:  KSLNKRTRRICYFCG---------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG
        +  +KR +  C++CG                                                   WY DSGC RHMTG  +F  ++  C    V F DG
Subjt:  KSLNKRTRRICYFCG---------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG

Query:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR
         KGKIIG G +   GLP L  V L++GL ANLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R
Subjt:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR

Query:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----
         G L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+K +     
Subjt:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----

Query:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV
             K  SL   +EK+  I RI++D+GREFEN    EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV
Subjt:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV

Query:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------
         LR GT TT YE+WKGRKP+VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++      
Subjt:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------

Query:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF
                       D    +  +   D    T I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QF
Subjt:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF

Query:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY
        KRN+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EEVY
Subjt:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY

Query:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV
        V QPKGF DP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+V
Subjt:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV

Query:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH
        GELT+FL  Q+KQ    IF SQ +YAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SH
Subjt:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH

Query:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI
        L   KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY +
Subjt:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI

Query:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
         Q  M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ R+LV+  +I L+HV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

AAO73527.1 gag-pol polyprotein [Glycine max]0.0e+0044.68Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+IL+I               L ++FE L+M E+E I +F++ +L+IAN   ALGE++ D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI
        +DEL G  +                     +D      Y                            R+K H +    +  K  K+           KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTED--AEDRKKTKDQEVILQQQER----I
        +CH CEG+GHI  EC T+LK+ +KG+    SD E   ESD +    AL  I      E  +      S  + ++  A  RK     E ILQQ+ +    I
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTED--AEDRKKTKDQEVILQQQER----I

Query:  QDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTER---------DTPGR---GTEISRM------SMKS
         DL  E ++    I  LK E+     + E + K  +M   G+  LD++L  G+ A ++RGLGF  +           P +   GT +S+       + + 
Subjt:  QDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTER---------DTPGR---GTEISRM------SMKS

Query:  LNKRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG
         +KR +  C++CG                                                     WY DSGC RHMTG  +F  ++  C    V F DG
Subjt:  LNKRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG

Query:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR
         KGKIIG G +   GLP L  V L++GL ANLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R
Subjt:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR

Query:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----
         G L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+  +     
Subjt:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----

Query:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV
             K  SL   +EK+  I RI++D+GREFEN  F EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV
Subjt:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV

Query:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------
         LR GT TT YE+WKGRKP+VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++      
Subjt:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------

Query:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF
                       D    +  +   D    T I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QF
Subjt:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF

Query:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY
        KRN+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EEVY
Subjt:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY

Query:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV
        V QPKGF DP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+V
Subjt:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV

Query:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH
        GELT+FL  Q+KQ    IF SQ +YAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SH
Subjt:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH

Query:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI
        L   KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY +
Subjt:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI

Query:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
         Q  M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ R+LV+  +I L+HV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

AAO73529.1 gag-pol polyprotein [Glycine max]0.0e+0044.42Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVR--KSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVR--KSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDI---------------LEILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+I               L++L ++FE L+M E+E I +F++ +L+IAN   ALGE+M D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDI---------------LEILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERG------KGTE-ASKSDK---FGKGI
        +DEL G  +                     +D      Y                            R+K H R       KG+E   KSD+     KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERG------KGTE-ASKSDK---FGKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER----IQD
        +C  CEG+GHI+ EC T+LK+++KG+    SD+ +  +  D D  +  ++    + E++  T  +   +        R+     E ILQQ+ +    I +
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER----IQD

Query:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------MKSLN
        L  E ++    I  LK E+     + E + K  +M   G+  LD++L  G++  ++RGLGF  +           P + +  + MS          +  +
Subjt:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------MKSLN

Query:  KRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGK
        KR +  C++CG                                                     WY DSGC RHMTG  +F  ++  C    V F DG K
Subjt:  KRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGK

Query:  GKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKRLG
        GKI G G +   GLP L  V L++GL  NLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R G
Subjt:  GKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKRLG

Query:  QLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-------
         L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+  +       
Subjt:  QLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-------

Query:  ---KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVIL
           K  SL   +EK+  I RI++D+GREFEN  F EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV L
Subjt:  ---KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVIL

Query:  RSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE--------
        R GT TT YE+WKGRKP VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++        
Subjt:  RSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE--------

Query:  ------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKR
                     D    +  +   D  P   I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QFKR
Subjt:  ------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKR

Query:  NQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVA
        N+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EE YV 
Subjt:  NQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVA

Query:  QPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMVGE
        QPKGFVDP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+VGE
Subjt:  QPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMVGE

Query:  LTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLH
        LT+FL  Q+KQ    IF SQ KYAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SHL+
Subjt:  LTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLH

Query:  CAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRITQ
          KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY + Q
Subjt:  CAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRITQ

Query:  SSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
          M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ RELV+  +I LEHV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  SSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

KAA0059225.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0065.71Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVRKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLINTC
        MD IRE NSTSRP LLDG NYGYWKS+M+AFLMSL+MR                               +EDDA +GNSR LNAL NVV+PNIFKLINTC
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVRKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLINTC

Query:  KSAKAAWDILE----------------ILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKL
        KSAKA WDILE                ILTS+FEALQMGE ETI EFNV VLDIANESDALGEKM DSKLVRKVLRSLP KFNMKVTAI+EANDLSKMKL
Subjt:  KSAKAAWDILE----------------ILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKL

Query:  DELFG-------------------------------------------------QSRISDTSSSGHYRKKEHERGKGTEASKSDKFGKGIRCHECEGFGH
        DELFG                                                 Q RISDTSSSGH RKKEHERGK  +ASKSDK+GKGIRCHECEGFGH
Subjt:  DELFG-------------------------------------------------QSRISDTSSSGHYRKKEHERGKGTEASKSDKFGKGIRCHECEGFGH

Query:  IQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQDLVEENQSFLSSIVT
        IQ ECATYLKRKKKGMVAT SDEEDYSESDDEDLGMALIS+CTMNDEENVQTHDQ +S N T D  +R K +DQEVILQQQERIQDLVEENQSFLSSIVT
Subjt:  IQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQDLVEENQSFLSSIVT

Query:  LKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP---------------------GRGTEISRMSMKSLNKRTRRIC----
        LKEELA+TKHQFEELLKFARM T GTSKLDDILDQG RADDKRGL F ERDTP                     G+GTEI+ M  K L       C    
Subjt:  LKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP---------------------GRGTEISRMSMKSLNKRTRRIC----

Query:  --------------YFCGWYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSI
                          WYFDSGC RHMTGNADFFS+L ECKVG VVF DGGKGKIIGKGTIN  GLPFLL+VRL+QGLAANLISISQLCD+  + VS 
Subjt:  --------------YFCGWYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSI

Query:  KID-------------------------DAKVTLCNLSKVEEAGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIA
          D                         DA+VTLCNLSKVEEA LWHKRLG LSG+TISKVTK DAII LPPL+F SLE CSEC  GKQVKSVHKPVNI+
Subjt:  KID-------------------------DAKVTLCNLSKVEEAGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIA

Query:  STSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV--KPYSLNS---------KEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSA
        STSHILELLHIDLMGPMQT+SLGRK   VVCVDDFSRYTWIK +  KP +  +         +EKNTGIG+IQTD+G EFENQHFAEF DNEGIFHEFSA
Subjt:  STSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV--KPYSLNS---------KEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSA

Query:  PLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGI
        PL  QQN V                           EALNTACHIHNRVILR GTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRR WDSKSD GI
Subjt:  PLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGI

Query:  FLGYSANSRAYRVYNQSSKTVMESINVIIDDLG----KEPNRN-----LDDEDDKLRVQLILQSG---------------------------------DL
        FLGY ANSRAYRVYNQ SK VMESINVIIDDL     + P R      L       R+ +   S                                  DL
Subjt:  FLGYSANSRAYRVYNQSSKTVMESINVIIDDLG----KEPNRN-----LDDEDDKLRVQLILQSG---------------------------------DL

Query:  IPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMD
        IPPTH  KN+PSSFII DIHS IITRKKERKDYAKMV+NVCYTS LEPTT+SAALSDEHWIL +QEELLQF+RNQVWEL+PKPPYANIIGTKWIFKNK D
Subjt:  IPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMD

Query:  EEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQA
        EEGRVIR                            + IRLLLSYACF RFKLFQMDVKS FLNGYL EEVYVAQPKGFVDPVH+DHVYKLRKALY LKQA
Subjt:  EEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQA

Query:  PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTS
        PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGT+FLIVQIYVD I+FG TS
Subjt:  PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTS

TYK07190.1 gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0082.26Show/hide
Query:  MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFGQSRISDTSSSGHYRKKEHERGKGTEASKSDKF
        MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFG S  +D   +GHYRKKEHERGKGTEASKSDKF
Subjt:  MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFGQSRISDTSSSGHYRKKEHERGKGTEASKSDKF

Query:  GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD
        GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD
Subjt:  GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD

Query:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTPGRGTEISRMSMKSLNKRTRRICYFCGWYFDSGC
        LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP       ++  +S N+ T +  +          
Subjt:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTPGRGTEISRMSMKSLNKRTRRICYFCGWYFDSGC

Query:  CRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL
         R + GNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL
Subjt:  CRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL

Query:  GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKPYSLN
        GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKP                                             +HVKPYSLN
Subjt:  GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKPYSLN

Query:  SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE
        SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE
Subjt:  SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE

Query:  LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPP
        LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLG+  +    +E   L   L     D+   
Subjt:  LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPP

Query:  T----HITKNNPSSFIIGDIHS-------------EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA
        +    H   +   + +    H+             EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA
Subjt:  T----HITKNNPSSFIIGDIHS-------------EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA

Query:  NIIGTKWIFKNKMDEEGRVIRKT
        NIIGTKWIFKNKMDEEGRVIRK+
Subjt:  NIIGTKWIFKNKMDEEGRVIRKT

TrEMBL top hitse value%identityAlignment
A0A5A7V046 Gag-pol polyprotein0.0e+0065.71Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVRKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLINTC
        MD IRE NSTSRP LLDG NYGYWKS+M+AFLMSL+MR                               +EDDA +GNSR LNAL NVV+PNIFKLINTC
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVRKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLINTC

Query:  KSAKAAWDILE----------------ILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKL
        KSAKA WDILE                ILTS+FEALQMGE ETI EFNV VLDIANESDALGEKM DSKLVRKVLRSLP KFNMKVTAI+EANDLSKMKL
Subjt:  KSAKAAWDILE----------------ILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKL

Query:  DELFG-------------------------------------------------QSRISDTSSSGHYRKKEHERGKGTEASKSDKFGKGIRCHECEGFGH
        DELFG                                                 Q RISDTSSSGH RKKEHERGK  +ASKSDK+GKGIRCHECEGFGH
Subjt:  DELFG-------------------------------------------------QSRISDTSSSGHYRKKEHERGKGTEASKSDKFGKGIRCHECEGFGH

Query:  IQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQDLVEENQSFLSSIVT
        IQ ECATYLKRKKKGMVAT SDEEDYSESDDEDLGMALIS+CTMNDEENVQTHDQ +S N T D  +R K +DQEVILQQQERIQDLVEENQSFLSSIVT
Subjt:  IQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQDLVEENQSFLSSIVT

Query:  LKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP---------------------GRGTEISRMSMKSLNKRTRRIC----
        LKEELA+TKHQFEELLKFARM T GTSKLDDILDQG RADDKRGL F ERDTP                     G+GTEI+ M  K L       C    
Subjt:  LKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP---------------------GRGTEISRMSMKSLNKRTRRIC----

Query:  --------------YFCGWYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSI
                          WYFDSGC RHMTGNADFFS+L ECKVG VVF DGGKGKIIGKGTIN  GLPFLL+VRL+QGLAANLISISQLCD+  + VS 
Subjt:  --------------YFCGWYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSI

Query:  KID-------------------------DAKVTLCNLSKVEEAGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIA
          D                         DA+VTLCNLSKVEEA LWHKRLG LSG+TISKVTK DAII LPPL+F SLE CSEC  GKQVKSVHKPVNI+
Subjt:  KID-------------------------DAKVTLCNLSKVEEAGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIA

Query:  STSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV--KPYSLNS---------KEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSA
        STSHILELLHIDLMGPMQT+SLGRK   VVCVDDFSRYTWIK +  KP +  +         +EKNTGIG+IQTD+G EFENQHFAEF DNEGIFHEFSA
Subjt:  STSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV--KPYSLNS---------KEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSA

Query:  PLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGI
        PL  QQN V                           EALNTACHIHNRVILR GTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRR WDSKSD GI
Subjt:  PLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGI

Query:  FLGYSANSRAYRVYNQSSKTVMESINVIIDDLG----KEPNRN-----LDDEDDKLRVQLILQSG---------------------------------DL
        FLGY ANSRAYRVYNQ SK VMESINVIIDDL     + P R      L       R+ +   S                                  DL
Subjt:  FLGYSANSRAYRVYNQSSKTVMESINVIIDDLG----KEPNRN-----LDDEDDKLRVQLILQSG---------------------------------DL

Query:  IPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMD
        IPPTH  KN+PSSFII DIHS IITRKKERKDYAKMV+NVCYTS LEPTT+SAALSDEHWIL +QEELLQF+RNQVWEL+PKPPYANIIGTKWIFKNK D
Subjt:  IPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMD

Query:  EEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQA
        EEGRVIR                            + IRLLLSYACF RFKLFQMDVKS FLNGYL EEVYVAQPKGFVDPVH+DHVYKLRKALY LKQA
Subjt:  EEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQA

Query:  PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTS
        PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGT+FLIVQIYVD I+FG TS
Subjt:  PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTS

A0A5D3C778 Gag-pol polyprotein0.0e+0082.26Show/hide
Query:  MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFGQSRISDTSSSGHYRKKEHERGKGTEASKSDKF
        MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFG S  +D   +GHYRKKEHERGKGTEASKSDKF
Subjt:  MGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFGQSRISDTSSSGHYRKKEHERGKGTEASKSDKF

Query:  GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD
        GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD
Subjt:  GKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQD

Query:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTPGRGTEISRMSMKSLNKRTRRICYFCGWYFDSGC
        LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTP       ++  +S N+ T +  +          
Subjt:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTPGRGTEISRMSMKSLNKRTRRICYFCGWYFDSGC

Query:  CRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL
         R + GNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL
Subjt:  CRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRL

Query:  GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKPYSLN
        GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKP                                             +HVKPYSLN
Subjt:  GQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKPYSLN

Query:  SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE
        SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE
Subjt:  SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYE

Query:  LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPP
        LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLG+  +    +E   L   L     D+   
Subjt:  LWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPP

Query:  T----HITKNNPSSFIIGDIHS-------------EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA
        +    H   +   + +    H+             EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA
Subjt:  T----HITKNNPSSFIIGDIHS-------------EIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYA

Query:  NIIGTKWIFKNKMDEEGRVIRKT
        NIIGTKWIFKNKMDEEGRVIRK+
Subjt:  NIIGTKWIFKNKMDEEGRVIRKT

Q84VH6 Gag-pol polyprotein0.0e+0044.42Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVR--KSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVR--KSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDI---------------LEILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+I               L++L ++FE L+M E+E I +F++ +L+IAN   ALGE+M D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDI---------------LEILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERG------KGTE-ASKSDK---FGKGI
        +DEL G  +                     +D      Y                            R+K H R       KG+E   KSD+     KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERG------KGTE-ASKSDK---FGKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER----IQD
        +C  CEG+GHI+ EC T+LK+++KG+    SD+ +  +  D D  +  ++    + E++  T  +   +        R+     E ILQQ+ +    I +
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER----IQD

Query:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------MKSLN
        L  E ++    I  LK E+     + E + K  +M   G+  LD++L  G++  ++RGLGF  +           P + +  + MS          +  +
Subjt:  LVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------MKSLN

Query:  KRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGK
        KR +  C++CG                                                     WY DSGC RHMTG  +F  ++  C    V F DG K
Subjt:  KRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGK

Query:  GKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKRLG
        GKI G G +   GLP L  V L++GL  NLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R G
Subjt:  GKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKRLG

Query:  QLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-------
         L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+  +       
Subjt:  QLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-------

Query:  ---KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVIL
           K  SL   +EK+  I RI++D+GREFEN  F EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV L
Subjt:  ---KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVIL

Query:  RSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE--------
        R GT TT YE+WKGRKP VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++        
Subjt:  RSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE--------

Query:  ------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKR
                     D    +  +   D  P   I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QFKR
Subjt:  ------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKR

Query:  NQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVA
        N+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EE YV 
Subjt:  NQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVA

Query:  QPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMVGE
        QPKGFVDP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+VGE
Subjt:  QPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMVGE

Query:  LTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLH
        LT+FL  Q+KQ    IF SQ KYAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SHL+
Subjt:  LTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLH

Query:  CAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRITQ
          KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY + Q
Subjt:  CAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRITQ

Query:  SSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
          M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ RELV+  +I LEHV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  SSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

Q84VH8 Gag-pol polyprotein0.0e+0044.68Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+IL+I               L ++FE L+M E+E I +F++ +L+IAN   ALGE++ D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI
        +DEL G  +                     +D      Y                            R+K H +    +  K  K+           KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTED--AEDRKKTKDQEVILQQQER----I
        +CH CEG+GHI  EC T+LK+ +KG+    SD E   ESD +    AL  I      E  +      S  + ++  A  RK     E ILQQ+ +    I
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTED--AEDRKKTKDQEVILQQQER----I

Query:  QDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTER---------DTPGR---GTEISRM------SMKS
         DL  E ++    I  LK E+     + E + K  +M   G+  LD++L  G+ A ++RGLGF  +           P +   GT +S+       + + 
Subjt:  QDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTER---------DTPGR---GTEISRM------SMKS

Query:  LNKRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG
         +KR +  C++CG                                                     WY DSGC RHMTG  +F  ++  C    V F DG
Subjt:  LNKRTRRICYFCG-----------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG

Query:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR
         KGKIIG G +   GLP L  V L++GL ANLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R
Subjt:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR

Query:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----
         G L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+  +     
Subjt:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----

Query:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV
             K  SL   +EK+  I RI++D+GREFEN  F EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV
Subjt:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV

Query:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------
         LR GT TT YE+WKGRKP+VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++      
Subjt:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------

Query:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF
                       D    +  +   D    T I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QF
Subjt:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF

Query:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY
        KRN+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EEVY
Subjt:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY

Query:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV
        V QPKGF DP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+V
Subjt:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV

Query:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH
        GELT+FL  Q+KQ    IF SQ +YAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SH
Subjt:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH

Query:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI
        L   KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY +
Subjt:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI

Query:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
         Q  M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ R+LV+  +I L+HV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

Q84VI4 Gag-pol polyprotein0.0e+0044.62Show/hide
Query:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN
        M+  +EG   +RPP+LDG NY YWK++M AFL SL+ R+W+AVI GWE+P   D   +     K E  WTK+ED+  +GNS+ LNALFN V+ NIF+LIN
Subjt:  MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTV--RKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLIN

Query:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK
        TC  AK AW+IL+I               L ++FE L+M E+E I +F++ +L+IAN   ALGE++ D KLVRK+LRSLP +F+MKVTAI+EA D+  M+
Subjt:  TCKSAKAAWDILEI---------------LTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMK

Query:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI
        +DEL G  +                     +D      Y                            R+K H +    +  K  K+           KGI
Subjt:  LDELFGQSRI--------------------SDTSSSGHY----------------------------RKKEHERGKGTEASKSDKF----------GKGI

Query:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISIC----TMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER---
        +CH CEG+GHI  EC T+LK+ +KG+    SD E   ESD +    AL  I       +D ++  T D+  ++        RK     E ILQQ+ +   
Subjt:  RCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISIC----TMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQER---

Query:  -IQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------M
         I DL  E ++    I  LK E+     + E + K  +M   G+  LD++L  G+ A ++RGLGF  +           P +    + MS          
Subjt:  -IQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERD---------TPGRGTEISRMS---------M

Query:  KSLNKRTRRICYFCG---------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG
        +  +KR +  C++CG                                                   WY DSGC RHMTG  +F  ++  C    V F DG
Subjt:  KSLNKRTRRICYFCG---------------------------------------------------WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDG

Query:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR
         KGKIIG G +   GLP L  V L++GL ANLISISQLCD+       KS  +  ++    L                       C  SK +E  +WH+R
Subjt:  GKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAI-----KSVSIKIDDAKVTL-----------------------CNLSKVEEAGLWHKR

Query:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----
         G L    + K+    A+  +P L       C EC +GKQVK  H+ +   +TS +LELLH+DLMGPMQ +SLG KR   V VDDFSR+TW+K +     
Subjt:  LGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHV-----

Query:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV
             K  SL   +EK+  I RI++D+GREFEN    EF  +EGI HEFSA + PQQN +VER+NRTLQE ARVM+HAK LP   W EA+NTAC+IHNRV
Subjt:  -----KPYSLN-SKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRV

Query:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------
         LR GT TT YE+WKGRKP+VK+FHIFGS C+IL+DR+ RR  D KSD GIFLGYS NSRAYRV+N  ++TVMESINV++DDL     ++++++      
Subjt:  ILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRAYRVYNQSSKTVMESINVIIDDLGKEPNRNLDDE------

Query:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF
                       D    +  +   D    T I K +P   IIGD +  + TR +E     ++VSN C+ S +EP  +  AL+DE WI AMQEEL QF
Subjt:  --------------DDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQF

Query:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY
        KRN+VWEL+P+P   N+IGTKWIFKNK +EEG + R                            ++IRLLL  AC  +FKL+QMDVKS FLNGYL EEVY
Subjt:  KRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVY

Query:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV
        V QPKGF DP H DHVY+L+KALYGLKQAPRAWYERL+ +L QQGY++G  D+T+F+ +   + +I QIYVDDI+FGG S+            EFEMS+V
Subjt:  VAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSS-----------GEFEMSMV

Query:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH
        GELT+FL  Q+KQ    IF SQ +YAKN++ KFGM+ A  KRTPA T+LK++KD  G  VD +LYRS+IGSLLYLTASRPDI +AVGVCARYQA+P+ SH
Subjt:  GELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSH

Query:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI
        L   KRILKY++GT +YGI Y + +   LVG  DADWAG  DDRKSTSGGCF+LGNN+ + FSKKQN               S  SQL+WMKQML EY +
Subjt:  LHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN---------------SYYSQLLWMKQMLDEYRI

Query:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC
         Q  M LYCD +SAI+ISKNPVQHS TKHI+IRHH+ R+LV+  +I L+HV +  Q+ADIFTK LD   FE LR  +G+C
Subjt:  TQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVC

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-8626.31Show/hide
Query:  CGWYFDSGCCRHMTGNADFFSDLIECKVGL-VVFEDGGKGKIIGKGTINRLGLPF---LLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLC--
        CG+  DSG   H+  +   ++D +E    L +     G+     K  I RL       L +V   +  A NL+S+ +L +     +SI+ D + VT+   
Subjt:  CGWYFDSGCCRHMTGNADFFSDLIECKVGL-VVFEDGGKGKIIGKGTINRLGLPF---LLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLC--

Query:  NLSKVEEAG------------------------LWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSL--ERCSECPVGKQVKSVHKPVNIASTSHI---L
         L  V+ +G                        LWH+R G +S   + ++ + +   D   L+   L  E C  C  GKQ +   K   +   +HI   L
Subjt:  NLSKVEEAG------------------------LWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSL--ERCSECPVGKQVKSVHKPVNIASTSHI---L

Query:  ELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRY--TWIKHVKP---------YSLNSKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQ
         ++H D+ GP+   +L  K   V+ VD F+ Y  T++   K           + +    N  +  +  D GRE+ +    +F   +GI +  + P  PQ 
Subjt:  ELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRY--TWIKHVKP---------YSLNSKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQ

Query:  NVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRS--GTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGY
        N V ER  RT+ E AR M+    L   FW EA+ TA ++ NR+  R+   ++ T YE+W  +KP +K+  +FG+T ++   ++ +  +D KS   IF+GY
Subjt:  NVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRS--GTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGY

Query:  S---------------------------ANSRAYRVYNQSSKTVMESIN-------------------------VIIDDLGKEPNRNLDDEDDKL-----
                                     NSRA +      K   ES N                           + D  +  N+N  ++  K+     
Subjt:  S---------------------------ANSRAYRVYNQSSKTVMESIN-------------------------VIIDDLGKEPNRNLDDEDDKL-----

Query:  --------RVQLILQSGDLIP-----------PTHITKN----NPSSFIIGDIHS----------------EIITRKKERKDYAKMVSNVCYTSSLEPTT
                 +Q +  S +                H+ ++    NP+     +                   EII R+ ER      +S     +SL    
Subjt:  --------RVQLILQSGDLIP-----------PTHITKN----NPSSFIIGDIHS----------------EIITRKKERKDYAKMVSNVCYTSSLEPTT

Query:  ISAAL----------------SDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR---------------------------
        ++A                      W  A+  EL   K N  W +  +P   NI+ ++W+F  K +E G  IR                           
Subjt:  ISAAL----------------SDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR---------------------------

Query:  -KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQG--
          + R +LS       K+ QMDVK+ FLNG L EE+Y+  P+G     + D+V KL KA+YGLKQA R W+E     L +  +   S D+ ++I  +G  
Subjt:  -KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQG--

Query:  TDFLIVQIYVDDIL-----------FGGTSSGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVD
         + + V +YVDD++           F      +F M+ + E+  F+  +I+ +   I+ SQ  Y K ++SKF M+   +  TP  + +      + E  +
Subjt:  TDFLIVQIYVDDIL-----------FGGTSSGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVD

Query:  TNLYRSIIGSLLY-LTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTT--GTLVGNYDADWAGCTDDRKSTSGGCFFLGNNV
        T   RS+IG L+Y +  +RPD+  AV + +RY +   +      KR+L+Y+ GT +  + +  +      ++G  D+DWAG   DRKST+G  F + +  
Subjt:  TNLYRSIIGSLLY-LTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTT--GTLVGNYDADWAGCTDDRKSTSGGCFFLGNNV

Query:  TACF-SKKQNSYYS---------------QLLWMKQMLDEYRITQSSMI-LYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQ
          C+ +K+QNS  +               + LW+K +L    I   + I +Y D    ISI+ NP  H   KHI+I++HF RE V+ N+I LE++ +  Q
Subjt:  TACF-SKKQNSYYS---------------QLLWMKQMLDEYRITQSSMI-LYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQ

Query:  LADIFTKPLDFATFEGLRASVGVCQ
        LADIFTKPL  A F  LR  +G+ Q
Subjt:  LADIFTKPLDFATFEGLRASVGVCQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-9727.58Show/hide
Query:  WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTI---NRLGLPFLL-NVRLLQGLAANLIS----------------------ISQLCD
        W  D+    H T   D F   +    G V   +    KI G G I     +G   +L +VR +  L  NLIS                       S +  
Subjt:  WYFDSGCCRHMTGNADFFSDLIECKVGLVVFEDGGKGKIIGKGTI---NRLGLPFLL-NVRLLQGLAANLIS----------------------ISQLCD

Query:  KAIKSVSIKIDDAKVTLCNLSKVEE---AGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLM
        K +   ++   +A++    L+  ++     LWHKR+G +S   +  + K   I        ++++ C  C  GKQ + V    +     +IL+L++ D+ 
Subjt:  KAIKSVSIKIDDAKVTLCNLSKVEE---AGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLERCSECPVGKQVKSVHKPVNIASTSHILELLHIDLM

Query:  GPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKP-----------YSLNSKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRN
        GPM+ +S+G  +  V  +DD SR  W+  +K            ++L  +E    + R+++D G E+ ++ F E+  + GI HE + P  PQ N V ER N
Subjt:  GPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKP-----------YSLNSKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSAPLKPQQNVVVERRN

Query:  RTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYEL----WKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRA
        RT+ E  R M+    LP  FW EA+ TAC++ N    RS +   ++E+    W  ++ +  +  +FG   F    ++ R   D KS   IF+GY      
Subjt:  RTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYEL----WKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRA

Query:  YRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPPTHITKNNPSSFII--------GDIHSEIITRKKERKDYAKMVSN-----
        YR+++   K V+ S +V+  +      R   D  +K++   I+ +   IP    T NNP+S           G+   E+I + ++  +  + V +     
Subjt:  YRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPPTHITKNNPSSFII--------GDIHSEIITRKKERKDYAKMVSN-----

Query:  ------------------------VCYTSSLEPTTISAALS---DEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR-----
                                V  +   EP ++   LS       + AMQEE+   ++N  ++L+  P     +  KW+FK K D + +++R     
Subjt:  ------------------------VCYTSSLEPTTISAALS---DEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR-----

Query:  -----------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYL
                                +IR +LS A     ++ Q+DVK+ FL+G L EE+Y+ QP+GF     +  V KL K+LYGLKQAPR WY +  +++
Subjt:  -----------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYL

Query:  LQQGYQRGSADQTMFIYR-QGTDFLIVQIYVDDILFGGTSSG-----------EFEMSMVGELTFFLRFQIKQENIG--IFFSQEKYAKNLISKFGMDKA
          Q Y +  +D  ++  R    +F+I+ +YVDD+L  G   G            F+M  +G     L  +I +E     ++ SQEKY + ++ +F M  A
Subjt:  LQQGYQRGSADQTMFIYR-QGTDFLIVQIYVDDILFGGTSSG-----------EFEMSMVGELTFFLRFQIKQENIG--IFFSQEKYAKNLISKFGMDKA

Query:  RSKRTPAATYLKMTKDTNGERVDTN------LYRSIIGSLLY-LTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVG
        +   TP A +LK++K      V+         Y S +GSL+Y +  +RPDIA AVGV +R+  +P   H    K IL+Y+ GT    + +   +   L G
Subjt:  RSKRTPAATYLKMTKDTNGERVDTN------LYRSIIGSLLY-LTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVG

Query:  NYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQNSYY---------------SQLLWMKQMLDEYRITQSSMILYCDYLSAISISKNPVQHSLTKHIN
          DAD AG  D+RKS++G  F       +  SK Q                    +++W+K+ L E  + Q   ++YCD  SAI +SKN + H+ TKHI+
Subjt:  NYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQNSYY---------------SQLLWMKQMLDEYRITQSSMILYCDYLSAISISKNPVQHSLTKHIN

Query:  IRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGV
        +R+H+ RE+V+   +++  + +    AD+ TK +    FE  +  VG+
Subjt:  IRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGV

P92519 Uncharacterized mitochondrial protein AtMg008101.9e-2735.86Show/hide
Query:  IYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSI
        +YVDDIL  G+S           S  F M  +G + +FL  QIK    G+F SQ KYA+ +++  GM   +   TP    L  +  T  +  D + +RSI
Subjt:  IYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSI

Query:  IGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQ
        +G+L YLT +RPDI++AV +  +   +P  +     KR+L+Y+ GT  +G++   ++   +    D+DWAGCT  R+ST+G C FLG N+ +  +K+Q
Subjt:  IGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-7735.14Show/hide
Query:  VCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPP-YANIIGTKWIFKNKMDEEG-------RVIRK---------------------TI
        V   +  EP T   AL DE W  AM  E+     N  W+L+P PP +  I+G +WIF  K + +G       R++ K                     +I
Subjt:  VCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPP-YANIIGTKWIFKNKMDEEG-------RVIRK---------------------TI

Query:  RLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIV
        R++L  A  R + + Q+DV + FL G L ++VY++QP GF+D    ++V KLRKALYGLKQAPRAWY  L  YLL  G+    +D ++F+ ++G   + +
Subjt:  RLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIV

Query:  QIYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRS
         +YVDDIL  G             S  F +    EL +FL  + K+   G+  SQ +Y  +L+++  M  A+   TP A   K++  +  +  D   YR 
Subjt:  QIYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRS

Query:  IIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN
        I+GSL YL  +RPDI++AV   +++   P   HL   KRIL+Y++GT N+GI+     T +L    DADWAG  DD  ST+G   +LG++  +  SKKQ 
Subjt:  IIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQN

Query:  SYY---------------SQLLWMKQMLDE--YRITQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPL
                          S++ W+  +L E   R+T+   ++YCD + A  +  NPV HS  KHI I +HF R  V++  +R+ HV +  QLAD  TKPL
Subjt:  SYY---------------SQLLWMKQMLDE--YRITQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPL

Query:  DFATFEGLRASVGVCQRP
            F+   + +GV + P
Subjt:  DFATFEGLRASVGVCQRP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.2e-7934.83Show/hide
Query:  TRKKE--RKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELM-PKPPYANIIGTKWIFKNKMDEEG-------RVIRK------
        TR K+  RK   K        ++ EP T   A+ D+ W  AM  E+     N  W+L+ P PP   I+G +WIF  K + +G       R++ K      
Subjt:  TRKKE--RKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELM-PKPPYANIIGTKWIFKNKMDEEG-------RVIRK------

Query:  ---------------TIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGS
                       +IR++L  A  R + + Q+DV + FL G L +EVY++QP GFVD    D+V +LRKA+YGLKQAPRAWY  L TYLL  G+    
Subjt:  ---------------TIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQAPRAWYERLSTYLLQQGYQRGS

Query:  ADQTMFIYRQGTDFLIVQIYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLK
        +D ++F+ ++G   + + +YVDDIL  G             S  F +    +L +FL  + K+   G+  SQ +Y  +L+++  M  A+   TP AT  K
Subjt:  ADQTMFIYRQGTDFLIVQIYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLK

Query:  MTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGG
        +T  +  +  D   YR I+GSL YL  +RPD+++AV   ++Y   P   H +  KR+L+Y++GT ++GI+     T +L    DADWAG TDD  ST+G 
Subjt:  MTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGG

Query:  CFFLGNNVTACFSKKQNSYY---------------SQLLWMKQMLDEYRITQS-SMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLE
          +LG++  +  SKKQ                   S+L W+  +L E  I  S   ++YCD + A  +  NPV HS  KHI + +HF R  V++  +R+ 
Subjt:  CFFLGNNVTACFSKKQNSYY---------------SQLLWMKQMLDEYRITQS-SMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLE

Query:  HVQSAFQLADIFTKPLDFATFEGLRASVGVCQRP
        HV +  QLAD  TKPL    F+     +GV + P
Subjt:  HVQSAFQLADIFTKPLDFATFEGLRASVGVCQRP

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein1.9e-0630.57Show/hide
Query:  KFGKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNST----------EDAEDRKKTKDQ
        K  KG RC EC+GF H+ +ECA  +K K+K  +  +SD E   +SDD +    L++  T        +   P S ++T              D  ++ D 
Subjt:  KFGKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNST----------EDAEDRKKTKDQ

Query:  EVILQQQERIQD----------LVEENQSFLSS--------IVTLK--EELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGF
        ++ +  +E  ++          +VEEN              + TLK   E  E   Q EE  K  RM  NGT KL  IL  G+   DK GLGF
Subjt:  EVILQQQERIQD----------LVEENQSFLSS--------IVTLK--EELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGF

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-5829.96Show/hide
Query:  SSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----
        +S  I DI S+ ++ +K    Y   +  VC   + EP+T + A     W  AM +E+   +    WE+   PP    IG KW++K K + +G + R    
Subjt:  SSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIR----

Query:  ------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFV----DPVHQDHVYKLRKALYGLKQAPRAWYER
                                 +++L+L+ +    F L Q+D+ + FLNG L EE+Y+  P G+     D +  + V  L+K++YGLKQA R W+ +
Subjt:  ------------------------KTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFV----DPVHQDHVYKLRKALYGLKQAPRAWYER

Query:  LSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSSGE-----------FEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMD
         S  L+  G+ +  +D T F+    T FL V +YVDDI+    +              F++  +G L +FL  +I +   GI   Q KYA +L+ + G+ 
Subjt:  LSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSSGE-----------FEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMD

Query:  KARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDAD
          +    P    +  +  + G+ VD   YR +IG L+YL  +R DI+FAV   +++   PR +H     +IL YI GT   G++Y+      L    DA 
Subjt:  KARSKRTPAATYLKMTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDAD

Query:  WAGCTDDRKSTSGGCFFLGNNVTACFSKKQ---------------NSYYSQLLWMKQMLDEYRITQSS-MILYCDYLSAISISKNPVQHSLTKHINIRHH
        +  C D R+ST+G C FLG ++ +  SKKQ               +    +++W+ Q   E ++  S   +L+CD  +AI I+ N V H  TKHI    H
Subjt:  WAGCTDDRKSTSGGCFFLGNNVTACFSKKQ---------------NSYYSQLLWMKQMLDEYRITQSS-MILYCDYLSAISISKNPVQHSLTKHINIRHH

Query:  FTRE
          RE
Subjt:  FTRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.9e-1035.23Show/hide
Query:  LYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGC-----FFLG
        +YLT +RPD+ FAV   +++ +  RT+ +    ++L Y+ GT   G++Y+  +   L    D+DWA C D R+S +G C     +FLG
Subjt:  LYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGC-----FFLG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.4e-2835.86Show/hide
Query:  IYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSI
        +YVDDIL  G+S           S  F M  +G + +FL  QIK    G+F SQ KYA+ +++  GM   +   TP    L  +  T  +  D + +RSI
Subjt:  IYVDDILFGGTS-----------SGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLKMTKDTNGERVDTNLYRSI

Query:  IGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQ
        +G+L YLT +RPDI++AV +  +   +P  +     KR+L+Y+ GT  +G++   ++   +    D+DWAGCT  R+ST+G C FLG N+ +  +K+Q
Subjt:  IGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTACFSKKQ

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.7e-1038.71Show/hide
Query:  IITRKKE--RKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIRKTIRLL
        ++TR K    K   K    +  T   EP ++  AL D  W  AMQEEL    RN+ W L+P P   NI+G KW+FK K+  +G + R   RL+
Subjt:  IITRKKE--RKDYAKMVSNVCYTSSLEPTTISAALSDEHWILAMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIRKTIRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGTATCCGAGAAGGAAACTCAACTAGCAGACCTCCCCTTTTGGACGGAGAAAATTATGGCTACTGGAAGTCACAAATGGAAGCCTTCCTGATGTCACTCAACAT
GAGGAGTTGGAGAGCCGTTATATCCGGATGGGAGTATCCCACTGAAAAGGACGAAGCATGCCAAACTGTTCGAAAATCTGAGTTGAAATGGACAAAAGATGAAGATGATG
CTCCTGTAGGCAACTCTCGTACACTGAATGCCTTGTTTAATGTCGTCAATCCTAACATATTCAAACTAATCAACACTTGTAAATCGGCCAAGGCTGCCTGGGATATCTTG
GAAATCTTGACGTCTCAATTTGAAGCCCTTCAAATGGGTGAAGATGAGACTATAACCGAATTCAATGTCCCTGTCCTTGACATTGCCAACGAATCAGATGCACTCGGGGA
AAAAATGTATGACTCCAAACTTGTTCGAAAGGTTCTCAGATCCCTGCCCTTCAAGTTTAACATGAAAGTCACAGCGATCAAAGAAGCTAATGACCTATCTAAAATGAAAT
TGGATGAACTATTTGGACAGTCAAGAATATCAGATACCTCCTCTTCAGGACACTATCGAAAGAAGGAACACGAACGAGGAAAAGGAACTGAAGCCTCGAAATCTGACAAA
TTTGGCAAAGGGATCAGGTGTCATGAATGTGAAGGATTTGGACATATCCAAACCGAGTGTGCTACTTATCTTAAACGCAAGAAAAAGGGTATGGTAGCTACCTTATCTGA
TGAAGAAGATTATTCCGAAAGCGACGATGAAGACCTTGGAATGGCCTTGATTAGTATCTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCCGAAATCAA
ATAACTCAACTGAAGACGCAGAAGACAGAAAGAAGACAAAAGATCAAGAAGTTATCCTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAGAACCAAAGCTTCCTA
TCCTCTATAGTAACTCTAAAAGAAGAACTAGCGGAAACCAAGCATCAATTCGAAGAACTCCTAAAATTTGCAAGGATGCCGACAAACGGAACCTCGAAACTAGATGACAT
ACTCGACCAAGGAAGGAGGGCTGACGACAAAAGAGGCCTCGGATTTACAGAAAGGGACACACCTGGAAGGGGCACTGAGATTTCTAGAATGTCTATGAAATCCCTAAACA
AGCGAACACGTAGAATTTGTTATTTTTGTGGATGGTACTTTGACAGTGGGTGTTGCAGACACATGACAGGTAATGCAGATTTCTTTTCTGATCTGATTGAATGTAAAGTC
GGGTTAGTAGTATTTGAAGATGGAGGAAAAGGAAAAATAATTGGTAAAGGAACGATTAACCGTCTGGGTCTACCGTTTCTTCTTAATGTTCGACTATTACAAGGACTGGC
TGCAAACCTCATAAGCATCAGCCAACTATGTGACAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGATGCAAAGGTGACCTTATGCAATTTATCAAAAGTAGAAGAAG
CTGGACTCTGGCACAAACGACTTGGACAACTTAGTGGCTCTACTATCTCCAAGGTCACCAAAGCTGATGCCATCATCGATCTTCCCCCGCTATCATTCTCGTCACTAGAA
AGATGTTCGGAGTGCCCAGTTGGCAAGCAAGTCAAGTCTGTGCACAAGCCTGTAAATATCGCCTCGACGTCCCATATTTTGGAACTTCTTCATATAGACCTAATGGGGCC
CATGCAAACAAAAAGCTTGGGAAGAAAACGGTGTGTAGTAGTGTGTGTAGATGATTTCTCCCGCTACACCTGGATAAAACATGTCAAACCCTATTCACTCAACTCCAAAG
AGAAAAATACCGGTATTGGCCGAATACAAACTGATTATGGGCGTGAATTTGAGAATCAGCACTTTGCTGAGTTCTATGATAATGAAGGCATCTTTCATGAGTTCTCTGCT
CCTTTAAAACCACAGCAAAATGTAGTTGTAGAGAGAAGGAACCGAACCTTACAGGAGATGGCCCGAGTGATGATTCATGCAAAGCACCTACCAATCCAATTCTGGGTGGA
AGCTCTAAACACTGCATGCCATATACATAACAGAGTCATTCTCCGTTCAGGGACCACTACAACCTCATATGAGCTGTGGAAAGGGAGAAAACCAAATGTGAAGTATTTTC
ACATCTTTGGCAGCACGTGCTTTATCTTGAGTGATAGGGATCATCGTAGAAACTGGGACTCAAAGTCTGATCATGGAATATTTCTGGGATATTCTGCTAACAGCCGAGCC
TACAGGGTCTACAACCAGAGTTCCAAAACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAACAGAAATCTTGATGATGAAGATGACAAACT
GCGGGTGCAACTGATTCTTCAAAGTGGTGACCTCATACCTCCTACGCATATAACCAAAAACAATCCCTCCAGCTTCATTATTGGAGATATTCACAGTGAAATCATAACTC
GGAAGAAGGAGAGGAAAGATTATGCAAAAATGGTTTCCAATGTGTGCTACACATCTTCACTAGAACCAACCACGATCTCTGCAGCACTTTCCGATGAACACTGGATCTTG
GCTATGCAGGAAGAGCTACTCCAGTTTAAAAGAAACCAAGTATGGGAATTAATGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAAT
GGATGAAGAAGGTAGAGTTATCCGTAAAACCATCCGACTACTGCTAAGCTACGCATGTTTTCGAAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGTGTTCCTAAATG
GGTACTTATTTGAGGAAGTATATGTGGCCCAGCCAAAAGGATTTGTTGATCCAGTACATCAAGATCATGTTTACAAACTTCGAAAGGCACTCTATGGACTTAAACAAGCT
CCTAGAGCTTGGTATGAGAGACTCTCCACTTACTTGTTACAACAAGGATATCAAAGGGGTAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGAT
CGTTCAGATCTATGTTGACGACATTTTATTTGGTGGTACGTCCTCAGGAGAATTTGAAATGAGCATGGTTGGGGAACTAACCTTCTTCCTGAGGTTTCAAATTAAGCAAG
AAAATATAGGGATCTTTTTCTCTCAAGAAAAATATGCAAAAAACCTCATATCTAAGTTTGGTATGGATAAGGCCAGATCTAAAAGAACGCCCGCCGCTACATATCTGAAA
ATGACTAAGGACACAAATGGTGAAAGAGTTGATACAAACCTGTACCGAAGCATCATTGGGAGCTTGCTCTATTTAACAGCCAGCAGACCGGATATAGCCTTTGCAGTAGG
AGTTTGTGCTCGGTATCAGGCTGACCCGCGCACGTCACATCTTCACTGTGCAAAACGAATACTCAAATATATATCAGGTACATTTAACTATGGTATTTGGTATACTTATG
ACACAACTGGTACTCTTGTTGGCAACTATGATGCAGACTGGGCAGGGTGCACAGATGATAGGAAGAGCACATCTGGAGGGTGCTTCTTCTTAGGGAATAATGTAACTGCT
TGTTTCAGCAAGAAACAAAATAGTTACTACTCTCAACTGTTGTGGATGAAACAAATGCTTGATGAATATAGGATAACTCAGTCTTCCATGATTCTCTACTGTGATTATCT
GAGCGCAATAAGCATCTCCAAAAATCCAGTTCAACATAGTCTAACAAAGCACATAAATATCCGACATCACTTTACTCGAGAACTCGTTGAAGCCAATATTATAAGATTAG
AACATGTTCAAAGTGCCTTTCAGCTAGCAGATATATTTACAAAGCCTCTGGATTTTGCAACATTTGAAGGACTGAGGGCCAGTGTTGGAGTCTGTCAACGGCCCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGTATCCGAGAAGGAAACTCAACTAGCAGACCTCCCCTTTTGGACGGAGAAAATTATGGCTACTGGAAGTCACAAATGGAAGCCTTCCTGATGTCACTCAACAT
GAGGAGTTGGAGAGCCGTTATATCCGGATGGGAGTATCCCACTGAAAAGGACGAAGCATGCCAAACTGTTCGAAAATCTGAGTTGAAATGGACAAAAGATGAAGATGATG
CTCCTGTAGGCAACTCTCGTACACTGAATGCCTTGTTTAATGTCGTCAATCCTAACATATTCAAACTAATCAACACTTGTAAATCGGCCAAGGCTGCCTGGGATATCTTG
GAAATCTTGACGTCTCAATTTGAAGCCCTTCAAATGGGTGAAGATGAGACTATAACCGAATTCAATGTCCCTGTCCTTGACATTGCCAACGAATCAGATGCACTCGGGGA
AAAAATGTATGACTCCAAACTTGTTCGAAAGGTTCTCAGATCCCTGCCCTTCAAGTTTAACATGAAAGTCACAGCGATCAAAGAAGCTAATGACCTATCTAAAATGAAAT
TGGATGAACTATTTGGACAGTCAAGAATATCAGATACCTCCTCTTCAGGACACTATCGAAAGAAGGAACACGAACGAGGAAAAGGAACTGAAGCCTCGAAATCTGACAAA
TTTGGCAAAGGGATCAGGTGTCATGAATGTGAAGGATTTGGACATATCCAAACCGAGTGTGCTACTTATCTTAAACGCAAGAAAAAGGGTATGGTAGCTACCTTATCTGA
TGAAGAAGATTATTCCGAAAGCGACGATGAAGACCTTGGAATGGCCTTGATTAGTATCTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCCGAAATCAA
ATAACTCAACTGAAGACGCAGAAGACAGAAAGAAGACAAAAGATCAAGAAGTTATCCTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAGAACCAAAGCTTCCTA
TCCTCTATAGTAACTCTAAAAGAAGAACTAGCGGAAACCAAGCATCAATTCGAAGAACTCCTAAAATTTGCAAGGATGCCGACAAACGGAACCTCGAAACTAGATGACAT
ACTCGACCAAGGAAGGAGGGCTGACGACAAAAGAGGCCTCGGATTTACAGAAAGGGACACACCTGGAAGGGGCACTGAGATTTCTAGAATGTCTATGAAATCCCTAAACA
AGCGAACACGTAGAATTTGTTATTTTTGTGGATGGTACTTTGACAGTGGGTGTTGCAGACACATGACAGGTAATGCAGATTTCTTTTCTGATCTGATTGAATGTAAAGTC
GGGTTAGTAGTATTTGAAGATGGAGGAAAAGGAAAAATAATTGGTAAAGGAACGATTAACCGTCTGGGTCTACCGTTTCTTCTTAATGTTCGACTATTACAAGGACTGGC
TGCAAACCTCATAAGCATCAGCCAACTATGTGACAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGATGCAAAGGTGACCTTATGCAATTTATCAAAAGTAGAAGAAG
CTGGACTCTGGCACAAACGACTTGGACAACTTAGTGGCTCTACTATCTCCAAGGTCACCAAAGCTGATGCCATCATCGATCTTCCCCCGCTATCATTCTCGTCACTAGAA
AGATGTTCGGAGTGCCCAGTTGGCAAGCAAGTCAAGTCTGTGCACAAGCCTGTAAATATCGCCTCGACGTCCCATATTTTGGAACTTCTTCATATAGACCTAATGGGGCC
CATGCAAACAAAAAGCTTGGGAAGAAAACGGTGTGTAGTAGTGTGTGTAGATGATTTCTCCCGCTACACCTGGATAAAACATGTCAAACCCTATTCACTCAACTCCAAAG
AGAAAAATACCGGTATTGGCCGAATACAAACTGATTATGGGCGTGAATTTGAGAATCAGCACTTTGCTGAGTTCTATGATAATGAAGGCATCTTTCATGAGTTCTCTGCT
CCTTTAAAACCACAGCAAAATGTAGTTGTAGAGAGAAGGAACCGAACCTTACAGGAGATGGCCCGAGTGATGATTCATGCAAAGCACCTACCAATCCAATTCTGGGTGGA
AGCTCTAAACACTGCATGCCATATACATAACAGAGTCATTCTCCGTTCAGGGACCACTACAACCTCATATGAGCTGTGGAAAGGGAGAAAACCAAATGTGAAGTATTTTC
ACATCTTTGGCAGCACGTGCTTTATCTTGAGTGATAGGGATCATCGTAGAAACTGGGACTCAAAGTCTGATCATGGAATATTTCTGGGATATTCTGCTAACAGCCGAGCC
TACAGGGTCTACAACCAGAGTTCCAAAACAGTAATGGAATCCATTAACGTCATTATTGATGACCTTGGTAAGGAACCCAACAGAAATCTTGATGATGAAGATGACAAACT
GCGGGTGCAACTGATTCTTCAAAGTGGTGACCTCATACCTCCTACGCATATAACCAAAAACAATCCCTCCAGCTTCATTATTGGAGATATTCACAGTGAAATCATAACTC
GGAAGAAGGAGAGGAAAGATTATGCAAAAATGGTTTCCAATGTGTGCTACACATCTTCACTAGAACCAACCACGATCTCTGCAGCACTTTCCGATGAACACTGGATCTTG
GCTATGCAGGAAGAGCTACTCCAGTTTAAAAGAAACCAAGTATGGGAATTAATGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAAT
GGATGAAGAAGGTAGAGTTATCCGTAAAACCATCCGACTACTGCTAAGCTACGCATGTTTTCGAAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGTGTTCCTAAATG
GGTACTTATTTGAGGAAGTATATGTGGCCCAGCCAAAAGGATTTGTTGATCCAGTACATCAAGATCATGTTTACAAACTTCGAAAGGCACTCTATGGACTTAAACAAGCT
CCTAGAGCTTGGTATGAGAGACTCTCCACTTACTTGTTACAACAAGGATATCAAAGGGGTAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGAT
CGTTCAGATCTATGTTGACGACATTTTATTTGGTGGTACGTCCTCAGGAGAATTTGAAATGAGCATGGTTGGGGAACTAACCTTCTTCCTGAGGTTTCAAATTAAGCAAG
AAAATATAGGGATCTTTTTCTCTCAAGAAAAATATGCAAAAAACCTCATATCTAAGTTTGGTATGGATAAGGCCAGATCTAAAAGAACGCCCGCCGCTACATATCTGAAA
ATGACTAAGGACACAAATGGTGAAAGAGTTGATACAAACCTGTACCGAAGCATCATTGGGAGCTTGCTCTATTTAACAGCCAGCAGACCGGATATAGCCTTTGCAGTAGG
AGTTTGTGCTCGGTATCAGGCTGACCCGCGCACGTCACATCTTCACTGTGCAAAACGAATACTCAAATATATATCAGGTACATTTAACTATGGTATTTGGTATACTTATG
ACACAACTGGTACTCTTGTTGGCAACTATGATGCAGACTGGGCAGGGTGCACAGATGATAGGAAGAGCACATCTGGAGGGTGCTTCTTCTTAGGGAATAATGTAACTGCT
TGTTTCAGCAAGAAACAAAATAGTTACTACTCTCAACTGTTGTGGATGAAACAAATGCTTGATGAATATAGGATAACTCAGTCTTCCATGATTCTCTACTGTGATTATCT
GAGCGCAATAAGCATCTCCAAAAATCCAGTTCAACATAGTCTAACAAAGCACATAAATATCCGACATCACTTTACTCGAGAACTCGTTGAAGCCAATATTATAAGATTAG
AACATGTTCAAAGTGCCTTTCAGCTAGCAGATATATTTACAAAGCCTCTGGATTTTGCAACATTTGAAGGACTGAGGGCCAGTGTTGGAGTCTGTCAACGGCCCACATGA
Protein sequenceShow/hide protein sequence
MDSIREGNSTSRPPLLDGENYGYWKSQMEAFLMSLNMRSWRAVISGWEYPTEKDEACQTVRKSELKWTKDEDDAPVGNSRTLNALFNVVNPNIFKLINTCKSAKAAWDIL
EILTSQFEALQMGEDETITEFNVPVLDIANESDALGEKMYDSKLVRKVLRSLPFKFNMKVTAIKEANDLSKMKLDELFGQSRISDTSSSGHYRKKEHERGKGTEASKSDK
FGKGIRCHECEGFGHIQTECATYLKRKKKGMVATLSDEEDYSESDDEDLGMALISICTMNDEENVQTHDQPKSNNSTEDAEDRKKTKDQEVILQQQERIQDLVEENQSFL
SSIVTLKEELAETKHQFEELLKFARMPTNGTSKLDDILDQGRRADDKRGLGFTERDTPGRGTEISRMSMKSLNKRTRRICYFCGWYFDSGCCRHMTGNADFFSDLIECKV
GLVVFEDGGKGKIIGKGTINRLGLPFLLNVRLLQGLAANLISISQLCDKAIKSVSIKIDDAKVTLCNLSKVEEAGLWHKRLGQLSGSTISKVTKADAIIDLPPLSFSSLE
RCSECPVGKQVKSVHKPVNIASTSHILELLHIDLMGPMQTKSLGRKRCVVVCVDDFSRYTWIKHVKPYSLNSKEKNTGIGRIQTDYGREFENQHFAEFYDNEGIFHEFSA
PLKPQQNVVVERRNRTLQEMARVMIHAKHLPIQFWVEALNTACHIHNRVILRSGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRRNWDSKSDHGIFLGYSANSRA
YRVYNQSSKTVMESINVIIDDLGKEPNRNLDDEDDKLRVQLILQSGDLIPPTHITKNNPSSFIIGDIHSEIITRKKERKDYAKMVSNVCYTSSLEPTTISAALSDEHWIL
AMQEELLQFKRNQVWELMPKPPYANIIGTKWIFKNKMDEEGRVIRKTIRLLLSYACFRRFKLFQMDVKSVFLNGYLFEEVYVAQPKGFVDPVHQDHVYKLRKALYGLKQA
PRAWYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIVQIYVDDILFGGTSSGEFEMSMVGELTFFLRFQIKQENIGIFFSQEKYAKNLISKFGMDKARSKRTPAATYLK
MTKDTNGERVDTNLYRSIIGSLLYLTASRPDIAFAVGVCARYQADPRTSHLHCAKRILKYISGTFNYGIWYTYDTTGTLVGNYDADWAGCTDDRKSTSGGCFFLGNNVTA
CFSKKQNSYYSQLLWMKQMLDEYRITQSSMILYCDYLSAISISKNPVQHSLTKHINIRHHFTRELVEANIIRLEHVQSAFQLADIFTKPLDFATFEGLRASVGVCQRPT