; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0014211 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0014211
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionEnzymatic polyprotein
Genome locationchr04:20534491..20538840
RNA-Seq ExpressionIVF0014211
SyntenyIVF0014211
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR001995 - Peptidase A2A, retrovirus, catalytic
IPR018061 - Retropepsins
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056776.1 Enzymatic polyprotein [Cucumis melo var. makuwa]0.091.18Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+K  DQESP RRRRHY KGK KK YSSKT+TICFKCNQKGHY
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY

Query:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
        ANRCPLKDKINA+T+DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
Subjt:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL

Query:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF
        IEQIPDE AKRTCLLKLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF
Subjt:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF

Query:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-----------
        +RKSAGKALLIEEIGTINSIS+IQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLSK           
Subjt:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-----------

Query:  -------NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK
               NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ K
Subjt:  -------NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK

Query:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
        IQNFQ+QLEKEVCSTLPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
Subjt:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI

Query:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
        NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
Subjt:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD

Query:  VL--GMFVGRKFSH
        VL     V + F H
Subjt:  VL--GMFVGRKFSH

KAA0059217.1 Enzymatic polyprotein [Cucumis melo var. makuwa]0.090.84Show/hide
Query:  VIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYANRCPLKDKINALTV
        +IKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+K  DQESP RR+RHY KGK KK YSSKT+TICFKCNQKGHYANRCPLKDKINA+T+
Subjt:  VIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYANRCPLKDKINALTV

Query:  DEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDEGAKRTCLL
        DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDE AKRTCLL
Subjt:  DEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDEGAKRTCLL

Query:  KLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKRKSAGKALLIEEIG
        KLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF+RKSAGKALLIEEIG
Subjt:  KLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKRKSAGKALLIEEIG

Query:  TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK------------------NLNEGIIL
        TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLSK                  NLNEGIIL
Subjt:  TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK------------------NLNEGIIL

Query:  GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCST
        GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ KIQNFQ+QLEKEVCST
Subjt:  GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCST

Query:  LPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP
        LPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP
Subjt:  LPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP

Query:  IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL--GMFVGRKFSH
        IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL     V + F H
Subjt:  IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL--GMFVGRKFSH

KAA0068171.1 Enzymatic polyprotein [Cucumis melo var. makuwa]0.091.71Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN

Query:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
        RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
Subjt:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE

Query:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
        QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
Subjt:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR

Query:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-------------
        KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK             
Subjt:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-------------

Query:  -----NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
             NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
Subjt:  -----NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ

Query:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
        NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVK                                AEKERGVPRLVINY
Subjt:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY

Query:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
Subjt:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

Query:  --GMFVGRKFSH
             V + F H
Subjt:  --GMFVGRKFSH

TYJ97599.1 Enzymatic polyprotein [Cucumis melo var. makuwa]0.091.18Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+K  DQESP RR+RHY KGK KK YSSKT+TICFKCNQKGHY
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNK--DQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY

Query:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
        ANRCPLKDKINA+T+DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
Subjt:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL

Query:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF
        IEQIPDE AKRTCLLKLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF
Subjt:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF

Query:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-----------
        +RKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLSK           
Subjt:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-----------

Query:  -------NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK
               NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ K
Subjt:  -------NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK

Query:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
        IQNFQ+QLEKEVCSTLPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
Subjt:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI

Query:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
        NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
Subjt:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD

Query:  VL--GMFVGRKFSH
        VL     V + F H
Subjt:  VL--GMFVGRKFSH

TYK21091.1 Enzymatic polyprotein [Cucumis melo var. makuwa]0.096.21Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN

Query:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
        RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
Subjt:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE

Query:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
        QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
Subjt:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR

Query:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-------------
        KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK             
Subjt:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSK-------------

Query:  -----NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
             NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
Subjt:  -----NLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ

Query:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
        NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
Subjt:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY

Query:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
Subjt:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

Query:  --GMFVGRKFSH
             V + F H
Subjt:  --GMFVGRKFSH

TrEMBL top hitse value%identityAlignment
A0A5A7UR29 Enzymatic polyprotein0.0e+0092.31Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+  KDQESP RRRRHY KGK KK YSSKT+TICFKCNQKGHY
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY

Query:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
        ANRCPLKDKINA+T+DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
Subjt:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL

Query:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF
        IEQIPDE AKRTCLLKLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF
Subjt:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF

Query:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------
        +RKSAGKALLIEEIGTINSIS+IQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLS            
Subjt:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------

Query:  ------KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK
              KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ K
Subjt:  ------KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK

Query:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
        IQNFQ+QLEKEVCSTLPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
Subjt:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI

Query:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
        NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
Subjt:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD

Query:  VL
        VL
Subjt:  VL

A0A5A7UX67 Enzymatic polyprotein0.0e+0091.99Show/hide
Query:  VIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYANRCPLKDKINALTV
        +IKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+  KDQESP RR+RHY KGK KK YSSKT+TICFKCNQKGHYANRCPLKDKINA+T+
Subjt:  VIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYANRCPLKDKINALTV

Query:  DEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDEGAKRTCLL
        DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDE AKRTCLL
Subjt:  DEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDEGAKRTCLL

Query:  KLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKRKSAGKALLIEEIG
        KLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF+RKSAGKALLIEEIG
Subjt:  KLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKRKSAGKALLIEEIG

Query:  TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------------KNLNEGIIL
        TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLS                  KNLNEGIIL
Subjt:  TINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------------KNLNEGIIL

Query:  GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCST
        GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ KIQNFQ+QLEKEVCST
Subjt:  GTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCST

Query:  LPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP
        LPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP
Subjt:  LPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYP

Query:  IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
Subjt:  IPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

A0A5A7VRE0 Reverse transcriptase0.0e+0092.86Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN

Query:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
        RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
Subjt:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE

Query:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
        QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
Subjt:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR

Query:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS--------------
        KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS              
Subjt:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS--------------

Query:  ----KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
            KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
Subjt:  ----KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ

Query:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
        NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVK                                AEKERGVPRLVINY
Subjt:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY

Query:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
Subjt:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

A0A5D3BEY3 Enzymatic polyprotein0.0e+0092.31Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSS+KFFRK+  KDQESP RR+RHY KGK KK YSSKT+TICFKCNQKGHY
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKN--KDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHY

Query:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
        ANRCPLKDKINA+T+DEETKQSLLYAIR+DD+T+SQTESSSEED INILQEE SS EEEFYSQS+SSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL
Subjt:  ANRCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDL

Query:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF
        IEQIPDE AKRTCLLKLKQSLEEQAPQK IQNPIMYSYQDILNRVKGEAK+PIQVEDLHHEVK LK+EVAENKQRLIYLETAFQAFQ SQA KEESTSDF
Subjt:  IEQIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDF

Query:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------
        +RKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGA GNPLNIQFKLS            
Subjt:  KRKSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS------------

Query:  ------KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK
              KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIA KEKHIEFLQDDIKAGKVAIEIQKP VQ K
Subjt:  ------KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLK

Query:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
        IQNFQ+QLEKEVCSTLPNAFWDRKKHMV LPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEIT+LLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
Subjt:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI

Query:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
        NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIH SDRYKTAF VPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
Subjt:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD

Query:  VL
        VL
Subjt:  VL

A0A5D3DBS1 Reverse transcriptase0.0e+0097.43Show/hide
Query:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
        MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN
Subjt:  MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYAN

Query:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
        RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE
Subjt:  RCPLKDKINALTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIE

Query:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
        QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR
Subjt:  QIPDEGAKRTCLLKLKQSLEEQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKR

Query:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS--------------
        KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS              
Subjt:  KSAGKALLIEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLS--------------

Query:  ----KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
            KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ
Subjt:  ----KNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQ

Query:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
        NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY
Subjt:  NFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINY

Query:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
Subjt:  KPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

SwissProt top hitse value%identityAlignment
P03554 Enzymatic polyprotein1.0e-3537.45Show/hide
Query:  EEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCK
        E V    N+I    + I  L +    G+   E +    Q ++Q  ++ LEK VCS  P    D  K    +       +     K +P++ +    +   
Subjt:  EEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCK

Query:  DEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVP
         +I +LL+  +I PSKSP    AF VNN+AEK RG  R+V+NYK +NK      Y +PN+ +LL  I   K+FS FD KSGFWQ+ +    R  TAF  P
Subjt:  DEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVP

Query:  FGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
         G ++WNV+PFGLK APS FQ+ M++ F  +++F  VY+DD+L
Subjt:  FGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

P03556 Enzymatic polyprotein1.6e-3635.1Show/hide
Query:  ILGTPFLTQLYPF-HVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVR----QFINRIARKEKHIEFLQDDIKAGKVAI-------EIQKPFV-QLK
        I+G  F     PF   TD+ I    F K+ T+       P +I+ +   VR     F+  + ++ K  +    +I   K+AI         +K F+ Q +
Subjt:  ILGTPFLTQLYPF-HVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVR----QFINRIARKEKHIEFLQDDIKAGKVAI-------EIQKPFV-QLK

Query:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI
        +Q  ++ LEK VCS  P    D  K    +       +     K +P++ +    +    +I +LL+  +I PSKSP    AF VNN+AEK RG  R+V+
Subjt:  IQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVI

Query:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD
        NYK +NK      Y  PN+ +LL  I   K+FS FD KSGFWQ+ +    R  TAF  P G ++WNV+PFGLK APS FQ+ M++ F  +++F  VY+DD
Subjt:  NYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDD

Query:  VL
        +L
Subjt:  VL

Q02964 Enzymatic polyprotein4.6e-3637.86Show/hide
Query:  EEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCK
        E V    N+I    K I  L +    G+   E +    Q ++Q  ++ LEK VCS  P    D  K    +       +     K +P++ +    +   
Subjt:  EEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCK

Query:  DEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVP
         +I +LL+  +I PSKSP    AF VNN+AEK RG  R+V+NYK +NK      Y +PN+ +LL  I   K+FS FD KSGFWQ+ +    R  TAF  P
Subjt:  DEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVP

Query:  FGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
         G ++WNV+PFGLK APS FQ+ M++ F  +++F  VY+DD+L
Subjt:  FGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

Q6XKE6 Genome polyprotein1.6e-4132.91Show/hide
Query:  IEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSKNLNEGIILGT-----PFLTQL
        + +I TI  I + + + + SK      D  +  +A  D+GA  +++   ++PS Y+                  F+     ++GI+  T     P   + 
Subjt:  IEEIGTINSISRIQNQKWMSKIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSKNLNEGIILGT-----PFLTQL

Query:  YP-FHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLE----KEVC-----
        +P F  T K + S    K++   F             +  RQ  N++      I +        K   EI + F QL   N  +QLE     ++C     
Subjt:  YP-FHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLE----KEVC-----

Query:  ---STLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLK
           S   +  W  +   + LP++    E   PTKA    MN + +++   E  +L    LI PS S W+C AFYVN ++E+ RG  RLVINY+PLN  L+
Subjt:  ---STLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLK

Query:  WIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
          ++PIPN+  L   ++ AK+FSKFD+KSGFWQ+ IH ++R KT F +P   FQW VMPFGLK APS FQK M  IF       +VYIDD+L
Subjt:  WIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

Q91DM0 Genome polyprotein2.8e-4133.6Show/hide
Query:  KIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSKNLNEGIILGT-----PFLTQLYP-FHVTDKGIVSKKFDKEI
        KI     D  +  +A  D+GA  +++   ++PS Y+                  F+     ++GI+  T     P   + +P F  T K + S    K++
Subjt:  KIVFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSKNLNEGIILGT-----PFLTQLYP-FHVTDKGIVSKKFDKEI

Query:  TFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLE----KEVC--------STLPNAFWDRKKHMVIL
           F             +  RQ  N++      I +        K   EI + F QL   N  +QLE     ++C        S   +  W  +   + L
Subjt:  TFEFTHPVTPKYISNIEEEVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLE----KEVC--------STLPNAFWDRKKHMVIL

Query:  PYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAK
        P++    E   PTKA    MN + +++   E  +L    LI PS S W+C AFYVN ++E+ RG  RLVINY+PLN  L+  ++PIPN+  L   ++ AK
Subjt:  PYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGLISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAK

Query:  VFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL
        +FSKFD+KSGFWQ+ IH ++R KT F +P   FQW VMPFGLK APS FQK M  IF       +VYIDD+L
Subjt:  VFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQKIMNDIFNQYQEFTIVYIDDVL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATGTGTTAATCTCTGCACGGAGAATAAACACACAACGAAGGTGATCAAGGATTCTGACTACAGAAAAGAGTTAGGCACCTTCTGCAAGCAGTATGGCCTGTCTCA
AGGGCCAAAGGAAGAGAAGAAGAAGAAAAAGAAAAGGTACTCTTCCAGAAAATTCTTTAGGAAAAACAAAGACCAAGAGTCTCCTCCAAGAAGAAGACGTCATTACTACA
AGGGCAAAGGCAAGAAAAAATACTCCTCAAAGACCAACACGATATGCTTCAAGTGCAATCAGAAAGGTCACTATGCAAATCGTTGTCCCCTAAAGGATAAAATCAATGCC
CTGACAGTGGATGAAGAGACAAAGCAATCCCTTCTTTATGCCATAAGAACGGATGACGAGACCTCTTCTCAAACGGAATCTTCATCGGAAGAAGACTGCATCAACATTCT
ACAAGAAGAAGAATCCTCTTTTGAAGAAGAATTCTACTCTCAAAGCAACTCAAGCGATGACGAAGGAGCAATACCTTGCACAGGACGATGTGCTGGCAAGTGTTCTGGCC
ACATCAACGTCATCACAAAGGATCAGGAAACTCTTTTTGATCTCATCGAGCAGATCCCAGATGAAGGAGCAAAGAGAACCTGTCTTCTCAAATTAAAACAGAGTCTAGAG
GAACAAGCACCTCAGAAGACGATTCAAAATCCGATTATGTATTCGTATCAAGACATACTGAATCGGGTAAAAGGCGAAGCTAAAGTGCCTATTCAAGTTGAAGATCTTCA
TCATGAAGTAAAGATTCTCAAAAAGGAAGTTGCTGAAAACAAACAACGACTCATCTACCTTGAAACTGCATTTCAAGCCTTTCAAAAATCTCAAGCTTTGAAAGAAGAAT
CAACAAGCGACTTTAAAAGAAAAAGCGCGGGAAAAGCACTATTGATTGAAGAAATAGGCACAATCAATAGTATTAGCAGAATCCAAAATCAAAAGTGGATGTCGAAGATA
GTCTTTAAAGTCAAAGACTTCCAGCTTGAAGCTCTAGCTCTCATTGATTCAGGAGCCGATCAAAACGTTATACAAGAAGGACTTGTCCCTTCTAGGTATTTCGAAAAAAC
AAAGGAGTCCCTATCCGGAGCTGGCGGGAATCCCCTCAACATTCAATTCAAATTGTCAAAGAACCTTAATGAAGGAATCATCCTAGGAACTCCATTCCTAACCCAATTAT
ATCCGTTTCATGTTACTGATAAGGGTATAGTCTCAAAAAAGTTTGACAAAGAAATTACTTTTGAGTTCACTCATCCAGTAACTCCTAAGTACATTTCAAATATTGAGGAA
GAAGTTCGTCAATTCATCAACAGGATCGCCAGGAAAGAAAAGCATATTGAATTCCTTCAAGACGACATCAAAGCAGGTAAAGTGGCTATTGAAATCCAGAAGCCATTCGT
TCAATTGAAGATCCAAAACTTCCAAAAGCAGCTCGAAAAGGAAGTTTGTTCAACTCTCCCAAATGCCTTTTGGGATCGAAAGAAGCATATGGTAATATTGCCATATGAGG
ACGGATTCAAAGAGGCCCAAATCCCTACAAAGGCCCGGCCTATTCAGATGAATAAAGATCTGGTCAAAGTCTGTAAGGACGAAATCACGGATCTTCTCAATAAAGGCCTA
ATAAGCCCATCAAAAAGCCCATGGTCGTGTTCGGCCTTTTACGTCAACAACCAGGCCGAAAAGGAACGTGGCGTCCCAAGGCTTGTCATCAATTACAAGCCTCTCAACAA
GGTTCTCAAATGGATTAGGTACCCAATTCCTAATCGCCAAGACTTGCTAAAGAGAATTACTTTAGCAAAGGTCTTCTCAAAGTTCGACATGAAATCTGGATTTTGGCAAA
TCCAAATCCACTCATCTGACCGGTACAAGACGGCCTTCAAGGTTCCATTCGGACAGTTTCAATGGAATGTCATGCCATTCGGATTGAAGAACGCTCCATCCGAATTCCAG
AAAATAATGAACGATATCTTCAACCAATATCAAGAGTTCACAATAGTCTATATTGATGATGTCTTGGGGATGTTTGTGGGAAGAAAATTCTCTCACCCTTTCAAACCACC
CTCATACAATTACCGTGACTATACAAAGGCATGGTACATTGTATTTTGGCTCCAAGCTTACAACCATTCTTGGTTTGTGACATTTTGCAAGCAAGCTTACAAAATGCACT
TCCCAAACTGGTTTCAAAACTGGTGGATGTATTTTGGACTCTCCGAAGAAATATTTCCGGTAGAAGTTCAACGTTCGTACCACCTTTTCCAACAAAGTATCTATTCGTCA
CCTCTTTCAAAGACGTTTAGGTTTGCTTTGTATTTCCAAATTCCCTGGATCTTTTGCTGGAATTTCCAACTAGGACCTTCAGGAAATTTCAAAGCCCTGAGTAAATCCCT
CAGGATCAAATGGTGGGAAAAGTTCAATTACTCCCATCTTGAAGTCAACAAGATCAAGGATTGGTTTAAAGCCAATATCCATCTCCAAGATATGACTAGGCAAGAGGACG
AAGCCTTCCTCCTCACCAAAAACGCAGTCATGTCAACCCTTGCCGGAGCAAGCACTCAACAGGAATTCAATTCTGTTGTCAACAACGTCGTTGTAAACCTCTCAGATGAT
AACGATGATTTACCAGAGGACGTTTCACCAGCCTCCGTCAATAATGTTGATGACCTTGACTATGACCCATACGAAGGCCTCGACATCAACGACCCATTTTTGGACACTCA
ACCTTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATGTGTTAATCTCTGCACGGAGAATAAACACACAACGAAGGTGATCAAGGATTCTGACTACAGAAAAGAGTTAGGCACCTTCTGCAAGCAGTATGGCCTGTCTCA
AGGGCCAAAGGAAGAGAAGAAGAAGAAAAAGAAAAGGTACTCTTCCAGAAAATTCTTTAGGAAAAACAAAGACCAAGAGTCTCCTCCAAGAAGAAGACGTCATTACTACA
AGGGCAAAGGCAAGAAAAAATACTCCTCAAAGACCAACACGATATGCTTCAAGTGCAATCAGAAAGGTCACTATGCAAATCGTTGTCCCCTAAAGGATAAAATCAATGCC
CTGACAGTGGATGAAGAGACAAAGCAATCCCTTCTTTATGCCATAAGAACGGATGACGAGACCTCTTCTCAAACGGAATCTTCATCGGAAGAAGACTGCATCAACATTCT
ACAAGAAGAAGAATCCTCTTTTGAAGAAGAATTCTACTCTCAAAGCAACTCAAGCGATGACGAAGGAGCAATACCTTGCACAGGACGATGTGCTGGCAAGTGTTCTGGCC
ACATCAACGTCATCACAAAGGATCAGGAAACTCTTTTTGATCTCATCGAGCAGATCCCAGATGAAGGAGCAAAGAGAACCTGTCTTCTCAAATTAAAACAGAGTCTAGAG
GAACAAGCACCTCAGAAGACGATTCAAAATCCGATTATGTATTCGTATCAAGACATACTGAATCGGGTAAAAGGCGAAGCTAAAGTGCCTATTCAAGTTGAAGATCTTCA
TCATGAAGTAAAGATTCTCAAAAAGGAAGTTGCTGAAAACAAACAACGACTCATCTACCTTGAAACTGCATTTCAAGCCTTTCAAAAATCTCAAGCTTTGAAAGAAGAAT
CAACAAGCGACTTTAAAAGAAAAAGCGCGGGAAAAGCACTATTGATTGAAGAAATAGGCACAATCAATAGTATTAGCAGAATCCAAAATCAAAAGTGGATGTCGAAGATA
GTCTTTAAAGTCAAAGACTTCCAGCTTGAAGCTCTAGCTCTCATTGATTCAGGAGCCGATCAAAACGTTATACAAGAAGGACTTGTCCCTTCTAGGTATTTCGAAAAAAC
AAAGGAGTCCCTATCCGGAGCTGGCGGGAATCCCCTCAACATTCAATTCAAATTGTCAAAGAACCTTAATGAAGGAATCATCCTAGGAACTCCATTCCTAACCCAATTAT
ATCCGTTTCATGTTACTGATAAGGGTATAGTCTCAAAAAAGTTTGACAAAGAAATTACTTTTGAGTTCACTCATCCAGTAACTCCTAAGTACATTTCAAATATTGAGGAA
GAAGTTCGTCAATTCATCAACAGGATCGCCAGGAAAGAAAAGCATATTGAATTCCTTCAAGACGACATCAAAGCAGGTAAAGTGGCTATTGAAATCCAGAAGCCATTCGT
TCAATTGAAGATCCAAAACTTCCAAAAGCAGCTCGAAAAGGAAGTTTGTTCAACTCTCCCAAATGCCTTTTGGGATCGAAAGAAGCATATGGTAATATTGCCATATGAGG
ACGGATTCAAAGAGGCCCAAATCCCTACAAAGGCCCGGCCTATTCAGATGAATAAAGATCTGGTCAAAGTCTGTAAGGACGAAATCACGGATCTTCTCAATAAAGGCCTA
ATAAGCCCATCAAAAAGCCCATGGTCGTGTTCGGCCTTTTACGTCAACAACCAGGCCGAAAAGGAACGTGGCGTCCCAAGGCTTGTCATCAATTACAAGCCTCTCAACAA
GGTTCTCAAATGGATTAGGTACCCAATTCCTAATCGCCAAGACTTGCTAAAGAGAATTACTTTAGCAAAGGTCTTCTCAAAGTTCGACATGAAATCTGGATTTTGGCAAA
TCCAAATCCACTCATCTGACCGGTACAAGACGGCCTTCAAGGTTCCATTCGGACAGTTTCAATGGAATGTCATGCCATTCGGATTGAAGAACGCTCCATCCGAATTCCAG
AAAATAATGAACGATATCTTCAACCAATATCAAGAGTTCACAATAGTCTATATTGATGATGTCTTGGGGATGTTTGTGGGAAGAAAATTCTCTCACCCTTTCAAACCACC
CTCATACAATTACCGTGACTATACAAAGGCATGGTACATTGTATTTTGGCTCCAAGCTTACAACCATTCTTGGTTTGTGACATTTTGCAAGCAAGCTTACAAAATGCACT
TCCCAAACTGGTTTCAAAACTGGTGGATGTATTTTGGACTCTCCGAAGAAATATTTCCGGTAGAAGTTCAACGTTCGTACCACCTTTTCCAACAAAGTATCTATTCGTCA
CCTCTTTCAAAGACGTTTAGGTTTGCTTTGTATTTCCAAATTCCCTGGATCTTTTGCTGGAATTTCCAACTAGGACCTTCAGGAAATTTCAAAGCCCTGAGTAAATCCCT
CAGGATCAAATGGTGGGAAAAGTTCAATTACTCCCATCTTGAAGTCAACAAGATCAAGGATTGGTTTAAAGCCAATATCCATCTCCAAGATATGACTAGGCAAGAGGACG
AAGCCTTCCTCCTCACCAAAAACGCAGTCATGTCAACCCTTGCCGGAGCAAGCACTCAACAGGAATTCAATTCTGTTGTCAACAACGTCGTTGTAAACCTCTCAGATGAT
AACGATGATTTACCAGAGGACGTTTCACCAGCCTCCGTCAATAATGTTGATGACCTTGACTATGACCCATACGAAGGCCTCGACATCAACGACCCATTTTTGGACACTCA
ACCTTATTAG
Protein sequenceShow/hide protein sequence
MICVNLCTENKHTTKVIKDSDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSRKFFRKNKDQESPPRRRRHYYKGKGKKKYSSKTNTICFKCNQKGHYANRCPLKDKINA
LTVDEETKQSLLYAIRTDDETSSQTESSSEEDCINILQEEESSFEEEFYSQSNSSDDEGAIPCTGRCAGKCSGHINVITKDQETLFDLIEQIPDEGAKRTCLLKLKQSLE
EQAPQKTIQNPIMYSYQDILNRVKGEAKVPIQVEDLHHEVKILKKEVAENKQRLIYLETAFQAFQKSQALKEESTSDFKRKSAGKALLIEEIGTINSISRIQNQKWMSKI
VFKVKDFQLEALALIDSGADQNVIQEGLVPSRYFEKTKESLSGAGGNPLNIQFKLSKNLNEGIILGTPFLTQLYPFHVTDKGIVSKKFDKEITFEFTHPVTPKYISNIEE
EVRQFINRIARKEKHIEFLQDDIKAGKVAIEIQKPFVQLKIQNFQKQLEKEVCSTLPNAFWDRKKHMVILPYEDGFKEAQIPTKARPIQMNKDLVKVCKDEITDLLNKGL
ISPSKSPWSCSAFYVNNQAEKERGVPRLVINYKPLNKVLKWIRYPIPNRQDLLKRITLAKVFSKFDMKSGFWQIQIHSSDRYKTAFKVPFGQFQWNVMPFGLKNAPSEFQ
KIMNDIFNQYQEFTIVYIDDVLGMFVGRKFSHPFKPPSYNYRDYTKAWYIVFWLQAYNHSWFVTFCKQAYKMHFPNWFQNWWMYFGLSEEIFPVEVQRSYHLFQQSIYSS
PLSKTFRFALYFQIPWIFCWNFQLGPSGNFKALSKSLRIKWWEKFNYSHLEVNKIKDWFKANIHLQDMTRQEDEAFLLTKNAVMSTLAGASTQQEFNSVVNNVVVNLSDD
NDDLPEDVSPASVNNVDDLDYDPYEGLDINDPFLDTQPY