; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003022 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003022
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr01:25085658..25092671
RNA-Seq ExpressionIVF0003022
SyntenyIVF0003022
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]0.074.35Show/hide
Query:  QVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI-------------------------------------------
        QVP ANATRTVRE YERWAKANEKARAYILASLSEVL KKHESMLTAREIMDSLQE+                                           
Subjt:  QVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI-------------------------------------------

Query:  ------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNK
                                                                                GSTSGTKS+PSSSGNKKWKKKKGGQGNK
Subjt:  ------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNK

Query:  ANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARG
        AN A AKT KKAKAAKGICF CNQEGHWKRNCPKYLA+KKKAKQ                    GKMTKRPFTGKGH +KEPLELVHSDLCGPMN KARG
Subjt:  ANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARG

Query:  GFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWS
        GFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAP TPQQNGV ERRNRTLLDMV S
Subjt:  GFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWS

Query:  MMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTN
        MMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW GHKGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKVFVSTN
Subjt:  MMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTN

Query:  ATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAM
        ATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLTFKKAM
Subjt:  ATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAM

Query:  EDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEI
        EDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAYFDYEI
Subjt:  EDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEI

Query:  WQMYVKTTILNDNLEETIYMQQPEGFIIP
        WQM VKT  LN NLEETIYMQQPEGFIIP
Subjt:  WQMYVKTTILNDNLEETIYMQQPEGFIIP

KAA0034863.1 gag/pol protein [Cucumis melo var. makuwa]0.086.89Show/hide
Query:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------
        S VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA   RE                           I++SL E       
Subjt:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------

Query:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVR-----------
                 GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ   +KR+            
Subjt:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVR-----------

Query:  ------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
                                 NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
Subjt:  ------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL

Query:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
        EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
Subjt:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS

Query:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
        KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
Subjt:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST

Query:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
        ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
Subjt:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV

Query:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
        DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
Subjt:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII

Query:  PC
        PC
Subjt:  PC

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]0.061.18Show/hide
Query:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------
        VKE  QVP ANATRTVRE YERWAKANEKARAYILASLSEVL KKHESMLTAREIMDSLQE+                                      
Subjt:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------

Query:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG
                                                                                     GSTSGTKS+PSSSGNKKWKKKKG
Subjt:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG

Query:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ--------------------------------------EWA----------
        GQGNKAN A AKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ                                       W           
Subjt:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ--------------------------------------EWA----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  SKRVR-----------------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDY
        +KR++                                    NSLPVCESCLEGKMTKRPFTGKGH +KEPLELVHSDLCGPMN KARGGFEYFITFTDDY
Subjt:  SKRVR-----------------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDY

Query:  SRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFR
        SRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPGTPQQNGV ERRNRTLLDMV SMMSYAHLPNSF 
Subjt:  SRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFR

Query:  GYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREY
        GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKVFVSTNATFLEEDHIRE+
Subjt:  GYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREY

Query:  KPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAM
        KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLTFKKAMEDVDKDEWIKAM
Subjt:  KPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAM

Query:  NLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILND
        NLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAYFDYEIWQM VKT  LN 
Subjt:  NLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILND

Query:  NLEETIYMQQPEGFIIP
        NLEETIYMQQPEGFIIP
Subjt:  NLEETIYMQQPEGFIIP

TYJ96675.1 gag/pol protein [Cucumis melo var. makuwa]0.086.89Show/hide
Query:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------
        S VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA   RE                           I++SL E       
Subjt:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------

Query:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVR-----------
                 GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ   +KR+            
Subjt:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVR-----------

Query:  ------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
                                 NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
Subjt:  ------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL

Query:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
        EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
Subjt:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS

Query:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
        KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
Subjt:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST

Query:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
        ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
Subjt:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV

Query:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
        DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
Subjt:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII

Query:  PC
        PC
Subjt:  PC

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]0.078.35Show/hide
Query:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------
        V+E  QVP ANAT+TVRE YERWAK NEK RAYILASLSEVL KKHESMLTAREIMDSLQE+                                      
Subjt:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------

Query:  --------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK
                                        GSTSGTKS+PSSSGNKKWKKKKGGQGNKAN A AKT KK+KA KGICFH NQEGHWKRNCPKYLAEKK
Subjt:  --------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK

Query:  KAKQEWAS-KRVRR------------NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEA
        KAKQ   +  R+ R            NSLP+CESCLEGKMTKRPFTGKGH +KEPLELVHSDLCGPMN KARG FEYFITFTDDYSRYGYVYLMQHKSEA
Subjt:  KAKQEWAS-KRVRR------------NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEA

Query:  LEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVP
        LEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMECEI+SQLSAPGTPQQNGV ERRNRTLLDMV SM+SYAHLPNSF GYAVQ  VYILNCVP
Subjt:  LEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVP

Query:  SKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNES
        SKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+
Subjt:  SKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNES

Query:  TELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDL
        TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDL
Subjt:  TELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDL

Query:  VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI
        VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAYFDYEIWQM VKT  LN NLEETIYMQQPEGFI
Subjt:  VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI

Query:  IP
        IP
Subjt:  IP

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein5.8e-29473.98Show/hide
Query:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------
        V++  QVP ANATRTVRE YERWAKANEKARAYILASLSEVL KKHESMLTAREIMDSLQE+                                      
Subjt:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------

Query:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG
                                                                                     GSTSGTKS+PSSSGNKKWKKKKG
Subjt:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG

Query:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMN
        GQGNKAN A AKT KKAKAAKGICF CNQEGHWKRNCPKYLA+KKKAKQ                    GKMTKRPFTGKGH +KEPLELVHSDLCGPMN
Subjt:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMN

Query:  AKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLL
         KARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAP TPQQNGV ERRNRTLL
Subjt:  AKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLL

Query:  DMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKV
        DMV SMMSYAHLPNSF GYAVQ  VYILNCVPSKS+SE PLKLW GHKGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKV
Subjt:  DMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKV

Query:  FVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLT
        FVSTNATFLEEDHIRE+KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLT
Subjt:  FVSTNATFLEEDHIREYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLT

Query:  FKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAY
        FKKAMEDVDKDEWIKAMNLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAY
Subjt:  FKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAY

Query:  FDYEIWQMYVKTTILNDNLEETIYMQQPEGFIIP
        FDYEIWQM VKT  LN NLEETIYMQQPEGFIIP
Subjt:  FDYEIWQMYVKTTILNDNLEETIYMQQPEGFIIP

A0A5A7SWF4 Gag/pol protein0.0e+0086.89Show/hide
Query:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------
        S VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA   RE                           I++SL E       
Subjt:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------

Query:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKR-------------
                 GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ   +KR             
Subjt:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKR-------------

Query:  ----------------------VRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
                              +  NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
Subjt:  ----------------------VRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL

Query:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
        EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
Subjt:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS

Query:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
        KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
Subjt:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST

Query:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
        ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
Subjt:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV

Query:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
        DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
Subjt:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII

Query:  PC
        PC
Subjt:  PC

A0A5A7V4M1 Gag/pol protein8.4e-28561.18Show/hide
Query:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------
        VKE  QVP ANATRTVRE YERWAKANEKARAYILASLSEVL KKHESMLTAREIMDSLQE+                                      
Subjt:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------

Query:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG
                                                                                     GSTSGTKS+PSSSGNKKWKKKKG
Subjt:  -----------------------------------------------------------------------------GSTSGTKSVPSSSGNKKWKKKKG

Query:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ--------------------------------------EW-----------
        GQGNKAN A AKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ                                       W           
Subjt:  GQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ--------------------------------------EW-----------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                            
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  SKRVR-----------------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDY
        +KR++                                    NSLPVCESCLEGKMTKRPFTGKGH +KEPLELVHSDLCGPMN KARGGFEYFITFTDDY
Subjt:  SKRVR-----------------------------------RNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDY

Query:  SRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFR
        SRYGYVYLMQHKSEALEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMEC IVSQLSAPGTPQQNGV ERRNRTLLDMV SMMSYAHLPNSF 
Subjt:  SRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFR

Query:  GYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREY
        GYAVQ  VYILNCVPSKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKVFVSTNATFLEEDHIRE+
Subjt:  GYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREY

Query:  KPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAM
        KPRSKIVLNELS E+TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLTFKKAMEDVDKDEWIKAM
Subjt:  KPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAM

Query:  NLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILND
        NLELESMYFN VWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAYFDYEIWQM VKT  LN 
Subjt:  NLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILND

Query:  NLEETIYMQQPEGFIIP
        NLEETIYMQQPEGFIIP
Subjt:  NLEETIYMQQPEGFIIP

A0A5D3BDY3 Gag/pol protein0.0e+0086.89Show/hide
Query:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------
        S VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA   RE                           I++SL E       
Subjt:  SRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTA---RE---------------------------IMDSLQEI------

Query:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKR-------------
                 GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQ   +KR             
Subjt:  ---------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKR-------------

Query:  ----------------------VRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
                              +  NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL
Subjt:  ----------------------VRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEAL

Query:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
        EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
Subjt:  EKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS

Query:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
        KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST
Subjt:  KSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNEST

Query:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
        ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV
Subjt:  ELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLV

Query:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
        DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII
Subjt:  DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFII

Query:  PC
        PC
Subjt:  PC

A0A5D3BHG7 Gag/pol protein2.8e-30478.35Show/hide
Query:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------
        V+E  QVP ANAT+TVRE YERWAK NEK RAYILASLSEVL KKHESMLTAREIMDSLQE+                                      
Subjt:  VKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEI--------------------------------------

Query:  --------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK
                                        GSTSGTKS+PSSSGNKKWKKKKGGQGNKAN A AKT KK+KA KGICFH NQEGHWKRNCPKYLAEKK
Subjt:  --------------------------------GSTSGTKSVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK

Query:  KAKQEWAS-KRVRR------------NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEA
        KAKQ   +  R+ R            NSLP+CESCLEGKMTKRPFTGKGH +KEPLELVHSDLCGPMN KARG FEYFITFTDDYSRYGYVYLMQHKSEA
Subjt:  KAKQEWAS-KRVRR------------NSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEA

Query:  LEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVP
        LEKFKEYKAEVEN LSKTIKTFRSDRGGEYMDLKFQNYLMECEI+SQLSAPGTPQQNGV ERRNRTLLDMV SM+SYAHLPNSF GYAVQ  VYILNCVP
Subjt:  LEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVP

Query:  SKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNES
        SKS+SE PLKLW G KGSLRHFRIWGC AHVLE NPKKLEPRSKLCLFVGYPKGTR GYFY  KDNKVFVSTNATFLEEDHIRE+KPRSKIVLNELS E+
Subjt:  SKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHIREYKPRSKIVLNELSNES

Query:  TELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDL
        TE STRVVEEPS L RVVHV SS R  QP+SL EPRRSGRVTNLPI YMSLTETLTVISDG+I DPLTFKKAMEDVDKDEWIKAMNLELESMYFN VWDL
Subjt:  TELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDL

Query:  VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI
        VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE+ FS VAMLKSIRILLSIAAYFDYEIWQM VKT  LN NLEETIYMQQPEGFI
Subjt:  VDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI

Query:  IP
        IP
Subjt:  IP

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-5427.87Show/hide
Query:  SLPVCESCLEGKMTKRPF---TGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRS
        S  +CE CL GK  + PF     K HI K PL +VHSD+CGP+         YF+ F D ++ Y   YL+++KS+    F+++ A+ E   +  +     
Subjt:  SLPVCESCLEGKMTKRPF---TGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRS

Query:  DRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSI---SEIPLKLWIGHKGSLRH
        D G EY+  + + + ++  I   L+ P TPQ NGV ER  RT+ +   +M+S A L  SF G AV    Y++N +PS+++   S+ P ++W   K  L+H
Subjt:  DRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSI---SEIPLKLWIGHKGSLRH

Query:  FRIWGCQAHV-LETNPKKLEPRSKLCLFVGY-PKGTR-----DGYFYGLKD---------NKVFVSTNATFLEEDHIREYK----PRSKIVLNELSNEST
         R++G   +V ++    K + +S   +FVGY P G +     +  F   +D         N   V     FL++    E K       KI+  E  NES 
Subjt:  FRIWGCQAHV-LETNPKKLEPRSKLCLFVGY-PKGTR-----DGYFYGLKD---------NKVFVSTNATFLEEDHIREYK----PRSKIVLNELSNEST

Query:  E-------------------------LSTRVVEEPSTLARVVHVSSS-----------------IRIRQPKSLGEP------------------------
        E                         + T    E      +  +  S                   + + K  G P                        
Subjt:  E-------------------------LSTRVVEEPSTLARVVHVSSS-----------------IRIRQPKSLGEP------------------------

Query:  ------RRSGRVTNLP-IHYMSLTETL--TVISDGNIGD--PLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADG
              RRS R+   P I Y     +L   V++   I +  P +F +     DK  W +A+N EL +   N  W +  +P+    +  +W++  K    G
Subjt:  ------RRSGRVTNLP-IHYMSLTETL--TVISDGNIGD--PLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADG

Query:  KVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEG
            +KARLVA+G+TQ   +DYE+ F+ VA + S R +LS+   ++ ++ QM VKT  LN  L+E IYM+ P+G
Subjt:  KVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-7836.29Show/hide
Query:  CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYM
        C+ CL GK  +  F          L+LV+SD+CGPM  ++ GG +YF+TF DD SR  +VY+++ K +  + F+++ A VE +  + +K  RSD GGEY 
Subjt:  CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYM

Query:  DLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQ--
          +F+ Y     I  + + PGTPQ NGV ER NRT+++ V SM+  A LP SF G AVQ   Y++N  PS  ++ EIP ++W   + S  H +++GC+  
Subjt:  DLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQ--

Query:  AHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKD---NKVFVSTNATFLEEDHIREYKPRSKIVLNEL---------------SNEST--ELSTRVV
        AHV +    KL+ +S  C+F+GY     + + Y L D    KV  S +  F  E  +R     S+ V N +               S EST  E+S +  
Subjt:  AHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKD---NKVFVSTNATFLEEDHIREYKPRSKIVLNEL---------------SNEST--ELSTRVV

Query:  EEPSTLARVVHVSSSIRIRQPKSLGEP-----RRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQ
        +    + +   +   +   +  + GE      RRS R       Y S TE + +  D    +P + K+ +   +K++ +KAM  E+ES+  N  + LV+ 
Subjt:  EEPSTLARVVHVSSSIRIRQPKSLGEP-----RRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQ

Query:  PDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGF
        P G +P+ CKW++K K+  D K+  +KARLV KG+ Q +G+D+++IFS V  + SIR +LS+AA  D E+ Q+ VKT  L+ +LEE IYM+QPEGF
Subjt:  PDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGF

Q12490 Transposon Ty1-BL Gag-Pol polyprotein1.9e-1530.11Show/hide
Query:  CESCLEGKMTK-RPFTG---KGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSE--ALEKFKEYKAEVENKLSKTIKTFRSD
        C  CL GK TK R   G   K   S EP + +H+D+ GP++   +    YFI+FTD+ +++ +VY +  + E   L+ F    A ++N+   ++   + D
Subjt:  CESCLEGKMTK-RPFTG---KGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSE--ALEKFKEYKAEVENKLSKTIKTFRSD

Query:  RGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS
        RG EY +     +L +  I    +     + +GV ER NRTLLD   + +  + LPN     A++ +  + N + S
Subjt:  RGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-4625.34Show/hide
Query:  CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYM
        C  CL  K  K PF+     S  PLE ++SD+       +   + Y++ F D ++RY ++Y ++ KS+  E F  +K  +EN+    I TF SD GGE++
Subjt:  CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYM

Query:  DLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH
         L    Y  +  I    S P TP+ NG+ ER++R +++   +++S+A +P ++  YA    VY++N +P+  +  E P +   G   +    R++GC  +
Subjt:  DLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAH

Query:  --VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE-----------DHIREYKPRSKIV------------------------
          +   N  KL+ +S+ C+F+GY   T+  Y    L+ +++++S +  F E              ++E +  S  V                        
Subjt:  --VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE-----------DHIREYKPRSKIV------------------------

Query:  ------------------------------------------------------LNELSNESTELSTRVVEEPSTLARVVHVSS----------------
                                                                  S+++T  +    E PS LA+ +   +                
Subjt:  ------------------------------------------------------LNELSNESTELSTRVVEEPSTLARVVHVSS----------------

Query:  -------SIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVI----------SDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPD
               SI I  P  L +   +     L  H M       +I          S     +P T  +A++D   + W  AM  E+ +   N  WDLV  P 
Subjt:  -------SIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVI----------SDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPD

Query:  G-VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI
          V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY + FS V    SIRI+L +A    + I Q+ V    L   L + +YM QP GFI
Subjt:  G-VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-4524.92Show/hide
Query:  VRRNSLPV---------CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENK
        +  +SLPV         C  C   K  K PF+     S +PLE ++SD+       +   + Y++ F D ++RY ++Y ++ KS+  + F  +K+ VEN+
Subjt:  VRRNSLPV---------CESCLEGKMTKRPFTGKGHISKEPLELVHSDLCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENK

Query:  LSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWI
            I T  SD GGE++ L+  +YL +  I    S P TP+ NG+ ER++R +++M  +++S+A +P ++  YA    VY++N +P+  +  + P +   
Subjt:  LSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWI

Query:  GHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE-------------------------------
        G   +    +++GC  +  +   N  KLE +SK C F+GY   T+  Y    +   +++ S +  F E                                
Subjt:  GHKGSLRHFRIWGCQAH--VLETNPKKLEPRSKLCLFVGYPKGTRDGYF-YGLKDNKVFVSTNATFLEE-------------------------------

Query:  --------------DHIREYKPR-----SKIVLNELSNE---STELSTRVVEEPSTLAR-------VVHVSSSIRIRQP---------KSLGEPRRSGRV
                       H+ +  PR     S +   ++S+    S+ +S+    EP+  +          H + +     P          S   P ++  +
Subjt:  --------------DHIREYKPR-----SKIVLNELSNE---STELSTRVVEEPSTLAR-------VVHVSSSIRIRQP---------KSLGEPRRSGRV

Query:  TNLPIHYMSLTETLTVISDGNI----------------------------------------------------------GDPLTFKKAMEDVDKDEWIK
           PI    +    T IS+ N                                                            +P T  +AM+D   D W +
Subjt:  TNLPIHYMSLTETLTVISDGNI----------------------------------------------------------GDPLTFKKAMEDVDKDEWIK

Query:  AMNLELESMYFNLVWDLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTI
        AM  E+ +   N  WDLV   P  V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY + FS V    SIRI+L +A    + I Q+ V    
Subjt:  AMNLELESMYFNLVWDLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTI

Query:  LNDNLEETIYMQQPEGFI
        L   L + +YM QP GF+
Subjt:  LNDNLEETIYMQQPEGFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-2740.74Show/hide
Query:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLS
        +P T+ +A E +    W  AM+ E+ +M     W++   P   KPIGCKW+YK K  +DG ++ +KARLVAKGYTQ EG+D+ + FS V  L S++++L+
Subjt:  DPLTFKKAMEDVDKDEWIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLS

Query:  IAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGF
        I+A +++ + Q+ +    LN +L+E IYM+ P G+
Subjt:  IAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGF

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.3e-0734.15Show/hide
Query:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSK
        NRT+++ V SM+    LP +FR  A    V+I+N  PS +I+  +P ++W     +  + R +GC A++   +  KL+PR+K
Subjt:  NRTLLDMVWSMMSYAHLPNSFRGYAVQATVYILNCVPSKSIS-EIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-1340.7Show/hide
Query:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA
        W +AM  EL+++  N  W LV  P     +GCKW++K K  +DG +   KARLVAKG+ Q EG+ + + +S V    +IR +L++A
Subjt:  WIKAMNLELESMYFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAAAGGGAATGAGCTTAGAGTATGGAGGAGACTTGGTACTTCCAGAGGACATAATTGGTAGAGTCAAGACAAATGTGCAGGAGATTGCAAATGTTGGACAACA
AGAAGAGAGCAGAGTTAAGGAGTATCTTCAAGTCCCAACTGCTAATGCAACTCGAACTGTTCGAGAAGCATATGAGCGCTGGGCCAAGGCAAATGAAAAAGCCCGAGCAT
ACATTTTGGCAAGCTTATCTGAAGTATTGACCAAGAAACATGAATCAATGCTCACTGCTCGTGAGATCATGGACTCCTTGCAGGAGATAGGTTCGACCTCTGGAACCAAG
TCTGTGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCCCGCTACTGCTAAAACGAGAAAGAAAGCTAAGGCTGCAAA
GGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCGAAACAAGAATGGGCTTCTAAGCGAGTTAGAA
GAAATTCCTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATATTTCCAAAGAACCCCTAGAACTTGTACATTCAGAT
CTATGTGGTCCTATGAATGCTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGA
AGCCCTTGAAAAGTTCAAGGAATACAAGGCTGAAGTTGAAAACAAATTAAGTAAAACTATTAAAACATTTCGGTCGGATCGAGGTGGAGAGTATATGGATTTGAAATTTC
AAAACTATTTGATGGAATGTGAAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATTAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTTGG
TCTATGATGAGTTACGCTCACTTACCTAATTCGTTTCGGGGTTATGCAGTGCAAGCTACAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTATTTCTGAAATACCTTT
GAAATTATGGATTGGTCATAAAGGTAGTTTACGTCATTTCAGAATCTGGGGTTGTCAAGCACACGTGCTTGAGACGAATCCAAAGAAATTGGAACCTCGTTCAAAATTAT
GTTTATTTGTAGGCTACCCCAAAGGAACTAGAGATGGTTATTTCTATGGTCTTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATA
AGGGAGTACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAATGAAAGTACTGAACTTTCAACAAGAGTTGTTGAAGAGCCTAGTACATTAGCAAGAGTTGTTCA
TGTCAGTTCATCTATTAGGATACGTCAACCTAAATCATTGGGCGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCATTATATGAGTTTAACGGAAACCTTAA
CTGTCATATCTGATGGCAACATTGGGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAATCTATG
TACTTCAATTTAGTCTGGGATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTT
TAAAGCTAGACTAGTGGCAAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGAAGATTTTCTCATCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTG
CTGCATATTTTGACTATGAGATTTGGCAAATGTATGTGAAAACTACCATTTTGAATGACAATCTTGAGGAGACCATTTATATGCAACAACCAGAAGGATTCATAATTCCA
TGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAAAGGGAATGAGCTTAGAGTATGGAGGAGACTTGGTACTTCCAGAGGACATAATTGGTAGAGTCAAGACAAATGTGCAGGAGATTGCAAATGTTGGACAACA
AGAAGAGAGCAGAGTTAAGGAGTATCTTCAAGTCCCAACTGCTAATGCAACTCGAACTGTTCGAGAAGCATATGAGCGCTGGGCCAAGGCAAATGAAAAAGCCCGAGCAT
ACATTTTGGCAAGCTTATCTGAAGTATTGACCAAGAAACATGAATCAATGCTCACTGCTCGTGAGATCATGGACTCCTTGCAGGAGATAGGTTCGACCTCTGGAACCAAG
TCTGTGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCCCGCTACTGCTAAAACGAGAAAGAAAGCTAAGGCTGCAAA
GGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCGAAACAAGAATGGGCTTCTAAGCGAGTTAGAA
GAAATTCCTTACCTGTATGTGAGTCATGCCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATATTTCCAAAGAACCCCTAGAACTTGTACATTCAGAT
CTATGTGGTCCTATGAATGCTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGA
AGCCCTTGAAAAGTTCAAGGAATACAAGGCTGAAGTTGAAAACAAATTAAGTAAAACTATTAAAACATTTCGGTCGGATCGAGGTGGAGAGTATATGGATTTGAAATTTC
AAAACTATTTGATGGAATGTGAAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATTAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTTGG
TCTATGATGAGTTACGCTCACTTACCTAATTCGTTTCGGGGTTATGCAGTGCAAGCTACAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTATTTCTGAAATACCTTT
GAAATTATGGATTGGTCATAAAGGTAGTTTACGTCATTTCAGAATCTGGGGTTGTCAAGCACACGTGCTTGAGACGAATCCAAAGAAATTGGAACCTCGTTCAAAATTAT
GTTTATTTGTAGGCTACCCCAAAGGAACTAGAGATGGTTATTTCTATGGTCTTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATA
AGGGAGTACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAATGAAAGTACTGAACTTTCAACAAGAGTTGTTGAAGAGCCTAGTACATTAGCAAGAGTTGTTCA
TGTCAGTTCATCTATTAGGATACGTCAACCTAAATCATTGGGCGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCATTATATGAGTTTAACGGAAACCTTAA
CTGTCATATCTGATGGCAACATTGGGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAATCTATG
TACTTCAATTTAGTCTGGGATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTT
TAAAGCTAGACTAGTGGCAAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGAAGATTTTCTCATCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTG
CTGCATATTTTGACTATGAGATTTGGCAAATGTATGTGAAAACTACCATTTTGAATGACAATCTTGAGGAGACCATTTATATGCAACAACCAGAAGGATTCATAATTCCA
TGTTAA
Protein sequenceShow/hide protein sequence
MAKKGMSLEYGGDLVLPEDIIGRVKTNVQEIANVGQQEESRVKEYLQVPTANATRTVREAYERWAKANEKARAYILASLSEVLTKKHESMLTAREIMDSLQEIGSTSGTK
SVPSSSGNKKWKKKKGGQGNKANPATAKTRKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQEWASKRVRRNSLPVCESCLEGKMTKRPFTGKGHISKEPLELVHSD
LCGPMNAKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENKLSKTIKTFRSDRGGEYMDLKFQNYLMECEIVSQLSAPGTPQQNGVLERRNRTLLDMVW
SMMSYAHLPNSFRGYAVQATVYILNCVPSKSISEIPLKLWIGHKGSLRHFRIWGCQAHVLETNPKKLEPRSKLCLFVGYPKGTRDGYFYGLKDNKVFVSTNATFLEEDHI
REYKPRSKIVLNELSNESTELSTRVVEEPSTLARVVHVSSSIRIRQPKSLGEPRRSGRVTNLPIHYMSLTETLTVISDGNIGDPLTFKKAMEDVDKDEWIKAMNLELESM
YFNLVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEKIFSSVAMLKSIRILLSIAAYFDYEIWQMYVKTTILNDNLEETIYMQQPEGFIIP
C