; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016329 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016329
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr12:36415927..36427252
RNA-Seq ExpressionLag0016329
SyntenyLag0016329
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12332.34Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-11831.52Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYK EVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
         QLSAP TPQQNGV E+RNRT+LDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ N+V+VSTNATFLEEDH++DH+P++KLVL+E       EST+VVD+ GPS  V++T      HPSQ +R PRRSGR++   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+NQAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFN +W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIW+MD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGFEQNV EPCVYKKI K  V FLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-11831.7Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARG +EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK  RSD+GGEYMDL FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DP++N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+T++EGVD+EETFS VAMLKSIRILLSIA +YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.1e-12332.34Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-12031.97Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPL+L+HSDLCG MNVKARG +EYF+SFIDDYSRY YLYLM HKSEALEKFKEYK EVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMV SMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.5e-11831.7Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARG +EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK  RSD+GGEYMDL FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DP++N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+T++EGVD+EETFS VAMLKSIRILLSIA +YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

A0A5A7TZD0 Gag/pol protein4.4e-12332.34Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

A0A5A7UYE8 Gag/pol protein4.4e-12332.34Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYKTEVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

A0A5D3CYF4 Gag/pol protein9.2e-12131.97Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPL+L+HSDLCG MNVKARG +EYF+SFIDDYSRY YLYLM HKSEALEKFKEYK EVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
        SQLSAP TPQQNGVSE+RNRTLLDMV SMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ+N+V VSTNATFLEEDH+++H+PRSKLVL E     T EST+VVD+ GPS  VD+T      HPSQ +R PRRSGRV+   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+ QAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFNS+W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIWQMD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGF+QNV EPCVYKKI K  VAFLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

A0A5D3CZY3 Gag/pol protein1.5e-11831.52Show/hide
Query:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT
        TG+ +RAKEPLEL+HSDLCG MNVKARGG+EYF+SFIDDYSRY YLYLM HKSEALEKFKEYK EVEN L K IK LRSD+GGEYMDL+FQDY+IEH I 
Subjt:  TGR-WRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEIT

Query:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------
         QLSAP TPQQNGV E+RNRT+LDMVRSMMSYAQLP SFWGYAVET V+ILN V  K        +W+GRK                             
Subjt:  SQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVK--------VWKGRK-----------------------------

Query:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC
            GYPKET+GG+F+DPQ N+V+VSTNATFLEEDH++DH+P++KLVL+E       EST+VVD+ GPS  V++T      HPSQ +R PRRSGR++   
Subjt:  ----GYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKLVLDEISRKTTKESTKVVDQAGPSKVVDQT------HPSQIVREPRRSGRVIDNC

Query:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE
                                                                                                            
Subjt:  PKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFE

Query:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG
                                                                                                            
Subjt:  TLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFVGWWLTGCELLTSTVVVLIPLPFISPNISDYAG

Query:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP
                                                                                                            
Subjt:  DELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVERNALEKKKLRSTSIIQTTAPPP

Query:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE
                                                                                                            
Subjt:  QPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRFETLQLGYPLPKE

Query:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL
                                                                                                            
Subjt:  RSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFGAL

Query:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV
                                                                   QP+RY+GLTETQ +I DD VEDPLS+NQAMNDVDKDQW+K 
Subjt:  GCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKV

Query:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------
        MDLEMESMYFN +W+LV+ P+G                              G+TQREGVD+EETFSPVAMLKSIRILLSIAT+YDYEIW+MD       
Subjt:  MDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMD-------

Query:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA
                     GF  QG EQKVCKL RSIYGLKQASRSWNIRFDTAIK+YGFEQNV EPCVYKKI K  V FLVLYVDDILLIGN+VG+LTDVK WLA
Subjt:  -------------GF-EQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYVDDILLIGNNVGFLTDVKQWLA

Query:  TQFK
         QF+
Subjt:  TQFK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-2230Show/hide
Query:  KEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEITSQLSAPA
        K PL +VHSD+CG +         YFV F+D ++ Y   YL+ +KS+    F+++  + E H    +  L  D G EY+  + + + ++  I+  L+ P 
Subjt:  KEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEITSQLSAPA

Query:  TPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMV-----------YVKVWKGRKGYPKETK--GGIFYDPQDNKVIVSTNATFLEED
        TPQ NGVSE+  RT+ +  R+M+S A+L  SFWG AV T  Y++N +             ++W  +K Y K  +  G   Y    NK     + +F  + 
Subjt:  TPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMV-----------YVKVWKGRKGYPKETK--GGIFYDPQDNKVIVSTNATFLEED

Query:  HIKDHRPRSKLVLDEISRKTTKESTKVVDQ
            + P    + D ++ K       VVD+
Subjt:  HIKDHRPRSKLVLDEISRKTTKESTKVVDQ

P04146 Copia protein2.0e-1927.23Show/hide
Query:  PLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSI
        P SF++     DK  W + ++ E+ +   N+ W + ++P+                               GFTQ+  +D+EETF+PVA + S R +LS+
Subjt:  PLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAMLKSIRILLSI

Query:  ATYYDYEIWQMD---GFEQG----------------DEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVY---KKIVKSTVAFLVLYVD
           Y+ ++ QMD    F  G                +   VCKL ++IYGLKQA+R W   F+ A+K   F  + V+ C+Y   K  +   + +++LYVD
Subjt:  ATYYDYEIWQMD---GFEQG----------------DEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVY---KKIVKSTVAFLVLYVD

Query:  DILLIGNNVGFLTDVKQWLATQFK
        D+++   ++  + + K++L  +F+
Subjt:  DILLIGNNVGFLTDVKQWLATQFK

P0C2I6 Transposon Ty1-LR3 Gag-Pol polyprotein1.9e-1425.63Show/hide
Query:  EPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSE--ALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEITSQLSAP
        EP + +H+D+ G ++   +    YF+SF D+ +++ ++Y +H + E   L+ F      ++N    ++  ++ D+G EY +     +L ++ IT   +  
Subjt:  EPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSE--ALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIEHEITSQLSAP

Query:  ATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVKVWKGRKGYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKL
        A  + +GV+E+ NRTLLD  R+ +  + LP+  W  A+E +  + N +           PK  K    +      + +ST   F +   + DH P SK+
Subjt:  ATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVKVWKGRKGYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRPRSKL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-3343.59Show/hide
Query:  RSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIE
        R +  T   R    L+LV+SD+CG M +++ GG +YFV+FIDD SR  ++Y++  K +  + F+++   VE   G+ +K LRSD GGEY   +F++Y   
Subjt:  RSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQDYLIE

Query:  HEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN
        H I  + + P TPQ NGV+E+ NRT+++ VRSM+  A+LP SFWG AV+T  Y++N
Subjt:  HEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2733.62Show/hide
Query:  IISDDDVEDPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAML
        +ISDD   +P S  + ++  +K+Q +K M  EMES+  N  + LVE P G                              GF Q++G+DF+E FSPV  +
Subjt:  IISDDDVEDPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQPDG------------------------------GFTQREGVDFEETFSPVAML

Query:  KSIRILLSIATYYDYEIWQMD--------------------GFE-QGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVY-KKIVKST
         SIR +LS+A   D E+ Q+D                    GFE  G +  VCKL +S+YGLKQA R W ++FD+ +K+  + +   +PCVY K+  ++ 
Subjt:  KSIRILLSIATYYDYEIWQMD--------------------GFE-QGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVY-KKIVKST

Query:  VAFLVLYVDDILLIGNNVGFLTDVKQWLATQF
           L+LYVDD+L++G + G +  +K  L+  F
Subjt:  VAFLVLYVDDILLIGNNVGFLTDVKQWLATQF

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-2031.68Show/hide
Query:  VTDSQRSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQ
        +  S +   +     +  PLE ++SD+     + +   Y Y+V F+D ++RY +LY +  KS+  E F  +K  +EN     I T  SD GGE++ L   
Subjt:  VTDSQRSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQ

Query:  DYLIEHEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN
        +Y  +H I+   S P TP+ NG+SE+++R +++   +++S+A +P ++W YA    VY++N
Subjt:  DYLIEHEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.1e-1729.02Show/hide
Query:  DPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP-------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL
        +P +  QA+ D   ++W   M  E+ +   N  WDLV  P                   DG            G+ QR G+D+ ETFSPV    SIRI+L
Subjt:  DPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP-------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL

Query:  SIATYYDYEIWQMD--------------------GFEQGDEQK-VCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYV
         +A    + I Q+D                    GF   D    VCKL++++YGLKQA R+W +     + T GF  +V +  ++      ++ ++++YV
Subjt:  SIATYYDYEIWQMD--------------------GFEQGDEQK-VCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYV

Query:  DDILLIGNNVGFLTDVKQWLATQF
        DDIL+ GN+   L +    L+ +F
Subjt:  DDILLIGNNVGFLTDVKQWLATQF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2234.16Show/hide
Query:  VTDSQRSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQ
        +  S +   +     + +PLE ++SD+     + +   Y Y+V F+D ++RY +LY +  KS+  + F  +K+ VEN     I TL SD GGE++ L  +
Subjt:  VTDSQRSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQ

Query:  DYLIEHEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN
        DYL +H I+   S P TP+ NG+SE+++R +++M  +++S+A +P ++W YA    VY++N
Subjt:  DYLIEHEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-1829.91Show/hide
Query:  DPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP-------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL
        +P +  QAM D   D+W + M  E+ +   N  WDLV  P                   DG            G+ QR G+D+ ETFSPV    SIRI+L
Subjt:  DPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP-------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL

Query:  SIATYYDYEIWQMD--------------------GFEQGDE-QKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYV
         +A    + I Q+D                    GF   D    VC+L+++IYGLKQA R+W +   T + T GF  ++ +  ++      ++ ++++YV
Subjt:  SIATYYDYEIWQMD--------------------GFEQGDE-QKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFLVLYV

Query:  DDILLIGNNVGFLTDVKQWLATQF
        DDIL+ GN+   L      L+ +F
Subjt:  DDILLIGNNVGFLTDVKQWLATQF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-2129.26Show/hide
Query:  EDPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL
        ++P ++N+A   +    W   MD E+ +M     W++   P                  DG            G+TQ+EG+DF ETFSPV  L S++++L
Subjt:  EDPLSFNQAMNDVDKDQWIKVMDLEMESMYFNSIWDLVEQP------------------DG------------GFTQREGVDFEETFSPVAMLKSIRILL

Query:  SIATYYDYEIWQMD----------------------GFEQGDE---QKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFL
        +I+  Y++ + Q+D                         QGD      VC LK+SIYGLKQASR W ++F   +  +GF Q+  +   + KI  +    +
Subjt:  SIATYYDYEIWQMD----------------------GFEQGDE---QKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVKSTVAFL

Query:  VLYVDDILLIGNNVGFLTDVKQWLATQFK
        ++YVDDI++  NN   + ++K  L + FK
Subjt:  VLYVDDILLIGNNVGFLTDVKQWLATQFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGGCTGGTCGGCAGAGGTCGACGGTGACTGATAGTCAGAGGTCAACGATGGCGACCGGTCGGTGGAGGGCCAAAGAACCTTTAGAACTCGTGCATTCAGATCT
CTGTGGTCTTATGAATGTCAAGGCACGAGGAGGGTATGAATATTTCGTCAGCTTCATTGATGATTATTCAAGGTATGACTACCTTTACCTAATGCATCACAAGTCTGAAG
CCCTCGAAAAGTTCAAAGAGTATAAGACCGAGGTTGAGAACCACTTAGGTAAAACGATTAAAACACTACGATCAGATCAAGGTGGAGAATATATGGACCTACAATTCCAA
GACTATTTGATAGAACATGAAATCACGTCCCAACTCTCAGCCCCTGCTACACCACAACAAAACGGTGTATCAGAGAAGAGAAACCGAACCTTGTTAGACATGGTTCGATC
AATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGGATATGCAGTTGAGACTACCGTATACATTTTGAACATGGTTTACGTAAAGGTATGGAAAGGGCGTAAAGGAT
ATCCTAAAGAAACGAAAGGTGGTATATTTTACGACCCTCAGGATAACAAAGTGATTGTATCGACAAACGCTACTTTCCTTGAGGAAGATCACATAAAAGATCATCGACCT
CGCAGTAAACTAGTATTAGATGAGATTTCAAGAAAAACTACAAAAGAATCTACAAAAGTTGTAGATCAAGCTGGTCCATCAAAAGTTGTTGATCAAACACATCCATCTCA
AATAGTGAGAGAGCCTCGACGTAGTGGGAGGGTTATTGATAACTGCCCAAAAGCAACCGCGCCGCCGCCGCAGCCTCCACGCACAGAGCCGCCGCAGTCCGCCGCCCAGC
CGCCTCGCACAGTCGCCGTCGTCTTCCTCCTCGCGTCGTGGTTTTCGCTTGCCGTTGGTCTCGTCTCACGCGCGCCGCCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCA
ACAAGATTCGCGCGCGTCCAGCAGTCCGAGCCTCGCTTTTGTGCGATTTTGCTTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTGTCGGGTAACGCCGTCTAAGTG
TTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTGCCCAAGGAGCGTTCTAACACGTTGTTAGAGGTTAGAATAACCCGCCTTTCTGAAAGCGAGTCCGTTTGCT
CTCTTGTGATCAAACTTGATGCTTTGTTGGATACTAAGCATGTTGTGTTAATGTTCATAAAGCGTGACGCGGAAATCACGTGTGGGATCTTGTTGTGGTGGTTGTTTGTT
GGTTGGTGGCTCACCGGGTGTGAGCTACTTACCAGTACGGTGGTTGTACTGATACCCCTCCCCTTTATTTCCCCCAACATTTCAGATTATGCAGGTGATGAGCTGTATGT
GCCCGGTGATGTAGACGAGGAGGTCACGAGGGGTGACCACAGGTTGTTCTGTGAAGTTGTGGAGTGGATCGAATCAATTAGAAGTGTTTTGAGTTTACAGGTTCGAGAAC
TTGATTCAGGAGTCATTGTACGGATGTTTATCAATGATCCTTCTTGTGCCGCAATTCAAAACCTTGGCATGGTATCTGAAAATGCTTTCGATGAGAAGAAAGTTGTTGAA
AGAAATGCCTTGGAAAAGAAAAAGTTGAGATCAACGTCAATCATTCAAACAACCGCGCCGCCGCCGCAGCCTCCACGCACAGAGCCGCCGCAGTCCGCCGCCCAGCCGCC
TCGCCCAGCCGCCGTCGTCTTCCTCCTCGCGCCGTGGTTTTCGCTTCCCGTTGGTCTCGTCTCACGCGCGCCGCCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCAACAA
GATTCGCGCGCGTCCAGCAGTCTGAGCCTCGCTTTTGTGCGATTTTGCTTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTCGCCGTCTAAGTGTTCGATAAGGTTC
GAAACACTTCAGCTTGGATACCCATTGCCCAAGGAGCGTTCTAACACGTTGTTAGAGTTTGTTTGTAGCTGGAAGCTCGTTTTGGGAGCGTGTCGTGCCATTCCAGCTAG
CGTTGATTATGCAGGTGATGAGCTGTATGTGCCCGGTGATGTAGACGAGGAGGTTCTGTTGTTGGTTGGAGATGCGAGATCTGACTCGTATCCTGTTTGCAGTTGTTCTG
TGAAGTTGTGGAGTGGATCGAATCAATTTAGAAGTGTTGTTGAGTTTAACAGGTGTTCTAAAATTTTGGGTTTGTCCTATGTACGCCGTTATGCTGCCGAAATTTTCGGA
GCTCTCGGTTGTTTGAATAGGAGCTTAGTAATGTCACTAGATTATGCTGTTGAGCGACTGGAGGGAGCAAATTCTGTGTTGCAGCAAAACTGGGAACAGAATTGCCACAT
CACAGCTCATTGTAATTCGTATTCCTATGCCTATTATGAACTAAGTAGCTTAAGGAACCCAACAGTTATAATACAACCCGACCGCTACATGGGTTTAACAGAAACTCAAA
CCATCATATCTGATGATGATGTAGAGGATCCATTGTCCTTTAATCAGGCAATGAATGATGTTGATAAGGACCAATGGATCAAAGTCATGGATCTCGAAATGGAGTCAATG
TACTTCAATTCGATCTGGGATCTTGTAGAGCAGCCGGATGGGGGTTTTACCCAACGGGAAGGAGTGGATTTTGAAGAAACTTTTTCCCCTGTTGCCATGCTTAAGTCGAT
TAGAATACTCTTATCCATTGCCACATATTATGACTATGAAATATGGCAAATGGATGGCTTTGAGCAGGGTGATGAGCAAAAGGTTTGCAAACTTAAACGATCCATTTATG
GATTAAAACAAGCATCAAGATCCTGGAATATACGTTTTGACACTGCGATCAAAACGTATGGCTTTGAGCAGAATGTTGTCGAACCTTGTGTTTACAAGAAAATCGTCAAA
TCCACTGTAGCTTTTCTAGTTCTATACGTAGATGATATCCTACTTATTGGGAATAATGTAGGATTCCTGACTGACGTAAAACAATGGCTAGCGACTCAATTCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGGCTGGTCGGCAGAGGTCGACGGTGACTGATAGTCAGAGGTCAACGATGGCGACCGGTCGGTGGAGGGCCAAAGAACCTTTAGAACTCGTGCATTCAGATCT
CTGTGGTCTTATGAATGTCAAGGCACGAGGAGGGTATGAATATTTCGTCAGCTTCATTGATGATTATTCAAGGTATGACTACCTTTACCTAATGCATCACAAGTCTGAAG
CCCTCGAAAAGTTCAAAGAGTATAAGACCGAGGTTGAGAACCACTTAGGTAAAACGATTAAAACACTACGATCAGATCAAGGTGGAGAATATATGGACCTACAATTCCAA
GACTATTTGATAGAACATGAAATCACGTCCCAACTCTCAGCCCCTGCTACACCACAACAAAACGGTGTATCAGAGAAGAGAAACCGAACCTTGTTAGACATGGTTCGATC
AATGATGAGCTATGCTCAGTTGCCTGATTCGTTCTGGGGATATGCAGTTGAGACTACCGTATACATTTTGAACATGGTTTACGTAAAGGTATGGAAAGGGCGTAAAGGAT
ATCCTAAAGAAACGAAAGGTGGTATATTTTACGACCCTCAGGATAACAAAGTGATTGTATCGACAAACGCTACTTTCCTTGAGGAAGATCACATAAAAGATCATCGACCT
CGCAGTAAACTAGTATTAGATGAGATTTCAAGAAAAACTACAAAAGAATCTACAAAAGTTGTAGATCAAGCTGGTCCATCAAAAGTTGTTGATCAAACACATCCATCTCA
AATAGTGAGAGAGCCTCGACGTAGTGGGAGGGTTATTGATAACTGCCCAAAAGCAACCGCGCCGCCGCCGCAGCCTCCACGCACAGAGCCGCCGCAGTCCGCCGCCCAGC
CGCCTCGCACAGTCGCCGTCGTCTTCCTCCTCGCGTCGTGGTTTTCGCTTGCCGTTGGTCTCGTCTCACGCGCGCCGCCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCA
ACAAGATTCGCGCGCGTCCAGCAGTCCGAGCCTCGCTTTTGTGCGATTTTGCTTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTGTCGGGTAACGCCGTCTAAGTG
TTCGATAAGGTTCGAAACACTTCAGCTTGGATACCCATTGCCCAAGGAGCGTTCTAACACGTTGTTAGAGGTTAGAATAACCCGCCTTTCTGAAAGCGAGTCCGTTTGCT
CTCTTGTGATCAAACTTGATGCTTTGTTGGATACTAAGCATGTTGTGTTAATGTTCATAAAGCGTGACGCGGAAATCACGTGTGGGATCTTGTTGTGGTGGTTGTTTGTT
GGTTGGTGGCTCACCGGGTGTGAGCTACTTACCAGTACGGTGGTTGTACTGATACCCCTCCCCTTTATTTCCCCCAACATTTCAGATTATGCAGGTGATGAGCTGTATGT
GCCCGGTGATGTAGACGAGGAGGTCACGAGGGGTGACCACAGGTTGTTCTGTGAAGTTGTGGAGTGGATCGAATCAATTAGAAGTGTTTTGAGTTTACAGGTTCGAGAAC
TTGATTCAGGAGTCATTGTACGGATGTTTATCAATGATCCTTCTTGTGCCGCAATTCAAAACCTTGGCATGGTATCTGAAAATGCTTTCGATGAGAAGAAAGTTGTTGAA
AGAAATGCCTTGGAAAAGAAAAAGTTGAGATCAACGTCAATCATTCAAACAACCGCGCCGCCGCCGCAGCCTCCACGCACAGAGCCGCCGCAGTCCGCCGCCCAGCCGCC
TCGCCCAGCCGCCGTCGTCTTCCTCCTCGCGCCGTGGTTTTCGCTTCCCGTTGGTCTCGTCTCACGCGCGCCGCCGCTCTTTTTCCCCTCTTTTCCTTGCGTTTCAACAA
GATTCGCGCGCGTCCAGCAGTCTGAGCCTCGCTTTTGTGCGATTTTGCTTCTGTCCAGCAAGCGATTTGGCCTCGAATCTCCTTCGCCGTCTAAGTGTTCGATAAGGTTC
GAAACACTTCAGCTTGGATACCCATTGCCCAAGGAGCGTTCTAACACGTTGTTAGAGTTTGTTTGTAGCTGGAAGCTCGTTTTGGGAGCGTGTCGTGCCATTCCAGCTAG
CGTTGATTATGCAGGTGATGAGCTGTATGTGCCCGGTGATGTAGACGAGGAGGTTCTGTTGTTGGTTGGAGATGCGAGATCTGACTCGTATCCTGTTTGCAGTTGTTCTG
TGAAGTTGTGGAGTGGATCGAATCAATTTAGAAGTGTTGTTGAGTTTAACAGGTGTTCTAAAATTTTGGGTTTGTCCTATGTACGCCGTTATGCTGCCGAAATTTTCGGA
GCTCTCGGTTGTTTGAATAGGAGCTTAGTAATGTCACTAGATTATGCTGTTGAGCGACTGGAGGGAGCAAATTCTGTGTTGCAGCAAAACTGGGAACAGAATTGCCACAT
CACAGCTCATTGTAATTCGTATTCCTATGCCTATTATGAACTAAGTAGCTTAAGGAACCCAACAGTTATAATACAACCCGACCGCTACATGGGTTTAACAGAAACTCAAA
CCATCATATCTGATGATGATGTAGAGGATCCATTGTCCTTTAATCAGGCAATGAATGATGTTGATAAGGACCAATGGATCAAAGTCATGGATCTCGAAATGGAGTCAATG
TACTTCAATTCGATCTGGGATCTTGTAGAGCAGCCGGATGGGGGTTTTACCCAACGGGAAGGAGTGGATTTTGAAGAAACTTTTTCCCCTGTTGCCATGCTTAAGTCGAT
TAGAATACTCTTATCCATTGCCACATATTATGACTATGAAATATGGCAAATGGATGGCTTTGAGCAGGGTGATGAGCAAAAGGTTTGCAAACTTAAACGATCCATTTATG
GATTAAAACAAGCATCAAGATCCTGGAATATACGTTTTGACACTGCGATCAAAACGTATGGCTTTGAGCAGAATGTTGTCGAACCTTGTGTTTACAAGAAAATCGTCAAA
TCCACTGTAGCTTTTCTAGTTCTATACGTAGATGATATCCTACTTATTGGGAATAATGTAGGATTCCTGACTGACGTAAAACAATGGCTAGCGACTCAATTCAAATGA
Protein sequenceShow/hide protein sequence
MVAAGRQRSTVTDSQRSTMATGRWRAKEPLELVHSDLCGLMNVKARGGYEYFVSFIDDYSRYDYLYLMHHKSEALEKFKEYKTEVENHLGKTIKTLRSDQGGEYMDLQFQ
DYLIEHEITSQLSAPATPQQNGVSEKRNRTLLDMVRSMMSYAQLPDSFWGYAVETTVYILNMVYVKVWKGRKGYPKETKGGIFYDPQDNKVIVSTNATFLEEDHIKDHRP
RSKLVLDEISRKTTKESTKVVDQAGPSKVVDQTHPSQIVREPRRSGRVIDNCPKATAPPPQPPRTEPPQSAAQPPRTVAVVFLLASWFSLAVGLVSRAPPLFFPSFPCVS
TRFARVQQSEPRFCAILLLSSKRFGLESPCRVTPSKCSIRFETLQLGYPLPKERSNTLLEVRITRLSESESVCSLVIKLDALLDTKHVVLMFIKRDAEITCGILLWWLFV
GWWLTGCELLTSTVVVLIPLPFISPNISDYAGDELYVPGDVDEEVTRGDHRLFCEVVEWIESIRSVLSLQVRELDSGVIVRMFINDPSCAAIQNLGMVSENAFDEKKVVE
RNALEKKKLRSTSIIQTTAPPPQPPRTEPPQSAAQPPRPAAVVFLLAPWFSLPVGLVSRAPPLFFPSFPCVSTRFARVQQSEPRFCAILLLSSKRFGLESPSPSKCSIRF
ETLQLGYPLPKERSNTLLEFVCSWKLVLGACRAIPASVDYAGDELYVPGDVDEEVLLLVGDARSDSYPVCSCSVKLWSGSNQFRSVVEFNRCSKILGLSYVRRYAAEIFG
ALGCLNRSLVMSLDYAVERLEGANSVLQQNWEQNCHITAHCNSYSYAYYELSSLRNPTVIIQPDRYMGLTETQTIISDDDVEDPLSFNQAMNDVDKDQWIKVMDLEMESM
YFNSIWDLVEQPDGGFTQREGVDFEETFSPVAMLKSIRILLSIATYYDYEIWQMDGFEQGDEQKVCKLKRSIYGLKQASRSWNIRFDTAIKTYGFEQNVVEPCVYKKIVK
STVAFLVLYVDDILLIGNNVGFLTDVKQWLATQFK