; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G08800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G08800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr7:6635098..6637765
RNA-Seq ExpressionCSPI07G08800
SyntenyCSPI07G08800
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66323.1 hypothetical protein VITISV_007384 [Vitis vinifera]5.2e-22849.43Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D + PVF++E+P++K D EW L HR+VCG++R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+ LIK+MM LKYQDG PM DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ   MNIKFE+E+ GLW+LGTL + W+ FRTSLSNS  +G+++MDLVKS +LN+EMRRKS+ SSSQS+VLVTEK+G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSVR
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSANTIGIGD--LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKST
        MGNDGSA  IG+GD  L  +G             +GSMVIAKG K SSLY + A+++DS I  V+D++  ELWH RL H+S+KGL IL K N L  +K  
Subjt:  MGNDGSANTIGIGD--LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKST

Query:  PLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDN
         LKRC HCLA KQTRV FK+ +H+RKP + +LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTKD VL  FKQFHA VER++GEKLKC+RTDN
Subjt:  PLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDN

Query:  GGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------------------------DISYSHLRVFG
        GGEY GPFDEYCR + IRHQKTPPKTP LN +AER+NRTLVER                                              +ISY HLRVFG
Subjt:  GGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------------------------DISYSHLRVFG

Query:  CKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV--------------------------------------------------
        CKAFVH+PKDERSKLD KT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                                                  
Subjt:  CKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV--------------------------------------------------

Query:  ---------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDE
                                   +  ++DD+   Q      P+++ LRR  RDR PSTRYS ++Y+LLTDG EPESY EA++DE+K KW D M+DE
Subjt:  ---------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDE

Query:  MD-------------------------------------------------QKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
        M+                                                 QKKGIDFDEIF+PVVKMSSIRVVLGLAASLD
Subjt:  MD-------------------------------------------------QKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

KAA0040427.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.5e-22458.82Show/hide
Query:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK
        KVCGFMRLW+EDNFLNHICEET  +TMWNKLESLCAPKT                                        IKFEDEI GLWVLGTL DSW+
Subjt:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK

Query:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID
        IFRTSLSNS PNG+LSMDLVKSS+LN+EMRRKS+SSS QSD LVTE+ G   SK                              G  +NL  QQSSWVID
Subjt:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID

Query:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK
        SGAS+HATSKREFFASYTPGDFGSVRMGNDG  N +GIGD                               LDDEGFCNTFDN IWKLTKGSMVIA GQK
Subjt:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK

Query:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS
        FSSLYY+DAKI+D DI TVNDEANVELWHKRLSH+S+KGLKILTKKNHL DLKSTPLKRCPHCLA KQTRVTFKSSQHSRK NVLELVHSNVC  MKTKS
Subjt:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS

Query:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--
        LGGALYFVTFT+DHS+KI VYTLKTKD   Q FKQFHA VERETGEKLKC+RTDNGGEYCGPFDEYCRN+GIRHQKTPPK+  LN IA+RLNRTLVER  
Subjt:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--

Query:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI
                                                    DISYSHLRVFGCKAFVHVPKDERSKLDAKTK C FLGYGQ+ FGYR+Y   KKKLI
Subjt:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI

Query:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------
        RSRDV    ++D+    + +                 P +++            +PESYEEAIEDEHK +WNDAMKDEM                     
Subjt:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------

Query:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
                                    +QKK IDFDEIFAPVVKMSSIRVVLGLAASLD
Subjt:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

RVW85908.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.7e-24350.49Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D +LPVF++E+P++KTD EW L HR+VCGF+R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+FLIK+MM LKYQDG P+ DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ + MNIKFE+E+ GLW+LGTL DSW+ FRTSLSNS P+G+++MDLVKS +LN+EMRRKS+ SSSQS VLV EK G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSVR
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSA-------------------------------NTIGIGDLDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE
        MGNDGSA                               N I  G LDDEGFCNTF +  WKLT+GSMVIAKG K SSLY + A+++DS I  V+D++  E
Subjt:  MGNDGSA-------------------------------NTIGIGDLDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE

Query:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK
        LWH RL H+S+KGL IL KKN L  +K   LKRC HCLA KQTRV FK+ +H+RKP +L+LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTK
Subjt:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK

Query:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------
        D VL  FKQFHA VER++GEKLKC+RTDNGGEY GPFDEYCR +GIRHQKTPPKTP LN +AER+NRTLVER                            
Subjt:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------

Query:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------
                          +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                     
Subjt:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------

Query:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL
                                                                   ++DD+   Q      P+++ LRRS RDR PSTRYS ++Y+L
Subjt:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL

Query:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI
        LTDGGEPESY EA+EDE+K KW DAM+DEM+                                                 QKKGIDFDEIF+PVVKMSSI
Subjt:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI

Query:  RVVLGLAASLD
        RVVLGLAASLD
Subjt:  RVVLGLAASLD

RVW88205.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.2e-24250.38Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D +LPVF++E+P++KTD EW L HR+VCG++R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+FLIK+MM LKYQDG PM DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ + MNIKFE+E+ GLW+LGTL DSW+ FRTSLSNS P+G+++MDLVKS +LN+EMRRKS+ SSSQS VLV EK G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSV 
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE
        MGNDGSA  IG+GD                               LDDEGFCNTF +  WKLT+GSMVIAKG K SSLY + A+++DS I  V+D++  E
Subjt:  MGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE

Query:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK
        LWH RL H+S+KGL IL KKN L  +K   LKRC HCLA KQTRV FK+ +H+RKP +L+LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTK
Subjt:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK

Query:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------
        D VL  FKQFHA VER++GEKLKC+RTDNGGEY GPFDEYCR +GIRHQKTPPKTP LN +AER+NRTLVER                            
Subjt:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------

Query:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------
                          +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                     
Subjt:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------

Query:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL
                                                                   ++DD+   Q     VP+++ LRRS RDR PST YS ++Y+L
Subjt:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL

Query:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI
        LTDGGE ESY EA+EDE+K KW DAM+DEM+                                                 QKKGIDFDEIF+PVVKMSSI
Subjt:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI

Query:  RVVLGLAASLD
        RVVLGLAASLD
Subjt:  RVVLGLAASLD

TYJ98688.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.4e-22559.08Show/hide
Query:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK
        KVCGFMRLW+EDNFLNHICEET  +TMWNKLESLCAPKT                                        IKFEDEI GLWVLGTL DSW+
Subjt:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK

Query:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID
        IFRTSLSNS PNG+LSMDLVKSS+LN+EMRRKS+SSS QSD LVTE+ G   SK                              G  +NLA QQSSWVID
Subjt:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID

Query:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK
        SGAS+HATSKREFFASYTPGDFGSVRMGNDG  N +GIGD                               LDDEGFCNTFDN IWKLTKGSMVIA GQK
Subjt:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK

Query:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS
        FSSLYY+DAKI+D DI TVNDEANVELWHKRLSH+S+KGLKILTKKNHL DLKSTPLKRCPHCLA KQTRVTFKSSQHSRK NVLELVHSNVC  MKTKS
Subjt:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS

Query:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--
        LGGALYFVTFT+DHS+KI VYTLKTKD   Q FKQFHA VERETGEKLKC+RTDNGGEYCGPFDEYCRN+GIRHQKTPPK+  LN IA+RLNRTLVER  
Subjt:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--

Query:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI
                                                    DISYSHLRVFGCKAFVHVPKDERSKLDAKTK C FLGYGQ+ FGYR+YD  KKKLI
Subjt:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI

Query:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------
        RSRDV    ++D+    + +                 P +++            +PESYEEAIEDEHK +WNDAMKDEM                     
Subjt:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------

Query:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
                                    +QKK IDFDEIFAPVVKMSSIRVVLGLAASLD
Subjt:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

TrEMBL top hitse value%identityAlignment
A0A438HN89 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-24350.49Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D +LPVF++E+P++KTD EW L HR+VCGF+R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+FLIK+MM LKYQDG P+ DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ + MNIKFE+E+ GLW+LGTL DSW+ FRTSLSNS P+G+++MDLVKS +LN+EMRRKS+ SSSQS VLV EK G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSVR
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSA-------------------------------NTIGIGDLDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE
        MGNDGSA                               N I  G LDDEGFCNTF +  WKLT+GSMVIAKG K SSLY + A+++DS I  V+D++  E
Subjt:  MGNDGSA-------------------------------NTIGIGDLDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE

Query:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK
        LWH RL H+S+KGL IL KKN L  +K   LKRC HCLA KQTRV FK+ +H+RKP +L+LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTK
Subjt:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK

Query:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------
        D VL  FKQFHA VER++GEKLKC+RTDNGGEY GPFDEYCR +GIRHQKTPPKTP LN +AER+NRTLVER                            
Subjt:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------

Query:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------
                          +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                     
Subjt:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------

Query:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL
                                                                   ++DD+   Q      P+++ LRRS RDR PSTRYS ++Y+L
Subjt:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL

Query:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI
        LTDGGEPESY EA+EDE+K KW DAM+DEM+                                                 QKKGIDFDEIF+PVVKMSSI
Subjt:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI

Query:  RVVLGLAASLD
        RVVLGLAASLD
Subjt:  RVVLGLAASLD

A0A438HUT4 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-24250.38Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D +LPVF++E+P++KTD EW L HR+VCG++R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+FLIK+MM LKYQDG PM DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ + MNIKFE+E+ GLW+LGTL DSW+ FRTSLSNS P+G+++MDLVKS +LN+EMRRKS+ SSSQS VLV EK G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSV 
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE
        MGNDGSA  IG+GD                               LDDEGFCNTF +  WKLT+GSMVIAKG K SSLY + A+++DS I  V+D++  E
Subjt:  MGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVE

Query:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK
        LWH RL H+S+KGL IL KKN L  +K   LKRC HCLA KQTRV FK+ +H+RKP +L+LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTK
Subjt:  LWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTK

Query:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------
        D VL  FKQFHA VER++GEKLKC+RTDNGGEY GPFDEYCR +GIRHQKTPPKTP LN +AER+NRTLVER                            
Subjt:  DLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------

Query:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------
                          +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                     
Subjt:  ------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV---------------------

Query:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL
                                                                   ++DD+   Q     VP+++ LRRS RDR PST YS ++Y+L
Subjt:  --------------------------------------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLL

Query:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI
        LTDGGE ESY EA+EDE+K KW DAM+DEM+                                                 QKKGIDFDEIF+PVVKMSSI
Subjt:  LTDGGEPESYEEAIEDEHKNKWNDAMKDEMD-------------------------------------------------QKKGIDFDEIFAPVVKMSSI

Query:  RVVLGLAASLD
        RVVLGLAASLD
Subjt:  RVVLGLAASLD

A0A5A7TFU1 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-22458.82Show/hide
Query:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK
        KVCGFMRLW+EDNFLNHICEET  +TMWNKLESLCAPKT                                        IKFEDEI GLWVLGTL DSW+
Subjt:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK

Query:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID
        IFRTSLSNS PNG+LSMDLVKSS+LN+EMRRKS+SSS QSD LVTE+ G   SK                              G  +NL  QQSSWVID
Subjt:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID

Query:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK
        SGAS+HATSKREFFASYTPGDFGSVRMGNDG  N +GIGD                               LDDEGFCNTFDN IWKLTKGSMVIA GQK
Subjt:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK

Query:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS
        FSSLYY+DAKI+D DI TVNDEANVELWHKRLSH+S+KGLKILTKKNHL DLKSTPLKRCPHCLA KQTRVTFKSSQHSRK NVLELVHSNVC  MKTKS
Subjt:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS

Query:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--
        LGGALYFVTFT+DHS+KI VYTLKTKD   Q FKQFHA VERETGEKLKC+RTDNGGEYCGPFDEYCRN+GIRHQKTPPK+  LN IA+RLNRTLVER  
Subjt:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--

Query:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI
                                                    DISYSHLRVFGCKAFVHVPKDERSKLDAKTK C FLGYGQ+ FGYR+Y   KKKLI
Subjt:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI

Query:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------
        RSRDV    ++D+    + +                 P +++            +PESYEEAIEDEHK +WNDAMKDEM                     
Subjt:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------

Query:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
                                    +QKK IDFDEIFAPVVKMSSIRVVLGLAASLD
Subjt:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

A0A5D3BKF7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-22559.08Show/hide
Query:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK
        KVCGFMRLW+EDNFLNHICEET  +TMWNKLESLCAPKT                                        IKFEDEI GLWVLGTL DSW+
Subjt:  KVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWK

Query:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID
        IFRTSLSNS PNG+LSMDLVKSS+LN+EMRRKS+SSS QSD LVTE+ G   SK                              G  +NLA QQSSWVID
Subjt:  IFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVID

Query:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK
        SGAS+HATSKREFFASYTPGDFGSVRMGNDG  N +GIGD                               LDDEGFCNTFDN IWKLTKGSMVIA GQK
Subjt:  SGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQK

Query:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS
        FSSLYY+DAKI+D DI TVNDEANVELWHKRLSH+S+KGLKILTKKNHL DLKSTPLKRCPHCLA KQTRVTFKSSQHSRK NVLELVHSNVC  MKTKS
Subjt:  FSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKS

Query:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--
        LGGALYFVTFT+DHS+KI VYTLKTKD   Q FKQFHA VERETGEKLKC+RTDNGGEYCGPFDEYCRN+GIRHQKTPPK+  LN IA+RLNRTLVER  
Subjt:  LGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER--

Query:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI
                                                    DISYSHLRVFGCKAFVHVPKDERSKLDAKTK C FLGYGQ+ FGYR+YD  KKKLI
Subjt:  --------------------------------------------DISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLI

Query:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------
        RSRDV    ++D+    + +                 P +++            +PESYEEAIEDEHK +WNDAMKDEM                     
Subjt:  RSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEM---------------------

Query:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
                                    +QKK IDFDEIFAPVVKMSSIRVVLGLAASLD
Subjt:  ----------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

A5C3L0 Integrase catalytic domain-containing protein2.5e-22849.43Show/hide
Query:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT
        MED+LYV D + PVF++E+P++K D EW L HR+VCG++R W++DN LNH+ EE HAR++WNKLE L A KTGNNK+ LIK+MM LKYQDG PM DHLNT
Subjt:  MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNT

Query:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------
        FQGI+NQ   MNIKFE+E+ GLW+LGTL + W+ FRTSLSNS  +G+++MDLVKS +LN+EMRRKS+ SSSQS+VLVTEK+G                  
Subjt:  FQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG------------------

Query:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR
                                +D K  K KEK ND+  + D +   T DF I+ D DV+N A Q++SWVIDSGASIHAT +++FF SYT GDFGSVR
Subjt:  -----------------------DKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVR

Query:  MGNDGSANTIGIGD--LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKST
        MGNDGSA  IG+GD  L  +G             +GSMVIAKG K SSLY + A+++DS I  V+D++  ELWH RL H+S+KGL IL K N L  +K  
Subjt:  MGNDGSANTIGIGD--LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKST

Query:  PLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDN
         LKRC HCLA KQTRV FK+ +H+RKP + +LV+S+VC PMKTK+LGG+LYFVTF +DHS+KI VYTLKTKD VL  FKQFHA VER++GEKLKC+RTDN
Subjt:  PLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDN

Query:  GGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------------------------DISYSHLRVFG
        GGEY GPFDEYCR + IRHQKTPPKTP LN +AER+NRTLVER                                              +ISY HLRVFG
Subjt:  GGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------------------------------------------DISYSHLRVFG

Query:  CKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV--------------------------------------------------
        CKAFVH+PKDERSKLD KT+ C F+GYGQ+  GYR YDP +KKL+RSRDV                                                  
Subjt:  CKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV--------------------------------------------------

Query:  ---------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDE
                                   +  ++DD+   Q      P+++ LRR  RDR PSTRYS ++Y+LLTDG EPESY EA++DE+K KW D M+DE
Subjt:  ---------------------------SSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDE

Query:  MD-------------------------------------------------QKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
        M+                                                 QKKGIDFDEIF+PVVKMSSIRVVLGLAASLD
Subjt:  MD-------------------------------------------------QKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.3e-3423.8Show/hide
Query:  EDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTF
        +D+L V D  +P        ++ D  W+   R     +  ++ D+FLN    +  AR +   L+++   K+  +++ L K+++ LK      +L H + F
Subjt:  EDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTF

Query:  QGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVL--------------VTEKEGDKDSK
          ++++      K E+      +L TL   +    T++   +    L++  VK+ +L++E++ K+  + +   V+              + +    K  K
Subjt:  QGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVL--------------VTEKEGDKDSK

Query:  NHKGKEK-------------------------NNDDDSDADTIIVATEDFYILSDGDVINLATQQS-SWVIDSGASIHATSKREFF------------AS
          KG  K                         NN +  +   +  AT         +V N +   +  +V+DSGAS H  +    +            A 
Subjt:  NHKGKEK-------------------------NNDDDSDADTIIVATEDFYILSDGDVINLATQQS-SWVIDSGASIHATSKREFF------------AS

Query:  YTPGDF------GSVRMGND-------------GSANTIGIGDLDDEGFCNTFDNDIWKLTK-GSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVEL
           G+F      G VR+ ND              + N + +  L + G    FD     ++K G MV+      +++  I+ +    + K  N   N  L
Subjt:  YTPGDF------GSVRMGND-------------GSANTIGIGDLDDEGFCNTFDNDIWKLTK-GSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVEL

Query:  WHKRLSHISDKGLKILTKKNHLPD---LKSTPL--KRCPHCLAEKQTRVTF---KSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKIL
        WH+R  HISD  L  + +KN   D   L +  L  + C  CL  KQ R+ F   K   H ++P  L +VHS+VC P+   +L    YFV F +  +   +
Subjt:  WHKRLSHISDKGLKILTKKNHLPD---LKSTPL--KRCPHCLAEKQTRVTF---KSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKIL

Query:  VYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEY-CGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------DISY------
         Y +K K  V   F+ F A  E     K+  +  DNG EY      ++C   GI +  T P TP LN ++ER+ RT+ E+          D S+      
Subjt:  VYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEY-CGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVER----------DISY------

Query:  --------------------------------SHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDECSR
                                         HLRVFG   +VH+ K+++ K D K+    F+GY  NGF  +L+D   +K I +RDV    + DE + 
Subjt:  --------------------------------SHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDECSR

Query:  QLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEE---------AIEDEHKNKWNDAMK
          +  V    V L+ S   +    +  PN+   +     P   +E         + E E+KN  ND+ K
Subjt:  QLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEE---------AIEDEHKNKWNDAMK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-13335.11Show/hide
Query:  MEDILYVNDLHLPV-FSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLN
        M D+L    LH  +   ++KPD     +W     +    +RL + D+ +N+I +E  AR +W +LESL   KT  NK++L KQ+  L   +G   L HLN
Subjt:  MEDILYVNDLHLPV-FSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLN

Query:  TFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG---DKDSKNH-----KG
         F G++ Q + + +K E+E   + +L +L  S+    T++ +      L  D+  + +LN++MR+K     +Q   L+TE  G    + S N+     +G
Subjt:  TFQGIMNQFSRMNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEG---DKDSKNH-----KG

Query:  KEKN------------------------------------NDDDSDADTIIVATED---FYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTP
        K KN                                    NDD++ A   +V   D    +I  + + ++L+  +S WV+D+ AS HAT  R+ F  Y  
Subjt:  KEKN------------------------------------NDDDSDADTIIVATED---FYILSDGDVINLATQQSSWVIDSGASIHATSKREFFASYTP

Query:  GDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTV
        GDFG+V+MGN   +   GIGD                               LD +G+ + F N  W+LTKGS+VIAKG    +LY  +A+I   ++   
Subjt:  GDFGSVRMGNDGSANTIGIGD-------------------------------LDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTV

Query:  NDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKIL
         DE +V+LWHKR+ H+S+KGL+IL KK+ +   K T +K C +CL  KQ RV+F++S   RK N+L+LV+S+VC PM+ +S+GG  YFVTF +D S+K+ 
Subjt:  NDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKIL

Query:  VYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCG-PFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVE---------------------
        VY LKTKD V Q F++FHA VERETG KLK +R+DNGGEY    F+EYC ++GIRH+KT P TP  N +AER+NRT+VE                     
Subjt:  VYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCG-PFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVE---------------------

Query:  -------------------------RDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV-------------
                                 +++SYSHL+VFGC+AF HVPK++R+KLD K+  C F+GYG   FGYRL+DP KKK+IRSRDV             
Subjt:  -------------------------RDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDV-------------

Query:  -----------------------SSTQIDDECSRQ-----------------LAETVVPT-----NVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYE
                               S+    DE S Q                 + E   PT     +  LRRS R R  S RY   EY+L++D  EPES +
Subjt:  -----------------------SSTQIDDECSRQ-----------------LAETVVPT-----NVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYE

Query:  EAIEDEHKNKWNDAMKDEM-------------------------------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD
        E +    KN+   AM++EM                                                 +QKKGIDFDEIF+PVVKM+SIR +L LAASLD
Subjt:  EAIEDEHKNKWNDAMKDEM-------------------------------------------------DQKKGIDFDEIFAPVVKMSSIRVVLGLAASLD

P93293 Uncharacterized mitochondrial protein AtMg003002.6e-1236.94Show/hide
Query:  IWKLTKGSMVIAKGQKFSSLYYIDAKIMDSD---IKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRK
        + K+ KG   I KG +  SLY +   +   +    +T  DE    LWH RL+H+S +G+++L KK  L   K + LK C  C+  K  RV F + QH+ K
Subjt:  IWKLTKGSMVIAKGQKFSSLYYIDAKIMDSD---IKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRK

Query:  PNVLELVHSNV
         N L+ VHS++
Subjt:  PNVLELVHSNV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.7e-1727.04Show/hide
Query:  NDEANVELWHKRLSHISDKGLKILTKKNHLPDLK-STPLKRCPHCLAEKQTRVTF-KSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKK
        + +A    WH RL H +   L  +     L  L  S     C  CL  K  +V F +S+ +S +P  LE ++S+V S     S     Y+V F +  ++ 
Subjt:  NDEANVELWHKRLSHISDKGLKILTKKNHLPDLK-STPLKRCPHCLAEKQTRVTF-KSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKK

Query:  ILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVERDI-----------------
          +Y LK K  V + F  F   +E     ++    +DNGGE+   + EY   +GI H  +PP TP  N ++ER +R +VE  +                 
Subjt:  ILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVERDI-----------------

Query:  -----------------------------SYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDEC---
                                     +Y  LRVFGC  +  +    + KLD K++ C FLGY      Y        +L  SR V   + D+ C   
Subjt:  -----------------------------SYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDEC---

Query:  SRQLAETVVPTNVSLRRS
        S  LA T+ P     R S
Subjt:  SRQLAETVVPTNVSLRRS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1924.88Show/hide
Query:  INLATQQSSWVIDSGASIHATS---KREFFASYTPGDFGSVRMGNDGSANTIGIGDLDDEG-----------------------FCNTFDNDI-------
        +N     ++W++DSGA+ H TS      F   YT GD   +  G+       G   L                            CNT    +       
Subjt:  INLATQQSSWVIDSGASIHATS---KREFFASYTPGDFGSVRMGNDGSANTIGIGDLDDEG-----------------------FCNTFDNDI-------

Query:  -WKLTKGSMVIAKGQKFSSLYY--IDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLK-STPLKRCPHCLAEKQTRVTFKSSQ-HSR
          K     + + +G+    LY   I +    S   +   +A    WH RL H S   L  +   + LP L  S  L  C  C   K  +V F +S   S 
Subjt:  -WKLTKGSMVIAKGQKFSSLYY--IDAKIMDSDIKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLK-STPLKRCPHCLAEKQTRVTFKSSQ-HSR

Query:  KPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPK
        KP  LE ++S+V S     S+    Y+V F +  ++   +Y LK K  V   F  F + VE     ++  + +DNGGE+     +Y   +GI H  +PP 
Subjt:  KPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHAFVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPK

Query:  TPHLNKIAERLNRTLVERDI----------------------------------------------SYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFL
        TP  N ++ER +R +VE  +                                              +Y  L+VFGC  +  +    R KL+ K+K C F+
Subjt:  TPHLNKIAERLNRTLVERDI----------------------------------------------SYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFL

Query:  GYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDEC
        GY      Y        +L  SR V   Q D+ C
Subjt:  GYGQNGFGYRLYDPAKKKLIRSRDVSSTQIDDEC

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.8e-1336.94Show/hide
Query:  IWKLTKGSMVIAKGQKFSSLYYIDAKIMDSD---IKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRK
        + K+ KG   I KG +  SLY +   +   +    +T  DE    LWH RL+H+S +G+++L KK  L   K + LK C  C+  K  RV F + QH+ K
Subjt:  IWKLTKGSMVIAKGQKFSSLYYIDAKIMDSD---IKTVNDEANVELWHKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRK

Query:  PNVLELVHSNV
         N L+ VHS++
Subjt:  PNVLELVHSNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACATATTGTATGTAAATGACTTGCACCTTCCTGTTTTTTCTAATGAGAAGCCTGACGACAAAACTGATAGAGAATGGGAATTATGTCATAGAAAAGTGTGTGG
ATTTATGAGGCTATGGATAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCCCTAAAACTGGTA
ATAATAAAATATTTCTGATTAAACAGATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTATGAATCAGTTCTCTAGA
ATGAATATCAAGTTTGAGGATGAGATACATGGGTTATGGGTGCTTGGTACATTGTCGGACTCATGGAAAATATTTCGAACTTCCTTATCGAACTCAACCCCAAATGGTGT
ACTAAGTATGGACCTAGTAAAAAGTAGCATGTTGAACAAGGAGATGAGAAGAAAGTCTCGAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAAGAGGGAGACA
AAGACAGTAAAAATCATAAGGGCAAGGAAAAGAATAATGACGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATCTTGTCTGATGGTGATGTT
ATAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCATCAATTCATGCTACTTCGAAGAGGGAATTTTTTGCATCCTATACTCCTGGTGATTTTGGCAG
TGTTAGGATGGGTAATGACGGATCAGCAAATACAATTGGCATCGGAGATCTTGATGATGAAGGTTTCTGCAATACCTTCGACAATGACATATGGAAGCTTACTAAAGGTT
CAATGGTTATAGCAAAGGGACAAAAATTTTCTTCACTGTACTACATAGATGCAAAAATCATGGATTCTGATATAAAAACAGTGAATGATGAAGCAAATGTTGAGCTTTGG
CATAAGAGACTTAGCCATATAAGTGATAAGGGTTTAAAGATTTTAACTAAGAAAAATCATCTTCCTGATTTAAAGAGTACACCATTAAAACGATGTCCTCATTGTTTGGC
AGAAAAGCAGACGAGAGTTACATTTAAATCATCTCAACATTCAAGGAAGCCAAATGTACTAGAGTTGGTACATTCTAATGTGTGTAGTCCCATGAAAACAAAGTCGCTTG
GGGGTGCTTTGTATTTTGTGACATTTACTGAAGATCATTCAAAGAAAATATTGGTTTACACCTTGAAGACTAAAGATCTAGTGTTGCAAGCGTTTAAACAGTTTCATGCC
TTTGTTGAGAGAGAAACTGGCGAAAAGCTCAAGTGTGTTAGAACTGATAATGGAGGTGAGTATTGCGGACCTTTTGATGAATATTGTAGAAATTATGGCATTCGACATCA
AAAGACGCCTCCTAAGACCCCGCATTTAAATAAGATAGCGGAAAGATTGAATAGAACATTGGTTGAGAGAGATATATCTTACAGTCACCTACGTGTCTTTGGTTGTAAAG
CTTTTGTCCATGTACCTAAAGATGAGAGATCAAAGCTTGATGCAAAAACTAAAGCATGTGAGTTTCTTGGTTATGGCCAAAATGGGTTTGGTTATAGATTATATGATCCA
GCTAAGAAAAAGCTTATAAGAAGTCGAGATGTTTCTTCTACACAGATAGATGATGAGTGTTCAAGACAGTTAGCTGAAACAGTTGTTCCTACAAATGTTTCACTCAGGAG
ATCTGTCAGAGATCGACGTCCGTCAACAAGATATTCACCTAATGAATATTTGCTATTGACTGACGGGGGAGAACCTGAGAGTTATGAAGAGGCTATAGAGGATGAGCACA
AAAATAAGTGGAACGATGCAATGAAAGATGAGATGGATCAGAAGAAAGGTATCGACTTTGATGAAATTTTTGCTCCAGTTGTCAAGATGTCCTCCATACGTGTTGTTCTG
GGTTTAGCAGCCAGTCTTGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACATATTGTATGTAAATGACTTGCACCTTCCTGTTTTTTCTAATGAGAAGCCTGACGACAAAACTGATAGAGAATGGGAATTATGTCATAGAAAAGTGTGTGG
ATTTATGAGGCTATGGATAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCCCTAAAACTGGTA
ATAATAAAATATTTCTGATTAAACAGATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTATGAATCAGTTCTCTAGA
ATGAATATCAAGTTTGAGGATGAGATACATGGGTTATGGGTGCTTGGTACATTGTCGGACTCATGGAAAATATTTCGAACTTCCTTATCGAACTCAACCCCAAATGGTGT
ACTAAGTATGGACCTAGTAAAAAGTAGCATGTTGAACAAGGAGATGAGAAGAAAGTCTCGAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAAGAGGGAGACA
AAGACAGTAAAAATCATAAGGGCAAGGAAAAGAATAATGACGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATCTTGTCTGATGGTGATGTT
ATAAATCTTGCCACACAACAGAGCAGTTGGGTGATTGATAGTGGTGCATCAATTCATGCTACTTCGAAGAGGGAATTTTTTGCATCCTATACTCCTGGTGATTTTGGCAG
TGTTAGGATGGGTAATGACGGATCAGCAAATACAATTGGCATCGGAGATCTTGATGATGAAGGTTTCTGCAATACCTTCGACAATGACATATGGAAGCTTACTAAAGGTT
CAATGGTTATAGCAAAGGGACAAAAATTTTCTTCACTGTACTACATAGATGCAAAAATCATGGATTCTGATATAAAAACAGTGAATGATGAAGCAAATGTTGAGCTTTGG
CATAAGAGACTTAGCCATATAAGTGATAAGGGTTTAAAGATTTTAACTAAGAAAAATCATCTTCCTGATTTAAAGAGTACACCATTAAAACGATGTCCTCATTGTTTGGC
AGAAAAGCAGACGAGAGTTACATTTAAATCATCTCAACATTCAAGGAAGCCAAATGTACTAGAGTTGGTACATTCTAATGTGTGTAGTCCCATGAAAACAAAGTCGCTTG
GGGGTGCTTTGTATTTTGTGACATTTACTGAAGATCATTCAAAGAAAATATTGGTTTACACCTTGAAGACTAAAGATCTAGTGTTGCAAGCGTTTAAACAGTTTCATGCC
TTTGTTGAGAGAGAAACTGGCGAAAAGCTCAAGTGTGTTAGAACTGATAATGGAGGTGAGTATTGCGGACCTTTTGATGAATATTGTAGAAATTATGGCATTCGACATCA
AAAGACGCCTCCTAAGACCCCGCATTTAAATAAGATAGCGGAAAGATTGAATAGAACATTGGTTGAGAGAGATATATCTTACAGTCACCTACGTGTCTTTGGTTGTAAAG
CTTTTGTCCATGTACCTAAAGATGAGAGATCAAAGCTTGATGCAAAAACTAAAGCATGTGAGTTTCTTGGTTATGGCCAAAATGGGTTTGGTTATAGATTATATGATCCA
GCTAAGAAAAAGCTTATAAGAAGTCGAGATGTTTCTTCTACACAGATAGATGATGAGTGTTCAAGACAGTTAGCTGAAACAGTTGTTCCTACAAATGTTTCACTCAGGAG
ATCTGTCAGAGATCGACGTCCGTCAACAAGATATTCACCTAATGAATATTTGCTATTGACTGACGGGGGAGAACCTGAGAGTTATGAAGAGGCTATAGAGGATGAGCACA
AAAATAAGTGGAACGATGCAATGAAAGATGAGATGGATCAGAAGAAAGGTATCGACTTTGATGAAATTTTTGCTCCAGTTGTCAAGATGTCCTCCATACGTGTTGTTCTG
GGTTTAGCAGCCAGTCTTGACTAA
Protein sequenceShow/hide protein sequence
MEDILYVNDLHLPVFSNEKPDDKTDREWELCHRKVCGFMRLWIEDNFLNHICEETHARTMWNKLESLCAPKTGNNKIFLIKQMMELKYQDGAPMLDHLNTFQGIMNQFSR
MNIKFEDEIHGLWVLGTLSDSWKIFRTSLSNSTPNGVLSMDLVKSSMLNKEMRRKSRSSSSQSDVLVTEKEGDKDSKNHKGKEKNNDDDSDADTIIVATEDFYILSDGDV
INLATQQSSWVIDSGASIHATSKREFFASYTPGDFGSVRMGNDGSANTIGIGDLDDEGFCNTFDNDIWKLTKGSMVIAKGQKFSSLYYIDAKIMDSDIKTVNDEANVELW
HKRLSHISDKGLKILTKKNHLPDLKSTPLKRCPHCLAEKQTRVTFKSSQHSRKPNVLELVHSNVCSPMKTKSLGGALYFVTFTEDHSKKILVYTLKTKDLVLQAFKQFHA
FVERETGEKLKCVRTDNGGEYCGPFDEYCRNYGIRHQKTPPKTPHLNKIAERLNRTLVERDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACEFLGYGQNGFGYRLYDP
AKKKLIRSRDVSSTQIDDECSRQLAETVVPTNVSLRRSVRDRRPSTRYSPNEYLLLTDGGEPESYEEAIEDEHKNKWNDAMKDEMDQKKGIDFDEIFAPVVKMSSIRVVL
GLAASLD