; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G14310 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G14310
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr5:14692872..14695259
RNA-Seq ExpressionCSPI05G14310
SyntenyCSPI05G14310
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.0e+0088.5Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKRSFTGKGLRAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYL
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
        E RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
        +RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEP
        YEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQN+DEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEP

Query:  CVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL
        CVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+GIHL
Subjt:  CVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        STSGSVFTLNGGAVVWRS+KQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAV NS+  R     SH +G+
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0083.75Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0081.01Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQ QLSAP TPQQNGV ERRNRT+LDMVRSMMS++Q+  SFWGYA+ETA +ILNNV SKSVSETP+ELW+GRK SL HF+I GCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQ+N++ VSTNATFLEEDH+RDH+P++KLVL E    S   +D+   S++ V++T  SGQSHPSQ LR PRRSGR+V
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+Y QAM DVD+DQW+KAMDLEMESMYFN +W LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGFEQN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    + FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDKML RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQCPKTPQEVEDMR IPYASAVGSLMY + CTR +ICY+V +VSRYQSN G DHWTAVK ILKYLRRT+DYML+YG KDLILTGYTDSDFQT+KD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH
        +RKSTS SVFTLNGGA+VWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R   +  H
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0082.37Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKI RSDRGGEYMDL F+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT++E
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK ILKYLRRT+DYML+YG KDLILTGYT+SDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTS SVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0083.75Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein0.0e+0082.37Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKI RSDRGGEYMDL F+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYT++E
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK ILKYLRRT+DYML+YG KDLILTGYT+SDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTS SVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

A0A5A7TZD0 Gag/pol protein0.0e+0083.75Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

A0A5A7UYE8 Gag/pol protein0.0e+0083.75Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR PRRSGRVV
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGF+QN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRPDICY+VG+VSRYQSNPG DHWTAVK +LKYLRRT+DYML+YG KDLILTGYTDSDFQTDKD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        +RKSTSGSVFTLNGGAVVWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R     SH +G+
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

A0A5D3CZY3 Gag/pol protein0.0e+0081.01Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKR FTGKG RAK PLELIHSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EVEN L K IKILRSDRGGEYMDLRF+DY+
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE+GIQ QLSAP TPQQNGV ERRNRT+LDMVRSMMS++Q+  SFWGYA+ETA +ILNNV SKSVSETP+ELW+GRK SL HF+I GCPAHVLV NPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV
        E RS+LC F+GYPKE+RGGLF+DPQ+N++ VSTNATFLEEDH+RDH+P++KLVL E    S   +D+   S++ V++T  SGQSHPSQ LR PRRSGR+V
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVV

Query:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE
         QP+RYLGL ETQVVIPDDG+EDPL+Y QAM DVD+DQW+KAMDLEMESMYFN +W LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQRE
Subjt:  HQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQRE

Query:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI
        GVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESI+MSQPEGFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGFEQN+
Subjt:  GVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNI

Query:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG
        DEPCVYKK+    + FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDKML RY MQNSKKGLLP+R+G
Subjt:  DEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYG

Query:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD
        +HLSKEQCPKTPQEVEDMR IPYASAVGSLMY + CTR +ICY+V +VSRYQSN G DHWTAVK ILKYLRRT+DYML+YG KDLILTGYTDSDFQT+KD
Subjt:  IHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKD

Query:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH
        +RKSTS SVFTLNGGA+VWRSIKQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAV NS+  R   +  H
Subjt:  ARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH

E2GK51 Gag/pol protein (Fragment)0.0e+0088.5Show/hide
Query:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL
        MTKRSFTGKGLRAK PLEL+HSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEVENE+GKTIK LRSDRGGEYMD +F+DYL
Subjt:  MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYL

Query:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL
        IE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAHVLVQNPKKL
Subjt:  IENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKL

Query:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP
        E RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRRSGRVVHQP
Subjt:  EHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQP

Query:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD
        +RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVD
Subjt:  DRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEP
        YEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYGLKQASRSWNIRFDTAIKSYGFEQN+DEP
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEP

Query:  CVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL
        CVYKK+VNS++AFL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG LP+R+GIHL
Subjt:  CVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHL

Query:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK
        SKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRPDICYSVG+VSRYQSNPGRDHWTAVKNILKYLRRT++YML+YG KDLILTGYTDSDFQ+DKDARK
Subjt:  SKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARK

Query:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ
        STSGSVFTLNGGAVVWRS+KQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAV NS+  R     SH +G+
Subjt:  STSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISHPKGQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-11630.51Show/hide
Query:  KGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPS
        K PL ++HSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G EY+    R + ++ GI   L+ P 
Subjt:  KGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPS

Query:  TPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFF
        TPQ NGVSER  RT+ +  R+M+S +++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V ++N + K + +S    F
Subjt:  TPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVLVQNPK-KLEHRSKLCFF

Query:  IGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS-----
        +GY  E  G   +D    K  V+ +    E + +     + + V  + SK + +K  P+ S K++                  D      ++ P+     
Subjt:  IGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSGQSHPS-----

Query:  ---------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ----
                       Q L++ + S +         + D +L               ET   + + GI++P                     ++Y +    
Subjt:  ---------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP---------------------LTYKQ----

Query:  ----------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETF
                     DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF
Subjt:  ----------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETF

Query:  SPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVY--
        +PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R W   F+ A+K   F  +  + C+Y  
Subjt:  SPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVY--

Query:  -KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP----YRYGI
         K  +N  I +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N      P      Y +
Subjt:  -KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLP----YRYGI

Query:  HLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLI----LTGYTDSDFQT
          S E C           N P  S +G LMY MLCTRPD+  +V ++SRY S    + W  +K +L+YL+ T D  L++  K+L     + GY DSD+  
Subjt:  HLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTKDLI----LTGYTDSDFQT

Query:  DKDARKSTSGSVFTL-NGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRN
         +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  EA +EA+WL+  LT + +   +  PI +Y DN G +  + N
Subjt:  DKDARKSTSGSVFTL-NGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.0e-17841.69Show/hide
Query:  SFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENG
        SF     R    L+L++SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE E G+ +K LRSD GGEY    F +Y   +G
Subjt:  SFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENG

Query:  IQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLE
        I+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV  +   KL+
Subjt:  IQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHVLVQNPKKLE

Query:  HRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ---------------
         +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S+ V   I        S  + P+S+    D+  + G+               
Subjt:  HRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ---------------

Query:  ----SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKR
             HP+Q   Q +  RRS R   +  RY       V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K 
Subjt:  ----SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKR

Query:  KRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGL
        K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL KS+YGL
Subjt:  KRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGL

Query:  KQASRSWNIRFDTAIKSYGFEQNIDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA
        KQA R W ++FD+ +KS  + +   +PCVY K+   +    L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ 
Subjt:  KQASRSWNIRFDTAIKSYGFEQNIDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA

Query:  SYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKD
         YI+++L R+ M+N+K    P    + LSK+ CP T +E  +M  +PY+SAVGSLMYAM+CTRPDI ++VG+VSR+  NPG++HW AVK IL+YLR T  
Subjt:  SYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKD

Query:  YMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGA
          L +G  D IL GYTD+D   D D RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   +YCD+  A
Subjt:  YMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGA

Query:  VENSRN
        ++ S+N
Subjt:  VENSRN

P25600 Putative transposon Ty5-1 protein YCL074W1.7e-3432.8Show/hide
Query:  MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E IY+ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +  +    ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGY

Query:  LTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYA
           +K+ L   + MKDLG     LG+ I     N  + +S   YI K  S  ++   K    P    +  SK     T   ++D+   PY S VG L++ 
Subjt:  LTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYA

Query:  MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGT-KDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIK-QTCIADST
            RPDI Y V ++SR+   P   H  + + +L+YL  T+   L Y +   L LT Y D+      D   ST G V  L G  V W S K +  I   +
Subjt:  MLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGT-KDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIK-QTCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-9828.54Show/hide
Query:  KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIE
        K  F+   + +  PLE I+SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I    SD GGE++ L   +Y  +
Subjt:  KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIE

Query:  NGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKK
        +GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N +P+  +  E+P++   G   +    R++GC  +  ++  N  K
Subjt:  NGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKK

Query:  LEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL----------------------------------
        L+ +S+ C F+GY       L    Q +++++S +  F E              +++ +  S  V                                   
Subjt:  LEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL----------------------------------

Query:  ---KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR---------------------VV
            ++S S +D   SS+                     T+   Q+H S                 Q L  P +S                       ++
Subjt:  ---KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR---------------------VV

Query:  HQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWI
        H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  W LV   P+ V  +GC+WI
Subjt:  HQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWI

Query:  YKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSI
        + +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GFI++D+   VCKL+K++
Subjt:  YKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSI

Query:  YGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMS
        YGLKQA R+W +     + + GF  ++ +  ++       I ++++YVDDIL+ GND   L +    L+ +F +KD  +  Y LGI+    R    L +S
Subjt:  YGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMS

Query:  QASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRT
        Q  YI  +L+R  M  +K    P      LS     K     E      Y   VGSL Y +  TRPDI Y+V  +S++   P  +H  A+K IL+YL  T
Subjt:  QASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRT

Query:  KDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDN
         ++ + +     L L  Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN
Subjt:  KDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDN

Query:  SGAVENSRNIRIADQISH
         GA     N     ++ H
Subjt:  SGAVENSRNIRIADQISH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.9e-10429.58Show/hide
Query:  KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIE
        K  F+   + +  PLE I+SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I  L SD GGE++ L  RDYL +
Subjt:  KRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIE

Query:  NGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKK
        +GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N +P+  +  ++P++   G+  +    +++GC  +  ++  N  K
Subjt:  NGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKK

Query:  LEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------------------------------------DHIRDHQPR-----
        LE +SK C F+GY       L       +++ S +  F E                                               H+ D  PR     
Subjt:  LEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------------------------------------DHIRDHQPR-----

Query:  SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP--------------------
        S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P  +    + P++   L ++ +  P                    
Subjt:  SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP--------------------

Query:  --------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPI
                                         DGI                 +P T  QAMKD   D+W +AM  E+ +   N  W LV   P  V  +
Subjt:  --------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPI

Query:  GCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCK
        GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GF+++D+   VC+
Subjt:  GCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCK

Query:  LKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK
        L+K+IYGLKQA R+W +   T + + GF  +I +  ++       I ++++YVDDIL+ GND   L      L+ +F +K+  D  Y LGI+    R  +
Subjt:  LKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK

Query:  TLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILK
         L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y   VGSL Y +  TRPD+ Y+V  +S+Y   P  DHW A+K +L+
Subjt:  TLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILK

Query:  YLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPIT
        YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  
Subjt:  YLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPIT

Query:  LYCDNSGAVENSRNIRIADQISH
        +YCDN GA     N     ++ H
Subjt:  LYCDNSGAVENSRNIRIADQISH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-7834.39Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF Q+  +   + K+  ++   +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +P    +  S      +  +  D
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI ++V  +S++   P   H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD R+ST+G    L    
Subjt:  MRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMYGTK-DLILTGYTDSDFQTDKDARKSTSGSVFTLNGGA

Query:  VVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH
        + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+  + N    ++  H
Subjt:  VVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVENSRNIRIADQISH

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.4e-0434.72Show/hide
Query:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG
        TRPD+ ++V  +S++ S        AV  +L Y++ T    L Y  T DL L  + DSD+ +  D R+S +G
Subjt:  TRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDYMLMY-GTKDLILTGYTDSDFQTDKDARKSTSG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.9e-0431.71Show/hide
Query:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00810.1 DNA/RNA polymerases superfamily protein4.1e-2031.78Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQ
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K     LP +    +S  + P    
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK--KGLLPYRYGIHLSKEQCPKTPQ

Query:  EVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTL
        +  D R+I     VG+L Y  L TRPDI Y+V +V +    P    +  +K +L+Y++ T  + + ++    L +  + DSD+      R+ST+G    L
Subjt:  EVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAVKNILKYLRRTKDY-MLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTL

Query:  NGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVW
            + W + +Q  ++ S+ E EY A    A E  W
Subjt:  NGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCATACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGA
ATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTAGAAA
ACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCT
GCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGG
ATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTA
GAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGGTTTG
TTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAAT
TTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTG
GGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGAT
GTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTG
TAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAA
CTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGT
AATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTC
TAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATATTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTT
TAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCA
CAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTC
CAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTG
TTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGCCGTT
AAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCAAACTGATAAAGA
TGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACG
TAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATCACTTTATACTGTGACAACAGT
GGTGCAGTTGAAAATTCAAGAAACATAAGGATCGCAGACCAGATTTCCCATCCAAAAGGACAAGAAAAGAACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCATACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGA
ATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAGTAGAAA
ACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCT
GCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGG
ATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTA
GAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTTATAGGTTATCCAAAAGAATCAAGAGGTGGTTTG
TTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAAT
TTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTG
GGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGAT
GTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTG
TAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAA
CTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGT
AATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTC
TAGATCCTGGAATATAAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAATATTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTT
TAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCA
CAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTC
CAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTG
TTGGAAGTTTAATGTATGCAATGTTATGTACTAGACCTGACATTTGCTACTCAGTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGCCGTT
AAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATGGTACAAAGGATCTGATCCTTACTGGATACACTGATTCAGATTTCCAAACTGATAAAGA
TGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACG
TAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTGGAAGTCGTTCCAAATATGCATCTACCAATCACTTTATACTGTGACAACAGT
GGTGCAGTTGAAAATTCAAGAAACATAAGGATCGCAGACCAGATTTCCCATCCAAAAGGACAAGAAAAGAACAAATGA
Protein sequenceShow/hide protein sequence
MTKRSFTGKGLRAKGPLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLS
APSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGL
FYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKD
VDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNG
NLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFDTAIKSYGFEQNIDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDA
QYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLPYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPDICYSVGMVSRYQSNPGRDHWTAV
KNILKYLRRTKDYMLMYGTKDLILTGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNS
GAVENSRNIRIADQISHPKGQEKNK