; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G24730 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G24730
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr3:21827229..21830875
RNA-Seq ExpressionCSPI03G24730
SyntenyCSPI03G24730
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.6e-27865.69Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        I RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        ENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDM         + DSFWGYALETA +ILNNVPSKSV ETPYEL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVV
        WKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVV
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVV

Query:  DKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYK
        DK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYK
Subjt:  DKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYK

Query:  RKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK----------
        RKRD AGK                                                  MDVKTAFLNGNLEESIYM QPEGFI QDQEQK          
Subjt:  RKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK----------

Query:  ---------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA
                                                FL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQA
Subjt:  ---------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA

Query:  SYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV------------
        SYIDK+LSRYKMQNSKKG L +R+GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRP+ICYSV              W+ V            
Subjt:  SYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV------------

Query:  -----------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                             DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  -----------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-25861.5Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK+GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-25560.86Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKI RSDRGGEYMDL F+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.9e-23857.4Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        I RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDM         + +SFWGYA++TA YILN VPSKSVSETP +L
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKP
        W GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++P
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKP

Query:  SSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPI
        S+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPI
Subjt:  SSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPI

Query:  GCKWIYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK---
        GCKWIYKRKR   GK                                                  MDVKTAFLNGNLEE+IYM QPEGFI   QEQK   
Subjt:  GCKWIYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK---

Query:  ----------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK
                                                       FLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK
Subjt:  ----------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK

Query:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV-----
         LA+SQASYIDK++ +Y MQNSK+GLL +R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRP+ICY+V              W+ V     
Subjt:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV-----

Query:  ------------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                    DSTME EYVAACEAAKEA+ ++  L  ++VV
Subjt:  ------------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-25861.5Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK+GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.8e-23857.4Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        I RLVK+GLLS LE+NSLP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DDYSRYG++YL+ HKS +LEKFKEYKAEV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L KTIK  RSDRGGEYMDL+F++YL+E GI SQLSAP TPQQNGVSERRNRTLLDM         + +SFWGYA++TA YILN VPSKSVSETP +L
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKP
        W GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYDP++NK+FVSTNATFLEEDHIR+H+PRSK+VL E+SK         +++P
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKS-------AIDKP

Query:  SSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPI
        S+ T+VV     S ++H  Q LREPRRSGRV + P RY+ L ET  VI D  IEDPLT+K+AM+DVD+D+WIKAM+LE+ESMYFNSVW LVDQP+ VKPI
Subjt:  SSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPI

Query:  GCKWIYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK---
        GCKWIYKRKR   GK                                                  MDVKTAFLNGNLEE+IYM QPEGFI   QEQK   
Subjt:  GCKWIYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK---

Query:  ----------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK
                                                       FLVLYVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK
Subjt:  ----------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK

Query:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV-----
         LA+SQASYIDK++ +Y MQNSK+GLL +R+G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYAMLCTRP+ICY+V              W+ V     
Subjt:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV-----

Query:  ------------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                    DSTME EYVAACEAAKEA+ ++  L  ++VV
Subjt:  ------------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

A0A5A7T2V9 Gag/pol protein2.6e-25560.86Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARG +EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKI RSDRGGEYMDL F+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

A0A5A7TZD0 Gag/pol protein1.9e-25861.5Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK+GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

A0A5A7UYE8 Gag/pol protein1.9e-25861.5Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        IGRLVK+GLL+ L+D SLPPCESCLEGKMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDDYSRYG++YL+ HKS +LEKFKEYK EV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        EN L K IKILRSDRGGEYMDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDM         +  SFWGYA+ETA +ILNNVPSKSVSETP+EL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST
        W+GRK SL HFRIWGCPAHVLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S+
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSST

Query:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW
        + VD+T  SGQSHPSQ LR PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKW
Subjt:  KVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKW

Query:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------
        IYKRKRD AGK                                                  MDVKTAFLNGNLEESI+MSQPEGFI Q QEQK       
Subjt:  IYKRKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK-------

Query:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM
                                                   FLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+
Subjt:  ------------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAM

Query:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------
        SQA+YIDK+L RY MQNSKKGLL +R+G+HLSKEQ PKTPQEVEDMR IPYASAVGSLMYAMLCTRP+ICY+V              W+ V         
Subjt:  SQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV---------

Query:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                                DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  --------------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

E2GK51 Gag/pol protein (Fragment)7.6e-27965.69Show/hide
Query:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV
        I RLVKSG+L+ LEDNSLPPCESCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL+HHKS S EKFKEYKAEV
Subjt:  IGRLVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEV

Query:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL
        ENE+GKTIK LRSDRGGEYMD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDM         + DSFWGYALETA +ILNNVPSKSV ETPYEL
Subjt:  ENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVSETPYEL

Query:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVV
        WKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVV
Subjt:  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVV

Query:  DKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYK
        DK   S QSH SQ+LR PRRSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYK
Subjt:  DKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYK

Query:  RKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK----------
        RKRD AGK                                                  MDVKTAFLNGNLEESIYM QPEGFI QDQEQK          
Subjt:  RKRDHAGK--------------------------------------------------MDVKTAFLNGNLEESIYMSQPEGFIEQDQEQK----------

Query:  ---------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA
                                                FL+LYVDDILLIGNDV YLTD+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQA
Subjt:  ---------------------------------------HFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA

Query:  SYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV------------
        SYIDK+LSRYKMQNSKKG L +R+GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYAMLCTRP+ICYSV              W+ V            
Subjt:  SYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV-------------RWSVV------------

Query:  -----------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV
                                                             DSTME EYVAACEAAKEA+ +++ L  ++VV
Subjt:  -----------------------------------------------------DSTMEVEYVAACEAAKEALQIQENLEAIKVV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-5223.89Show/hide
Query:  CESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRG
        CE CL GK  +  F  K L+     K PL +VHSD+CGP+         YF+ F+D ++ Y   YLI +KS+    F+++ A+ E      +  L  D G
Subjt:  CESCLEGKMTKRSFTGKGLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRG

Query:  GEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRI
         EY+    R + ++ GI   L+ P TPQ NGVSER  RT+ +          +  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R+
Subjt:  GEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRI

Query:  WGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV----------
        +G   +V ++N + K + +S    F+GY  E  G   +D    K  V+ +    E + +     + + V  + SK + +K  P+ S K++          
Subjt:  WGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV----------

Query:  --------DKTRKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP-----
                D      ++ P+                    Q L++ + S +         + D +L               ET   + + GI++P     
Subjt:  --------DKTRKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP-----

Query:  ----------------LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAG-
                        ++Y +                 DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G 
Subjt:  ----------------LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAG-

Query:  -------------------------------------------------KMDVKTAFLNGNLEESIYMSQPEGF--------------------------
                                                         +MDVKTAFLNG L+E IYM  P+G                           
Subjt:  -------------------------------------------------KMDVKTAFLNGNLEESIYMSQPEGF--------------------------

Query:  -IEQDQEQKHF----------------------LVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLS
          EQ  ++  F                      ++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS
Subjt:  -IEQDQEQKHF----------------------LVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLS

Query:  RYKMQN----SKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV
        ++ M+N    S        Y +  S E C           N P  S +G LMY MLCTRP++  +V
Subjt:  RYKMQN----SKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.2e-9733.05Show/hide
Query:  LVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENE
        L K  L+S  +  ++ PC+ CL GK  + SF     R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+++ A VE E
Subjt:  LVKSGLLSPLEDNSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENE

Query:  LGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWK
         G+ +K LRSD GGEY    F +Y   +GI+ + + P TPQ NGV+ER NRT+++          +  SFWG A++TA Y++N  PS  ++ E P  +W 
Subjt:  LGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWK

Query:  GRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPS
         ++ S  H +++GC   AHV  +   KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S+ V   I        S  + P+
Subjt:  GRKGSLRHFRIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPS

Query:  SSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEM
        S+    D+  + G+                    HP+Q   Q +  RRS R   +  RY       V+I DD   +P + K+ +   +++Q +KAM  EM
Subjt:  SSTKVVDKTRKSGQ-------------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEM

Query:  ESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD--------------------------------------------------HAGKMDVKTAFLNGNLEE
        ES+  N  + LV+ P   +P+ CKW++K K+D                                                     ++DVKTAFL+G+LEE
Subjt:  ESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRD--------------------------------------------------HAGKMDVKTAFLNGNLEE

Query:  SIYMSQPEGF------------------IEQDQEQ-------------------------KHF-------LVLYVDDILLIGNDVGYLTDIKKWLAMQFQ
         IYM QPEGF                  ++Q   Q                         K F       L+LYVDD+L++G D G +  +K  L+  F 
Subjt:  SIYMSQPEGF------------------IEQDQEQ-------------------------KHF-------LVLYVDDILLIGNDVGYLTDIKKWLAMQFQ

Query:  MKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV
        MKDLG AQ +LG++IVR R ++ L +SQ  YI+++L R+ M+N+K         + LSK+ CP T +E  +M  +PY+SAVGSLMYAM+CTRP+I ++V
Subjt:  MKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV

Q12490 Transposon Ty1-BL Gag-Pol polyprotein2.6e-1826.14Show/hide
Query:  CESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSD
        C  CL GK TK R   G  L+ +    P + +H+D+ GP++   +    YFISF D+ +++  +Y +H   + + L+ F    A ++N+   ++ +++ D
Subjt:  CESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIH--HKSNSLEKFKEYKAEVENELGKTIKILRSD

Query:  RGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRI-
        RG EY +     +L +NGI    +  +  + +GV+ER NRTLLD          + +  W  A+E +  + N++ S           K +K + +H  + 
Subjt:  RGGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLD---------MMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRI-

Query:  ---------WGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLE
                 +G P  V   NP    H   +  +  +P + S G + Y P   K   +TN   L+
Subjt:  ---------WGCPAHVLVQNPKKLEHRSKLCFFIGYP-KESRGGLFYDPQENKIFVSTNATFLE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-3622.89Show/hide
Query:  CESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYM
        C  CL  K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   E F  +K  +EN     I    SD GGE++
Subjt:  CESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYM

Query:  DLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNR-------TLLDMMS--DSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAH
         L   +Y  ++GI    S P TP+ NG+SER++R       TLL   S   ++W YA   A Y++N +P+  +  E+P++   G   +    R++GC  +
Subjt:  DLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNR-------TLLDMMS--DSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL------------------------
          ++  N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++ +  S  V                         
Subjt:  VLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL------------------------

Query:  -------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR-------------
                      ++S S +D   SS+                     T+   Q+H S                 Q L  P +S               
Subjt:  -------------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR-------------

Query:  --------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPN
                ++H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  W LV   P+
Subjt:  --------VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPN

Query:  DVKPIGCKWIYKRKRDHAG--------------------------------------------------KMDVKTAFLNGNLEESIYMSQPEGFIEQDQ-
         V  +GC+WI+ +K +  G                                                  ++DV  AFL G L + +YMSQP GFI++D+ 
Subjt:  DVKPIGCKWIYKRKRDHAG--------------------------------------------------KMDVKTAFLNGNLEESIYMSQPEGFIEQDQ-

Query:  ----------------------EQKHFL--------------------------VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVR
                              E +++L                          ++YVDDIL+ GND   L +    L+ +F +KD  +  Y LGI+   
Subjt:  ----------------------EQKHFL--------------------------VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVR

Query:  NRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV
         R    L +SQ  YI  +L+R  M  +K           LS     K     E      Y   VGSL Y +  TRP+I Y+V
Subjt:  NRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-3822.73Show/hide
Query:  NSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDR
        + L  C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F+D ++RY  +Y +  KS   + F  +K+ VEN     I  L SD 
Subjt:  NSLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDR

Query:  GGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIW
        GGE++ L  RDYL ++GI    S P TP+ NG+SER++R +++M         +  ++W YA   A Y++N +P+  +  ++P++   G+  +    +++
Subjt:  GGEYMDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDM---------MSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIW

Query:  GCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------------
        GC  +  ++  N  KLE +SK C F+GY       L       +++ S +  F E                                             
Subjt:  GCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE--------------------------------------------

Query:  -DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP-----
          H+ D  PR     S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P  +    + P++   L ++ +  P     
Subjt:  -DHIRDHQPR-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP-----

Query:  -----------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNS
                                                        DGI                 +P T  QAMKD   D+W +AM  E+ +   N 
Subjt:  -----------------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNS

Query:  VWTLV-DQPNDVKPIGCKWIYKRKRDHAG--------------------------------------------------KMDVKTAFLNGNLEESIYMSQ
         W LV   P  V  +GC+WI+ +K +  G                                                  ++DV  AFL G L + +YMSQ
Subjt:  VWTLV-DQPNDVKPIGCKWIYKRKRDHAG--------------------------------------------------KMDVKTAFLNGNLEESIYMSQ

Query:  PEGFIEQDQ-------------------------------------------------EQKHFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDA
        P GF+++D+                                                     ++++YVDDIL+ GND   L      L+ +F +K+  D 
Subjt:  PEGFIEQDQ-------------------------------------------------EQKHFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDA

Query:  QYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV
         Y LGI+    R  + L +SQ  Y   +L+R  M  +K           L+     K P   E      Y   VGSL Y +  TRP++ Y+V
Subjt:  QYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-1221.65Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAG---------------------------------------
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G                                       
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAG---------------------------------------

Query:  -----------KMDVKTAFLNGNLEESIYMSQPE-------------------------------------------GFIEQDQEQKHFL----------
                   ++D+  AFLNG+L+E IYM  P                                            GF++   +  +FL          
Subjt:  -----------KMDVKTAFLNGNLEESIYMSQPE-------------------------------------------GFIEQDQEQKHFL----------

Query:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVED
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +     +  S      +  +  D
Subjt:  VLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASAVGSLMYAMLCTRPNICYSV
         +   Y   +G LMY  + TR +I ++V
Subjt:  MRNIPYASAVGSLMYAMLCTRPNICYSV

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-0831.79Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEV
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K   +S    + L+         + 
Subjt:  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEV

Query:  EDMRNIPYASAVGSLMYAMLCTRPNICYSVRWSVVDSTMEVEYVAACEAAKEALQIQENLEAIKVVYHELHNH
         D R+I     VG+L Y  L TRP+I Y+V  ++V   M    +A  +  K  L+  +       ++H L+ H
Subjt:  EDMRNIPYASAVGSLMYAMLCTRPNICYSVRWSVVDSTMEVEYVAACEAAKEALQIQENLEAIKVVYHELHNH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCTATGTCTCAAGCATCTTACATAGACAAAATTTTGTCTAG
GTATAAAATGCAGAATTCCAAAAAGGGTCTATTGTCGTACAGATATGGAATTCATTTGTCAAATGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATA
TACCCTATGCTTCCGCTGTTGGAAGTTTGATGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCTTGTCTTGAA
GGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGG
ATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAG
TAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAA
CTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGC
TTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCAC
ACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAAT
AAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAA
ACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTG
ATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATC
AAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAA
ACGAGACCATGCCGGTAAAATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAAC
AAAAGCATTTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGAT
CTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAA
AATGCAGAATTCCAAAAAGGGTCTATTGTCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCGT
ATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGGCCTAACATTTGCTACTCAGTAAGGTGGTCAGTAGTTGATTCCACAATGGAAGTTGAATACGTA
GCGGCTTGTGAAGCAGCAAAAGAAGCATTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGTTGTGTATCATGAACTACACAATCATGTGAAGGCAGCTCATCAGCGTTG
GCCCAATAAGCCTCCCATTTCAGGGGTTGGGTGGATAGCTCGGGACATAGGGTGCATTACGAAATTCAATCTTATCCGCTTTAGGGTTAGTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCTATGTCTCAAGCATCTTACATAGACAAAATTTTGTCTAG
GTATAAAATGCAGAATTCCAAAAAGGGTCTATTGTCGTACAGATATGGAATTCATTTGTCAAATGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATA
TACCCTATGCTTCCGCTGTTGGAAGTTTGATGATTGGGAGATTGGTTAAAAGTGGGCTTCTAAGTCCGTTAGAAGATAACTCTTTACCTCCTTGTGAATCTTGTCTTGAA
GGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGG
ATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAATAGTCTTGAAAAGTTCAAAGAATATAAGGCTGAAG
TAGAAAACGAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAA
CTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGC
TTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCAC
ACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAAT
AAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAA
ACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTG
ATCGCTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATC
AAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAA
ACGAGACCATGCCGGTAAAATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAAC
AAAAGCATTTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGAT
CTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAA
AATGCAGAATTCCAAAAAGGGTCTATTGTCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCGT
ATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATGTACTAGGCCTAACATTTGCTACTCAGTAAGGTGGTCAGTAGTTGATTCCACAATGGAAGTTGAATACGTA
GCGGCTTGTGAAGCAGCAAAAGAAGCATTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGTTGTGTATCATGAACTACACAATCATGTGAAGGCAGCTCATCAGCGTTG
GCCCAATAAGCCTCCCATTTCAGGGGTTGGGTGGATAGCTCGGGACATAGGGTGCATTACGAAATTCAATCTTATCCGCTTTAGGGTTAGTAGATAG
Protein sequenceShow/hide protein sequence
MKLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKILSRYKMQNSKKGLLSYRYGIHLSNEQCPKTPQEVEDMRNIPYASAVGSLMIGRLVKSGLLSPLEDNSLPPCESCLE
GKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSNSLEKFKEYKAEVENELGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQ
LSAPSTPQQNGVSERRNRTLLDMMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQEN
KIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWI
KAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKHFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKD
LGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYRYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAMLCTRPNICYSVRWSVVDSTMEVEYV
AACEAAKEALQIQENLEAIKVVYHELHNHVKAAHQRWPNKPPISGVGWIARDIGCITKFNLIRFRVSR