; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15660 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15660
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
Genome locationChr6:14003487..14005298
RNA-Seq ExpressionCSPI06G15660
SyntenyCSPI06G15660
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]4.3e-30686.51Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR
        VLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR

Query:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
        RSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Subjt:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY
        GYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKT F+NGNLEESIYM Q +GFI QDQEQKVCKL+KSIYGLKQASRSWN+RFDTAIKSY
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY

Query:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL
        GFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDVEYL D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG 
Subjt:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL

Query:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        L + +GIHLSKEQCPKTPQEVEDMRNIPY+S VGSLMYA LCTRPDICYSVG+VSRYQSNPGRDHWT VKNILKYLRRT++YML+Y  KD +L
Subjt:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-28580.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK +LKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-28079.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDL F+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK ILKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-28580.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK +LKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

TYK03644.1 gag/pol protein [Cucumis melo var. makuwa]9.9e-29584.49Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DYLIE+GIQSQ SAPS PQQNGV +RRNR LLDMVRSMMSF+Q+ DSFW YALET  YILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR
        V VQNPKKLE RSKLC F+GYPKES+GGLFYDPQENK+FVSTNATFLEEDHIR+HQ RSKLVL+EISK+  D+PSS TKVVDKTR  GQ+H  Q+L +PR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR

Query:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
        RSGRVV Q DRYLGL E Q++IPDDGIEDPLTYK AM DVDRDQWIKAMDLEMESMY NSVWTLVDQPNDVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Subjt:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY
        GYTQ+EG+DYEE FS  AM+KSIRILLSIATFYDYEIWQMDVKTTF+N NLEESIYM Q + FI++ QEQK+CKL+KSIYGLKQASRS N+RFDTAIKSY
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY

Query:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL
        G EQNVDEPCVYK+++NS +AFLVLYVDDILLIGNDV +L DIKKWLAMQFQMKDLG+AQYVLG+QIVRNRKNKTLAMSQ SYIDKMLSRYKM NSKKGL
Subjt:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL

Query:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        L Y YGIHLSKEQCPKTPQEVEDM NIPYAS VGSLMY  LCTRP+ICYSVG+VSR QS PGRDHWTTVKNILKYLRRTKDYML+Y +KD +L
Subjt:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein6.7e-28179.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDL F+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DP+EN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+ QYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK ILKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

A0A5A7TZD0 Gag/pol protein4.5e-28580.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK +LKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

A0A5A7UYE8 Gag/pol protein4.5e-28580.87Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DY+IE+GIQSQLSAP TPQQNGVSERRNRTLLDMVRSMMS++Q+  SFWGYA+ETA +ILNNVPSKSVSETP+ELW+GRK SL HFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR
        VLV NPKKLE RS+LC F+GYPKE+RGGLF+DPQEN++FVSTNATFLEEDH+R+H+PRSKLVL E    S   +D+   S++ VD+T  SGQSHPSQ LR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI---SKSAIDKPSSSTKVVDKTRKSGQSHPSQQLR

Query:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
         PRRSGRVV QP+RYLGL ETQVVIPDDG+EDPL+YKQAM DVD+DQW+KAMDLEMESMYFNSVW LVD P  VKPIGCKWIYKRKRD AGKVQTFKARL
Subjt:  EPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT F+NGNLEESI+M Q +GFI Q QEQKVCKL +SIYGLKQASRSWN+RFDTAI
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK
        KSYGF+QNVDEPCVYKK+    +AFLVLYVDDILLIGNDV YL D+K WLA QFQMKDLG+AQYVLGIQI+R+RKNKTLA+SQA+YIDK+L RY MQNSK
Subjt:  KSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSK

Query:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        KGLL + +G+HLSKEQ PKTPQEVEDMR IPYAS VGSLMYA LCTRPDICY+VG+VSRYQSNPG DHWT VK +LKYLRRT+DYML+Y  KD +L
Subjt:  KGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

A0A5D3BX45 Gag/pol protein4.8e-29584.49Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MDLRF+DYLIE+GIQSQ SAPS PQQNGV +RRNR LLDMVRSMMSF+Q+ DSFW YALET  YILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR
        V VQNPKKLE RSKLC F+GYPKES+GGLFYDPQENK+FVSTNATFLEEDHIR+HQ RSKLVL+EISK+  D+PSS TKVVDKTR  GQ+H  Q+L +PR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR

Query:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
        RSGRVV Q DRYLGL E Q++IPDDGIEDPLTYK AM DVDRDQWIKAMDLEMESMY NSVWTLVDQPNDVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Subjt:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY
        GYTQ+EG+DYEE FS  AM+KSIRILLSIATFYDYEIWQMDVKTTF+N NLEESIYM Q + FI++ QEQK+CKL+KSIYGLKQASRS N+RFDTAIKSY
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY

Query:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL
        G EQNVDEPCVYK+++NS +AFLVLYVDDILLIGNDV +L DIKKWLAMQFQMKDLG+AQYVLG+QIVRNRKNKTLAMSQ SYIDKMLSRYKM NSKKGL
Subjt:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL

Query:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        L Y YGIHLSKEQCPKTPQEVEDM NIPYAS VGSLMY  LCTRP+ICYSVG+VSR QS PGRDHWTTVKNILKYLRRTKDYML+Y +KD +L
Subjt:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

E2GK51 Gag/pol protein (Fragment)2.1e-30686.51Show/hide
Query:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH
        MD +F+DYLIE GIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMS++Q+ DSFWGYALETA +ILNNVPSKSV ETPYELWKGRK SLR+FRIWGCPAH
Subjt:  MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAH

Query:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR
        VLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVSTNATFLEEDH R+HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PR
Subjt:  VLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPR

Query:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK
        RSGRVVHQP+RYLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSVWTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAK
Subjt:  RSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAK

Query:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY
        GYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIWQMDVKT F+NGNLEESIYM Q +GFI QDQEQKVCKL+KSIYGLKQASRSWN+RFDTAIKSY
Subjt:  GYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSY

Query:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL
        GFEQNVDEPCVYKK+VNS++AFL+LYVDDILLIGNDVEYL D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+LSRYKMQNSKKG 
Subjt:  GFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGL

Query:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML
        L + +GIHLSKEQCPKTPQEVEDMRNIPY+S VGSLMYA LCTRPDICYSVG+VSRYQSNPGRDHWT VKNILKYLRRT++YML+Y  KD +L
Subjt:  LSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKML

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.9e-8129.27Show/hide
Query:  RDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVL
        R + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S +++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L+H R++G   +V 
Subjt:  RDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSLRHFRIWGCPAHVL

Query:  VQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------
        ++N + K + +S    F+GY  E  G   +D    K  V+ +    E + +     + + V  + SK + +K  P+ S K++                  
Subjt:  VQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------

Query:  DKTRKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP-------------
        D      ++ P+                    Q L++ + S +         + D +L               ET   + + GI++P             
Subjt:  DKTRKSGQSHPS--------------------QQLREPRRSGRVV------HQPDRYLG------------LIETQVVIPDDGIEDP-------------

Query:  --------LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL
                ++Y +                 DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K +  G    +KARL
Subjt:  --------LTYKQ--------------AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI
        VA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKT F+NG L+E IYM   +G         VCKL K+IYGLKQA+R W   F+ A+
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQ
        K   F  +  + C+Y   K  +N  I +++LYVDD+++   D+  + + K++L  +F+M DL + ++ +GI+I    +   + +SQ++Y+ K+LS++ M+
Subjt:  KSYGFEQNVDEPCVY---KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQ

Query:  NS---KKGLLSYI-YGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYR
        N       L S I Y +  S E C           N P  S++G LMY  LCTRPD+  +V ++SRY S    + W  +K +L+YL+ T D  L+++
Subjt:  NS---KKGLLSYI-YGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-12239.39Show/hide
Query:  FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHV
        F +Y   +GI+ + + P TPQ NGV+ER NRT+++ VRSM+  +++  SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV
Subjt:  FRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCP--AHV

Query:  LVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ------
          +   KL+ +S  C FIGY  E  G   +DP + K+  S +  F  E  +R     S+ V   I        S  + P+S+    D+  + G+      
Subjt:  LVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEI------SKSAIDKPSSSTKVVDKTRKSGQ------

Query:  -------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKP
                      HP+Q   Q +  RRS R   +  RY       V+I DD   +P + K+ +   +++Q +KAM  EMES+  N  + LV+ P   +P
Subjt:  -------------SHPSQ---QLREPRRSGRVVHQPDRYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKP

Query:  IGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVC
        + CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKT F++G+LEE IYM Q +GF    ++  VC
Subjt:  IGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVC

Query:  KLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRK
        KL KS+YGLKQA R W M+FD+ +KS  + +   +PCVY K+   +    L+LYVDD+L++G D   +  +K  L+  F MKDLG AQ +LG++IVR R 
Subjt:  KLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVY-KKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRK

Query:  NKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNI
        ++ L +SQ  YI+++L R+ M+N+K         + LSK+ CP T +E  +M  +PY+S VGSLMYA +CTRPDI ++VG+VSR+  NPG++HW  VK I
Subjt:  NKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNI

Query:  LKYLRRTKDYMLMYRTKDKMLE
        L+YLR T    L +   D +L+
Subjt:  LKYLRRTKDYMLMYRTKDKMLE

P25600 Putative transposon Ty5-1 protein YCL074W6.3e-2630Show/hide
Query:  MDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEY
        MDV T F+N  ++E IY+ Q  GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +  +    ++ +YVDD+L+     + 
Subjt:  MDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEY

Query:  LIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYA
           +K+ L   + MKDLG     LG+ I     N  + +S   YI K  S  ++   K         +  SK     T   ++D+   PY SIVG L++ 
Subjt:  LIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYA

Query:  KLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD
            RPDI Y V ++SR+   P   H  + + +L+YL  T+   L YR+  ++  +++ D
Subjt:  KLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.5e-6727.25Show/hide
Query:  DYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ-
        +Y  ++GI    S P TP+ NG+SER++R +++   +++S + +  ++W YA   A Y++N +P+  +  E+P++   G   +    R++GC  +  ++ 
Subjt:  DYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ-

Query:  -NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL-----------------------------
         N  KL+ +S+ C F+GY       L    Q +++++S +  F E              +++ +  S  V                              
Subjt:  -NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE-----------DHIRDHQPRSKLVL-----------------------------

Query:  --------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR------------------
                 ++S S +D   SS+                     T+   Q+H S                 Q L  P +S                    
Subjt:  --------KEISKSAIDKPSSST----------------KVVDKTRKSGQSHPS-----------------QQLREPRRSGR------------------

Query:  ---VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPI
           ++H P     ++      P            GI                 +P T  QA+KD   ++W  AM  E+ +   N  W LV   P+ V  +
Subjt:  ---VVHQPDRYLGLIETQVVIP----------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPI

Query:  GCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCK
        GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV   F+ G L + +YM Q  GFI++D+   VCK
Subjt:  GCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCK

Query:  LKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK
        L+K++YGLKQA R+W +     + + GF  +V +  ++       I ++++YVDDIL+ GND   L +    L+ +F +KD  +  Y LGI+    R   
Subjt:  LKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNK

Query:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILK
         L +SQ  YI  +L+R  M  +K           LS     K     E      Y  IVGSL Y    TRPDI Y+V  +S++   P  +H   +K IL+
Subjt:  TLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILK

Query:  YLRRTKDYMLMYRTKDKMLESLH
        YL  T ++ +  +  + +  SLH
Subjt:  YLRRTKDYMLMYRTKDKMLESLH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-7228.26Show/hide
Query:  RDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ
        RDYL ++GI    S P TP+ NG+SER++R +++M  +++S + +  ++W YA   A Y++N +P+  +  ++P++   G+  +    +++GC  +  ++
Subjt:  RDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ

Query:  --NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------------------------------------DHIRDHQP
          N  KLE +SK C F+GY       L       +++ S +  F E                                               H+ D  P
Subjt:  --NPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEE---------------------------------------------DHIRDHQP

Query:  R-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP--------------
        R     S L   ++S      S+I  PSSS       +  + + Q H +Q        L  P  +    + P++   L ++ +  P              
Subjt:  R-----SKLVLKEIS-----KSAIDKPSSSTKVV---DKTRKSGQSHPSQQ-------LREPRRSGRVVHQPDRYLGLIETQVVIP--------------

Query:  --------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQP
                                               DGI                 +P T  QAMKD   D+W +AM  E+ +   N  W LV   P
Subjt:  --------------------------------------DDGI----------------EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQP

Query:  NDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQ
          V  +GC+WI+ +K +  G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV   F+ G L + +YM Q  GF+++D+
Subjt:  NDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQ

Query:  EQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIV
           VC+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++       I ++++YVDDIL+ GND   L      L+ +F +K+  D  Y LGI+  
Subjt:  EQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIV

Query:  RNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTT
          R  + L +SQ  Y   +L+R  M  +K           L+     K P   E      Y  IVGSL Y    TRPD+ Y+V  +S+Y   P  DHW  
Subjt:  RNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTT

Query:  VKNILKYLRRTKDYMLMYRTKDKMLESLH
        +K +L+YL  T D+ +  +  + +  SLH
Subjt:  VKNILKYLRRTKDYMLMYRTKDKMLESLH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-5733.6Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQE----QKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL
        +I+  Y++ + Q+D+   F+NG+L+E IYM    G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF Q+  +   + K+  ++   +
Subjt:  SIATFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQE----QKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFL

Query:  VLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVED
        ++YVDDI++  N+   + ++K  L   F+++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +     +  S      +  +  D
Subjt:  VLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVED

Query:  MRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKM
         +   Y  ++G LMY ++ TR DI ++V  +S++   P   H   V  IL Y++ T    L Y ++ +M
Subjt:  MRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKM

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.4e-0431.71Show/hide
Query:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK
        NRT+++ VRSM+    +  +F   A  TA +I+N  PS +++   P E+W     +  + R +GC A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSK

ATMG00810.1 DNA/RNA polymerases superfamily protein3.1e-1233.33Show/hide
Query:  FLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEV
        +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K   +S    + L+         + 
Subjt:  FLVLYVDDILLIGNDVEYLIDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEV

Query:  EDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKM
         D R     SIVG+L Y  L TRPDI Y+V +V +    P    +  +K +L+Y++ T  + L      K+
Subjt:  EDMRNIPYASIVGSLMYAKLCTRPDICYSVGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-1538.83Show/hide
Query:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L
Subjt:  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIA
        ++A
Subjt:  SIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTT
GTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTG
TTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAGTTGGAA
CATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTT
AGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGA
CTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTC
GTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTA
CTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCA
AGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCC
ACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACTTTTATGAACGGTAATCTTGAAGAGAGTATCTATATGTGTCAAACAAAGGGGTTTATAGAACAAGA
TCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCATCTAGATCCTGGAATATGAGATTTGATACTGCGATAAAATCTTATGGCTTTGAAC
AAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGAATATCTT
ATTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGC
CATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGTCGTACATATATGGAATTCATTTGTCAAAAGAACAAT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCATTGTTGGAAGTTTAATGTATGCAAAGTTATGTACCAGACCTGACATTTGCTACTCA
GTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAACCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATCG
TACAAAGGATAAGATGCTAGAAAGTCTACATCATGATCAGTATTTAGTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTATCAGAAAGAAGAAACCGGACCTT
GTTAGACATGGTTCGCTCTATGATGAGTTTTTCTCAGATGTCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTG
TTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAGTTGGAA
CATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTT
AGAGGAAGACCACATCAGGGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGA
CTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTC
GTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTA
CTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTCA
AGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCCGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCC
ACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACTTTTATGAACGGTAATCTTGAAGAGAGTATCTATATGTGTCAAACAAAGGGGTTTATAGAACAAGA
TCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCATCTAGATCCTGGAATATGAGATTTGATACTGCGATAAAATCTTATGGCTTTGAAC
AAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTATAGCATTTTTAGTCTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGAATATCTT
ATTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGC
CATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGTCGTACATATATGGAATTCATTTGTCAAAAGAACAAT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCATTGTTGGAAGTTTAATGTATGCAAAGTTATGTACCAGACCTGACATTTGCTACTCA
GTAGGGATGGTCAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAACCGTTAAAAACATTCTAAAATATCTTCGAAGAACAAAAGACTACATGCTCATGTATCG
TACAAAGGATAAGATGCTAGAAAGTCTACATCATGATCAGTATTTAGTCTAA
Protein sequenceShow/hide protein sequence
MDLRFRDYLIENGIQSQLSAPSTPQQNGVSERRNRTLLDMVRSMMSFSQMSDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLE
HRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHIRDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDRYLGLIETQV
VIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
TFYDYEIWQMDVKTTFMNGNLEESIYMCQTKGFIEQDQEQKVCKLKKSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKKVVNSIIAFLVLYVDDILLIGNDVEYL
IDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKMLSRYKMQNSKKGLLSYIYGIHLSKEQCPKTPQEVEDMRNIPYASIVGSLMYAKLCTRPDICYS
VGMVSRYQSNPGRDHWTTVKNILKYLRRTKDYMLMYRTKDKMLESLHHDQYLV