; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0009663 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0009663
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr03:11238420..11246668
RNA-Seq ExpressionIVF0009663
SyntenyIVF0009663
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR038765 - Papain-like cysteine peptidase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]0.081.55Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINL+RI RLVK+G+LN+LED+SLPPCESCLE KMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDD+SRYG++YL+ HKSE+ 
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVEN + K IK LRSDRGGEYMD +FQDY+IE GIQ QLSAP TPQQNGV ERRNRT+LDMVRSMMSYAQLP SFWGYA+ETA+HILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSV ETP+ELW+GRK SL +F+I GCPAHVLV NPKKLEPRS+LC FVGYPKE+RGGLF+ PQ+N+V VSTNATFLEEDH R+H+P++K+VL E    +T
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSRV-NETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLP
           D+   S++V ++   S QSH SQ LR+PRRSGR+V QPNRYLGL ETQ++IPDDGVEDPL+Y QAMNDVD+DQW+KAM+LEMESMYFN +W LVDLP
Subjt:  RVVDEVGPSSRV-NETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLP

Query:  EGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQ
          VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIW+MDV TAFLNGNLEESI+M QPEGFI Q Q
Subjt:  EGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQ

Query:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQII
        EQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKI    V FL+LYVDDILLIGNDV YLTDVK WL  QFQMKDLGEAQY+LGIQI+
Subjt:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQII

Query:  RDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTA
        R+RKNKTLA+SQA+YIDK+L RY MQNSKKG LPFRHG+HLSKEQCPKTPQEVEDMR IPY+SAVGSLMY + CTR +ICY+V IVSRYQSN G DHWTA
Subjt:  RDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTA

Query:  VKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNM
        VK ILKYLRRTR+YMLVYGAKDLILTGYTDSDFQ++KD+RKSTS SVFTLNGGA+VWRS+KQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM
Subjt:  VKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNM

Query:  NLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV
        +LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIV
Subjt:  NLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.095.12Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINLDRIGRLVKNGLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL+EA DEST
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE
        RVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPE
Subjt:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE

Query:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE
        GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQE
Subjt:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE

Query:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
        QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
Subjt:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR

Query:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV
        DRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAV
Subjt:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV

Query:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
        KI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
Subjt:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN

Query:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM
        MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM

Query:  IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL
        IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL
Subjt:  IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL

Query:  EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS
        EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS
Subjt:  EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS

Query:  QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG
        QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG
Subjt:  QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD
        VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD

Query:  EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV
        EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV
Subjt:  EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV

Query:  HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS
        HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS
Subjt:  HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI
        RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI
Subjt:  RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI

Query:  VQ
        VQ
Subjt:  VQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.093.93Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINLDRIGRLVK+GLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEAL
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP++NRV VSTNATFLEEDHMR+HKP++KLVL+EA DEST
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE
        RVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPE
Subjt:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE

Query:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE
        GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQE
Subjt:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE

Query:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
        QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIR
Subjt:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR

Query:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV
        DRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAV
Subjt:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV

Query:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
        KIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQT+KDSRKSTSRSVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMN
Subjt:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN

Query:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.088.32Show/hide
Query:  WKTVKCPLQVGVVECGYYVMRYMRDIITNGSIVVTDLIDTRTSYSQLELDEVRMELADFLGGHMDKTECGAGNIITQYGIHSFPPLRLGHINLDRIGRLV
        WKT  CP         Y V +  ++  TN   V + L +T +S+ QLE  E+ +++             G G++I+   +      +LGHINLDRIGRLV
Subjt:  WKTVKCPLQVGVVECGYYVMRYMRDIITNGSIVVTDLIDTRTSYSQLELDEVRMELADFLGGHMDKTECGAGNIITQYGIHSFPPLRLGHINLDRIGRLV

Query:  KNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLS
        KNGLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLS
Subjt:  KNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLS

Query:  KKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRK
        KKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV SKSVSETPFELWRGRK
Subjt:  KKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRK

Query:  PSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNET
        PSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL+EA DESTRVVDEVGPSSRV+ET
Subjt:  PSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNET

Query:  TTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKR
        TTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPEGVKPIGCKWIYKRKR
Subjt:  TTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKR

Query:  DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ
        DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ
Subjt:  DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ

Query:  ASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI
        ASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI
Subjt:  ASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI

Query:  DKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYML
        DK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAVKI+LKYLRRTRDYML
Subjt:  DKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYML

Query:  VYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN
        VYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN
Subjt:  VYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN

Query:  SKEPRSHKREKHIERKYHLIQEIVQ
        SKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  SKEPRSHKREKHIERKYHLIQEIVQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein0.0e+0093.93Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINLDRIGRLVK+GLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISFIDDYSRYGYLYLMEHKSEAL
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP++NRV VSTNATFLEEDHMR+HKP++KLVL+EA DEST
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE
        RVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPE
Subjt:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE

Query:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE
        GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EGVDYEETFS VAMLKSIRILLSIA FYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQE
Subjt:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE

Query:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
        QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGE QYVLGIQIIR
Subjt:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR

Query:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV
        DRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAV
Subjt:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV

Query:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
        KIILKYLRRTRDYMLVYGAKDLILTGYT+SDFQT+KDSRKSTSRSVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWL+KFLHDLEVVPNMN
Subjt:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN

Query:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

A0A5A7TZD0 Gag/pol protein0.0e+0095.12Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINLDRIGRLVKNGLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSVSETPFELWRGRKPSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL+EA DEST
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE
        RVVDEVGPSSRV+ETTTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPE
Subjt:  RVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPE

Query:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE
        GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQE
Subjt:  GVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQE

Query:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
        QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR
Subjt:  QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIR

Query:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV
        DRKNKTLALSQATYIDK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAV
Subjt:  DRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAV

Query:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
        KI+LKYLRRTRDYMLVYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN
Subjt:  KIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMN

Query:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  LPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

A0A5A7UYE8 Gag/pol protein0.0e+0088.11Show/hide
Query:  WKTVKCPLQVGVVECGYYVMRYMRDIITNGSIVVTDLIDTRTSYSQLELDEVRMELADFLGGHMDKTECGAGNIITQYGIHSFPPLRLGHINLDRIGRLV
        WKT  CP         Y V +  ++  TN    V   +   +S+ QLE  E+ +++             G G++I+   +      +LGHINLDRIGRLV
Subjt:  WKTVKCPLQVGVVECGYYVMRYMRDIITNGSIVVTDLIDTRTSYSQLELDEVRMELADFLGGHMDKTECGAGNIITQYGIHSFPPLRLGHINLDRIGRLV

Query:  KNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLS
        KNGLLNKL+D SLPPCESCLE KMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLS
Subjt:  KNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLS

Query:  KKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRK
        KKIKILRSDRGGEYMDLRFQDYMIEHGIQ QLSAPGTPQQNGV ERRNRT+LDMVRSMMSYAQLPSSFWGYAVETAVHILNNV SKSVSETPFELWRGRK
Subjt:  KKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRK

Query:  PSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNET
        PSLSHF+I GCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQ+NRV VSTNATFLEEDHMR+HKP++KLVL+EA DESTRVVDEVGPSSRV+ET
Subjt:  PSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNET

Query:  TTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKR
        TTSGQSHPSQSLRMPRRSGR+VSQPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKDQWVKAMDLEMESMYFN +WELVDLPEGVKPIGCKWIYKRKR
Subjt:  TTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKR

Query:  DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ
        DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIW+MDV TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ
Subjt:  DSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQ

Query:  ASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI
        ASRSWNIRFDTAIKSYGF+QNVDEPCVYKKINKGKV FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI
Subjt:  ASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYI

Query:  DKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYML
        DK+LVRYSMQNSKKGLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN GLDHWTAVKI+LKYLRRTRDYML
Subjt:  DKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYML

Query:  VYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN
        VYGAKDLILTGYTDSDFQT+KDSRKSTS SVFTLNGGA+VWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN
Subjt:  VYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVAN

Query:  SKEPRSHKREKHIERKYHLIQEIVQ
        SKEPRSHKR KHIERKYHLI+EIVQ
Subjt:  SKEPRSHKREKHIERKYHLIQEIVQ

A0A5D3CZY3 Gag/pol protein0.0e+00100Show/hide
Query:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM
        MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM
Subjt:  MTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYM

Query:  IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL
        IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL
Subjt:  IEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKL

Query:  EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS
        EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS
Subjt:  EPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVS

Query:  QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG
        QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG
Subjt:  QPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG

Query:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD
        VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD
Subjt:  VDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVD

Query:  EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV
        EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV
Subjt:  EPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGV

Query:  HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS
        HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS
Subjt:  HLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI
        RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI
Subjt:  RKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEI

Query:  VQ
        VQ
Subjt:  VQ

E2GK51 Gag/pol protein (Fragment)0.0e+0081.55Show/hide
Query:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
        LRLGHINL+RI RLVK+G+LN+LED+SLPPCESCLE KMTKR FTGKG RAK PLEL+HSDLCGPMNVKARGG+EYFISFIDD+SRYG++YL+ HKSE+ 
Subjt:  LRLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS
        EKFKEYK EVEN + K IK LRSDRGGEYMD +FQDY+IE GIQ QLSAP TPQQNGV ERRNRT+LDMVRSMMSYAQLP SFWGYA+ETA+HILNNV S
Subjt:  EKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSS

Query:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST
        KSV ETP+ELW+GRK SL +F+I GCPAHVLV NPKKLEPRS+LC FVGYPKE+RGGLF+ PQ+N+V VSTNATFLEEDH R+H+P++K+VL E    +T
Subjt:  KSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDEST

Query:  RVVDEVGPSSR-VNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLP
           D+   S++ V++   S QSH SQ LR+PRRSGR+V QPNRYLGL ETQ++IPDDGVEDPL+Y QAMNDVD+DQW+KAM+LEMESMYFN +W LVDLP
Subjt:  RVVDEVGPSSR-VNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLP

Query:  EGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQ
          VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILLSIATFY+YEIW+MDV TAFLNGNLEESI+M QPEGFI Q Q
Subjt:  EGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQ

Query:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQII
        EQKVCKL +SIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKI    V FL+LYVDDILLIGNDV YLTDVK WL  QFQMKDLGEAQY+LGIQI+
Subjt:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQII

Query:  RDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTA
        R+RKNKTLA+SQA+YIDK+L RY MQNSKKG LPFRHG+HLSKEQCPKTPQEVEDMR IPY+SAVGSLMY + CTR +ICY+V IVSRYQSN G DHWTA
Subjt:  RDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTA

Query:  VKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNM
        VK ILKYLRRTR+YMLVYGAKDLILTGYTDSDFQ++KD+RKSTS SVFTLNGGA+VWRS+KQ CIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM
Subjt:  VKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNM

Query:  NLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV
        +LPITLYCDNSGAVANSKEPRSHKR KHIERKYHLI+EIV
Subjt:  NLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-12230.26Show/hide
Query:  RLGHIN------LDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLM
        R GHI+      + R        LLN LE  S   CE CL  K  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+
Subjt:  RLGHIN------LDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLM

Query:  EHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVH
        ++KS+    F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGV ER  RT+ +  R+M+S A+L  SFWG AV TA +
Subjt:  EHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVH

Query:  ILNNVSSKSV---SETPFELWRGRKPSLSHFKILGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD----------------------------
        ++N + S+++   S+TP+E+W  +KP L H ++ G   +V + N + K + +S    FVGY  E  G   +D                            
Subjt:  ILNNVSSKSV---SETPFELWRGRKPSLSHFKILGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD----------------------------

Query:  ---------------PQKNRVLVST----------NATFLEE-------------------------------DHMRDHKPQNKLVLNEAIDESTRVVDE
                       P  +R ++ T          N  FL++                                 ++D K  NK  LNE+  +  +  D 
Subjt:  ---------------PQKNRVLVST----------NATFLEE-------------------------------DHMRDHKPQNKLVLNEAIDESTRVVDE

Query:  VGPSSRVNETTTSGQSHPSQSLR---------------MPRRSGRIVSQPNRYLGLTE---TQVVIPDDGV--EDPLSYNQAMNDVDKDQWVKAMDLEME
        +  S        S +S  ++ L+               + RRS R+ ++P       +    +VV+    +  + P S+++     DK  W +A++ E+ 
Subjt:  VGPSSRVNETTTSGQSHPSQSLR---------------MPRRSGRIVSQPNRYLGLTE---TQVVIPDDGV--EDPLSYNQAMNDVDKDQWVKAMDLEME

Query:  SMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEES
        +   N  W +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ +MDV TAFLNG L+E 
Subjt:  SMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDVNTAFLNGNLEES

Query:  IFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY--KKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQ
        I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  + ++++LYVDD+++   D+  + + K +L  +F+
Subjt:  IFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY--KKINKGKVVFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQ

Query:  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEIC
        M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S E C             P  S +G LMY++ CTR ++ 
Subjt:  MKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHL----SKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEIC

Query:  YAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTNKDSRKSTSRSVFTL-NGGAIVWRSIKQGCIADSTMEAEYVA
         AV I+SRY S    + W  +K +L+YL+ T D  L++    A +  + GY DSD+  ++  RKST+  +F + +   I W + +Q  +A S+ EAEY+A
Subjt:  YAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYG---AKDLILTGYTDSDFQTNKDSRKSTSRSVFTL-NGGAIVWRSIKQGCIADSTMEAEYVA

Query:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
          EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  +E VQ
Subjt:  ACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-18740.95Show/hide
Query:  RLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE
        R+GH++   +  L K  L++  +  ++ PC+ CL  K  +  F     R    L+L++SD+CGPM +++ GG +YF++FIDD SR  ++Y+++ K +  +
Subjt:  RLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALE

Query:  KFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSK
         F+++   VE    +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV ER NRT+++ VRSM+  A+LP SFWG AV+TA +++N   S 
Subjt:  KFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSK

Query:  SVS-ETPFELWRGRKPSLSHFKILGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHM----RDHKPQNKLVLNE
         ++ E P  +W  ++ S SH K+ GC   AHV      KL+ +S  C F+GY  E  G   +DP K +V+ S +  F E +         K +N ++ N 
Subjt:  SVS-ETPFELWRGRKPSLSHFKILGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHM----RDHKPQNKLVLNE

Query:  AIDEST--------RVVDEVG-PSSRVNETTTSGQ---------SHPSQSLRMP---RRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDK
            ST           DEV     +  E    G+          HP+Q        RRS R   +  RY   +   V+I DD   +P S  + ++  +K
Subjt:  AIDEST--------RVVDEVG-PSSRVNETTTSGQ---------SHPSQSLRMP---RRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDK

Query:  DQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDV
        +Q +KAM  EMES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ ++DV
Subjt:  DQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWKMDV

Query:  NTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KKINKGKVVFLVLYVDDILLIGNDVGYLT
         TAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++   + L+LYVDD+L++G D G + 
Subjt:  NTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVY-KKINKGKVVFLVLYVDDILLIGNDVGYLT

Query:  DVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIF
         +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK+ CP T +E  +M ++PY+SAVGSLMY + 
Subjt:  DVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIF

Query:  CTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAE
        CTR +I +AV +VSR+  N G +HW AVK IL+YLR T    L +G  D IL GYTD+D   + D+RKS++  +FT +GGAI W+S  Q C+A ST EAE
Subjt:  CTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAE

Query:  YVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV
        Y+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH I+E+V
Subjt:  YVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIV

P25600 Putative transposon Ty5-1 protein YCL074W5.9e-3231.85Show/hide
Query:  MDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGY
        MDV+TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +      +++ +YVDD+L+       
Subjt:  MDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGY

Query:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV
           VK  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +    SK     T   ++D+   PY S VG L++ 
Subjt:  LTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYV

Query:  IFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIK-QGCIADST
            R +I Y V ++SR+       H  + + +L+YL  TR   L Y     L LT Y D+      D   ST   V  L G  + W S K +G I   +
Subjt:  IFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVY-GAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIK-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.7e-10829.36Show/hide
Query:  HSFPPLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME
        HS    RLGH     +  ++ N  L+ L        C  CL  K  K PF+     +  PLE I+SD+     + +   + Y++ F+D ++RY +LY ++
Subjt:  HSFPPLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME

Query:  HKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHI
         KS+  E F  +KN +EN    +I    SD GGE++ L   +Y  +HGI    S P TP+ NG+ ER++R +++   +++S+A +P ++W YA   AV++
Subjt:  HKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHI

Query:  LNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLE--------------
        +N + +  +  E+PF+   G  P+    ++ GC  +  +   N  KL+ +SR C F+GY       L    Q +R+ +S +  F E              
Subjt:  LNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLE--------------

Query:  -------------------------------EDHMRDHKP-------QNKLVLNEAIDES----------TRVVDEVGPSSRVNETTTSGQSHPS-----
                                       + H     P       +N  V +  +D S               + GP      T T  Q+H S     
Subjt:  -------------------------------EDHMRDHKP-------QNKLVLNEAIDES----------TRVVDEVGPSSRVNETTTSGQSHPS-----

Query:  ------------QSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMND------------------------------------------
                    QSL  P +S      P      + T    P   +  P    Q +N+                                          
Subjt:  ------------QSLRMPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMND------------------------------------------

Query:  --VDKDQWVKAMDLEMESMYFNLMWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE
          +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + 
Subjt:  --VDKDQWVKAMDLEMESMYFNLMWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYE

Query:  IWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGND
        I ++DVN AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V +  ++       +V++++YVDDIL+ GND
Subjt:  IWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGND

Query:  VGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSL
           L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  +L R +M  +K    P      LS     K     E      Y   VGSL
Subjt:  VGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSL

Query:  MYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIAD
         Y+ F TR +I YAV  +S++      +H  A+K IL+YL  T ++ + +     L L  Y+D+D+  +KD   ST+  +  L    I W S KQ  +  
Subjt:  MYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIAD

Query:  STMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        S+ EAEY +    + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH I+  VQ
Subjt:  STMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-11129.77Show/hide
Query:  HSFPPLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME
        HS    RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ F+D ++RY +LY ++
Subjt:  HSFPPLRLGHINLDRIGRLVKNGLLNKLE-DDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLME

Query:  HKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHI
         KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+ ER++R +++M  +++S+A +P ++W YA   AV++
Subjt:  HKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHI

Query:  LNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLE--------------
        +N + +  +  ++PF+   G+ P+    K+ GC  +  +   N  KLE +S+ C F+GY       L       R+  S +  F E              
Subjt:  LNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLE--------------

Query:  -EDHMRDHKPQ---------NKLV------LNEAIDES------------TRVVDEVGPSSRVNETTTSGQSHPSQS--------------------LRM
         ++   D  P            LV      L   +D S            T+V     PSS ++  ++S  + PS +                    L  
Subjt:  -EDHMRDHKPQ---------NKLV------LNEAIDES------------TRVVDEVGPSSRVNETTTSGQSHPSQS--------------------LRM

Query:  PRRSGRIVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---------LSYN---
        P  +    + PN+   L ++ +  P                                                     DG+  P         L+ N   
Subjt:  PRRSGRIVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP---------LSYN---

Query:  ----QAMNDVDKDQWVKAMDLEMESMYFNLMWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI
            QAM D   D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +
Subjt:  ----QAMNDVDKDQWVKAMDLEMESMYFNLMWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSI

Query:  ATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDD
        A    + I ++DVN AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       ++++++YVDD
Subjt:  ATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDD

Query:  ILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPY
        IL+ GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   +L R +M  +K    P      L+     K P   E      Y
Subjt:  ILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPY

Query:  ASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSI
           VGSL Y+ F TR ++ YAV  +S+Y      DHW A+K +L+YL  T D+ + +     L L  Y+D+D+  + D   ST+  +  L    I W S 
Subjt:  ASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSI

Query:  KQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ
        KQ  +  S+ EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH I+  VQ
Subjt:  KQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.6e-8035.74Show/hide
Query:  EDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +YN+A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFL
        +I+  Y++ + ++D++ AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI     + +
Subjt:  SIATFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S      +  +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVED

Query:  MRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGA
         +   Y   +G LMY +  TRL+I +AV  +S++     L H  AV  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+     L    
Subjt:  MRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAK-DLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGA

Query:  IVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQE
        I W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H ++E
Subjt:  IVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQE

ATMG00300.1 Gag-Pol-related retrotransposon family protein8.3e-0535.82Show/hide
Query:  RLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        RL H++   +  LVK G L+  +  SL  CE C+  K  +  F+   +  K PL+ +HSDL G  +V
Subjt:  RLGHINLDRIGRLVKNGLLNKLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.1e-0836.59Show/hide
Query:  NRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAVHI+N   S +++   P E+W    P+ S+ +  GC A++   +  KL+PR++
Subjt:  NRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVS-ETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein3.2e-1731.65Show/hide
Query:  VFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSK--KGLLPFRHGVHLSKEQCPKTP
        ++L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K     LP +    +S  + P   
Subjt:  VFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSK--KGLLPFRHGVHLSKEQCPKTP

Query:  QEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFT
         +  D R     S VG+L Y+   TR +I YAV IV +      L  +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+     
Subjt:  QEVEDMRRIPYASAVGSLMYVIFCTRLEICYAVRIVSRYQSNLGLDHWTAVKIILKYLRRTRDY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFT

Query:  LNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVW
        L    I W + +Q  ++ S+ E EY A    A E  W
Subjt:  LNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.8e-1536.09Show/hide
Query:  MPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRIVSQPNRYLGLTETQVVIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
        VAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  VAKGYTQREGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAAGATTCAGGGATTCGTTACCAACTACCATTCTCGTTGTTTGGTATCGGTAGGAAGGCGTGTGTTCTTCGAGAAGATATTATTGATTTTTGTAACATGCGAGA
AGTGAAGACTTTGACATTGGTTGCATATATGGCGTATTTGGATTCTCTGTTGGACACAACACCCAAGAATGGTTATTGGGCACTTCTAGCTATTAACGCATATGATGAGA
CAGTCTATTATCTTGACTCACTTCGAACAACATCAAGAGTAGATATTAGATATGTCACTGACACGGCTATTACTATCTTCGGATCTCAGAAGAATATTCAAACAAGTTGT
AAACAACCAATATGGAAAACAGTGAAGTGTCCTTTGCAAGTTGGAGTAGTTGAATGTGGATACTATGTGATGAGATACATGAGAGATATAATCACCAATGGGAGCATAGT
AGTCACAGATTTGATTGATACTAGGACCTCATACAGTCAACTCGAGTTGGACGAAGTACGAATGGAGCTTGCTGATTTTTTGGGTGGCCACATGGACAAGACCGAGTGTG
GAGCTGGGAACATAATCACACAATATGGAATTCACTCATTCCCACCTTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAAT
AAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAATGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTAT
ACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGACGATTATTCTAGGTATGGTTATTTATACTTAATGGAGC
ATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGAATGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGAT
TTGAGATTCCAGGACTATATGATAGAACATGGAATCCAAATCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATTAGAAAGGAGAAATAGAACCATGTTAGA
CATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTTCCTCGAAGAGTGTTTCTG
AAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAAAATTTTGGGTTGTCCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAACCTCGT
TCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCCACAAAAAAATAGAGTGCTTGTATCGACAAATGCTACTTTCTTGGAAGA
AGACCACATGAGAGATCATAAACCACAAAACAAATTAGTATTAAATGAAGCTATTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTAATGAAA
CCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAATCGTTATTTGGGTTTAACTGAAACTCAGGTT
GTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAACCAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCTATGGACCTTGAAATGGAGTCTATGTA
CTTCAATTTAATGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTCA
AAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCC
ACTTTTTATGATTATGAAATATGGAAAATGGATGTCAACACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGG
TCAAGAGCAAAAAGTTTGTAAGTTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCTATCAAATCCTACGGTTTTGAAC
AAAACGTTGATGAACCTTGTGTATATAAGAAAATTAATAAAGGAAAAGTAGTTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTT
ACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGTTAGC
ACTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATATTCTGCACTAGACTAGAAATTTGTTATGCA
GTGAGAATAGTCAGTAGGTATCAGTCCAACCTAGGGTTAGACCACTGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGG
AGCTAAGGATTTGATCCTTACAGGATACACTGACTCTGATTTCCAAACCAATAAGGATTCTAGAAAATCTACTTCGAGATCAGTGTTCACCCTAAATGGGGGAGCTATAG
TATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACAT
GATTTGGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGAGAAACACAT
AGAGAGGAAGTATCACCTGATACAGGAGATTGTGCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAAGATTCAGGGATTCGTTACCAACTACCATTCTCGTTGTTTGGTATCGGTAGGAAGGCGTGTGTTCTTCGAGAAGATATTATTGATTTTTGTAACATGCGAGA
AGTGAAGACTTTGACATTGGTTGCATATATGGCGTATTTGGATTCTCTGTTGGACACAACACCCAAGAATGGTTATTGGGCACTTCTAGCTATTAACGCATATGATGAGA
CAGTCTATTATCTTGACTCACTTCGAACAACATCAAGAGTAGATATTAGATATGTCACTGACACGGCTATTACTATCTTCGGATCTCAGAAGAATATTCAAACAAGTTGT
AAACAACCAATATGGAAAACAGTGAAGTGTCCTTTGCAAGTTGGAGTAGTTGAATGTGGATACTATGTGATGAGATACATGAGAGATATAATCACCAATGGGAGCATAGT
AGTCACAGATTTGATTGATACTAGGACCTCATACAGTCAACTCGAGTTGGACGAAGTACGAATGGAGCTTGCTGATTTTTTGGGTGGCCACATGGACAAGACCGAGTGTG
GAGCTGGGAACATAATCACACAATATGGAATTCACTCATTCCCACCTTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAAT
AAGTTAGAAGATGATTCATTACCTCCATGTGAATCTTGTCTTGAATGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTAT
ACATTCAGACCTCTGTGGTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGACGATTATTCTAGGTATGGTTATTTATACTTAATGGAGC
ATAAGTCTGAAGCTCTTGAAAAGTTCAAGGAGTATAAGAATGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGAT
TTGAGATTCCAGGACTATATGATAGAACATGGAATCCAAATCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATTAGAAAGGAGAAATAGAACCATGTTAGA
CATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTTCCTCGAAGAGTGTTTCTG
AAACACCTTTCGAGTTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAAAATTTTGGGTTGTCCAGCACACGTGTTAGTGACAAATCCCAAGAAGTTGGAACCTCGT
TCAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCCACAAAAAAATAGAGTGCTTGTATCGACAAATGCTACTTTCTTGGAAGA
AGACCACATGAGAGATCATAAACCACAAAACAAATTAGTATTAAATGAAGCTATTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGAGTTAATGAAA
CCACCACATCAGGTCAATCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGATTGTATCACAACCTAATCGTTATTTGGGTTTAACTGAAACTCAGGTT
GTCATACCAGATGATGGTGTTGAGGATCCATTGTCCTATAACCAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCTATGGACCTTGAAATGGAGTCTATGTA
CTTCAATTTAATGTGGGAGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTCA
AAGCTAGACTTGTAGCAAAAGGGTATACCCAAAGGGAAGGGGTTGACTATGAGGAAACTTTTTCTCCTGTTGCTATGTTAAAGTCTATAAGGATTCTCTTGTCCATCGCC
ACTTTTTATGATTATGAAATATGGAAAATGGATGTCAACACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGG
TCAAGAGCAAAAAGTTTGTAAGTTGAATCGATCCATTTATGGATTGAAACAAGCATCTAGATCTTGGAACATTAGGTTTGATACTGCTATCAAATCCTACGGTTTTGAAC
AAAACGTTGATGAACCTTGTGTATATAAGAAAATTAATAAAGGAAAAGTAGTTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGATACCTT
ACTGACGTTAAAGCTTGGCTAGCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGTTAGC
ACTGTCTCAAGCAACCTATATCGACAAAATGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGACATGGGGTTCACTTGTCTAAGGAACAGT
GTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGTTATATTCTGCACTAGACTAGAAATTTGTTATGCA
GTGAGAATAGTCAGTAGGTATCAGTCCAACCTAGGGTTAGACCACTGGACGGCGGTTAAAATTATTCTCAAGTATCTTAGGAGAACGAGAGACTACATGCTTGTGTATGG
AGCTAAGGATTTGATCCTTACAGGATACACTGACTCTGATTTCCAAACCAATAAGGATTCTAGAAAATCTACTTCGAGATCAGTGTTCACCCTAAATGGGGGAGCTATAG
TATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGCTGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACAT
GATTTGGAAGTTGTTCCAAACATGAACTTGCCCATCACTCTATATTGTGATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGAGAAACACAT
AGAGAGGAAGTATCACCTGATACAGGAGATTGTGCAATGA
Protein sequenceShow/hide protein sequence
MDKDSGIRYQLPFSLFGIGRKACVLREDIIDFCNMREVKTLTLVAYMAYLDSLLDTTPKNGYWALLAINAYDETVYYLDSLRTTSRVDIRYVTDTAITIFGSQKNIQTSC
KQPIWKTVKCPLQVGVVECGYYVMRYMRDIITNGSIVVTDLIDTRTSYSQLELDEVRMELADFLGGHMDKTECGAGNIITQYGIHSFPPLRLGHINLDRIGRLVKNGLLN
KLEDDSLPPCESCLECKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKNEVENLLSKKIKILRSDRGGEYMD
LRFQDYMIEHGIQIQLSAPGTPQQNGVLERRNRTMLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVSSKSVSETPFELWRGRKPSLSHFKILGCPAHVLVTNPKKLEPR
SRLCQFVGYPKETRGGLFFDPQKNRVLVSTNATFLEEDHMRDHKPQNKLVLNEAIDESTRVVDEVGPSSRVNETTTSGQSHPSQSLRMPRRSGRIVSQPNRYLGLTETQV
VIPDDGVEDPLSYNQAMNDVDKDQWVKAMDLEMESMYFNLMWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA
TFYDYEIWKMDVNTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKINKGKVVFLVLYVDDILLIGNDVGYL
TDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLVRYSMQNSKKGLLPFRHGVHLSKEQCPKTPQEVEDMRRIPYASAVGSLMYVIFCTRLEICYA
VRIVSRYQSNLGLDHWTAVKIILKYLRRTRDYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSRSVFTLNGGAIVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLH
DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKREKHIERKYHLIQEIVQ