; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014574 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014574
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag/pol protein
Genome locationchr02:12251536..12253683
RNA-Seq ExpressionPay0014574
SyntenyPay0014574
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0080Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTSGSVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0079.16Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTS SVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

KAA0051680.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-27883.33Show/hide
Query:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR
        VET VHILNNVPS SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLFFDPQ NRVFVSTNATFLEEDHMR+HKPR
Subjt:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR

Query:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ
        SKL++             + P  +VN    + DGVED L Y+QA NDVDKDQWVKAMDLEMESMYFNS+WELVDLPEGVKPIGCKWIYKRKRDSAGK ++
Subjt:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ

Query:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV
           +  +   Y         A+LN N EESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS  FDQNVDEP VYKKINK KVAFLV
Subjt:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV

Query:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM
        LYVDDILLIGNDV YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL LSQATYIDK+LVRYSMQNSK  LLPFRHGVHLSKEQ PKTPQEVEDM
Subjt:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM

Query:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG------------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF
        RRI Y SAVGSLMY MLCT+ DI  AVGIVSRYQSNPG            KSTSGSVFTLNGGAV W SIKQGCIADSTMEA YV ACEAAKE VWLRKF
Subjt:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG------------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LHDLEVVPNMNLPITLYCDNSG V NSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNIAD FTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0080Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTSGSVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

TYJ97931.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-27883.9Show/hide
Query:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR
        VET VHILNNVPS SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLFFDPQ NRVFVSTNATFLEEDHMR+HKPR
Subjt:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR

Query:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ
        SKL++             + P  +VN    + DGVED L Y+QA NDVDKDQWVKAMDLEMESMYFNS+WELVDLPEGVKPIGCKWIYKRKRDSAGK ++
Subjt:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ

Query:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV
           +  +   Y         A+LN N EESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS  FDQNVDEP VYKKINK KVAFLV
Subjt:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV

Query:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM
        LYVDDILLIGNDV YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL LSQATYIDK+LVRYSMQNSK  LLPFRHGVHLSKEQ PKTPQEVEDM
Subjt:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM

Query:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG--------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDL
        RRI Y SAVGSLMY MLCT+ DI  AVGIVSRYQSNPG        KSTSGSVFTLNGGAV W SIKQGCIADSTMEA YV ACEAAKE VWLRKFLHDL
Subjt:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG--------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDL

Query:  EVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        EVVPNMNLPITLYCDNSG V NSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNIAD FTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  EVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein0.0e+0079.16Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT++EG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGE QYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTS SVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWL+KFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

A0A5A7TZD0 Gag/pol protein0.0e+0080Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTSGSVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

A0A5A7U951 Gag/pol protein3.0e-27883.33Show/hide
Query:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR
        VET VHILNNVPS SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLFFDPQ NRVFVSTNATFLEEDHMR+HKPR
Subjt:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR

Query:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ
        SKL++             + P  +VN    + DGVED L Y+QA NDVDKDQWVKAMDLEMESMYFNS+WELVDLPEGVKPIGCKWIYKRKRDSAGK ++
Subjt:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ

Query:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV
           +  +   Y         A+LN N EESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS  FDQNVDEP VYKKINK KVAFLV
Subjt:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV

Query:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM
        LYVDDILLIGNDV YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL LSQATYIDK+LVRYSMQNSK  LLPFRHGVHLSKEQ PKTPQEVEDM
Subjt:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM

Query:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG------------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF
        RRI Y SAVGSLMY MLCT+ DI  AVGIVSRYQSNPG            KSTSGSVFTLNGGAV W SIKQGCIADSTMEA YV ACEAAKE VWLRKF
Subjt:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG------------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKF

Query:  LHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        LHDLEVVPNMNLPITLYCDNSG V NSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNIAD FTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  LHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

A0A5A7UYE8 Gag/pol protein0.0e+0080Show/hide
Query:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
        MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF
Subjt:  MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVF

Query:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM
        VSTNATFLEEDHMRNHKPRSKL++                              P Q L  P     +              +++  DGVEDPLSYKQAM
Subjt:  VSTNATFLEEDHMRNHKPRSKLLV------------------------------PHQGLMKPPHQVNL--------------ILLNHDGVEDPLSYKQAM

Query:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------
        NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG                              
Subjt:  NDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG------------------------------

Query:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
              TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV
Subjt:  ------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDV

Query:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
        GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM
Subjt:  GYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLM

Query:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST
        YAMLCTRPDICYAVGIVSRYQSNPG                                              KSTSGSVFTLNGGAVVWRSIKQGCIADST
Subjt:  YAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KSTSGSVFTLNGGAVVWRSIKQGCIADST

Query:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
        MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF
Subjt:  MEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVF

Query:  EGHLESLGLRDMYIR
        EGHLESLGLRDMYIR
Subjt:  EGHLESLGLRDMYIR

A0A5D3BH07 Gag/pol protein1.0e-27883.9Show/hide
Query:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR
        VET VHILNNVPS SVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLE RSRLCQFVGYPKETRGGLFFDPQ NRVFVSTNATFLEEDHMR+HKPR
Subjt:  VETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPR

Query:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ
        SKL++             + P  +VN    + DGVED L Y+QA NDVDKDQWVKAMDLEMESMYFNS+WELVDLPEGVKPIGCKWIYKRKRDSAGK ++
Subjt:  SKLLVPHQ--------GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK-VQ

Query:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV
           +  +   Y         A+LN N EESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS  FDQNVDEP VYKKINK KVAFLV
Subjt:  TFKARLVAKGY---TQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLV

Query:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM
        LYVDDILLIGNDV YLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTL LSQATYIDK+LVRYSMQNSK  LLPFRHGVHLSKEQ PKTPQEVEDM
Subjt:  LYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDM

Query:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG--------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDL
        RRI Y SAVGSLMY MLCT+ DI  AVGIVSRYQSNPG        KSTSGSVFTLNGGAV W SIKQGCIADSTMEA YV ACEAAKE VWLRKFLHDL
Subjt:  RRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG--------KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDL

Query:  EVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR
        EVVPNMNLPITLYCDNSG V NSKEPRSHKRGKHIERKYHLIREIVQR DVIVTKIASEHNIAD FTKTLTAKVFEGHLESLGLRDMYIR
Subjt:  EVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-7026.25Show/hide
Query:  RSMMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-----
        R+M+S A+L  SFWG AV TA +++N +PS+++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D     
Subjt:  RSMMSYAQLPSSFWGYAVETAVHILNNVPSKSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFD-----

Query:  --------------------------------------PQENRVFVST----------NATFL---EEDHMRNHKPRSKLLV----PHQG-------LMK
                                              P ++R  + T          N  FL   +E   +N    S+ ++    P++         +K
Subjt:  --------------------------------------PQENRVFVST----------NATFL---EEDHMRNHKPRSKLLV----PHQG-------LMK

Query:  PPHQVNLILLN------------------------------------------HDGVE------------DPLSYKQ--------------AMNDV----
           + N   LN                                          +DG+E              +SY +                NDV    
Subjt:  PPHQVNLILLN------------------------------------------HDGVE------------DPLSYKQ--------------AMNDV----

Query:  -------DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQR----------------------------
               DK  W +A++ E+ +   N+ W +   PE    +  +W++  K +  G    +KARLVA+G+TQ+                            
Subjt:  -------DKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQR----------------------------

Query:  --------EGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDIL
                  TAFLNG L+E I+M  P+G         VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N  +  +++LYVDD++
Subjt:  --------EGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY--KKINKGKVAFLVLYVDDIL

Query:  LIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS
        +   D+  + + K +L  +F+M DL E ++ +GI+I  + +   + LSQ+ Y+ K+L +++M+N      P    ++     S       ++    P  S
Subjt:  LIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYAS

Query:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPG-------------------------------------------------KSTSGSVFTL-NGGAVVWRS
         +G LMY MLCTRPD+  AV I+SRY S                                                    KST+G +F + +   + W +
Subjt:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPG-------------------------------------------------KSTSGSVFTL-NGGAVVWRS

Query:  IKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADP
         +Q  +A S+ EAEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD 
Subjt:  IKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADP

Query:  FTKTLTAKVFEGHLESLGL
        FTK L A  F    + LGL
Subjt:  FTKTLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-11734.56Show/hide
Query:  VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENR
        VRSM+  A+LP SFWG AV+TA +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +
Subjt:  VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENR

Query:  VFVSTNATFLE---------EDHMRNHKPRSKLLVP-------------------------------------------------HQGLMKPPH------
        V  S +  F E          + ++N    + + +P                                                 HQ L +         
Subjt:  VFVSTNATFLE---------EDHMRNHKPRSKLLVP-------------------------------------------------HQGLMKPPH------

Query:  ---QVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG-----
               +L++ D   +P S K+ ++  +K+Q +KAM  EMES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G     
Subjt:  ---QVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG-----

Query:  -------------------------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY
                                       TAFL+G+LEE I+M QPEGF   G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY
Subjt:  -------------------------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVY

Query:  -KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSK
         K+ ++     L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N+K    P    + LSK
Subjt:  -KKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSK

Query:  EQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KST
        +  P T +E  +M ++PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG                                              KS+
Subjt:  EQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG----------------------------------------------KST

Query:  SGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRG
        +G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V   
Subjt:  SGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRG

Query:  DVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
         + V KI++  N AD  TK +    FE   E +G+
Subjt:  DVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W2.2e-2328.39Show/hide
Query:  TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDV
        TAFLN  ++E I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +       ++ +YVDD+L+          V
Subjt:  TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDV

Query:  KAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCT
        K  L   + MKDLG+    LG+  I    N  + LS   YI K      +   K    P  +   L +  SP     ++D+   PY S VG L++     
Subjt:  KAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCT

Query:  RPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNGGAVVWRSIK-QGCIADSTMEAE
        RPDI Y V ++SR+   P                                                 ST G V  L G  V W S K +G I   + EAE
Subjt:  RPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNGGAVVWRSIK-QGCIADSTMEAE

Query:  YVAACEAAKE
        Y+ A E   E
Subjt:  YVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-5729.81Show/hide
Query:  GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG
        G++KP  + +L  ++     +P +  QA+ D   ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S G +  +KARLVAKGY QR G
Subjt:  GLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG

Query:  ------------------------------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVD
                                             AFL G L + ++MSQP GFI + +   VCKL +++YGLKQA R+W +     + + GF  +V 
Subjt:  ------------------------------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVD

Query:  EPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGV
        +  ++       + ++++YVDDIL+ GND   L +    L+ +F +KD  E  Y LGI+    R    L LSQ  YI  LL R +M  +K    P     
Subjt:  EPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGV

Query:  HLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGK--------------------------------------------
         LS     K     E      Y   VGSL Y +  TRPDI YAV  +S++   P +                                            
Subjt:  HLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGK--------------------------------------------

Query:  ---STSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
           ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   L +L +   +  P  +YCDN GA      P  H R KHI   YH IR 
Subjt:  ---STSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

Query:  IVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL
         VQ G + V  +++   +AD  TK L+   F+     +G+
Subjt:  IVQRGDVIVTKIASEHNIADPFTKTLTAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-5729.81Show/hide
Query:  DPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG--------------------
        +P +  QAM D   D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY QR G                    
Subjt:  DPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG--------------------

Query:  ----------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYV
                         AFL G L + ++MSQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YV
Subjt:  ----------------TAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYV

Query:  DDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRI
        DDIL+ GND   L      L+ +F +K+  +  Y LGI+    R  + L LSQ  Y   LL R +M  +K    P      L+     K P   E     
Subjt:  DDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRI

Query:  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGK-----------------------------------------------STSGSVFTLNGGAVVWR
         Y   VGSL Y +  TRPD+ YAV  +S+Y   P                                                 ST+G +  L    + W 
Subjt:  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGK-----------------------------------------------STSGSVFTLNGGAVVWR

Query:  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIAD
        S KQ  +  S+ EAEY +    + E  W+   L +L +   ++ P  +YCDN GA      P  H R KHI   YH IR  VQ G + V  +++   +AD
Subjt:  SIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIAD

Query:  PFTKTLTAKVFEGHLESLGL
          TK L+   F+     +G+
Subjt:  PFTKTLTAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-5329.96Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG--------------------
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ+EG                    
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREG--------------------

Query:  ----------------TAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL
                         AFLNG+L+E I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  +GF Q+  +   + KI       +
Subjt:  ----------------TAFLNGNLEESIFMSQPEGFIT-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFL

Query:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K+ L + F+++DLG  +Y LG++I R      + + Q  Y   LL    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI +AV  +S++   P                                                +ST+G    L    
Subjt:  MRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  VVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0738.67Show/hide
Query:  VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-1129.06Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIGNDVGYLTDVKAWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNG
         D R     S VG+L Y  L TRPDI YAV IV +    P                                                +ST+G    L  
Subjt:  EDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP-----------------------------------------------GKSTSGSVFTLNG

Query:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW
          + W + +Q  ++ S+ E EY A    A E  W
Subjt:  GAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.9e-1043.28Show/hide
Query:  WVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGTAFL
        W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARLVAKG+ Q EG  F+
Subjt:  WVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGTAFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGA
AACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTT
CAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAA
GACCACATGAGAAATCATAAACCACGAAGCAAATTATTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCATGATGGTGTTGA
GGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTG
TAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCAAAAGGG
TATACCCAAAGGGAAGGGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAA
GCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTG
TATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTA
GCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATAT
CGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAG
TTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTAT
CAGTCCAACCCAGGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGC
TGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTG
ATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTG
ATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTA
CATTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGTTCAATGATGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGA
AACACCTTTCGAGCTATGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTT
CAAGGTTATGCCAATTTGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAA
GACCACATGAGAAATCATAAACCACGAAGCAAATTATTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAATCTCATCCTTCTCAATCATGATGGTGTTGA
GGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGGAGCTTG
TAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCAAAAGGG
TATACCCAAAGGGAAGGGACTGCTTTTCTGAATGGCAATCTTGAAGAGAGTATCTTTATGTCTCAGCCCGAGGGGTTCATAACCCAAGGTCAAGAGCAAAAAGTTTGCAA
GCTGAATCGATCCATTTATGGGTTGAAACAAGCATCAAGATCTTGGAACATTAGGTTTGATACTGCAATCAAATCCTATGGTTTTGACCAGAATGTTGATGAACCTTGTG
TATATAAGAAAATCAACAAAGGAAAAGTAGCTTTCTTAGTACTTTATGTGGACGATATCCTCCTCATTGGGAATGATGTGGGTTACCTTACTGACGTTAAAGCTTGGCTA
GCAGCCCAATTCCAAATGAAAGATTTAGGAGAGGCACAATATGTTCTTGGGATCCAAATCATAAGGGATCGTAAGAACAAAACGCTAGCACTGTCTCAAGCAACCTATAT
CGACAAATTGTTGGTTCGATATTCGATGCAGAACTCTAAGAAGGGTTTATTACCTTTCAGGCATGGAGTTCACTTGTCTAAGGAACAGAGTCCTAAGACACCTCAAGAAG
TTGAGGATATGAGACGTATTCCCTATGCCTCAGCTGTGGGCAGCTTAATGTATGCTATGCTCTGCACTAGGCCAGACATTTGTTATGCAGTGGGAATAGTCAGTAGGTAT
CAGTCCAACCCAGGGAAATCCACATCGGGATCAGTGTTCACCCTAAATGGGGGAGCTGTAGTATGGCGTAGCATCAAGCAAGGATGCATTGCAGACTCTACAATGGAGGC
TGAATACGTCGCTGCTTGTGAAGCAGCAAAAGAAGCAGTTTGGCTTAGGAAGTTCCTACATGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATCACTCTATATTGTG
ATAACAGTGGGGCAGTAGCCAATTCTAAAGAACCTCGCAGCCATAAACGAGGGAAACACATAGAGAGGAAGTATCATCTGATACGGGAGATTGTGCAACGAGGGGATGTG
ATCGTCACCAAGATCGCTTCGGAGCACAACATTGCTGATCCATTTACGAAGACTCTCACGGCTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGTCTACGAGATATGTA
CATTAGGTAA
Protein sequenceShow/hide protein sequence
MVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE
DHMRNHKPRSKLLVPHQGLMKPPHQVNLILLNHDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKG
YTQREGTAFLNGNLEESIFMSQPEGFITQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQNVDEPCVYKKINKGKVAFLVLYVDDILLIGNDVGYLTDVKAWL
AAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKLLVRYSMQNSKKGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRPDICYAVGIVSRY
QSNPGKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLHDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV
IVTKIASEHNIADPFTKTLTAKVFEGHLESLGLRDMYIR