; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028607 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028607
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr8:26208701..26217525
RNA-Seq ExpressionLag0028607
SyntenyLag0028607
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]8.8e-28576.7Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SV +TP+ELWKGRK SLRYFRIWGCP HVLV NPKKLEPR+++C FVGYPKE+RGGLFY PQ+NKVFVSTNATFLEEDH R+H+PRSK+VL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E  + +T   D+P  ST+V  +++ S QS     +   RRSGRVV QPNRYLGL ETQ++IPDDGVEDPL+Y+QAMNDVD+D+W KAM+LEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        W LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAML SIRILLSIATFY+YEIWQMDVKTAFLNGNL+ESI+M QPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKKI  + VAFL+LYVDDILLI NDV YL+DVK+WL  QFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        +LGIQI+R+RKNKTLA+SQA+YIDK+L+RY MQN+K+G LPFRHG+HLSKEQ PKTPQEVEDMR IPY+SAVGSLMYAMLCTR DICY+VGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        G DHWT VK ILKYLRRTR+YMLV+GAK++ +                        LNGGAVVWRS+KQ CIA+STMEA+YVAACEAAKEAVWLRKFLTD
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNM+LPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0085.1Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK +LKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-30382.41Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V+  SVS+TPFELW+GRKPSL +F+I GCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ N+V VSTNATFLEEDH+RDHKP++KLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+ + STRVVDE GPS+RV  E+++S QS P   +   RRSGR+V QPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKD+W KAMDLEMESMYFN +
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIW+MDV TAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKKINK KV FLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKML RYSMQN+K+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN 
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK ILKYLRRTRDYMLV+GAK++ +                        LNGGA+VWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0084.15Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DP++N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT++EGVDYEETFS VAML SIRILLSIA FYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGE QY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK ILKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWL+KFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0085.1Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK +LKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein0.0e+0084.15Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DP++N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT++EGVDYEETFS VAML SIRILLSIA FYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGE QY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK ILKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWL+KFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

A0A5A7TZD0 Gag/pol protein0.0e+0085.1Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK +LKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

A0A5A7UYE8 Gag/pol protein0.0e+0085.1Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SVS+TPFELW+GRKPSL +FRIWGCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+HKPRSKLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+T+ STRVVDE GPS+RV  E+++S QS P   +   RRSGRVV QPNRYLGLTETQVVIPDDGVEDPLSY+QAMNDVDKD+W KAMDLEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIWQMDVKTAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GFDQNVDEPCVYKKINK KVAFLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDK+L RYSMQN+K+GLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTR DICYAVGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK +LKYLRRTRDYMLV+GAK++ +                        LNGGAVVWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

A0A5D3CZY3 Gag/pol protein2.0e-30382.41Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V+  SVS+TPFELW+GRKPSL +F+I GCP HVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ N+V VSTNATFLEEDH+RDHKP++KLVL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E+ + STRVVDE GPS+RV  E+++S QS P   +   RRSGR+V QPNRYLGLTETQVVIPDDGVEDPLSY QAMNDVDKD+W KAMDLEMESMYFN +
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        WELVD PEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQREGVDYEETFSPVAML SIRILLSIATFYDYEIW+MDV TAFLNGNL+ESIFMSQPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI QGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKKINK KV FLVLYVDDILLI NDVGYL+DVK WLAAQFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        VLGIQIIRDRKNKTLALSQATYIDKML RYSMQN+K+GLLPFRHGVHLSKEQ PKTPQEVEDMRRIPYASAVGSLMY + CTR +ICYAV IVSRYQSN 
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        GLDHWT VK ILKYLRRTRDYMLV+GAK++ +                        LNGGA+VWRSIKQGCIA+STMEA+YVAACEAAKEAVWLRKFL D
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

E2GK51 Gag/pol protein (Fragment)4.2e-28576.7Show/hide
Query:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG
        + + V   SV +TP+ELWKGRK SLRYFRIWGCP HVLV NPKKLEPR+++C FVGYPKE+RGGLFY PQ+NKVFVSTNATFLEEDH R+H+PRSK+VL 
Subjt:  VSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG

Query:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV
        E  + +T   D+P  ST+V  +++ S QS     +   RRSGRVV QPNRYLGL ETQ++IPDDGVEDPL+Y+QAMNDVD+D+W KAM+LEMESMYFN V
Subjt:  ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQV

Query:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE
        W LVD P  VKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAML SIRILLSIATFY+YEIWQMDVKTAFLNGNL+ESI+M QPE
Subjt:  WELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPE

Query:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
        GFI Q QEQKVCKL +SIYGLKQASRSWNIRFDTAIKS+GF+QNVDEPCVYKKI  + VAFL+LYVDDILLI NDV YL+DVK+WL  QFQMKDLGEAQY
Subjt:  GFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
        +LGIQI+R+RKNKTLA+SQA+YIDK+L+RY MQN+K+G LPFRHG+HLSKEQ PKTPQEVEDMR IPY+SAVGSLMYAMLCTR DICY+VGIVSRYQSNP
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD
        G DHWT VK ILKYLRRTR+YMLV+GAK++ +                        LNGGAVVWRS+KQ CIA+STMEA+YVAACEAAKEAVWLRKFLTD
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTD

Query:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK
        LEVVPNM+LPITLYCDNSGAVANSKEPRSHK
Subjt:  LEVVPNMNLPITLYCDNSGAVANSKEPRSHK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.6e-7928.03Show/hide
Query:  ISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPK-KLEPRTRICQFVGYPKETRGGLFYD--------------------------------------
        +  SKTP+E+W  +KP L++ R++G   +V + N + K + ++    FVGY  E  G   +D                                      
Subjt:  ISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPK-KLEPRTRICQFVGYPKETRGGLFYD--------------------------------------

Query:  -----PQDNKVFVST----------NATFLEE-------------------------------DHVRDHKPRSKLVLGESTEGSTRVVDEPGPSTRVAGE
             P D++  + T          N  FL++                                 ++D K  +K  L ES +   R  D+    ++ +G 
Subjt:  -----PQDNKVFVST----------NATFLEE-------------------------------DHVRDHKPRSKLVLGESTEGSTRVVDEPGPSTRVAGE

Query:  SSSSRQSSPPHVVGEL---------------RRSGRVVIQPNRYLGLTE---TQVVIPDDGV--EDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWE
         + SR+S     + E+               RRS R+  +P       +    +VV+    +  + P S+ +     DK  W +A++ E+ +   N  W 
Subjt:  SSSSRQSSPPHVVGEL---------------RRSGRVVIQPNRYLGLTE---TQVVIPDDGV--EDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWE

Query:  LVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGF
        +   PE    +  +W++  K +  G    +KARLVA+G+TQ+  +DYEETF+PVA ++S R +LS+   Y+ ++ QMDVKTAFLNG L E I+M  P+G 
Subjt:  LVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGF

Query:  IIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY--KKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY
         I      VCKLN++IYGLKQA+R W   F+ A+K   F  +  + C+Y   K N N+  +++LYVDD+++   D+  +++ K +L  +F+M DL E ++
Subjt:  IIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY--KKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQY

Query:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP
         +GI+I  + +   + LSQ+ Y+ K+L++++M+N      P    ++     S       ++    P  S +G LMY MLCTR D+  AV I+SRY S  
Subjt:  VLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNP

Query:  GLDHWTTVKGILKYLRRTRDYMLVF------------------GAKEVSVPLNGG---------AVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRK
          + W  +K +L+YL+ T D  L+F                     E+      G          + W + +Q  +A S+ EA+Y+A  EA +EA+WL+ 
Subjt:  GLDHWTTVKGILKYLRRTRDYMLVF------------------GAKEVSVPLNGG---------AVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRK

Query:  FLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK
         LT + +   +  PI +Y DN G ++ +  P  HK
Subjt:  FLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-12038.45Show/hide
Query:  PFELWKGRKPSLRYFRIWGCP--THVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG-----------
        P  +W  ++ S  + +++GC    HV      KL+ ++  C F+GY  E  G   +DP   KV  S +  F  E  VR     S+ V             
Subjt:  PFELWKGRKPSLRYFRIWGCP--THVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHVRDHKPRSKLVLG-----------

Query:  -----ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHV----VGE-----LRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAK
              S E +T  V E G       E           V     GE     LRRS R  ++  RY   +   V+I DD   +P S ++ ++  +K++  K
Subjt:  -----ESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHV----VGE-----LRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAK

Query:  AMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFL
        AM  EMES+  N  ++LV+ P+G +P+ CKW++K K+D   K+  +KARLV KG+ Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL
Subjt:  AMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFL

Query:  NGNLDESIFMSQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY-KKINKNKVAFLVLYVDDILLIENDVGYLSDVKEW
        +G+L+E I+M QPEGF + G++  VCKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY K+ ++N    L+LYVDD+L++  D G ++ +K  
Subjt:  NGNLDESIFMSQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVY-KKINKNKVAFLVLYVDDILLIENDVGYLSDVKEW

Query:  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSD
        L+  F MKDLG AQ +LG++I+R+R ++ L LSQ  YI+++L R++M+N K    P    + LSK+  P T +E  +M ++PY+SAVGSLMYAM+CTR D
Subjt:  LAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSD

Query:  ICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAAC
        I +AVG+VSR+  NPG +HW  VK IL+YLR T    L FG  +  +                         +GGA+ W+S  Q C+A ST EA+Y+AA 
Subjt:  ICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVFGAKEVSV-----------------------PLNGGAVVWRSIKQGCIANSTMEAKYVAAC

Query:  EAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSH
        E  KE +WL++FL +L +         +YCD+  A+  SK    H
Subjt:  EAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSH

P25600 Putative transposon Ty5-1 protein YCL074W3.4e-2929.62Show/hide
Query:  MDVKTAFLNGNLDESIFMSQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGY
        MDV TAFLN  +DE I++ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y +   +   ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLDESIFMSQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGY

Query:  LSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA
           VK+ L   + MKDLG+    LG+  I    N  + LS   YI K  +   +   K    P  +   L +  SP     ++D+   PY S VG L++ 
Subjt:  LSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYA

Query:  MLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVF-GAKEVSVP-----------------------LNGGAVVWRSIK-QGCIANST
            R DI Y V ++SR+   P   H  + + +L+YL  TR   L +    ++++                        L G  V W S K +G I   +
Subjt:  MLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVF-GAKEVSVP-----------------------LNGGAVVWRSIK-QGCIANST

Query:  MEAKYVAACEAAKE
         EA+Y+ A E   E
Subjt:  MEAKYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-6832.82Show/hide
Query:  VVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDPPE
        ++  P P  ++   ++ +  ++  H +G   ++G  +I+PN    L  +          +P +  QA+ D   + W  AM  E+ +   N  W+LV PP 
Subjt:  VVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDPPE

Query:  G-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGFIIQGQ
          V  +GC+WI+ +K ++ G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + ++MSQP GFI + +
Subjt:  G-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGFIIQGQ

Query:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQII
           VCKL +++YGLKQA R+W +     + + GF  +V +  ++       + ++++YVDDIL+  ND   L +  + L+ +F +KD  E  Y LGI+  
Subjt:  EQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQII

Query:  RDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTT
          R    L LSQ  YI  +LAR +M   K    P      LS     K     E      Y   VGSL Y +  TR DI YAV  +S++   P  +H   
Subjt:  RDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTT

Query:  VKGILKYLRRTRDYMLVF-----------------GAKEVSVPLNGGAVV-------WRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTDLEVVPN
        +K IL+YL  T ++ +                   G K+  V  NG  V        W S KQ  +  S+ EA+Y +    + E  W+   LT+L +   
Subjt:  VKGILKYLRRTRDYMLVF-----------------GAKEVSVPLNGGAVV-------WRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTDLEVVPN

Query:  MNLPITLYCDNSGAVANSKEPRSH
        +  P  +YCDN GA      P  H
Subjt:  MNLPITLYCDNSGAVANSKEPRSH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.0e-7033.27Show/hide
Query:  PGPSTRVAGESS-SSRQSSPPHVVGELRRSGRVVIQPNRYLGL-TETQVVIPDDGVEDP---LSY----------RQAMNDVDKDEWAKAMDLEMESMYF
        P PST ++  +S SS  +S P +   L      +IQ N    + T +      DG+  P    SY          R A+  +  D W +AM  E+ +   
Subjt:  PGPSTRVAGESS-SSRQSSPPHVVGELRRSGRVVIQPNRYLGL-TETQVVIPDDGVEDP---LSY----------RQAMNDVDKDEWAKAMDLEMESMYF

Query:  NQVWELV-DPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFM
        N  W+LV  PP  V  +GC+WI+ +K ++ G +  +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + ++M
Subjt:  NQVWELV-DPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFM

Query:  SQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLG
        SQP GF+ + +   VC+L ++IYGLKQA R+W +   T + + GF  ++ +  ++       + ++++YVDDIL+  ND   L    + L+ +F +K+  
Subjt:  SQPEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLG

Query:  EAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRY
        +  Y LGI+    R  + L LSQ  Y   +LAR +M   K    P      L+     K P   E      Y   VGSL Y +  TR D+ YAV  +S+Y
Subjt:  EAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRY

Query:  QSNPGLDHWTTVKGILKYLRRTRDYMLVF-----------------GAKEVSVPLNGGAVV-------WRSIKQGCIANSTMEAKYVAACEAAKEAVWLR
           P  DHW  +K +L+YL  T D+ +                   G  +  V  NG  V        W S KQ  +  S+ EA+Y +    + E  W+ 
Subjt:  QSNPGLDHWTTVKGILKYLRRTRDYMLVF-----------------GAKEVSVPLNGGAVV-------WRSIKQGCIANSTMEAKYVAACEAAKEAVWLR

Query:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSH
          LT+L +   ++ P  +YCDN GA      P  H
Subjt:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.5e-6933.7Show/hide
Query:  EDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGFII-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL
        +I+  Y++ + Q+D+  AFLNG+LDE I+M  P G+   QG       VC L +SIYGLKQASR W ++F   +  FGF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLDESIFMSQPEGFII-QGQE---QKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFL

Query:  VLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVED
        ++YVDDI++  N+   + ++K  L + F+++DLG  +Y LG++I R      + + Q  Y   +L    +   K   +P    V  S         +  D
Subjt:  VLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVED

Query:  MRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVFGA-----------------KEVSVPLNG-------GA
         +   Y   +G LMY  + TR DI +AV  +S++   P L H   V  IL Y++ T    L + +                 K+     NG         
Subjt:  MRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDYMLVFGA-----------------KEVSVPLNG-------GA

Query:  VVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV
        + W+S KQ  ++ S+ EA+Y A   A  E +WL +F  +L++   ++ P  L+CDN+ A+
Subjt:  VVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV

ATMG00810.1 DNA/RNA polymerases superfamily protein7.7e-1329.06Show/hide
Query:  FLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEV
        +L+LYVDDILL  +    L+ +   L++ F MKDLG   Y LGIQI        L LSQ  Y +++L    M + K    P    + L+   S     + 
Subjt:  FLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQIIRDRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEV

Query:  EDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDY-MLVFGAKEVSVP-----------------------LNG
         D R     S VG+L Y  L TR DI YAV IV +    P L  +  +K +L+Y++ T  + + +    +++V                        L  
Subjt:  EDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRTRDY-MLVFGAKEVSVP-----------------------LNG

Query:  GAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVW
          + W + +Q  ++ S+ E +Y A    A E  W
Subjt:  GAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.4e-1643.02Show/hide
Query:  WAKAMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIA
        W +AM  E++++  N+ W LV PP     +GCKW++K K  + G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  WAKAMDLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCAGAAGAAGAAGGTCTCAGTTGCAAATTTGGCAGGTTGGGTGGTCGGGTATCTGAACGCGTTCTGTGATTCGGGGAGGAGGGAGATGGAGTTGTTAGGGGGGAG
ACCTAGTCGGGTTGTGTGCAAGTGGGCGTCGCTAGAGGTTGATGGGTTCAAGGTGAGCTTTGCAGCTACGATTTTCCACGAGAATATCAGAGATGCTGACTTTGTAGAAG
GGTTGGCGGCTGCTGTCGGTCTGAGGCTTGCTGTGGAGACGGCTACTCGACCTTTTGTACTGGAGACAGACTCTATGCGGGTGTTCAAGCTGCTGCAGCGTGATAGGGAG
GAGGTGTCGGAGTTGGGGATGTTGGTGGAGGATGCAGTGAGAGGCATTCCAACGGGGTGGTTTGTTGGTGGCAGCTTTACGTTTAGGGAAGGCAACTGCGTTGCCCATCG
TTTGGCAAGGTTGGCGATGGATGACAGGCGTGATCGAGCTAGTGGATTGGGGCTAAGTCTGGATGCACCTGAGCTTCAGGCTAATTTATTAATTTATACATGTTATGCCG
CTTCTGGTTTTGTTCTGCTTGGTGGGGTTGAGGCCTCTTCCTTAGGTAGTCTCGAAGGCTCTGGTCAGTTTTATGAGTTCTTAAGATCCGTTGTCACTCCGATAGGTGCC
CTAGGTTCTCTTTTGTGTCATTTAGCTTCTGGGCGAGACATTAGTGCCGGTGTAGGGGGAAGATGGGTGGAGGAGTTTATAAGCCTTTCTTGGGGCTCACCACGAGCTTG
TGTACTGGGTAATTTCATTCATCCCCAAATTGATTCTCCTCGCATTTCAGATTTGCTGCGAAATTGTCAGATCAGATTGAATTCACCAACTGCTTGTGAAATAATCACGA
ACTACTGCGAACACCACCACTATGGCTACACCAGTATAACTCTGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAGGGAGCCTTCAGGGGAATTCTTTGAGACCC
CCATTCGGTCTTGCCCCTGATATGGATACCCCCACTCGCATGTATCCTACATGGATGCTTTGGATCGTTGCATCTATATCGAATACAAGGTGGGTCGTATCACATCGTGT
CACCAGGATAAGTGTTTCTAAAACTCCTTTTGAGCTATGGAAGGGGCGTAAACCTAGTTTGCGTTACTTTCGTATCTGGGGTTGTCCTACACATGTGTTAGTGACAAATC
CTAAGAAACTGGAACCTCGTACGAGAATATGCCAATTTGTTGGGTACCCGAAAGAAACGAGAGGTGGTCTTTTCTACGACCCTCAAGACAATAAAGTGTTTGTATCGACA
AATGCAACTTTCTTGGAGGAAGACCACGTTCGAGATCATAAACCACGAAGTAAGTTAGTATTAGGAGAATCTACAGAAGGATCAACAAGGGTTGTTGATGAACCTGGTCC
TTCAACAAGAGTTGCTGGAGAATCAAGTTCTTCTCGTCAATCTAGTCCTCCTCACGTAGTGGGAGAGCTTCGACGCAGTGGGAGGGTTGTGATACAACCTAACCGCTACT
TGGGTTTAACAGAAACACAAGTAGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATCGTCAGGCAATGAATGACGTAGATAAGGACGAGTGGGCCAAAGCCATG
GACCTTGAGATGGAGTCTATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATAGGGTGTAAATGGATCTATAAGAGGAAAAGAGATGC
CGCTGGGAAAGTACAAACTTTCAAAGCTAGACTTGTAGCAAAGGGTTATACCCAACGAGAAGGGGTGGACTATGAAGAAACCTTTTCTCCAGTTGCTATGCTAAATTCCA
TAAGAATTCTCTTATCCATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTCAAGACAGCTTTTTTGAATGGCAACCTTGACGAGAGTATTTTTATGTCTCAA
CCCGAAGGGTTCATAATCCAAGGTCAGGAGCAAAAGGTTTGTAAACTGAATCGATCCATTTATGGGTTGAAACAGGCCTCCCGATCTTGGAATATTAGATTTGATACTGC
GATAAAATCTTTTGGCTTTGACCAGAACGTTGATGAGCCTTGTGTATACAAGAAGATCAACAAGAATAAAGTAGCTTTCCTCGTACTTTATGTTGACGACATCCTACTCA
TTGAGAATGATGTAGGGTATCTGTCTGACGTAAAAGAATGGCTAGCAGCTCAATTCCAAATGAAAGATTTGGGCGAGGCCCAGTATGTTCTTGGCATCCAGATTATTCGG
GATAGAAAGAACAAAACGCTAGCTCTGTCTCAAGCAACGTATATCGACAAGATGTTGGCTCGATATTCGATGCAGAACACCAAGAGGGGCTTATTGCCTTTCAGGCATGG
GGTTCACCTGTCTAAGGAACAGTCTCCTAAGACACCTCAAGAGGTTGAGGATATGAGACGGATTCCCTACGCCTCTGCAGTAGGTAGCTTAATGTATGCCATGCTCTGCA
CGAGGTCTGACATCTGTTATGCTGTAGGGATTGTCAGCAGATATCAGTCAAATCCAGGGTTAGACCACTGGACCACCGTTAAAGGAATCCTCAAGTATCTTAGGAGAACG
AGGGACTACATGCTGGTGTTTGGGGCTAAAGAGGTCAGTGTTCCCCTTAACGGGGGAGCTGTAGTTTGGAGAAGTATAAAGCAAGGATGCATAGCAAACTCCACGATGGA
GGCTAAGTATGTAGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACAGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATTACGTTATACT
GTGACAACAGTGGGGCTGTAGCCAATTCAAAGGAACCTCGCAGCCACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCAGAAGAAGAAGGTCTCAGTTGCAAATTTGGCAGGTTGGGTGGTCGGGTATCTGAACGCGTTCTGTGATTCGGGGAGGAGGGAGATGGAGTTGTTAGGGGGGAG
ACCTAGTCGGGTTGTGTGCAAGTGGGCGTCGCTAGAGGTTGATGGGTTCAAGGTGAGCTTTGCAGCTACGATTTTCCACGAGAATATCAGAGATGCTGACTTTGTAGAAG
GGTTGGCGGCTGCTGTCGGTCTGAGGCTTGCTGTGGAGACGGCTACTCGACCTTTTGTACTGGAGACAGACTCTATGCGGGTGTTCAAGCTGCTGCAGCGTGATAGGGAG
GAGGTGTCGGAGTTGGGGATGTTGGTGGAGGATGCAGTGAGAGGCATTCCAACGGGGTGGTTTGTTGGTGGCAGCTTTACGTTTAGGGAAGGCAACTGCGTTGCCCATCG
TTTGGCAAGGTTGGCGATGGATGACAGGCGTGATCGAGCTAGTGGATTGGGGCTAAGTCTGGATGCACCTGAGCTTCAGGCTAATTTATTAATTTATACATGTTATGCCG
CTTCTGGTTTTGTTCTGCTTGGTGGGGTTGAGGCCTCTTCCTTAGGTAGTCTCGAAGGCTCTGGTCAGTTTTATGAGTTCTTAAGATCCGTTGTCACTCCGATAGGTGCC
CTAGGTTCTCTTTTGTGTCATTTAGCTTCTGGGCGAGACATTAGTGCCGGTGTAGGGGGAAGATGGGTGGAGGAGTTTATAAGCCTTTCTTGGGGCTCACCACGAGCTTG
TGTACTGGGTAATTTCATTCATCCCCAAATTGATTCTCCTCGCATTTCAGATTTGCTGCGAAATTGTCAGATCAGATTGAATTCACCAACTGCTTGTGAAATAATCACGA
ACTACTGCGAACACCACCACTATGGCTACACCAGTATAACTCTGAGACTTCTAGAGGCAGGAGACTGGTGGGAGTCTAGAGGGAGCCTTCAGGGGAATTCTTTGAGACCC
CCATTCGGTCTTGCCCCTGATATGGATACCCCCACTCGCATGTATCCTACATGGATGCTTTGGATCGTTGCATCTATATCGAATACAAGGTGGGTCGTATCACATCGTGT
CACCAGGATAAGTGTTTCTAAAACTCCTTTTGAGCTATGGAAGGGGCGTAAACCTAGTTTGCGTTACTTTCGTATCTGGGGTTGTCCTACACATGTGTTAGTGACAAATC
CTAAGAAACTGGAACCTCGTACGAGAATATGCCAATTTGTTGGGTACCCGAAAGAAACGAGAGGTGGTCTTTTCTACGACCCTCAAGACAATAAAGTGTTTGTATCGACA
AATGCAACTTTCTTGGAGGAAGACCACGTTCGAGATCATAAACCACGAAGTAAGTTAGTATTAGGAGAATCTACAGAAGGATCAACAAGGGTTGTTGATGAACCTGGTCC
TTCAACAAGAGTTGCTGGAGAATCAAGTTCTTCTCGTCAATCTAGTCCTCCTCACGTAGTGGGAGAGCTTCGACGCAGTGGGAGGGTTGTGATACAACCTAACCGCTACT
TGGGTTTAACAGAAACACAAGTAGTCATACCTGATGACGGCGTAGAAGATCCATTGTCTTATCGTCAGGCAATGAATGACGTAGATAAGGACGAGTGGGCCAAAGCCATG
GACCTTGAGATGGAGTCTATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATAGGGTGTAAATGGATCTATAAGAGGAAAAGAGATGC
CGCTGGGAAAGTACAAACTTTCAAAGCTAGACTTGTAGCAAAGGGTTATACCCAACGAGAAGGGGTGGACTATGAAGAAACCTTTTCTCCAGTTGCTATGCTAAATTCCA
TAAGAATTCTCTTATCCATTGCCACATTTTATGATTATGAAATTTGGCAAATGGATGTCAAGACAGCTTTTTTGAATGGCAACCTTGACGAGAGTATTTTTATGTCTCAA
CCCGAAGGGTTCATAATCCAAGGTCAGGAGCAAAAGGTTTGTAAACTGAATCGATCCATTTATGGGTTGAAACAGGCCTCCCGATCTTGGAATATTAGATTTGATACTGC
GATAAAATCTTTTGGCTTTGACCAGAACGTTGATGAGCCTTGTGTATACAAGAAGATCAACAAGAATAAAGTAGCTTTCCTCGTACTTTATGTTGACGACATCCTACTCA
TTGAGAATGATGTAGGGTATCTGTCTGACGTAAAAGAATGGCTAGCAGCTCAATTCCAAATGAAAGATTTGGGCGAGGCCCAGTATGTTCTTGGCATCCAGATTATTCGG
GATAGAAAGAACAAAACGCTAGCTCTGTCTCAAGCAACGTATATCGACAAGATGTTGGCTCGATATTCGATGCAGAACACCAAGAGGGGCTTATTGCCTTTCAGGCATGG
GGTTCACCTGTCTAAGGAACAGTCTCCTAAGACACCTCAAGAGGTTGAGGATATGAGACGGATTCCCTACGCCTCTGCAGTAGGTAGCTTAATGTATGCCATGCTCTGCA
CGAGGTCTGACATCTGTTATGCTGTAGGGATTGTCAGCAGATATCAGTCAAATCCAGGGTTAGACCACTGGACCACCGTTAAAGGAATCCTCAAGTATCTTAGGAGAACG
AGGGACTACATGCTGGTGTTTGGGGCTAAAGAGGTCAGTGTTCCCCTTAACGGGGGAGCTGTAGTTTGGAGAAGTATAAAGCAAGGATGCATAGCAAACTCCACGATGGA
GGCTAAGTATGTAGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACAGATTTGGAAGTTGTTCCAAATATGAACTTGCCCATTACGTTATACT
GTGACAACAGTGGGGCTGTAGCCAATTCAAAGGAACCTCGCAGCCACAAATGA
Protein sequenceShow/hide protein sequence
MRQKKKVSVANLAGWVVGYLNAFCDSGRREMELLGGRPSRVVCKWASLEVDGFKVSFAATIFHENIRDADFVEGLAAAVGLRLAVETATRPFVLETDSMRVFKLLQRDRE
EVSELGMLVEDAVRGIPTGWFVGGSFTFREGNCVAHRLARLAMDDRRDRASGLGLSLDAPELQANLLIYTCYAASGFVLLGGVEASSLGSLEGSGQFYEFLRSVVTPIGA
LGSLLCHLASGRDISAGVGGRWVEEFISLSWGSPRACVLGNFIHPQIDSPRISDLLRNCQIRLNSPTACEIITNYCEHHHYGYTSITLRLLEAGDWWESRGSLQGNSLRP
PFGLAPDMDTPTRMYPTWMLWIVASISNTRWVVSHRVTRISVSKTPFELWKGRKPSLRYFRIWGCPTHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVST
NATFLEEDHVRDHKPRSKLVLGESTEGSTRVVDEPGPSTRVAGESSSSRQSSPPHVVGELRRSGRVVIQPNRYLGLTETQVVIPDDGVEDPLSYRQAMNDVDKDEWAKAM
DLEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLNSIRILLSIATFYDYEIWQMDVKTAFLNGNLDESIFMSQ
PEGFIIQGQEQKVCKLNRSIYGLKQASRSWNIRFDTAIKSFGFDQNVDEPCVYKKINKNKVAFLVLYVDDILLIENDVGYLSDVKEWLAAQFQMKDLGEAQYVLGIQIIR
DRKNKTLALSQATYIDKMLARYSMQNTKRGLLPFRHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYAMLCTRSDICYAVGIVSRYQSNPGLDHWTTVKGILKYLRRT
RDYMLVFGAKEVSVPLNGGAVVWRSIKQGCIANSTMEAKYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHK