; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022142 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022142
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr7:19635201..19643479
RNA-Seq ExpressionLag0022142
SyntenyLag0022142
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-28848.61Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------
        SS++QL+  +MTL+VGTG++ISA+ VGD K+FF                           EH                             Y+L      
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------

Query:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA
                                             LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMNV+A
Subjt:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA

Query:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------
        RGG+EYFISFIDDYSR                     TEVENLL K IKI                                                  
Subjt:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------

Query:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS
                                                     P L                                         FD   N   VS
Subjt:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS

Query:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR
        T +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL+++
Subjt:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR

Query:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD
         AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILL IATFYD
Subjt:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD

Query:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG
        YEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDILLIG
Subjt:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG

Query:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT
        NDV Y                                                                                               
Subjt:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT

Query:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT
                                                                                                            
Subjt:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT

Query:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK
                                                             L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNKTLALSQATYIDK
Subjt:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK

Query:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY
        +L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK +LKYLRRTRDY LVY
Subjt:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY

Query:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK
        G+KDLILTGYT+SDFQTDK SRKSTSGSVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVANSK
Subjt:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK

Query:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        EPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-28448.21Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------
        SS++QL+  +MTL+VGTG++ISA+ VGD K+FF                           EH                             Y+L      
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------

Query:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA
                                             LD   RLVK+GLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMNV+A
Subjt:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA

Query:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------
        RG +EYFISFIDDYSR                     TEVENLL K IKI                                                  
Subjt:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------

Query:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS
                                                     P L                                         FD   N   VS
Subjt:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS

Query:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR
        T +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL+++
Subjt:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR

Query:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD
         AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYT+KEGVDYEETFS VAMLKSIRILL IA FYD
Subjt:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD

Query:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG
        YEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDILLIG
Subjt:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG

Query:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT
        NDV Y                                                                                               
Subjt:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT

Query:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT
                                                                                                            
Subjt:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT

Query:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK
                                                             L D+K WL+ +FQ+KDLGE QYVLGIQIIR+RKNKTLALSQATYIDK
Subjt:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK

Query:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY
        +L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK ILKYLRRTRDY LVY
Subjt:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY

Query:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK
        G+KDLILTGYTNSDFQTDK SRKSTS SVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWL+KFL DLEVVPN+NL ITLYCDNSGAVANSK
Subjt:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK

Query:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        EPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-26448.92Show/hide
Query:  RLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDYSR---------------------TEVEN
        RLVKNGLLS+LE+ +LP CESCLEGKMTKRPFTGKG+ AKEPLELVH+DLCGPMNV+ARGG+EYFI+F DDYSR                      EVEN
Subjt:  RLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDYSR---------------------TEVEN

Query:  LLGKTIK--------------------------------------IPELFDRT---------------------------------------------WN
         L KTIK                                      + E  +RT                                             WN
Subjt:  LLGKTIK--------------------------------------IPELFDRT---------------------------------------------WN

Query:  -------------------------------------------------------LVSTLSTRV----VDEVGPSTRVV-DEPSSE-AGPSTRVADEPS-
                                                                VST +T +    + E  P +++V +E S E   PSTRV +EPS 
Subjt:  -------------------------------------------------------LVSTLSTRV----VDEVGPSTRVV-DEPSSE-AGPSTRVADEPS-

Query:  --------TSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCK
                +S + H  Q LR PRRSG V   P RY+ L ET TVI DG +EDPLTF+ AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPI CK
Subjt:  --------TSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCK

Query:  WIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRR
        WIYKRKR A GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILL IA ++DYEIWQMDVK AFLNGNL+E I+M QP+GF+  GQEQK+CKL R
Subjt:  WIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRR

Query:  SIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNP
        SI+GLKQ SRSWNI+FDT +KSYGFDQ VDEPCVYKRIIN SVAFLVLYVDDILLIGND+                                        
Subjt:  SIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNP

Query:  LPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIV
                                                                                                            
Subjt:  LPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIV

Query:  IHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQ
                                                                                                            
Subjt:  IHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQ

Query:  KSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPY
                 +L DIK+WL+T+FQ+KDLGEAQ+VLGIQI R+RKNK LALSQA+YIDK++ +Y M NSKRGLLPFRHGV LSKEQCPKTPQ++E+MR IPY
Subjt:  KSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPY

Query:  ASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIK
        AS VGSLMY MLCTRPDI YAVGI+SRYQSNP L HWTAVK ILKYLRRTRDY LVYGSKDLILTGYT+SDFQTD+ SRKSTSGSVFTLNGGAVVWRSIK
Subjt:  ASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIK

Query:  QGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFT
        QGCIA STMEAEYVAACEAAKEAVWLR FL+DLEVVPN++  ITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT++AS HN+ADPFT
Subjt:  QGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFT

Query:  KALTAKVFEGHLKGLG
        K LTAKVFEGHL+ LG
Subjt:  KALTAKVFEGHLKGLG

KAA0050437.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-27153.61Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN
        SS++QL+  +MTL VGTG++ISA+VVGD K+      ++ LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+ LCGPMN
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN

Query:  VRARGGYEYFISFIDDYSRTEVENLLGKTIKIPELFDRTWNLVSTLST----------------------------------------------RVV---
        V+ARGG+EYFIS +DDYSR     +LG       LFDRTWN + TLST                                              RV    
Subjt:  VRARGGYEYFISFIDDYSRTEVENLLGKTIKIPELFDRTWNLVSTLST----------------------------------------------RVV---

Query:  ----------------------DEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRH
                              +    STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMP+RSG +V QP RYLGL ET  VIPD GVEDPL+++ 
Subjt:  ----------------------DEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRH

Query:  AMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDY
        AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFS VAML+SIRILL IATFYDY
Subjt:  AMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDY

Query:  EIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGN
        EIW+MDVK A LNGNL+E+IFMSQ +GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +K YGFDQNVDEPCVYK+I    VAFLVLYVDDILLIGN
Subjt:  EIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGN

Query:  DVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTT
        D+ Y                                                                                                
Subjt:  DVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTT

Query:  ISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTS
                                                                                                            
Subjt:  ISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTS

Query:  PTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKM
                                                            L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNK LALSQ TYIDKM
Subjt:  PTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKM

Query:  LSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYG
        L RY M NSK+GLLPFRHGVHLSKEQC KTPQE+EDMRRIPYAS VGSLMYVMLCTRP+I YAVGI+SRYQSNP LDHWTAVK ILKYLRRTRDY LVYG
Subjt:  LSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYG

Query:  SKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKE
        +KDLILTGYT+ DFQTDK SRKSTSGSVFTLNGG VVWRSIKQGCIA  TMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVAN KE
Subjt:  SKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKE

Query:  PRSHKRGKHIE
        PRSHKRGK  E
Subjt:  PRSHKRGKHIE

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-29752.15Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN
        SS++QL+  +MTL+VGTG++ISA+ VGD K+      ++ LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMN
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN

Query:  VRARGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI-----------------------------------------------
        V+ARGG+EYFISFIDDYSR                     TEVENLL K IKI                                               
Subjt:  VRARGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI-----------------------------------------------

Query:  ------------------------------------------------PEL-----------------------------------------FDRTWN--
                                                        P L                                         FD   N  
Subjt:  ------------------------------------------------PEL-----------------------------------------FDRTWN--

Query:  LVSTLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPL
         VST +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL
Subjt:  LVSTLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPL

Query:  TFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIAT
        +++ AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILL IAT
Subjt:  TFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIAT

Query:  FYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDIL
        FYDYEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDIL
Subjt:  FYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDIL

Query:  LIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQP
        LIGNDV Y                                                                                            
Subjt:  LIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQP

Query:  PPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSES
                                                                                                            
Subjt:  PPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSES

Query:  SLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATY
                                                                L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNKTLALSQATY
Subjt:  SLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATY

Query:  IDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYH
        IDK+L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK +LKYLRRTRDY 
Subjt:  IDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYH

Query:  LVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVA
        LVYG+KDLILTGYT+SDFQTDK SRKSTSGSVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVA
Subjt:  LVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVA

Query:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.5e-26448.92Show/hide
Query:  RLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDYSR---------------------TEVEN
        RLVKNGLLS+LE+ +LP CESCLEGKMTKRPFTGKG+ AKEPLELVH+DLCGPMNV+ARGG+EYFI+F DDYSR                      EVEN
Subjt:  RLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDYSR---------------------TEVEN

Query:  LLGKTIK--------------------------------------IPELFDRT---------------------------------------------WN
         L KTIK                                      + E  +RT                                             WN
Subjt:  LLGKTIK--------------------------------------IPELFDRT---------------------------------------------WN

Query:  -------------------------------------------------------LVSTLSTRV----VDEVGPSTRVV-DEPSSE-AGPSTRVADEPS-
                                                                VST +T +    + E  P +++V +E S E   PSTRV +EPS 
Subjt:  -------------------------------------------------------LVSTLSTRV----VDEVGPSTRVV-DEPSSE-AGPSTRVADEPS-

Query:  --------TSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCK
                +S + H  Q LR PRRSG V   P RY+ L ET TVI DG +EDPLTF+ AM DVD D+W+KAM+LE+ESMYFN VWDLVD P+GVKPI CK
Subjt:  --------TSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCK

Query:  WIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRR
        WIYKRKR A GKVQTFKARLVAKGYTQ EGVDYEETFSPVAMLKSIRILL IA ++DYEIWQMDVK AFLNGNL+E I+M QP+GF+  GQEQK+CKL R
Subjt:  WIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRR

Query:  SIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNP
        SI+GLKQ SRSWNI+FDT +KSYGFDQ VDEPCVYKRIIN SVAFLVLYVDDILLIGND+                                        
Subjt:  SIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNP

Query:  LPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIV
                                                                                                            
Subjt:  LPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIV

Query:  IHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQ
                                                                                                            
Subjt:  IHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQ

Query:  KSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPY
                 +L DIK+WL+T+FQ+KDLGEAQ+VLGIQI R+RKNK LALSQA+YIDK++ +Y M NSKRGLLPFRHGV LSKEQCPKTPQ++E+MR IPY
Subjt:  KSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPY

Query:  ASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIK
        AS VGSLMY MLCTRPDI YAVGI+SRYQSNP L HWTAVK ILKYLRRTRDY LVYGSKDLILTGYT+SDFQTD+ SRKSTSGSVFTLNGGAVVWRSIK
Subjt:  ASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIK

Query:  QGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFT
        QGCIA STMEAEYVAACEAAKEAVWLR FL+DLEVVPN++  ITLYCDNSGAVANS+EPRSHKRGKHIERKYHLIREIV RGDVIVT++AS HN+ADPFT
Subjt:  QGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFT

Query:  KALTAKVFEGHLKGLG
        K LTAKVFEGHL+ LG
Subjt:  KALTAKVFEGHLKGLG

A0A5A7T2V9 Gag/pol protein7.8e-28548.21Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------
        SS++QL+  +MTL+VGTG++ISA+ VGD K+FF                           EH                             Y+L      
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------

Query:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA
                                             LD   RLVK+GLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMNV+A
Subjt:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA

Query:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------
        RG +EYFISFIDDYSR                     TEVENLL K IKI                                                  
Subjt:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------

Query:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS
                                                     P L                                         FD   N   VS
Subjt:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS

Query:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR
        T +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL+++
Subjt:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR

Query:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD
         AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYT+KEGVDYEETFS VAMLKSIRILL IA FYD
Subjt:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD

Query:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG
        YEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDILLIG
Subjt:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG

Query:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT
        NDV Y                                                                                               
Subjt:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT

Query:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT
                                                                                                            
Subjt:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT

Query:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK
                                                             L D+K WL+ +FQ+KDLGE QYVLGIQIIR+RKNKTLALSQATYIDK
Subjt:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK

Query:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY
        +L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK ILKYLRRTRDY LVY
Subjt:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY

Query:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK
        G+KDLILTGYTNSDFQTDK SRKSTS SVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWL+KFL DLEVVPN+NL ITLYCDNSGAVANSK
Subjt:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK

Query:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        EPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

A0A5A7TZD0 Gag/pol protein6.8e-28948.61Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------
        SS++QL+  +MTL+VGTG++ISA+ VGD K+FF                           EH                             Y+L      
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSR-------------------------EH-----------------------------YML------

Query:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA
                                             LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMNV+A
Subjt:  -------------------------------------LDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRA

Query:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------
        RGG+EYFISFIDDYSR                     TEVENLL K IKI                                                  
Subjt:  RGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI--------------------------------------------------

Query:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS
                                                     P L                                         FD   N   VS
Subjt:  ---------------------------------------------PEL-----------------------------------------FDRTWN--LVS

Query:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR
        T +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL+++
Subjt:  TLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFR

Query:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD
         AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILL IATFYD
Subjt:  HAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYD

Query:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG
        YEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDILLIG
Subjt:  YEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIG

Query:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT
        NDV Y                                                                                               
Subjt:  NDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPT

Query:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT
                                                                                                            
Subjt:  TISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLT

Query:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK
                                                             L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNKTLALSQATYIDK
Subjt:  SPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDK

Query:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY
        +L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK +LKYLRRTRDY LVY
Subjt:  MLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVY

Query:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK
        G+KDLILTGYT+SDFQTDK SRKSTSGSVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVANSK
Subjt:  GSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSK

Query:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        EPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  EPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

A0A5A7U7T0 Gag/pol protein7.6e-27253.61Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN
        SS++QL+  +MTL VGTG++ISA+VVGD K+      ++ LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+ LCGPMN
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN

Query:  VRARGGYEYFISFIDDYSRTEVENLLGKTIKIPELFDRTWNLVSTLST----------------------------------------------RVV---
        V+ARGG+EYFIS +DDYSR     +LG       LFDRTWN + TLST                                              RV    
Subjt:  VRARGGYEYFISFIDDYSRTEVENLLGKTIKIPELFDRTWNLVSTLST----------------------------------------------RVV---

Query:  ----------------------DEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRH
                              +    STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMP+RSG +V QP RYLGL ET  VIPD GVEDPL+++ 
Subjt:  ----------------------DEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRH

Query:  AMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDY
        AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFS VAML+SIRILL IATFYDY
Subjt:  AMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDY

Query:  EIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGN
        EIW+MDVK A LNGNL+E+IFMSQ +GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +K YGFDQNVDEPCVYK+I    VAFLVLYVDDILLIGN
Subjt:  EIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGN

Query:  DVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTT
        D+ Y                                                                                                
Subjt:  DVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTT

Query:  ISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTS
                                                                                                            
Subjt:  ISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTS

Query:  PTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKM
                                                            L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNK LALSQ TYIDKM
Subjt:  PTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKM

Query:  LSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYG
        L RY M NSK+GLLPFRHGVHLSKEQC KTPQE+EDMRRIPYAS VGSLMYVMLCTRP+I YAVGI+SRYQSNP LDHWTAVK ILKYLRRTRDY LVYG
Subjt:  LSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYG

Query:  SKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKE
        +KDLILTGYT+ DFQTDK SRKSTSGSVFTLNGG VVWRSIKQGCIA  TMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVAN KE
Subjt:  SKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKE

Query:  PRSHKRGKHIE
        PRSHKRGK  E
Subjt:  PRSHKRGKHIE

A0A5A7UYE8 Gag/pol protein1.8e-29752.15Show/hide
Query:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN
        SS++QL+  +MTL+VGTG++ISA+ VGD K+      ++ LD   RLVKNGLL++L+D +LPPCESCLEGKMTKRPFTGKGY AKEPLEL+H+DLCGPMN
Subjt:  SSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMN

Query:  VRARGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI-----------------------------------------------
        V+ARGG+EYFISFIDDYSR                     TEVENLL K IKI                                               
Subjt:  VRARGGYEYFISFIDDYSR---------------------TEVENLLGKTIKI-----------------------------------------------

Query:  ------------------------------------------------PEL-----------------------------------------FDRTWN--
                                                        P L                                         FD   N  
Subjt:  ------------------------------------------------PEL-----------------------------------------FDRTWN--

Query:  LVSTLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPL
         VST +T + ++                   STRVVD    E GPS+RV DE +TSGQ+HPSQ LRMPRRSG VV QP RYLGL ET  VIPD GVEDPL
Subjt:  LVSTLSTRVVDE----------------VGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPL

Query:  TFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIAT
        +++ AMNDVD DQWVKAMDLEMESMYFN VW+LVDLPEGVKPI CKWIYKRKRD+AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLKSIRILL IAT
Subjt:  TFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIAT

Query:  FYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDIL
        FYDYEIWQMDVK AFLNGNL+E+IFMSQP+GF+TQGQEQKVCKL RSI+GLKQ SRSWNI+FDT +KSYGFDQNVDEPCVYK+I    VAFLVLYVDDIL
Subjt:  FYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDIL

Query:  LIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQP
        LIGNDV Y                                                                                            
Subjt:  LIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQP

Query:  PPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSES
                                                                                                            
Subjt:  PPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSES

Query:  SLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATY
                                                                L D+K WL+ +FQ+KDLGEAQYVLGIQIIR+RKNKTLALSQATY
Subjt:  SLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATY

Query:  IDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYH
        IDK+L RY M NSK+GLLPFRHGVHLSKEQ PKTPQE+EDMRRIPYAS VGSLMY MLCTRPDI YAVGI+SRYQSNP LDHWTAVK +LKYLRRTRDY 
Subjt:  IDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYH

Query:  LVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVA
        LVYG+KDLILTGYT+SDFQTDK SRKSTSGSVFTLNGGAVVWRSIKQGCIA STMEAEYVAACEAAKEAVWLRKFL DLEVVPN+NL ITLYCDNSGAVA
Subjt:  LVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVA

Query:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
        NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTK+AS HNIADPFTK LTAKVFEGHL+ LG
Subjt:  NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.4e-5723.96Show/hide
Query:  PLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFI
        P +F       D   W +A++ E+ +   N  W +   PE    +  +W++  K +  G    +KARLVA+G+TQK  +DYEETF+PVA + S R +L +
Subjt:  PLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFI

Query:  ATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVY---KRIINTSVAFLVLY
           Y+ ++ QMDVK AFLNG L E I+M  P+G         VCKL ++I+GLKQ +R W   F+  +K   F  +  + C+Y   K  IN ++ +++LY
Subjt:  ATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVY---KRIINTSVAFLVLY

Query:  VDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRR
        VDD+++   D+  ++N                                                                                    
Subjt:  VDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRR

Query:  LHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYY
                                                                                                            
Subjt:  LHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYY

Query:  KDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLAL
                                                                         K++L  +F++ DL E ++ +GI+I    +   + L
Subjt:  KDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLAL

Query:  SQATYIDKMLSRYKMHNSKRGLLPFRHGVHL----SKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILK
        SQ+ Y+ K+LS++ M N      P    ++     S E C             P  S +G LMY+MLCTRPD+  AV I+SRY S    + W  +K +L+
Subjt:  SQATYIDKMLSRYKMHNSKRGLLPFRHGVHL----SKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILK

Query:  YLRRTRDYHLVYGSKDLI----LTGYTNSDFQTDKASRKSTSGSVFTL-NGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLN
        YL+ T D  L++  K+L     + GY +SD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  EA +EA+WL+  L  + +   L 
Subjt:  YLRRTRDYHLVYGSKDLI----LTGYTNSDFQTDKASRKSTSGSVFTL-NGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLN

Query:  LRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVF
          I +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  + + + +AD FTK L A  F
Subjt:  LRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVF

P0CV72 Secreted RxLR effector protein 1611.8e-2848.12Show/hide
Query:  MRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLI-LTGYTNSDFQTDKASRKSTSGSVFTLNGGA
        M+ +PY S VG++MY+M+ TRPD+  AVG++S++ S+P   HW A+K +L+YL+ T+ Y L +       L GY+++D+  D  SR+STSG +F LNGG 
Subjt:  MRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLI-LTGYTNSDFQTDKASRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWL
        V WRS KQ  +A S+ E EY+A  EA +EAVWL
Subjt:  VVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-10029.45Show/hide
Query:  DYSRTEVENLLGKTIKIPELFDRTWNLVSTLSTRVVDEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTV
        D S      ++   + IP   +   +  ST  T  V E G     V E   +        + P+   + H  Q L   RRS    ++  RY    E   +
Subjt:  DYSRTEVENLLGKTIKIPELFDRTWNLVSTLSTRVVDEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTV

Query:  IPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLK
          D    +P + +  ++  + +Q +KAM  EMES+  N  + LV+LP+G +P++CKW++K K+D   K+  +KARLV KG+ QK+G+D++E FSPV  + 
Subjt:  IPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLK

Query:  SIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVY-KRIINTSV
        SIR +L +A   D E+ Q+DVK AFL+G+L+E I+M QP+GF   G++  VCKL +S++GLKQ  R W +KFD+ +KS  + +   +PCVY KR    + 
Subjt:  SIRILLFIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVY-KRIINTSV

Query:  AFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATV
          L+LYVDD+L++G D                                                                                    
Subjt:  AFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATV

Query:  ADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDV
                                                                                                            
Subjt:  ADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDV

Query:  LSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRK
                                          KG                              ++A +K  LS  F +KDLG AQ +LG++I+R R 
Subjt:  LSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRK

Query:  NKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNI
        ++ L LSQ  YI+++L R+ M N+K    P    + LSK+ CP T +E  +M ++PY+S VGSLMY M+CTRPDI +AVG++SR+  NP  +HW AVK I
Subjt:  NKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNI

Query:  LKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRI
        L+YLR T    L +G  D IL GYT++D   D  +RKS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +        
Subjt:  LKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRI

Query:  TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFE
         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V K+++  N AD  TK +    FE
Subjt:  TLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-1131.75Show/hide
Query:  GELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDY
        GEL +A+    + ++  R  +M       L K  L+S  +  T+ PC+ CL GK  +  F          L+LV++D+CGPM + + GG +YF++FIDD 
Subjt:  GELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEYFISFIDDY

Query:  SRTEVENLLGKTIKIPELFDRTWNLV
        SR     +L    ++ ++F +   LV
Subjt:  SRTEVENLLGKTIKIPELFDRTWNLV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-5224.52Show/hide
Query:  PSTRVVDEPSSEAGPSTRVADEP------STSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMES
        P+T      +S   PS  +   P      + + QA P     M  R+   +I+P     LA     +      +P T   A+ D   ++W  AM  E+ +
Subjt:  PSTRVVDEPSSEAGPSTRVADEP------STSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDGGVEDPLTFRHAMNDVDHDQWVKAMDLEMES

Query:  MYFNQVWDLVDLPEG-VKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDEN
           N  WDLV  P   V  + C+WI+ +K ++ G +  +KARLVAKGY Q+ G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L ++
Subjt:  MYFNQVWDLVDLPEG-VKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDYEIWQMDVKIAFLNGNLDEN

Query:  IFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSR
        ++MSQP GF+ + +   VCKLR++++GLKQ  R+W ++    + + GF  +V +  ++      S+ ++++YVDDIL+ GND   LHN            
Subjt:  IFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKFKFCRRFFSR

Query:  WQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPL
                                                                                                            
Subjt:  WQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPL

Query:  PFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQ
                                                              T+ N SQ                                       
Subjt:  PFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQ

Query:  KGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHG
                                                    F +KD  E  Y LGI+    R    L LSQ  YI  +L+R  M  +K    P    
Subjt:  KGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHG

Query:  VHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHL-VYGSKDLILTGYTNSDFQTDK
          LS     K     E      Y   VGSL Y+   TRPDI YAV  +S++   P  +H  A+K IL+YL  T ++ + +     L L  Y+++D+  DK
Subjt:  VHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHL-VYGSKDLILTGYTNSDFQTDK

Query:  ASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR
            ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   L +L +   L     +YCDN GA      P  H R KHI   YH IR
Subjt:  ASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
          VQ G + V  V++   +AD  TK L+   F+     +G
Subjt:  EIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-5324.64Show/hide
Query:  DPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLV-DLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL
        +P T   AM D   D+W +AM  E+ +   N  WDLV   P  V  + C+WI+ +K ++ G +  +KARLVAKGY Q+ G+DY ETFSPV    SIRI+L
Subjt:  DPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLV-DLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL

Query:  FIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYV
         +A    + I Q+DV  AFL G L + ++MSQP GFV + +   VC+LR++I+GLKQ  R+W ++  T + + GF  ++ +  ++      S+ ++++YV
Subjt:  FIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYV

Query:  DDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRL
        DDIL+ GND                                                                                           
Subjt:  DDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRL

Query:  HTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYK
                                                                                                            
Subjt:  HTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYK

Query:  DSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALS
                                                                  V+L      LS  F +K+  +  Y LGI+    R  + L LS
Subjt:  DSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALS

Query:  QATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRT
        Q  Y   +L+R  M  +K    P      L+     K P   E      Y   VGSL Y+   TRPD+ YAV  +S+Y   P  DHW A+K +L+YL  T
Subjt:  QATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRT

Query:  RDYHL-VYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDN
         D+ + +     L L  Y+++D+  D     ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   L +L +   L+    +YCDN
Subjt:  RDYHL-VYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDN

Query:  SGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG
         GA      P  H R KHI   YH IR  VQ G + V  V++   +AD  TK L+   F+   + +G
Subjt:  SGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLG

Arabidopsis top hitse value%identityAlignment
AT1G79640.1 Protein kinase superfamily protein3.6e-0839.56Show/hide
Query:  NETEKNAANGGTIGNTSQAV--DDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVA-SFIQQKSHNMQIL
        ++T++   NG     T   V  DDV ++   K  +++     ++ +E D+K+K P++QQ+G  KVTSEN+D++K VA S I QKSH+MQ+L
Subjt:  NETEKNAANGGTIGNTSQAV--DDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVA-SFIQQKSHNMQIL

AT1G79640.2 Protein kinase superfamily protein3.6e-0839.56Show/hide
Query:  NETEKNAANGGTIGNTSQAV--DDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVA-SFIQQKSHNMQIL
        ++T++   NG     T   V  DDV ++   K  +++     ++ +E D+K+K P++QQ+G  KVTSEN+D++K VA S I QKSH+MQ+L
Subjt:  NETEKNAANGGTIGNTSQAV--DDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVA-SFIQQKSHNMQIL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.8e-5024.04Show/hide
Query:  EDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL
        ++P T+  A   +    W  AMD E+ +M     W++  LP   KPI CKW+YK K ++ G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILL

Query:  FIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVT-QGQE---QKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFL
         I+  Y++ + Q+D+  AFLNG+LDE I+M  P G+   QG       VC L++SI+GLKQ SR W +KF  T+  +GF Q+  +   + +I  T    +
Subjt:  FIATFYDYEIWQMDVKIAFLNGNLDENIFMSQPKGFVT-QGQE---QKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFL

Query:  VLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADS
        ++YVDDI++  N                                                                                        
Subjt:  VLYVDDILLIGNDVEYLHNKKFKFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADS

Query:  SRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSD
                                                                                                            
Subjt:  SRRLHTQPPPTTISPPSIVVVGFLDYQSPPLPFHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSD

Query:  FYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKT
                       ND   DE                                             +K  L + F+++DLG  +Y LG++I R+     
Subjt:  FYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENVDVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKT

Query:  LALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKY
        + + Q  Y   +L    +   K   +P    V  S      +  +  D +   Y   +G LMY+ + TR DI +AV  +S++   P L H  AV  IL Y
Subjt:  LALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKY

Query:  LRRTRDYHLVYGSK-DLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITL
        ++ T    L Y S+ ++ L  ++++ FQ+ K +R+ST+G    L    + W+S KQ  ++ S+ EAEY A   A  E +WL +F  +L++   L+    L
Subjt:  LRRTRDYHLVYGSK-DLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITL

Query:  YCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        +CDN+ A+  +     H+R KHIE   H +RE
Subjt:  YCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein1.2e-1631.1Show/hide
Query:  LSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPD
        LS+ F +KDLG   Y LGIQI  +     L LSQ  Y +++L+   M + K    P    ++ S     K P   +      + S VG+L Y+ L TRPD
Subjt:  LSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRIPYASTVGSLMYVMLCTRPD

Query:  IFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHL-VYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAA
        I YAV I+ +    P L  +  +K +L+Y++ T  + L ++ +  L +  + +SD+    ++R+ST+G    L    + W + +Q  ++ S+ E EY A 
Subjt:  IFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHL-VYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYSTMEAEYVAA

Query:  CEAAKEAVW
           A E  W
Subjt:  CEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.2e-1340.7Show/hide
Query:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIA
        W +AM  E++++  N+ W LV  P     + CKW++K K  + G +   KARLVAKG+ Q+EG+ + ET+SPV    +IR +L +A
Subjt:  WVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCTGGCAACAGTTGGATGCTGGGAAAATGACTCTTAGAGTTGGGACTGGGGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGA
ACACTATATGTTATTAGATAATGATTGGAGATTGGTTAAGAACGGACTTCTAAGCCAGCTAGAAGATGGTACTTTACCTCCGTGTGAGTCATGTCTCGAAGGTAAGATGA
CCAAACGACCTTTTACTGGAAAAGGTTATTGCGCCAAGGAGCCCTTAGAACTCGTACATACGGATCTCTGTGGTCCAATGAATGTAAGAGCTCGAGGAGGGTATGAATAT
TTCATCTCTTTCATAGATGATTATTCAAGGACTGAAGTTGAGAACTTGTTAGGTAAGACCATTAAAATTCCAGAACTATTTGATAGAACATGGAATCTCGTCTCAACTCT
CAGCACCAGAGTTGTTGATGAGGTTGGACCTTCAACAAGAGTTGTTGATGAGCCCAGTTCAGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTACATCTGGTC
AAGCCCATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGATGGTTGTTATTCAACCTGAACGCTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGGT
GGGGTTGAGGATCCATTGACCTTTAGACATGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTG
GGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATACGGTGCAAATGGATCTACAAGAGAAAACGAGACGCAGCTGGGAAAGTACAAACTTTCAAAGCTCGACTTGTGG
CAAAGGGTTATACCCAAAAGGAGGGGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAAGTCGATAAGAATTCTCTTGTTCATTGCCACTTTTTATGATTAT
GAAATTTGGCAAATGGACGTCAAAATTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGTCTCAACCAAAGGGGTTCGTTACTCAAGGTCAGGAGCAAAAAGT
TTGCAAGCTTAGACGATCCATTTTTGGGTTGAAGCAAACTTCCAGATCTTGGAATATAAAGTTTGACACTACTGTCAAATCTTATGGTTTTGACCAGAACGTTGATGAGC
CTTGTGTTTACAAAAGGATTATCAACACTTCAGTAGCTTTTCTGGTGTTGTATGTAGATGATATTCTTCTCATTGGGAATGATGTAGAATATCTACACAACAAGAAATTT
AAGTTTTGCCGACGATTTTTCTCCCGATGGCAAAAAATTGGTCAGAAAAGCCCTAGCCGCCAACCATACTTCCCTCGCCACCACATCTTCAACCCTCTTCCTTCCACCAT
CGTCGACTCTTCTCGTCGCCTCCGTTCACAACCACTGCCCCCGACTCTTCTCGTCGCCTCCGTTCACATCCTCCACCGTCAACTTTTCTCGTCGCCTCTATTCACAGCCA
CCGTCGCCGACTCTTCTCGTCGCCTCCATACACAGCCTCCGCCGACCACTATTTCGCCTCCATCCATAGTCGTTGTTGGGTTCCTAGATTACCAGTCGCCGCCCCTCCCA
TTTCACATTCTTCTTGTTCTTTTCCTCCTTCACCTTGTTGCCGGTGCTTTCCTTTCTTCCATTGTCATCCACTCCTTCTTCCTCCCTCTGTTCTGCTGCAGAGCACTACC
GTGTCATTACCCACTTAATGAGACAGAGAAAAATGCTGCAAATGGTGGCACAATTGGAAACACTTCTCAGGCGGTAGATGATGTGCTTTCGGATTTTTATTATAAAGATT
CCGAGTCATCATTAACAAGTCCAACAACAAATGATGAGGAACAAGATGAGAAAGCAAAGGCACCCATTATTCAGCAGAAAGGGTGTTGCAAAGTTACTTCGGAGAATGTT
GATGTAGATAAGGCAGTTGCATCCTTTATTCAACAAAAAAGTCACAATATGCAGATTCTTGTAGTGCTAGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATTAA
AGATTTGGGTGAAGCTCAGTATGTTCTAGGGATTCAAATTATTAGGAATCGTAAGAACAAAACGTTAGCCCTGTCTCAAGCAACATATATAGACAAGATGTTGTCTAGAT
ATAAGATGCATAATTCCAAGAGGGGTTTACTACCCTTCAGGCATGGAGTTCATTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAATTGAGGATATGAGACGCATC
CCTTATGCTTCAACTGTGGGCAGTCTTATGTATGTCATGTTGTGTACTAGGCCTGACATTTTCTATGCAGTTGGGATAATCAGCAGGTACCAGTCCAATCCAGAGTTAGA
CCACTGGACTGCAGTTAAAAACATCCTCAAGTATCTTCGGAGAACGAGGGACTATCACCTTGTGTATGGCTCTAAGGATTTGATCCTTACAGGATACACAAACTCTGACT
TTCAGACTGATAAAGCTTCTAGGAAATCCACATCAGGGTCAGTATTTACCCTCAACGGGGGAGCTGTAGTCTGGCGTAGCATCAAGCAAGGATGCATTGCATACTCCACC
ATGGAGGCCGAATATGTCGCTGCTTGTGAAGCAGCTAAGGAGGCTGTATGGCTACGGAAGTTCTTGATGGATTTGGAAGTTGTTCCAAATTTGAACTTGCGAATCACCCT
TTACTGTGATAATAGTGGTGCAGTGGCAAATTCTAAAGAACCTCGCAGTCATAAAAGAGGCAAGCACATAGAGCGCAAATATCATCTCATTAGAGAGATCGTGCAACGAG
GGGACGTGATCGTCACGAAGGTAGCCTCTGCGCACAATATTGCTGATCCATTTACGAAGGCTCTCACGGCTAAAGTGTTCGAGGGTCATCTAAAGGGTCTAGGTAACAAC
CTAAAAGGTCTATTATATAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGTGATACGGTCCACCTTGTATTCGATACAGATGCAATGATCCAAAGCATCCATGT
AGGAGACATGCGGGTGGGGGTATCCTATGCAATGAATTTGCATAAGACCGGACCACGAAATAGTAACCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCTGGCAACAGTTGGATGCTGGGAAAATGACTCTTAGAGTTGGGACTGGGGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGA
ACACTATATGTTATTAGATAATGATTGGAGATTGGTTAAGAACGGACTTCTAAGCCAGCTAGAAGATGGTACTTTACCTCCGTGTGAGTCATGTCTCGAAGGTAAGATGA
CCAAACGACCTTTTACTGGAAAAGGTTATTGCGCCAAGGAGCCCTTAGAACTCGTACATACGGATCTCTGTGGTCCAATGAATGTAAGAGCTCGAGGAGGGTATGAATAT
TTCATCTCTTTCATAGATGATTATTCAAGGACTGAAGTTGAGAACTTGTTAGGTAAGACCATTAAAATTCCAGAACTATTTGATAGAACATGGAATCTCGTCTCAACTCT
CAGCACCAGAGTTGTTGATGAGGTTGGACCTTCAACAAGAGTTGTTGATGAGCCCAGTTCAGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTACATCTGGTC
AAGCCCATCCTTCTCAAGAGTTGAGAATGCCTCGACGTAGTGGGATGGTTGTTATTCAACCTGAACGCTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGGT
GGGGTTGAGGATCCATTGACCTTTAGACATGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTG
GGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATACGGTGCAAATGGATCTACAAGAGAAAACGAGACGCAGCTGGGAAAGTACAAACTTTCAAAGCTCGACTTGTGG
CAAAGGGTTATACCCAAAAGGAGGGGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAAGTCGATAAGAATTCTCTTGTTCATTGCCACTTTTTATGATTAT
GAAATTTGGCAAATGGACGTCAAAATTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGTCTCAACCAAAGGGGTTCGTTACTCAAGGTCAGGAGCAAAAAGT
TTGCAAGCTTAGACGATCCATTTTTGGGTTGAAGCAAACTTCCAGATCTTGGAATATAAAGTTTGACACTACTGTCAAATCTTATGGTTTTGACCAGAACGTTGATGAGC
CTTGTGTTTACAAAAGGATTATCAACACTTCAGTAGCTTTTCTGGTGTTGTATGTAGATGATATTCTTCTCATTGGGAATGATGTAGAATATCTACACAACAAGAAATTT
AAGTTTTGCCGACGATTTTTCTCCCGATGGCAAAAAATTGGTCAGAAAAGCCCTAGCCGCCAACCATACTTCCCTCGCCACCACATCTTCAACCCTCTTCCTTCCACCAT
CGTCGACTCTTCTCGTCGCCTCCGTTCACAACCACTGCCCCCGACTCTTCTCGTCGCCTCCGTTCACATCCTCCACCGTCAACTTTTCTCGTCGCCTCTATTCACAGCCA
CCGTCGCCGACTCTTCTCGTCGCCTCCATACACAGCCTCCGCCGACCACTATTTCGCCTCCATCCATAGTCGTTGTTGGGTTCCTAGATTACCAGTCGCCGCCCCTCCCA
TTTCACATTCTTCTTGTTCTTTTCCTCCTTCACCTTGTTGCCGGTGCTTTCCTTTCTTCCATTGTCATCCACTCCTTCTTCCTCCCTCTGTTCTGCTGCAGAGCACTACC
GTGTCATTACCCACTTAATGAGACAGAGAAAAATGCTGCAAATGGTGGCACAATTGGAAACACTTCTCAGGCGGTAGATGATGTGCTTTCGGATTTTTATTATAAAGATT
CCGAGTCATCATTAACAAGTCCAACAACAAATGATGAGGAACAAGATGAGAAAGCAAAGGCACCCATTATTCAGCAGAAAGGGTGTTGCAAAGTTACTTCGGAGAATGTT
GATGTAGATAAGGCAGTTGCATCCTTTATTCAACAAAAAAGTCACAATATGCAGATTCTTGTAGTGCTAGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATTAA
AGATTTGGGTGAAGCTCAGTATGTTCTAGGGATTCAAATTATTAGGAATCGTAAGAACAAAACGTTAGCCCTGTCTCAAGCAACATATATAGACAAGATGTTGTCTAGAT
ATAAGATGCATAATTCCAAGAGGGGTTTACTACCCTTCAGGCATGGAGTTCATTTGTCTAAGGAACAGTGTCCTAAGACACCTCAAGAAATTGAGGATATGAGACGCATC
CCTTATGCTTCAACTGTGGGCAGTCTTATGTATGTCATGTTGTGTACTAGGCCTGACATTTTCTATGCAGTTGGGATAATCAGCAGGTACCAGTCCAATCCAGAGTTAGA
CCACTGGACTGCAGTTAAAAACATCCTCAAGTATCTTCGGAGAACGAGGGACTATCACCTTGTGTATGGCTCTAAGGATTTGATCCTTACAGGATACACAAACTCTGACT
TTCAGACTGATAAAGCTTCTAGGAAATCCACATCAGGGTCAGTATTTACCCTCAACGGGGGAGCTGTAGTCTGGCGTAGCATCAAGCAAGGATGCATTGCATACTCCACC
ATGGAGGCCGAATATGTCGCTGCTTGTGAAGCAGCTAAGGAGGCTGTATGGCTACGGAAGTTCTTGATGGATTTGGAAGTTGTTCCAAATTTGAACTTGCGAATCACCCT
TTACTGTGATAATAGTGGTGCAGTGGCAAATTCTAAAGAACCTCGCAGTCATAAAAGAGGCAAGCACATAGAGCGCAAATATCATCTCATTAGAGAGATCGTGCAACGAG
GGGACGTGATCGTCACGAAGGTAGCCTCTGCGCACAATATTGCTGATCCATTTACGAAGGCTCTCACGGCTAAAGTGTTCGAGGGTCATCTAAAGGGTCTAGGTAACAAC
CTAAAAGGTCTATTATATAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGTGATACGGTCCACCTTGTATTCGATACAGATGCAATGATCCAAAGCATCCATGT
AGGAGACATGCGGGTGGGGGTATCCTATGCAATGAATTTGCATAAGACCGGACCACGAAATAGTAACCACTAG
Protein sequenceShow/hide protein sequence
MSSWQQLDAGKMTLRVGTGELISAKVVGDIKMFFSREHYMLLDNDWRLVKNGLLSQLEDGTLPPCESCLEGKMTKRPFTGKGYCAKEPLELVHTDLCGPMNVRARGGYEY
FISFIDDYSRTEVENLLGKTIKIPELFDRTWNLVSTLSTRVVDEVGPSTRVVDEPSSEAGPSTRVADEPSTSGQAHPSQELRMPRRSGMVVIQPERYLGLAETPTVIPDG
GVEDPLTFRHAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIRCKWIYKRKRDAAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLFIATFYDY
EIWQMDVKIAFLNGNLDENIFMSQPKGFVTQGQEQKVCKLRRSIFGLKQTSRSWNIKFDTTVKSYGFDQNVDEPCVYKRIINTSVAFLVLYVDDILLIGNDVEYLHNKKF
KFCRRFFSRWQKIGQKSPSRQPYFPRHHIFNPLPSTIVDSSRRLRSQPLPPTLLVASVHILHRQLFSSPLFTATVADSSRRLHTQPPPTTISPPSIVVVGFLDYQSPPLP
FHILLVLFLLHLVAGAFLSSIVIHSFFLPLFCCRALPCHYPLNETEKNAANGGTIGNTSQAVDDVLSDFYYKDSESSLTSPTTNDEEQDEKAKAPIIQQKGCCKVTSENV
DVDKAVASFIQQKSHNMQILVVLADIKKWLSTEFQIKDLGEAQYVLGIQIIRNRKNKTLALSQATYIDKMLSRYKMHNSKRGLLPFRHGVHLSKEQCPKTPQEIEDMRRI
PYASTVGSLMYVMLCTRPDIFYAVGIISRYQSNPELDHWTAVKNILKYLRRTRDYHLVYGSKDLILTGYTNSDFQTDKASRKSTSGSVFTLNGGAVVWRSIKQGCIAYST
MEAEYVAACEAAKEAVWLRKFLMDLEVVPNLNLRITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKVASAHNIADPFTKALTAKVFEGHLKGLGNN
LKGLLYRDKAGYLILVTLCDTVHLVFDTDAMIQSIHVGDMRVGVSYAMNLHKTGPRNSNH