; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032332 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032332
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr11:30635763..30644024
RNA-Seq ExpressionLag0032332
SyntenyLag0032332
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0055.32Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QL GEN+  WKS LN IL++ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG PS Q                                                                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKRE----------
           +K EANVAHSK++F          VP  S S++IQKRK  KGK    A++ KGKAK VA K +CFHCN D HWK NCP+YL +KK +          
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKRE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------KKAETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTK
                                            + A TQNKRQ+ISP +NNTYLWHLRL HIN+D+I RLVKNGLL  ++D  LPPCESCLEGKMTK
Subjt:  ------------------------------------KKAETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEH
        RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY ISFIDDYS+YGYLYLM HKSEALEKFKE+K EVENLL K IKILRSDRGGEY+D RFQDYMIEH
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEH

Query:  GIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPR
        GIQSQLS PGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL +FRIWGCPAHVLVTNPKKLEPR
Subjt:  GIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPR

Query:  TRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSH
        +R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+H+PRSKLVL EAT+ STRVVDE GPS+RV                             
Subjt:  TRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSH

Query:  SSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIY
                                                                   D+  T                                    
Subjt:  SSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIY

Query:  RAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESM
                          SG    S      +    +  Q     G+             ETQVVIPDD VEDPLSY+QAMND+DKD+W KAMD EMESM
Subjt:  RAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESM

Query:  YFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------------
        YFN VWELVD PEGVKPIGCKWIYKRKRD AGKVQTFKA LVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM                
Subjt:  YFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------------

Query:  ---------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDL
                                   +ASRSWNIRFDT IKS+GFDQNVDEPCVYKKINK K+AFLVLYVDDILLIGNDVGYL++VK W+AAQFQ+KDL
Subjt:  ---------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDL

Query:  GEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSR
        GEAQ                                            HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMY MLCT PDICYAVGIVSR
Subjt:  GEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSR

Query:  YQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLR
        YQS+PGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKEAVWLR
Subjt:  YQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLR

Query:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI
        KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIE+KYHLIREIVQRGDV+VTKIASEHNI DPFTK L+AKVFEGHLESLGLRDMYI
Subjt:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.24Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK+ +NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G KA+  A +   KAK  A KG CFHCN +GHWKRNCP+YLAEKK+ K+       
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM
                                                A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKM
Subjt:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI
        TKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE
        E GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLE
Subjt:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL
        PR+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                      
Subjt:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL

Query:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR
                            G++T   QP +                                                      L  P +        R
Subjt:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR

Query:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD
        ++N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+
Subjt:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD

Query:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------
         E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM          
Subjt:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------

Query:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ
                                         +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A Q
Subjt:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ

Query:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA
        FQ+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYA
Subjt:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA

Query:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK
        VGIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AK
Subjt:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK

Query:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        EAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.07Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK+ +NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK--------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN +GHWKRNCP+YLAEKK+ K+        
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMT
                                               A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKMT
Subjt:  ---------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMT

Query:  KRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIE
        KRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++E
Subjt:  KRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIE

Query:  HGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEP
         GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLEP
Subjt:  HGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEP

Query:  RTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLC
        R+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                       
Subjt:  RTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLC

Query:  RRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRI
                           G++T   QP +                                                      L  P +        R+
Subjt:  RRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRI

Query:  SNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDF
        +N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+ 
Subjt:  SNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDF

Query:  EMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------
        E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM           
Subjt:  EMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------

Query:  --------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQF
                                        +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A QF
Subjt:  --------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQF

Query:  QIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAV
        Q+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYAV
Subjt:  QIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAV

Query:  GIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKE
        GIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKE
Subjt:  GIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKE

Query:  AVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        AVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  AVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0054.3Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK N NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G KA+  A +   KAK  A KG CFHCN +GHWKRNCP+YLAEKK+ K+       
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM
                                                A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKM
Subjt:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI
        TKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE
        E GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLE
Subjt:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL
        PR+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                      
Subjt:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL

Query:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR
                            G++T   QP +                                                      L  P +        R
Subjt:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR

Query:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD
        ++N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+
Subjt:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD

Query:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------
         E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM          
Subjt:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------

Query:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ
                                         +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A Q
Subjt:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ

Query:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA
        FQ+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYA
Subjt:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA

Query:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK
        VGIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AK
Subjt:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK

Query:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        EAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0059.39Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QL GEN+  WKS LN IL++ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG PS Q                                                                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKKAETQNKRQ
           +K EANVAHSK++F          VP  S S++IQKRK  KGK    A++ KGKAK VA K +CFHCN D HWK NCP+YL  KK+EK+  T +   
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKKAETQNKRQ

Query:  KISPLSNNTYL--------------------WHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN
         +   S+   L                       +L HIN+D+I RLVKNGLL  ++D  LPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN
Subjt:  KISPLSNNTYL--------------------WHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN

Query:  VKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLL
        VKARGGFEY ISFIDDYS+YGYLYLM HKSEALEKFKE+K EVENLL K IKILRSDRGGEY+D RFQDYMIEHGIQSQLS PGTPQQNGVSERRNRTLL
Subjt:  VKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKV
        DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL +FRIWGCPAHVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+V
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKV

Query:  FVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAP
        FVSTNATFLEEDH+R+H+PRSKLVL EAT+ STRVVDE GPS+RV                                                       
Subjt:  FVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAP

Query:  QLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSL
                                         D+  T                                                      SG    S 
Subjt:  QLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSL

Query:  MYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKR
             +    +  Q     G+             ETQVVIPDD VEDPLSY+QAMND+DKD+W KAMD EMESMYFN VWELVD PEGVKPIGCKWIYKR
Subjt:  MYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKR

Query:  KRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM------------------------------------------
        KRD AGKVQTFKA LVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM                                          
Subjt:  KRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM------------------------------------------

Query:  -EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQ----------------------
         +ASRSWNIRFDT IKS+GFDQNVDEPCVYKKINK K+AFLVLYVDDILLIGNDVGYL++VK W+AAQFQ+KDLGEAQ                      
Subjt:  -EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQ----------------------

Query:  ----------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY
                              HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMY MLCT PDICYAVGIVSRYQS+PGLDHWTAVK +LKYLRRTRDY
Subjt:  ----------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY

Query:  MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV
        MLV+GAK+L+LTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAV
Subjt:  MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV

Query:  ANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI
        ANSKEPRSHKRGKHIE+KYHLIREIVQRGDV+VTKIASEHNI DPFTK L+AKVFEGHLESLGLRDMYI
Subjt:  ANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0054.24Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK+ +NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G KA+  A +   KAK  A KG CFHCN +GHWKRNCP+YLAEKK+ K+       
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM
                                                A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKM
Subjt:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI
        TKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE
        E GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLE
Subjt:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL
        PR+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                      
Subjt:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL

Query:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR
                            G++T   QP +                                                      L  P +        R
Subjt:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR

Query:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD
        ++N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+
Subjt:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD

Query:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------
         E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM          
Subjt:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------

Query:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ
                                         +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A Q
Subjt:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ

Query:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA
        FQ+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYA
Subjt:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA

Query:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK
        VGIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AK
Subjt:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK

Query:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        EAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

A0A5A7TZD0 Gag/pol protein0.0e+0055.32Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QL GEN+  WKS LN IL++ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG PS Q                                                                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKRE----------
           +K EANVAHSK++F          VP  S S++IQKRK  KGK    A++ KGKAK VA K +CFHCN D HWK NCP+YL +KK +          
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKRE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------KKAETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTK
                                            + A TQNKRQ+ISP +NNTYLWHLRL HIN+D+I RLVKNGLL  ++D  LPPCESCLEGKMTK
Subjt:  ------------------------------------KKAETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTK

Query:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEH
        RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEY ISFIDDYS+YGYLYLM HKSEALEKFKE+K EVENLL K IKILRSDRGGEY+D RFQDYMIEH
Subjt:  RPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEH

Query:  GIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPR
        GIQSQLS PGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL +FRIWGCPAHVLVTNPKKLEPR
Subjt:  GIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPR

Query:  TRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSH
        +R+CQFVGYPKETRGGLF+DPQ+N+VFVSTNATFLEEDH+R+H+PRSKLVL EAT+ STRVVDE GPS+RV                             
Subjt:  TRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSH

Query:  SSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIY
                                                                   D+  T                                    
Subjt:  SSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIY

Query:  RAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESM
                          SG    S      +    +  Q     G+             ETQVVIPDD VEDPLSY+QAMND+DKD+W KAMD EMESM
Subjt:  RAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESM

Query:  YFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------------
        YFN VWELVD PEGVKPIGCKWIYKRKRD AGKVQTFKA LVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM                
Subjt:  YFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------------

Query:  ---------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDL
                                   +ASRSWNIRFDT IKS+GFDQNVDEPCVYKKINK K+AFLVLYVDDILLIGNDVGYL++VK W+AAQFQ+KDL
Subjt:  ---------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDL

Query:  GEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSR
        GEAQ                                            HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMY MLCT PDICYAVGIVSR
Subjt:  GEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSR

Query:  YQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLR
        YQS+PGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKEAVWLR
Subjt:  YQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLR

Query:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI
        KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIE+KYHLIREIVQRGDV+VTKIASEHNI DPFTK L+AKVFEGHLESLGLRDMYI
Subjt:  KFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI

A0A5A7TZD7 Gag/pol protein0.0e+0054.3Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK N NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G KA+  A +   KAK  A KG CFHCN +GHWKRNCP+YLAEKK+ K+       
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKG-KASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM
                                                A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKM
Subjt:  ----------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKM

Query:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI
        TKRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++
Subjt:  TKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMI

Query:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE
        E GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLE
Subjt:  EHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLE

Query:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL
        PR+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                      
Subjt:  PRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNL

Query:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR
                            G++T   QP +                                                      L  P +        R
Subjt:  CRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFR

Query:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD
        ++N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+
Subjt:  ISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMD

Query:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------
         E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM          
Subjt:  FEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM----------

Query:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ
                                         +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A Q
Subjt:  ---------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQ

Query:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA
        FQ+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYA
Subjt:  FQIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYA

Query:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK
        VGIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AK
Subjt:  VGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAK

Query:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        EAVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  EAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

A0A5A7UYE8 Gag/pol protein0.0e+0059.39Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        MSSSII+LLK +QL GEN+  WKS LN IL++ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K +++IL S+S++L+K++E + TAR+IM+SL+E
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG PS Q                                                                                            
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKKAETQNKRQ
           +K EANVAHSK++F          VP  S S++IQKRK  KGK    A++ KGKAK VA K +CFHCN D HWK NCP+YL  KK+EK+  T +   
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKKAETQNKRQ

Query:  KISPLSNNTYL--------------------WHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN
         +   S+   L                       +L HIN+D+I RLVKNGLL  ++D  LPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN
Subjt:  KISPLSNNTYL--------------------WHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMN

Query:  VKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLL
        VKARGGFEY ISFIDDYS+YGYLYLM HKSEALEKFKE+K EVENLL K IKILRSDRGGEY+D RFQDYMIEHGIQSQLS PGTPQQNGVSERRNRTLL
Subjt:  VKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLL

Query:  DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKV
        DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSL +FRIWGCPAHVLVTNPKKLEPR+R+CQFVGYPKETRGGLF+DPQ+N+V
Subjt:  DMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKV

Query:  FVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAP
        FVSTNATFLEEDH+R+H+PRSKLVL EAT+ STRVVDE GPS+RV                                                       
Subjt:  FVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAP

Query:  QLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSL
                                         D+  T                                                      SG    S 
Subjt:  QLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSL

Query:  MYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKR
             +    +  Q     G+             ETQVVIPDD VEDPLSY+QAMND+DKD+W KAMD EMESMYFN VWELVD PEGVKPIGCKWIYKR
Subjt:  MYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKR

Query:  KRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM------------------------------------------
        KRD AGKVQTFKA LVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM                                          
Subjt:  KRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM------------------------------------------

Query:  -EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQ----------------------
         +ASRSWNIRFDT IKS+GFDQNVDEPCVYKKINK K+AFLVLYVDDILLIGNDVGYL++VK W+AAQFQ+KDLGEAQ                      
Subjt:  -EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQ----------------------

Query:  ----------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY
                              HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMY MLCT PDICYAVGIVSRYQS+PGLDHWTAVK +LKYLRRTRDY
Subjt:  ----------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY

Query:  MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV
        MLV+GAK+L+LTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAV
Subjt:  MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAV

Query:  ANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI
        ANSKEPRSHKRGKHIE+KYHLIREIVQRGDV+VTKIASEHNI DPFTK L+AKVFEGHLESLGLRDMYI
Subjt:  ANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYI

A0A5D3CPJ6 Gag/pol protein0.0e+0054.07Show/hide
Query:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE
        M+S+ +++L  ++LNG N+  WK+ +NT+L+++DL FVL EECP VP   A + V++ YERW KANEK + YIL SLSEVLAK++E++ TAREIM+SLQE
Subjt:  MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQE

Query:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG
        MFG  SYQ+ HDALK ++NA+M E  SVREHVL+M+  FN+AE NG V+ E SQV+FIL  LP S+L FR+NA MNKI + LTTLL+ELQ +ESL K KG
Subjt:  MFGLPSYQLHHDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKG

Query:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK--------
        +   KGEANVA S +KF +GS+ GTKS+P +S +K+ +K+KG +G   A     K   K  A KG CFHCN +GHWKRNCP+YLAEKK+ K+        
Subjt:  KNVVKGEANVAHSKKKFLKGSSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKK--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMT
                                               A TQNKR KISP   N +LWHLRL HIN+++I+RLVKNGLL+++E+  LP CESCLEGKMT
Subjt:  ---------------------------------------AETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMT

Query:  KRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIE
        KRPFTGKG+RAKEPLEL+HSDLCGPMNVKARGGFEY I+F DDYS+YGY+YLM HKSEALEKFKE+KAEVEN L KTIK  RSDRGGEY+D +FQ+Y++E
Subjt:  KRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIE

Query:  HGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEP
         GI SQLS PGTPQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAV+ILN VPSKSVSETP +LW GRK SLR+FRIWGCPAHVL  NPKKLEP
Subjt:  HGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKKLEP

Query:  RTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLC
        R+++C FVGYPK TRGG FYDP+DNKVFVSTNATFLEEDHIR+H+PRSK+VL     E TE STRVV+EP   TRVV                       
Subjt:  RTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLG----EATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLC

Query:  RRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRI
                           G++T   QP +                                                      L  P +        R+
Subjt:  RRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRI

Query:  SNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDF
        +N   R   L                                                       ET  VI D D+EDPL++++AM D+DKD+W KAM+ 
Subjt:  SNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDF

Query:  EMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------
        E+ESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKA LVAKGYTQ EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQM           
Subjt:  EMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------

Query:  --------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQF
                                        +ASRSWNIRFDT IKS+GFDQ VDEPCVYK+I    +AFLVLYVDDILLIGND+G L+++K+W+A QF
Subjt:  --------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQF

Query:  QIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAV
        Q+KDLGEAQ                                            HGV LSKEQ PKTPQ+VE+MR IPYASAVGSLMY MLCT PDICYAV
Subjt:  QIKDLGEAQ--------------------------------------------HGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAV

Query:  GIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKE
        GIVSRYQS+PGL HWTAVK ILKYLRRTRDY LV+G+K+L+LTGYTDSDFQTD+DSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACE AKE
Subjt:  GIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKE

Query:  AVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM
        AVWLR FL DLEVVPNM+ PITLYCDNSGAVANS+EPRSHKRGKHIE+KYHLIREIV RGDV+VT+IAS HN+ DPFTK L+AKVFEGHLESLGLRDM
Subjt:  AVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDM

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-10827.07Show/hide
Query:  NNTYLWHLRLDHINIDQIDRLVKNGLLTDIE-----DTFLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQ
        NN  LWH R  HI+  ++  + +  + +D       +     CE CL GK  + PF     +   K PL ++HSD+CGP+         Y + F+D ++ 
Subjt:  NNTYLWHLRLDHINIDQIDRLVKNGLLTDIE-----DTFLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQ

Query:  YGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGY
        Y   YL+ +KS+    F++F A+ E      +  L  D G EYL    + + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SFWG 
Subjt:  YGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGY

Query:  AVETAVHILNNVPSKSV---SETPFELWRGRKPSLRYFRIWGCPAHVLVTNPK-KLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIR
        AV TA +++N +PS+++   S+TP+E+W  +KP L++ R++G   +V + N + K + ++    FVGY  E  G   +D  + K  V+ +    E + + 
Subjt:  AVETAVHILNNVPSKSV---SETPFELWRGRKPSLRYFRIWGCPAHVLVTNPK-KLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIR

Query:  DHRPRSKLV-LGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFP
            + + V L ++ E   +  + P  S +++         F  +  +       +    S   + PN S        P        + F          
Subjt:  DHRPRSKLV-LGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFP

Query:  PVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLN-SPDKDPIKSL-HFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGM-
                       K   +++K   +   +     SG+ N S + +  + L    I NP          DG +++          +   E+ K +  + 
Subjt:  PVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLN-SPDKDPIKSL-HFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGM-

Query:  YEQDKNESGMYENFSFLVIFVVVETQVVIPD-DDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQT
        Y ++ N              VV+    +  D  +  D + YR      DK  W +A++ E+ +   N  W +   PE    +  +W++  K +  G    
Subjt:  YEQDKNESGMYENFSFLVIFVVVETQVVIPD-DDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQT

Query:  FKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------------------------------------EASRSWNIRFD
        +KA LVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QM                                         +A+R W   F+
Subjt:  FKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-----------------------------------------EASRSWNIRFD

Query:  TTIKSFGFDQNVDEPCVY--KKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--GVHLSKEQ---------------------
          +K   F  +  + C+Y   K N N+  +++LYVDD+++   D+  ++N K ++  +F++ DL E +H  G+ +  ++                     
Subjt:  TTIKSFGFDQNVDEPCVY--KKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--GVHLSKEQ---------------------

Query:  -----SPKTPQEV-------EDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFG---AKELVLTGYTD
             S   P ++       ++    P  S +G LMY+MLCT PD+  AV I+SRY S    + W  +K +L+YL+ T D  L+F    A E  + GY D
Subjt:  -----SPKTPQEV-------EDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFG---AKELVLTGYTD

Query:  SDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHI
        SD+   +  RKST+G +F + +   + W + +Q  +A S+ EAEY+A  E  +EA+WL+  LT + +   +  PI +Y DN G ++ +  P  HKR KHI
Subjt:  SDFQTDKDSRKSTSGSVFTL-NGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHI

Query:  EKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL
        + KYH  RE VQ   + +  I +E+ + D FTK L A  F    + LGL
Subjt:  EKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1615.7e-2949.62Show/hide
Query:  MRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVF-GAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGA
        M+ +PY SAVG++MY+M+ T PD+  AVG++S++ S P   HW A+K +L+YL+ T+ Y L F  A    L GY+D+D+  D +SR+STSG +F LNGG 
Subjt:  MRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVF-GAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGA

Query:  VVWRSIKQGCIADSTMEAEYVAACEVAKEAVWL
        V WRS KQ  +A S+ E EY+A  E  +EAVWL
Subjt:  VVWRSIKQGCIADSTMEAEYVAACEVAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-14431.02Show/hide
Query:  LWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKS
        LWH R+ H++   +  L K  L++  + T + PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +Y ++FIDD S+  ++Y++  K 
Subjt:  LWHLRLDHINIDQIDRLVKNGLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKS

Query:  EALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNN
        +  + F++F A VE   G+ +K LRSD GGEY  + F++Y   HGI+ + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA +++N 
Subjt:  EALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNN

Query:  VPSKSVS-ETPFELWRGRKPSLRYFRIWGCP--AHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGE
         PS  ++ E P  +W  ++ S  + +++GC   AHV      KL+ ++  C F+GY  E  G   +DP   KV  S +  F E +            +  
Subjt:  VPSKSVS-ETPFELWRGRKPSLRYFRIWGCP--AHVLVTNPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGE

Query:  ATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFR
        A + S +V +   P+              ++ +P              ST + P S+ ST                                +  ++G +
Subjt:  ATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFR

Query:  PWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSF
        P                       G++                          ++ G  L  G+   +  +    E+        E+ + ES  Y +  +
Subjt:  PWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSF

Query:  LVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVD
        ++I             D  +P S ++ ++  +K++  KAM  EMES+  N  ++LV+ P+G +P+ CKW++K K+D   K+  +KA LV KG+ Q++G+D
Subjt:  LVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVD

Query:  YEETFSPVAMLKSIRILLSIATFYDYEIWQM-------------------------------------------EASRSWNIRFDTTIKSFGFDQNVDEP
        ++E FSPV  + SIR +LS+A   D E+ Q+                                           +A R W ++FD+ +KS  + +   +P
Subjt:  YEETFSPVAMLKSIRILLSIATFYDYEIWQM-------------------------------------------EASRSWNIRFDTTIKSFGFDQNVDEP

Query:  CVY-KKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--------------------------------------------GVH
        CVY K+ ++N    L+LYVDD+L++G D G ++ +K  ++  F +KDLG AQ                                              + 
Subjt:  CVY-KKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--------------------------------------------GVH

Query:  LSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSR
        LSK+  P T +E  +M ++PY+SAVGSLMY M+CT PDI +AVG+VSR+  +PG +HW AVK IL+YLR T    L FG  + +L GYTD+D   D D+R
Subjt:  LSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGYTDSDFQTDKDSR

Query:  KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIV
        KS++G +FT +GGA+ W+S  Q C+A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V
Subjt:  KSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIREIV

Query:  QRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL
            + V KI++  N  D  TK +    FE   E +G+
Subjt:  QRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.0e-9926.69Show/hide
Query:  SPLSNNTY-LWHLRLDHINIDQIDRLVKNGLLTDIEDT--FLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQ
        SP S  T+  WH RL H     ++ ++ N  L+ +  +  FL  C  CL  K  K PF+     +  PLE I+SD+     + +   + Y + F+D +++
Subjt:  SPLSNNTY-LWHLRLDHINIDQIDRLVKNGLLTDIEDT--FLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQ

Query:  YGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGY
        Y +LY +  KS+  E F  FK  +EN     I    SD GGE++     +Y  +HGI    S P TP+ NG+SER++R +++   +++S+A +P ++W Y
Subjt:  YGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGY

Query:  AVETAVHILNNVPSKSVS-ETPFELWRGRKPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEE-----
        A   AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ ++R C F+GY       L    Q +++++S +  F E      
Subjt:  AVETAVHILNNVPSKSVS-ETPFELWRGRKPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEE-----

Query:  ------DHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFF
                +++ R  S  V    T   TR    P PS      +           P  PF     R+S  S+ +  +S SS+F ++    +P+AP+    
Subjt:  ------DHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSRPNSSSSTFGATTPLLQPSAPQLLFF

Query:  PSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQD
              P P  +  QT  +         +     +  ++  S+S     +S    P  S     ++P   +  +        ++  N+     +      
Subjt:  PSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYEQD

Query:  KNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEG-VKPIGCKWIYKRKRDV
        + ++G+ + +   S        L + +  E+          +P +  QA+ D   ++W  AM  E+ +   N  W+LV PP   V  +GC+WI+ +K + 
Subjt:  KNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEG-VKPIGCKWIYKRKRDV

Query:  AGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-------------------------------------------EAS
         G +  +KA LVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+                                           +A 
Subjt:  AGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM-------------------------------------------EAS

Query:  RSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH-----------GVHLSKEQ------
        R+W +     + + GF  +V +  ++       I ++++YVDDIL+ GND   L N  + ++ +F +KD  E  +           G+HLS+ +      
Subjt:  RSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH-----------GVHLSKEQ------

Query:  ------------SPKTPQEVEDMRR-------IPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY-MLVFGAKELVL
                    +P  P     +           Y   VGSL Y+   T PDI YAV  +S++   P  +H  A+K IL+YL  T ++ + +     L L
Subjt:  ------------SPKTPQEVEDMRR-------IPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDY-MLVFGAKELVL

Query:  TGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKR
          Y+D+D+  DKD   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +YCDN GA      P  H R
Subjt:  TGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKR

Query:  GKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL
         KHI   YH IR  VQ G + V  +++   + D  TK LS   F+     +G+
Subjt:  GKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.4e-9727.6Show/hide
Query:  AETQNKRQKISPLSNNTY-LWHLRLDHINIDQIDRLVKNGLLTDIEDTF-LPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYS
        A +Q      SP S  T+  WH RL H ++  ++ ++ N  L  +  +  L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y 
Subjt:  AETQNKRQKISPLSNNTY-LWHLRLDHINIDQIDRLVKNGLLTDIEDTF-LPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYS

Query:  ISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYA
        + F+D +++Y +LY +  KS+  + F  FK+ VEN     I  L SD GGE++  R  DY+ +HGI    S P TP+ NG+SER++R +++M  +++S+A
Subjt:  ISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGGEYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNAT
         +P ++W YA   AV+++N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  KLE +++ C F+GY       L       +++ S +  
Subjt:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLRYFRIWGCPAHVLVT--NPKKLEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNAT

Query:  FLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPG-------PSTRVVGESIAATLFFYLQLPQRPFFN---LCRRDSHSSTFSRPNSSSSTFGATTPLL-Q
        F E        P S    G +T    R    P        P+T +V  +    L  +L    RP  +   LC     SS    P+SS S+  ++ P    
Subjt:  FLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPG-------PSTRVVGESIAATLFFYLQLPQRPFFN---LCRRDSHSSTFSRPNSSSSTFGATTPLL-Q

Query:  PSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSH----SGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLN
         + PQ    P           IL   +    P  N  +QN  L Q    +S  H    S  ++ P+     S       P+  A           +I +N
Subjt:  PSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSH----SGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLN

Query:  SGDGSSSLMYEQDKNESGMYEQDKNE-SGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELV-DPPEGVK
        +         +   N   M  + K+      + +S+                   +P +  QAM D   D+W +AM  E+ +   N  W+LV  PP  V 
Subjt:  SGDGSSSLMYEQDKNESGMYEQDKNE-SGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELV-DPPEGVK

Query:  PIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM--------------------------------
         +GC+WI+ +K +  G +  +KA LVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+                                
Subjt:  PIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQM--------------------------------

Query:  -----------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKD-------LG----EAQH
                   +A R+W +   T + + GF  ++ +  ++       I ++++YVDDIL+ GND   L +  + ++ +F +K+       LG        
Subjt:  -----------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGNDVGYLSNVKEWIAAQFQIKD-------LG----EAQH

Query:  GVHLSKEQ-----------------------SPKTPQEVEDMRRIP----YASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRR
        G+HLS+ +                       SPK    +    ++P    Y   VGSL Y+   T PD+ YAV  +S+Y   P  DHW A+K +L+YL  
Subjt:  GVHLSKEQ-----------------------SPKTPQEVEDMRRIP----YASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRR

Query:  TRDY-MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCD
        T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +  S+ EAEY +    + E  W+   LT+L +   ++ P  +YCD
Subjt:  TRDY-MLVFGAKELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCD

Query:  NSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL
        N GA      P  H R KHI   YH IR  VQ G + V  +++   + D  TK LS   F+     +G+
Subjt:  NSGAVANSKEPRSHKRGKHIEKKYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGL

Arabidopsis top hitse value%identityAlignment
AT2G37160.1 Transducin/WD40 repeat-like superfamily protein2.1e-1048.65Show/hide
Query:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYE--QDKNESGMYEQDKNESGMYEN
        DLNS DKDP+KS+H   SNP+  AF  + KDGHDLLIGLNSGD  +  + +  QD  +  +  Q  N+ G   N
Subjt:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYE--QDKNESGMYEQDKNESGMYEN

AT2G37160.2 Transducin/WD40 repeat-like superfamily protein2.1e-1048.65Show/hide
Query:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYE--QDKNESGMYEQDKNESGMYEN
        DLNS DKDP+KS+H   SNP+  AF  + KDGHDLLIGLNSGD  +  + +  QD  +  +  Q  N+ G   N
Subjt:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGDGSSSLMYE--QDKNESGMYEQDKNESGMYEN

AT3G53390.1 Transducin/WD40 repeat-like superfamily protein1.5e-1172.09Show/hide
Query:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGD
        DLNS +KDP+KS+HF  SNP+  AFD + KDGHDLLIGLNSGD
Subjt:  DLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIGLNSGD

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.6e-5628.78Show/hide
Query:  EDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILL
        ++P +Y +A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K +  G ++ +KA LVAKGYTQ+EG+D+ ETFSPV  L S++++L
Subjt:  EDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQM-----------------------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFL
        +I+  Y++ + Q+                                               +ASR W ++F  T+  FGF Q+  +   + KI       +
Subjt:  SIATFYDYEIWQM-----------------------------------------------EASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFL

Query:  VLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--GVHLSKEQS---------------------------PKTPQEV-------EDMRRIPYAS
        ++YVDDI++  N+   +  +K  + + F+++DLG  ++  G+ +++  +                           P  P          + +    Y  
Subjt:  VLYVDDILLIGNDVGYLSNVKEWIAAQFQIKDLGEAQH--GVHLSKEQS---------------------------PKTPQEV-------EDMRRIPYAS

Query:  AVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAK-ELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQ
         +G LMY+ + T  DI +AV  +S++  +P L H  AV  IL Y++ T    L + ++ E+ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ
Subjt:  AVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAK-ELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQ

Query:  GCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIRE
          ++ S+ EAEY A      E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  GCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEKKYHLIRE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.4e-1638.68Show/hide
Query:  WAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMEASR
        W +AM  E++++  N+ W LV PP     +GCKW++K K    G +   KA LVAKG+ Q EG+ + ET+SPV    +IR +L++A        Q+E  +
Subjt:  WAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKRKRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMEASR

Query:  SWNIRF
        S N  F
Subjt:  SWNIRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTATCTCCTTGCTTAAAAATGAACAATTAAATGGCGAAAACTTTCAACAATGGAAATCGAATCTTAATACCATTCTCATGGTAGAGGATCTATGGTT
CGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTGCCGCTCAGGCAGTAAAAGACGCCTACGAACGTTGGACTAAGGCCAATGAAAAGGTCAAAGTCTATATAC
TGGTCAGCTTGTCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGAGAGATTATGAATTCCCTGCAAGAGATGTTTGGACTTCCGTCCTACCAGCTCCAC
CACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGTTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAATTCAATATAGCTGAGGCAAATGGCGG
GGTAGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTTGCTCCCAGCGAGCTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCAACCTGACTA
CCCTCCTCTCGGAGTTACAGATTTATGAGTCCTTGCAAAAGAGCAAGGGCAAAAATGTGGTTAAAGGAGAGGCCAATGTGGCCCATTCCAAAAAGAAGTTCCTGAAGGGT
TCATCCTTAGGGACTAAATCTGTACCCCAGGCTTCTTCATCGAAGCAAATACAGAAGAGGAAGGGAGACAAGGGGAAGGCTTCTGCAAAGGCTATGCAAGGTAAGGGAAA
GGCCAAGATCGTGGCTGACAAAGGCAGGTGCTTCCACTGCAACGCAGATGGACATTGGAAGCGCAATTGTCCCCGTTACCTCGCTGAGAAGAAGAGAGAAAAAAAAGCCG
AAACGCAAAACAAAAGGCAAAAGATTTCTCCTTTAAGTAACAATACGTATCTTTGGCACCTTCGTCTCGATCATATTAACATCGATCAGATCGATCGTTTGGTTAAAAAT
GGACTTCTAACTGATATAGAAGATACATTTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACCAAGAGGCCGTTTACTGGAAAAGGTTATAGGGCCAAAGAACC
ACTTGAACTGATACATTCGGATCTTTGTGGTCCGATGAATGTAAAGGCTCGAGGTGGTTTTGAATATTCCATCTCTTTCATAGATGATTATTCTCAATACGGTTACTTAT
ACCTGATGGGCCATAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTTTAAGGCTGAGGTAGAAAATCTGTTAGGTAAAACTATTAAAATACTTCGATCAGATCGAGGTGGA
GAGTATTTGGATCAGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAACACCTGGTACACCTCAGCAGAATGGTGTATCAGAAAGGAGAAATAG
AACCTTGTTAGACATGGTTCGATCTATGATGAGTTATGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTACATATTCTTAACAACGTTCCCTCAA
AAAGTGTTTCTGAAACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTGCGTTACTTCCGGATCTGGGGTTGTCCTGCACATGTGTTAGTGACAAATCCTAAGAAA
TTGGAACCACGTACTAGAATATGCCAATTTGTTGGATACCCGAAAGAAACGAGAGGTGGTCTTTTCTACGACCCTCAAGACAATAAAGTATTTGTATCGACAAATGCAAC
TTTCTTGGAGGAAGACCATATTCGAGATCATAGACCACGAAGTAAGTTAGTATTAGGAGAAGCTACAGAAGGATCAACAAGGGTTGTTGATGAACCTGGTCCTTCAACAA
GAGTTGTTGGAGAATCCATCGCCGCAACCCTCTTCTTCTACCTTCAGCTGCCCCAACGCCCCTTCTTCAACCTTTGCCGCCGCGACTCCCATTCTTCTACCTTCAGCCGT
CCCAACTCCTCTTCTTCAACCTTCGGCGCAACAACTCCCCTTCTTCAACCTTCAGCGCCACAACTCTTGTTCTTCCCTTCGGTTGTGGTCTTTCCTTTTCCTCCGGTGAA
GATTTTGCAAACATTCGACCGTGGGTTTCGACCGTGGAAGAATATATGGGATCAAAACAAAACTTTAGCTCAGAGGAAGATGATCCTGAGTATGTCTCATTCAGGTGACC
TTAATTCACCAGATAAGGATCCTATTAAGTCTTTGCATTTTAGAATTTCAAATCCTATATACCGTGCGTTTGATCTTGAAGTTAAGGATGGTCACGACCTACTTATTGGA
TTGAATTCTGGAGATGGTTCTTCAAGTTTGATGTACGAGCAAGACAAGAATGAAAGTGGCATGTACGAGCAAGACAAGAATGAAAGTGGCATGTACGAGAATTTTTCTTT
TTTGGTGATTTTTGTAGTAGTGGAAACACAAGTAGTCATACCTGATGACGACGTAGAAGATCCATTGTCTTATCGTCAGGCAATGAATGACATAGATAAAGACAAGTGGG
CCAAAGCCATGGACTTTGAGATGGAGTCTATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATTGGGTGTAAATGGATCTATAAGAGG
AAAAGAGATGTTGCTGGGAAAGTACAGACTTTCAAAGCCAGTCTTGTAGCAAAGGGTTATACCCAACGAGAAGGGGTGGACTATGAAGAAACCTTTTCTCCAGTTGCTAT
GCTTAAGTCCATAAGAATTCTCTTATCCATTGCCACATTTTATGACTATGAAATATGGCAAATGGAGGCCTCCCGATCTTGGAATATTAGATTTGATACTACGATCAAAT
CTTTTGGTTTTGACCAGAATGTTGATGAGCCTTGTGTATACAAGAAGATCAACAAGAATAAAATAGCTTTCCTCGTACTTTATGTTGACGACATCCTGCTCATTGGGAAT
GATGTAGGATATCTGTCTAACGTAAAAGAATGGATAGCAGCTCAATTCCAAATTAAGGATTTGGGTGAGGCCCAGCATGGGGTTCACTTGTCTAAGGAACAGTCTCCTAA
GACACCTCAAGAAGTTGAGGACATGAGACGGATTCCCTACGCCTCTGCAGTGGGTAGCTTAATGTATGTCATGCTATGCACGACGCCTGACATATGTTATGCTGTAGGGA
TTGTCAGCAGATATCAGTCAAGTCCAGGGTTAGACCACTGGACCGCCGTTAAAGGAATCCTCAAGTATCTTAGGAGAACGAGGGACTACATGCTGGTGTTTGGGGCTAAA
GAGTTAGTTCTCACTGGATACACCGACTCTGACTTCCAAACTGACAAGGATTCTAGGAAATCCACGTCAGGGTCAGTTTTCACCCTTAACGGGGGAGCTGTAGTTTGGAG
AAGTATAAAGCAAGGATGCATAGCAGACTCCACGATGGAGGCTGAGTATGTAGCTGCTTGTGAAGTAGCTAAGGAAGCTGTTTGGCTAAGAAAGTTTTTGACAGATTTGG
AAGTTGTTCCAAATATGAACTTGCCCATTACGTTATATTGTGACAACAGTGGGGCTGTAGCCAATTCAAAGGAACCTCGCAGCCACAAACGAGGTAAACACATAGAGAAG
AAGTATCATCTGATTCGTGAGATTGTGCAAAGAGGAGATGTGGTCGTGACCAAGATTGCTTCCGAGCACAACATTGTTGATCCGTTTACGAAGGCCCTCTCGGCTAAAGT
GTTCGAGGGTCATCTAGAAAGTCTAGGTCTGCGAGACATGTATATAGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTATCTCCTTGCTTAAAAATGAACAATTAAATGGCGAAAACTTTCAACAATGGAAATCGAATCTTAATACCATTCTCATGGTAGAGGATCTATGGTT
CGTCTTAACGGAGGAATGTCCTCCTGTTCCCCCTCGCACTGCCGCTCAGGCAGTAAAAGACGCCTACGAACGTTGGACTAAGGCCAATGAAAAGGTCAAAGTCTATATAC
TGGTCAGCTTGTCTGAAGTATTGGCCAAGCGTTACGAGAACGTGGAAACTGCCAGAGAGATTATGAATTCCCTGCAAGAGATGTTTGGACTTCCGTCCTACCAGCTCCAC
CACGACGCCTTGAAGAACGTCTTCAATGCCAAGATGCTGGAAGTTCAATCTGTTCGGGAACATGTCCTGGACATGATTAACCAATTCAATATAGCTGAGGCAAATGGCGG
GGTAGTCTGCGAGCGCAGTCAGGTTGCGTTCATCCTTCACTTGCTCCCAGCGAGCTATCTGTCATTCAGGACGAACGCGAGCATGAATAAGATTCAGTTCAACCTGACTA
CCCTCCTCTCGGAGTTACAGATTTATGAGTCCTTGCAAAAGAGCAAGGGCAAAAATGTGGTTAAAGGAGAGGCCAATGTGGCCCATTCCAAAAAGAAGTTCCTGAAGGGT
TCATCCTTAGGGACTAAATCTGTACCCCAGGCTTCTTCATCGAAGCAAATACAGAAGAGGAAGGGAGACAAGGGGAAGGCTTCTGCAAAGGCTATGCAAGGTAAGGGAAA
GGCCAAGATCGTGGCTGACAAAGGCAGGTGCTTCCACTGCAACGCAGATGGACATTGGAAGCGCAATTGTCCCCGTTACCTCGCTGAGAAGAAGAGAGAAAAAAAAGCCG
AAACGCAAAACAAAAGGCAAAAGATTTCTCCTTTAAGTAACAATACGTATCTTTGGCACCTTCGTCTCGATCATATTAACATCGATCAGATCGATCGTTTGGTTAAAAAT
GGACTTCTAACTGATATAGAAGATACATTTTTACCACCCTGTGAATCGTGTCTCGAGGGTAAAATGACCAAGAGGCCGTTTACTGGAAAAGGTTATAGGGCCAAAGAACC
ACTTGAACTGATACATTCGGATCTTTGTGGTCCGATGAATGTAAAGGCTCGAGGTGGTTTTGAATATTCCATCTCTTTCATAGATGATTATTCTCAATACGGTTACTTAT
ACCTGATGGGCCATAAGTCTGAGGCTCTTGAAAAGTTCAAAGAGTTTAAGGCTGAGGTAGAAAATCTGTTAGGTAAAACTATTAAAATACTTCGATCAGATCGAGGTGGA
GAGTATTTGGATCAGAGATTCCAGGACTATATGATAGAACATGGAATCCAATCCCAACTCTCAACACCTGGTACACCTCAGCAGAATGGTGTATCAGAAAGGAGAAATAG
AACCTTGTTAGACATGGTTCGATCTATGATGAGTTATGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTACATATTCTTAACAACGTTCCCTCAA
AAAGTGTTTCTGAAACTCCTTTTGAGTTATGGAGGGGGCGTAAACCTAGTTTGCGTTACTTCCGGATCTGGGGTTGTCCTGCACATGTGTTAGTGACAAATCCTAAGAAA
TTGGAACCACGTACTAGAATATGCCAATTTGTTGGATACCCGAAAGAAACGAGAGGTGGTCTTTTCTACGACCCTCAAGACAATAAAGTATTTGTATCGACAAATGCAAC
TTTCTTGGAGGAAGACCATATTCGAGATCATAGACCACGAAGTAAGTTAGTATTAGGAGAAGCTACAGAAGGATCAACAAGGGTTGTTGATGAACCTGGTCCTTCAACAA
GAGTTGTTGGAGAATCCATCGCCGCAACCCTCTTCTTCTACCTTCAGCTGCCCCAACGCCCCTTCTTCAACCTTTGCCGCCGCGACTCCCATTCTTCTACCTTCAGCCGT
CCCAACTCCTCTTCTTCAACCTTCGGCGCAACAACTCCCCTTCTTCAACCTTCAGCGCCACAACTCTTGTTCTTCCCTTCGGTTGTGGTCTTTCCTTTTCCTCCGGTGAA
GATTTTGCAAACATTCGACCGTGGGTTTCGACCGTGGAAGAATATATGGGATCAAAACAAAACTTTAGCTCAGAGGAAGATGATCCTGAGTATGTCTCATTCAGGTGACC
TTAATTCACCAGATAAGGATCCTATTAAGTCTTTGCATTTTAGAATTTCAAATCCTATATACCGTGCGTTTGATCTTGAAGTTAAGGATGGTCACGACCTACTTATTGGA
TTGAATTCTGGAGATGGTTCTTCAAGTTTGATGTACGAGCAAGACAAGAATGAAAGTGGCATGTACGAGCAAGACAAGAATGAAAGTGGCATGTACGAGAATTTTTCTTT
TTTGGTGATTTTTGTAGTAGTGGAAACACAAGTAGTCATACCTGATGACGACGTAGAAGATCCATTGTCTTATCGTCAGGCAATGAATGACATAGATAAAGACAAGTGGG
CCAAAGCCATGGACTTTGAGATGGAGTCTATGTACTTCAATCAAGTTTGGGAACTTGTAGATCCACCTGAAGGGGTCAAACCCATTGGGTGTAAATGGATCTATAAGAGG
AAAAGAGATGTTGCTGGGAAAGTACAGACTTTCAAAGCCAGTCTTGTAGCAAAGGGTTATACCCAACGAGAAGGGGTGGACTATGAAGAAACCTTTTCTCCAGTTGCTAT
GCTTAAGTCCATAAGAATTCTCTTATCCATTGCCACATTTTATGACTATGAAATATGGCAAATGGAGGCCTCCCGATCTTGGAATATTAGATTTGATACTACGATCAAAT
CTTTTGGTTTTGACCAGAATGTTGATGAGCCTTGTGTATACAAGAAGATCAACAAGAATAAAATAGCTTTCCTCGTACTTTATGTTGACGACATCCTGCTCATTGGGAAT
GATGTAGGATATCTGTCTAACGTAAAAGAATGGATAGCAGCTCAATTCCAAATTAAGGATTTGGGTGAGGCCCAGCATGGGGTTCACTTGTCTAAGGAACAGTCTCCTAA
GACACCTCAAGAAGTTGAGGACATGAGACGGATTCCCTACGCCTCTGCAGTGGGTAGCTTAATGTATGTCATGCTATGCACGACGCCTGACATATGTTATGCTGTAGGGA
TTGTCAGCAGATATCAGTCAAGTCCAGGGTTAGACCACTGGACCGCCGTTAAAGGAATCCTCAAGTATCTTAGGAGAACGAGGGACTACATGCTGGTGTTTGGGGCTAAA
GAGTTAGTTCTCACTGGATACACCGACTCTGACTTCCAAACTGACAAGGATTCTAGGAAATCCACGTCAGGGTCAGTTTTCACCCTTAACGGGGGAGCTGTAGTTTGGAG
AAGTATAAAGCAAGGATGCATAGCAGACTCCACGATGGAGGCTGAGTATGTAGCTGCTTGTGAAGTAGCTAAGGAAGCTGTTTGGCTAAGAAAGTTTTTGACAGATTTGG
AAGTTGTTCCAAATATGAACTTGCCCATTACGTTATATTGTGACAACAGTGGGGCTGTAGCCAATTCAAAGGAACCTCGCAGCCACAAACGAGGTAAACACATAGAGAAG
AAGTATCATCTGATTCGTGAGATTGTGCAAAGAGGAGATGTGGTCGTGACCAAGATTGCTTCCGAGCACAACATTGTTGATCCGTTTACGAAGGCCCTCTCGGCTAAAGT
GTTCGAGGGTCATCTAGAAAGTCTAGGTCTGCGAGACATGTATATAGCGTAA
Protein sequenceShow/hide protein sequence
MSSSIISLLKNEQLNGENFQQWKSNLNTILMVEDLWFVLTEECPPVPPRTAAQAVKDAYERWTKANEKVKVYILVSLSEVLAKRYENVETAREIMNSLQEMFGLPSYQLH
HDALKNVFNAKMLEVQSVREHVLDMINQFNIAEANGGVVCERSQVAFILHLLPASYLSFRTNASMNKIQFNLTTLLSELQIYESLQKSKGKNVVKGEANVAHSKKKFLKG
SSLGTKSVPQASSSKQIQKRKGDKGKASAKAMQGKGKAKIVADKGRCFHCNADGHWKRNCPRYLAEKKREKKAETQNKRQKISPLSNNTYLWHLRLDHINIDQIDRLVKN
GLLTDIEDTFLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYSISFIDDYSQYGYLYLMGHKSEALEKFKEFKAEVENLLGKTIKILRSDRGG
EYLDQRFQDYMIEHGIQSQLSTPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLRYFRIWGCPAHVLVTNPKK
LEPRTRICQFVGYPKETRGGLFYDPQDNKVFVSTNATFLEEDHIRDHRPRSKLVLGEATEGSTRVVDEPGPSTRVVGESIAATLFFYLQLPQRPFFNLCRRDSHSSTFSR
PNSSSSTFGATTPLLQPSAPQLLFFPSVVVFPFPPVKILQTFDRGFRPWKNIWDQNKTLAQRKMILSMSHSGDLNSPDKDPIKSLHFRISNPIYRAFDLEVKDGHDLLIG
LNSGDGSSSLMYEQDKNESGMYEQDKNESGMYENFSFLVIFVVVETQVVIPDDDVEDPLSYRQAMNDIDKDKWAKAMDFEMESMYFNQVWELVDPPEGVKPIGCKWIYKR
KRDVAGKVQTFKASLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMEASRSWNIRFDTTIKSFGFDQNVDEPCVYKKINKNKIAFLVLYVDDILLIGN
DVGYLSNVKEWIAAQFQIKDLGEAQHGVHLSKEQSPKTPQEVEDMRRIPYASAVGSLMYVMLCTTPDICYAVGIVSRYQSSPGLDHWTAVKGILKYLRRTRDYMLVFGAK
ELVLTGYTDSDFQTDKDSRKSTSGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEVAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIEK
KYHLIREIVQRGDVVVTKIASEHNIVDPFTKALSAKVFEGHLESLGLRDMYIA