; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012935 (gene) of Snake gourd v1 genome

Gene IDTan0012935
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:57519181..57523071
RNA-Seq ExpressionTan0012935
SyntenyTan0012935
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

A0A5A7TWB9 Gag/pol protein0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

A0A5A7UGV2 Gag/pol protein0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

A0A5D3CPJ6 Gag/pol protein0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

A0A5D3CSZ6 Gag/pol protein0.0e+0056.47Show/hide
Query:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+IDD++FV VEECP+       R+  + Y+RW +ANEKA+ YI+AS+S++LAKKHE M+T +EIM+SLQE
Subjt:  MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQE

Query:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA
        MFGQ S+Q++HD+LKY++NARM EG SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP+SFL  R+  VMNKI YTLTTLLNELQ F+SL ++K 
Subjt:  MFGQQSFQVRHDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKA

Query:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM
         + EANVA   R +HRGSTSGTK +  S    + K KK  + +K N  AA   ++++                        ++K++         + T +
Subjt:  SESEANVA--YRSYHRGSTSGTKHVAPS----RPKGKKRMKRDKTNRGAAHKARRSRT----------------------LQRKES---------VSTAM

Query:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------
          +     + +S     +    +GI SW+    GE                                                                 
Subjt:  EADTGRGTVPNSWPRGRI----KGIDSWQPQREGE-----------------------------------------------------------------

Query:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS
                                          TA T+ K+ K+SPKEN HLWHLRLGHINLNRIE+LV++GLL+ELEENSLPVCESCLEGKMTKRP +
Subjt:  ----------------------------------TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPIS

Query:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLELVHSD C PMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-
        QLSAPG PQQNGVSERRNR LLDMVRSMMSYA LP+SFWGY V+TAVYILN V SKSV ETP +LW+GRKG+L HFRIWGCP HVL +NPK   P   + 
Subjt:  QLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNPKSWNPVQIV-

Query:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD
                        P   +V VS+NATFLEEDH+R+H PRSKIVLNE+       S RV +  S  T VV   +S++  + Q L  PRRSGR      
Subjt:  ----------------PIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMD----STSARVADVASTSTSVVDPSTSSQI-RSQELGMPRRSGRAVRQLD

Query:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY
        RYM L ET  V  D + EDPLT+ +AM DVDKD+WIKAM+ E+ESMYFNSVW+LVDQPDGVKPIGCKWIYKRKR  D KVQTF ARLVAKG+TQVEGVDY
Subjt:  RYMDLAETSVVAPDDNCEDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDY

Query:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC
        EETFSPV M+KSIRILL+I AY+ YE+WQMD K AFLNG L+ETIY+ QP+GFI  GQEQK+C+L+RSIYGLKQ SRSWNIRFD AIKSYGFDQ VDEPC
Subjt:  EETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPC

Query:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS
        VYK+I++K+VAFLVLYVDDILLIGN++  L D+K+ LA+QFQMKDLGEA                              +Y MQNSK+ LLPFRHGV LS
Subjt:  VYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLS

Query:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE
        K+QCPKTPQDVE+MR I YASAVGSLMY MLCTRPDI        SRY+SNP LAH T +                +GS    +              R+
Subjt:  KDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH---------------FGSPSSRM--------------RE

Query:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH
        S  G            RSIK+GCI DSTMEAEYVA CEAAKEAVWLR F+ DLEVVPNM+ PITL+CDNSGAVANSREPRSHK+GKHIE KYHLIREIVH
Subjt:  SEYG------------RSIKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVH

Query:  RGDVTVT
        RGDV VT
Subjt:  RGDVTVT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-8626.83Show/hide
Query:  KENVHLWHLRLGHINLNRIEKLVR------SGLLNELEENSLPVCESCLEGKMTKRPISGKGYRA--KEPLELVHSDFCSPMNVKARGGYEYFVSFIDDY
        K N  LWH R GHI+  ++ ++ R        LLN L E S  +CE CL GK  + P      +   K PL +VHSD C P+         YFV F+D +
Subjt:  KENVHLWHLRLGHINLNRIEKLVR------SGLLNELEENSLPVCESCLEGKMTKRPISGKGYRA--KEPLELVHSDFCSPMNVKARGGYEYFVSFIDDY

Query:  SRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFW
        + Y   YL+  KS+    F+++  + E      +  L  D G EY+  E + + ++ GI+  L+ P  PQ NGVSER  R + +  R+M+S A+L  SFW
Subjt:  SRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFW

Query:  GYVVETAVYILNNVLSKSVCE---TPFELWSGRKGNLHHFRIWGCPTHVLVSNP---------------------KSWNPVQIVPIIRR-------VLVS
        G  V TA Y++N + S+++ +   TP+E+W  +K  L H R++G   +V + N                      K W+ V    I+ R        +V+
Subjt:  GYVVETAVYILNNVLSKSVCE---TPFELWSGRKGNLHHFRIWGCPTHVLVSNP---------------------KSWNPVQIVPIIRR-------VLVS

Query:  SNATFLEEDHVRD-------HLPRS--KIVLNEM--------------DSTSARVADVASTSTSVVD---PSTSSQI-----------------------
        S A   E   ++D       + P    KI+  E               DS  +   +  + S  ++    P+ S +                        
Subjt:  SNATFLEEDHVRD-------HLPRS--KIVLNEM--------------DSTSARVADVASTSTSVVD---PSTSSQI-----------------------

Query:  -RSQELGMPRRSG--RAVRQLDRYMDLAETSVVAP--------------------------DDNC-------------EDPLTYDQAMVDVDKDDWIKAM
         R   L   + SG     R+ +    L E  +  P                          +DN              + P ++D+     DK  W +A+
Subjt:  -RSQELGMPRRSG--RAVRQLDRYMDLAETSVVAP--------------------------DDNC-------------EDPLTYDQAMVDVDKDDWIKAM

Query:  DQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNG
        + E+ +   N+ W +  +P+    +  +W++  K +       + ARLVA+GFTQ   +DYEETF+PV  + S R +L++   Y  +V QMD K AFLNG
Subjt:  DQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAIPAYYVYEVWQMDAKIAFLNG

Query:  KLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDK----TVAFLVLYVDDILLIGNEVEFLIDVKK
         L E IY+  P+G         VC+L+++IYGLKQ +R W   F++A+K   F  +  + C+Y  I+DK       +++LYVDD+++   ++  + + K+
Subjt:  KLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDK----TVAFLVLYVDDILLIGNEVEFLIDVKK

Query:  LLASQFQMKDLGE----------------------------ARYKMQNSKKDLLPFRHGVH---LSKDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTR
         L  +F+M DL E                            +++ M+N      P    ++   L+ D+   TP            S +G LMY MLCTR
Subjt:  LLASQFQMKDLGE----------------------------ARYKMQNSKKDLLPFRHGVH---LSKDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTR

Query:  PDI--------HLCGKESSSRYRSNPR------------------LAHRTEIHFGSPSSRMRESEYGRSIKRG-------------------CIVDSTME
        PD+            K +S  +++  R                  LA   +I  G   S    SE  R    G                    +  S+ E
Subjt:  PDI--------HLCGKESSSRYRSNPR------------------LAHRTEIHFGSPSSRMRESEYGRSIKRG-------------------CIVDSTME

Query:  AEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIV
        AEY+A+ EA +EA+WL+  +T + +   +  PI ++ DN G ++ +  P  HK+ KHI+ KYH  RE V
Subjt:  AEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.4e-12932.31Show/hide
Query:  NVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMH
        +V LWH R+GH++   ++ L +  L++  +  ++  C+ CL GK  +        R    L+LV+SD C PM +++ GG +YFV+FIDD SR  ++Y++ 
Subjt:  NVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMH

Query:  RKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYI
         K +  + F+++   VE   G+ LK LRSD GGEY   EF++Y   HGI  + + PG PQ NGV+ER NR +++ VRSM+  A+LP SFWG  V+TA Y+
Subjt:  RKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYI

Query:  LNNVLSKSVC-ETPFELWSGRKGNLHHFRIWGCPTHVLVSNP------------------------KSWNPVQIVPIIRRVLVSSNATFLEEDHVRDHLP
        +N   S  +  E P  +W+ ++ +  H +++GC     V                           + W+PV+     ++V+ S +  F  E  VR    
Subjt:  LNNVLSKSVC-ETPFELWSGRKGNLHHFRIWGCPTHVLVSNP------------------------KSWNPVQIVPIIRRVLVSSNATFLEEDHVRDHLP

Query:  RSKIVLN-----------------EMDSTSARVADVASTSTSVVDPSTSSQIRSQELGMP----------RRSGRAVRQLDRYMDLAETSVVAPDDNCED
         S+ V N                   +ST+  V++       V++         +E+  P          RRS R   +  RY     T  V   D+  +
Subjt:  RSKIVLN-----------------EMDSTSARVADVASTSTSVVDPSTSSQIRSQELGMP----------RRSGRAVRQLDRYMDLAETSVVAPDDNCED

Query:  PLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAI
        P +  + +   +K+  +KAM +EMES+  N  ++LV+ P G +P+ CKW++K K+D D K+  + ARLV KGF Q +G+D++E FSPVV + SIR +L++
Subjt:  PLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAI

Query:  PAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVY-KKIVDKTVAFLVLYVD
         A    EV Q+D K AFL+G L+E IY++QP+GF   G++  VC+L++S+YGLKQ  R W ++FD  +KS  + +   +PCVY K+  +     L+LYVD
Subjt:  PAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVY-KKIVDKTVAFLVLYVD

Query:  DILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLSKDQCPKTPQDVEDMRRIS
        D+L++G +   +  +K  L+  F MKDLG A                              R+ M+N+K    P    + LSK  CP T ++  +M ++ 
Subjt:  DILLIGNEVEFLIDVKKLLASQFQMKDLGEA------------------------------RYKMQNSKKDLLPFRHGVHLSKDQCPKTPQDVEDMRRIS

Query:  YASAVGSLMYPMLCTRPDI-HLCGKESSSRYRSNPRLAHRTEIHF-------------------------------GSPSSRMRESEYGRSIKRG-----
        Y+SAVGSLMY M+CTRPDI H  G    SR+  NP   H   + +                               G   +R   + Y  +   G     
Subjt:  YASAVGSLMYPMLCTRPDI-HLCGKESSSRYRSNPRLAHRTEIHF-------------------------------GSPSSRMRESEYGRSIKRG-----

Query:  -----CIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIV
             C+  ST EAEY+A  E  KE +WL++F+ +L +         ++CD+  A+  S+    H + KHI+ +YH IRE+V
Subjt:  -----CIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIV

Q07791 Transposon Ty2-DR3 Gag-Pol polyprotein6.4e-2127.92Show/hide
Query:  PQREGETAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLP-------VCESCLEGKMTK-RPISG---KGYRAKEPLELVHSDF
        P    +       K+K   K    L H  LGH N   I+K ++   +  L+E+ +         C  CL GK TK R I G   K   + EP + +H+D 
Subjt:  PQREGETAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLP-------VCESCLEGKMTK-RPISG---KGYRAKEPLELVHSDF

Query:  CSPMNVKARGGYEYFVSFIDDYSRYGYIYLMH--RKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSE
          P++   +    YF+SF D+ +R+ ++Y +H  R+   L  F      ++N     +  ++ DRG EY +     +    GIT+  +     + +GV+E
Subjt:  CSPMNVKARGGYEYFVSFIDDYSRYGYIYLMH--RKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSE

Query:  RRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLS
        R NR LL+  R+++  + LP+  W   VE +  I N+++S
Subjt:  RRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.1e-7224.69Show/hide
Query:  AKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELE-ENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSR
        A  S K     WH RLGH   + +  ++ +  L+ L   +    C  CL  K  K P S     +  PLE ++SD  S   + +   Y Y+V F+D ++R
Subjt:  AKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELE-ENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSR

Query:  YGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGY
        Y ++Y + +KS+  E F  +K  +EN     + T  SD GGE++     +Y  +HGI+   S P  P+ NG+SER++R +++   +++S+A +P ++W Y
Subjt:  YGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGY

Query:  VVETAVYILNNVLSKSV-CETPFELWSGRKGNLHHFRIWGCP----------------------------------THVLVSN--------------PKS
            AVY++N + +  +  E+PF+   G   N    R++GC                                    H+  S               P S
Subjt:  VVETAVYILNNVLSKSV-CETPFELWSGRKGNLHHFRIWGCP----------------------------------THVLVSN--------------PKS

Query:  -------------------WNPVQIVPIIRRVLVSSNATFLEEDHVRDHLP-----RSKIVLNEMDS--------------------------TSARVAD
                           W+P   +P    VL + + +           P      S++  + +DS                          T  +   
Subjt:  -------------------WNPVQIVPIIRRVLVSSNATFLEEDHVRDHLP-----RSKIVLNEMDS--------------------------TSARVAD

Query:  VASTSTSVVDPSTSSQIR-SQELGMPRRSGRAVRQLDRYMDLAETSVVAP--------------DDNCEDPLT------------------YDQAM----
         +S +TS  +P+  S  + +Q L  P +S  +          + TS   P              ++N + PL                   Y  A+    
Subjt:  VASTSTSVVDPSTSSQIR-SQELGMPRRSGRAVRQLDRYMDLAETSVVAP--------------DDNCEDPLT------------------YDQAM----

Query:  -------VDVDKDD-WIKAMDQEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLA
               +   KD+ W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K + D  +  + ARLVAKG+ Q  G+DY ETFSPV+   SIRI+L 
Subjt:  -------VDVDKDD-WIKAMDQEMESMYFNSVWELVDQPDG-VKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLA

Query:  IPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVD
        +     + + Q+D   AFL G L + +Y+ QP GFI + +   VC+L +++YGLKQ  R+W +     + + GF  +V +  ++     K++ ++++YVD
Subjt:  IPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVD

Query:  DILLIGNEVEFLIDVKKLLASQFQMKDLGEARYKMQNSKK---------------DLLPFRHGVHLSKDQCPKTPQDVEDMRR-------ISYASAVGSL
        DIL+ GN+   L +    L+ +F +KD  E  Y +    K               DLL   + +       P  P     +           Y   VGSL
Subjt:  DILLIGNEVEFLIDVKKLLASQFQMKDLGEARYKMQNSKK---------------DLLPFRHGVHLSKDQCPKTPQDVEDMRR-------ISYASAVGSL

Query:  MYPMLCTRPDI-----------------HLCGKESSSRYRSNP-------RLAHRTEIHFGSPSSRMRESE-----------------YGRSIKRGCIVD
         Y +  TRPDI                 HL   +   RY +         +  +   +H  S +    + +                    S K+  +V 
Subjt:  MYPMLCTRPDI-----------------HLCGKESSSRYRSNP-------RLAHRTEIHFGSPSSRMRESE-----------------YGRSIKRGCIVD

Query:  STMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVHRGDVTV
        S+ EAEY +V   + E  W+   +T+L +   +  P  ++CDN GA      P  H + KHI   YH IR  V  G + V
Subjt:  STMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVHRGDVTV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.9e-7223.88Show/hide
Query:  TNRGAAHKARRSRTLQ----------RKESVSTAMEADTGRGTVPNSWPRGRIKGIDSWQPQREGET---------AETRTKKAKVSP--KENVHLWHLR
        T+ G+A     SR+L            K  +S     +T R +V       ++K +++  P  +G+T         A ++      SP  K     WH R
Subjt:  TNRGAAHKARRSRTLQ----------RKESVSTAMEADTGRGTVPNSWPRGRIKGIDSWQPQREGET---------AETRTKKAKVSP--KENVHLWHLR

Query:  LGHINLNRIEKLVRSGLLNELE-ENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLE
        LGH +L  +  ++ +  L  L   + L  C  C   K  K P S     + +PLE ++SD  S   + +   Y Y+V F+D ++RY ++Y + +KS+  +
Subjt:  LGHINLNRIEKLVRSGLLNELE-ENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLE

Query:  KFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSK
         F  +K+ VEN     + TL SD GGE++    +DY+ +HGI+   S P  P+ NG+SER++R +++M  +++S+A +P ++W Y    AVY++N + + 
Subjt:  KFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSK

Query:  SV-CETPFELWSGRKGNLHHFRIWGC--------------------------------------------------------------------------
         +  ++PF+   G+  N    +++GC                                                                          
Subjt:  SV-CETPFELWSGRKGNLHHFRIWGC--------------------------------------------------------------------------

Query:  -----PTHVLVSN---------------------PKSWNPVQIVPIIRRVLVS--------------------------------SNATFLEEDHVRDHL
             P+H  +                       P S +P+    +    L S                                SN+  L   +     
Subjt:  -----PTHVLVSN---------------------PKSWNPVQIVPIIRRVLVS--------------------------------SNATFLEEDHVRDHL

Query:  PRSKIVLNEMDSTSARVADVASTSTSVVDPSTSS-------------------QIRSQ----ELGMPRRSGRAVRQLDRYMDLAETSVVAPDDNCEDPLT
        P S    + +  +      + + STS+ +P++ S                   Q+ +Q       M  R+   +R+ ++    A TS+ A      +P T
Subjt:  PRSKIVLNEMDSTSARVADVASTSTSVVDPSTSS-------------------QIRSQ----ELGMPRRSGRAVRQLDRYMDLAETSVVAPDDNCEDPLT

Query:  YDQAMVDVDKDDWIKAMDQEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAIPA
          QAM D   D W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K + D  +  + ARLVAKG+ Q  G+DY ETFSPV+   SIRI+L +  
Subjt:  YDQAMVDVDKDDWIKAMDQEMESMYFNSVWELV-DQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAIPA

Query:  YYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDIL
           + + Q+D   AFL G L + +Y+ QP GF+ + +   VCRL ++IYGLKQ  R+W +     + + GF  ++ +  ++     +++ ++++YVDDIL
Subjt:  YYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDIL

Query:  LIGNEVEFLIDVKKLLASQFQMKD--------------------LGEARY--------KMQNSKKDLLPFRHGVHLSKDQCPKTPQDVEDMRRISYASAV
        + GN+   L      L+ +F +K+                    L + RY         M  +K    P      L+     K P   E      Y   V
Subjt:  LIGNEVEFLIDVKKLLASQFQMKD--------------------LGEARY--------KMQNSKKDLLPFRHGVHLSKDQCPKTPQDVEDMRRISYASAV

Query:  GSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIHFGSPSSRMR----ESEYGRSIKRG--------------------------------------
        GSL Y +  TRPD+        S+Y   P     T+ H+ +    +R      ++G  +K+G                                      
Subjt:  GSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIHFGSPSSRMR----ESEYGRSIKRG--------------------------------------

Query:  -----CIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVHRGDVTV
              +V S+ EAEY +V   + E  W+   +T+L +   ++ P  ++CDN GA      P  H + KHI   YH IR  V  G + V
Subjt:  -----CIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVHRGDVTV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-5632.36Show/hide
Query:  EDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILL
        ++P TY++A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K + D  ++ + ARLVAKG+TQ EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILL

Query:  AIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIA-QGQE---QKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFL
        AI A Y + + Q+D   AFLNG LDE IY+  P G+ A QG       VC L +SIYGLKQ SR W ++F   +  +GF Q+  +   + KI       +
Subjt:  AIPAYYVYEVWQMDAKIAFLNGKLDETIYIDQPKGFIA-QGQE---QKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFL

Query:  VLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEARYKM---------------QNSKKDLLPFRHGVHLSKDQCPKTPQ---------DVEDMRRISY
        ++YVDDI++  N    + ++K  L S F+++DLG  +Y +               +    DLL     +       P  P          D  D +  +Y
Subjt:  VLYVDDILLIGNEVEFLIDVKKLLASQFQMKDLGEARYKM---------------QNSKKDLLPFRHGVHLSKDQCPKTPQ---------DVEDMRRISY

Query:  ASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH--------------FGSPSSRM----------------RESEYG------------RS
           +G LMY +  TR DI        S++   PRLAH+  +               F S  + M                R S  G            +S
Subjt:  ASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIH--------------FGSPSSRM----------------RESEYG------------RS

Query:  IKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIRE
         K+  +  S+ EAEY A+  A  E +WL +F  +L++   ++ P  LFCDN+ A+  +     H++ KHIE   H +RE
Subjt:  IKRGCIVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIRE

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.2e-0835.37Show/hide
Query:  TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSD
        + ET       + K+   LWH RL H++   +E LV+ G L+  + +SL  CE C+ GK  +   S   +  K PL+ VHSD
Subjt:  TAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLVRSGLLNELEENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSD

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.7e-1442.35Show/hide
Query:  WIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAI
        W +AM +E++++  N  W LV  P     +GCKW++K K   D  +    ARLVAKGF Q EG+ + ET+SPVV   +IR +L +
Subjt:  WIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCGCAATAATAACTCTACTCGCCTCCGATAAATTAGTGGGAGATAATTACCAAAGTTGGAAAAACAACATAAACACAATTTTGGTAATCGACGACATTAAGTT
CGTCTTTGTCGAGGAGTGTCCTCGATGTCGGCCCGATCGGCTATTGCGAAGTTTTTGCGATGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATCA
TTGCTAGCATGTCTGATATTTTGGCAAAGAAGCATGAGCTGATGGTCACTACCAAGGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAATCCTTTCAGGTCCGG
CATGACTCGCTCAAATACGTCTTCAACGCACGGATGAAAGAGGGGAGGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGGCTGAGATGAACGAGGC
TTCGATCGATGAGTCGAGCCAAGTCAGTTTTATTCTGGAGACTCTTCCGAAGAGTTTCCTTGATTTAAGAGCAATCTTTGTTATGAACAAAATTAACTACACTCTGACTA
CCCTCCTCAATGAGCTACAGAATTTTCAGTCATTGAACAGGGTCAAGGCATCGGAATCTGAGGCAAATGTTGCCTACAGGTCTTACCACAGGGGTTCGACCTCTGGGACG
AAACATGTTGCTCCTTCACGTCCGAAAGGGAAGAAGAGAATGAAGAGGGATAAAACTAACCGAGGTGCCGCCCACAAGGCAAGAAGGTCAAGGACGTTGCAGAGAAAAGA
AAGTGTTTCCACTGCAATGGAGGCGGACACTGGAAGAGGAACTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCAACGAGAGGGTG
AGACTGCAGAAACACGAACTAAGAAAGCGAAAGTTTCTCCTAAAGAAAATGTCCATCTTTGGCATCTACGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTG
AGGAGTGGACTTCTAAACGAGCTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTGAGGGCAAGATGACCAAACGTCCTATTAGTGGAAAAGGATATAGAGCCAA
AGAGCCTCTTGAGTTAGTACATTCTGACTTTTGTAGTCCGATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGATTACTCGAGGTATGGGT
ATATTTACCTAATGCACAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGA
GGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTACGTCCCAACTCTCAGCGCCTGGTATGCCACAGCAAAATGGTGTATCAGAGAGGAG
AAACAGAATCTTGTTGGACATGGTTCGATCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGTAGTGGAGACTGCAGTCTATATTTTGAACAACGTTC
TGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAATTTACATCACTTCAGGATTTGGGGATGCCCGACCCACGTGTTGGTGTCAAACCCA
AAAAGTTGGAACCCCGTTCAAATTGTGCCTATTATTCGTAGGGTGCTTGTGTCGTCTAACGCCACTTTCCTAGAGGAAGACCACGTCAGGGATCATTTACCAAGGAGTAA
AATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGTGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGAGT
TGGGAATGCCTCGACGTAGTGGGAGGGCTGTGAGACAGCTTGATCGCTACATGGATTTAGCTGAAACCTCAGTTGTCGCTCCTGATGATAACTGTGAGGATCCATTGACC
TATGATCAGGCAATGGTTGATGTTGACAAAGACGATTGGATTAAAGCAATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCAACCGGA
TGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAACCTTCAATGCCCGACTAGTGGCAAAGGGTTTTACCCAGGTTG
AAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGTCATGGTAAAGTCGATCAGGATCCTTCTGGCCATTCCCGCGTATTATGTCTACGAGGTATGGCAGATGGACGCC
AAGATAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATAGACCAGCCCAAAGGGTTCATTGCCCAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCATAGGTCCAT
TTATGGGCTGAAACAAGTTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCG
TTGACAAAACTGTCGCATTTTTGGTGTTGTATGTGGATGATATTCTTCTCATTGGGAATGAGGTAGAATTTCTTATTGACGTTAAGAAGTTGCTAGCTTCGCAGTTCCAA
ATGAAAGATTTGGGAGAAGCTCGATATAAGATGCAGAACTCCAAGAAAGACTTGCTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAATGTCCTAAGACTCCTCA
AGATGTTGAGGATATGAGACGGATTTCATATGCTTCGGCTGTAGGGAGCCTGATGTATCCCATGTTGTGTACTAGGCCCGACATCCATTTATGCGGTAAAGAATCGTCGA
GTAGATATCGGTCCAATCCGAGATTAGCTCACCGGACGGAAATCCACTTCGGGTCACCTTCATCCCGAATGCGGGAGTCAGAGTATGGGCGAAGTATCAAGCGAGGATGC
ATCGTTGATTCCACTATGGAAGCAGAGTATGTTGCGGTTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGTTCATGACGGATTTGGAAGTTGTTCCAAATATGAA
CTTGCCGATCACACTATTCTGTGATAACAGTGGTGCAGTAGCCAACTCACGTGAGCCTCGGAGCCATAAGAAAGGCAAACACATTGAGTGCAAGTATCATTTGATACGGG
AGATTGTGCACCGTGGAGACGTGACAGTCACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCGCAATAATAACTCTACTCGCCTCCGATAAATTAGTGGGAGATAATTACCAAAGTTGGAAAAACAACATAAACACAATTTTGGTAATCGACGACATTAAGTT
CGTCTTTGTCGAGGAGTGTCCTCGATGTCGGCCCGATCGGCTATTGCGAAGTTTTTGCGATGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTACATCA
TTGCTAGCATGTCTGATATTTTGGCAAAGAAGCATGAGCTGATGGTCACTACCAAGGAGATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAATCCTTTCAGGTCCGG
CATGACTCGCTCAAATACGTCTTCAACGCACGGATGAAAGAGGGGAGGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGGCTGAGATGAACGAGGC
TTCGATCGATGAGTCGAGCCAAGTCAGTTTTATTCTGGAGACTCTTCCGAAGAGTTTCCTTGATTTAAGAGCAATCTTTGTTATGAACAAAATTAACTACACTCTGACTA
CCCTCCTCAATGAGCTACAGAATTTTCAGTCATTGAACAGGGTCAAGGCATCGGAATCTGAGGCAAATGTTGCCTACAGGTCTTACCACAGGGGTTCGACCTCTGGGACG
AAACATGTTGCTCCTTCACGTCCGAAAGGGAAGAAGAGAATGAAGAGGGATAAAACTAACCGAGGTGCCGCCCACAAGGCAAGAAGGTCAAGGACGTTGCAGAGAAAAGA
AAGTGTTTCCACTGCAATGGAGGCGGACACTGGAAGAGGAACTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGGGATTGATTCCTGGCAGCCGCAACGAGAGGGTG
AGACTGCAGAAACACGAACTAAGAAAGCGAAAGTTTCTCCTAAAGAAAATGTCCATCTTTGGCATCTACGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTG
AGGAGTGGACTTCTAAACGAGCTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTGAGGGCAAGATGACCAAACGTCCTATTAGTGGAAAAGGATATAGAGCCAA
AGAGCCTCTTGAGTTAGTACATTCTGACTTTTGTAGTCCGATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGATTACTCGAGGTATGGGT
ATATTTACCTAATGCACAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGA
GGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTACGTCCCAACTCTCAGCGCCTGGTATGCCACAGCAAAATGGTGTATCAGAGAGGAG
AAACAGAATCTTGTTGGACATGGTTCGATCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGTAGTGGAGACTGCAGTCTATATTTTGAACAACGTTC
TGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAATTTACATCACTTCAGGATTTGGGGATGCCCGACCCACGTGTTGGTGTCAAACCCA
AAAAGTTGGAACCCCGTTCAAATTGTGCCTATTATTCGTAGGGTGCTTGTGTCGTCTAACGCCACTTTCCTAGAGGAAGACCACGTCAGGGATCATTTACCAAGGAGTAA
AATTGTGTTAAATGAAATGGACAGTACATCAGCAAGAGTTGCTGATGTGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGAGT
TGGGAATGCCTCGACGTAGTGGGAGGGCTGTGAGACAGCTTGATCGCTACATGGATTTAGCTGAAACCTCAGTTGTCGCTCCTGATGATAACTGTGAGGATCCATTGACC
TATGATCAGGCAATGGTTGATGTTGACAAAGACGATTGGATTAAAGCAATGGACCAGGAAATGGAGTCTATGTACTTCAATTCTGTATGGGAGCTTGTGGATCAACCGGA
TGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAACGTGACGTAGATGAGAAGGTGCAAACCTTCAATGCCCGACTAGTGGCAAAGGGTTTTACCCAGGTTG
AAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGTCATGGTAAAGTCGATCAGGATCCTTCTGGCCATTCCCGCGTATTATGTCTACGAGGTATGGCAGATGGACGCC
AAGATAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATAGACCAGCCCAAAGGGTTCATTGCCCAAGGCCAAGAGCAAAAGGTTTGCCGGCTTCATAGGTCCAT
TTATGGGCTGAAACAAGTTTCGAGGTCTTGGAATATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATGTCGACGAGCCTTGTGTCTACAAGAAAATCG
TTGACAAAACTGTCGCATTTTTGGTGTTGTATGTGGATGATATTCTTCTCATTGGGAATGAGGTAGAATTTCTTATTGACGTTAAGAAGTTGCTAGCTTCGCAGTTCCAA
ATGAAAGATTTGGGAGAAGCTCGATATAAGATGCAGAACTCCAAGAAAGACTTGCTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAATGTCCTAAGACTCCTCA
AGATGTTGAGGATATGAGACGGATTTCATATGCTTCGGCTGTAGGGAGCCTGATGTATCCCATGTTGTGTACTAGGCCCGACATCCATTTATGCGGTAAAGAATCGTCGA
GTAGATATCGGTCCAATCCGAGATTAGCTCACCGGACGGAAATCCACTTCGGGTCACCTTCATCCCGAATGCGGGAGTCAGAGTATGGGCGAAGTATCAAGCGAGGATGC
ATCGTTGATTCCACTATGGAAGCAGAGTATGTTGCGGTTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGTTCATGACGGATTTGGAAGTTGTTCCAAATATGAA
CTTGCCGATCACACTATTCTGTGATAACAGTGGTGCAGTAGCCAACTCACGTGAGCCTCGGAGCCATAAGAAAGGCAAACACATTGAGTGCAAGTATCATTTGATACGGG
AGATTGTGCACCGTGGAGACGTGACAGTCACGTAG
Protein sequenceShow/hide protein sequence
MSSAIITLLASDKLVGDNYQSWKNNINTILVIDDIKFVFVEECPRCRPDRLLRSFCDAYDRWIRANEKAKVYIIASMSDILAKKHELMVTTKEIMESLQEMFGQQSFQVR
HDSLKYVFNARMKEGRSVREHVLDMMTHFNLAEMNEASIDESSQVSFILETLPKSFLDLRAIFVMNKINYTLTTLLNELQNFQSLNRVKASESEANVAYRSYHRGSTSGT
KHVAPSRPKGKKRMKRDKTNRGAAHKARRSRTLQRKESVSTAMEADTGRGTVPNSWPRGRIKGIDSWQPQREGETAETRTKKAKVSPKENVHLWHLRLGHINLNRIEKLV
RSGLLNELEENSLPVCESCLEGKMTKRPISGKGYRAKEPLELVHSDFCSPMNVKARGGYEYFVSFIDDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLGKSLKTLRSDR
GGEYMDTEFQDYMIEHGITSQLSAPGMPQQNGVSERRNRILLDMVRSMMSYARLPDSFWGYVVETAVYILNNVLSKSVCETPFELWSGRKGNLHHFRIWGCPTHVLVSNP
KSWNPVQIVPIIRRVLVSSNATFLEEDHVRDHLPRSKIVLNEMDSTSARVADVASTSTSVVDPSTSSQIRSQELGMPRRSGRAVRQLDRYMDLAETSVVAPDDNCEDPLT
YDQAMVDVDKDDWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKRKRDVDEKVQTFNARLVAKGFTQVEGVDYEETFSPVVMVKSIRILLAIPAYYVYEVWQMDA
KIAFLNGKLDETIYIDQPKGFIAQGQEQKVCRLHRSIYGLKQVSRSWNIRFDEAIKSYGFDQNVDEPCVYKKIVDKTVAFLVLYVDDILLIGNEVEFLIDVKKLLASQFQ
MKDLGEARYKMQNSKKDLLPFRHGVHLSKDQCPKTPQDVEDMRRISYASAVGSLMYPMLCTRPDIHLCGKESSSRYRSNPRLAHRTEIHFGSPSSRMRESEYGRSIKRGC
IVDSTMEAEYVAVCEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNSGAVANSREPRSHKKGKHIECKYHLIREIVHRGDVTVT