; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018415 (gene) of Snake gourd v1 genome

Gene IDTan0018415
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:38838866..38840989
RNA-Seq ExpressionTan0018415
SyntenyTan0018415
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]9.8e-25867.68Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYA LPSSFWGY VETAV+ILNNVPSKSVSETPF+LW+GRK                                 GYPKETRGGLF+DP+E+
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS
        +VFVSTNATFLEEDH+R+HKPRSK+VLSE      +V ++   S+RV +T  S Q  PSQ L MPRRSGRV++QP+RY+GL+ETQVVIPDD  EDP +Y 
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS

Query:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD
        QAM D DKD+WV AMD +MESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQT KARLVAKGYTQ EGVDYEETFSPVAM+KSI ILL+I  +YD
Subjt:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD

Query:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG
        YE+WQMDVKT FLNGNLEE+I+M +P+GFI  GQ+ K         GLK                                        VLYVDDILLIG
Subjt:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG

Query:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG
        N+VGYLTD+K WLA QF MKDLG+AQ+VLGIQI+R+RKNKTLALSQ +YIDK+L+R+ MQ+SKKGLLPFRH VH SKE+ PKTPQ VEDMRR PYAS VG
Subjt:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG

Query:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA
        SLMYAMLCTRP IC+AV +VSRYQSNPG +HWT VK +LKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTSG              +KQGCIA
Subjt:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA

Query:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        DSTMEAEYVAACEAAKE VWLRKF+ + EVVPNM L ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTKIA EHN+
Subjt:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]7.8e-25567.1Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYA LPSSFWGY VETAV+ILNNVPSKSVSETPF+LW+GRK                                 GYPKETRGGLF+DPKE+
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS
        +VFVSTNATFLEEDH+R+HKPRSK+VLSE      +V ++   S+RV +T  S Q  PSQ L MPRRSGRV++QP+RY+GL+ETQVVIPDD  EDP +Y 
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS

Query:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD
        QAM D DKD+WV AMD +MESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQT KARLVAKGYT+ EGVDYEETFS VAM+KSI ILL+I  +YD
Subjt:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD

Query:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG
        YE+WQMDVKT FLNGNLEE+I+M +P+GFI  GQ+ K         GLK                                        VLYVDDILLIG
Subjt:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG

Query:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG
        N+VGYLTD+K WLA QF MKDLG+ Q+VLGIQI+R+RKNKTLALSQ +YIDK+L+R+ MQ+SKKGLLPFRH VH SKE+ PKTPQ VEDMRR PYAS VG
Subjt:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG

Query:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTS--------------GVKQGCIA
        SLMYAMLCTRP IC+AV +VSRYQSNPG +HWT VK ILKYLRRTR+YMLVYGAKDL LTGYT+SDFQTDKDSRKSTS               +KQGCIA
Subjt:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTS--------------GVKQGCIA

Query:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        DSTMEAEYVAACEAAKE VWL+KF+ + EVVPNM L ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTKIA EHN+
Subjt:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-24865.99Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYAHLP+SFWGY V+TAVYILN VPSKSVSETP KLW GRK                                 GYPK TRGG FYDPK++
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP
        KVFVSTNATFLEEDHIR+HKPRSKIVL+EL         +V  + S  TRV     S +    Q L  PRRSGRV   P RYM L+ET  VI D D EDP
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP

Query:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV
         T+ +AM D DKD+W+ AM+ ++ESMYFN VWDLVD+PDGVKPIGCKWIYKRKRG DGKVQT KARLVAKGYTQVEGVDYEETFSPVAM+KSI ILL+I 
Subjt:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV

Query:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI
        AY+DYE+WQMDVKT FLNGNLEE IYM +P+GFI  GQ+ K         GLK                                        VLYVDDI
Subjt:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI

Query:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA
        LLIGN++G LTDIK WLATQF MKDLG+AQFVLGIQI R+RKNK LALSQ SYIDK+++++ MQ+SK+GLLPFRH V  SKE+CPKTPQ VE+MR  PYA
Subjt:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA

Query:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ
        S VGSLMYAMLCTRP IC+AV +VSRYQSNPG  HWT VKTILKYLRRTR+YMLVYG+KDL LTGYTDSDFQTD+DSRKSTSG              +KQ
Subjt:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ

Query:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        GCIADSTMEAEYVAACEAAKE VWLR F+++ EVVPNM   ITLYCDNSGAV NSREPRSHKRGKHIERKYHLIREIVHRGDVIVT+IA  HNV
Subjt:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]9.8e-25867.68Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYA LPSSFWGY VETAV+ILNNVPSKSVSETPF+LW+GRK                                 GYPKETRGGLF+DP+E+
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS
        +VFVSTNATFLEEDH+R+HKPRSK+VLSE      +V ++   S+RV +T  S Q  PSQ L MPRRSGRV++QP+RY+GL+ETQVVIPDD  EDP +Y 
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS

Query:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD
        QAM D DKD+WV AMD +MESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQT KARLVAKGYTQ EGVDYEETFSPVAM+KSI ILL+I  +YD
Subjt:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD

Query:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG
        YE+WQMDVKT FLNGNLEE+I+M +P+GFI  GQ+ K         GLK                                        VLYVDDILLIG
Subjt:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG

Query:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG
        N+VGYLTD+K WLA QF MKDLG+AQ+VLGIQI+R+RKNKTLALSQ +YIDK+L+R+ MQ+SKKGLLPFRH VH SKE+ PKTPQ VEDMRR PYAS VG
Subjt:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG

Query:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA
        SLMYAMLCTRP IC+AV +VSRYQSNPG +HWT VK +LKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTSG              +KQGCIA
Subjt:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA

Query:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        DSTMEAEYVAACEAAKE VWLRKF+ + EVVPNM L ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTKIA EHN+
Subjt:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-24967.77Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRKGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDGT
        ++DMV+SMMSYAHLP+SFWGY V+TAVYILN VPSKSVSETP KLW G KGYPK TRGG FYDPK++KVFVSTNATFLEEDHIR+HKPRSKIVL+EL   
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRKGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDGT

Query:  IAK----VANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWD
          K    V  + S  TRV     S +    Q L  PR+SGRV   P RYM L+ET  VI D D EDP T+ +AM D DKD+W+ AM+ ++ESMYFN VWD
Subjt:  IAK----VANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWD

Query:  LVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGF
        L+D+PDGVKPIGCKWIYKRKRG DGKVQT KARLVAKGYTQVEGVDYEETFSPVAM+KSI ILL+I AY+DYE+WQMDVKT FLNGNLEE IYM +P+GF
Subjt:  LVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGF

Query:  IELGQKSK------RFAGLK----------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVL
        I  GQ+ K         GLK                                        VLYVDDILLIGN++G LTDIK WLATQF MKDLG+AQFVL
Subjt:  IELGQKSK------RFAGLK----------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVL

Query:  GIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGH
        GIQI R+RKNK LALSQ SYIDK+++++ MQ+SK+GLLPFRH V  SKE+CPKTPQ VE+MR  PYAS VGSLMYAMLCTRP IC+AV +VSRYQSNPG 
Subjt:  GIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGH

Query:  EHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFE
         HWT VKTILKYLRR R+Y LVYG+KDL LTGYTDSDFQTD+DSRKST G              +KQGCIADSTMEAEYV ACEAAKE VWLR F+++ E
Subjt:  EHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFE

Query:  VVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVLILLQR
        VVPNM   ITLYCDNSGAV NSREPRSHKRGKHIERKYHLIREIVHRGDVIVT+IA  HN+  L  +
Subjt:  VVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVLILLQR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.2e-24765.85Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYAHLP+SFWGY V+TAVYILN VPSKSVSETP KLW GRK                                 GYPK TRGG FYDPK++
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP
        KVFVSTNATFLEEDHIR+HKPRSKIVL+EL         +V  + S  TRV     S +    Q L  PRRSGRV   P RYM L+ET  VI D D EDP
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP

Query:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV
         T+ +AM D DKD+W+ AM+ ++ESMYFN VWDLVD+PDGVKPIGCKWIYKRKRG DGKVQT KARLVAKGYTQVEGVDYEETFSPVAM+KSI ILL+I 
Subjt:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV

Query:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI
        AY+DYE+WQMDVKT FLNGNLEE IYM +P+GFI  GQ+ K         GLK                                        VLYVDDI
Subjt:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI

Query:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA
        LLIGN++G LTDIK WLATQF MKDLG+AQFVLGIQI R+RKNK LALSQ SYIDK+++++ MQ+SK+GLLPFRH V  SKE+CPKTPQ VE+MR  PYA
Subjt:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA

Query:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ
        S VGSLMYAMLCTRP IC+AV +VSRYQSNPG  HWT VKTILKYLRRTR+Y LVYG+KDL LTGYTDSDFQTD+DSRKSTSG              +KQ
Subjt:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ

Query:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        GCIADSTMEAEYVAACEAAKE VWLR F+++ EVVPNM   ITLYCDNSGAV NSREPRSHKRGKHIERKYHLIREIVHRGDVIVT+IA  HNV
Subjt:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

A0A5A7TWB9 Gag/pol protein4.5e-24865.99Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYAHLP+SFWGY V+TAVYILN VPSKSVSETP KLW GRK                                 GYPK TRGG FYDPK++
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP
        KVFVSTNATFLEEDHIR+HKPRSKIVL+EL         +V  + S  TRV     S +    Q L  PRRSGRV   P RYM L+ET  VI D D EDP
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTI----AKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDP

Query:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV
         T+ +AM D DKD+W+ AM+ ++ESMYFN VWDLVD+PDGVKPIGCKWIYKRKRG DGKVQT KARLVAKGYTQVEGVDYEETFSPVAM+KSI ILL+I 
Subjt:  STYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIV

Query:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI
        AY+DYE+WQMDVKT FLNGNLEE IYM +P+GFI  GQ+ K         GLK                                        VLYVDDI
Subjt:  AYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDI

Query:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA
        LLIGN++G LTDIK WLATQF MKDLG+AQFVLGIQI R+RKNK LALSQ SYIDK+++++ MQ+SK+GLLPFRH V  SKE+CPKTPQ VE+MR  PYA
Subjt:  LLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYA

Query:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ
        S VGSLMYAMLCTRP IC+AV +VSRYQSNPG  HWT VKTILKYLRRTR+YMLVYG+KDL LTGYTDSDFQTD+DSRKSTSG              +KQ
Subjt:  SVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQ

Query:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        GCIADSTMEAEYVAACEAAKE VWLR F+++ EVVPNM   ITLYCDNSGAV NSREPRSHKRGKHIERKYHLIREIVHRGDVIVT+IA  HNV
Subjt:  GCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

A0A5A7TZD0 Gag/pol protein4.8e-25867.68Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYA LPSSFWGY VETAV+ILNNVPSKSVSETPF+LW+GRK                                 GYPKETRGGLF+DP+E+
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS
        +VFVSTNATFLEEDH+R+HKPRSK+VLSE      +V ++   S+RV +T  S Q  PSQ L MPRRSGRV++QP+RY+GL+ETQVVIPDD  EDP +Y 
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS

Query:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD
        QAM D DKD+WV AMD +MESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQT KARLVAKGYTQ EGVDYEETFSPVAM+KSI ILL+I  +YD
Subjt:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD

Query:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG
        YE+WQMDVKT FLNGNLEE+I+M +P+GFI  GQ+ K         GLK                                        VLYVDDILLIG
Subjt:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG

Query:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG
        N+VGYLTD+K WLA QF MKDLG+AQ+VLGIQI+R+RKNKTLALSQ +YIDK+L+R+ MQ+SKKGLLPFRH VH SKE+ PKTPQ VEDMRR PYAS VG
Subjt:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG

Query:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA
        SLMYAMLCTRP IC+AV +VSRYQSNPG +HWT VK +LKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTSG              +KQGCIA
Subjt:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA

Query:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        DSTMEAEYVAACEAAKE VWLRKF+ + EVVPNM L ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTKIA EHN+
Subjt:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

A0A5A7UYE8 Gag/pol protein4.8e-25867.68Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED
        ++DMV+SMMSYA LPSSFWGY VETAV+ILNNVPSKSVSETPF+LW+GRK                                 GYPKETRGGLF+DP+E+
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRK---------------------------------GYPKETRGGLFYDPKED

Query:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS
        +VFVSTNATFLEEDH+R+HKPRSK+VLSE      +V ++   S+RV +T  S Q  PSQ L MPRRSGRV++QP+RY+GL+ETQVVIPDD  EDP +Y 
Subjt:  KVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYS

Query:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD
        QAM D DKD+WV AMD +MESMYFN VW+LVD P+GVKPIGCKWIYKRKR   GKVQT KARLVAKGYTQ EGVDYEETFSPVAM+KSI ILL+I  +YD
Subjt:  QAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYD

Query:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG
        YE+WQMDVKT FLNGNLEE+I+M +P+GFI  GQ+ K         GLK                                        VLYVDDILLIG
Subjt:  YEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSK------RFAGLK----------------------------------------VLYVDDILLIG

Query:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG
        N+VGYLTD+K WLA QF MKDLG+AQ+VLGIQI+R+RKNKTLALSQ +YIDK+L+R+ MQ+SKKGLLPFRH VH SKE+ PKTPQ VEDMRR PYAS VG
Subjt:  NEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVG

Query:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA
        SLMYAMLCTRP IC+AV +VSRYQSNPG +HWT VK +LKYLRRTR+YMLVYGAKDL LTGYTDSDFQTDKDSRKSTSG              +KQGCIA
Subjt:  SLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIA

Query:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
        DSTMEAEYVAACEAAKE VWLRKF+ + EVVPNM L ITLYCDNSGAV NS+EPRSHKRGKHIERKYHLIREIV RGDVIVTKIA EHN+
Subjt:  DSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

A0A5A7V6N0 Gag/pol protein1.4e-24967.77Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRKGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDGT
        ++DMV+SMMSYAHLP+SFWGY V+TAVYILN VPSKSVSETP KLW G KGYPK TRGG FYDPK++KVFVSTNATFLEEDHIR+HKPRSKIVL+EL   
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRKGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDGT

Query:  IAK----VANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWD
          K    V  + S  TRV     S +    Q L  PR+SGRV   P RYM L+ET  VI D D EDP T+ +AM D DKD+W+ AM+ ++ESMYFN VWD
Subjt:  IAK----VANKTSTSTRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWD

Query:  LVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGF
        L+D+PDGVKPIGCKWIYKRKRG DGKVQT KARLVAKGYTQVEGVDYEETFSPVAM+KSI ILL+I AY+DYE+WQMDVKT FLNGNLEE IYM +P+GF
Subjt:  LVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGF

Query:  IELGQKSK------RFAGLK----------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVL
        I  GQ+ K         GLK                                        VLYVDDILLIGN++G LTDIK WLATQF MKDLG+AQFVL
Subjt:  IELGQKSK------RFAGLK----------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVL

Query:  GIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGH
        GIQI R+RKNK LALSQ SYIDK+++++ MQ+SK+GLLPFRH V  SKE+CPKTPQ VE+MR  PYAS VGSLMYAMLCTRP IC+AV +VSRYQSNPG 
Subjt:  GIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGH

Query:  EHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFE
         HWT VKTILKYLRR R+Y LVYG+KDL LTGYTDSDFQTD+DSRKST G              +KQGCIADSTMEAEYV ACEAAKE VWLR F+++ E
Subjt:  EHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSG--------------VKQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFE

Query:  VVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVLILLQR
        VVPNM   ITLYCDNSGAV NSREPRSHKRGKHIERKYHLIREIVHRGDVIVT+IA  HN+  L  +
Subjt:  VVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVLILLQR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-6024.88Show/hide
Query:  DMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSV---SETPFKLWKGRKGYPKETR-------------GGLF-----------YDPKEDKVFVSTNA
        +  ++M+S A L  SFWG  V TA Y++N +PS+++   S+TP+++W  +K Y K  R              G F           Y+P   K++ + N 
Subjt:  DMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSV---SETPFKLWKGRKGYPKETR-------------GGLF-----------YDPKEDKVFVSTNA

Query:  TFL--------------------EEDHIRDHKPRS---------KIVLSEL--------------DGTIAKVANKTSTSTRVFDTRLSNQERPSQELS--
         F+                    E   ++D K            KI+ +E               D   ++  N  + S ++  T   N+ +    +   
Subjt:  TFL--------------------EEDHIRDHKPRS---------KIVLSEL--------------DGTIAKVANKTSTSTRVFDTRLSNQERPSQELS--

Query:  ------------------------------------------------------------MPRRSGRVITQP-----DRYMGLSETQVVIPDDDCEDPST
                                                                    + RRS R+ T+P     +    L++  +       + P++
Subjt:  ------------------------------------------------------------MPRRSGRVITQP-----DRYMGLSETQVVIPDDDCEDPST

Query:  YSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAY
        + +     DK  W  A++ ++ +   N  W +  +P+    +  +W++  K    G     KARLVA+G+TQ   +DYEETF+PVA + S   +L++V  
Subjt:  YSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAY

Query:  YDYEVWQMDVKTNFLNGNLEENIYMGKPKGFI----ELGQKSKRFAGLK------------------------------------------VLYVDDILL
        Y+ +V QMDVKT FLNG L+E IYM  P+G       + + +K   GLK                                          +LYVDD+++
Subjt:  YDYEVWQMDVKTNFLNGNLEENIYMGKPKGFI----ELGQKSKRFAGLK------------------------------------------VLYVDDILL

Query:  IGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHF----SKEKCPKTPQGVEDMRRFP
           ++  + + K +L  +F M DL + +  +GI+I    +   + LSQ++Y+ K+L +F M++      P   ++++    S E C             P
Subjt:  IGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHF----SKEKCPKTPQGVEDMRRFP

Query:  YASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYG---AKDLTLTGYTDSDFQTDKDSRKSTSGV-----------
          S++G LMY MLCTRP +  AV ++SRY S    E W  +K +L+YL+ T +  L++    A +  + GY DSD+   +  RKST+G            
Subjt:  YASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYG---AKDLTLTGYTDSDFQTDKDSRKSTSGV-----------

Query:  ----KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV
            +Q  +A S+ EAEY+A  EA +E +WL+  + +  +   +   I +Y DN G +  +  P  HKR KHI+ KYH  RE V    + +  I  E+ +
Subjt:  ----KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNV

P0CV72 Secreted RxLR effector protein 1612.4e-1739.85Show/hide
Query:  MRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVY-GAKDLTLTGYTDSDFQTDKDSRKSTSGV--------
        M+  PY S VG++MY M+ TRP +  AV ++S++ S+P   HW  +K +L+YL+ T+ Y L +  A    L GY+D+D+  D +SR+STSG         
Subjt:  MRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVY-GAKDLTLTGYTDSDFQTDKDSRKSTSGV--------

Query:  ------KQGCIADSTMEAEYVAACEAAKEIVWL
              KQ  +A S+ E EY+A  EA +E VWL
Subjt:  ------KQGCIADSTMEAEYVAACEAAKEIVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-10134.11Show/hide
Query:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVS-ETPFKLWKGRK-----------------------------------GYPKETRGGLFYDP
        +++ V+SM+  A LP SFWG  V+TA Y++N  PS  ++ E P ++W  ++                                   GY  E  G   +DP
Subjt:  MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVS-ETPFKLWKGRK-----------------------------------GYPKETRGGLFYDP

Query:  KEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDG---TIAKVANKTSTSTRVFDTRLSNQERPS-------------QELSMP----------RRSGRV
         + KV  S +  F  E  +R     S+ V + +     TI   +N  +++    D      E+P              +E+  P          RRS R 
Subjt:  KEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDG---TIAKVANKTSTSTRVFDTRLSNQERPS-------------QELSMP----------RRSGRV

Query:  ITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQV
          +  RY     T+ V+  DD  +P +  + +   +K++ + AM ++MES+  N  + LV+ P G +P+ CKW++K K+  D K+   KARLV KG+ Q 
Subjt:  ITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQV

Query:  EGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQK------SKRFAGLK----------------------
        +G+D++E FSPV  + SI  +L++ A  D EV Q+DVKT FL+G+LEE IYM +P+GF   G+K      +K   GLK                      
Subjt:  EGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQK------SKRFAGLK----------------------

Query:  -------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFR
                           +LYVDD+L++G + G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L LSQ  YI++VL RF M+++K    P  
Subjt:  -------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFR

Query:  HRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTD
          +  SK+ CP T +   +M + PY+S VGSLMYAM+CTRP I  AV +VSR+  NPG EHW  VK IL+YLR T    L +G  D  L GYTD+D   D
Subjt:  HRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTD

Query:  KDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLI
         D+RKS++G                Q C+A ST EAEY+AA E  KE++WL++F+    +     +   +YCD+  A++ S+    H R KHI+ +YH I
Subjt:  KDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLI

Query:  REIVHRGDVIVTKIAQEHNVLILLQRL
        RE+V    + V KI+   N   +L ++
Subjt:  REIVHRGDVIVTKIAQEHNVLILLQRL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-5130.47Show/hide
Query:  SNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDG-VKPIGCKWIYKRKRG
        +N + P    SM  R+   I +P+    L+ +          +P T  QA+ D   ++W  AM  ++ +   N  WDLV  P   V  +GC+WI+ +K  
Subjt:  SNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDG-VKPIGCKWIYKRKRG

Query:  VDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS------KRFAGLK--
         DG +   KARLVAKGY Q  G+DY ETFSPV    SI I+L +     + + Q+DV   FL G L +++YM +P GFI+  + +      K   GLK  
Subjt:  VDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS------KRFAGLK--

Query:  --------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYID
                                              ++YVDDIL+ GN+   L +  + L+ +F +KD  +  + LGI+    R    L LSQ  YI 
Subjt:  --------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYID

Query:  KVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNY-ML
         +L R  M  +K    P       S     K     E      Y  +VGSL Y +  TRP I +AV  +S++   P  EH   +K IL+YL  T N+ + 
Subjt:  KVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNY-ML

Query:  VYGAKDLTLTGYTDSDFQTDKDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVEN
        +     L+L  Y+D+D+  DKD   ST+G               KQ  +  S+ EAEY +    + E+ W+   +    +   +     +YCDN GA   
Subjt:  VYGAKDLTLTGYTDSDFQTDKDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVEN

Query:  SREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIA
           P  H R KHI   YH IR  V  G + V  ++
Subjt:  SREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.5e-5129.45Show/hide
Query:  NQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLV-DKPDGVKPIGCKWIYKRKRGV
        N + P    SM  R+   I +P++    + +          +P T  QAM D   D+W  AM  ++ +   N  WDLV   P  V  +GC+WI+ +K   
Subjt:  NQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLV-DKPDGVKPIGCKWIYKRKRGV

Query:  DGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS------KRFAGLK---
        DG +   KARLVAKGY Q  G+DY ETFSPV    SI I+L +     + + Q+DV   FL G L + +YM +P GF++  +        K   GLK   
Subjt:  DGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS------KRFAGLK---

Query:  -------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDK
                                             ++YVDDIL+ GN+   L    + L+ +F +K+     + LGI+    R  + L LSQ  Y   
Subjt:  -------------------------------------VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDK

Query:  VLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNY-MLV
        +L R  M  +K    P       +     K P   E      Y  +VGSL Y +  TRP + +AV  +S+Y   P  +HW  +K +L+YL  T ++ + +
Subjt:  VLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNY-MLV

Query:  YGAKDLTLTGYTDSDFQTDKDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENS
             L+L  Y+D+D+  D D   ST+G               KQ  +  S+ EAEY +    + E+ W+   +    +   +     +YCDN GA    
Subjt:  YGAKDLTLTGYTDSDFQTDKDSRKSTSGV--------------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENS

Query:  REPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVL-ILLQRLSRL
          P  H R KHI   YH IR  V  G + V  ++    +   L + LSR+
Subjt:  REPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVL-ILLQRLSRL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.2e-5631.4Show/hide
Query:  EDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILL
        ++PSTY++A   K+   W  AMD ++ +M     W++   P   KPIGCKW+YK K   DG ++  KARLVAKGYTQ EG+D+ ETFSPV  + S+ ++L
Subjt:  EDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILL

Query:  AIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS----------KRFAGLK----------------------------------------
        AI A Y++ + Q+D+   FLNG+L+E IYM  P G+      S          K   GLK                                        
Subjt:  AIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKS----------KRFAGLK----------------------------------------

Query:  VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVED
        ++YVDDI++  N    + ++K+ L + F ++DLG  ++ LG++I R+     + + Q  Y   +L    +   K   +P    V FS         G + 
Subjt:  VLYVDDILLIGNEVGYLTDIKNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVED

Query:  MRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAK-DLTLTGYTDSDFQTDKDSRKSTSGV--------
        +    Y  ++G LMY  + TR  I FAV  +S++   P   H   V  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G         
Subjt:  MRRFPYASVVGSLMYAMLCTRPGICFAVVMVSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAK-DLTLTGYTDSDFQTDKDSRKSTSGV--------

Query:  ------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIRE
              KQ  ++ S+ EAEY A   A  E++WL +F    ++  +      L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  ------KQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCDNSGAVENSREPRSHKRGKHIERKYHLIRE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)6.4e-1340Show/hide
Query:  WVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAI
        W  AM ++++++  N  W LV  P     +GCKW++K K   DG +  +KARLVAKG+ Q EG+ + ET+SPV    +I  +L +
Subjt:  WVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVDGKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGACATGGTTCAATCAATGATGAGTTATGCTCATCTCCCTAGTTCCTTTTGGGGTTACACAGTGGAGACTGCGGTATACATTTTGAACAATGTGCCATCAAAAAG
TGTTTCTGAAACACCTTTTAAACTTTGGAAAGGACGTAAAGGATATCCAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTTTTTGTGTCGACAA
ATGCCACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAGAAGTAAAATTGTGTTGAGTGAGTTAGATGGAACGATAGCAAAGGTTGCTAATAAAACTAGTACG
TCAACAAGAGTTTTTGATACTAGGTTGTCTAATCAAGAGAGACCATCTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTATAACACAACCTGACCGTTACATGGG
TTTATCTGAAACTCAAGTTGTTATACCAGATGACGACTGTGAGGATCCATCGACTTATAGTCAAGCGATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACC
AAAAAATGGAGTCTATGTACTTCAATTTTGTTTGGGATCTTGTAGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGGAAGCGTGGTGTAGAT
GGAAAGGTGCAAACCGTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTATGGTAAAGTCTATCTG
TATCCTACTTGCCATTGTCGCATATTATGACTATGAGGTATGGCAAATGGATGTCAAGACAAACTTTTTGAATGGCAATCTTGAGGAAAATATCTACATGGGAAAACCCA
AAGGGTTCATTGAACTAGGACAGAAGAGCAAAAGGTTTGCAGGCTTAAAAGTTCTATATGTGGATGATATCTTACTCATTGGGAATGAGGTAGGGTATCTTACTGACATT
AAGAATTGGCTAGCTACGCAATTCCTAATGAAAGATTTGGGTAAAGCGCAGTTTGTTCTTGGGATCCAGATTGTTCGGAACCGCAAGAATAAAACACTAGCCCTGTCTCA
GACATCTTACATCGACAAAGTGTTGTTGAGATTTAAGATGCAAGACTCCAAAAAAGGTTTATTGCCTTTTAGACATAGAGTTCATTTTTCTAAGGAAAAATGTCCTAAGA
CACCTCAAGGAGTTGAGGATATGAGACGGTTTCCTTATGCATCTGTTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAGGCCTGGCATCTGTTTTGCAGTTGTTATG
GTCAGTAGGTATCAATCCAATCCAGGACATGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTACGGAGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGA
TCTGACCCTTACAGGATACACGGATTCTGACTTTCAGACTGATAAAGATTCTCGGAAATCTACATCAGGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCG
AATATGTAGCAGCTTGTGAAGCAGCTAAAGAGATCGTTTGGTTAAGGAAATTCATGCTAAATTTTGAAGTTGTTCCAAATATGATTTTGTTCATCACGCTGTATTGCGAC
AACAGTGGTGCAGTGGAAAATTCGAGGGAACCCCGAAGTCACAAGAGGGGAAAGCACATTGAGCGGAAATATCATCTCATCCGGGAGATCGTACATCGAGGAGACGTGAT
TGTCACGAAGATCGCCCAAGAGCACAACGTGCTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTGTTTGAGAGTCACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGACATGGTTCAATCAATGATGAGTTATGCTCATCTCCCTAGTTCCTTTTGGGGTTACACAGTGGAGACTGCGGTATACATTTTGAACAATGTGCCATCAAAAAG
TGTTTCTGAAACACCTTTTAAACTTTGGAAAGGACGTAAAGGATATCCAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTTTTTGTGTCGACAA
ATGCCACTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAGAAGTAAAATTGTGTTGAGTGAGTTAGATGGAACGATAGCAAAGGTTGCTAATAAAACTAGTACG
TCAACAAGAGTTTTTGATACTAGGTTGTCTAATCAAGAGAGACCATCTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTATAACACAACCTGACCGTTACATGGG
TTTATCTGAAACTCAAGTTGTTATACCAGATGACGACTGTGAGGATCCATCGACTTATAGTCAAGCGATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACC
AAAAAATGGAGTCTATGTACTTCAATTTTGTTTGGGATCTTGTAGATAAACCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGGAAGCGTGGTGTAGAT
GGAAAGGTGCAAACCGTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTATGGTAAAGTCTATCTG
TATCCTACTTGCCATTGTCGCATATTATGACTATGAGGTATGGCAAATGGATGTCAAGACAAACTTTTTGAATGGCAATCTTGAGGAAAATATCTACATGGGAAAACCCA
AAGGGTTCATTGAACTAGGACAGAAGAGCAAAAGGTTTGCAGGCTTAAAAGTTCTATATGTGGATGATATCTTACTCATTGGGAATGAGGTAGGGTATCTTACTGACATT
AAGAATTGGCTAGCTACGCAATTCCTAATGAAAGATTTGGGTAAAGCGCAGTTTGTTCTTGGGATCCAGATTGTTCGGAACCGCAAGAATAAAACACTAGCCCTGTCTCA
GACATCTTACATCGACAAAGTGTTGTTGAGATTTAAGATGCAAGACTCCAAAAAAGGTTTATTGCCTTTTAGACATAGAGTTCATTTTTCTAAGGAAAAATGTCCTAAGA
CACCTCAAGGAGTTGAGGATATGAGACGGTTTCCTTATGCATCTGTTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAGGCCTGGCATCTGTTTTGCAGTTGTTATG
GTCAGTAGGTATCAATCCAATCCAGGACATGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTACGGAGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGA
TCTGACCCTTACAGGATACACGGATTCTGACTTTCAGACTGATAAAGATTCTCGGAAATCTACATCAGGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCG
AATATGTAGCAGCTTGTGAAGCAGCTAAAGAGATCGTTTGGTTAAGGAAATTCATGCTAAATTTTGAAGTTGTTCCAAATATGATTTTGTTCATCACGCTGTATTGCGAC
AACAGTGGTGCAGTGGAAAATTCGAGGGAACCCCGAAGTCACAAGAGGGGAAAGCACATTGAGCGGAAATATCATCTCATCCGGGAGATCGTACATCGAGGAGACGTGAT
TGTCACGAAGATCGCCCAAGAGCACAACGTGCTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTGTTTGAGAGTCACCTAG
Protein sequenceShow/hide protein sequence
MMDMVQSMMSYAHLPSSFWGYTVETAVYILNNVPSKSVSETPFKLWKGRKGYPKETRGGLFYDPKEDKVFVSTNATFLEEDHIRDHKPRSKIVLSELDGTIAKVANKTST
STRVFDTRLSNQERPSQELSMPRRSGRVITQPDRYMGLSETQVVIPDDDCEDPSTYSQAMVDKDKDKWVIAMDQKMESMYFNFVWDLVDKPDGVKPIGCKWIYKRKRGVD
GKVQTVKARLVAKGYTQVEGVDYEETFSPVAMVKSICILLAIVAYYDYEVWQMDVKTNFLNGNLEENIYMGKPKGFIELGQKSKRFAGLKVLYVDDILLIGNEVGYLTDI
KNWLATQFLMKDLGKAQFVLGIQIVRNRKNKTLALSQTSYIDKVLLRFKMQDSKKGLLPFRHRVHFSKEKCPKTPQGVEDMRRFPYASVVGSLMYAMLCTRPGICFAVVM
VSRYQSNPGHEHWTTVKTILKYLRRTRNYMLVYGAKDLTLTGYTDSDFQTDKDSRKSTSGVKQGCIADSTMEAEYVAACEAAKEIVWLRKFMLNFEVVPNMILFITLYCD
NSGAVENSREPRSHKRGKHIERKYHLIREIVHRGDVIVTKIAQEHNVLILLQRLSRLKCLRVT