; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039126 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039126
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr2:36482116..36493648
RNA-Seq ExpressionLag0039126
SyntenyLag0039126
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0081.61Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+TQREG+DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0078.72Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LPNSF GYAV+TAVYILN VPSKSVSETP +LW GRK SLRHFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ
        VLE NPKKLEPRS+ CLF+GYPK TRGGY +DP++NKV VSTNATFLEEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++H  Q
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ

Query:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK
         +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVDK+EWIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFK
Subjt:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK

Query:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD
        ARLVAKG+TQ EG+DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL RSIYGLKQASRSWN+RFD
Subjt:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD

Query:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ
        TAIKSYGF+Q VDEPCVYKRIIN S+AFLVLYVDDILLIGND+G LTDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++V+Y MQ
Subjt:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ

Query:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------
        NSK+GLLPFRHG+ LSKEQCP+TPQ+VE+MR  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK                        
Subjt:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------

Query:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG
                   +SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM  PITLYCDNSGAVANS+EPRSHKRG
Subjt:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG

Query:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL
        KHIERKYHLIREIVHRGDVIVT+IAS HN+ADPFTK LTAKVFE HLE L L
Subjt:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0080.69Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+T++EG+DYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGE Q+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTS SVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0078.72Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LPNSF GYAV+TAVYILN VPSKSVSETP +LW GRK SLRHFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ
        VLE NPKKLEPRS+ CLF+GYPK TRGGY +DP++NKV VSTNATFLEEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++H  Q
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ

Query:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK
         +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVDK+EWIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFK
Subjt:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK

Query:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD
        ARLVAKG+TQ EG+DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL RSIYGLKQASRSWN+RFD
Subjt:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD

Query:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ
        TAIKSYGF+Q VDEPCVYKRIIN S+AFLVLYVDDILLIGND+G LTDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++V+Y MQ
Subjt:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ

Query:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------
        NSK+GLLPFRHG+ LSKEQCP+TPQ+VE+MR  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK                        
Subjt:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------

Query:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG
                   +SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM  PITLYCDNSGAVANS+EPRSHKRG
Subjt:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG

Query:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL
        KHIERKYHLIREIVHRGDVIVT+IAS HN+ADPFTK LTAKVFE HLE L L
Subjt:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0081.42Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+TQREG+DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTKRST
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +  T
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTKRST

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0078.72Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LPNSF GYAV+TAVYILN VPSKSVSETP +LW GRK SLRHFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ
        VLE NPKKLEPRS+ CLF+GYPK TRGGY +DP++NKV VSTNATFLEEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++H  Q
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ

Query:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK
         +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVDK+EWIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFK
Subjt:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK

Query:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD
        ARLVAKG+TQ EG+DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL RSIYGLKQASRSWN+RFD
Subjt:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD

Query:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ
        TAIKSYGF+Q VDEPCVYKRIIN S+AFLVLYVDDILLIGND+G LTDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++V+Y MQ
Subjt:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ

Query:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------
        NSK+GLLPFRHG+ LSKEQCP+TPQ+VE+MR  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK                        
Subjt:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------

Query:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG
                   +SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM  PITLYCDNSGAVANS+EPRSHKRG
Subjt:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG

Query:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL
        KHIERKYHLIREIVHRGDVIVT+IAS HN+ADPFTK LTAKVFE HLE L L
Subjt:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL

A0A5A7T2V9 Gag/pol protein0.0e+0080.69Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+T++EG+DYEETFS VAMLKSIRILLSIA FYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGE Q+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTS SVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK

A0A5A7TZD0 Gag/pol protein0.0e+0081.61Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+TQREG+DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTK

A0A5A7UYE8 Gag/pol protein0.0e+0081.42Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQDY+IEHGI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLP+SF GYAVETAV+ILN VPSKSVSETPFELW+GRK SL HFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE
        VL  NPKKLEPRSR C F+GYPKETRGG  FDP+EN+V VSTNATFLEEDH+R+HKPRSK+VLSE ++E+T     V     VD+T TS QSHPSQ +R 
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRV-----VDDTGTSSQSHPSQEIRE

Query:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV
        PRRSGRVV QP+RYLGLTET VVIPDDG+EDPL++KQAM+DVDK++W+KAMDLE+ESM+FNSVW+LVD PEGV+PIGCKWIYKRKRD+AGKVQTFKARLV
Subjt:  PRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLV

Query:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK
        AKG+TQREG+DYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESI+MSQPEGFI Q QEQKVCKL RSIYGLKQASRSWN+RFDTAIK
Subjt:  AKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIK

Query:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK
        SYGF+QNVDEPCVYK+I    +AFLVLYVDDILLIGNDVGYLTD+K WLA QFQMKDLGEAQ+VLGIQI+R+RKNKTLALSQA+YIDKLLVRY MQNSKK
Subjt:  SYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKK

Query:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------
        GLLPFRHG++LSKEQ P+TPQEVEDMRR PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG DHWTAV                             
Subjt:  GLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV-----------------------------

Query:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
              K+SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNM+LPITLYCDNSGAVANSKEPRSHKRGKHIE
Subjt:  ------KNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE

Query:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTKRST
        RKYHLIREIV RGDVIVTKIASEHNIADPFTK LTAKVFE HLE L L  R+ Y +  T
Subjt:  RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTKRST

A0A5D3CPJ6 Gag/pol protein0.0e+0078.72Show/hide
Query:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY
        MDL FQ+YL+E GI SQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA LPNSF GYAV+TAVYILN VPSKSVSETP +LW GRK SLRHFRIWGCPA+
Subjt:  MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAY

Query:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ
        VLE NPKKLEPRS+ CLF+GYPK TRGGY +DP++NKV VSTNATFLEEDH+R+HKPRSK+VL+ELS E TE STRVV++          G+S+++H  Q
Subjt:  VLEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDD---------TGTSSQSHPSQ

Query:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK
         +REPRRSGRV   P RY+ LTET  VI D  IEDPLTFK+AM+DVDK+EWIKAM+LE+ESM+FNSVWDLVDQP+GV+PIGCKWIYKRKR A GKVQTFK
Subjt:  EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFK

Query:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD
        ARLVAKG+TQ EG+DYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI   QEQK+CKL RSIYGLKQASRSWN+RFD
Subjt:  ARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFD

Query:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ
        TAIKSYGF+Q VDEPCVYKRIIN S+AFLVLYVDDILLIGND+G LTDIK WLATQFQMKDLGEAQFVLGIQI R+RKNK LALSQASYIDK++V+Y MQ
Subjt:  TAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ

Query:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------
        NSK+GLLPFRHG+ LSKEQCP+TPQ+VE+MR  PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPG  HWTAVK                        
Subjt:  NSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK------------------------

Query:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG
                   +SRKSTSGSVFTLNGGA+VWR+IKQGCIADSTMEAEYVAACEAAKEAVWLR FL DLEVVPNM  PITLYCDNSGAVANS+EPRSHKRG
Subjt:  -----------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRG

Query:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL
        KHIERKYHLIREIVHRGDVIVT+IAS HN+ADPFTK LTAKVFE HLE L L
Subjt:  KHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLAL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-10330.78Show/hide
Query:  QDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSV---SETPFELWKGRKASLRHFRIWGCPAYVL
        + + ++ GI+  L+ P TPQ NGVSER  RT+ +  R+M+S A+L  SF G AV TA Y++N +PS+++   S+TP+E+W  +K  L+H R++G   YV 
Subjt:  QDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSV---SETPFELWKGRKASLRHFRIWGCPAYVL

Query:  EANPK-KLEPRSRFCLFIGYPKETRGGYLFDPEENKV------------LVSTNATFLEEDHLRDHKPRS---------KVVLSELSNEATETST--RVV
          N + K + +S   +F+GY  E  G  L+D    K             +V++ A   E   L+D K            K++ +E  NE+ E      + 
Subjt:  EANPK-KLEPRSRFCLFIGYPKETRGGYLFDPEENKV------------LVSTNATFLEEDHLRDHKPRS---------KVVLSELSNEATETST--RVV

Query:  DDTGTSSQSHPS--------------------QEIREPRRSGRVV------RQPDRYLG------------LTETPVVIPDDGIEDP-------------
        D   + +++ P+                    Q +++ + S +        R+ D +L              +ET   + + GI++P             
Subjt:  DDTGTSSQSHPS--------------------QEIREPRRSGRVV------RQPDRYLG------------LTETPVVIPDDGIEDP-------------

Query:  ----------------------LTFKQAMDDV-----------DKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARL
                              L      +DV           DK+ W +A++ E+ +   N+ W +  +PE    +  +W++  K +  G    +KARL
Subjt:  ----------------------LTFKQAMDDV-----------DKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARL

Query:  VAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAI
        VA+GFTQ+  IDYEETF+PVA + S R +LS+   Y+ ++ QMDVKTAFLNG L+E IYM  P+G         VCKL ++IYGLKQA+R W   F+ A+
Subjt:  VAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAI

Query:  KSYGFEQNVDEPCVY---KRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ
        K   F  +  + C+Y   K  IN +I +++LYVDD+++   D+  + + K +L  +F+M DL E +  +GI+I    +   + LSQ++Y+ K+L ++ M+
Subjt:  KSYGFEQNVDEPCVY---KRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQ

Query:  NSKKGLLPFRHGINL----SKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVKN-------------------
        N      P    IN     S E C             P  S +G LMY MLCTRPD+  AV I+SRY S    + W  +K                    
Subjt:  NSKKGLLPFRHGINL----SKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVKN-------------------

Query:  -------------------SRKSTSGSVFTL-NGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSK
                            RKST+G +F + +   I W   +Q  +A S+ EAEY+A  EA +EA+WL+  LT + +   +  PI +Y DN G ++ + 
Subjt:  -------------------SRKSTSGSVFTL-NGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSK

Query:  EPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALV
         P  HKR KHI+ KYH  RE V    + +  I +E+ +AD FTK L A  F    + L L+
Subjt:  EPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-15741.36Show/hide
Query:  FQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--V
        F++Y   HGI  + + PGTPQ NGV+ER NRT+++ VRSM+  A+LP SF G AV+TA Y++N  PS  ++ E P  +W  ++ S  H +++GC A+  V
Subjt:  FQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--V

Query:  LEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEED--HLRDHKPRSK-------VVLSELSNEAT--ETSTRVVD-----------
         +    KL+ +S  C+FIGY  E  G  L+DP + KV+ S +  F E +     D   + K       V +   SN  T  E++T  V            
Subjt:  LEANPKKLEPRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEED--HLRDHKPRSK-------VVLSELSNEAT--ETSTRVVD-----------

Query:  -----DTGTSSQSHPSQ--EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIG
             D G     HP+Q  E  +P R     R   R    TE  V+I DD   +P + K+ +   +KN+ +KAM  E+ES+  N  + LV+ P+G RP+ 
Subjt:  -----DTGTSSQSHPSQ--EIREPRRSGRVVRQPDRYLGLTETPVVIPDDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIG

Query:  CKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKL
        CKW++K K+D   K+  +KARLV KGF Q++GID++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF    ++  VCKL
Subjt:  CKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKL

Query:  KRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVY-KRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNK
         +S+YGLKQA R W M+FD+ +KS  + +   +PCVY KR   ++   L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++
Subjt:  KRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVY-KRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNK

Query:  TLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK----
         L LSQ  YI+++L R+ M+N+K    P    + LSK+ CP T +E  +M + PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG +HW AVK    
Subjt:  TLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAVK----

Query:  -------------------------------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITL
                                       ++RKS++G +FT +GGAI W++  Q C+A ST EAEY+AA E  KE +WL++FL +L +    ++   +
Subjt:  -------------------------------NSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITL

Query:  YCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFE
        YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE
Subjt:  YCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFE

P25600 Putative transposon Ty5-1 protein YCL074W7.9e-2629.3Show/hide
Query:  MDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGY
        MDV TAFLN  ++E IY+ QP GF+ +     V +L   +YGLKQA   WN   +  +K  GF ++  E  +Y R  +    ++ +YVDD+L+       
Subjt:  MDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGY

Query:  LTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYA
           +K  L   + MKDLG+    LG+ I     N  + LS   YI K     ++   K    P  +    SK     T   ++D+   PY S VG L++ 
Subjt:  LTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYA

Query:  MLCTRPDICYAVGIVSRYQSNPGFDHWT------------------------------------AVKNSRKSTSGSVFTLNGGAIVWRNIK-QGCIADST
            RPDI Y V ++SR+   P   H                                      A+ +   ST G V  L G  + W + K +G I   +
Subjt:  MLCTRPDICYAVGIVSRYQSNPGFDHWT------------------------------------AVKNSRKSTSGSVFTLNGGAIVWRNIK-QGCIADST

Query:  MEAEYVAACEAAKE
         EAEY+ A E   E
Subjt:  MEAEYVAACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.8e-8528.36Show/hide
Query:  DYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--VLE
        +Y  +HGI+   S P TP+ NG+SER++R +++   +++S+A +P ++  YA   AVY++N +P+  +  E+PF+   G   +    R++GC  Y  +  
Subjt:  DYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--VLE

Query:  ANPKKLEPRSRFCLFIGYPKETRGGYL-FDPEENKVLVSTNATFLE-----EDHLRDHKP----------------------------------------
         N  KL+ +SR C+F+GY   T+  YL    + +++ +S +  F E      ++L    P                                        
Subjt:  ANPKKLEPRSRFCLFIGYPKETRGGYL-FDPEENKVLVSTNATFLE-----EDHLRDHKP----------------------------------------

Query:  --------RSKVVLSEL------------------------SNEATETSTRVVDDTGTS-------SQSHPSQEIREPRRSGRVVRQPDRYLGLTETPVV
                 S+V  S L                        + + T+T T+      TS       S S  +Q +  P +S      P      + T   
Subjt:  --------RSKVVLSEL------------------------SNEATETSTRVVDDTGTS-------SQSHPSQEIREPRRSGRVVRQPDRYLGLTETPVV

Query:  IPDDGIEDPLTFKQ--------------------------------------------AMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEG-VRPIGC
         P   I  P    Q                                            A+  +    W  AM  EI +   N  WDLV  P   V  +GC
Subjt:  IPDDGIEDPLTFKQ--------------------------------------------AMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEG-VRPIGC

Query:  KWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLK
        +WI+ +K ++ G +  +KARLVAKG+ QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GFI +D+   VCKL+
Subjt:  KWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLK

Query:  RSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTL
        +++YGLKQA R+W +     + + GF  +V +  ++      SI ++++YVDDIL+ GND   L +    L+ +F +KD  E  + LGI+    R    L
Subjt:  RSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTL

Query:  ALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDH-----------
         LSQ  YI  LL R  M  +K    P      LS     +     E      Y   VGSL Y +  TRPDI YAV  +S++   P  +H           
Subjt:  ALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDH-----------

Query:  -------------------------WTAVKNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLY
                                 W   K+   ST+G +  L    I W + KQ  +  S+ EAEY +    + E  W+   LT+L +   +  P  +Y
Subjt:  -------------------------WTAVKNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLY

Query:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFES
        CDN GA      P  H R KHI   YH IR  V  G + V  +++   +AD  TK L+   F++
Subjt:  CDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFES

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-8728.26Show/hide
Query:  QDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--VL
        +DYL +HGI+   S P TP+ NG+SER++R +++M  +++S+A +P ++  YA   AVY++N +P+  +  ++PF+   G+  +    +++GC  Y  + 
Subjt:  QDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAY--VL

Query:  EANPKKLEPRSRFCLFIGYPKETRGGYL-FDPEENKVLVSTNATFLEE---------------------------------------------DHLRDHK
          N  KLE +S+ C F+GY   T+  YL       ++  S +  F E                                               HL D  
Subjt:  EANPKKLEPRSRFCLFIGYPKETRGGYL-FDPEENKVLVSTNATFLEE---------------------------------------------DHLRDHK

Query:  PR-------------------SKVVLSELSNEATETSTRVVDDTGTSSQSHPSQE----IREPRRSGRVVRQPDRYLGLTETPVVIP-------------
        PR                   S  + S  S+E T  S      T    Q+  S      +  P  +      P++   L ++P+  P             
Subjt:  PR-------------------SKVVLSELSNEATETSTRVVDDTGTSSQSHPSQE----IREPRRSGRVVRQPDRYLGLTETPVVIP-------------

Query:  ---------------------------------------DDGI----------------EDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLV-DQ
                                                DGI                 +P T  QAM D   + W +AM  EI +   N  WDLV   
Subjt:  ---------------------------------------DDGI----------------EDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLV-DQ

Query:  PEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQD
        P  V  +GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YMSQP GF+ +D
Subjt:  PEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQD

Query:  QEQKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQI
        +   VC+L+++IYGLKQA R+W +   T + + GF  ++ +  ++      SI ++++YVDDIL+ GND   L      L+ +F +K+  +  + LGI+ 
Subjt:  QEQKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQI

Query:  VRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWT
           R  + L LSQ  Y   LL R  M  +K    P      L+     + P   E      Y   VGSL Y +  TRPD+ YAV  +S+Y   P  DHW 
Subjt:  VRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWT

Query:  AVKNSRK------------------------------------STSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVP
        A+K   +                                    ST+G +  L    I W + KQ  +  S+ EAEY +    + E  W+   LT+L +  
Subjt:  AVKNSRK------------------------------------STSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVP

Query:  NMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALV
         +  P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +++   +AD  TK L+   F++    + ++
Subjt:  NMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.4e-7232.85Show/hide
Query:  EDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILL
        ++P T+ +A + +    W  AMD EI +M     W++   P   +PIGCKW+YK K ++ G ++ +KARLVAKG+TQ+EGID+ ETFSPV  L S++++L
Subjt:  EDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILL

Query:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQE----QKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFL
        +I+  Y++ + Q+D+  AFLNG+L+E IYM  P G+ A+  +      VC LK+SIYGLKQASR W ++F   +  +GF Q+  +   + +I  +    +
Subjt:  SIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQE----QKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFL

Query:  VLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVED
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R+     + + Q  Y   LL    +   K   +P    +  S      +  +  D
Subjt:  VLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVED

Query:  MRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV------------------------------------KNSRKSTSGSVFTLNGGA
         +   Y   +G LMY  + TR DI +AV  +S++   P   H  AV                                    K++R+ST+G    L    
Subjt:  MRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGFDHWTAV------------------------------------KNSRKSTSGSVFTLNGGA

Query:  IVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
        I W++ KQ  ++ S+ EAEY A   A  E +WL +F  +L++   +  P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  IVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0834Show/hide
Query:  NRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAYVLEANPKKLEPRSRFCLFIGYPKETRGGYLFD
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS +++   P E+W     +  + R +GC AY+   +  KL+PR++         E +G YL +
Subjt:  NRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVS-ETPFELWKGRKASLRHFRIWGCPAYVLEANPKKLEPRSRFCLFIGYPKETRGGYLFD

ATMG00810.1 DNA/RNA polymerases superfamily protein3.2e-1430.51Show/hide
Query:  FLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSK--KGLLPFRHGINLSKEQCPQTPQ
        +L+LYVDDILL G+    L  +   L++ F MKDLG   + LGIQI  +     L LSQ  Y +++L    M + K     LP +   ++S  + P  P 
Subjt:  FLVLYVDDILLIGNDVGYLTDIKLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSK--KGLLPFRHGINLSKEQCPQTPQ

Query:  EVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP---GFD---------------------------------HWTAVKNSRKSTSGSVFTL
        +        + S VG+L Y  L TRPDI YAV IV +    P    FD                                  W    ++R+ST+G    L
Subjt:  EVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNP---GFD---------------------------------HWTAVKNSRKSTSGSVFTL

Query:  NGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVW
            I W   +Q  ++ S+ E EY A    A E  W
Subjt:  NGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.6e-1644.19Show/hide
Query:  WIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIA
        W +AM  E++++  N  W LV  P     +GCKW++K K  + G +   KARLVAKGF Q EGI + ET+SPV    +IR +L++A
Subjt:  WIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTAAGTTTTCAAGACTATTTGATAGAGCATGGAATTACGTCCCAACTCTCAGCCCCTGGTACACCTCAGCAAAACGGTGTATCAGAAAGGAGAAACAGAACCTT
GTTGGACATGGTTCGTTCGATGATGAGTTATGCTCAGTTGCCTAACTCGTTTTGCGGTTATGCAGTAGAGACTGCAGTTTATATCTTGAACGTAGTTCCCTCAAAGAGTG
TTTCTGAAACACCATTTGAGCTCTGGAAAGGTCGTAAAGCTAGTTTACGACATTTCAGAATTTGGGGATGTCCTGCATACGTGCTAGAAGCAAACCCAAAGAAATTGGAG
CCTCGTTCTAGATTTTGCCTATTTATAGGCTATCCCAAAGAAACGAGAGGTGGATACCTCTTCGATCCAGAGGAAAATAAAGTACTTGTATCGACAAATGCTACATTCCT
AGAAGAAGACCACCTCAGAGATCACAAGCCACGTAGTAAAGTCGTTTTAAGTGAACTATCTAACGAAGCTACAGAAACATCAACAAGAGTTGTTGATGATACTGGCACCT
CGAGTCAATCACATCCTTCTCAAGAAATAAGAGAACCTCGACGTAGTGGGAGGGTTGTTAGACAGCCTGATCGCTATTTGGGTTTAACTGAAACTCCAGTCGTCATACCT
GATGACGGGATCGAGGATCCATTAACCTTTAAACAGGCAATGGATGACGTTGACAAGAACGAGTGGATTAAAGCCATGGACTTAGAGATCGAGTCCATGCATTTTAATTC
AGTATGGGATCTTGTAGATCAACCTGAAGGGGTTAGACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGAGACGCTGCTGGAAAGGTACAGACCTTTAAGGCACGAC
TTGTGGCAAAGGGTTTTACCCAGCGGGAGGGGATTGACTATGAAGAAACCTTCTCCCCTGTTGCCATGTTAAAGTCGATCCGGATACTTTTATCCATTGCCACGTTTTAC
GATTATGAAATATGGCAAATGGATGTCAAGACTGCCTTTCTGAACGGTAATCTTGAAGAAAGTATCTACATGTCTCAGCCAGAGGGGTTCATAGCTCAGGATCAAGAGCA
AAAAGTTTGCAAGCTTAAACGATCCATTTATGGATTAAAACAAGCGTCTAGATCCTGGAATATGAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAACGTTG
ATGAGCCTTGTGTTTACAAAAGGATCATCAACAGCTCTATAGCTTTCTTAGTACTGTATGTGGATGATATCCTACTCATCGGGAATGATGTAGGTTATCTAACTGACATT
AAACTATGGCTAGCAACTCAATTCCAAATGAAAGATTTGGGTGAGGCACAATTTGTTCTAGGAATCCAAATTGTTCGAAACCGAAAGAACAAAACGCTAGCCCTGTCTCA
AGCGTCTTATATCGACAAATTGTTGGTTAGATATAAAATGCAAAATTCCAAGAAAGGACTATTACCTTTCAGGCACGGAATTAACTTGTCTAAGGAACAGTGTCCTCAGA
CACCTCAAGAAGTTGAGGACATGAGACGTTTTCCCTATGCCTCTGCAGTAGGTAGCCTAATGTATGCCATGTTGTGTACAAGGCCTGACATTTGTTATGCAGTAGGGATT
GTCAGTAGATATCAATCCAATCCAGGATTTGATCACTGGACAGCCGTTAAAAATTCGAGGAAATCTACATCGGGGTCAGTATTCACTCTAAATGGAGGAGCAATAGTATG
GCGCAACATCAAGCAAGGATGCATTGCAGACTCGACCATGGAGGCCGAATATGTAGCCGCTTGCGAAGCAGCAAAAGAGGCAGTATGGCTCAGGAAGTTCCTAACTGACT
TGGAAGTCGTTCCAAATATGCATCTGCCTATCACCTTGTATTGTGATAATAGCGGCGCAGTTGCAAACTCAAAGGAACCTCGCAGCCATAAGAGAGGCAAGCACATCGAA
AGGAAATACCATCTCATTAGGGAGATTGTGCACCGAGGAGACGTGATCGTCACGAAGATAGCGTCGGAGCACAACATTGCTGATCCATTCACGAAGGCTCTCACGGCTAA
AGTGTTCGAGAGTCATCTAGAAGGTCTAGCTTTAGTGACTAGAGAATTCTATACTAAAAGGAGCACTTCCAGGATTTGCACGACCATTTGTGTTCAACGTAGAGTCGTGT
TGAAGATGGATGAAAACAGAAGCAAAGGATGCTGCGGGAGCCTTGGTGACCTTAGCTATTATGCTGCTGAGCGACTGGAGGGAGCAATTCTATGCTGCAGCAAAACTGGG
AGCAGAAACTGCCACATCACAGCTCGTGTGATTTTGGTGATGAGCGATCCGCTGGGTACGGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTC
GGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTTTTTGGGGAGATCGATTTGGGACCTTTTGGAGCCGTAAACAGAGCAGAACAGAGCACTTGGAGGCTGAAACAAAGG
AGGAACTTGGAAATCAACCCATTGTTCGTGGGGATCGTGACGGGGACGTCGGGACTCGGCCTACATTGAGGGCAACAGTCGGGATTGTCTATGCACCGATTAATGCCAAC
AACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGG
GACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCA
TCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAG
CAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAG
TACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCAT
CTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTTGCCAATGCTTTTATGAAATTTTCA
GGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATA
TAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAA
CTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCTATCAACACCATGGTGAAT
GGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGTACAATGAACAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGAT
GGAGTATTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGATTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCACAGATGA
GGCTGAAAAGCCTAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTAAGTTTTCAAGACTATTTGATAGAGCATGGAATTACGTCCCAACTCTCAGCCCCTGGTACACCTCAGCAAAACGGTGTATCAGAAAGGAGAAACAGAACCTT
GTTGGACATGGTTCGTTCGATGATGAGTTATGCTCAGTTGCCTAACTCGTTTTGCGGTTATGCAGTAGAGACTGCAGTTTATATCTTGAACGTAGTTCCCTCAAAGAGTG
TTTCTGAAACACCATTTGAGCTCTGGAAAGGTCGTAAAGCTAGTTTACGACATTTCAGAATTTGGGGATGTCCTGCATACGTGCTAGAAGCAAACCCAAAGAAATTGGAG
CCTCGTTCTAGATTTTGCCTATTTATAGGCTATCCCAAAGAAACGAGAGGTGGATACCTCTTCGATCCAGAGGAAAATAAAGTACTTGTATCGACAAATGCTACATTCCT
AGAAGAAGACCACCTCAGAGATCACAAGCCACGTAGTAAAGTCGTTTTAAGTGAACTATCTAACGAAGCTACAGAAACATCAACAAGAGTTGTTGATGATACTGGCACCT
CGAGTCAATCACATCCTTCTCAAGAAATAAGAGAACCTCGACGTAGTGGGAGGGTTGTTAGACAGCCTGATCGCTATTTGGGTTTAACTGAAACTCCAGTCGTCATACCT
GATGACGGGATCGAGGATCCATTAACCTTTAAACAGGCAATGGATGACGTTGACAAGAACGAGTGGATTAAAGCCATGGACTTAGAGATCGAGTCCATGCATTTTAATTC
AGTATGGGATCTTGTAGATCAACCTGAAGGGGTTAGACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGAGACGCTGCTGGAAAGGTACAGACCTTTAAGGCACGAC
TTGTGGCAAAGGGTTTTACCCAGCGGGAGGGGATTGACTATGAAGAAACCTTCTCCCCTGTTGCCATGTTAAAGTCGATCCGGATACTTTTATCCATTGCCACGTTTTAC
GATTATGAAATATGGCAAATGGATGTCAAGACTGCCTTTCTGAACGGTAATCTTGAAGAAAGTATCTACATGTCTCAGCCAGAGGGGTTCATAGCTCAGGATCAAGAGCA
AAAAGTTTGCAAGCTTAAACGATCCATTTATGGATTAAAACAAGCGTCTAGATCCTGGAATATGAGATTTGATACTGCGATCAAATCTTATGGCTTTGAACAAAACGTTG
ATGAGCCTTGTGTTTACAAAAGGATCATCAACAGCTCTATAGCTTTCTTAGTACTGTATGTGGATGATATCCTACTCATCGGGAATGATGTAGGTTATCTAACTGACATT
AAACTATGGCTAGCAACTCAATTCCAAATGAAAGATTTGGGTGAGGCACAATTTGTTCTAGGAATCCAAATTGTTCGAAACCGAAAGAACAAAACGCTAGCCCTGTCTCA
AGCGTCTTATATCGACAAATTGTTGGTTAGATATAAAATGCAAAATTCCAAGAAAGGACTATTACCTTTCAGGCACGGAATTAACTTGTCTAAGGAACAGTGTCCTCAGA
CACCTCAAGAAGTTGAGGACATGAGACGTTTTCCCTATGCCTCTGCAGTAGGTAGCCTAATGTATGCCATGTTGTGTACAAGGCCTGACATTTGTTATGCAGTAGGGATT
GTCAGTAGATATCAATCCAATCCAGGATTTGATCACTGGACAGCCGTTAAAAATTCGAGGAAATCTACATCGGGGTCAGTATTCACTCTAAATGGAGGAGCAATAGTATG
GCGCAACATCAAGCAAGGATGCATTGCAGACTCGACCATGGAGGCCGAATATGTAGCCGCTTGCGAAGCAGCAAAAGAGGCAGTATGGCTCAGGAAGTTCCTAACTGACT
TGGAAGTCGTTCCAAATATGCATCTGCCTATCACCTTGTATTGTGATAATAGCGGCGCAGTTGCAAACTCAAAGGAACCTCGCAGCCATAAGAGAGGCAAGCACATCGAA
AGGAAATACCATCTCATTAGGGAGATTGTGCACCGAGGAGACGTGATCGTCACGAAGATAGCGTCGGAGCACAACATTGCTGATCCATTCACGAAGGCTCTCACGGCTAA
AGTGTTCGAGAGTCATCTAGAAGGTCTAGCTTTAGTGACTAGAGAATTCTATACTAAAAGGAGCACTTCCAGGATTTGCACGACCATTTGTGTTCAACGTAGAGTCGTGT
TGAAGATGGATGAAAACAGAAGCAAAGGATGCTGCGGGAGCCTTGGTGACCTTAGCTATTATGCTGCTGAGCGACTGGAGGGAGCAATTCTATGCTGCAGCAAAACTGGG
AGCAGAAACTGCCACATCACAGCTCGTGTGATTTTGGTGATGAGCGATCCGCTGGGTACGGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTTAATCGGGTGCTC
GGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTTTTTGGGGAGATCGATTTGGGACCTTTTGGAGCCGTAAACAGAGCAGAACAGAGCACTTGGAGGCTGAAACAAAGG
AGGAACTTGGAAATCAACCCATTGTTCGTGGGGATCGTGACGGGGACGTCGGGACTCGGCCTACATTGAGGGCAACAGTCGGGATTGTCTATGCACCGATTAATGCCAAC
AACTTTGAGCTGAAGACCGGTCTCATTCAGATGGCTCGAGACTGTGCATACAGAGGATCGCCCACTGAGGATCCAAATTCTCATCTTAAATCATTCTTGGACATTTGTGG
GACGGTAAAGATTAATGGTGTCTCTGAGGATGCTATTCGCTTACGTTTATTTCCCTTTTCTTTGCAGGATAAAGCACGAGATTGGTTGCAGTCTATTACCCCTGGGAGCA
TCACCACTTGGGATGCTTTGGTCCAGGCCTTTTTAAAGAAATTTTTCCCTCCTGCAAAGACGGTCAAGCTGAGGACCGAGATTGGGACATTCCAACAACAATATGATGAG
CAGCTGTTCGAAGCTTGGGAGCGATTCAAAGAGCTACTGAGGAAGTGTCCTCAGCATGGTTACCCCGATTGGCTTCAGGTACAGTTGTTTTATAATGGTTTAACTCCTAG
TACAAAAACGATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACCGTGGAAAACGCTCGCACACTTTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCAT
CTGAGCGGTCTACACCTAAAAAGATTGCTGCTGGAGTGTTTGAGGTTGATAAAGTAAGTGCACTCCAGGCCCAGATGACTTCCCTTGCCAATGCTTTTATGAAATTTTCA
GGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGACCTCAGGAGGAGACCATCGAGCAGGTTCAGTATGTATCAAATTTTAATTCTAGGGGATA
TAATAATAGTTCTACACCTACACATTATCACCCTAACAATAGGAACCATGAAAATTTCTCTTATGCTAATACTAAGAATGTTCTTAATCCTCCTGGTTTTGCCCCTCAAA
CTCAAGATAATAAAAAGTTAGAAGATCTTGTTGGAGCTTTCATTGCAGAGTCTAGTAACAGGACAACCAAGTTAGAGGAGGCAGTCATTGCTATCAACACCATGGTGAAT
GGCCACAGTGCAGCCATAAAGAACATTGAGACTCAGCTGGGACAGTTGGTGAATGTTGTAAGTACAATGAACAAAGGTAAGGCCCCAGCTGAACAAGAGAAACCCCAGAT
GGAGTATTGTAAGGCAATCACTGTGCACCAGGAGGAATCTGAAGAGGAACCTGAATCTGAGGATTATGAAACGCCTACAGGGGAAGCTGAGGAGGACACATCACAGATGA
GGCTGAAAAGCCTAACCTGA
Protein sequenceShow/hide protein sequence
MDLSFQDYLIEHGITSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAQLPNSFCGYAVETAVYILNVVPSKSVSETPFELWKGRKASLRHFRIWGCPAYVLEANPKKLE
PRSRFCLFIGYPKETRGGYLFDPEENKVLVSTNATFLEEDHLRDHKPRSKVVLSELSNEATETSTRVVDDTGTSSQSHPSQEIREPRRSGRVVRQPDRYLGLTETPVVIP
DDGIEDPLTFKQAMDDVDKNEWIKAMDLEIESMHFNSVWDLVDQPEGVRPIGCKWIYKRKRDAAGKVQTFKARLVAKGFTQREGIDYEETFSPVAMLKSIRILLSIATFY
DYEIWQMDVKTAFLNGNLEESIYMSQPEGFIAQDQEQKVCKLKRSIYGLKQASRSWNMRFDTAIKSYGFEQNVDEPCVYKRIINSSIAFLVLYVDDILLIGNDVGYLTDI
KLWLATQFQMKDLGEAQFVLGIQIVRNRKNKTLALSQASYIDKLLVRYKMQNSKKGLLPFRHGINLSKEQCPQTPQEVEDMRRFPYASAVGSLMYAMLCTRPDICYAVGI
VSRYQSNPGFDHWTAVKNSRKSTSGSVFTLNGGAIVWRNIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMHLPITLYCDNSGAVANSKEPRSHKRGKHIE
RKYHLIREIVHRGDVIVTKIASEHNIADPFTKALTAKVFESHLEGLALVTREFYTKRSTSRICTTICVQRRVVLKMDENRSKGCCGSLGDLSYYAAERLEGAILCCSKTG
SRNCHITARVILVMSDPLGTEMVDWSYNAILELIGCSGREKMQRNEKSFWGDRFGTFWSRKQSRTEHLEAETKEELGNQPIVRGDRDGDVGTRPTLRATVGIVYAPINAN
NFELKTGLIQMARDCAYRGSPTEDPNSHLKSFLDICGTVKINGVSEDAIRLRLFPFSLQDKARDWLQSITPGSITTWDALVQAFLKKFFPPAKTVKLRTEIGTFQQQYDE
QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPSERSTPKKIAAGVFEVDKVSALQAQMTSLANAFMKFS
GTGSAQSIESAAALASRPQEETIEQVQYVSNFNSRGYNNSSTPTHYHPNNRNHENFSYANTKNVLNPPGFAPQTQDNKKLEDLVGAFIAESSNRTTKLEEAVIAINTMVN
GHSAAIKNIETQLGQLVNVVSTMNKGKAPAEQEKPQMEYCKAITVHQEESEEEPESEDYETPTGEAEEDTSQMRLKSLT