; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002710 (gene) of Snake gourd v1 genome

Gene IDTan0002710
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:42066280..42068429
RNA-Seq ExpressionTan0002710
SyntenyTan0002710
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.6e-24161.55Show/hide
Query:  MVLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKA
        +VLNTE+F+T   Q KK+K+S   N++LWHLRLGHINLNRIERLVK+G+LN+LE+NSL  CESCLEGKMTKR F+GKG RA+  LEL+HSDLCGPMN+KA
Subjt:  MVLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKA

Query:  KGGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE----------
        +GGYEYFISFIDD+SRYGH+YL+HHK E+ EKFKE+K EVEN++GK +KTLRSDRGGEYMD KFQ+Y+IE GI SQLSAP T  QNGVSE          
Subjt:  KGGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE----------

Query:  ---------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVS
                                                           RIWGCP HVLV NPKKL+ RSKLCLFVGYPKE+RGGLFY P+E+KVFVS
Subjt:  ---------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVS

Query:  TNATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVD-TSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM
        TNATF+EE+H R+H+P+SK+VL E+     K A    S+ST+VVD  ++S Q   SQEL +PRRSGRVV QP+R++GL ETQ++IPDD  EDPLTY QAM
Subjt:  TNATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVD-TSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM

Query:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-
         D+D+D+W+ AM+ EMESM+FNSVW LVD P  VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIRILL++  +Y+YE 
Subjt:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-

Query:  -------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDV
                                                                           N DEPCVYKKI+NS VAFLILYVDDILLIGNDV
Subjt:  -------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDV

Query:  GYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLM
         YLTD+K+WL TQFQMKDLG+AQ++LGIQIVRNRKN+TLA+SQASYI KVLSR+KMQ+SKKG LPFRHGIHL KEQCPKTPQ VEDMR IPY+SAVGSLM
Subjt:  GYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLM

Query:  YAMLCTKRDI
        YAMLCT+ DI
Subjt:  YAMLCTKRDI

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-24662.43Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        VLN E+F+TAN Q K+++ISP  N++LWHLRLGHINL+RI RLVKNGLLN+L++ SL  CESCLEGKMTKRPF+GKGYRA++ LELIHSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFISFIDDYSRYG++YLM HK EALEKFKE+KTEVEN L K++K LRSDRGGEYMDL+FQ+YMIE+GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVLVTNPKKL+ RS+LC FVGYPKETRGGLF+DP+E++VFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD
        NATF+EE+H+R+HKP+SK+VLSE      +V ++    S+RV +T+ S Q  PSQ L MPRRSGRVV QP+R++GL ETQVVIPDD  EDPL+Y QAM D
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD

Query:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---
        +DKD+WV AMD EMESM+FNSVW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIRILL++  +YDYE   
Subjt:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---

Query:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY
                                                                         N DEPCVYKKI    VAFL+LYVDDILLIGNDVGY
Subjt:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA
        LTD+K WLA QFQMKDLG+AQ+VLGIQI+R+RKN+TLALSQA+YI K+L R+ MQ+SKKGLLPFRHG+HL KEQ PKTPQ VEDMRRIPYASAVGSLMYA
Subjt:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA

Query:  MLCTKRDI
        MLCT+ DI
Subjt:  MLCTKRDI

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-24261.72Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        VLN E+F+TAN Q K+++ISP  N++LWHLRLGHINL+RI RLVK+GLLN+L++ SL  CESCLEGKMTKRPF+GKGYRA++ LELIHSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        G +EYFISFIDDYSRYG++YLM HK EALEKFKE+KTEVEN L K++K  RSDRGGEYMDL FQ+YMIE+GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVLVTNPKKL+ RS+LC FVGYPKETRGGLF+DPKE++VFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD
        NATF+EE+H+R+HKP+SK+VLSE      +V ++    S+RV +T+ S Q  PSQ L MPRRSGRVV QP+R++GL ETQVVIPDD  EDPL+Y QAM D
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD

Query:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---
        +DKD+WV AMD EMESM+FNSVW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYT+ EGVDYEETFS VAM+KSIRILL++  +YDYE   
Subjt:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---

Query:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY
                                                                         N DEPCVYKKI    VAFL+LYVDDILLIGNDVGY
Subjt:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA
        LTD+K WLA QFQMKDLG+ Q+VLGIQI+R+RKN+TLALSQA+YI K+L R+ MQ+SKKGLLPFRHG+HL KEQ PKTPQ VEDMRRIPYASAVGSLMYA
Subjt:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA

Query:  MLCTKRDI
        MLCT+ DI
Subjt:  MLCTKRDI

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-24161.18Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        +LNTE+FKTA  Q K+ KISPKEN+HLWHLRLGHINLNRIERLVKNGLL+ELEENSL VCESCLEGKMTKRPF+GKG+RA++ LEL+HSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFI+F DDYSRYG++YLM HK EALEKFKE+K EVEN L K +KT RSDRGGEYMDLKFQNY++E GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA
        NATF+EE+HIR+HKP+SK+VL+EL     + + +     S  TRVV    S++    Q L  PRRSGRV   P R++ L ET  VI D + EDPLT+ +A
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA

Query:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE
        M D+DKD+W+ AM+ E+ESM+FNSVWDLVD+PDGVKPIGCKWIYKRKRG +GKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL++ AY+DYE
Subjt:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE

Query:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND
                                                                              DEPCVYK+IIN SVAFL+LYVDDILLIGND
Subjt:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND

Query:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL
        +G LTDIK+WLATQFQMKDLG+AQFVLGIQI R+RKN+ LALSQASYI K++ ++ MQ+SK+GLLPFRHG+ L KEQCPKTPQ VE+MR IPYASAVGSL
Subjt:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL

Query:  MYAMLCTKRDI
        MYAMLCT+ DI
Subjt:  MYAMLCTKRDI

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-24161.18Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        +LNTE+FKTA  Q K+ KISPKEN+HLWHLRLGHINLNRIERLVKNGLL+ELEENSL VCESCLEGKMTKRPF+GKG+RA++ LEL+HSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFI+F DDYSRYG++YLM HK EALEKFKE+K EVEN L K +KT RSDRGGEYMDLKFQNY++E GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA
        NATF+EE+HIR+HKP+SK+VL+EL     + + +     S  TRVV    S++    Q L  PRRSGRV   P R++ L ET  VI D + EDPLT+ +A
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA

Query:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE
        M D+DKD+W+ AM+ E+ESM+FNSVWDLVD+PDGVKPIGCKWIYKRKRG +GKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL++ AY+DYE
Subjt:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE

Query:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND
                                                                              DEPCVYK+IIN SVAFL+LYVDDILLIGND
Subjt:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND

Query:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL
        +G LTDIK+WLATQFQMKDLG+AQFVLGIQI R+RKN+ LALSQASYI K++ ++ MQ+SK+GLLPFRHG+ L KEQCPKTPQ VE+MR IPYASAVGSL
Subjt:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL

Query:  MYAMLCTKRDI
        MYAMLCT+ DI
Subjt:  MYAMLCTKRDI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.3e-24161.18Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        +LNTE+FKTA  Q K+ KISPKEN+HLWHLRLGHINLNRIERLVKNGLL+ELEENSL VCESCLEGKMTKRPF+GKG+RA++ LEL+HSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFI+F DDYSRYG++YLM HK EALEKFKE+K EVEN L K +KT RSDRGGEYMDLKFQNY++E GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA
        NATF+EE+HIR+HKP+SK+VL+EL     + + +     S  TRVV    S++    Q L  PRRSGRV   P R++ L ET  VI D + EDPLT+ +A
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA

Query:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE
        M D+DKD+W+ AM+ E+ESM+FNSVWDLVD+PDGVKPIGCKWIYKRKRG +GKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL++ AY+DYE
Subjt:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE

Query:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND
                                                                              DEPCVYK+IIN SVAFL+LYVDDILLIGND
Subjt:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND

Query:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL
        +G LTDIK+WLATQFQMKDLG+AQFVLGIQI R+RKN+ LALSQASYI K++ ++ MQ+SK+GLLPFRHG+ L KEQCPKTPQ VE+MR IPYASAVGSL
Subjt:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL

Query:  MYAMLCTKRDI
        MYAMLCT+ DI
Subjt:  MYAMLCTKRDI

A0A5A7T2V9 Gag/pol protein1.6e-24261.72Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        VLN E+F+TAN Q K+++ISP  N++LWHLRLGHINL+RI RLVK+GLLN+L++ SL  CESCLEGKMTKRPF+GKGYRA++ LELIHSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        G +EYFISFIDDYSRYG++YLM HK EALEKFKE+KTEVEN L K++K  RSDRGGEYMDL FQ+YMIE+GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVLVTNPKKL+ RS+LC FVGYPKETRGGLF+DPKE++VFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD
        NATF+EE+H+R+HKP+SK+VLSE      +V ++    S+RV +T+ S Q  PSQ L MPRRSGRVV QP+R++GL ETQVVIPDD  EDPL+Y QAM D
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD

Query:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---
        +DKD+WV AMD EMESM+FNSVW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYT+ EGVDYEETFS VAM+KSIRILL++  +YDYE   
Subjt:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---

Query:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY
                                                                         N DEPCVYKKI    VAFL+LYVDDILLIGNDVGY
Subjt:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA
        LTD+K WLA QFQMKDLG+ Q+VLGIQI+R+RKN+TLALSQA+YI K+L R+ MQ+SKKGLLPFRHG+HL KEQ PKTPQ VEDMRRIPYASAVGSLMYA
Subjt:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA

Query:  MLCTKRDI
        MLCT+ DI
Subjt:  MLCTKRDI

A0A5A7TZD0 Gag/pol protein3.1e-24662.43Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        VLN E+F+TAN Q K+++ISP  N++LWHLRLGHINL+RI RLVKNGLLN+L++ SL  CESCLEGKMTKRPF+GKGYRA++ LELIHSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFISFIDDYSRYG++YLM HK EALEKFKE+KTEVEN L K++K LRSDRGGEYMDL+FQ+YMIE+GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVLVTNPKKL+ RS+LC FVGYPKETRGGLF+DP+E++VFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD
        NATF+EE+H+R+HKP+SK+VLSE      +V ++    S+RV +T+ S Q  PSQ L MPRRSGRVV QP+R++GL ETQVVIPDD  EDPL+Y QAM D
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAMVD

Query:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---
        +DKD+WV AMD EMESM+FNSVW+LVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIRILL++  +YDYE   
Subjt:  IDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE---

Query:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY
                                                                         N DEPCVYKKI    VAFL+LYVDDILLIGNDVGY
Subjt:  -----------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDVGY

Query:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA
        LTD+K WLA QFQMKDLG+AQ+VLGIQI+R+RKN+TLALSQA+YI K+L R+ MQ+SKKGLLPFRHG+HL KEQ PKTPQ VEDMRRIPYASAVGSLMYA
Subjt:  LTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYA

Query:  MLCTKRDI
        MLCT+ DI
Subjt:  MLCTKRDI

A0A5D3CPJ6 Gag/pol protein2.3e-24161.18Show/hide
Query:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK
        +LNTE+FKTA  Q K+ KISPKEN+HLWHLRLGHINLNRIERLVKNGLL+ELEENSL VCESCLEGKMTKRPF+GKG+RA++ LEL+HSDLCGPMN+KA+
Subjt:  VLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAK

Query:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------
        GG+EYFI+F DDYSRYG++YLM HK EALEKFKE+K EVEN L K +KT RSDRGGEYMDLKFQNY++E GI SQLSAPGT  QNGVSE           
Subjt:  GGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------

Query:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST
                                                          RIWGCP HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVST
Subjt:  --------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVST

Query:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA
        NATF+EE+HIR+HKP+SK+VL+EL     + + +     S  TRVV    S++    Q L  PRRSGRV   P R++ L ET  VI D + EDPLT+ +A
Subjt:  NATFMEENHIRDHKPKSKVVLSELDGTIAKVANK---NTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQA

Query:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE
        M D+DKD+W+ AM+ E+ESM+FNSVWDLVD+PDGVKPIGCKWIYKRKRG +GKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL++ AY+DYE
Subjt:  MVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE

Query:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND
                                                                              DEPCVYK+IIN SVAFL+LYVDDILLIGND
Subjt:  --------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGND

Query:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL
        +G LTDIK+WLATQFQMKDLG+AQFVLGIQI R+RKN+ LALSQASYI K++ ++ MQ+SK+GLLPFRHG+ L KEQCPKTPQ VE+MR IPYASAVGSL
Subjt:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL

Query:  MYAMLCTKRDI
        MYAMLCT+ DI
Subjt:  MYAMLCTKRDI

E2GK51 Gag/pol protein (Fragment)7.8e-24261.55Show/hide
Query:  MVLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKA
        +VLNTE+F+T   Q KK+K+S   N++LWHLRLGHINLNRIERLVK+G+LN+LE+NSL  CESCLEGKMTKR F+GKG RA+  LEL+HSDLCGPMN+KA
Subjt:  MVLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKA

Query:  KGGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE----------
        +GGYEYFISFIDD+SRYGH+YL+HHK E+ EKFKE+K EVEN++GK +KTLRSDRGGEYMD KFQ+Y+IE GI SQLSAP T  QNGVSE          
Subjt:  KGGYEYFISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE----------

Query:  ---------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVS
                                                           RIWGCP HVLV NPKKL+ RSKLCLFVGYPKE+RGGLFY P+E+KVFVS
Subjt:  ---------------------------------------------------RIWGCPTHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVS

Query:  TNATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVD-TSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM
        TNATF+EE+H R+H+P+SK+VL E+     K A    S+ST+VVD  ++S Q   SQEL +PRRSGRVV QP+R++GL ETQ++IPDD  EDPLTY QAM
Subjt:  TNATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVD-TSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM

Query:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-
         D+D+D+W+ AM+ EMESM+FNSVW LVD P  VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIRILL++  +Y+YE 
Subjt:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-

Query:  -------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDV
                                                                           N DEPCVYKKI+NS VAFLILYVDDILLIGNDV
Subjt:  -------------------------------------------------------------------NDDEPCVYKKIINSSVAFLILYVDDILLIGNDV

Query:  GYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLM
         YLTD+K+WL TQFQMKDLG+AQ++LGIQIVRNRKN+TLA+SQASYI KVLSR+KMQ+SKKG LPFRHGIHL KEQCPKTPQ VEDMR IPY+SAVGSLM
Subjt:  GYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLM

Query:  YAMLCTKRDI
        YAMLCT+ DI
Subjt:  YAMLCTKRDI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-4322.5Show/hide
Query:  KENSHLWHLRLGHIN------LNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRA--EDTLELIHSDLCGPMNIKAKGGYEYFISFIDDY
        K N  LWH R GHI+      + R        LLN L E S  +CE CL GK  + PF     +   +  L ++HSD+CGP+         YF+ F+D +
Subjt:  KENSHLWHLRLGHIN------LNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRA--EDTLELIHSDLCGPMNIKAKGGYEYFISFIDDY

Query:  SRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSERI---------------------W
        + Y   YL+ +K +    F++F  + E     +V  L  D G EY+  + + + ++ GI+  L+ P T   NGVSER+                     W
Subjt:  SRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSERI---------------------W

Query:  G------------CPTHVLVTNPK--------------------------------KLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFMEENH
        G             P+  LV + K                                K D +S   +FVGY  E  G   +D   +K  V+ +    E N 
Subjt:  G------------CPTHVLVTNPK--------------------------------KLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFMEENH

Query:  IRDHKPK---------------------SKVVLSELDGTIAKV------------ANKNTSTSTRVV---------------------------------
        +     K                      K++ +E      +              NKN    +R +                                 
Subjt:  IRDHKPK---------------------SKVVLSELDGTIAKV------------ANKNTSTSTRVV---------------------------------

Query:  ----DTSLSSQEG---PSQELS-----------------------MPRRSGRVVIQPDRFIGLAE---TQVVIPDDNC--EDPLTYNQAMVDIDKDKWVI
            D  L+  +G   P++                          + RRS R+  +P       +    +VV+       + P ++++     DK  W  
Subjt:  ----DTSLSSQEG---PSQELS-----------------------MPRRSGRVVIQPDRFIGLAE---TQVVIPDDNC--EDPLTYNQAMVDIDKDKWVI

Query:  AMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-----------
        A++ E+ +   N+ W +  +P+    +  +W++  K    G    +KARLVA+G+TQ   +DYEETF+PVA I S R +L++V  Y+ +           
Subjt:  AMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYE-----------

Query:  ------------------NDDEPCVYKK------------------------IINSSV---------------AFLILYVDDILLIGNDVGYLTDIKEWL
                          N D  C   K                         +NSSV                +++LYVDD+++   D+  + + K +L
Subjt:  ------------------NDDEPCVYKK------------------------IINSSV---------------AFLILYVDDILLIGNDVGYLTDIKEWL

Query:  ATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYAMLCTKRDI
          +F+M DL + +  +GI+I    +   + LSQ++Y+ K+LS+F M++      P    I+       +     ED    P  S +G LMY MLCT+ D+
Subjt:  ATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSLMYAMLCTKRDI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-8730.38Show/hide
Query:  LWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYFISFIDDYSRYGHIYLMHHKF
        LWH R+GH++   ++ L K  L++  +  ++  C+ CL GK  +  F     R  + L+L++SD+CGPM I++ GG +YF++FIDD SR   +Y++  K 
Subjt:  LWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYFISFIDDYSRYGHIYLMHHKF

Query:  EALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSER------------------------------------
        +  + F++F   VE + G+++K LRSD GGEY   +F+ Y   +GI  + + PGT   NGV+ER                                    
Subjt:  EALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSER------------------------------------

Query:  --------------------------IWGCP--THVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFMEENHIR---DHKPKSKVV
                                  ++GC    HV      KLD +S  C+F+GY  E  G   +DP + KV  S +  F  E+ +R   D   K K  
Subjt:  --------------------------IWGCP--THVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFMEENHIR---DHKPKSKVV

Query:  LSELDGTIAKVANKNTSTSTRV-------------------VDTSLSSQEGPSQ--ELSMP-RRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM
        +     TI   +N  TS  +                     +D  +   E P+Q  E   P RRS R  ++  R+     T+ V+  D+  +P +  + +
Subjt:  LSELDGTIAKVANKNTSTSTRV-------------------VDTSLSSQEGPSQ--ELSMP-RRSGRVVIQPDRFIGLAETQVVIPDDNCEDPLTYNQAM

Query:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYEN
           +K++ + AM +EMES+  N  + LV+ P G +P+ CKW++K K+  + K+  +KARLV KG+ Q +G+D++E FSPV  + SIR +L++ A  D E 
Subjt:  VDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYEN

Query:  D--------------------------------------------------------------------DEPCVY-KKIINSSVAFLILYVDDILLIGND
        +                                                                     +PCVY K+   ++   L+LYVDD+L++G D
Subjt:  D--------------------------------------------------------------------DEPCVY-KKIINSSVAFLILYVDDILLIGND

Query:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL
         G +  +K  L+  F MKDLG AQ +LG++IVR R +R L LSQ  YI +VL RF M+++K    P    + L K+ CP T +   +M ++PY+SAVGSL
Subjt:  VGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYASAVGSL

Query:  MYAMLCTKRDI
        MYAM+CT+ DI
Subjt:  MYAMLCTKRDI

P92520 Uncharacterized mitochondrial protein AtMg008202.0e-1342.35Show/hide
Query:  WVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAV
        W  AM +E++++  N  W LV  P     +GCKW++K K   +G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L V
Subjt:  WVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.4e-3822.86Show/hide
Query:  SPKENSHLWHLRLGHINLNRIERLVKNGLLNELE-ENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYFISFIDDYSRYGH
        S K     WH RLGH   + +  ++ N  L+ L   +    C  CL  K  K PFS     +   LE I+SD+     I +   Y Y++ F+D ++RY  
Subjt:  SPKENSHLWHLRLGHINLNRIERLVKNGLLNELE-ENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYFISFIDDYSRYGH

Query:  IYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------------------------
        +Y +  K +  E F  FK  +EN+   R+ T  SD GGE++ L    Y  ++GI+   S P T   NG+SE                             
Subjt:  IYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSE-----------------------------

Query:  ---------------------------------RIWGCPTHVLVT--NPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFME-----ENHI
                                         R++GC  +  +   N  KLD +S+ C+F+GY       L    +  ++++S +  F E      N++
Subjt:  ---------------------------------RIWGCPTHVLVT--NPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNATFME-----ENHI

Query:  RDHKP------------------------------------------------KSKVVLSELDGTIA-----------------KVANKNTSTSTRVVDT
            P                                                 S+V  S LD + +                 +   + T T T+   +
Subjt:  RDHKP------------------------------------------------KSKVVLSELDGTIA-----------------KVANKNTSTSTRVVDT

Query:  SLSSQEGP--------SQELSMPRRSGRVVIQPDRFIGLAETQVVIP--------------DDNCEDPLTYNQ---------------------------
          +SQ  P        +Q LS P +S      P      + T    P              ++N + PL  +                            
Subjt:  SLSSQEGP--------SQELSMPRRSGRVVIQPDRFIGLAETQVVIP--------------DDNCEDPLTYNQ---------------------------

Query:  ---AMVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDG-VKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVV-
           A+  +  ++W  AM  E+ +   N  WDLV  P   V  +GC+WI+ +K   +G +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L V  
Subjt:  ---AMVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDG-VKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVV-

Query:  ------------------------------AYYDYENDDEPCVYKK------------------------IINS-------------SVAFLILYVDDIL
                                       + D +  +  C  +K                         +NS             S+ ++++YVDDIL
Subjt:  ------------------------------AYYDYENDDEPCVYKK------------------------IINS-------------SVAFLILYVDDIL

Query:  LIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYAS
        + GND   L +  + L+ +F +KD  +  + LGI+    R    L LSQ  YI+ +L+R  M  +K    P      L      K     E      Y  
Subjt:  LIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVEDMRRIPYAS

Query:  AVGSLMYAMLCTKRDIRF
         VGSL Y +  T+ DI +
Subjt:  AVGSLMYAMLCTKRDIRF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-3622.66Show/hide
Query:  ANGQTKKRKISP--KENSHLWHLRLGHINLNRIERLVKNGLLNELE-ENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYF
        A+ Q      SP  K     WH RLGH +L  +  ++ N  L  L   + L  C  C   K  K PFS     +   LE I+SD+     I +   Y Y+
Subjt:  ANGQTKKRKISP--KENSHLWHLRLGHINLNRIERLVKNGLLNELE-ENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYF

Query:  ISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSER----------------
        + F+D ++RY  +Y +  K +  + F  FK+ VEN+   R+ TL SD GGE++ L+  +Y+ ++GI+   S P T   NG+SER                
Subjt:  ISFIDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSER----------------

Query:  ----------------------------------------------IWGCPTHVLVT--NPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAT
                                                      ++GC  +  +   N  KL+ +SK C F+GY       L       +++ S +  
Subjt:  ----------------------------------------------IWGCPTHVLVT--NPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAT

Query:  FME---------------ENHIRDHKPK---------SKVVLSELD--GTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRS---------GRVVI
        F E               +    D  P          + +VL      G     + +  S+ + +  T +SS   PS  +S P  S          +   
Subjt:  FME---------------ENHIRDHKPK---------SKVVLSELD--GTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRS---------GRVVI

Query:  QPDRFIGLAETQVVI--PDDNCEDPLTYNQ----------------------------------------------------------------------
        QP +         ++  P+ N   P + NQ                                                                      
Subjt:  QPDRFIGLAETQVVI--PDDNCEDPLTYNQ----------------------------------------------------------------------

Query:  -------------------AMVDIDKDKWVIAMDQEMESMHFNSVWDLV-DKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFS
                           A+  +  D+W  AM  E+ +   N  WDLV   P  V  +GC+WI+ +K   +G +  +KARLVAKGY Q  G+DY ETFS
Subjt:  -------------------AMVDIDKDKWVIAMDQEMESMHFNSVWDLV-DKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFS

Query:  PVAMIKSIRILLAVV-------------------------------AYYDYENDDEPCVYKKII------------------------------------
        PV    SIRI+L V                                 + D +  D  C  +K I                                    
Subjt:  PVAMIKSIRILLAVV-------------------------------AYYDYENDDEPCVYKKII------------------------------------

Query:  -NSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCP
           S+ ++++YVDDIL+ GND   L    + L+ +F +K+  D  + LGI+    R  + L LSQ  Y + +L+R  M  +K    P      L      
Subjt:  -NSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCP

Query:  KTPQGVEDMRRIPYASAVGSLMYAMLCTKRDIRF
        K P   E      Y   VGSL Y +  T+ D+ +
Subjt:  KTPQGVEDMRRIPYASAVGSLMYAMLCTKRDIRF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.4e-2826.99Show/hide
Query:  EDPLTYNQAMVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILL
        ++P TYN+A   +    W  AMD E+ +M     W++   P   KPIGCKW+YK K   +G ++ +KARLVAKGYTQ EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYNQAMVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILL

Query:  AVVAYYDY-------------------------------ENDDEP----CVYKKIIN-----------------------------------SSVAFL--
        A+ A Y++                               + D  P    C  KK I                                    ++  FL  
Subjt:  AVVAYYDY-------------------------------ENDDEP----CVYKKIIN-----------------------------------SSVAFL--

Query:  ILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVED
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R+     + + Q  Y + +L    +   K   +P    +            G + 
Subjt:  ILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGVED

Query:  MRRIPYASAVGSLMYAMLCTKRDIRF
        +    Y   +G LMY  + T+ DI F
Subjt:  MRRIPYASAVGSLMYAMLCTKRDIRF

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.4e-0936.84Show/hide
Query:  KENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNI
        K+ + LWH RL H++   +E LVK G L+  + +SL  CE C+ GK  +  FS   +  ++ L+ +HSDL G  ++
Subjt:  KENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNI

ATMG00810.1 DNA/RNA polymerases superfamily protein3.8e-0734.38Show/hide
Query:  FLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGV
        +L+LYVDDILL G+    L  +   L++ F MKDLG   + LGIQI  +     L LSQ  Y  ++L+   M D K    P    ++       K P   
Subjt:  FLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKTPQGV

Query:  EDMRRIPYASAVGSLMYAMLCTKRDIRF
        +      + S VG+L Y  L T+ DI +
Subjt:  EDMRRIPYASAVGSLMYAMLCTKRDIRF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.5e-1442.35Show/hide
Query:  WVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAV
        W  AM +E++++  N  W LV  P     +GCKW++K K   +G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L V
Subjt:  WVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGAACACTGAATTATTTAAAACTGCAAACGGACAAACTAAGAAACGTAAAATTTCTCCGAAAGAAAATTCTCATCTTTGGCATCTAAGATTGGGTCATATTAA
TCTCAATAGGATTGAGAGATTGGTAAAGAACGGACTTCTAAACGAGTTAGAAGAAAATTCCTTGTCAGTATGTGAATCCTGCCTAGAAGGCAAGATGACTAAAAGACCTT
TTAGTGGAAAAGGTTATAGAGCCGAAGATACTCTAGAGCTTATACATTCAGACCTTTGTGGTCCGATGAATATTAAAGCCAAGGGAGGATATGAATATTTCATCTCTTTT
ATAGATGATTATTCCAGATATGGGCATATTTACCTAATGCACCACAAGTTTGAAGCACTTGAAAAGTTCAAGGAATTCAAGACTGAAGTTGAAAATCAATTAGGTAAAAG
AGTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGATTTAAAATTCCAGAACTATATGATAGAAAATGGAATTACATCCCAACTCTCAGCTCCCGGCACACTAC
ATCAAAACGGTGTATCGGAAAGAATATGGGGTTGCCCAACACATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAGTTGTGCCTATTCGTAGGATACCCA
AAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTCTTTGTGTCGACAAATGCCACTTTCATGGAGGAGAACCACATAAGGGACCACAAACCAAAAAG
TAAGGTAGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGTTGCTAATAAGAACACTAGTACGTCAACAAGAGTTGTTGATACTAGTTTGTCTAGTCAAGAGGGTCCAT
CTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTGTGATACAGCCTGACCGTTTCATAGGTTTAGCTGAAACCCAAGTTGTTATACCAGATGACAACTGCGAGGAT
CCATTGACTTATAATCAAGCAATGGTTGACATTGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCTATGCACTTCAATTCTGTTTGGGATCTTGTAGA
TAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAACGTGGTGTAAATGGGAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATA
CCCAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGCTATGATAAAGTCTATCCGTATCCTTCTTGCCGTTGTTGCATATTATGACTATGAGAATGATGAT
GAACCTTGTGTATACAAGAAAATCATCAATAGTTCTGTCGCATTCCTAATTCTCTATGTGGATGATATCCTACTCATTGGGAATGATGTAGGTTATCTTACTGACATCAA
GGAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGGTGATGCGCAGTTTGTTCTTGGGATCCAGATTGTCCGAAACCGCAAGAATAGAACACTAGCCTTGTCTCAAG
CATCATACATAGTCAAAGTGTTGTCAAGATTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCCTTTTAGACATGGAATCCATTTGTTTAAGGAACAGTGTCCTAAGACA
CCTCAAGGAGTTGAGGATATGAGACGGATTCCTTACGCATCAGCTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAAGCGTGACATCCGCTTTTTGCGATTGGGATG
GTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGAACACTGAATTATTTAAAACTGCAAACGGACAAACTAAGAAACGTAAAATTTCTCCGAAAGAAAATTCTCATCTTTGGCATCTAAGATTGGGTCATATTAA
TCTCAATAGGATTGAGAGATTGGTAAAGAACGGACTTCTAAACGAGTTAGAAGAAAATTCCTTGTCAGTATGTGAATCCTGCCTAGAAGGCAAGATGACTAAAAGACCTT
TTAGTGGAAAAGGTTATAGAGCCGAAGATACTCTAGAGCTTATACATTCAGACCTTTGTGGTCCGATGAATATTAAAGCCAAGGGAGGATATGAATATTTCATCTCTTTT
ATAGATGATTATTCCAGATATGGGCATATTTACCTAATGCACCACAAGTTTGAAGCACTTGAAAAGTTCAAGGAATTCAAGACTGAAGTTGAAAATCAATTAGGTAAAAG
AGTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGATTTAAAATTCCAGAACTATATGATAGAAAATGGAATTACATCCCAACTCTCAGCTCCCGGCACACTAC
ATCAAAACGGTGTATCGGAAAGAATATGGGGTTGCCCAACACATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAGTTGTGCCTATTCGTAGGATACCCA
AAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTCTTTGTGTCGACAAATGCCACTTTCATGGAGGAGAACCACATAAGGGACCACAAACCAAAAAG
TAAGGTAGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGTTGCTAATAAGAACACTAGTACGTCAACAAGAGTTGTTGATACTAGTTTGTCTAGTCAAGAGGGTCCAT
CTCAAGAGTTGAGTATGCCTCGACGTAGTGGGAGGGTTGTGATACAGCCTGACCGTTTCATAGGTTTAGCTGAAACCCAAGTTGTTATACCAGATGACAACTGCGAGGAT
CCATTGACTTATAATCAAGCAATGGTTGACATTGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTCTATGCACTTCAATTCTGTTTGGGATCTTGTAGA
TAAGCCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAGAAAACGTGGTGTAAATGGGAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATA
CCCAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGCTATGATAAAGTCTATCCGTATCCTTCTTGCCGTTGTTGCATATTATGACTATGAGAATGATGAT
GAACCTTGTGTATACAAGAAAATCATCAATAGTTCTGTCGCATTCCTAATTCTCTATGTGGATGATATCCTACTCATTGGGAATGATGTAGGTTATCTTACTGACATCAA
GGAATGGCTAGCTACGCAATTCCAAATGAAAGATTTGGGTGATGCGCAGTTTGTTCTTGGGATCCAGATTGTCCGAAACCGCAAGAATAGAACACTAGCCTTGTCTCAAG
CATCATACATAGTCAAAGTGTTGTCAAGATTTAAGATGCAAGATTCCAAAAAGGGCTTGTTGCCTTTTAGACATGGAATCCATTTGTTTAAGGAACAGTGTCCTAAGACA
CCTCAAGGAGTTGAGGATATGAGACGGATTCCTTACGCATCAGCTGTTGGGAGCCTGATGTACGCCATGTTGTGTACTAAGCGTGACATCCGCTTTTTGCGATTGGGATG
GTGA
Protein sequenceShow/hide protein sequence
MVLNTELFKTANGQTKKRKISPKENSHLWHLRLGHINLNRIERLVKNGLLNELEENSLSVCESCLEGKMTKRPFSGKGYRAEDTLELIHSDLCGPMNIKAKGGYEYFISF
IDDYSRYGHIYLMHHKFEALEKFKEFKTEVENQLGKRVKTLRSDRGGEYMDLKFQNYMIENGITSQLSAPGTLHQNGVSERIWGCPTHVLVTNPKKLDSRSKLCLFVGYP
KETRGGLFYDPKEDKVFVSTNATFMEENHIRDHKPKSKVVLSELDGTIAKVANKNTSTSTRVVDTSLSSQEGPSQELSMPRRSGRVVIQPDRFIGLAETQVVIPDDNCED
PLTYNQAMVDIDKDKWVIAMDQEMESMHFNSVWDLVDKPDGVKPIGCKWIYKRKRGVNGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMIKSIRILLAVVAYYDYENDD
EPCVYKKIINSSVAFLILYVDDILLIGNDVGYLTDIKEWLATQFQMKDLGDAQFVLGIQIVRNRKNRTLALSQASYIVKVLSRFKMQDSKKGLLPFRHGIHLFKEQCPKT
PQGVEDMRRIPYASAVGSLMYAMLCTKRDIRFLRLGW