; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0019924 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0019924
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr08:12031543..12047052
RNA-Seq ExpressionPay0019924
SyntenyPay0019924
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR004242 - Transposon, En/Spm-like
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]9.9e-28082.64Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD REVDVSLS EPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTE EHEEHLRMVLQTLRDNKLYAKF K EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAE GQAVEF +SSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        KCLVC
Subjt:  KCLVC

KAA0040689.1 pol protein [Cucumis melo var. makuwa]2.0e-28082.81Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD REVDVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          +SPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDP KIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVL VPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF ISSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

KAA0048687.1 pol protein [Cucumis melo var. makuwa]4.5e-28082.64Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD RE DVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF +SSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

KAA0056702.1 pol protein [Cucumis melo var. makuwa]7.6e-28082.81Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQ----------DVSPWGAPILFVKKKDGS
        ASVVD REVDVSLSSEPVVRDYPDVFPEELPRLPPHRE+EFAIE EPG VPISR PYRMAPAELKELKVQLQ           VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQ----------DVSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYA+FSK EFWL +VSFLGHVVSKA VSVDPAKIE V  W +PSTVSEV SFLGLVGYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPV+TVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF +SSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

TYK01613.1 pol protein [Cucumis melo var. makuwa]9.0e-28182.98Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD RE DVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQ  EF +SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        KCLVC
Subjt:  KCLVC

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase4.8e-28082.64Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD REVDVSLS EPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTE EHEEHLRMVLQTLRDNKLYAKF K EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAE GQAVEF +SSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        KCLVC
Subjt:  KCLVC

A0A5A7THE6 Reverse transcriptase9.7e-28182.81Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD REVDVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          +SPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDP KIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVL VPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF ISSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

A0A5A7U330 Reverse transcriptase2.2e-28082.64Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD RE DVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF +SSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

A0A5A7ULI8 Pol protein3.7e-28082.81Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQ----------DVSPWGAPILFVKKKDGS
        ASVVD REVDVSLSSEPVVRDYPDVFPEELPRLPPHRE+EFAIE EPG VPISR PYRMAPAELKELKVQLQ           VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQ----------DVSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYA+FSK EFWL +VSFLGHVVSKA VSVDPAKIE V  W +PSTVSEV SFLGLVGYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPV+TVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQAVEF +SSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        +CLVC
Subjt:  KCLVC

A0A5D3BPI1 Reverse transcriptase4.3e-28182.98Show/hide
Query:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS
        ASVVD RE DVSLSSEPVVRDYPDVFPEELP LPPHRE+EFAIELEPG VPISR PYRMAPAELKELKVQLQ+          VSPWGAP+LFVKKKDGS
Subjt:  ASVVDAREVDVSLSSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQD----------VSPWGAPILFVKKKDGS

Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAT+FSKIDLRS YHQLRIK+ D+PKTAF SRYG+ EFIVMSFGLTNAP VFMDLMNRVF+EFLDTFV
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFV

Query:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR
        IVFIDDIL YSKTEAEHEEHLRMVLQTLRDNKLYAKFSK EFWL +VSFLGHVVSKA VSVDPAKIE VT W +PSTVSEV SFLGL GYYRRFVENFSR
Subjt:  IVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSR

Query:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW
        IATPL QLTRKG PFVW KA            +TAPVLTVPD SGSFVIY+DASKKGLGCVLMQQGKVVAYASR+LKSHEQNYPTHDLEL AVVFALKIW
Subjt:  IATPLPQLTRKGDPFVWRKA------------LTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMR+RRWLELVKDYDCEILYHPGKANVV DALSRKVSHS ALI                    AVT+      
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE-------------------AVTL------

Query:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS
              Q+IIDAQSNDPYLVEK GLAEAGQ  EF +SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREV EFVS
Subjt:  ------QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVS

Query:  KCLVC
        KCLVC
Subjt:  KCLVC

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-7335Show/hide
Query:  VVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDV----------SPWGAPILFV-KKKDGS----MRLCIDYRELNK
        +++ Y D+   E  +L    + +  I  +  +   S+  Y    A  +E++ Q+QD+          SP+ +PI  V KK+D S     R+ IDYR+LN+
Subjt:  VVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDV----------SPWGAPILFV-KKKDGS----MRLCIDYRELNK

Query:  VTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSK
        +TV +R+P+P +D++  +L     F+ IDL   +HQ+ +    + KTAF +++G+ E++ M FGL NAP  F   MN + +  L+   +V++DDI+ +S 
Subjt:  VTVKNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSK

Query:  TEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKG
        +  EH + L +V + L    L  +  K EF   E +FLGHV++   +  +P KIE +  +P P+   E+ +FLGL GYYR+F+ NF+ IA P+ +  +K 
Subjt:  TEAEHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKG

Query:  -----------DPFVWRKALTA--PVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYGEKIQI
                     F   K L +  P+L VPD +  F +  DAS   LG VL Q G  ++Y SR L  HE NY T + EL+A+V+A K +RHYL G   +I
Subjt:  -----------DPFVWRKALTA--PVLTVPDCSGSFVIYNDASKKGLGCVLMQQGKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYGEKIQI

Query:  FTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE
         +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N V DALSR     T L E
Subjt:  FTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIE

P0CT34 Transposon Tf2-1 polyprotein4.4e-7329.86Show/hide
Query:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E    +LP P + +EF +EL      +    Y + P +++ +  ++      G           P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA
         N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++  D  K AF    G  E++VM +G++ AP  F   +N +  E  ++ V+ ++DDIL +SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA

Query:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF
        EH +H++ VLQ L++  L    +K EF   +V F+G+ +S+   +     I+ V  W QP    E+  FLG V Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF

Query:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK
         W    T             PVL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E++A++ +LK WRHYL    E 
Subjt:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN + DALSR V            +S   +  +++      +++   +ND  L+ 
Subjt:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE

Query:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC
           L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++ E+V  C  C
Subjt:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC

P0CT35 Transposon Tf2-2 polyprotein4.4e-7329.86Show/hide
Query:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E    +LP P + +EF +EL      +    Y + P +++ +  ++      G           P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA
         N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++  D  K AF    G  E++VM +G++ AP  F   +N +  E  ++ V+ ++DDIL +SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA

Query:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF
        EH +H++ VLQ L++  L    +K EF   +V F+G+ +S+   +     I+ V  W QP    E+  FLG V Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF

Query:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK
         W    T             PVL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E++A++ +LK WRHYL    E 
Subjt:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN + DALSR V            +S   +  +++      +++   +ND  L+ 
Subjt:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE

Query:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC
           L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++ E+V  C  C
Subjt:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC

P0CT36 Transposon Tf2-3 polyprotein4.4e-7329.86Show/hide
Query:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E    +LP P + +EF +EL      +    Y + P +++ +  ++      G           P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA
         N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++  D  K AF    G  E++VM +G++ AP  F   +N +  E  ++ V+ ++DDIL +SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA

Query:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF
        EH +H++ VLQ L++  L    +K EF   +V F+G+ +S+   +     I+ V  W QP    E+  FLG V Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF

Query:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK
         W    T             PVL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E++A++ +LK WRHYL    E 
Subjt:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN + DALSR V            +S   +  +++      +++   +ND  L+ 
Subjt:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE

Query:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC
           L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++ E+V  C  C
Subjt:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC

P0CT41 Transposon Tf2-12 polyprotein4.4e-7329.86Show/hide
Query:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV
        + +++ D+  E    +LP P + +EF +EL      +    Y + P +++ +  ++      G           P++FV KK+G++R+ +DY+ LNK   
Subjt:  VVRDYPDVFPE-ELPRLP-PHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWG----------APILFVKKKDGSMRLCIDYRELNKVTV

Query:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA
         N YPLP I+ L  ++QG+TIF+K+DL+S YH +R++  D  K AF    G  E++VM +G++ AP  F   +N +  E  ++ V+ ++DDIL +SK+E+
Subjt:  KNRYPLPRIDDLFDQLQGATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEA

Query:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF
        EH +H++ VLQ L++  L    +K EF   +V F+G+ +S+   +     I+ V  W QP    E+  FLG V Y R+F+   S++  PL  L +K   +
Subjt:  EHEEHLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPF

Query:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK
         W    T             PVL   D S   ++  DAS   +G VL Q+        V Y S ++   + NY   D E++A++ +LK WRHYL    E 
Subjt:  VWRKALT------------APVLTVPDCSGSFVIYNDASKKGLGCVLMQQGK-----VVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYG--EK

Query:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE
         +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN + DALSR V            +S   +  +++      +++   +ND  L+ 
Subjt:  IQIFTDHKSLKYFFTQKE--LNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKV-----------SHSTALIEAVTL-----QKIIDAQSNDPYLVE

Query:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC
           L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++ E+V  C  C
Subjt:  KCGLAEAGQAVEFFISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.4e-2041.22Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLG--HVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPFVW
        HL MVLQ    ++ YA   K  F   ++++LG  H++S   VS DPAK+E +  WP+P   +E+  FLGL GYYRRFV+N+ +I  PL +L +K +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKSEFWLMEVSFLG--HVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPFVW

Query:  R-------KAL-----TAPVLTVPDCSGSFV
                KAL     T PVL +PD    FV
Subjt:  R-------KAL-----TAPVLTVPDCSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAGAGGACATCATTCATGACAATTCTACATCAAGGAAGGAAAACAAATTTTCGCAAAAGGTGGAAGAGGCAAATACACCATTGTATGGTGGTTGTACGAAGTA
TACAAAGATGTCAGCAGTTGTAGCATTGTACAAACTGAAAACTTTTAATGGTTGGTCAGATACAAGCTTCACTAGCCTTTTGGGGCTTTTGCATGACATGCTCCCAATGG
ACAATGTTATTTCAAGATCCATTTATGAAGTTAGAAAATTATTTAAGGAATTTGATTTAGGTTACCAAAAAATTCATGCATGTGTTAAAGACTGTTGCCTATTTAGAAAT
GAGAATGAAAAGTTAGAAAGTTGTCCTCATTGTGCAAGTTCAAGATGGAAGATCGATGAACGAACAAACCAAATCAAACAAGGTGTGCCCGCCAAGGTATTGAGATACTT
TCCTATCATTCCACGACTTAAACGTATGTTTAAAATAAATGAAGTTAGTGAAAGTTTACGGTGGCATTTGAGTCATAAAAGTACTGATGGAAAGATCAGACATCCTGTTG
ACTCTGTTGCATGGGAAACAATTGATAAAAAATGGCCTGAGTTTTCAATGGATCCACGTAATCTTAGGTTGGGCCTTGCTACAGACGGGTTTAACCCCTTCTCCAATTTA
AGTAGTCGATATAGTTGTTGGCCGGTCATGCTTGTTACTTACAATCTTCCTCCTTGGTTATGCATGAAAAAAGAAAACATAATGTTGACACTGTTGATTCCTGGTCCCAG
ACAACCCGGAAATGATATTGATGTATATCTACAACCCCTTGTGGAAGATTTACAACAACTATGGAAAGGAATACAAGTTTATGATATTGTAGGCAACACACATTTTAATT
TGAGATCAATTCTTATGTGGACTATAAATGATTTTCCAGCATATGGAAATCTTGCCGGATGCACTACAAAAGCGAGTGTGGTGGATGCTAGAGAGGTTGATGTGTCCCTG
TCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAAGGTTACCTCCTCACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCGGGTATGGT
TCCTATATCCAGACCTCCATACAGAATGGCCCCAGCAGAATTGAAAGAGCTGAAAGTACAGTTACAGGATGTGTCACCTTGGGGTGCACCAATTTTATTTGTTAAGAAGA
AGGATGGATCGATGCGCCTATGTATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGA
GCTACAATATTCTCTAAGATCGACCTTCGGTCGAGATATCATCAGCTGAGGATTAAGAATAGGGATTTACCGAAGACAGCCTTTTATTCCAGATATGGATATAATGAGTT
TATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAAGGAGTTCCTGGACACTTTTGTGATCGTGTTTATTGATGATA
TTTTGACATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTAGAGACAATAAATTGTATGCAAAGTTCTCGAAATCTGAGTTTTGG
TTGATGGAGGTATCCTTTCTAGGCCATGTGGTTTCTAAGGCTAGAGTTTCTGTGGATCCAGCTAAGATAGAGGGAGTCACCAGTTGGCCCCAACCTTCCACGGTCAGTGA
GGTTCATAGCTTTCTTGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAGCTACTCCTCTTCCTCAGTTGACCAGGAAGGGAGATCCTTTTGTTT
GGAGAAAGGCACTTACTGCACCGGTTCTTACTGTACCTGATTGTTCCGGGAGTTTTGTGATTTACAATGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAA
GGTAAGGTAGTCGCTTATGCTTCTCGTCGGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGTAGCAGTGGTTTTTGCACTGAAAATATGGAGGCA
TTACTTGTATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACGGCGAAGATGGCTTGAATTAGTGA
AGGATTACGATTGTGAGATATTATATCATCCAGGCAAGGCGAATGTGGTAGTTGATGCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTGAGGCAGTCACATTG
CAGAAGATCATTGATGCTCAGAGTAATGATCCTTACTTGGTTGAGAAGTGTGGCCTAGCAGAAGCAGGGCAAGCTGTTGAGTTCTTCATATCCTCTGATGGTGGACTTTT
GTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATC
AGGACCTAAAAAGGGTTTATTGGTGGTGTAATATGAAGAGAGAGGTGGTCGAATTTGTTAGTAAATGCTTGGTGTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAGAGGACATCATTCATGACAATTCTACATCAAGGAAGGAAAACAAATTTTCGCAAAAGGTGGAAGAGGCAAATACACCATTGTATGGTGGTTGTACGAAGTA
TACAAAGATGTCAGCAGTTGTAGCATTGTACAAACTGAAAACTTTTAATGGTTGGTCAGATACAAGCTTCACTAGCCTTTTGGGGCTTTTGCATGACATGCTCCCAATGG
ACAATGTTATTTCAAGATCCATTTATGAAGTTAGAAAATTATTTAAGGAATTTGATTTAGGTTACCAAAAAATTCATGCATGTGTTAAAGACTGTTGCCTATTTAGAAAT
GAGAATGAAAAGTTAGAAAGTTGTCCTCATTGTGCAAGTTCAAGATGGAAGATCGATGAACGAACAAACCAAATCAAACAAGGTGTGCCCGCCAAGGTATTGAGATACTT
TCCTATCATTCCACGACTTAAACGTATGTTTAAAATAAATGAAGTTAGTGAAAGTTTACGGTGGCATTTGAGTCATAAAAGTACTGATGGAAAGATCAGACATCCTGTTG
ACTCTGTTGCATGGGAAACAATTGATAAAAAATGGCCTGAGTTTTCAATGGATCCACGTAATCTTAGGTTGGGCCTTGCTACAGACGGGTTTAACCCCTTCTCCAATTTA
AGTAGTCGATATAGTTGTTGGCCGGTCATGCTTGTTACTTACAATCTTCCTCCTTGGTTATGCATGAAAAAAGAAAACATAATGTTGACACTGTTGATTCCTGGTCCCAG
ACAACCCGGAAATGATATTGATGTATATCTACAACCCCTTGTGGAAGATTTACAACAACTATGGAAAGGAATACAAGTTTATGATATTGTAGGCAACACACATTTTAATT
TGAGATCAATTCTTATGTGGACTATAAATGATTTTCCAGCATATGGAAATCTTGCCGGATGCACTACAAAAGCGAGTGTGGTGGATGCTAGAGAGGTTGATGTGTCCCTG
TCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAAGGTTACCTCCTCACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCGGGTATGGT
TCCTATATCCAGACCTCCATACAGAATGGCCCCAGCAGAATTGAAAGAGCTGAAAGTACAGTTACAGGATGTGTCACCTTGGGGTGCACCAATTTTATTTGTTAAGAAGA
AGGATGGATCGATGCGCCTATGTATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGA
GCTACAATATTCTCTAAGATCGACCTTCGGTCGAGATATCATCAGCTGAGGATTAAGAATAGGGATTTACCGAAGACAGCCTTTTATTCCAGATATGGATATAATGAGTT
TATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAAGGAGTTCCTGGACACTTTTGTGATCGTGTTTATTGATGATA
TTTTGACATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTCTACAAACCCTTAGAGACAATAAATTGTATGCAAAGTTCTCGAAATCTGAGTTTTGG
TTGATGGAGGTATCCTTTCTAGGCCATGTGGTTTCTAAGGCTAGAGTTTCTGTGGATCCAGCTAAGATAGAGGGAGTCACCAGTTGGCCCCAACCTTCCACGGTCAGTGA
GGTTCATAGCTTTCTTGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAGCTACTCCTCTTCCTCAGTTGACCAGGAAGGGAGATCCTTTTGTTT
GGAGAAAGGCACTTACTGCACCGGTTCTTACTGTACCTGATTGTTCCGGGAGTTTTGTGATTTACAATGATGCTTCTAAGAAGGGTTTGGGTTGTGTATTGATGCAGCAA
GGTAAGGTAGTCGCTTATGCTTCTCGTCGGTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTAGAGTTGGTAGCAGTGGTTTTTGCACTGAAAATATGGAGGCA
TTACTTGTATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACGGCGAAGATGGCTTGAATTAGTGA
AGGATTACGATTGTGAGATATTATATCATCCAGGCAAGGCGAATGTGGTAGTTGATGCTCTTAGTAGAAAGGTATCACATTCAACAGCACTTATTGAGGCAGTCACATTG
CAGAAGATCATTGATGCTCAGAGTAATGATCCTTACTTGGTTGAGAAGTGTGGCCTAGCAGAAGCAGGGCAAGCTGTTGAGTTCTTCATATCCTCTGATGGTGGACTTTT
GTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATC
AGGACCTAAAAAGGGTTTATTGGTGGTGTAATATGAAGAGAGAGGTGGTCGAATTTGTTAGTAAATGCTTGGTGTGTTAG
Protein sequenceShow/hide protein sequence
MGKEDIIHDNSTSRKENKFSQKVEEANTPLYGGCTKYTKMSAVVALYKLKTFNGWSDTSFTSLLGLLHDMLPMDNVISRSIYEVRKLFKEFDLGYQKIHACVKDCCLFRN
ENEKLESCPHCASSRWKIDERTNQIKQGVPAKVLRYFPIIPRLKRMFKINEVSESLRWHLSHKSTDGKIRHPVDSVAWETIDKKWPEFSMDPRNLRLGLATDGFNPFSNL
SSRYSCWPVMLVTYNLPPWLCMKKENIMLTLLIPGPRQPGNDIDVYLQPLVEDLQQLWKGIQVYDIVGNTHFNLRSILMWTINDFPAYGNLAGCTTKASVVDAREVDVSL
SSEPVVRDYPDVFPEELPRLPPHREIEFAIELEPGMVPISRPPYRMAPAELKELKVQLQDVSPWGAPILFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQG
ATIFSKIDLRSRYHQLRIKNRDLPKTAFYSRYGYNEFIVMSFGLTNAPTVFMDLMNRVFKEFLDTFVIVFIDDILTYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKSEFW
LMEVSFLGHVVSKARVSVDPAKIEGVTSWPQPSTVSEVHSFLGLVGYYRRFVENFSRIATPLPQLTRKGDPFVWRKALTAPVLTVPDCSGSFVIYNDASKKGLGCVLMQQ
GKVVAYASRRLKSHEQNYPTHDLELVAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRRRRWLELVKDYDCEILYHPGKANVVVDALSRKVSHSTALIEAVTL
QKIIDAQSNDPYLVEKCGLAEAGQAVEFFISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWCNMKREVVEFVSKCLVC