; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0101961 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0101961
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:18541809..18543873
RNA-Seq ExpressionCmc04g0101961
SyntenyCmc04g0101961
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048687.1 pol protein [Cucumis melo var. makuwa]4.0e-27078.65Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   SV      +K       +SSPF MH  STKMY+D+KRVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVS+CLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA
        PVSIVSDRDARFTSKFWK L  A
Subjt:  PVSIVSDRDARFTSKFWKGLHAA

KAA0051357.1 pol protein [Cucumis melo var. makuwa]1.5e-26978.61Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGY+QLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKREVEEFVS
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+ +             G         +K       +SSPF MH  STKMYQDLKRVYWWRNMKREV EFVS
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKREVEEFVS

Query:  KCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGVPVSIVS
        KCLVCQQVK PR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGVPVSIVS
Subjt:  KCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLHAA
        DRDARFTSKFWKGL  A
Subjt:  DRDARFTSKFWKGLHAA

KAA0059792.1 pol protein [Cucumis melo var. makuwa]1.5e-26974.03Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEF+VMSFGL NAPTVFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLR VL+TLR NKLYAKFSKCEFWLKQVSFLGHV+SK GVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   SV      +K        SSPF MH  STKMYQDL+RVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVS+CLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA----FQFNASLQPPHVASFVC-GSGSPTVVQTPSILLPCCLTPHLHL
        PV IVSDRDARFTSKFWKGL  A      F+ +  P       C       +++  ++  P     HLHL
Subjt:  PVSIVSDRDARFTSKFWKGLHAA----FQFNASLQPPHVASFVC-GSGSPTVVQTPSILLPCCLTPHLHL

TYK01613.1 pol protein [Cucumis melo var. makuwa]8.1e-27178.97Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQ LK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   S       +K       +SSPF MH  STKMYQDLKRVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVSKCLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA
        PVSIVSDRDARFTSKFWKGL  A
Subjt:  PVSIVSDRDARFTSKFWKGLHAA

TYK07181.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.6e-29086.26Show/hide
Query:  YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK
        YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAP VFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK
Subjt:  YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLKT---------------
        QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLK                
Subjt:  QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLKT---------------

Query:  ---------EASYCTDSYELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALI
                 E +Y T   ELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGK NVVADALSRKVSHSAALI
Subjt:  ---------EASYCTDSYELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALI

Query:  TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFF
        TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQ TLRQKIIDAQSNDPYL                       C+   S       +K       +SSPFF
Subjt:  TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFF

Query:  MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS
        MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPP QKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS
Subjt:  MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS

Query:  GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPHLHLRFRLVSPSALP
        GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFWKGLHAAFQFNASLQ PHVASFVCGSGSPTVVQTPSILLPC  TPHLHLRFRLVSPSALP
Subjt:  GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPHLHLRFRLVSPSALP

Query:  ISTS
        ISTS
Subjt:  ISTS

TrEMBL top hitse value%identityAlignment
A0A5A7U330 Reverse transcriptase1.9e-27078.65Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   SV      +K       +SSPF MH  STKMY+D+KRVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVS+CLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA
        PVSIVSDRDARFTSKFWK L  A
Subjt:  PVSIVSDRDARFTSKFWKGLHAA

A0A5A7UAA8 Reverse transcriptase7.4e-27078.61Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGY+QLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKREVEEFVS
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+ +             G         +K       +SSPF MH  STKMYQDLKRVYWWRNMKREV EFVS
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKREVEEFVS

Query:  KCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGVPVSIVS
        KCLVCQQVK PR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGVPVSIVS
Subjt:  KCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLHAA
        DRDARFTSKFWKGL  A
Subjt:  DRDARFTSKFWKGLHAA

A0A5A7UV42 Reverse transcriptase7.4e-27074.03Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEF+VMSFGL NAPTVFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLR VL+TLR NKLYAKFSKCEFWLKQVSFLGHV+SK GVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQNLK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   SV      +K        SSPF MH  STKMYQDL+RVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVS+CLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA----FQFNASLQPPHVASFVC-GSGSPTVVQTPSILLPCCLTPHLHL
        PV IVSDRDARFTSKFWKGL  A      F+ +  P       C       +++  ++  P     HLHL
Subjt:  PVSIVSDRDARFTSKFWKGLHAA----FQFNASLQPPHVASFVC-GSGSPTVVQTPSILLPCCLTPHLHL

A0A5D3BPI1 Reverse transcriptase3.9e-27178.97Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGA VFSKIDLRSGYHQLRIKD DVPKT FRSRYGHYEFIVMSFGL NAP VFMDLMN+VFREFLDTF+
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        IVFIDDILIYSKTEAEHEEHLRMVL+TLR NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVT W +PSTVSEVRSFLGLAGYYRRFVENFS 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW
        IATPLTQLTRKGAPF WSKACEDSFQ LK                                                  E +Y T   ELAAVVFALKIW
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLKT-------------------------------------------------EASYCTDSYELAAVVFALKIW

Query:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT
        RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD ERAEI VSVGAVTM LAQLT
Subjt:  RHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLT

Query:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE
        VQPTLRQ+IIDAQSNDPYLVEKRGLAE G+     LSS   L+ +   C+   S       +K       +SSPF MH  STKMYQDLKRVYWWRNMKRE
Subjt:  VQPTLRQKIIDAQSNDPYLVEKRGLAEVGKL----LSSPYPLMVD--FCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWRNMKRE

Query:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV
        V EFVSKCLVCQQVKAPR     QKP GLLQPLS+P+WKWENVSMDFITGLPRTLR FTVIWVVVDRLTKSA F+ GKSTY ASKWAQLYMSEIVRLHGV
Subjt:  VEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIVRLHGV

Query:  PVSIVSDRDARFTSKFWKGLHAA
        PVSIVSDRDARFTSKFWKGL  A
Subjt:  PVSIVSDRDARFTSKFWKGLHAA

A0A5D3C5J7 Ty3-gypsy retrotransposon protein2.2e-29086.26Show/hide
Query:  YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK
        YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAP VFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK
Subjt:  YHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFLIVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLKT---------------
        QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLK                
Subjt:  QVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAWSKACEDSFQNLKT---------------

Query:  ---------EASYCTDSYELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALI
                 E +Y T   ELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGK NVVADALSRKVSHSAALI
Subjt:  ---------EASYCTDSYELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALI

Query:  TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFF
        TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQ TLRQKIIDAQSNDPYL                       C+   S       +K       +SSPFF
Subjt:  TRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFF

Query:  MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS
        MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPP QKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS
Subjt:  MHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFIS

Query:  GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPHLHLRFRLVSPSALP
        GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDA FTSKFWKGLHAAFQFNASLQ PHVASFVCGSGSPTVVQTPSILLPC  TPHLHLRFRLVSPSALP
Subjt:  GKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPHLHLRFRLVSPSALP

Query:  ISTS
        ISTS
Subjt:  ISTS

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.3e-7728.19Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH +R++  D  K  FR   G +E++VM +G+  AP  F   +N +  E  ++ +
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        + ++DDILI+SK+E+EH +H++ VL+ L+   L    +KCEF   QV F+G+ +S+ G +     I+ V  W QP    E+R FLG   Y R+F+   S 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF
        +  PL  L +K   + W+     + +N+K                     T+AS                                 Y     E+ A++ 
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I  V  +
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV

Query:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR
                Q+  + T   K+++  +N+   VE+    + G L++S   +++              +L +    + +     +H     +   + R + W+
Subjt:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR

Query:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV
         ++++++E+V  C  CQ       K    KP G LQP+   +  WE++SMDFIT LP +   +  ++VVVDR +K A  +    +  A + A+++   ++
Subjt:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH
           G P  I++D D  FTS+ WK     + F      P+         +    QT   LL C  + H
Subjt:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH

P0CT35 Transposon Tf2-2 polyprotein8.3e-7728.19Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH +R++  D  K  FR   G +E++VM +G+  AP  F   +N +  E  ++ +
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        + ++DDILI+SK+E+EH +H++ VL+ L+   L    +KCEF   QV F+G+ +S+ G +     I+ V  W QP    E+R FLG   Y R+F+   S 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF
        +  PL  L +K   + W+     + +N+K                     T+AS                                 Y     E+ A++ 
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I  V  +
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV

Query:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR
                Q+  + T   K+++  +N+   VE+    + G L++S   +++              +L +    + +     +H     +   + R + W+
Subjt:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR

Query:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV
         ++++++E+V  C  CQ       K    KP G LQP+   +  WE++SMDFIT LP +   +  ++VVVDR +K A  +    +  A + A+++   ++
Subjt:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH
           G P  I++D D  FTS+ WK     + F      P+         +    QT   LL C  + H
Subjt:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH

P0CT36 Transposon Tf2-3 polyprotein8.3e-7728.19Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH +R++  D  K  FR   G +E++VM +G+  AP  F   +N +  E  ++ +
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        + ++DDILI+SK+E+EH +H++ VL+ L+   L    +KCEF   QV F+G+ +S+ G +     I+ V  W QP    E+R FLG   Y R+F+   S 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF
        +  PL  L +K   + W+     + +N+K                     T+AS                                 Y     E+ A++ 
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I  V  +
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV

Query:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR
                Q+  + T   K+++  +N+   VE+    + G L++S   +++              +L +    + +     +H     +   + R + W+
Subjt:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR

Query:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV
         ++++++E+V  C  CQ       K    KP G LQP+   +  WE++SMDFIT LP +   +  ++VVVDR +K A  +    +  A + A+++   ++
Subjt:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH
           G P  I++D D  FTS+ WK     + F      P+         +    QT   LL C  + H
Subjt:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH

P0CT37 Transposon Tf2-4 polyprotein8.3e-7728.19Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH +R++  D  K  FR   G +E++VM +G+  AP  F   +N +  E  ++ +
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        + ++DDILI+SK+E+EH +H++ VL+ L+   L    +KCEF   QV F+G+ +S+ G +     I+ V  W QP    E+R FLG   Y R+F+   S 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF
        +  PL  L +K   + W+     + +N+K                     T+AS                                 Y     E+ A++ 
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I  V  +
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV

Query:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR
                Q+  + T   K+++  +N+   VE+    + G L++S   +++              +L +    + +     +H     +   + R + W+
Subjt:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR

Query:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV
         ++++++E+V  C  CQ       K    KP G LQP+   +  WE++SMDFIT LP +   +  ++VVVDR +K A  +    +  A + A+++   ++
Subjt:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH
           G P  I++D D  FTS+ WK     + F      P+         +    QT   LL C  + H
Subjt:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH

P0CT41 Transposon Tf2-12 polyprotein8.3e-7728.19Show/hide
Query:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL
        +R+ +DY+ LNK    N YPLP I+ L  ++QG+ +F+K+DL+S YH +R++  D  K  FR   G +E++VM +G+  AP  F   +N +  E  ++ +
Subjt:  MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFL

Query:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL
        + ++DDILI+SK+E+EH +H++ VL+ L+   L    +KCEF   QV F+G+ +S+ G +     I+ V  W QP    E+R FLG   Y R+F+   S 
Subjt:  IVFIDDILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSL

Query:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF
        +  PL  L +K   + W+     + +N+K                     T+AS                                 Y     E+ A++ 
Subjt:  IATPLTQLTRKGAPFAWSKACEDSFQNLK---------------------TEAS---------------------------------YCTDSYELAAVVF

Query:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV
        +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I  V  +
Subjt:  ALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDFERAEI--VVSV

Query:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR
                Q+  + T   K+++  +N+   VE+    + G L++S   +++              +L +    + +     +H     +   + R + W+
Subjt:  GAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPFFMHSDSTKMYQDLKRVYWWR

Query:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV
         ++++++E+V  C  CQ       K    KP G LQP+   +  WE++SMDFIT LP +   +  ++VVVDR +K A  +    +  A + A+++   ++
Subjt:  NMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKSTYIASKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH
           G P  I++D D  FTS+ WK     + F      P+         +    QT   LL C  + H
Subjt:  RLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPH

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.6e-2244.64Show/hide
Query:  HLRMVLETLRANKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAW
        HL MVL+    ++ YA   KC F   Q+++LG  H++S  GVS DP K+EA+  WP+P   +E+R FLGL GYYRRFV+N+  I  PLT+L +K +   W
Subjt:  HLRMVLETLRANKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTRKGAPFAW

Query:  SKACEDSFQNLK
        ++    +F+ LK
Subjt:  SKACEDSFQNLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAATAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGAGCTATA
GTGTTTTCTAAGATTGACCTTCGGTCAGGATATCATCAGCTGAGGATTAAGGATAGGGATGTACCAAAGACAACCTTTCGTTCCAGATATGGACACTATGAGTTT
ATTGTGATGTCTTTTGGTTTGATGAATGCTCCGACAGTGTTTATGGATTTGATGAACAAAGTGTTTAGGGAGTTCCTAGACACTTTTCTGATCGTGTTTATCGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGCATGGTTCTAGAAACCCTTCGAGCTAATAAATTGTATGCAAAGTTCTCGAAATGT
GAGTTTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCCAACTAAGATAGAGGCAGTCACCAGTTGGCCCCAACCT
TCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGTTTTGTGGAGAACTTTTCCCTTATAGCTACTCCTCTTACTCAGTTGACCAGA
AAGGGAGCTCCTTTTGCTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAAACAGAAGCTAGTTACTGTACCGATTCTTACGAGTTGGCAGCAGTGGTT
TTTGCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAAATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATG
AGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATATTATATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCA
CATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATTTTGAGAGGGCTGAGATTGTAGTGTCAGTAGGGGCAGTCACCATGCATTTAGCCCAGTTG
ACGGTACAGCCGACCTTGAGGCAGAAGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAGCGTGGCCTAGCAGAAGTAGGCAAGCTGTTGAGTTCT
CCATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCAACAGTTCCCCATTT
TTTATGCACTCGGATAGTACGAAGATGTATCAGGACCTGAAACGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGAAGAATTTGTTAGTAAATGCTTGGTG
TGTCAGCAGGTTAAGGCACCAAGGCAGAAACCACCAACGCAGAAACCAACGGGTTTATTACAACCCTTGAGCGTACCGAAATGGAAGTGGGAAAACGTGTCCATG
GATTTCATTACGGGACTACCTAGAACTTTGAGGAGTTTTACAGTGATTTGGGTTGTGGTTGACAGGCTTACCAAATCAGCACGCTTCATTTCGGGTAAATCCACC
TATATAGCTAGTAAGTGGGCACAGTTGTACATGTCTGAAATAGTGAGACTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAA
TTCTGGAAGGGTTTGCATGCTGCTTTTCAGTTCAACGCTAGTCTGCAGCCGCCGCACGTTGCCTCGTTCGTCTGCGGTAGTGGAAGCCCGACCGTCGTTCAAACT
CCGTCAATCCTTTTGCCTTGCTGTTTGACGCCTCACCTTCACCTTCGCTTTCGTTTGGTTTCGCCGTCGGCCCTCCCAATCTCCACTTCGTATCCAGCCAATGCC
ACCGTCGTCACTGTCCGTTCGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAATAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTACAGGGAGCTATA
GTGTTTTCTAAGATTGACCTTCGGTCAGGATATCATCAGCTGAGGATTAAGGATAGGGATGTACCAAAGACAACCTTTCGTTCCAGATATGGACACTATGAGTTT
ATTGTGATGTCTTTTGGTTTGATGAATGCTCCGACAGTGTTTATGGATTTGATGAACAAAGTGTTTAGGGAGTTCCTAGACACTTTTCTGATCGTGTTTATCGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGCATGGTTCTAGAAACCCTTCGAGCTAATAAATTGTATGCAAAGTTCTCGAAATGT
GAGTTTTGGTTGAAGCAGGTATCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTGGATCCAACTAAGATAGAGGCAGTCACCAGTTGGCCCCAACCT
TCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGTTTTGTGGAGAACTTTTCCCTTATAGCTACTCCTCTTACTCAGTTGACCAGA
AAGGGAGCTCCTTTTGCTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAAACAGAAGCTAGTTACTGTACCGATTCTTACGAGTTGGCAGCAGTGGTT
TTTGCATTGAAGATATGGAGGCATTACTTGTATGGTGAAAAGATACAAATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATG
AGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATATTATATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCA
CATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGAGATTTTGAGAGGGCTGAGATTGTAGTGTCAGTAGGGGCAGTCACCATGCATTTAGCCCAGTTG
ACGGTACAGCCGACCTTGAGGCAGAAGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAGCGTGGCCTAGCAGAAGTAGGCAAGCTGTTGAGTTCT
CCATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAGAATTATTATCTGAGGCTCAACAGTTCCCCATTT
TTTATGCACTCGGATAGTACGAAGATGTATCAGGACCTGAAACGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGAAGAATTTGTTAGTAAATGCTTGGTG
TGTCAGCAGGTTAAGGCACCAAGGCAGAAACCACCAACGCAGAAACCAACGGGTTTATTACAACCCTTGAGCGTACCGAAATGGAAGTGGGAAAACGTGTCCATG
GATTTCATTACGGGACTACCTAGAACTTTGAGGAGTTTTACAGTGATTTGGGTTGTGGTTGACAGGCTTACCAAATCAGCACGCTTCATTTCGGGTAAATCCACC
TATATAGCTAGTAAGTGGGCACAGTTGTACATGTCTGAAATAGTGAGACTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAA
TTCTGGAAGGGTTTGCATGCTGCTTTTCAGTTCAACGCTAGTCTGCAGCCGCCGCACGTTGCCTCGTTCGTCTGCGGTAGTGGAAGCCCGACCGTCGTTCAAACT
CCGTCAATCCTTTTGCCTTGCTGTTTGACGCCTCACCTTCACCTTCGCTTTCGTTTGGTTTCGCCGTCGGCCCTCCCAATCTCCACTTCGTATCCAGCCAATGCC
ACCGTCGTCACTGTCCGTTCGTTTTGA
Protein sequenceShow/hide protein sequence
MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGAIVFSKIDLRSGYHQLRIKDRDVPKTTFRSRYGHYEFIVMSFGLMNAPTVFMDLMNKVFREFLDTFLIVFID
DILIYSKTEAEHEEHLRMVLETLRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWPQPSTVSEVRSFLGLAGYYRRFVENFSLIATPLTQLTR
KGAPFAWSKACEDSFQNLKTEASYCTDSYELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVS
HSAALITRQAPLHRDFERAEIVVSVGAVTMHLAQLTVQPTLRQKIIDAQSNDPYLVEKRGLAEVGKLLSSPYPLMVDFCLRGASVCHQIVRLKQNYYLRLNSSPF
FMHSDSTKMYQDLKRVYWWRNMKREVEEFVSKCLVCQQVKAPRQKPPTQKPTGLLQPLSVPKWKWENVSMDFITGLPRTLRSFTVIWVVVDRLTKSARFISGKST
YIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLHAAFQFNASLQPPHVASFVCGSGSPTVVQTPSILLPCCLTPHLHLRFRLVSPSALPISTSYPANA
TVVTVRSF